CN110684755B - 基于进化信息构建嵌合SaCas9用于增强和扩展PAM位点的识别 - Google Patents

基于进化信息构建嵌合SaCas9用于增强和扩展PAM位点的识别 Download PDF

Info

Publication number
CN110684755B
CN110684755B CN201810731984.9A CN201810731984A CN110684755B CN 110684755 B CN110684755 B CN 110684755B CN 201810731984 A CN201810731984 A CN 201810731984A CN 110684755 B CN110684755 B CN 110684755B
Authority
CN
China
Prior art keywords
mutation
lys
asn
seq
leu
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810731984.9A
Other languages
English (en)
Other versions
CN110684755A (zh
Inventor
谢震
马大程
张昭煜
许志锰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Priority to CN201810731984.9A priority Critical patent/CN110684755B/zh
Priority to PCT/CN2019/094585 priority patent/WO2020007325A1/en
Publication of CN110684755A publication Critical patent/CN110684755A/zh
Application granted granted Critical
Publication of CN110684755B publication Critical patent/CN110684755B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/102Mutagenizing nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Zoology (AREA)
  • Organic Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Medicinal Chemistry (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Mycology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

本发明提出了一种Cas9蛋白突变体。该Cas9蛋白突变体具有:框架区;和PAM识别区,所述PAM识别区识别下列核酸序列的至少之一:5’‑NNNRRT‑3’;5’‑NNNRRN‑3’;5’‑NNNRCN‑3’;5’‑NNNRTN‑3’;5’‑NNNCAA‑3’;5’‑NNNCAT‑3’;5’‑NNNCGT‑3’;5’‑NNNCGC‑3’;5’‑NNNGTN‑3’;5’‑NNNTCN‑3’;5’‑NNNTTC‑3’;5’‑NNNTTG‑3’;5’‑NNNTTT‑3’;N=A、T、G或C,R=A或G。

Description

基于进化信息构建嵌合SaCas9用于增强和扩展PAM位点的 识别
技术领域
本发明涉及生物技术领域,具体地,本发明涉及Cas9蛋白突变体、核酸、试剂盒、对细胞进行基因改造的方法以及细胞。
背景技术
CRISPR/Cas9核酶在多种物种和不同的细胞中实现了有效的基因编辑。通过人为将crRNA和tracRNA进行偶联形成guide RNA,从而指引Cas9识别不同的位置。但是Cas9仍然需要结合在特定的PAM序列前。
被广泛使用的SpCas9识别NGG PAM序列,而另一个SaCas9蛋白识别“NNGRRT”的PAM序列,PAM序列限制了SaCas9识别的范围。为了扩展SpCas9的识别范围,通过利用直接进化在细菌中进行筛选的方法,多种不同PAM被挖掘。同时,通过引入三点突变,KKH-SaCas9的PAM被拓展成NNNRRT。尽管相比较SaCas9,KKH-SaCas9的PAM识别范围更加扩展,然而,理论上,KKH-SaCas9只能结合1/16的区域。
虽然Cas9同源蛋白广泛分布于细菌中,很多不同的Cas9同源蛋白已经被鉴定。然而很少被鉴定出来在哺乳动物细胞中可以有效进行基因编辑。
因此,具有PAM广泛识别能力的Cas9需要科研工作者不断的开发和改进,使得CRISPR/Cas9系统的基因编辑能力变的更加强大。
发明内容
本申请是基于发明人对以下事实和问题的发现和认识做出的:
本申请的发明人通过进化信息、基因挖掘的方法,发现一系列不同的SaCas9同源蛋白,进而发明人以KKH SaCas9作为骨架,通过将PAM作用区域的13个氨基酸残基肽段替换为其他同源蛋白序列,设计了一系列的不同Cas9嵌合体(cCas9)。这些不同的cCas9具有不同的PAM特异性,除了NNNRRT,不同的突变体还可以识别包括NNNRRN,NNNRCN,NNNRTN、NNNCAA、NNNCAT、NNNCGT、NNNCGC、NNNGTN、NNNTCN、NNNTTC、NNNTTG、NNNTTT的PAM区(N=A、T、G或C、R=A或G)。本申请的发明人成功地将Cas9的PAM识别范围扩展到大于1/2,(上面所列的PAM共有49种,而PAM总共是64种,此识别范围的概率是49/64)。既拓展了PAM的倾向性,又发现了多个新的嵌合体。
为此,在本发明的第一方面,本发明提出了一种Cas9蛋白突变体。根据本发明的实施例,其具有:框架区;和PAM识别区,所述PAM识别区识别下列核酸序列的至少之一:
5’-NNNRRT-3’,N=A、T、G或C、R=A或G;
5’-NNNRRN-3’,N=A、T、G或C、R=A或G;
5’-NNNRCN-3’,N=A、T、G或C、R=A或G;
5’-NNNRTN-3’,N=A、T、G或C、R=A或G;
5’-NNNCAA-3’,N=A、T、G或C;
5’-NNNCAT-3’,N=A、T、G或C;
5’-NNNCGT-3’,N=A、T、G或C;
5’-NNNCGC-3’,N=A、T、G或C;
5’-NNNGTN-3’,N=A、T、G或C;
5’-NNNTCN-3’,N=A、T、G或C;
5’-NNNTTC-3’,N=A、T、G或C;
5’-NNNTTG-3’,N=A、T、G或C;
5’-NNNTTT-3’,N=A、T、G或C。
根据本发明实施例的Cas9蛋白突变体相较于Cas9,PAM识别范围扩展到接近1/2,极大拓展了PAM的倾向性。根据本发明实施例的Cas9蛋白突变体在guide RNA指引下,能够结合的dsDNA区域得到极大的拓展,CRISPR/Cas9系统的基因编辑能力变的更加强大。
根据本发明的实施例,上述Cas9蛋白突变体还可以进一步包括如下附加技术特征至少之一:
根据本发明的实施例,所述Cas9蛋白突变体的框架区与下列野生型蛋白的框架区具有至少70%的同源性;优选地,具有至少80%的同源性;更优选地,具有至少90%的同源性;更优选地,具有至少95%的同源性;更优选地,具有至少99%的同源性;
O13、O40、O23、O39、O26、O18、O38、O12、O36、O27、O10、O33、O34、O14、O44、O15、O28、O42、O20、O37、O24、O43、O30、O31、O32、O29、O16、O19、O25、O21、O17、O35、O22、saCas9、SaCas9-KKH。
根据本发明实施例的上述Cas9蛋白突变体相较于Cas9,PAM识别范围广,根据本发明实施例的Cas9蛋白突变体在guide RNA指引下,能够结合的dsDNA区域得到极大的拓展,CRISPR/Cas9系统的基因编辑能力变的更加强大。
根据本发明的实施例,所述框架区与saCas9具有至少90%的同源性;更优选地,具有至少95%的同源性;更优选地,具有至少99%的同源性。根据本发明实施例的Cas9蛋白突变体相较于saCas9,PAM识别范围更广,可扩展到接近1/2。
根据本发明的具体实施例,所述框架区具有SEQ ID NO:1~2、130所示的氨基酸序列。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRV(SEQ ID NO:1)。
NMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQ IDNO:2)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNAKTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATARLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRV(SEQ ID NO:130)。
其中,SEQ ID NO:1或SEQ ID NO:130所示的氨基酸序列是位于PAM识别区5’端的框架区序列,SEQ ID NO:2所示的氨基酸序列是位于PAM识别区3’端的框架区序列,即所述PAM识别区位于5’端的框架区序列和3’端的框架区序列之间,SEQ ID NO:1或SEQID NO:130所示的氨基酸序列的3’端与PAM识别区5’端相连,SEQ ID NO:2所示的氨基酸序列的5’端与PAM识别区3’端相连。
根据本发明的实施例,相对于saCas9,所述PAM识别区与982IGVNNDLLNRIEV994相比具有至少一个突变。
根据本发明的实施例,相对于saCas9,所述PAM识别区与982IGVNNDLLNRIEV 994相比具有至多13个突变,优选地,具有至多8个突变,或具有至多7个突变,或具有至多6个突变,或具有至多5个突变,或具有至多4个突变,或具有至多3个突变。
根据本发明的实施例,所述Cas9蛋白突变体与所述Cas9蛋白相比具有第982位~994位任一点或多点的突变。发明人发现,SaCas9在PAM直接相互作用的氨基酸残基更加的不保守,暗示这些不同的Cas9同源蛋白可能识别不同的PAM序列。而SaCas9上负责PAM相互作用的氨基酸残基的旁边的序列更加的保守,而且,985、986、991位三个PAM相互作用的氨基酸残基在蛋白序列上临近分布。因此,发明人将不同来源的Cas9同源蛋白中PAM相互作用区域的短肽直接替换到SaCas9上,从而开发出了一系列识别不同嵌合蛋白。选择对应于SaCas9中序列为982-994的氨基酸肽(PAM识别区)段进行替换,筛选获得的SaCas9嵌合体与PAM相互作用的成功率、活性更高。
根据本发明的实施例,相对于saCas9,所述PAM识别区与982IGVNNDLLNRIEV 994相比具有下列突变的至少之一:第982位突变为T、K、R或L,第983位突变为A、C或S,第984位突变为T、D,第985位突变为F、S、A、N,第986位突变为E、D、H、A、M,第987位突变为S、G、N、S、D、E、P,第988位突变为D、K、T、S、T、D、K、R、E、A,第989位突变为R、A、N、Q、G、E、T、K、S、G、H、V,第990位突变为S,第991位突变为I、V、L、K、T、M,第992位突变为V、L,第993位突变为Q,第994位突变为L、M、C、I、A。发明人发现,具有上述至少之一突变的根据本申请实施例的Cas9蛋白突变体的PAM识别范围广,可扩展到接近1/2。
根据本发明的实施例,相对于saCas9,所述PAM识别区与982IGVNNDLLNRIEV 994相比,在985位突变为S的前提下,第986位突变为S和第991位突变为R;
优选地,在985位突变为A的前提下,第986位突变为M和第991位突变为K;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为K;
优选地,在985位突变为A的前提下,第986位突变为N和第991位突变为I;
优选地,在所985位突变为N的前提下,第986位突变为H和第991位突变为R;
优选地,在985位突变为N的前提下,第986位突变为S和第991位突变为L;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为I;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为K;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为V;
优选地,在985位突变为N的前提下,第986位突变为D和第991位突变为K;
优选地,在985位突变为N的前提下,第986位突变为S和第991位突变为I;
优选地,在985位突变为N的前提下,第986位突变为D和第991位突变为K;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为R;
优选地,在985位突变为N的前提下,第986位突变为H和第991位突变为R;
优选地,在985位突变为N的前提下,第986位突变为E和第991位突变为I;
优选地,在985位突变为S的前提下,第986位突变为S和第991位突变为R;
优选地,在985位突变为N的前提下,第986位突变为D和第991位突变为K;
优选地,在985位突变为N的前提下,第986位突变为A和第991位突变为T;
优选地,在985位突变为N的前提下,第986位突变为D和第991位突变为T;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为R;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为V;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为V;
优选地,在985位突变为S的前提下,第986位突变为M和第991位突变为K;
优选地,在985位突变为A的前提下,第986位突变为M和第991位突变为K;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为L;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为M;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为R;
优选地,在985位突变为N的前提下,第986位突变为D和第991位突变为T;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为K;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为M;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为K;
优选地,在985位突变为F的前提下,第986位突变为S和第991位突变为L;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为V;
优选地,在985位突变为N的前提下,第986位突变为S和第991位突变为L;
优选地,在985位突变为N的前提下,第986位突变为S和第991位突变为K;
优选地,在985位突变为N的前提下,第986位突变为S和第991位突变为R;
优选地,在985位突变为N的前提下,第986位突变为S和第991位突变为R;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为K;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为K;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为I;
优选地,在985位突变为N的前提下,第986位突变为N和第991位突变为I。
进而,根据本申请实施例的Cas9蛋白突变体PAM识别范围更广。
根据本发明的实施例,所述Cas9蛋白突变体的PAM识别区具有SEQ ID NO:3~43所示的氨基酸序列。
RSDSSPRENRLEV(SEQ ID NO:3)。
KGDAMPRGNKIEI(SEQ ID NO:4)。
TATNNDKSNKIEV(SEQ ID NO:5)。
LGDANSRQNILEA(SEQ ID NO:6)。
IGVNHDEGNRIEM(SEQ ID NO:7)。
IGVNSDKNNLIEV(SEQ ID NO:8)。
IGVNNSTRNIVEL(SEQ ID NO:9)。
RGDNNPRQNKLEV(SEQ ID NO:10)。
IGVNNDKNNVIEL(SEQ ID NO:11)。
IGINDNKHNKIEL(SEQ ID NO:12)。
IGVNSDDRNIIEL(SEQ ID NO:13)。
IGVNDSEKNKIQL(SEQ ID NO:14)。
KCINNEKTHRIEI(SEQ ID NO:15)。
IGVNHDKTNRIEC(SEQ ID NO:16)。
IGVNEDKRNIIEL(SEQ ID NO:17)。
RGDSSPRENRFEV(SEQ ID NO:18)。
RGDNDPKANKIEV(SEQ ID NO:19)。
IGVNAEKRNTIEV(SEQ ID NO:20)。
IGVNDDAKNTLEL(SEQ ID NO:21)。
VGVNNDSVNRVEL(SEQ ID NO:22)。
VGVNNDTRNVVEL(SEQ ID NO:23)。
VGVNNDSRNVVEL(SEQ ID NO:24)。
RGDSMPRQNKIEM(SEQ ID NO:25)。
RGDAMPRDNKIEV(SEQ ID NO:26)。
IGINNGDKNLVEL(SEQ ID NO:27)。
RGDNNPRQNMIEV(SEQ ID NO:28)。
IGVNNDSTNRVEL(SEQ ID NO:29)。
RGDNDPRRSTIEL(SEQ ID NO:30)。
RGDNNPRQNKLEV(SEQ ID NO:31)。
TATNNDKKNMIEV(SEQ ID NO:32)。
IGVNNNRLNKIEL(SEQ ID NO:33)。
IGVFSDAGNLLEV(SEQ ID NO:34)。
IGDNNPRNNVIEV(SEQ ID NO:35)。
IGVNSDDRNLIEL(SEQ ID NO:36)。
IGVNSDDRNKIEL(SEQ ID NO:37)。
IGVNSDDRNRIEL(SEQ ID NO:38)。
IGVNSDLLNRIEV(SEQ ID NO:39)。
IGVNNNLLNKIEV(SEQ ID NO:40)。
IGVNNDLLNKIEV(SEQ ID NO:41)。
IGVNNSTRNIKEL(SEQ ID NO:42)。
IGVNNSTRNILEL(SEQ ID NO:43)。
发明人发现,具有上述PAM识别序列的Cas9蛋白突变体在guide RNA指引下,能够结合非常广泛的dsDNA区域,PAM识别范围扩展到接近1/2,CRISPR/Cas9系统的基因编辑能力更加强大。
根据本发明的实施例,所述Cas9蛋白突变体具有SEQ ID NO:44~85、131所示的氨基酸序列。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRSDSSPRENRLEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:44)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVKGDAMPRGNKIEINMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:45)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVTATNNDKSNKIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:46)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVLGDANSRQNILEANMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:47)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNHDEGNRIEMNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:48)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDKNNLIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:49)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNSTRNIVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:50)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDNNPRQNKLEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:51)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNDKNNVIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:52)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGINDNKHNKIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:53)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDDRNIIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:54)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNDSEKNKIQLNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:55)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVKCINNEKTHRIEINMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:56)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNHDKTNRIECNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:57)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNEDKRNIIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:58)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDSSPRENRFEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:59)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDNDPKANKIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:60)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNAEKRNTIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:61)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNDDAKNTLELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:62)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVVGVNNDSVNRVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:63)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVVGVNNDTRNVVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:64)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVVGVNNDSRNVVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:65)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDSMPRQNKIEMNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:66)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDAMPRDNKIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:67)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGINNGDKNLVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:68)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDNNPRQNMIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:69)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNDSTNRVELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:70)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDNDPRRSTIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:71)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVRGDNNPRQNKLEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:72)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVTATNNDKKNMIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:73)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNNRLNKIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:74)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVFSDAGNLLEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:75)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGDNNPRNNVIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:76)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDDRNLIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:77)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDDRNKIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:78)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDDRNRIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:79)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDLLNRIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:80)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNNLLNKIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:81)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNDLLNKIEVNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:82)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNSTRNIKELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:83)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNNSTRNILELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:84)。
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNAKTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATARLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRKLINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYKNDLIKINGELYRVIGVNSDDRNRIELNMIDITYREYLENMNDKRPPHIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG(SEQID NO:131)。
具有上述氨基酸序列的Cas9蛋白突变体相较于Cas9,PAM识别范围扩展到接近1/2,极大拓展了PAM的倾向性,其中,具有SEQ ID NO:131所示氨基酸序列的Cas9蛋白突变体其脱靶效率进一步降低。
在本发明的第二方面,本发明提出了一种核酸。根据本发明的实施例,所述核酸编码前面所述的Cas9蛋白突变体。进而将根据本发明实施例的核酸导入受体细胞后,在合适的条件下,获得前面所述的Cas9蛋白突变体。根据本发明实施例的核酸可作为CRISPR/Cas9系统的前导核酸,在导入细胞成功表达前面所述的Cas9蛋白突变体,例如SaCas9嵌合体后,可实现更为强大的基因编辑功能。
根据本发明的实施例,上述核酸还可以进一步包括如下附加技术特征至少之一:
根据本发明的实施例,所述核酸具有SEQ ID NO:85~125、132任一所述的核苷酸序列。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGAGCGACAGCAGCCCCAGGGAGAACAGGCTGGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:85)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAAGGGCGACGCCATGCCCAGGGGCAACAAGATCGAGATCTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:86)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGACCGCCACCAACAACGACAAGAGCAACAAGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:87)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGCTGGGCGACGCCAACAGCAGGCAGAACATCCTGGAGGCCTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:88)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACCACGACGAGGGCAACAGGATCGAGATGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:89)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACAAGAACAACCTGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:90)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACAGCACCAGGAACATCGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:91)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAACAACCCCAGGCAGAACAAGCTGGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:92)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACGACAAGAACAACGTGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:93)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCATCAACGACAACAAGCACAACAAGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:94)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACGACAGGAACATCATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:95)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACGACAGCGAGAAGAACAAGATCCAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:96)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAAGTGCATCAACAACGAGAAGACCCACAGGATCGAGATCTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:97)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACCACGACAAGACCAACAGGATCGAGTGCTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:98)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACGAGGACAAGAGGAACATCATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:99)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAGCAGCCCCAGGGAGAACAGGTTCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:100)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAACGACCCCAAGGCCAACAAGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:101)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACGCCGAGAAGAGGAACACCATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:102)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACGACGACGCCAAGAACACCCTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:103)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGGTGGGCGTGAACAACGACAGCGTGAACAGGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:104)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGGTGGGCGTGAACAACGACACCAGGAACGTGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:105)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGGTGGGCGTGAACAACGACAGCAGGAACGTGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:106)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAGCATGCCCAGGCAGAACAAGATCGAGATGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:107)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACGCCATGCCCAGGGACAACAAGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:108)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCATCAACAACGGCGACAAGAACCTGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:109)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAACAACCCCAGGCAGAACATGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:110)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACGACAGCACCAACAGGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:111)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAACGACCCCAGGAGGAGCACCATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:112)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGAGGGGCGACAACAACCCCAGGCAGAACAAGCTGGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:113)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGACCGCCACCAACAACGACAAGAAGAACATGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:114)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACAACAGGCTGAACAAGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:115)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGTTCAGCGACGCCGGCAACCTGCTGGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:116)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGACAACAACCCCAGGAACAACGTGATCGAGGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:117)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACAACCTGCTGAACAAGATCGAAGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:118)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACTCGGACCTGCTGAACCGGATCGAAGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:119)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACGACCTGCTGAACAAGATCGAAGTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:120)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACAGCACCAGGAACAAGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:121)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAACAGCACCAGGAACCTGGTGGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:122)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACGACAGGAACAAGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:123)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACGACAGGAACCTGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:124)。
ATGAAGCGGAACTACATCCTGGGCCTGGCCATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACCGGCAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAGCCAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCAGAGGCCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACGACAGGAACAGGATCGAGCTGTGAGACGGGCCATACTCGTCTCGAACATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:125)。
ATGAAGCGGAACTACATCCTGGGCCTGGACATCGGCATCACCAGCGTGGGCTACGGCATCATCGACTACGAGACACGGGACGTGATCGATGCCGGCGTGCGGCTGTTCAAAGAGGCCAACGTGGAAAACAACGAGGGCAGGCGGAGCAAGAGAGGCGCCAGAAGGCTGAAGCGGCGGAGGCGGCATAGAATCCAGAGAGTGAAGAAGCTGCTGTTCGACTACAACCTGCTGACCGACCACAGCGAGCTGAGCGGCATCAACCCCTACGAGGCCAGAGTGAAGGGCCTGAGCCAGAAGCTGAGCGAGGAAGAGTTCTCTGCCGCCCTGCTGCACCTGGCCAAGAGAAGAGGCGTGCACAACGTGAACGAGGTGGAAGAGGACACCGGCAACGAGCTGTCCACCAAAGAGCAGATCAGCCGGAACAGCAAGGCCCTGGAAGAGAAATACGTGGCCGAACTGCAGCTGGAACGGCTGAAGAAAGACGGCGAAGTGCGGGGCAGCATCAACAGATTCAAGACCAGCGACTACGTGAAAGAAGCCAAACAGCTGCTGAAGGTGCAGAAGGCCTACCACCAGCTGGACCAGAGCTTCATCGACACCTACATCGACCTGCTGGAAACCCGGCGGACCTACTATGAGGGACCTGGCGAGGGCAGCCCCTTCGGCTGGAAGGACATCAAAGAATGGTACGAGATGCTGATGGGCCACTGCACCTACTTCCCCGAGGAACTGCGGAGCGTGAAGTACGCCTACAACGCCGACCTGTACAACGCCCTGAACGACCTGAACAATCTCGTGATCACCAGGGACGAGAACGAGAAGCTGGAATATTACGAGAAGTTCCAGATCATCGAGAACGTGTTCAAGCAGAAGAAGAAGCCCACCCTGAAGCAGATCGCCAAAGAAATCCTCGTGAACGAAGAGGATATTAAGGGCTACAGAGTGACCAGCACCGGCAAGCCCGAGTTCACCAACCTGAAGGTGTACCACGACATCAAGGACATTACCGCCCGGAAAGAGATTATTGAGAACGCCGAGCTGCTGGATCAGATTGCCAAGATCCTGACCATCTACCAGAGCAGCGAGGACATCCAGGAAGAACTGACCAATCTGAACTCCGAGCTGACCCAGGAAGAGATCGAGCAGATCTCTAATCTGAAGGGCTATACCGGCACCCACAACCTGAGCCTGAAGGCCATCAACCTGATCCTGGACGAGCTGTGGCACACCAACGACAACCAGATCGCTATCTTCAACCGGCTGAAGCTGGTGCCCAAGAAGGTGGACCTGTCCCAGCAGAAAGAGATCCCCACCACCCTGGTGGACGACTTCATCCTGAGCCCCGTCGTGAAGAGAAGCTTCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAACGACATCATTATCGAGCTGGCCCGCGAGAAGAACTCCAAGGACGCCCAGAAAATGATCAACGAGATGCAGAAGCGGAACGCCAAGACCAACGAGCGGATCGAGGAAATCATCCGGACCACCGGCAAAGAGAACGCCAAGTACCTGATCGAGAAGATCAAGCTGCACGACATGCAGGAAGGCAAGTGCCTGTACAGCCTGGAAGCCATCCCTCTGGAAGATCTGCTGAACAACCCCTTCAACTATGAGGTGGACCACATCATCCCCAGAAGCGTGTCCTTCGACAACAGCTTCAACAACAAGGTGCTCGTGAAGCAGGAAGAAAACAGCAAGAAGGGCAACCGGACCCCATTCCAGTACCTGAGCAGCAGCGACAGCAAGATCAGCTACGAAACCTTCAAGAAGCACATCCTGAATCTGGCCAAGGGCAAGGGCAGAATCAGCAAGACCAAGAAAGAGTATCTGCTGGAAGAACGGGACATCAACAGGTTCTCCGTGCAGAAAGACTTCATCAACCGGAACCTGGTGGATACCAGATACGCCACCGCCCGGCTGATGAACCTGCTGCGGAGCTACTTCAGAGTGAACAACCTGGACGTGAAAGTGAAGTCCATCAATGGCGGCTTCACCAGCTTTCTGCGGCGGAAGTGGAAGTTTAAGAAAGAGCGGAACAAGGGGTACAAGCACCACGCCGAGGACGCCCTGATCATTGCCAACGCCGATTTCATCTTCAAAGAGTGGAAGAAACTGGACAAGGCCAAAAAAGTGATGGAAAACCAGATGTTCGAGGAAAAGCAGGCCGAGAGCATGCCCGAGATCGAAACCGAGCAGGAGTACAAAGAGATCTTCATCACCCCCCACCAGATCAAGCACATTAAGGACTTCAAGGACTACAAGTACAGCCACCGGGTGGACAAGAAGCCTAATAGAAAGCTGATTAACGACACCCTGTACTCCACCCGGAAGGACGACAAGGGCAACACCCTGATCGTGAACAATCTGAACGGCCTGTACGACAAGGACAATGACAAGCTGAAAAAGCTGATCAACAAGAGCCCCGAAAAGCTGCTGATGTACCACCACGACCCCCAGACCTACCAGAAACTGAAGCTGATTATGGAACAGTACGGCGACGAGAAGAATCCCCTGTACAAGTACTACGAGGAAACCGGGAACTACCTGACCAAGTACTCCAAAAAGGACAACGGCCCCGTGATCAAGAAGATTAAGTATTACGGCAACAAACTGAACGCCCATCTGGACATCACCGACGACTACCCCAACAGCAGAAACAAGGTCGTGAAGCTGTCCCTGAAGCCCTACAGATTCGACGTGTACCTGGACAATGGCGTGTACAAGTTCGTGACCGTGAAGAATCTGGATGTGATCAAAAAAGAAAACTACTACGAAGTGAATAGCAAGTGCTATGAGGAAGCTAAGAAGCTGAAGAAGATCAGCAACCAGGCCGAGTTTATCGCCTCCTTCTACAAGAACGATCTGATCAAGATCAACGGCGAGCTGTATAGAGTGATCGGCGTGAACAGCGACGACCGGAACCGGATCGAAGTGCTGATGATCGACATCACCTACCGCGAGTACCTGGAAAACATGAACGACAAGAGGCCCCCCCACATCATTAAGACAATCGCCTCCAAGACCCAGAGCATTAAGAAGTACAGCACAGACATTCTGGGCAACCTGTATGAAGTGAAATCTAAGAAGCACCCTCAGATCATCAAAAAGGGC(SEQ ID NO:132)。
具有上述核苷酸序列的核酸可有效编码上述Cas9蛋白突变体,表达效率更高。
在本发明的第三方面,本发明提出了一种试剂盒。根据本发明的实施例,所述试剂盒包括:第一核酸分子,所述第一核酸分子编码前面所述的Cas9蛋白突变体;以及第二核酸分子,所述第二核酸分子编码gRNA。根据本发明实施例的试剂盒中的第一核酸分子编码的Cas9蛋白突变体,如SaCas9嵌合体,可在guide RNA指引下,结合非常广泛的dsDNA区域,PAM识别范围扩展到接近1/2,CRISPR/Cas9系统的基因编辑能力强大。
根据本发明的实施例,上述试剂盒还可以进一步包括如下附加技术特征至少之一:
根据本发明的实施例,所述第一核酸分子具有SEQ ID NO:85~125、132任一项所述的核苷酸序列。具有上述核苷酸序列的第一核酸可有效编辑上述Cas9蛋白突变体,表达效率更高。
根据本发明的实施例,所述第二核酸分子的编码gRNA骨架序列的核苷酸序列与野生型gRNA骨架序列的核苷酸序列相比,具有下列突变的至少之一:U3C,U4A,U4C,U5C,A6G,A32G,A31T,A31G,A30G,T29C。其中,野生型gRNA骨架序列具有SEQ ID NO:126所示的核苷酸序列。
GTTTTAGTACTCTGGAAACAGAATCTACTAAAACAAGGCAAAATGCCGTGTTTATCTCGTCAACTTGTTGGCGAGATTTTTT(SEQ ID NO:126)。
通过序列比对,发明人发现gRNA架序列的第4-6位置更加的保守。并且发明人前期的工作表明,可以通过更改4-6位置的碱基序列从而提高guide RNA的活性,因此,申请人认为这一区域Cas9与guide RNA骨架的相互作用并没有蛋白-特定序列RNA相互作用,这些区域突变会导致crRNA PAM识别区域的改变,很有可能产生自我非我识别的干扰。由于gRNA骨架中的连续的4个T能够引起III类聚合酶的提前终止,发明人惊喜地发现,通过将第4位进行AU翻转,和第五位将AU突变为GC,可有效提高guide RNA活性。然而在4-6位的更改改变gRNA的原有的自我非我识别能力。为了不改变4-6位置的序列同时改变连续4个T序列导致的提前终止,发明人将第三位的U更改为C,同时,发明人观察到,第V38变体对应的crRNA的4-6位置,不同于其他crRNA 4-6位置的TTR为TCG,因此,可能这个突变体具有不一样的PAM倾向性,有可能可以识别TTA,为了使发明人的筛选更加的稳健,发明人同时更改4-6位置为CCG。根据本发明的实施例,具有上述突变位点的第二核酸分子编码获得的gRNA,saCas9的PAM结合活性显著提高,CRISPR/Cas9系统的基因编辑能力进一步显著增强。
根据本发明的实施例,所述第二核酸分子具有SEQ ID NO:127~129任一所述的核苷酸序列。
GTCTTAGTACTCTGGAAACAGAATCTACTAAGACAAGGCAAAATGCCGTGTTTATCTCGTCAACTTGTTGGCGAGATTTTTT(SEQ ID NO:127)。
GTTATAGTACTCTGGAAACAGAATCTACTATAACAAGGCAAAATGCCGTGTTTATCTCGTCAACTTGTTGGCGAGATTTTTTT(SEQ ID NO:128)。
GTTCCGGTACTCTGGAAACAGAATCTACCGGAACAAGGCAAAATGCCGTGTTTATCTCGTCAACTTGTTGGCGAGATTTTTTT(SEQ ID NO:129)。
具有上述核苷酸序列的第二核酸分子编码gRNA的效率高,编码获得的gRNA引导saCas9的PAM结合活性显著提高,CRISPR/Cas9系统的基因编辑能力进一步增强。
根据本发明的实施例,所述第一核酸分子、第二核酸分子负载在同一表达载体上。
根据本发明的实施例,所述同一个载体为腺病毒载体。
在本发明的第四方面,本发明提出了一种对细胞进行基因改造的方法。根据本发明的实施例,将第一核酸分子和第二核酸分子引入待改造的细胞中,所述第一核酸分子和所述第二核酸分子是如前面所限定的。根据本发明实施例的方法可实现对细胞基因组的预定位点进行基因改造,改造成功率高、效率高、活性强。
根据本发明的实施例,上述方法还可以进一步包括如下附加技术特征至少之一:
根据本发明的实施例,所述Cas9蛋白突变体的PAM识别区序列和所述gRNA序列是基于待改造的基因序列确定的。
根据本发明的实施例,所述PAM识别区序列和所述gRNA的序列是基于下列关系确定的:
Figure BDA0001721117680001081
Figure BDA0001721117680001091
Figure BDA0001721117680001101
Figure BDA0001721117680001111
发明人通过实验验证了,利用根据本发明实施例的基因改造的方法,所述PAM识别区序列和所述gRNA的序列在上述识别对应关系下,实现了对EMX1、IL1RN、RUNX1、ZSCAN2基因的成功改造,改造成功率和效率高。
根据本发明的实施例,所述基因改造包括对预定位点进行基因敲除或表达调控。
在本发明的第五方面,本发明提出了一种细胞。根据本发明的实施例,上述细胞是根据前面所述的方法获得的细胞。根据本发明实施例的细胞的基因组得到了有效地靶向改造。
根据本发明的实施例,所述细胞为动物细胞、植物细胞或微生物细胞。进而利用根据本发明实施例的基因改造的方法可以获得基因组得到有效地靶向改造的各种细胞,例如动物细胞、植物细胞或微生物细胞,进而基于上述方法,也可以获得相应的特定基因改造的动物模型、转基因植物或微生物。
附图说明
图1是根据本发明实施例的SaCas9同源蛋白的进化树分析;
图2是根据本发明实施例的SaCas9同源蛋白PAM作用区域的同源序列比对结果;
图3是根据本发明实施例的EFYP重组荧光报告系统用于探究cCas9的倾向性的示意图;
图4是根据本发明实施例的优化gRNA骨架促进Cas9活性的结果图;
图5是根据本发明实施例的32种不同的cCas9的PAM识别活性的结果图;
图6是根据本发明实施例的V42在内源基因位置的编辑以及基因激活的活性的结果图;
图7是根据本发明实施例的V42、V17K在RRV PAM位置具备较高的活性的结果图;
图8是根据本发明实施例的cCas9系列突变体中具备增强的PAM的识别性的结果图;
图9是根据本发明实施例的SaCas9V21R和V21L在RRN位置的编辑效率的结果图;
图10是根据本发明实施例的评价SaCas9-KKH和V21R高保真版本的脱靶效应的结果图;
图11是根据本发明实施例的SaCas9同源蛋白的序列比对结果图;以及
图12是根据本发明实施例的正交全长Cas9在哺乳动物细胞中的切割活性的结果图。
具体实施方式
下面详细描述本发明的实施例,所述实施例的示例在附图中示出。下面通过参考附图描述的实施例是示例性的,旨在用于解释本发明,而不能理解为对本发明的限制。
需要说明的是,如无特别说明,本申请所述的“野生型蛋白”既指自然界天然存在的,又指现有技术已经改造存在的,如本申请所述的O13、O40、O23、O39、O26、O18、O38、O12、O36、O27、O10、O33、O34、O14、O44、O15、O28、O42、O20、O37、O24、O43、O30、O31、O32、O29、O16、O19、O25、O21、O17、O35、O22是自然界天然存在的与saCas9具有同源性的蛋白(参考图1右侧所标示的蛋白编号,此处以蛋白标号表示具有同源性的蛋白的名称),saCas9、SaCas9-KKH是现有技术中存在的,已经对自然界中存在的cas9蛋白进行基因改造后的蛋白,其中,saCas9的氨基酸序列如下所示:
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRELINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYNNDLIKINGELYRVIGVNNDLLNRIEVNMIDITYREYLENMNDKRPPRIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG
SaCas9-KKH相较于saCas9序列,具有E782K、N968K、R1015H突变。
如无特别说明,本申请所述的saCas9的PAM识别区是指982~994位氨基酸位置区段,其余氨基酸区段为框架区。
CRISPR/Cas9技术的出现为从基础生物研究到临床应用带来了革命性的突破。尽管Cas9蛋白在微生物界广泛存在,然而Cas9能够靶向的范围受到PAM的限制。在本申请的实施例中,发明人通过基因挖掘获得一系列SaCas9高度同源的蛋白,将其替换到SaCas9-KKH中PI结构域中负责与PAM直接相互作用区域。发明人构建了一系列嵌合Cas9(chimericCas9,cCas9),通过功能性测试,发明人选择PAM 4-6位置的64种不同的PAM组合。发明人鉴定出来多个cCas9突变体,具有在RRN的PAM位置具有增强的识别能力。同时拓展了在ACT、ATG、ATT、GCT、GTG、以及GTT PAM位置的识别能力。
整体说来,在以下实施例中,发明人提供了一系列cCas9突变体可以实现在哺乳动物细胞中1/3的位置可以被编辑。
实施例1基因组挖掘SaCas9高度同源蛋白
首先,发明人在NCBI数据库中通过BLAST程序搜索了SaCas9全长的同源蛋白。如图1所示(其中,进化树分析SaCas9的同源蛋白,其中除SaCas9以外,右列显示的是对应Cas9的编号),发明人发现了33中SaCas9的同源蛋白。其中11个同源蛋白来自Staphylococcus属,并且与SaCas9具有更高的序列同源性。并且发现有趣的现象,来自同一属的细菌中Cas9蛋白,往往序列同源性更高。对于不同的Cas9同源蛋白,发明人命名为O+数字,例如O21,O22等。
SaCas9-KKH识别PAM为NNNRRT,为了叙述方便,发明人简写作为RRT。如图2所示,通过同源序列比对,发明人发现与PAM相互作用的985、986和991所在的蛋白质区域具有较高的保守性。可以看到,982、983、990、992、993和994对应的位置就有较高的保守性。而直接负责与PAM相互作用的986和991位置具有较大的不同。因此发明人猜想,有可能这些不同的SaCas9具有不同的PAM识别能力。
其中第18号和39号同源蛋白具有相同的982-994肽段。同时,982-994氨基酸肽段的两端具备保守性。因此发明人将不同的同源蛋白的982-994位置的肽段嵌合到SaCas9-KKH中,共开发了32种嵌合Cas9蛋白(cCas9),其中,在以下实施例中,发明人将嵌合Cas9命名方式为:V+数字。例如,来源于O32Cas9的对应的982-994肽段插入到骨架中,构建的嵌合Cas9蛋白命名为V32,此外,发明人在嵌合Cas9蛋白的基础上的进一步突变,如在嵌合蛋白V21的991位置做进一步的点突变,突变为R,则命名为V21R。
实施例2cCas9展现了不同的PAM倾向性
发明人和其他课题组通过改变gRNA骨架中的第三位和第四位中的U可以用来改变gRNA骨架中的连续4个U序列,从而减少由于连续U被聚合酶III识别成提前终止信号。发明人利用CIRPSRfinder程序发现对应的不同的Cas9同源蛋白所在菌中的CRISPR位点。发明人分析出不同的Cas9蛋白所对应的crRNA的序列。有趣的是,发明人发现所有的SaCas9同源蛋白对应的crRNA直接重复区域序列具有序列不一致性,除了5‘端的6nt的序列具有序列一致性。发明人猜想,这是由于为了防止Cas9靶向切割编码crRNA的DNA序列。为了避免优化gRNA骨架导致4-6位置的gRNA的crRNA直接重复区域发生序列改变。发明人选择第三位进行改变。发明人通过将第三位的U改变为C产生了称为优化gRNA-2(optimized gRNA-2)的新的骨架。
发明人利用EYFP重组实验探究不同的cCas9对于不同的PAM位置的活性。如图3所示(其中,HDR表示同源重组,通过利用gRNA指导cCas9结合到EYFP中的结合位点,然后通过同源重组成全长EYFP荧光蛋白基因),在EYFP重组实验中,发明人分别构建了EYFP的N端和C端片段,N端和C端在其中具有序列的重合性。在N端和C端中间植入gRNA结合序列和对应的PAM序列,当Cas9在gRNA的驱动下,识别gRNA结合序列,在特定PAM下,具备切割活性,则可以使DNA发生断裂。断裂的DNA因为N端和C端具有序列的重合性,则可以发生同源重组。发生了同源重组的EYFP,具备完整的表达框,可以表达出来完整的荧光蛋白。发明人可以通过荧光蛋白的活性强弱来反应特定的cCas9对于特定PAM的识别能力。
对于PAM的4,5,6三个不同的位置,共有64种不同的组合。在HEK293FT细胞系中,发明人利用EYFP同源重组实验探究不同gRNA骨架对于SaCas9活性的影响。转染后三天,发明人通过FACS实验衡量了不同PAM下的SaCas9-KKH的活性。与前人报道的相一致,SaCas9-KKH当在原始gRNA骨架下载RRT的PAM位置展现很强的活性。同时在GGA,GGC和AGC展现了微弱的活性。当使用优化版本的gRNA时,发明人发现SaCas9-KKH在RRT的位置保持了高的活性。同时在RRV(V=A,C和G)、ATT和CGT展现了较弱的活性。表明优化后的gRNA表达有助于精确评价较弱的SaCas9的PAM倾向性,结果参见图4。因此在接下来的实验中,发明人使用优化的gRNA-2骨架作为gRNA骨架。
发明人遍历的测试了32种cCas9这64种不同的组合活性的影响。如图5所示,发明人发现2/3的cCas9具备明显的PAM识别性。其中V42,V17,V31,V32,V35突变体相对于SaCas9-KKH具有明显的在RRV(其中V=A,C和G)的活性拓展能力。其中V32在ACG和ACT也有明显的活性。V42展现在RRV最强的活性。
其次V24,V16和V21在ATG,ATT,GTG,GTT等位置展现拓展的PAM活性。V18虽然活性整体较弱,但展现较为不同的PAM倾向性。另外V15在ATA,ATC,GTA,GTC位置展现了活性。
整体说来,发明人发现了多个cCas9展现了不同的PAM倾向性。
实施例3
接下来,发明人选择突变体V17和V42进行进一步的分析。在上面的遍历研究中,发明人发现V17和V42展现相似的PAM倾向性,相比较SaCas9-KKH而言,在RRV的PAM位置上有增强的活性。发明人首先分析了982-994位置的序列比较。其中V42和V17以及SaCas9-KKH在986位置都是N,同时V42和V17SaCas9在991位置不相同。SaCas9-KKH与V42共有三个氨基酸的不同。因此接下来,发明人逐步进行氨基酸的突变,探究PAM的变化。考虑到991位置直接参与到与DNA的相互作用,发明人首先突变了SaCas9-KKH的991位置,将991R突变成K,发明人发现SaCas9-KKH(R991K)的突变体,相比较SaCas9-KKH而言,展现明显的RRV所在的PAM位置的增强,同时V42突变体与SaCas9-KKH在987位置的氨基酸也不相同,因此,发明人在991突变的基础上,进一步将987位置的氨基酸D突变成N,发明人发现在RRV位置,突变体的活性在RRV位置进一步提高。同时发明人衡量V42在64种不同的PAM处的突变体活性,发明人发现V42在RRN均展现非常强的活性。说明V42突变体具备识别RRN的扩展PAM活性。
考虑到V17也具备明显的RRV活性的拓展。因此,发明人也进一步分析了V17在64种不同PAM的活性拓展。发明人发现V17在RRN中,也有明显的活性。同时发明人将V17的991对应位置的氨基酸分别突变成K和L,发明人发现两个突变体维持了和V17一样的RRN的PAM倾向性。但是在一些活性较弱的位置,不尽相同。其中V17I991K突变体,在GCC和GCG有着明显的报告基因的活性。而V17I991L在GCA和GCT有着明显的活性。
既然发明人发现了多个扩展了RRV PAM倾向性的Cas9突变体。发明人进一步横向比较了V17I991K(V17K),V17L I991L(V17L),V42和SaCas9-KKH(KKH)在RRN16种不同PAM对应的活性。
发明人发现相对应于KKH,V17K,V17L和V42在RRV中均有明显的增强。而且在RRT的四个PAM位置上也有着一致的高活性。接下来,发明人使用V42,进一步验证在内源基因位置的编辑以及基因激活的活性。
发明人选择EMX1和ZSCAN2基因进行编辑,在HEK293FT细胞中,发明人转染靶向不同PAM的gRNA和突变体,发明人选择GGC,GAA,AGG,AGC进行测试,发明人选择1,2,4和8天四个时间节点进行观察,通过T7E1实验,发明人观察到在这四个位点上,V42更早的实现饱和,在1,2,4天均比SaCas9-KKH基因编辑的活性要高。而在AGT PAM的位置上,没有明显的差异。而在第八天这些PAM的位置都达到了饱和,同时不同的Cas9没有明显的差异,结果参见图6。
同时发明人测试了一些其他的RRN位置的PAM,发现在第8天后,V17,V42和SaCas9-KKH并没有明显的差异。同时,发明人设计gRNA靶向IL1RN的启动子区域,发明人针对于每一个PAM,设计四条gRNA,并且将12.5ng 4条gRNA与50ng dCas9:VPR融合蛋白共同转染到96孔板的HEK293FT细胞系中,经过rt—PCR实验,发明人发现不同的PAM位置对应gRNA均促进IL1RN的mRNA表达水平有明显的提高。其中,在GAT PAM对应的位置V42dCas9:VPR与dSaCas9-KKH:VPR没有明显的差异,同时在GGT位置,dSaCas9-KKH:VPR激活的效率比V42dCas9:VPR要高。但是在RRN的位置,三个不同的PAM位置均实现了明显的增强。综上,发明人发现的V42突变体增强了RRV PAM处的活性,结果参见图7(其中,A显示了V17K、V42和KKH在RRNPAM位置,经过8天,具备无差异的切割活性的结果图,B显示了V42突变体增强了RRVPAM处的活性的结果图)。
除了V17和V42之外,发明人发现V16和V21具备不相同的PAM倾向性。因此发明人进一步分析V16和V21的PAM特异性。参见图8B和C所示,发明人衡量了V16和V21在64种不同PAM的活性,发明人发现V16和V21在ATG、ATT、GTG、GTT有明显的报告系统的活性。类似的,发明人进一步突变V21的991位置,发明人发现当V21的991位置突变到L、K和R后,具备类似的PAM倾向性。同时V21I991R具备增强的ATG、ATT、GTG和GTT的PAM报告系统的活性。另外,V21I991R(V21R)在ACT位置也有明显的PAM活性。首先通过序列比对,发明人发现V16和V21在986位置均为S,不同于SaCas9-KKH的N。前人的三维结构解析发现,SaCas9-KKH中的986位置负责与4,5,6三个位置的DNA均有相互作用。因此,如图8D,发明人将SaCas9-KKH的986位置由N突变为S,发明人衡量了新的突变体的PAM倾向性,发明人发现的确,在突变到S后,有着相似的PAM倾向性。并且在TTT位置也有微弱的活性。
统计起来,发明人在ACT、ATG、ATT、GCT、GTG和GTT发现了一系列新的Cas9突变体,为了比较这些突变体在这些新的PAM位置的活性,发明人同时测定了突变体的活性。发明人发现在ACT中,V21I991R(V21R)的活性高于V16的活性。在ATG PAM的测试中发明人发现V21R和V21I991L(V21L)的活性远高于SaCas9-KKH的活性,同时V21L的活性较好。在ATT PAM中,SaCas9-KKH具有较弱的报告系统的活性,而V21L和V21R的活性高于SaCas9-KKH。在前面,发明人发现V42在GCT中也有一定的活性,因此发明人比较了V42、V21R和SaCas9-KKH的活性,发明人发现V42和V21的活性高于SaCas9-KKH的活性。最后,在GTG和GTT中,SaCas9-KKH只有本底值的表达,而V21L和V21R的活性远高于SaCas9-KKH。
实施例4
为了验证内源基因的编辑活性,发明人测试了不同位点的基因编辑效率。发明人首先衡量了ACT位置的编辑活性,发明人选择了四个不同的gRNA,通过横向比较,向HEK293FT细胞中转染50ng Cas9突变体与50ng gRNA编码质粒和编码嘌呤霉素抗性基因的质粒。经过8天的转染,利用T7E1检测方法,发明人发现SaCas9只有很弱的切割活性,而V21R可以实现约为15%的内源基因编辑强度。同时,在ATG的PAM位置,V21L和V21R实现了较为相似的编辑活性,均可以可以产生10%以上的indels,而SaCas9-KKH不能实现有效的切割。在ATT的PAM位置,发明人发现有一部分的gRNA,SaCas9-KKH可以实现较低水平的切割,而其他没有切割活性。V21R的活性高于V21L,平均indels的效率高于20%。在GCT的PAM中,SaCas9-KKH可以实现平均10%的indels,而V21R可以产生平均25%以上的indels。同时V42可以产生接近20%的indels。在GTG PAM的位置,发明人选择了15个不同的gRNA,在这15个位置当中,SaCas9-KKH只有极弱的活性。而V21L和V21R均可以实现20%左右的基因编辑活性。在GTT位置,没有检测到SaCas9-KKH的编辑活性,在V21L和V21R中可以检测到相应的indel产生。
综上,发明人验证了不同突变体在新发现的6个不同的PAM上编辑活性,提高了SaCas9编辑的靶向范围。
除了新发现的PAM的活性,同时发明人也衡量了V21R和V21L在RRN PAM上编辑活性。
如图9,发明人选择8个不同的gRNA,在这8个不同的位置,经过8天的转染,可以观察到5%-30%的编辑活性。其中,图9显示了在HEK293FT细胞中转染,经过8天,由SaCas9突变体产生的Indels效率通过T7E1实验衡量。数据有均值±均方差表示(n=3次独立转染实验)。N.D.表示没有检测到。V21R是V21I991R的缩写;V21L是I991L的缩写。
为了评价发明人发现的嵌合Cas9以及相关突变体的脱靶效应,发明人产生了一系列gRNA靶向AGT PAM的位置,与原始的gRNA相比,具有连续的2nt的突变。如图10所示(其中,图10显示了SaCas9-KKH(KKH)、cCas9V21R和V21R-HF(包含R499A、Q500K、R654A和G655R四个突变)在靶向位置以及脱靶位置的切割效率在HEK293FT细胞中通过EYFP重组报告系统衡量,所有的实验均在AGT PAM下进行测试,状图表示均值±均方差表示(n=3次独立转染实验),发明人发现对于SaCas9-KKH,在多个突变gRNA中,均可以检测到明显的EYFP的活性,提示脱靶效应的存在。同时在V21R中,可以看到类似的活性。最近已经报道,可以通过改变DNA/RNA异源二聚体结合区域的氨基酸残基的电荷分布从而可以改善特异性。因此发明人通过进一步将cCas9V21R进行氨基酸R499A、Q500K、R654A和G655R突变(V21R-HF)。通过EYFP重组报告系统,发明人发现在gRNA完全匹配的情况下,V21R-HF的活性有较高的活性,相比较cCas9V21R而言,保持了65%的活性。发明人通过改变gRNA的序列模拟脱靶位置,发明人遍历测试了连续两个碱基突变的情况,在20种不同的脱靶位置情况下,SaCas9-HF和V21R均可以检测到明显的脱靶效应。而测试的V21R-HF的则没有明显的报告系统的活性。
另外,发明人在进行嵌合测试的同时,发明人也在尝试进行嵌合的反向工程,试图克隆全长的SaCas9高度同源蛋白,分析PAM倾向性。首先发明人对SaCas9同源蛋白Orthorlog 32(O32,SshCas9)和Orthorlog35(O35,SlCas9)进行序列比对分析,如图11(其中,同源蛋白的序列比对通过Espript服务器产生,三角号表示在SaCas9-KKH(KKH)中对应的E782K/N968K/R1015H的三个点突变对应的位置)所示。通过序列比对发明人发现O32和O35与SaCas9具备高度的序列同源性,前人的工作表明,可以通过三个氨基酸突变将SaCas9的PAM从NNGRRT更改为NNNRRT。发明人发现这三个氨基酸在O32和O35中具备非常明显的对应关系。
如图12A所示,因此发明人将对应的三个氨基酸分别进行突变,分别命名为SshCas9-KKH和SlCas9-KKH。发明人首先遍历测试了这两个新的同源蛋白在PAM4,5,6位置的倾向性。然后发明人横向比较两种全长蛋白与对应嵌合蛋白的PAM倾向性的差异,如图12B所示。在图12中,A显示了Ortholog 32(O32,SshCas9)在E782K/N968K/R1015H处的三个点突变(SshCas9-KKH),Ortholog35(O35,SlCas9)在Q782K/Y968K/R1013H处的三个点突变(SlCas9-KKH)。B和C显示了在HEK293FT细胞中,经过3天的转染使用EYFP重组实验研究SaCas9同源蛋白和嵌合蛋白在64种不同PAM位置的切割活性。数据是经过三次独立转染的均值。D显示了经过8天的转染,利用T7E1实验测试SlCas9-KKH和SshCas9-KKH在6个PAM位置处的基因编辑活性。数据表示均值±均方差(n=3次独立转染实验)。N.D.表示未检测到。
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。
尽管上面已经示出和描述了本发明的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本发明的限制,本领域的普通技术人员在本发明的范围内可以对上述实施例进行变化、修改、替换和变型。
SEQUENCE LISTING
<110> 清华大学
<120> 基于进化信息构建嵌合 SaCas9用于增强和扩展PAM 位点的识别
<130> PIDC3181386
<160> 132
<170> PatentIn version 3.3
<210> 1
<211> 981
<212> PRT
<213> Artificial
<220>
<223> 框架区序列
<400> 1
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val
980
<210> 2
<211> 59
<212> PRT
<213> Artificial
<220>
<223> 框架区序列
<400> 2
Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met Asn Asp
1 5 10 15
Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys Thr Gln Ser
20 25 30
Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu Tyr Glu Val Lys
35 40 45
Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
50 55
<210> 3
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 3
Arg Ser Asp Ser Ser Pro Arg Glu Asn Arg Leu Glu Val
1 5 10
<210> 4
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 4
Lys Gly Asp Ala Met Pro Arg Gly Asn Lys Ile Glu Ile
1 5 10
<210> 5
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 5
Thr Ala Thr Asn Asn Asp Lys Ser Asn Lys Ile Glu Val
1 5 10
<210> 6
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 6
Leu Gly Asp Ala Asn Ser Arg Gln Asn Ile Leu Glu Ala
1 5 10
<210> 7
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 7
Ile Gly Val Asn His Asp Glu Gly Asn Arg Ile Glu Met
1 5 10
<210> 8
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 8
Ile Gly Val Asn Ser Asp Lys Asn Asn Leu Ile Glu Val
1 5 10
<210> 9
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 9
Ile Gly Val Asn Asn Ser Thr Arg Asn Ile Val Glu Leu
1 5 10
<210> 10
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 10
Arg Gly Asp Asn Asn Pro Arg Gln Asn Lys Leu Glu Val
1 5 10
<210> 11
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 11
Ile Gly Val Asn Asn Asp Lys Asn Asn Val Ile Glu Leu
1 5 10
<210> 12
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 12
Ile Gly Ile Asn Asp Asn Lys His Asn Lys Ile Glu Leu
1 5 10
<210> 13
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区
<400> 13
Ile Gly Val Asn Ser Asp Asp Arg Asn Ile Ile Glu Leu
1 5 10
<210> 14
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 14
Ile Gly Val Asn Asp Ser Glu Lys Asn Lys Ile Gln Leu
1 5 10
<210> 15
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 15
Lys Cys Ile Asn Asn Glu Lys Thr His Arg Ile Glu Ile
1 5 10
<210> 16
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 16
Ile Gly Val Asn His Asp Lys Thr Asn Arg Ile Glu Cys
1 5 10
<210> 17
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 17
Ile Gly Val Asn Glu Asp Lys Arg Asn Ile Ile Glu Leu
1 5 10
<210> 18
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 18
Arg Gly Asp Ser Ser Pro Arg Glu Asn Arg Phe Glu Val
1 5 10
<210> 19
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 19
Arg Gly Asp Asn Asp Pro Lys Ala Asn Lys Ile Glu Val
1 5 10
<210> 20
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 20
Ile Gly Val Asn Ala Glu Lys Arg Asn Thr Ile Glu Val
1 5 10
<210> 21
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 21
Ile Gly Val Asn Asp Asp Ala Lys Asn Thr Leu Glu Leu
1 5 10
<210> 22
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 22
Val Gly Val Asn Asn Asp Ser Val Asn Arg Val Glu Leu
1 5 10
<210> 23
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 23
Val Gly Val Asn Asn Asp Thr Arg Asn Val Val Glu Leu
1 5 10
<210> 24
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 24
Val Gly Val Asn Asn Asp Ser Arg Asn Val Val Glu Leu
1 5 10
<210> 25
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 25
Arg Gly Asp Ser Met Pro Arg Gln Asn Lys Ile Glu Met
1 5 10
<210> 26
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 26
Arg Gly Asp Ala Met Pro Arg Asp Asn Lys Ile Glu Val
1 5 10
<210> 27
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 27
Ile Gly Ile Asn Asn Gly Asp Lys Asn Leu Val Glu Leu
1 5 10
<210> 28
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 28
Arg Gly Asp Asn Asn Pro Arg Gln Asn Met Ile Glu Val
1 5 10
<210> 29
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 29
Ile Gly Val Asn Asn Asp Ser Thr Asn Arg Val Glu Leu
1 5 10
<210> 30
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 30
Arg Gly Asp Asn Asp Pro Arg Arg Ser Thr Ile Glu Leu
1 5 10
<210> 31
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 31
Arg Gly Asp Asn Asn Pro Arg Gln Asn Lys Leu Glu Val
1 5 10
<210> 32
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 32
Thr Ala Thr Asn Asn Asp Lys Lys Asn Met Ile Glu Val
1 5 10
<210> 33
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 33
Ile Gly Val Asn Asn Asn Arg Leu Asn Lys Ile Glu Leu
1 5 10
<210> 34
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 34
Ile Gly Val Phe Ser Asp Ala Gly Asn Leu Leu Glu Val
1 5 10
<210> 35
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 35
Ile Gly Asp Asn Asn Pro Arg Asn Asn Val Ile Glu Val
1 5 10
<210> 36
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 36
Ile Gly Val Asn Ser Asp Asp Arg Asn Leu Ile Glu Leu
1 5 10
<210> 37
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 37
Ile Gly Val Asn Ser Asp Asp Arg Asn Lys Ile Glu Leu
1 5 10
<210> 38
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 38
Ile Gly Val Asn Ser Asp Asp Arg Asn Arg Ile Glu Leu
1 5 10
<210> 39
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 39
Ile Gly Val Asn Ser Asp Leu Leu Asn Arg Ile Glu Val
1 5 10
<210> 40
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 40
Ile Gly Val Asn Asn Asn Leu Leu Asn Lys Ile Glu Val
1 5 10
<210> 41
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 41
Ile Gly Val Asn Asn Asp Leu Leu Asn Lys Ile Glu Val
1 5 10
<210> 42
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区
<400> 42
Ile Gly Val Asn Asn Ser Thr Arg Asn Ile Lys Glu Leu
1 5 10
<210> 43
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体的PAM识别区序列
<400> 43
Ile Gly Val Asn Asn Ser Thr Arg Asn Ile Leu Glu Leu
1 5 10
<210> 44
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 44
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Arg Ser Asp Ser Ser Pro Arg Glu Asn Arg Leu
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 45
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 45
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Lys Gly Asp Ala Met Pro Arg Gly Asn Lys Ile
980 985 990
Glu Ile Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 46
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 46
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Thr Ala Thr Asn Asn Asp Lys Ser Asn Lys Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 47
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 47
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Leu Gly Asp Ala Asn Ser Arg Gln Asn Ile Leu
980 985 990
Glu Ala Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 48
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 48
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn His Asp Glu Gly Asn Arg Ile
980 985 990
Glu Met Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 49
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 49
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Ser Asp Lys Asn Asn Leu Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 50
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 50
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Ser Thr Arg Asn Ile Val
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 51
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 51
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Arg Gly Asp Asn Asn Pro Arg Gln Asn Lys Leu
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 52
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 52
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Asp Lys Asn Asn Val Ile
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 53
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 53
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Ile Asn Asp Asn Lys His Asn Lys Ile
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 54
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 54
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Ser Asp Asp Arg Asn Ile Ile
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 55
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 55
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asp Ser Glu Lys Asn Lys Ile
980 985 990
Gln Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 56
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 56
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Lys Cys Ile Asn Asn Glu Lys Thr His Arg Ile
980 985 990
Glu Ile Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 57
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 57
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn His Asp Lys Thr Asn Arg Ile
980 985 990
Glu Cys Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 58
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 58
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Glu Asp Lys Arg Asn Ile Ile
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 59
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 59
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Arg Gly Asp Ser Ser Pro Arg Glu Asn Arg Phe
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 60
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 60
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Arg Gly Asp Asn Asp Pro Lys Ala Asn Lys Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 61
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 61
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Ala Glu Lys Arg Asn Thr Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 62
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 62
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asp Asp Ala Lys Asn Thr Leu
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 63
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 63
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Val Gly Val Asn Asn Asp Ser Val Asn Arg Val
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 64
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 64
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Val Gly Val Asn Asn Asp Thr Arg Asn Val Val
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 65
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 65
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Val Gly Val Asn Asn Asp Ser Arg Asn Val Val
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 66
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 66
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Arg Gly Asp Ser Met Pro Arg Gln Asn Lys Ile
980 985 990
Glu Met Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 67
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 67
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Arg Gly Asp Ala Met Pro Arg Asp Asn Lys Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 68
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 68
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Ile Asn Asn Gly Asp Lys Asn Leu Val
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 69
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 69
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Arg Gly Asp Asn Asn Pro Arg Gln Asn Met Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 70
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 70
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Asp Ser Thr Asn Arg Val
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 71
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 71
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Arg Gly Asp Asn Asp Pro Arg Arg Ser Thr Ile
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 72
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 72
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Arg Gly Asp Asn Asn Pro Arg Gln Asn Lys Leu
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 73
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 73
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Thr Ala Thr Asn Asn Asp Lys Lys Asn Met Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 74
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 74
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Asn Arg Leu Asn Lys Ile
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 75
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 75
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Phe Ser Asp Ala Gly Asn Leu Leu
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 76
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 76
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Asp Asn Asn Pro Arg Asn Asn Val Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 77
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 77
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Ser Asp Asp Arg Asn Leu Ile
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 78
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 78
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Ser Asp Asp Arg Asn Lys Ile
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 79
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 79
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Ser Asp Asp Arg Asn Arg Ile
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 80
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 80
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Ser Asp Leu Leu Asn Arg Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 81
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 81
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Asn Leu Leu Asn Lys Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 82
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 82
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Asp Leu Leu Asn Lys Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 83
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 83
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Ser Thr Arg Asn Ile Lys
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 84
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 84
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Ser Thr Arg Asn Ile Leu
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 85
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸的核苷酸序列
<400> 85
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaggagcg acagcagccc cagggagaac aggctggagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 86
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸的核苷酸序列
<400> 86
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaagggcg acgccatgcc caggggcaac aagatcgaga tctgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 87
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体序列
<400> 87
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaccgcca ccaacaacga caagagcaac aagatcgagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 88
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体核酸的序列
<400> 88
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgctgggcg acgccaacag caggcagaac atcctggagg cctgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 89
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸的序列
<400> 89
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaaccacga cgagggcaac aggatcgaga tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 90
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸的序列
<400> 90
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacagcga caagaacaac ctgatcgagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 91
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 91
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacaacag caccaggaac atcgtggagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 92
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 92
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaggggcg acaacaaccc caggcagaac aagctggagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 93
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 93
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacaacga caagaacaac gtgatcgagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 94
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码前面所述的Cas9蛋白突变体的核酸序列
<400> 94
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggca tcaacgacaa caagcacaac aagatcgagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 95
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸的核苷酸序列
<400> 95
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacagcga cgacaggaac atcatcgagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 96
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸的序列
<400> 96
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacgacag cgagaagaac aagatccagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 97
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 97
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaagtgca tcaacaacga gaagacccac aggatcgaga tctgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 98
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸的序列
<400> 98
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaaccacga caagaccaac aggatcgagt gctgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 99
<211> 3182
<212> PRT
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 99
Ala Thr Gly Ala Ala Gly Cys Gly Gly Ala Ala Cys Thr Ala Cys Ala
1 5 10 15
Thr Cys Cys Thr Gly Gly Gly Cys Cys Thr Gly Gly Cys Cys Ala Thr
20 25 30
Cys Gly Gly Cys Ala Thr Cys Ala Cys Cys Ala Gly Cys Gly Thr Gly
35 40 45
Gly Gly Cys Thr Ala Cys Gly Gly Cys Ala Thr Cys Ala Thr Cys Gly
50 55 60
Ala Cys Thr Ala Cys Gly Ala Gly Ala Cys Ala Cys Gly Gly Gly Ala
65 70 75 80
Cys Gly Thr Gly Ala Thr Cys Gly Ala Thr Gly Cys Cys Gly Gly Cys
85 90 95
Gly Thr Gly Cys Gly Gly Cys Thr Gly Thr Thr Cys Ala Ala Ala Gly
100 105 110
Ala Gly Gly Cys Cys Ala Ala Cys Gly Thr Gly Gly Ala Ala Ala Ala
115 120 125
Cys Ala Ala Cys Gly Ala Gly Gly Gly Cys Ala Gly Gly Cys Gly Gly
130 135 140
Ala Gly Cys Ala Ala Gly Ala Gly Ala Gly Gly Cys Gly Cys Cys Ala
145 150 155 160
Gly Ala Ala Gly Gly Cys Thr Gly Ala Ala Gly Cys Gly Gly Cys Gly
165 170 175
Gly Ala Gly Gly Cys Gly Gly Cys Ala Thr Ala Gly Ala Ala Thr Cys
180 185 190
Cys Ala Gly Ala Gly Ala Gly Thr Gly Ala Ala Gly Ala Ala Gly Cys
195 200 205
Thr Gly Cys Thr Gly Thr Thr Cys Gly Ala Cys Thr Ala Cys Ala Ala
210 215 220
Cys Cys Thr Gly Cys Thr Gly Ala Cys Cys Gly Ala Cys Cys Ala Cys
225 230 235 240
Ala Gly Cys Gly Ala Gly Cys Thr Gly Ala Gly Cys Gly Gly Cys Ala
245 250 255
Thr Cys Ala Ala Cys Cys Cys Cys Thr Ala Cys Gly Ala Gly Gly Cys
260 265 270
Cys Ala Gly Ala Gly Thr Gly Ala Ala Gly Gly Gly Cys Cys Thr Gly
275 280 285
Ala Gly Cys Cys Ala Gly Ala Ala Gly Cys Thr Gly Ala Gly Cys Gly
290 295 300
Ala Gly Gly Ala Ala Gly Ala Gly Thr Thr Cys Thr Cys Thr Gly Cys
305 310 315 320
Cys Gly Cys Cys Cys Thr Gly Cys Thr Gly Cys Ala Cys Cys Thr Gly
325 330 335
Gly Cys Cys Ala Ala Gly Ala Gly Ala Ala Gly Ala Gly Gly Cys Gly
340 345 350
Thr Gly Cys Ala Cys Ala Ala Cys Gly Thr Gly Ala Ala Cys Gly Ala
355 360 365
Gly Gly Thr Gly Gly Ala Ala Gly Ala Gly Gly Ala Cys Ala Cys Cys
370 375 380
Gly Gly Cys Ala Ala Cys Gly Ala Gly Cys Thr Gly Thr Cys Cys Ala
385 390 395 400
Cys Cys Ala Ala Ala Gly Ala Gly Cys Ala Gly Ala Thr Cys Ala Gly
405 410 415
Cys Cys Gly Gly Ala Ala Cys Ala Gly Cys Ala Ala Gly Gly Cys Cys
420 425 430
Cys Thr Gly Gly Ala Ala Gly Ala Gly Ala Ala Ala Thr Ala Cys Gly
435 440 445
Thr Gly Gly Cys Cys Gly Ala Ala Cys Thr Gly Cys Ala Gly Cys Thr
450 455 460
Gly Gly Ala Ala Cys Gly Gly Cys Thr Gly Ala Ala Gly Ala Ala Ala
465 470 475 480
Gly Ala Cys Gly Gly Cys Gly Ala Ala Gly Thr Gly Cys Gly Gly Gly
485 490 495
Gly Cys Ala Gly Cys Ala Thr Cys Ala Ala Cys Ala Gly Ala Thr Thr
500 505 510
Cys Ala Ala Gly Ala Cys Cys Ala Gly Cys Gly Ala Cys Thr Ala Cys
515 520 525
Gly Thr Gly Ala Ala Ala Gly Ala Ala Gly Cys Cys Ala Ala Ala Cys
530 535 540
Ala Gly Cys Thr Gly Cys Thr Gly Ala Ala Gly Gly Thr Gly Cys Ala
545 550 555 560
Gly Ala Ala Gly Gly Cys Cys Thr Ala Cys Cys Ala Cys Cys Ala Gly
565 570 575
Cys Thr Gly Gly Ala Cys Cys Ala Gly Ala Gly Cys Thr Thr Cys Ala
580 585 590
Thr Cys Gly Ala Cys Ala Cys Cys Thr Ala Cys Ala Thr Cys Gly Ala
595 600 605
Cys Cys Thr Gly Cys Thr Gly Gly Ala Ala Ala Cys Cys Cys Gly Gly
610 615 620
Cys Gly Gly Ala Cys Cys Thr Ala Cys Thr Ala Thr Gly Ala Gly Gly
625 630 635 640
Gly Ala Cys Cys Thr Gly Gly Cys Gly Ala Gly Gly Gly Cys Ala Gly
645 650 655
Cys Cys Cys Cys Thr Thr Cys Gly Gly Cys Thr Gly Gly Ala Ala Gly
660 665 670
Gly Ala Cys Ala Thr Cys Ala Ala Ala Gly Ala Ala Thr Gly Gly Thr
675 680 685
Ala Cys Gly Ala Gly Ala Thr Gly Cys Thr Gly Ala Thr Gly Gly Gly
690 695 700
Cys Cys Ala Cys Thr Gly Cys Ala Cys Cys Thr Ala Cys Thr Thr Cys
705 710 715 720
Cys Cys Cys Gly Ala Gly Gly Ala Ala Cys Thr Gly Cys Gly Gly Ala
725 730 735
Gly Cys Gly Thr Gly Ala Ala Gly Thr Ala Cys Gly Cys Cys Thr Ala
740 745 750
Cys Ala Ala Cys Gly Cys Cys Gly Ala Cys Cys Thr Gly Thr Ala Cys
755 760 765
Ala Ala Cys Gly Cys Cys Cys Thr Gly Ala Ala Cys Gly Ala Cys Cys
770 775 780
Thr Gly Ala Ala Cys Ala Ala Thr Cys Thr Cys Gly Thr Gly Ala Thr
785 790 795 800
Cys Ala Cys Cys Ala Gly Gly Gly Ala Cys Gly Ala Gly Ala Ala Cys
805 810 815
Gly Ala Gly Ala Ala Gly Cys Thr Gly Gly Ala Ala Thr Ala Thr Thr
820 825 830
Ala Cys Gly Ala Gly Ala Ala Gly Thr Thr Cys Cys Ala Gly Ala Thr
835 840 845
Cys Ala Thr Cys Gly Ala Gly Ala Ala Cys Gly Thr Gly Thr Thr Cys
850 855 860
Ala Ala Gly Cys Ala Gly Ala Ala Gly Ala Ala Gly Ala Ala Gly Cys
865 870 875 880
Cys Cys Ala Cys Cys Cys Thr Gly Ala Ala Gly Cys Ala Gly Ala Thr
885 890 895
Cys Gly Cys Cys Ala Ala Ala Gly Ala Ala Ala Thr Cys Cys Thr Cys
900 905 910
Gly Thr Gly Ala Ala Cys Gly Ala Ala Gly Ala Gly Gly Ala Thr Ala
915 920 925
Thr Thr Ala Ala Gly Gly Gly Cys Thr Ala Cys Ala Gly Ala Gly Thr
930 935 940
Gly Ala Cys Cys Ala Gly Cys Ala Cys Cys Gly Gly Cys Ala Ala Gly
945 950 955 960
Cys Cys Cys Gly Ala Gly Thr Thr Cys Ala Cys Cys Ala Ala Cys Cys
965 970 975
Thr Gly Ala Ala Gly Gly Thr Gly Thr Ala Cys Cys Ala Cys Gly Ala
980 985 990
Cys Ala Thr Cys Ala Ala Gly Gly Ala Cys Ala Thr Thr Ala Cys Cys
995 1000 1005
Gly Cys Cys Cys Gly Gly Ala Ala Ala Gly Ala Gly Ala Thr Thr
1010 1015 1020
Ala Thr Thr Gly Ala Gly Ala Ala Cys Gly Cys Cys Gly Ala Gly
1025 1030 1035
Cys Thr Gly Cys Thr Gly Gly Ala Thr Cys Ala Gly Ala Thr Thr
1040 1045 1050
Gly Cys Cys Ala Ala Gly Ala Thr Cys Cys Thr Gly Ala Cys Cys
1055 1060 1065
Ala Thr Cys Thr Ala Cys Cys Ala Gly Ala Gly Cys Ala Gly Cys
1070 1075 1080
Gly Ala Gly Gly Ala Cys Ala Thr Cys Cys Ala Gly Gly Ala Ala
1085 1090 1095
Gly Ala Ala Cys Thr Gly Ala Cys Cys Ala Ala Thr Cys Thr Gly
1100 1105 1110
Ala Ala Cys Thr Cys Cys Gly Ala Gly Cys Thr Gly Ala Cys Cys
1115 1120 1125
Cys Ala Gly Gly Ala Ala Gly Ala Gly Ala Thr Cys Gly Ala Gly
1130 1135 1140
Cys Ala Gly Ala Thr Cys Thr Cys Thr Ala Ala Thr Cys Thr Gly
1145 1150 1155
Ala Ala Gly Gly Gly Cys Thr Ala Thr Ala Cys Cys Gly Gly Cys
1160 1165 1170
Ala Cys Cys Cys Ala Cys Ala Ala Cys Cys Thr Gly Ala Gly Cys
1175 1180 1185
Cys Thr Gly Ala Ala Gly Gly Cys Cys Ala Thr Cys Ala Ala Cys
1190 1195 1200
Cys Thr Gly Ala Thr Cys Cys Thr Gly Gly Ala Cys Gly Ala Gly
1205 1210 1215
Cys Thr Gly Thr Gly Gly Cys Ala Cys Ala Cys Cys Ala Ala Cys
1220 1225 1230
Gly Ala Cys Ala Ala Cys Cys Ala Gly Ala Thr Cys Gly Cys Thr
1235 1240 1245
Ala Thr Cys Thr Thr Cys Ala Ala Cys Cys Gly Gly Cys Thr Gly
1250 1255 1260
Ala Ala Gly Cys Thr Gly Gly Thr Gly Cys Cys Cys Ala Ala Gly
1265 1270 1275
Ala Ala Gly Gly Thr Gly Gly Ala Cys Cys Thr Gly Thr Cys Cys
1280 1285 1290
Cys Ala Gly Cys Ala Gly Ala Ala Ala Gly Ala Gly Ala Thr Cys
1295 1300 1305
Cys Cys Cys Ala Cys Cys Ala Cys Cys Cys Thr Gly Gly Thr Gly
1310 1315 1320
Gly Ala Cys Gly Ala Cys Thr Thr Cys Ala Thr Cys Cys Thr Gly
1325 1330 1335
Ala Gly Cys Cys Cys Cys Gly Thr Cys Gly Thr Gly Ala Ala Gly
1340 1345 1350
Ala Gly Ala Ala Gly Cys Thr Thr Cys Ala Thr Cys Cys Ala Gly
1355 1360 1365
Ala Gly Cys Ala Thr Cys Ala Ala Ala Gly Thr Gly Ala Thr Cys
1370 1375 1380
Ala Ala Cys Gly Cys Cys Ala Thr Cys Ala Thr Cys Ala Ala Gly
1385 1390 1395
Ala Ala Gly Thr Ala Cys Gly Gly Cys Cys Thr Gly Cys Cys Cys
1400 1405 1410
Ala Ala Cys Gly Ala Cys Ala Thr Cys Ala Thr Thr Ala Thr Cys
1415 1420 1425
Gly Ala Gly Cys Thr Gly Gly Cys Cys Cys Gly Cys Gly Ala Gly
1430 1435 1440
Ala Ala Gly Ala Ala Cys Thr Cys Cys Ala Ala Gly Gly Ala Cys
1445 1450 1455
Gly Cys Cys Cys Ala Gly Ala Ala Ala Ala Thr Gly Ala Thr Cys
1460 1465 1470
Ala Ala Cys Gly Ala Gly Ala Thr Gly Cys Ala Gly Ala Ala Gly
1475 1480 1485
Cys Gly Gly Ala Ala Cys Cys Gly Gly Cys Ala Gly Ala Cys Cys
1490 1495 1500
Ala Ala Cys Gly Ala Gly Cys Gly Gly Ala Thr Cys Gly Ala Gly
1505 1510 1515
Gly Ala Ala Ala Thr Cys Ala Thr Cys Cys Gly Gly Ala Cys Cys
1520 1525 1530
Ala Cys Cys Gly Gly Cys Ala Ala Ala Gly Ala Gly Ala Ala Cys
1535 1540 1545
Gly Cys Cys Ala Ala Gly Thr Ala Cys Cys Thr Gly Ala Thr Cys
1550 1555 1560
Gly Ala Gly Ala Ala Gly Ala Thr Cys Ala Ala Gly Cys Thr Gly
1565 1570 1575
Cys Ala Cys Gly Ala Cys Ala Thr Gly Cys Ala Gly Gly Ala Ala
1580 1585 1590
Gly Gly Cys Ala Ala Gly Thr Gly Cys Cys Thr Gly Thr Ala Cys
1595 1600 1605
Ala Gly Cys Cys Thr Gly Gly Ala Ala Gly Cys Cys Ala Thr Cys
1610 1615 1620
Cys Cys Thr Cys Thr Gly Gly Ala Ala Gly Ala Thr Cys Thr Gly
1625 1630 1635
Cys Thr Gly Ala Ala Cys Ala Ala Cys Cys Cys Cys Thr Thr Cys
1640 1645 1650
Ala Ala Cys Thr Ala Thr Gly Ala Gly Gly Thr Gly Gly Ala Cys
1655 1660 1665
Cys Ala Cys Ala Thr Cys Ala Thr Cys Cys Cys Cys Ala Gly Ala
1670 1675 1680
Ala Gly Cys Gly Thr Gly Thr Cys Cys Thr Thr Cys Gly Ala Cys
1685 1690 1695
Ala Ala Cys Ala Gly Cys Thr Thr Cys Ala Ala Cys Ala Ala Cys
1700 1705 1710
Ala Ala Gly Gly Thr Gly Cys Thr Cys Gly Thr Gly Ala Ala Gly
1715 1720 1725
Cys Ala Gly Gly Ala Ala Gly Ala Ala Gly Cys Cys Ala Gly Cys
1730 1735 1740
Ala Ala Gly Ala Ala Gly Gly Gly Cys Ala Ala Cys Cys Gly Gly
1745 1750 1755
Ala Cys Cys Cys Cys Ala Thr Thr Cys Cys Ala Gly Thr Ala Cys
1760 1765 1770
Cys Thr Gly Ala Gly Cys Ala Gly Cys Ala Gly Cys Gly Ala Cys
1775 1780 1785
Ala Gly Cys Ala Ala Gly Ala Thr Cys Ala Gly Cys Thr Ala Cys
1790 1795 1800
Gly Ala Ala Ala Cys Cys Thr Thr Cys Ala Ala Gly Ala Ala Gly
1805 1810 1815
Cys Ala Cys Ala Thr Cys Cys Thr Gly Ala Ala Thr Cys Thr Gly
1820 1825 1830
Gly Cys Cys Ala Ala Gly Gly Gly Cys Ala Ala Gly Gly Gly Cys
1835 1840 1845
Ala Gly Ala Ala Thr Cys Ala Gly Cys Ala Ala Gly Ala Cys Cys
1850 1855 1860
Ala Ala Gly Ala Ala Ala Gly Ala Gly Thr Ala Thr Cys Thr Gly
1865 1870 1875
Cys Thr Gly Gly Ala Ala Gly Ala Ala Cys Gly Gly Gly Ala Cys
1880 1885 1890
Ala Thr Cys Ala Ala Cys Ala Gly Gly Thr Thr Cys Thr Cys Cys
1895 1900 1905
Gly Thr Gly Cys Ala Gly Ala Ala Ala Gly Ala Cys Thr Thr Cys
1910 1915 1920
Ala Thr Cys Ala Ala Cys Cys Gly Gly Ala Ala Cys Cys Thr Gly
1925 1930 1935
Gly Thr Gly Gly Ala Thr Ala Cys Cys Ala Gly Ala Thr Ala Cys
1940 1945 1950
Gly Cys Cys Ala Cys Cys Ala Gly Ala Gly Gly Cys Cys Thr Gly
1955 1960 1965
Ala Thr Gly Ala Ala Cys Cys Thr Gly Cys Thr Gly Cys Gly Gly
1970 1975 1980
Ala Gly Cys Thr Ala Cys Thr Thr Cys Ala Gly Ala Gly Thr Gly
1985 1990 1995
Ala Ala Cys Ala Ala Cys Cys Thr Gly Gly Ala Cys Gly Thr Gly
2000 2005 2010
Ala Ala Ala Gly Thr Gly Ala Ala Gly Thr Cys Cys Ala Thr Cys
2015 2020 2025
Ala Ala Thr Gly Gly Cys Gly Gly Cys Thr Thr Cys Ala Cys Cys
2030 2035 2040
Ala Gly Cys Thr Thr Thr Cys Thr Gly Cys Gly Gly Cys Gly Gly
2045 2050 2055
Ala Ala Gly Thr Gly Gly Ala Ala Gly Thr Thr Thr Ala Ala Gly
2060 2065 2070
Ala Ala Ala Gly Ala Gly Cys Gly Gly Ala Ala Cys Ala Ala Gly
2075 2080 2085
Gly Gly Gly Thr Ala Cys Ala Ala Gly Cys Ala Cys Cys Ala Cys
2090 2095 2100
Gly Cys Cys Gly Ala Gly Gly Ala Cys Gly Cys Cys Cys Thr Gly
2105 2110 2115
Ala Thr Cys Ala Thr Thr Gly Cys Cys Ala Ala Cys Gly Cys Cys
2120 2125 2130
Gly Ala Thr Thr Thr Cys Ala Thr Cys Thr Thr Cys Ala Ala Ala
2135 2140 2145
Gly Ala Gly Thr Gly Gly Ala Ala Gly Ala Ala Ala Cys Thr Gly
2150 2155 2160
Gly Ala Cys Ala Ala Gly Gly Cys Cys Ala Ala Ala Ala Ala Ala
2165 2170 2175
Gly Thr Gly Ala Thr Gly Gly Ala Ala Ala Ala Cys Cys Ala Gly
2180 2185 2190
Ala Thr Gly Thr Thr Cys Gly Ala Gly Gly Ala Ala Ala Ala Gly
2195 2200 2205
Cys Ala Gly Gly Cys Cys Gly Ala Gly Ala Gly Cys Ala Thr Gly
2210 2215 2220
Cys Cys Cys Gly Ala Gly Ala Thr Cys Gly Ala Ala Ala Cys Cys
2225 2230 2235
Gly Ala Gly Cys Ala Gly Gly Ala Gly Thr Ala Cys Ala Ala Ala
2240 2245 2250
Gly Ala Gly Ala Thr Cys Thr Thr Cys Ala Thr Cys Ala Cys Cys
2255 2260 2265
Cys Cys Cys Cys Ala Cys Cys Ala Gly Ala Thr Cys Ala Ala Gly
2270 2275 2280
Cys Ala Cys Ala Thr Thr Ala Ala Gly Gly Ala Cys Thr Thr Cys
2285 2290 2295
Ala Ala Gly Gly Ala Cys Thr Ala Cys Ala Ala Gly Thr Ala Cys
2300 2305 2310
Ala Gly Cys Cys Ala Cys Cys Gly Gly Gly Thr Gly Gly Ala Cys
2315 2320 2325
Ala Ala Gly Ala Ala Gly Cys Cys Thr Ala Ala Thr Ala Gly Ala
2330 2335 2340
Ala Ala Gly Cys Thr Gly Ala Thr Thr Ala Ala Cys Gly Ala Cys
2345 2350 2355
Ala Cys Cys Cys Thr Gly Thr Ala Cys Thr Cys Cys Ala Cys Cys
2360 2365 2370
Cys Gly Gly Ala Ala Gly Gly Ala Cys Gly Ala Cys Ala Ala Gly
2375 2380 2385
Gly Gly Cys Ala Ala Cys Ala Cys Cys Cys Thr Gly Ala Thr Cys
2390 2395 2400
Gly Thr Gly Ala Ala Cys Ala Ala Thr Cys Thr Gly Ala Ala Cys
2405 2410 2415
Gly Gly Cys Cys Thr Gly Thr Ala Cys Gly Ala Cys Ala Ala Gly
2420 2425 2430
Gly Ala Cys Ala Ala Thr Gly Ala Cys Ala Ala Gly Cys Thr Gly
2435 2440 2445
Ala Ala Ala Ala Ala Gly Cys Thr Gly Ala Thr Cys Ala Ala Cys
2450 2455 2460
Ala Ala Gly Ala Gly Cys Cys Cys Cys Gly Ala Ala Ala Ala Gly
2465 2470 2475
Cys Thr Gly Cys Thr Gly Ala Thr Gly Thr Ala Cys Cys Ala Cys
2480 2485 2490
Cys Ala Cys Gly Ala Cys Cys Cys Cys Cys Ala Gly Ala Cys Cys
2495 2500 2505
Thr Ala Cys Cys Ala Gly Ala Ala Ala Cys Thr Gly Ala Ala Gly
2510 2515 2520
Cys Thr Gly Ala Thr Thr Ala Thr Gly Gly Ala Ala Cys Ala Gly
2525 2530 2535
Thr Ala Cys Gly Gly Cys Gly Ala Cys Gly Ala Gly Ala Ala Gly
2540 2545 2550
Ala Ala Thr Cys Cys Cys Cys Thr Gly Thr Ala Cys Ala Ala Gly
2555 2560 2565
Thr Ala Cys Thr Ala Cys Gly Ala Gly Gly Ala Ala Ala Cys Cys
2570 2575 2580
Gly Gly Gly Ala Ala Cys Thr Ala Cys Cys Thr Gly Ala Cys Cys
2585 2590 2595
Ala Ala Gly Thr Ala Cys Thr Cys Cys Ala Ala Ala Ala Ala Gly
2600 2605 2610
Gly Ala Cys Ala Ala Cys Gly Gly Cys Cys Cys Cys Gly Thr Gly
2615 2620 2625
Ala Thr Cys Ala Ala Gly Ala Ala Gly Ala Thr Thr Ala Ala Gly
2630 2635 2640
Thr Ala Thr Thr Ala Cys Gly Gly Cys Ala Ala Cys Ala Ala Ala
2645 2650 2655
Cys Thr Gly Ala Ala Cys Gly Cys Cys Cys Ala Thr Cys Thr Gly
2660 2665 2670
Gly Ala Cys Ala Thr Cys Ala Cys Cys Gly Ala Cys Gly Ala Cys
2675 2680 2685
Thr Ala Cys Cys Cys Cys Ala Ala Cys Ala Gly Cys Ala Gly Ala
2690 2695 2700
Ala Ala Cys Ala Ala Gly Gly Thr Cys Gly Thr Gly Ala Ala Gly
2705 2710 2715
Cys Thr Gly Thr Cys Cys Cys Thr Gly Ala Ala Gly Cys Cys Cys
2720 2725 2730
Thr Ala Cys Ala Gly Ala Thr Thr Cys Gly Ala Cys Gly Thr Gly
2735 2740 2745
Thr Ala Cys Cys Thr Gly Gly Ala Cys Ala Ala Thr Gly Gly Cys
2750 2755 2760
Gly Thr Gly Thr Ala Cys Ala Ala Gly Thr Thr Cys Gly Thr Gly
2765 2770 2775
Ala Cys Cys Gly Thr Gly Ala Ala Gly Ala Ala Thr Cys Thr Gly
2780 2785 2790
Gly Ala Thr Gly Thr Gly Ala Thr Cys Ala Ala Ala Ala Ala Ala
2795 2800 2805
Gly Ala Ala Ala Ala Cys Thr Ala Cys Thr Ala Cys Gly Ala Ala
2810 2815 2820
Gly Thr Gly Ala Ala Thr Ala Gly Cys Ala Ala Gly Thr Gly Cys
2825 2830 2835
Thr Ala Thr Gly Ala Gly Gly Ala Ala Gly Cys Thr Ala Ala Gly
2840 2845 2850
Ala Ala Gly Cys Thr Gly Ala Ala Gly Ala Ala Gly Ala Thr Cys
2855 2860 2865
Ala Gly Cys Ala Ala Cys Cys Ala Gly Gly Cys Cys Gly Ala Gly
2870 2875 2880
Thr Thr Thr Ala Thr Cys Gly Cys Cys Thr Cys Cys Thr Thr Cys
2885 2890 2895
Thr Ala Cys Ala Ala Gly Ala Ala Cys Gly Ala Thr Cys Thr Gly
2900 2905 2910
Ala Thr Cys Ala Ala Gly Ala Thr Cys Ala Ala Cys Gly Gly Cys
2915 2920 2925
Gly Ala Gly Cys Thr Gly Thr Ala Thr Ala Gly Ala Gly Thr Gly
2930 2935 2940
Ala Thr Cys Gly Gly Cys Gly Thr Gly Ala Ala Cys Gly Ala Gly
2945 2950 2955
Gly Ala Cys Ala Ala Gly Ala Gly Gly Ala Ala Cys Ala Thr Cys
2960 2965 2970
Ala Thr Cys Gly Ala Gly Cys Thr Gly Thr Gly Ala Gly Ala Cys
2975 2980 2985
Gly Gly Gly Cys Cys Ala Thr Ala Cys Thr Cys Gly Thr Cys Thr
2990 2995 3000
Cys Gly Ala Ala Cys Ala Thr Gly Ala Thr Cys Gly Ala Cys Ala
3005 3010 3015
Thr Cys Ala Cys Cys Thr Ala Cys Cys Gly Cys Gly Ala Gly Thr
3020 3025 3030
Ala Cys Cys Thr Gly Gly Ala Ala Ala Ala Cys Ala Thr Gly Ala
3035 3040 3045
Ala Cys Gly Ala Cys Ala Ala Gly Ala Gly Gly Cys Cys Cys Cys
3050 3055 3060
Cys Cys Cys Ala Cys Ala Thr Cys Ala Thr Thr Ala Ala Gly Ala
3065 3070 3075
Cys Ala Ala Thr Cys Gly Cys Cys Thr Cys Cys Ala Ala Gly Ala
3080 3085 3090
Cys Cys Cys Ala Gly Ala Gly Cys Ala Thr Thr Ala Ala Gly Ala
3095 3100 3105
Ala Gly Thr Ala Cys Ala Gly Cys Ala Cys Ala Gly Ala Cys Ala
3110 3115 3120
Thr Thr Cys Thr Gly Gly Gly Cys Ala Ala Cys Cys Thr Gly Thr
3125 3130 3135
Ala Thr Gly Ala Ala Gly Thr Gly Ala Ala Ala Thr Cys Thr Ala
3140 3145 3150
Ala Gly Ala Ala Gly Cys Ala Cys Cys Cys Thr Cys Ala Gly Ala
3155 3160 3165
Thr Cys Ala Thr Cys Ala Ala Ala Ala Ala Gly Gly Gly Cys
3170 3175 3180
<210> 100
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 100
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaggggcg acagcagccc cagggagaac aggttcgagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 101
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 101
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaggggcg acaacgaccc caaggccaac aagatcgagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 102
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 102
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacgccga gaagaggaac accatcgagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 103
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 103
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacgacga cgccaagaac accctggagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 104
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 104
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtggtgggcg tgaacaacga cagcgtgaac agggtggagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 105
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 105
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtggtgggcg tgaacaacga caccaggaac gtggtggagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 106
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 106
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtggtgggcg tgaacaacga cagcaggaac gtggtggagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 107
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 107
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaggggcg acagcatgcc caggcagaac aagatcgaga tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 108
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 108
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaggggcg acgccatgcc cagggacaac aagatcgagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 109
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 109
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggca tcaacaacgg cgacaagaac ctggtggagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 110
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 110
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaggggcg acaacaaccc caggcagaac atgatcgagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 111
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 111
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacaacga cagcaccaac agggtggagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 112
<211> 3182
<212> PRT
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 112
Ala Thr Gly Ala Ala Gly Cys Gly Gly Ala Ala Cys Thr Ala Cys Ala
1 5 10 15
Thr Cys Cys Thr Gly Gly Gly Cys Cys Thr Gly Gly Cys Cys Ala Thr
20 25 30
Cys Gly Gly Cys Ala Thr Cys Ala Cys Cys Ala Gly Cys Gly Thr Gly
35 40 45
Gly Gly Cys Thr Ala Cys Gly Gly Cys Ala Thr Cys Ala Thr Cys Gly
50 55 60
Ala Cys Thr Ala Cys Gly Ala Gly Ala Cys Ala Cys Gly Gly Gly Ala
65 70 75 80
Cys Gly Thr Gly Ala Thr Cys Gly Ala Thr Gly Cys Cys Gly Gly Cys
85 90 95
Gly Thr Gly Cys Gly Gly Cys Thr Gly Thr Thr Cys Ala Ala Ala Gly
100 105 110
Ala Gly Gly Cys Cys Ala Ala Cys Gly Thr Gly Gly Ala Ala Ala Ala
115 120 125
Cys Ala Ala Cys Gly Ala Gly Gly Gly Cys Ala Gly Gly Cys Gly Gly
130 135 140
Ala Gly Cys Ala Ala Gly Ala Gly Ala Gly Gly Cys Gly Cys Cys Ala
145 150 155 160
Gly Ala Ala Gly Gly Cys Thr Gly Ala Ala Gly Cys Gly Gly Cys Gly
165 170 175
Gly Ala Gly Gly Cys Gly Gly Cys Ala Thr Ala Gly Ala Ala Thr Cys
180 185 190
Cys Ala Gly Ala Gly Ala Gly Thr Gly Ala Ala Gly Ala Ala Gly Cys
195 200 205
Thr Gly Cys Thr Gly Thr Thr Cys Gly Ala Cys Thr Ala Cys Ala Ala
210 215 220
Cys Cys Thr Gly Cys Thr Gly Ala Cys Cys Gly Ala Cys Cys Ala Cys
225 230 235 240
Ala Gly Cys Gly Ala Gly Cys Thr Gly Ala Gly Cys Gly Gly Cys Ala
245 250 255
Thr Cys Ala Ala Cys Cys Cys Cys Thr Ala Cys Gly Ala Gly Gly Cys
260 265 270
Cys Ala Gly Ala Gly Thr Gly Ala Ala Gly Gly Gly Cys Cys Thr Gly
275 280 285
Ala Gly Cys Cys Ala Gly Ala Ala Gly Cys Thr Gly Ala Gly Cys Gly
290 295 300
Ala Gly Gly Ala Ala Gly Ala Gly Thr Thr Cys Thr Cys Thr Gly Cys
305 310 315 320
Cys Gly Cys Cys Cys Thr Gly Cys Thr Gly Cys Ala Cys Cys Thr Gly
325 330 335
Gly Cys Cys Ala Ala Gly Ala Gly Ala Ala Gly Ala Gly Gly Cys Gly
340 345 350
Thr Gly Cys Ala Cys Ala Ala Cys Gly Thr Gly Ala Ala Cys Gly Ala
355 360 365
Gly Gly Thr Gly Gly Ala Ala Gly Ala Gly Gly Ala Cys Ala Cys Cys
370 375 380
Gly Gly Cys Ala Ala Cys Gly Ala Gly Cys Thr Gly Thr Cys Cys Ala
385 390 395 400
Cys Cys Ala Ala Ala Gly Ala Gly Cys Ala Gly Ala Thr Cys Ala Gly
405 410 415
Cys Cys Gly Gly Ala Ala Cys Ala Gly Cys Ala Ala Gly Gly Cys Cys
420 425 430
Cys Thr Gly Gly Ala Ala Gly Ala Gly Ala Ala Ala Thr Ala Cys Gly
435 440 445
Thr Gly Gly Cys Cys Gly Ala Ala Cys Thr Gly Cys Ala Gly Cys Thr
450 455 460
Gly Gly Ala Ala Cys Gly Gly Cys Thr Gly Ala Ala Gly Ala Ala Ala
465 470 475 480
Gly Ala Cys Gly Gly Cys Gly Ala Ala Gly Thr Gly Cys Gly Gly Gly
485 490 495
Gly Cys Ala Gly Cys Ala Thr Cys Ala Ala Cys Ala Gly Ala Thr Thr
500 505 510
Cys Ala Ala Gly Ala Cys Cys Ala Gly Cys Gly Ala Cys Thr Ala Cys
515 520 525
Gly Thr Gly Ala Ala Ala Gly Ala Ala Gly Cys Cys Ala Ala Ala Cys
530 535 540
Ala Gly Cys Thr Gly Cys Thr Gly Ala Ala Gly Gly Thr Gly Cys Ala
545 550 555 560
Gly Ala Ala Gly Gly Cys Cys Thr Ala Cys Cys Ala Cys Cys Ala Gly
565 570 575
Cys Thr Gly Gly Ala Cys Cys Ala Gly Ala Gly Cys Thr Thr Cys Ala
580 585 590
Thr Cys Gly Ala Cys Ala Cys Cys Thr Ala Cys Ala Thr Cys Gly Ala
595 600 605
Cys Cys Thr Gly Cys Thr Gly Gly Ala Ala Ala Cys Cys Cys Gly Gly
610 615 620
Cys Gly Gly Ala Cys Cys Thr Ala Cys Thr Ala Thr Gly Ala Gly Gly
625 630 635 640
Gly Ala Cys Cys Thr Gly Gly Cys Gly Ala Gly Gly Gly Cys Ala Gly
645 650 655
Cys Cys Cys Cys Thr Thr Cys Gly Gly Cys Thr Gly Gly Ala Ala Gly
660 665 670
Gly Ala Cys Ala Thr Cys Ala Ala Ala Gly Ala Ala Thr Gly Gly Thr
675 680 685
Ala Cys Gly Ala Gly Ala Thr Gly Cys Thr Gly Ala Thr Gly Gly Gly
690 695 700
Cys Cys Ala Cys Thr Gly Cys Ala Cys Cys Thr Ala Cys Thr Thr Cys
705 710 715 720
Cys Cys Cys Gly Ala Gly Gly Ala Ala Cys Thr Gly Cys Gly Gly Ala
725 730 735
Gly Cys Gly Thr Gly Ala Ala Gly Thr Ala Cys Gly Cys Cys Thr Ala
740 745 750
Cys Ala Ala Cys Gly Cys Cys Gly Ala Cys Cys Thr Gly Thr Ala Cys
755 760 765
Ala Ala Cys Gly Cys Cys Cys Thr Gly Ala Ala Cys Gly Ala Cys Cys
770 775 780
Thr Gly Ala Ala Cys Ala Ala Thr Cys Thr Cys Gly Thr Gly Ala Thr
785 790 795 800
Cys Ala Cys Cys Ala Gly Gly Gly Ala Cys Gly Ala Gly Ala Ala Cys
805 810 815
Gly Ala Gly Ala Ala Gly Cys Thr Gly Gly Ala Ala Thr Ala Thr Thr
820 825 830
Ala Cys Gly Ala Gly Ala Ala Gly Thr Thr Cys Cys Ala Gly Ala Thr
835 840 845
Cys Ala Thr Cys Gly Ala Gly Ala Ala Cys Gly Thr Gly Thr Thr Cys
850 855 860
Ala Ala Gly Cys Ala Gly Ala Ala Gly Ala Ala Gly Ala Ala Gly Cys
865 870 875 880
Cys Cys Ala Cys Cys Cys Thr Gly Ala Ala Gly Cys Ala Gly Ala Thr
885 890 895
Cys Gly Cys Cys Ala Ala Ala Gly Ala Ala Ala Thr Cys Cys Thr Cys
900 905 910
Gly Thr Gly Ala Ala Cys Gly Ala Ala Gly Ala Gly Gly Ala Thr Ala
915 920 925
Thr Thr Ala Ala Gly Gly Gly Cys Thr Ala Cys Ala Gly Ala Gly Thr
930 935 940
Gly Ala Cys Cys Ala Gly Cys Ala Cys Cys Gly Gly Cys Ala Ala Gly
945 950 955 960
Cys Cys Cys Gly Ala Gly Thr Thr Cys Ala Cys Cys Ala Ala Cys Cys
965 970 975
Thr Gly Ala Ala Gly Gly Thr Gly Thr Ala Cys Cys Ala Cys Gly Ala
980 985 990
Cys Ala Thr Cys Ala Ala Gly Gly Ala Cys Ala Thr Thr Ala Cys Cys
995 1000 1005
Gly Cys Cys Cys Gly Gly Ala Ala Ala Gly Ala Gly Ala Thr Thr
1010 1015 1020
Ala Thr Thr Gly Ala Gly Ala Ala Cys Gly Cys Cys Gly Ala Gly
1025 1030 1035
Cys Thr Gly Cys Thr Gly Gly Ala Thr Cys Ala Gly Ala Thr Thr
1040 1045 1050
Gly Cys Cys Ala Ala Gly Ala Thr Cys Cys Thr Gly Ala Cys Cys
1055 1060 1065
Ala Thr Cys Thr Ala Cys Cys Ala Gly Ala Gly Cys Ala Gly Cys
1070 1075 1080
Gly Ala Gly Gly Ala Cys Ala Thr Cys Cys Ala Gly Gly Ala Ala
1085 1090 1095
Gly Ala Ala Cys Thr Gly Ala Cys Cys Ala Ala Thr Cys Thr Gly
1100 1105 1110
Ala Ala Cys Thr Cys Cys Gly Ala Gly Cys Thr Gly Ala Cys Cys
1115 1120 1125
Cys Ala Gly Gly Ala Ala Gly Ala Gly Ala Thr Cys Gly Ala Gly
1130 1135 1140
Cys Ala Gly Ala Thr Cys Thr Cys Thr Ala Ala Thr Cys Thr Gly
1145 1150 1155
Ala Ala Gly Gly Gly Cys Thr Ala Thr Ala Cys Cys Gly Gly Cys
1160 1165 1170
Ala Cys Cys Cys Ala Cys Ala Ala Cys Cys Thr Gly Ala Gly Cys
1175 1180 1185
Cys Thr Gly Ala Ala Gly Gly Cys Cys Ala Thr Cys Ala Ala Cys
1190 1195 1200
Cys Thr Gly Ala Thr Cys Cys Thr Gly Gly Ala Cys Gly Ala Gly
1205 1210 1215
Cys Thr Gly Thr Gly Gly Cys Ala Cys Ala Cys Cys Ala Ala Cys
1220 1225 1230
Gly Ala Cys Ala Ala Cys Cys Ala Gly Ala Thr Cys Gly Cys Thr
1235 1240 1245
Ala Thr Cys Thr Thr Cys Ala Ala Cys Cys Gly Gly Cys Thr Gly
1250 1255 1260
Ala Ala Gly Cys Thr Gly Gly Thr Gly Cys Cys Cys Ala Ala Gly
1265 1270 1275
Ala Ala Gly Gly Thr Gly Gly Ala Cys Cys Thr Gly Thr Cys Cys
1280 1285 1290
Cys Ala Gly Cys Ala Gly Ala Ala Ala Gly Ala Gly Ala Thr Cys
1295 1300 1305
Cys Cys Cys Ala Cys Cys Ala Cys Cys Cys Thr Gly Gly Thr Gly
1310 1315 1320
Gly Ala Cys Gly Ala Cys Thr Thr Cys Ala Thr Cys Cys Thr Gly
1325 1330 1335
Ala Gly Cys Cys Cys Cys Gly Thr Cys Gly Thr Gly Ala Ala Gly
1340 1345 1350
Ala Gly Ala Ala Gly Cys Thr Thr Cys Ala Thr Cys Cys Ala Gly
1355 1360 1365
Ala Gly Cys Ala Thr Cys Ala Ala Ala Gly Thr Gly Ala Thr Cys
1370 1375 1380
Ala Ala Cys Gly Cys Cys Ala Thr Cys Ala Thr Cys Ala Ala Gly
1385 1390 1395
Ala Ala Gly Thr Ala Cys Gly Gly Cys Cys Thr Gly Cys Cys Cys
1400 1405 1410
Ala Ala Cys Gly Ala Cys Ala Thr Cys Ala Thr Thr Ala Thr Cys
1415 1420 1425
Gly Ala Gly Cys Thr Gly Gly Cys Cys Cys Gly Cys Gly Ala Gly
1430 1435 1440
Ala Ala Gly Ala Ala Cys Thr Cys Cys Ala Ala Gly Gly Ala Cys
1445 1450 1455
Gly Cys Cys Cys Ala Gly Ala Ala Ala Ala Thr Gly Ala Thr Cys
1460 1465 1470
Ala Ala Cys Gly Ala Gly Ala Thr Gly Cys Ala Gly Ala Ala Gly
1475 1480 1485
Cys Gly Gly Ala Ala Cys Cys Gly Gly Cys Ala Gly Ala Cys Cys
1490 1495 1500
Ala Ala Cys Gly Ala Gly Cys Gly Gly Ala Thr Cys Gly Ala Gly
1505 1510 1515
Gly Ala Ala Ala Thr Cys Ala Thr Cys Cys Gly Gly Ala Cys Cys
1520 1525 1530
Ala Cys Cys Gly Gly Cys Ala Ala Ala Gly Ala Gly Ala Ala Cys
1535 1540 1545
Gly Cys Cys Ala Ala Gly Thr Ala Cys Cys Thr Gly Ala Thr Cys
1550 1555 1560
Gly Ala Gly Ala Ala Gly Ala Thr Cys Ala Ala Gly Cys Thr Gly
1565 1570 1575
Cys Ala Cys Gly Ala Cys Ala Thr Gly Cys Ala Gly Gly Ala Ala
1580 1585 1590
Gly Gly Cys Ala Ala Gly Thr Gly Cys Cys Thr Gly Thr Ala Cys
1595 1600 1605
Ala Gly Cys Cys Thr Gly Gly Ala Ala Gly Cys Cys Ala Thr Cys
1610 1615 1620
Cys Cys Thr Cys Thr Gly Gly Ala Ala Gly Ala Thr Cys Thr Gly
1625 1630 1635
Cys Thr Gly Ala Ala Cys Ala Ala Cys Cys Cys Cys Thr Thr Cys
1640 1645 1650
Ala Ala Cys Thr Ala Thr Gly Ala Gly Gly Thr Gly Gly Ala Cys
1655 1660 1665
Cys Ala Cys Ala Thr Cys Ala Thr Cys Cys Cys Cys Ala Gly Ala
1670 1675 1680
Ala Gly Cys Gly Thr Gly Thr Cys Cys Thr Thr Cys Gly Ala Cys
1685 1690 1695
Ala Ala Cys Ala Gly Cys Thr Thr Cys Ala Ala Cys Ala Ala Cys
1700 1705 1710
Ala Ala Gly Gly Thr Gly Cys Thr Cys Gly Thr Gly Ala Ala Gly
1715 1720 1725
Cys Ala Gly Gly Ala Ala Gly Ala Ala Gly Cys Cys Ala Gly Cys
1730 1735 1740
Ala Ala Gly Ala Ala Gly Gly Gly Cys Ala Ala Cys Cys Gly Gly
1745 1750 1755
Ala Cys Cys Cys Cys Ala Thr Thr Cys Cys Ala Gly Thr Ala Cys
1760 1765 1770
Cys Thr Gly Ala Gly Cys Ala Gly Cys Ala Gly Cys Gly Ala Cys
1775 1780 1785
Ala Gly Cys Ala Ala Gly Ala Thr Cys Ala Gly Cys Thr Ala Cys
1790 1795 1800
Gly Ala Ala Ala Cys Cys Thr Thr Cys Ala Ala Gly Ala Ala Gly
1805 1810 1815
Cys Ala Cys Ala Thr Cys Cys Thr Gly Ala Ala Thr Cys Thr Gly
1820 1825 1830
Gly Cys Cys Ala Ala Gly Gly Gly Cys Ala Ala Gly Gly Gly Cys
1835 1840 1845
Ala Gly Ala Ala Thr Cys Ala Gly Cys Ala Ala Gly Ala Cys Cys
1850 1855 1860
Ala Ala Gly Ala Ala Ala Gly Ala Gly Thr Ala Thr Cys Thr Gly
1865 1870 1875
Cys Thr Gly Gly Ala Ala Gly Ala Ala Cys Gly Gly Gly Ala Cys
1880 1885 1890
Ala Thr Cys Ala Ala Cys Ala Gly Gly Thr Thr Cys Thr Cys Cys
1895 1900 1905
Gly Thr Gly Cys Ala Gly Ala Ala Ala Gly Ala Cys Thr Thr Cys
1910 1915 1920
Ala Thr Cys Ala Ala Cys Cys Gly Gly Ala Ala Cys Cys Thr Gly
1925 1930 1935
Gly Thr Gly Gly Ala Thr Ala Cys Cys Ala Gly Ala Thr Ala Cys
1940 1945 1950
Gly Cys Cys Ala Cys Cys Ala Gly Ala Gly Gly Cys Cys Thr Gly
1955 1960 1965
Ala Thr Gly Ala Ala Cys Cys Thr Gly Cys Thr Gly Cys Gly Gly
1970 1975 1980
Ala Gly Cys Thr Ala Cys Thr Thr Cys Ala Gly Ala Gly Thr Gly
1985 1990 1995
Ala Ala Cys Ala Ala Cys Cys Thr Gly Gly Ala Cys Gly Thr Gly
2000 2005 2010
Ala Ala Ala Gly Thr Gly Ala Ala Gly Thr Cys Cys Ala Thr Cys
2015 2020 2025
Ala Ala Thr Gly Gly Cys Gly Gly Cys Thr Thr Cys Ala Cys Cys
2030 2035 2040
Ala Gly Cys Thr Thr Thr Cys Thr Gly Cys Gly Gly Cys Gly Gly
2045 2050 2055
Ala Ala Gly Thr Gly Gly Ala Ala Gly Thr Thr Thr Ala Ala Gly
2060 2065 2070
Ala Ala Ala Gly Ala Gly Cys Gly Gly Ala Ala Cys Ala Ala Gly
2075 2080 2085
Gly Gly Gly Thr Ala Cys Ala Ala Gly Cys Ala Cys Cys Ala Cys
2090 2095 2100
Gly Cys Cys Gly Ala Gly Gly Ala Cys Gly Cys Cys Cys Thr Gly
2105 2110 2115
Ala Thr Cys Ala Thr Thr Gly Cys Cys Ala Ala Cys Gly Cys Cys
2120 2125 2130
Gly Ala Thr Thr Thr Cys Ala Thr Cys Thr Thr Cys Ala Ala Ala
2135 2140 2145
Gly Ala Gly Thr Gly Gly Ala Ala Gly Ala Ala Ala Cys Thr Gly
2150 2155 2160
Gly Ala Cys Ala Ala Gly Gly Cys Cys Ala Ala Ala Ala Ala Ala
2165 2170 2175
Gly Thr Gly Ala Thr Gly Gly Ala Ala Ala Ala Cys Cys Ala Gly
2180 2185 2190
Ala Thr Gly Thr Thr Cys Gly Ala Gly Gly Ala Ala Ala Ala Gly
2195 2200 2205
Cys Ala Gly Gly Cys Cys Gly Ala Gly Ala Gly Cys Ala Thr Gly
2210 2215 2220
Cys Cys Cys Gly Ala Gly Ala Thr Cys Gly Ala Ala Ala Cys Cys
2225 2230 2235
Gly Ala Gly Cys Ala Gly Gly Ala Gly Thr Ala Cys Ala Ala Ala
2240 2245 2250
Gly Ala Gly Ala Thr Cys Thr Thr Cys Ala Thr Cys Ala Cys Cys
2255 2260 2265
Cys Cys Cys Cys Ala Cys Cys Ala Gly Ala Thr Cys Ala Ala Gly
2270 2275 2280
Cys Ala Cys Ala Thr Thr Ala Ala Gly Gly Ala Cys Thr Thr Cys
2285 2290 2295
Ala Ala Gly Gly Ala Cys Thr Ala Cys Ala Ala Gly Thr Ala Cys
2300 2305 2310
Ala Gly Cys Cys Ala Cys Cys Gly Gly Gly Thr Gly Gly Ala Cys
2315 2320 2325
Ala Ala Gly Ala Ala Gly Cys Cys Thr Ala Ala Thr Ala Gly Ala
2330 2335 2340
Ala Ala Gly Cys Thr Gly Ala Thr Thr Ala Ala Cys Gly Ala Cys
2345 2350 2355
Ala Cys Cys Cys Thr Gly Thr Ala Cys Thr Cys Cys Ala Cys Cys
2360 2365 2370
Cys Gly Gly Ala Ala Gly Gly Ala Cys Gly Ala Cys Ala Ala Gly
2375 2380 2385
Gly Gly Cys Ala Ala Cys Ala Cys Cys Cys Thr Gly Ala Thr Cys
2390 2395 2400
Gly Thr Gly Ala Ala Cys Ala Ala Thr Cys Thr Gly Ala Ala Cys
2405 2410 2415
Gly Gly Cys Cys Thr Gly Thr Ala Cys Gly Ala Cys Ala Ala Gly
2420 2425 2430
Gly Ala Cys Ala Ala Thr Gly Ala Cys Ala Ala Gly Cys Thr Gly
2435 2440 2445
Ala Ala Ala Ala Ala Gly Cys Thr Gly Ala Thr Cys Ala Ala Cys
2450 2455 2460
Ala Ala Gly Ala Gly Cys Cys Cys Cys Gly Ala Ala Ala Ala Gly
2465 2470 2475
Cys Thr Gly Cys Thr Gly Ala Thr Gly Thr Ala Cys Cys Ala Cys
2480 2485 2490
Cys Ala Cys Gly Ala Cys Cys Cys Cys Cys Ala Gly Ala Cys Cys
2495 2500 2505
Thr Ala Cys Cys Ala Gly Ala Ala Ala Cys Thr Gly Ala Ala Gly
2510 2515 2520
Cys Thr Gly Ala Thr Thr Ala Thr Gly Gly Ala Ala Cys Ala Gly
2525 2530 2535
Thr Ala Cys Gly Gly Cys Gly Ala Cys Gly Ala Gly Ala Ala Gly
2540 2545 2550
Ala Ala Thr Cys Cys Cys Cys Thr Gly Thr Ala Cys Ala Ala Gly
2555 2560 2565
Thr Ala Cys Thr Ala Cys Gly Ala Gly Gly Ala Ala Ala Cys Cys
2570 2575 2580
Gly Gly Gly Ala Ala Cys Thr Ala Cys Cys Thr Gly Ala Cys Cys
2585 2590 2595
Ala Ala Gly Thr Ala Cys Thr Cys Cys Ala Ala Ala Ala Ala Gly
2600 2605 2610
Gly Ala Cys Ala Ala Cys Gly Gly Cys Cys Cys Cys Gly Thr Gly
2615 2620 2625
Ala Thr Cys Ala Ala Gly Ala Ala Gly Ala Thr Thr Ala Ala Gly
2630 2635 2640
Thr Ala Thr Thr Ala Cys Gly Gly Cys Ala Ala Cys Ala Ala Ala
2645 2650 2655
Cys Thr Gly Ala Ala Cys Gly Cys Cys Cys Ala Thr Cys Thr Gly
2660 2665 2670
Gly Ala Cys Ala Thr Cys Ala Cys Cys Gly Ala Cys Gly Ala Cys
2675 2680 2685
Thr Ala Cys Cys Cys Cys Ala Ala Cys Ala Gly Cys Ala Gly Ala
2690 2695 2700
Ala Ala Cys Ala Ala Gly Gly Thr Cys Gly Thr Gly Ala Ala Gly
2705 2710 2715
Cys Thr Gly Thr Cys Cys Cys Thr Gly Ala Ala Gly Cys Cys Cys
2720 2725 2730
Thr Ala Cys Ala Gly Ala Thr Thr Cys Gly Ala Cys Gly Thr Gly
2735 2740 2745
Thr Ala Cys Cys Thr Gly Gly Ala Cys Ala Ala Thr Gly Gly Cys
2750 2755 2760
Gly Thr Gly Thr Ala Cys Ala Ala Gly Thr Thr Cys Gly Thr Gly
2765 2770 2775
Ala Cys Cys Gly Thr Gly Ala Ala Gly Ala Ala Thr Cys Thr Gly
2780 2785 2790
Gly Ala Thr Gly Thr Gly Ala Thr Cys Ala Ala Ala Ala Ala Ala
2795 2800 2805
Gly Ala Ala Ala Ala Cys Thr Ala Cys Thr Ala Cys Gly Ala Ala
2810 2815 2820
Gly Thr Gly Ala Ala Thr Ala Gly Cys Ala Ala Gly Thr Gly Cys
2825 2830 2835
Thr Ala Thr Gly Ala Gly Gly Ala Ala Gly Cys Thr Ala Ala Gly
2840 2845 2850
Ala Ala Gly Cys Thr Gly Ala Ala Gly Ala Ala Gly Ala Thr Cys
2855 2860 2865
Ala Gly Cys Ala Ala Cys Cys Ala Gly Gly Cys Cys Gly Ala Gly
2870 2875 2880
Thr Thr Thr Ala Thr Cys Gly Cys Cys Thr Cys Cys Thr Thr Cys
2885 2890 2895
Thr Ala Cys Ala Ala Gly Ala Ala Cys Gly Ala Thr Cys Thr Gly
2900 2905 2910
Ala Thr Cys Ala Ala Gly Ala Thr Cys Ala Ala Cys Gly Gly Cys
2915 2920 2925
Gly Ala Gly Cys Thr Gly Thr Ala Thr Ala Gly Ala Gly Thr Gly
2930 2935 2940
Ala Gly Gly Gly Gly Cys Gly Ala Cys Ala Ala Cys Gly Ala Cys
2945 2950 2955
Cys Cys Cys Ala Gly Gly Ala Gly Gly Ala Gly Cys Ala Cys Cys
2960 2965 2970
Ala Thr Cys Gly Ala Gly Cys Thr Gly Thr Gly Ala Gly Ala Cys
2975 2980 2985
Gly Gly Gly Cys Cys Ala Thr Ala Cys Thr Cys Gly Thr Cys Thr
2990 2995 3000
Cys Gly Ala Ala Cys Ala Thr Gly Ala Thr Cys Gly Ala Cys Ala
3005 3010 3015
Thr Cys Ala Cys Cys Thr Ala Cys Cys Gly Cys Gly Ala Gly Thr
3020 3025 3030
Ala Cys Cys Thr Gly Gly Ala Ala Ala Ala Cys Ala Thr Gly Ala
3035 3040 3045
Ala Cys Gly Ala Cys Ala Ala Gly Ala Gly Gly Cys Cys Cys Cys
3050 3055 3060
Cys Cys Cys Ala Cys Ala Thr Cys Ala Thr Thr Ala Ala Gly Ala
3065 3070 3075
Cys Ala Ala Thr Cys Gly Cys Cys Thr Cys Cys Ala Ala Gly Ala
3080 3085 3090
Cys Cys Cys Ala Gly Ala Gly Cys Ala Thr Thr Ala Ala Gly Ala
3095 3100 3105
Ala Gly Thr Ala Cys Ala Gly Cys Ala Cys Ala Gly Ala Cys Ala
3110 3115 3120
Thr Thr Cys Thr Gly Gly Gly Cys Ala Ala Cys Cys Thr Gly Thr
3125 3130 3135
Ala Thr Gly Ala Ala Gly Thr Gly Ala Ala Ala Thr Cys Thr Ala
3140 3145 3150
Ala Gly Ala Ala Gly Cys Ala Cys Cys Cys Thr Cys Ala Gly Ala
3155 3160 3165
Thr Cys Ala Thr Cys Ala Ala Ala Ala Ala Gly Gly Gly Cys
3170 3175 3180
<210> 113
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 113
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaggggcg acaacaaccc caggcagaac aagctggagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 114
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 114
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgaccgcca ccaacaacga caagaagaac atgatcgagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 115
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 115
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacaacaa caggctgaac aagatcgagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 116
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 116
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgttcagcga cgccggcaac ctgctggagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 117
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 117
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg acaacaaccc caggaacaac gtgatcgagg tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 118
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 118
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacaacaa cctgctgaac aagatcgaag tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 119
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 119
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaactcgga cctgctgaac cggatcgaag tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 120
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 120
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacaacga cctgctgaac aagatcgaag tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 121
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 121
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacaacag caccaggaac aaggtggagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 122
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 122
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacaacag caccaggaac ctggtggagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 123
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 123
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacagcga cgacaggaac aagatcgagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 124
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 124
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacagcga cgacaggaac ctgatcgagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 125
<211> 3182
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 125
atgaagcgga actacatcct gggcctggcc atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaagcc 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacagcga cgacaggaac aggatcgagc tgtgagacgg gccatactcg 3000
tctcgaacat gatcgacatc acctaccgcg agtacctgga aaacatgaac gacaagaggc 3060
ccccccacat cattaagaca atcgcctcca agacccagag cattaagaag tacagcacag 3120
acattctggg caacctgtat gaagtgaaat ctaagaagca ccctcagatc atcaaaaagg 3180
gc 3182
<210> 126
<211> 82
<212> DNA
<213> Artificial
<220>
<223> 野生型gRNA骨架序列
<400> 126
gttttagtac tctggaaaca gaatctacta aaacaaggca aaatgccgtg tttatctcgt 60
caacttgttg gcgagatttt tt 82
<210> 127
<211> 82
<212> DNA
<213> Artificial
<220>
<223> 第二核酸分子序列
<400> 127
gtcttagtac tctggaaaca gaatctacta agacaaggca aaatgccgtg tttatctcgt 60
caacttgttg gcgagatttt tt 82
<210> 128
<211> 83
<212> DNA
<213> Artificial
<220>
<223> 第二核酸分子序列
<400> 128
gttatagtac tctggaaaca gaatctacta taacaaggca aaatgccgtg tttatctcgt 60
caacttgttg gcgagatttt ttt 83
<210> 129
<211> 83
<212> DNA
<213> Artificial
<220>
<223> 第二核酸分子序列
<400> 129
gttccggtac tctggaaaca gaatctaccg gaacaaggca aaatgccgtg tttatctcgt 60
caacttgttg gcgagatttt ttt 83
<210> 130
<211> 981
<212> PRT
<213> Artificial
<220>
<223> 框架区序列
<400> 130
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Ala Lys Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Ala Arg Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val
980
<210> 131
<211> 1053
<212> PRT
<213> Artificial
<220>
<223> Cas9蛋白突变体序列
<400> 131
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Ala Lys Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Ala Arg Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Ser Asp Asp Arg Asn Arg Ile
980 985 990
Glu Leu Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 132
<211> 3159
<212> DNA
<213> Artificial
<220>
<223> 编码Cas9蛋白突变体的核酸序列
<400> 132
atgaagcgga actacatcct gggcctggac atcggcatca ccagcgtggg ctacggcatc 60
atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac 120
gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg 180
cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac 240
agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg 300
agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac 360
gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg 420
aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa 480
gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc 540
aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc 600
tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc 660
ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc 720
cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac 780
gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag 840
ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc 900
aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag 960
cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag 1020
attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc 1080
agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc 1140
gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc 1200
aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg 1260
ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg 1320
gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg 1380
atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc 1440
gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaacgccaag 1500
accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg 1560
atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc 1620
atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc 1680
agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaaaac 1740
agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc 1800
tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag 1860
accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac 1920
ttcatcaacc ggaacctggt ggataccaga tacgccaccg cccggctgat gaacctgctg 1980
cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc 2040
accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac 2100
cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa 2160
ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc 2280
aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat 2340
agaaagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg 2400
atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc 2460
aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg 2520
aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa 2580
accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt 2640
aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc 2700
agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat 2760
ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac 2820
gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc 2880
gagtttatcg cctccttcta caagaacgat ctgatcaaga tcaacggcga gctgtataga 2940
gtgatcggcg tgaacagcga cgaccggaac cggatcgaag tgctgatgat cgacatcacc 3000
taccgcgagt acctggaaaa catgaacgac aagaggcccc cccacatcat taagacaatc 3060
gcctccaaga cccagagcat taagaagtac agcacagaca ttctgggcaa cctgtatgaa 3120
gtgaaatcta agaagcaccc tcagatcatc aaaaagggc 3159

Claims (14)

1.一种Cas9蛋白突变体,其特征在于,其具有:
框架区;和
PAM识别区,所述PAM识别区识别下列核酸序列的至少之一:
5’-NNNRRT-3’,N=A、T、G或C、R=A或G;
5’-NNNRRN-3’,N=A、T、G或C、R=A或G;
5’-NNNRCN-3’,N=A、T、G或C、R=A或G;
5’-NNNRTN-3’,N=A、T、G或C、R=A或G;
5’-NNNCAA-3’,N=A、T、G或C;
5’-NNNCAT-3’,N=A、T、G或C;
5’-NNNCGT-3’,N=A、T、G或C;
5’-NNNCGC-3’,N=A、T、G或C;
5’-NNNGTN-3’,N=A、T、G或C;
5’-NNNTCN-3’,N=A、T、G或C;
5’-NNNTTC-3’,N=A、T、G或C;
5’-NNNTTG-3’,N=A、T、G或C;
5’-NNNTTT-3’,N=A、T、G或C,
其中,相对于saCas9,所述PAM识别区与982IGVNNDLLNRIEV 994相比具有下列突变的至少之一:
第982位突变为T、K、R或L,
第983位突变为A、C或S,
第984位突变为T、D,
第985位突变为F、S、A、N,
第986位突变为E、D、H、A、M,
第987位突变为S、G、N、S、D、E、P,
第988位突变为D、K、T、S、T、D、K、R、E、A,
第989位突变为R、A、N、Q、G、E、T、K、S、G、H、V,
第990位突变为S,
第991位突变为I、V、L、K、T、M,
第992位突变为V、L,
第993位突变为Q,
第994位突变为L、M、C、I、A;
其中,所述PAM识别区的氨基酸序列如SEQ ID NO:3~SEQ ID NO:43所示;所述框架区的氨基酸序列如SEQ ID NO:1~2、130所示;所述Cas9蛋白突变体的氨基酸序列如SEQ IDNO:44~84、131所示。
2.一种核酸,其特征在于,所述核酸编码权利要求1所述的Cas9蛋白突变体。
3.根据权利要求2所述的核酸,其特征在于,所述核酸具有SEQ ID NO:85~125、132任一所述的核苷酸序列。
4.一种试剂盒,其特征在于,包括:
第一核酸分子,所述第一核酸分子编码权利要求1所述的Cas9蛋白突变体;以及
第二核酸分子,所述第二核酸分子编码gRNA。
5.根据权利要求4所述的试剂盒,其特征在于,所述第一核酸分子具有SEQ ID NO:85~125、132任一所述的核苷酸序列。
6.根据权利要求4所述的试剂盒,其特征在于,所述第二核酸分子是SEQ ID NO:127~129任一所述的核苷酸序列。
7.根据权利要求4所述的试剂盒,其特征在于,所述第一核酸分子、第二核酸分子负载在同一表达载体上。
8.根据权利要求7所述的试剂盒,其特征在于,所述同一表达载体为腺病毒载体。
9.一种对细胞进行基因改造的方法,其特征在于,将第一核酸分子和第二核酸分子引入待改造的细胞中,所述第一核酸分子和所述第二核酸分子是如权利要求4~8任一项所述的,所述方法用于非疾病诊断或治疗目的。
10.根据权利要求9所述的方法,其特征在于,所述Cas9蛋白突变体的PAM识别区序列和所述gRNA序列是基于待改造的基因序列确定的。
11.根据权利要求9所述的方法,其特征在于,所述PAM识别区序列和所述gRNA的序列是基于下列关系确定的:
Figure FDA0003346642420000021
Figure FDA0003346642420000031
Figure FDA0003346642420000041
Figure FDA0003346642420000051
Figure FDA0003346642420000061
Figure FDA0003346642420000071
Figure FDA0003346642420000081
Figure FDA0003346642420000091
Figure FDA0003346642420000101
Figure FDA0003346642420000111
Figure FDA0003346642420000121
12.根据权利要求9所述的方法,其特征在于,所述基因改造包括对预定位点进行基因敲除或表达调控。
13.一种细胞,其特征在于,是根据权利要求9~12任一项所述的方法获得的。
14.根据权利要求13所述的细胞,其特征在于,所述细胞为动物细胞或微生物细胞。
CN201810731984.9A 2018-07-05 2018-07-05 基于进化信息构建嵌合SaCas9用于增强和扩展PAM位点的识别 Active CN110684755B (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201810731984.9A CN110684755B (zh) 2018-07-05 2018-07-05 基于进化信息构建嵌合SaCas9用于增强和扩展PAM位点的识别
PCT/CN2019/094585 WO2020007325A1 (en) 2018-07-05 2019-07-03 Cas9 variants and application thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810731984.9A CN110684755B (zh) 2018-07-05 2018-07-05 基于进化信息构建嵌合SaCas9用于增强和扩展PAM位点的识别

Publications (2)

Publication Number Publication Date
CN110684755A CN110684755A (zh) 2020-01-14
CN110684755B true CN110684755B (zh) 2021-12-31

Family

ID=69060169

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810731984.9A Active CN110684755B (zh) 2018-07-05 2018-07-05 基于进化信息构建嵌合SaCas9用于增强和扩展PAM位点的识别

Country Status (2)

Country Link
CN (1) CN110684755B (zh)
WO (1) WO2020007325A1 (zh)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111718954B (zh) * 2020-06-29 2021-12-31 合肥戬谷生物科技有限公司 一种基因组编辑工具及其应用
CN116004572A (zh) * 2021-02-05 2023-04-25 山东舜丰生物科技有限公司 Crispr酶和系统以及应用
EP4144841A1 (en) * 2021-09-07 2023-03-08 Bayer AG Novel small rna programmable endonuclease systems with impoved pam specificity and uses thereof
CN117866926A (zh) * 2024-03-07 2024-04-12 珠海舒桐医疗科技有限公司 一种CRISPR-FrCas9蛋白突变体及应用

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016205759A1 (en) * 2015-06-18 2016-12-22 The Broad Institute Inc. Engineering and optimization of systems, methods, enzymes and guide scaffolds of cas9 orthologs and variants for sequence manipulation
CN107236739A (zh) * 2017-06-12 2017-10-10 上海捷易生物科技有限公司 CRISPR/SaCas9特异性敲除人CXCR4基因的方法
CN107532161A (zh) * 2015-03-03 2018-01-02 通用医疗公司 具有改变的PAM特异性的工程化CRISPR‑Cas9核酸酶

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2752834C2 (ru) * 2015-06-18 2021-08-09 Те Брод Инститьют, Инк. Мутации фермента crispr, уменьшающие нецелевые эффекты
IL294014B1 (en) * 2015-10-23 2024-03-01 Harvard College Nucleobase editors and their uses
CN107012250B (zh) * 2017-05-16 2021-01-29 上海交通大学 一种适用于CRISPR/Cas9系统的基因组DNA片段编辑精准度的分析方法及应用

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107532161A (zh) * 2015-03-03 2018-01-02 通用医疗公司 具有改变的PAM特异性的工程化CRISPR‑Cas9核酸酶
WO2016205759A1 (en) * 2015-06-18 2016-12-22 The Broad Institute Inc. Engineering and optimization of systems, methods, enzymes and guide scaffolds of cas9 orthologs and variants for sequence manipulation
CN107236739A (zh) * 2017-06-12 2017-10-10 上海捷易生物科技有限公司 CRISPR/SaCas9特异性敲除人CXCR4基因的方法

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Broadening Staphylococcus aureus Cas9 Targeting Range by Modifying PAM Recognition;Benjamin P.Kleinstiver et al.;《Nat Biotechnol》;20151231;第33卷(第12期);第1293-1298页 *
CRISPR/Cas9技术存在的问题及其改进措施的研究进展;袁伟曦等;《生物技术通报》;20171231;第33卷(第4期);第70-77页 *
Crystal structure of Staphylococcus aureus Cas9;Hiroshi Nishimasu et al.;《Cell》;20150827;第162卷(第5期);摘要,第8页倒数第1段,图5-6 *
Engineer chimeric Cas9 to expand PAM recognition based on evolutionary information;Dacheng Maet al.;《NATURE COMMUNICATIONS》;20190204;第10卷;第1-9页 *

Also Published As

Publication number Publication date
CN110684755A (zh) 2020-01-14
WO2020007325A1 (en) 2020-01-09

Similar Documents

Publication Publication Date Title
CN110684755B (zh) 基于进化信息构建嵌合SaCas9用于增强和扩展PAM位点的识别
US20240093241A1 (en) Crispr enabled multiplexed genome engineering
AU2017280353B2 (en) Methods for generating barcoded combinatorial libraries
CN106957831B (zh) 一种Cas9核酸酶K918A及其用途
CN106967697B (zh) 一种Cas9核酸酶G915F及其用途
CN106939303B (zh) 一种Cas9核酸酶R919P及其用途
CN106957830B (zh) 一种Cas9核酸酶ΔF916及其用途
Buchholz et al. Improved properties of FLP recombinase evolved by cycling mutagenesis
JP2019525755A (ja) ゲノム編集
WO2016135507A1 (en) Nucleic acid editing systems
WO2011053957A9 (en) Compositions and methods for the regulation of multiple genes of interest in a cell
EA020657B1 (ru) Специализированная многосайтовая комбинаторная сборка
JP2004507243A (ja) 大量特異的変異導入方法
CA2573023A1 (en) Generation of recombinant genes in bacteriophages
CA3206795A1 (en) Methods and systems for generating nucleic acid diversity
WO2023102176A1 (en) Crispr-associated transposases and methods of use thereof
EP1838851B1 (en) Polypeptide mutagenesis method
WO2022155445A1 (en) Non-naturally occurring host cells for enhanced plant growth
JP5246904B2 (ja) 外来遺伝子導入用ベクター及び外来遺伝子が導入されたベクターの製造方法
Hansson et al. [28] Use of chimeras generated by DNA shuffling: Probing structure-function relationships among glutathione transferases
CN114026226A (zh) 靶特异性crispr突变体
WO2024017189A1 (en) Tnpb-based genome editor
Fauser et al. Systematic Development of Reprogrammed Modular Integrases Enables Precise Genomic Integration of Large DNA Sequences
KR20240049267A (ko) Dna 절단 활성의 증진을 나타내는 광범위한 스캐닝 돌연변이 유발에 의해 발견된 스트렙토코커스 피오게네스 cas9에서의 신규한 돌연변이
CA3163369A1 (en) Variant cas9

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant