KR20220145438A - An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use thereof - Google Patents

An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use thereof Download PDF

Info

Publication number
KR20220145438A
KR20220145438A KR1020210051552A KR20210051552A KR20220145438A KR 20220145438 A KR20220145438 A KR 20220145438A KR 1020210051552 A KR1020210051552 A KR 1020210051552A KR 20210051552 A KR20210051552 A KR 20210051552A KR 20220145438 A KR20220145438 A KR 20220145438A
Authority
KR
South Korea
Prior art keywords
seq
sequence
engineered
cas12f1
region
Prior art date
Application number
KR1020210051552A
Other languages
Korean (ko)
Inventor
김용삼
김도연
Original Assignee
주식회사 진코어
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 주식회사 진코어 filed Critical 주식회사 진코어
Priority to KR1020210051552A priority Critical patent/KR20220145438A/en
Priority to US18/030,624 priority patent/US20240254479A1/en
Priority to JP2023521464A priority patent/JP2023544817A/en
Priority to EP21878063.3A priority patent/EP4227411A1/en
Priority to AU2021357377A priority patent/AU2021357377A1/en
Priority to PCT/KR2021/013923 priority patent/WO2022075813A1/en
Priority to CN202180082426.4A priority patent/CN116806261A/en
Priority to CA3198429A priority patent/CA3198429A1/en
Publication of KR20220145438A publication Critical patent/KR20220145438A/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biomedical Technology (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Molecular Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Plant Pathology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Medicinal Chemistry (AREA)
  • Mycology (AREA)
  • Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)

Abstract

In the present invention, an engineered Cas12f1 guide RNA for increasing an intracellular gene editing activity of a CRISPR/Cas12f1 system is provided by overcoming limitations of a prior art. The engineered Cas12f1 guide RNA is a modification of a part of a structure of a guide RNA found in nature. In the engineered Cas12f1 guide RNA, at least a part of a scaffold region having a role to interact with Cas12f1 protein is modified. The engineered scaffold region is different from a scaffold region of the guide RNA found in nature. The engineered guide RNA comprises the engineered scaffold region and a spacer.

Description

CRISPR/Cas12f1 시스템 효율화를 위한 엔지니어링 된 가이드 RNA 및 그 용도{An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use thereof}An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use thereof

본 명세서에서는 CRISPR/Cas 시스템, 특히 CRISPR/Cas12f1 시스템을 유전자 편집에 사용하는 분야에 대한 기술을 개시한다.Disclosed herein is a technology for the field of using the CRISPR/Cas system, particularly the CRISPR/Cas12f1 system, for gene editing.

CRISPR/Cas12f1 시스템은 Class 2, Type V로 분류되는 CRISPR/Cas 시스템이다. 선행연구(Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018))에서 최초로 고균 유래의 CRISPR/Cas 시스템인 CRISPR/Cas14 시스템에 대해 밝혀졌다. 그 후 후속 연구(Karvelis et al., Nucleic Acids Research, Vol. 48, No. 9 5017 (2020))에서 상기 CRISPR/Cas14 시스템을 CRISPR/Cas12f 시스템으로 분류하였다. CRISPR/Cas12f1 시스템은 상기 Class 2, Type V로 분류되는 CRISPR/Cas 시스템 중 하나의 Subtype인 V-F1 시스템에 속하며, 이는 Cas14a를 이펙터 단백질로 가지는 CRISPR/Cas14a 시스템을 포함한다. 상기 CRISPR/Cas12f1 시스템은 CRISPR/Cas9 시스템에 비하여 이펙터 단백질의 크기가 현저히 작다는 특징이 있다. 다만, 선행 연구(Harington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018), US 2020/0190494 A1)에서 밝혀진 바로는, 상기 CRISPR/Cas12f1 시스템, 특히 CRISPR/Cas14a 시스템은 단일가닥 DNA의 절단 능력은 보이나, 이중가닥 DNA에 대해서는 절단 활성이 없거나, 극히 낮아 유전자 편집 기술에 응용하는 데 한계가 있었다.The CRISPR/Cas12f1 system is a CRISPR/Cas system classified as Class 2, Type V. In a previous study (Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018)), the CRISPR/Cas14 system, a CRISPR/Cas system derived from archaebacteria for the first time, was revealed. Then, in a subsequent study (Karvelis et al., Nucleic Acids Research, Vol. 48, No. 9 5017 (2020)), the CRISPR/Cas14 system was classified as a CRISPR/Cas12f system. The CRISPR/Cas12f1 system belongs to the V-F1 system, which is a subtype of one of the CRISPR/Cas systems classified into Class 2 and Type V, and includes the CRISPR/Cas14a system having Cas14a as an effector protein. The CRISPR/Cas12f1 system is characterized in that the size of the effector protein is significantly smaller than that of the CRISPR/Cas9 system. However, as revealed in previous studies (Harington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018), US 2020/0190494 A1), the CRISPR/Cas12f1 system, in particular CRISPR/ The Cas14a system has the ability to cut single-stranded DNA, but has no or extremely low cleavage activity for double-stranded DNA, limiting its application to gene editing technology.

계속된 연구로 최근 선행문헌(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021))에서는 CRISPR/Cas12f1 시스템에서 Cas12f1 단백질이 이량체(dimer)를 형성한다는 사실, 및 가이드 RNA의 구조에 대해 밝혔다. 상기 선행문헌에 의해 가이드 RNA 부분 중 Cas12f1 단백질과 직접적으로 상호작용하지 않는, 이른바 disordered region이 있다는 것이 밝혀졌으며, 또 다른 선행문헌(Xiao et al., Structural basis for the dimerization-dependent CRISPR-Cas12f nuclease, bioRxiv (2020))에서는 상기 disordered region 부분을 제거하고 in vitro에서 이중가닥 DNA 절단 효율을 살핀 바 있다. 하지만, 상기 선행 연구들은 1) 세포 내 유전자 편집 활성(예를 들어, indel 발생 효율)을 보이거나, 이를 높이는 방안에 대한 연구는 아니며, 2) disordered region을 제거한 실험은 있지만, 오히려 절단 활성이 낮아지는 것으로 보고되었으며, 3) 더욱이 상기 실험은 in vitro에서 진행한 실험으로, 세포 내 유전자 편집 활성을 나타내거나, 효율을 높이기 위한 변형이 무엇인지 밝히는데는 실패하였다.As a continuing study, in the recent prior literature (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)), Cas12f1 protein is a dimer in the CRISPR / Cas12f1 system. , and the structure of guide RNAs. It was found by the above literature that there is a so-called disordered region that does not directly interact with the Cas12f1 protein in the guide RNA portion, and another prior literature (Xiao et al., Structural basis for the dimerization-dependent CRISPR-Cas12f nuclease, In bioRxiv (2020)), the disordered region was removed and double-stranded DNA cleavage efficiency was examined in vitro. However, the preceding studies do not 1) show or increase intracellular gene editing activity (eg, indel generation efficiency), and 2) there are experiments that remove the disordered region, but rather the cleavage activity is low. 3) Moreover, as the above experiment was conducted in vitro, it failed to reveal what modifications were made to show intracellular gene editing activity or to increase efficiency.

본 명세서에서는 CRISPR/Cas12f1 시스템에 사용되어 유전자 편집 효율을 증가시킬 수 있는 엔지니어링 된 Cas12f1 가이드 RNA를 제공하고자 한다.An object of the present specification is to provide an engineered Cas12f1 guide RNA that can be used in the CRISPR/Cas12f1 system to increase gene editing efficiency.

본 명세서에서는 상기 엔지니어링 된 Cas12f1 가이드 RNA에 포함되어 유전자 편집 효율을 증가시킬 수 있는 엔지니어링 된 스캐폴드 영역을 제공하고자 한다.In the present specification, it is intended to provide an engineered scaffold region that can be included in the engineered Cas12f1 guide RNA to increase gene editing efficiency.

본 명세서에서는 유전자 편집 효율이 증가된 엔지니어링 된 CRISPR/Cas12f1 복합체를 제공하고자 한다.An object of the present specification is to provide an engineered CRISPR/Cas12f1 complex with increased gene editing efficiency.

본 명세서에서는 유전자 편집 효율이 증가된 엔지니어링 된 CRISPR/Cas12f1 시스템을 제공하고자 한다.An object of the present specification is to provide an engineered CRISPR/Cas12f1 system with increased gene editing efficiency.

본 명세서에서는 상기 엔지니어링 된 CRISPR/Cas12f1 시스템의 각 구성요소를 암호화하는 핵산 서열을 가지는 벡터를 제공하고자 한다.An object of the present specification is to provide a vector having a nucleic acid sequence encoding each component of the engineered CRISPR/Cas12f1 system.

본 명세서에서는 상기 엔지니어링 된 CRISPR/Cas12f1 시스템을 사용한 유전자 편집 방법을 제공하고자 한다.An object of the present specification is to provide a gene editing method using the engineered CRISPR/Cas12f1 system.

본 명세서에서는 상기 엔지니어링 된 CRISPR/Cas12f1 시스템의 용도를 제공하고자 한다.In the present specification, it is intended to provide the use of the engineered CRISPR/Cas12f1 system.

상기 과제를 해결하기 위해, 본 명세서에서는 다음을 포함하는, CRISPR/Cas12f1 시스템을 위한 엔지니어링 된 가이드 RNA를 개시한다:In order to solve the above problem, herein disclosed is an engineered guide RNA for the CRISPR/Cas12f1 system, comprising:

엔지니어링 된 스캐폴드 영역; 및engineered scaffold area; and

스페이서;spacer;

이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,

상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 7)과 상이한 것을 특징으로 하며,The sequence of the engineered scaffold region is characterized in that it differs from 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 7),

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5' -UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);

5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);

5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 및 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117)에서 선택된 서열; 및5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), and 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) a sequence selected from SEQ ID NO: 117); and

5'-AUGCAAC-3'.5'-AUGCAAC-3'.

상기 과제를 해결하기 위해, 본 명세서에서는 다음을 포함하는, CRISPR/Cas12f1 시스템을 위한 엔지니어링 된 가이드 RNA를 개시한다:In order to solve the above problem, herein disclosed is an engineered guide RNA for the CRISPR/Cas12f1 system, comprising:

엔지니어링 된 스캐폴드 영역; 및engineered scaffold area; and

스페이서; 및spacer; and

이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,

상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGAAUGCAAC-3'(서열번호 315)과 상이한 것을 특징으로 하며,The sequence of the engineered scaffold region is 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUUCCUCUCCCAAAAUUCGAGAACCAAUGAAUGAAUUCGAUGAAGAAUGAAUGAAUGAUGAAGUGAGAAUGAAUGAAUGAUGAUGAGAAUGAAUGAAUGAUGAAUGAAGAAU characterized by that the sequence differs by the sequence

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5' -UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);

5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);

5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117), 5'-AACAAAUUCAUUUUGAAACGAAUGAAGGA-3'(서열번호 293), 5'-AACAAAUUCAUUUUUGAAAACGAAUGAAGGA-3'(서열번호 294), 5'-AACAAAUUCAUUUUUCGAAAGACGAAUGAAGGA-3'(서열번호 295), 5'-AACAAAUUCAUUUUUCCGAAAAGACGAAUGAAGGA-3'(서열번호 296), 5'-AACAAAUUCAUUUUUCCUGAAAUAGACGAAUGAAGGA-3'(서열번호 297), 5'-AACAAAUUCAUUUUUCCUCGAAAAUAGACGAAUGAAGGA-3'(서열번호 298), 5'-AACAAAUUCAUUUUUCCUCUGAAAAAUAGACGAAUGAAGGA-3'(서열번호 299), 5'-AACAAAUUCAUUUUUCCUCUCGAAAGAAUAGACGAAUGAAGGA-3'(서열번호 300), 5'-AACAAAUUCAUUUUUCCUCUCCGAAACGAAUAGACGAAUGAAGGA-3'(서열번호 301), 5'-AACAAAUUCAUUUUUCCUCUCCAGAAACCGAAUAGACGAAUGAAGGA-3'(서열번호 302), 5'-AACAAAUUCAUUUUUCCUCUCCAAGAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 303), 5'-AACAAAUUCAUUUUUCCUCUCCAAUGAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 304), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUGAAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 305), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCGAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 306), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGAAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 307), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGGAAACAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 308), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCGAAAGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 309), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCAGAAAUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 310), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 311), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 312), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 313), 및 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 314)로 이뤄진 군에서 선택된 서열; 및5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) No. 117), 5'-AACAAAUUCAUUUUGAAACGAAUGAAGGA-3' (SEQ ID NO: 293), 5'-AACAAAUUCAUUUUUGAAAACGAAUGAAGGA-3' (SEQ ID NO: 294), 5'-AACAAAUUCAUUUUUCGAAAGACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAAU-GAAAAAGGAUGA (SEQ ID NO: 296), 5'-AACAAAUUCAUUUUUCCUGAAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 297), 5'-AACAAAUUCAUUUUUCCUCGAAAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 298), 5'-AACAAAUUCAUUCAUUUUUCCUACAUGAAUUCAUUUUUUCCUACAUGAAUUCAUUUUUUCCUACAUGAAUUCAUUUUUUCCUACAACUGAAUGAAUUUUUCCUCUGAAUGAAGAAUUCA 3' (SEQ ID NO: 300), 5'-AACAAAUUCAUUUUUCCUCUCCGAAACGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 301), 5'-AACAAAUUCAUUUUUCCUCUCCAGAAACCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 302), 5'-AACAAAUCUCCAUGAAUGAGU-3' (SEQ ID NO: 302), 5'-AACAAAUCUCCAUGAAUGAU AACAAAUUCAUUUUUCCUCUCCAAUGAAAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 304), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUGAAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 305), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCGAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 306), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGAAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 307), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGGAAACAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 308 ), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCGAAAGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 309), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCAGAAAUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 310), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 311), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열 312), 5'-AACAAAUUCAUUUUUUCCUCUCCAAUUCUGCACAGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 313), and 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO:314) selected from the group consisting of; and

5'-AUGCAAC-3'.5'-AUGCAAC-3'.

상기 과제를 해결하기 위해, 본 명세서에서는 다음을 포함하는, CRISPR/Cas12f1 시스템을 위한 엔지니어링 된 가이드 RNA를 개시한다:In order to solve the above problem, herein disclosed is an engineered guide RNA for the CRISPR/Cas12f1 system, comprising:

엔지니어링 된 스캐폴드; 및engineered scaffolds; and

스페이서,spacer,

이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,

상기 스페이서는 10 내지 30 뉴클레오타이드 길이를 가지며, 표적 서열과 상보적인 서열을 가지며,The spacer has a length of 10 to 30 nucleotides and has a sequence complementary to the target sequence,

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:

5'-A-3'로 표현되는 제1 서열;a first sequence represented by 5'-A-3';

5'-CCGCUUCAC-3'(서열번호 432)로 표현되는 제2 서열;a second sequence represented by 5'-CCGCUUCAC-3' (SEQ ID NO: 432);

5'-UUAG-3'로 표현되는 제3 서열;a third sequence represented by 5'-UUAG-3';

5'-AGUGAAGGUGG-3'(서열번호 433)로 표현되는 제4 서열;a fourth sequence represented by 5'-AGUGAAGGUGG-3' (SEQ ID NO: 433);

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11)로 표현되는 제5 서열;a fifth sequence represented by 5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);

5'-AACAAA-3'로 표현되는 제6 서열;a sixth sequence represented by 5'-AACAAA-3';

링커;linker;

5'-GGA-3'로 표현되는 제7 서열; 및a seventh sequence represented by 5'-GGA-3'; and

5'-AUGCAAC-3'로 표현되는 제8 서열.The eighth sequence represented by 5'-AUGCAAC-3'.

상기 과제를 해결하기 위해, 본 명세서에서는 다음을 포함하는 CRISPR/Cas12f1 시스템을 위한 엔지니어링 된 가이드 RNA를 개시한다:In order to solve the above problem, herein disclosed is an engineered guide RNA for CRISPR/Cas12f1 system comprising:

엔지니어링 된 스캐폴드 영역; 및engineered scaffold area; and

스페이서;spacer;

이때, 상기 스페이서는 10 내지 30 뉴클레오타이드 길이를 가지며, 표적 서열과 상보적인 서열을 가지며,In this case, the spacer has a length of 10 to 30 nucleotides and has a sequence complementary to the target sequence,

상기 엔지니어링 된 스캐폴드 영역의 서열은 다음 서열을 포함함:The sequence of the engineered scaffold region comprises the following sequence:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5' -UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), a sequence selected from the group consisting of 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9),

5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열,5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order,

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11), 및5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11), and

5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAUUCA-3'(서열번호 66), 5'-AACAAAUUCAU-3'(서열번호 67), 5'-AACAAAUUCAUU-3'(서열번호 68), 및 5'-AACAAAUUCAUUU-3'(서열번호 12)로 이뤄진 군에서 선택된 서열이 연결된 엔지니어링 된 tracrRNA; 및5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAAUUCA-3' (SEQ ID NO: 66), 5'-AACAAAUUCAU- an engineered tracrRNA to which a sequence selected from the group consisting of 3' (SEQ ID NO: 67), 5'-AACAAAUUCAUU-3' (SEQ ID NO: 68), and 5'-AACAAAUUCAUUU-3' (SEQ ID NO: 12) is linked; and

5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 5'-AAUGAAGGA-3', 및 5'-GAAUGAAGGA-3'(서열번호 14)으로 이뤄진 군에서 선택된 서열, 및5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 5' -AAUGAAGGA-3', and a sequence selected from the group consisting of 5'-GAAUGAAGGA-3' (SEQ ID NO: 14), and

5'-AUGCAAC-3'가 연결된 엔지니어링 된 crRNA 반복 서열 부분,5'-AUGCAAC-3' linked engineered crRNA repeat sequence portion,

이때, 상기 엔지니어링 된 crRNA 반복 서열 부분의 3'말단은 상기 스페이서의 5'말단과 연결 되어 있으며,At this time, the 3' end of the engineered crRNA repeat sequence portion is connected to the 5' end of the spacer,

상기 엔지니어링 된 tracrRNA의 서열이 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 1)과 동일하고, 및 상기 엔지니어링 된 crRNA 반복 서열이 5'-GAAUGAAGGAAUGCAAC-3'(서열번호 3)과 동일한 경우는 제외되는 것을 특징으로 함.If the sequence of the engineered tracrRNA is identical to 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3' (SEQ ID NO: 1), except that the sequence is identical to that of the engineered tracrRNA repeats that the sequence is identical to 5'-CUUCACUGAUAAAGUGGAGAGAACCGCUUCACCAAAAGCUGUCCCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3' (SEQ ID NO: 1). characterized.

상기 과제를 해결하기 위해, 본 명세서에서는 다음을 포함하는, 표적 서열을 포함하는 핵산을 편집할 수 있는 엔지니어링 된 CRISPR/Cas12f1 복합체를 개시한다:In order to solve the above problem, disclosed herein is an engineered CRISPR/Cas12f1 complex capable of editing a nucleic acid comprising a target sequence, comprising:

Cas12f1 단백질; 및Cas12f1 protein; and

엔지니어링 된 가이드 RNA,engineered guide RNA,

이때, 상기 엔지니어링 된 가이드 RNA는 다음을 포함함:In this case, the engineered guide RNA comprises:

엔지니어링 된 스캐폴드 영역; 및engineered scaffold area; and

스페이서;spacer;

이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,

상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 7)과 상이한 것을 특징으로 하며,The sequence of the engineered scaffold region is characterized in that it differs from 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 7),

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5' -UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);

5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);

5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 및 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117)에서 선택된 서열; 및5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), and 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) a sequence selected from SEQ ID NO: 117); and

5'-AUGCAAC-3'.5'-AUGCAAC-3'.

상기 과제를 해결하기 위해, 본 명세서에서는 다음을 포함하는, CRISPR/Cas12f1 시스템의 각 구성요소를 발현할 수 있는 벡터를 개시한다:In order to solve the above problem, the present specification discloses a vector capable of expressing each component of the CRISPR/Cas12f1 system, comprising:

Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 제1 서열;a first sequence comprising a nucleic acid sequence encoding a Cas12f1 protein;

상기 제1 서열과 작동 가능하게 연결된 제1 프로모터 서열;a first promoter sequence operably linked to said first sequence;

엔지니어링 된 가이드 RNA를 암호화하는 핵산 서열을 포함하는 제2 서열; 및a second sequence comprising a nucleic acid sequence encoding the engineered guide RNA; and

상기 제2 서열과 작동 가능하게 연결된 제2 프로모터 서열,a second promoter sequence operably linked to said second sequence;

이때, 상기 엔지니어링 된 가이드 RNA는 다음을 포함함:In this case, the engineered guide RNA comprises:

엔지니어링 된 스캐폴드 영역; 및engineered scaffold area; and

스페이서;spacer;

이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,

상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 7)과 상이한 것을 특징으로 하며,The sequence of the engineered scaffold region is characterized in that it differs from 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 7),

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5' -UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);

5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);

5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 및 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117)에서 선택된 서열; 및5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), and 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) a sequence selected from SEQ ID NO: 117); and

5'-AUGCAAC-3'.5'-AUGCAAC-3'.

상기 과제를 해결하기 위해, 본 명세서에서는 다음을 포함하는, 세포 내에서 표적 서열을 포함하는 핵산을 편집하는 방법을 개시한다:In order to solve the above problem, disclosed herein is a method for editing a nucleic acid comprising a target sequence in a cell, comprising:

Cas12f1 단백질 또는 이를 암호화하는 핵산, 및 엔지니어링 된 가이드 RNA 또는 이름 암호화하는 핵산을 세포 내로 전달하는 것,delivering the Cas12f1 protein or a nucleic acid encoding the same, and an engineered guide RNA or a nucleic acid encoding the name into a cell;

이로 인해 상기 세포 내에서 CRISPR/Cas12f1 복합체가 형성될 수 있으며,Due to this, the CRISPR/Cas12f1 complex may be formed in the cell,

이로 인해 상기 표적 서열을 포함하는 핵산이 CRISPR/Cas12f1 복합체에 의해 편집될 수 있고,Due to this, the nucleic acid comprising the target sequence can be edited by the CRISPR/Cas12f1 complex,

상기 엔지니어링 된 가이드 RNA는 다음을 포함하는 것을 특징으로 함:The engineered guide RNA is characterized in that it comprises:

엔지니어링 된 스캐폴드 영역; 및engineered scaffold area; and

스페이서;spacer;

이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,

상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 7)과 상이한 것을 특징으로 하며,The sequence of the engineered scaffold region is characterized in that it differs from 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 7),

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5' -UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);

5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);

5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 및 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117)에서 선택된 서열; 및5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), and 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) a sequence selected from SEQ ID NO: 117); and

5'-AUGCAAC-3'.5'-AUGCAAC-3'.

상기 과제를 해결하기 위해, 본 명세서에서는 다음을 포함하는, 세포 내에서 표적 서열을 포함하는 핵산을 편집하는 방법을 개시한다:In order to solve the above problem, disclosed herein is a method for editing a nucleic acid comprising a target sequence in a cell, comprising:

Cas12f1 단백질 또는 이를 암호화하는 핵산, 및 엔지니어링 된 가이드 RNA 또는 이름 암호화하는 핵산을 세포 내로 전달하는 것,delivering the Cas12f1 protein or a nucleic acid encoding the same, and an engineered guide RNA or a nucleic acid encoding the name into a cell;

이로 인해 상기 세포 내에서 CRISPR/Cas12f1 복합체가 형성될 수 있으며,Due to this, the CRISPR/Cas12f1 complex may be formed in the cell,

이로 인해 상기 표적 서열을 포함하는 핵산이 CRISPR/Cas12f1 복합체에 의해 편집될 수 있고,Due to this, the nucleic acid comprising the target sequence can be edited by the CRISPR/Cas12f1 complex,

상기 엔지니어링 된 가이드 RNA는 다음을 포함하는 것을 특징으로 함:The engineered guide RNA is characterized in that it comprises:

엔지니어링 된 스캐폴드 영역; 및engineered scaffold area; and

스페이서; 및spacer; and

이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,

상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGAAUGCAAC-3'(서열번호 315)과 상이한 것을 특징으로 하며,The sequence of the engineered scaffold region is 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUUCCUCUCCCAAAAUUCGAGAACCAAUGAAUGAAUUCGAUGAAGAAUGAAUGAAUGAUGAAGUGAGAAUGAAUGAAUGAUGAUGAGAAUGAAUGAAUGAUGAAUGAAGAAU characterized by that the sequence differs by the sequence

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5' -UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);

5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);

5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117), 5'-AACAAAUUCAUUUUGAAACGAAUGAAGGA-3'(서열번호 293), 5'-AACAAAUUCAUUUUUGAAAACGAAUGAAGGA-3'(서열번호 294), 5'-AACAAAUUCAUUUUUCGAAAGACGAAUGAAGGA-3'(서열번호 295), 5'-AACAAAUUCAUUUUUCCGAAAAGACGAAUGAAGGA-3'(서열번호 296), 5'-AACAAAUUCAUUUUUCCUGAAAUAGACGAAUGAAGGA-3'(서열번호 297), 5'-AACAAAUUCAUUUUUCCUCGAAAAUAGACGAAUGAAGGA-3'(서열번호 298), 5'-AACAAAUUCAUUUUUCCUCUGAAAAAUAGACGAAUGAAGGA-3'(서열번호 299), 5'-AACAAAUUCAUUUUUCCUCUCGAAAGAAUAGACGAAUGAAGGA-3'(서열번호 300), 5'-AACAAAUUCAUUUUUCCUCUCCGAAACGAAUAGACGAAUGAAGGA-3'(서열번호 301), 5'-AACAAAUUCAUUUUUCCUCUCCAGAAACCGAAUAGACGAAUGAAGGA-3'(서열번호 302), 5'-AACAAAUUCAUUUUUCCUCUCCAAGAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 303), 5'-AACAAAUUCAUUUUUCCUCUCCAAUGAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 304), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUGAAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 305), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCGAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 306), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGAAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 307), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGGAAACAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 308), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCGAAAGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 309), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCAGAAAUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 310), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 311), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 312), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 313), 및 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 314)로 이뤄진 군에서 선택된 서열; 및5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) No. 117), 5'-AACAAAUUCAUUUUGAAACGAAUGAAGGA-3' (SEQ ID NO: 293), 5'-AACAAAUUCAUUUUUGAAAACGAAUGAAGGA-3' (SEQ ID NO: 294), 5'-AACAAAUUCAUUUUUCGAAAGACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAAU-GAAAAAGGAUGA (SEQ ID NO: 296), 5'-AACAAAUUCAUUUUUCCUGAAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 297), 5'-AACAAAUUCAUUUUUCCUCGAAAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 298), 5'-AACAAAUUCAUUCAUUUUUCCUACAUGAAUUCAUUUUUUCCUACAUGAAUUCAUUUUUUCCUACAUGAAUUCAUUUUUUCCUACAACUGAAUGAAUUUUUCCUCUGAAUGAAGAAUUCA 3' (SEQ ID NO: 300), 5'-AACAAAUUCAUUUUUCCUCUCCGAAACGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 301), 5'-AACAAAUUCAUUUUUCCUCUCCAGAAACCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 302), 5'-AACAAAUCUCCAUGAAUGAGU-3' (SEQ ID NO: 302), 5'-AACAAAUCUCCAUGAAUGAU AACAAAUUCAUUUUUCCUCUCCAAUGAAAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 304), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUGAAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 305), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCGAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 306), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGAAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 307), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGGAAACAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 308 ), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCGAAAGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 309), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCAGAAAUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 310), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 311), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열 312), 5'-AACAAAUUCAUUUUUUCCUCUCCAAUUCUGCACAGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 313), and 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO:314) selected from the group consisting of; and

5'-AUGCAAC-3'.5'-AUGCAAC-3'.

본 명세서에서 제공하는 엔지니어링 된 스캐폴드 영역을 가지는 엔지니어링 된 Cas12f1 가이드 RNA를 포함하는 CRISPR/Cas12f1 시스템을 유전자 편집에 사용하는 경우, 자연계에서 발견되는 CRISPR/Cas12f1 시스템을 사용할 때와 비교해 높은 유전자 편집 효율을 나타낸다.When the CRISPR/Cas12f1 system including the engineered Cas12f1 guide RNA having the engineered scaffold region provided herein is used for gene editing, high gene editing efficiency is achieved compared to when using the CRISPR/Cas12f1 system found in nature. indicates.

도 1은 본 명세서에서 개시하는 엔지니어링 된 Cas12f1 가이드 RNA를 예시적으로 나타낸 모식도이다.
도 2 내지 도 4는 실험예 2에 개시된 실시예 중, DY2를 표적으로 하는 Example 1.1 내지 Example 1.13에 대한 평균 indel 효율을 나타낸 그래프이다. 이때, Ex 는 Example, Comp는 Comparative Example을 나타내는 약어이며, Control은 CRISPR/Cas12f1을 처리하지 않은 음성 대조군을 나타낸다.
도 5 내지 도 8은 실험예 2에 개시된 실시예 중, DY10을 표적으로 하는 Example 2.1 내지 Example 2.13에 대한 평균 indel 효율을 나타낸 그래프이다. 이때, Ex 는 Example, Comp는 Comparative Example을 나타내는 약어이며, Control은 CRISPR/Cas12f1을 처리하지 않은 음성 대조군을 나타낸다.
도 9 내지 도 12는 실험예 2에 개시된 실시예 중, Intergenic-22를 표적으로 하는 Example 3.1 내지 Example 3.13에 대한 평균 indel 효율을 나타낸 그래프이다. 이때, Ex 는 Example, Comp는 Comparative Example을 나타내는 약어이며, Control은 CRISPR/Cas12f1을 처리하지 않은 음성 대조군을 나타낸다.
도 13은 실험예 2에 개시된 실시예 중, DY2를 표적으로 하는 Example 1.13 내지 Example 1.14, DY10을 표적으로 하는 Example 2.13 내지 Example 2.14, Intergenic-22를 표적으로 하는 Example 3.13 내지 Example 3.14에 대한 평균 indel 효율을 나타낸 그래프이다. 이때, Ex 는 Example, Comp는 Comparative Example을 나타내는 약어이며, Control은 CRISPR/Cas12f1을 처리하지 않은 음성 대조군을 나타낸다.
1 is a schematic diagram illustratively showing the engineered Cas12f1 guide RNA disclosed herein.
2 to 4 are graphs showing average indel efficiencies for Examples 1.1 to 1.13 targeting DY2 among the examples disclosed in Experimental Example 2. FIG. In this case, Ex is an abbreviation for Example, Comp is an abbreviation for Comparative Example, and Control is a negative control that is not treated with CRISPR/Cas12f1.
5 to 8 are graphs showing average indel efficiencies for Examples 2.1 to 2.13 targeting DY10 among the examples disclosed in Experimental Example 2. FIG. In this case, Ex is an abbreviation for Example, Comp is an abbreviation for Comparative Example, and Control is a negative control that is not treated with CRISPR/Cas12f1.
9 to 12 are graphs showing average indel efficiencies for Examples 3.1 to 3.13 targeting Intergenic-22 among the Examples disclosed in Experimental Example 2; In this case, Ex is an abbreviation for Example, Comp is an abbreviation for Comparative Example, and Control is a negative control that is not treated with CRISPR/Cas12f1.
13 shows the average indels for Examples 1.13 to Example 1.14 targeting DY2, Examples 2.13 to Example 2.14 targeting DY10, and Examples 3.13 to Example 3.14 targeting Intergenic-22 among the examples disclosed in Experimental Example 2; This is a graph showing the efficiency. In this case, Ex is an abbreviation for Example, Comp is an abbreviation for Comparative Example, and Control is a negative control that is not treated with CRISPR/Cas12f1.

용어의 정의Definition of Terms

approximately

본 명세서에서 사용되는 "약"이라는 용어는 참조 양, 수준, 값, 수, 빈도, 퍼센트, 치수, 크기, 양, 중량 또는 길이에 대해 30, 25, 20, 25, 10, 9, 8, 7, 6, 5, 4, 3, 2 또는 1% 정도로 변하는 양, 수준, 값, 수, 빈도, 퍼센트, 치수, 크기, 양, 중량 또는 길이를 의미한다.As used herein, the term “about” refers to 30, 25, 20, 25, 10, 9, 8, 7 with respect to a reference amount, level, value, number, frequency, percent, dimension, size, amount, weight or length. , means an amount, level, value, number, frequency, percentage, dimension, size, amount, weight or length varying by 6, 5, 4, 3, 2 or 1%.

A,T,C,G, 및 UA,T,C,G, and U

본 명세서에서 사용되는 A, T, C, G 및 U 기호는 당업계 통상의 기술자가 이해하는 의미로 해석된다. 문맥 및 기술에 따라 DNA 또는 RNA 상에서 염기, 뉴클레오사이드 또는 뉴클레오타이드로 적절히 해석될 수 있다. 예를 들어, 염기를 의미하는 경우는 각각 아데닌(A), 티민(T), 시토신(C), 구아닌(G) 또는 우라실(U) 자체로 해석될 수 있고, 뉴클레오사이드를 의미하는 경우는 각각 아데노신(A), 티민(T), 시티딘(C), 구아노신(G) 또는 유리딘(U)으로 해석될 수 있으며, 서열에서 뉴클레오타이드를 의미하는 경우는 상기 각각의 뉴클레오사이드를 포함하는 뉴클레오타이드를 의미하는 것으로 해석되어야 한다.As used herein, the symbols A, T, C, G and U are interpreted as meanings understood by those of ordinary skill in the art. It may be appropriately interpreted as a base, nucleoside or nucleotide on DNA or RNA according to context and technology. For example, when it means a base, it can be interpreted as adenine (A), thymine (T), cytosine (C), guanine (G) or uracil (U) itself, respectively, and when it means a nucleoside, It can be interpreted as adenosine (A), thymine (T), cytidine (C), guanosine (G) or uridine (U), respectively, and if it means a nucleotide in the sequence, it includes each of the nucleosides should be construed as meaning a nucleotide that

작동 가능하게 연결된(operably linked)operably linked

본 명세서에서 사용되는 "작동 가능하게 연결된"이라는 용어는 유전자 발현 기술에 있어서, 특정 구성이 다른 구성과 연결되어, 상기 특정 구성이 의도된 방식대로 기능할 수 있도록 연결되어 있는 것을 의미한다. 예를 들어, 프로모터 서열이 암호화 서열과 작동적으로 연결되었다고 할 때, 상기 프로모터가 상기 암호화 서열의 세포 내에서의 전사 및/또는 발현에 영향을 미칠 수 있도록 연결된 것을 의미한다. 또한, 상기 용어는 당업계 통상의 기술자가 인식할 수 있는 의미를 모두 포함하며, 문맥에 따라 적절히 해석될 수 있다.The term "operably linked" as used herein, in gene expression technology, means that a specific component is linked to another component, such that the specific component is linked such that it functions in an intended manner. For example, when a promoter sequence is operatively linked with a coding sequence, it is meant that the promoter is linked so as to affect the transcription and/or expression of the coding sequence in a cell. In addition, the term includes all meanings recognized by those skilled in the art, and may be appropriately interpreted according to the context.

표적 유전자 또는 표적 핵산target gene or target nucleic acid

본 명세서에서 사용되는 "표적 유전자" 또는 "표적 핵산"은 기본적으로, 유전자 편집의 대상이 되는 세포 내 유전자, 또는 핵산을 의미한다. 상기 표적 유전자 또는 표적 핵산은 혼용될 수 있으며, 서로 동일한 대상을 지칭할 수 있다. 상기 표적 유전자 또는 표적 핵산은 달리 기재되지 않은 한, 대상 세포가 가진 고유한 유전자 또는 핵산, 혹은 외부 유래의 유전자 또는 핵산 모두를 의미할 수 있으며, 유전자 편집의 대상이 될 수 있다면 특별히 제한되지 않는다. 상기 표적 유전자 또는 표적 핵산은 단일가닥 DNA, 이중가닥 DNA, 및/또는 RNA일 수 있다. 또한, 상기 용어는 당업계 통상의 기술자가 인식할 수 있는 의미를 모두 포함하며, 문맥에 따라 적절히 해석될 수 있다."Target gene" or "target nucleic acid" as used herein basically refers to a gene or nucleic acid in a cell to be subjected to gene editing. The target gene or target nucleic acid may be used interchangeably, and may refer to the same subject. The target gene or target nucleic acid may refer to both a unique gene or nucleic acid possessed by a target cell, or an externally-derived gene or nucleic acid, unless otherwise specified, and is not particularly limited as long as it can be subjected to gene editing. The target gene or target nucleic acid may be single-stranded DNA, double-stranded DNA, and/or RNA. In addition, the term includes all meanings recognized by those skilled in the art, and may be appropriately interpreted according to the context.

표적 서열target sequence

본 명세서에서 사용되는 "표적 서열"은 CRISPR/Cas 복합체가 표적 유전자 또는 표적 핵산을 절단하기 위해 인식하는 특정 서열을 의미한다. 상기 표적 서열은 그 목적에 따라 적절히 선택될 수 있다. 구체적으로, "표적 서열"은 표적 유전자 또는 표적 핵산 서열 내에 포함된 서열이며, 본 명세서에서 제공하는 가이드 RNA, 또는 엔지니어링 된 가이드 RNA에 포함된 스페이서 서열과 상보성을 가지는 서열을 의미한다. 일반적으로, 상기 스페이서 서열은 표적 유전자 또는 표적 핵산의 서열 및 CRISPR/Cas 시스템의 이펙터 단백질이 인식하는 PAM 서열을 고려하여 결정된다. 상기 표적 서열은 CRISPR/Cas 복합체의 가이드 RNA와 상보적으로 결합하는 특정 가닥만을 지칭할 수 있으며, 상기 특정 가닥 부분을 포함하는 표적 이중 가닥 전체를 지칭할 수도 있으며, 이는 문맥에 따라 적절히 해석된다. 또한, 상기 용어는 당업계 통상의 기술자가 인식할 수 있는 의미를 모두 포함하며, 문맥에 따라 적절히 해석될 수 있다.As used herein, “target sequence” refers to a specific sequence recognized by the CRISPR/Cas complex to cleave a target gene or target nucleic acid. The target sequence may be appropriately selected depending on the purpose. Specifically, "target sequence" refers to a sequence included in a target gene or target nucleic acid sequence, and refers to a sequence having complementarity with a spacer sequence included in a guide RNA provided herein or an engineered guide RNA. In general, the spacer sequence is determined in consideration of the sequence of the target gene or target nucleic acid and the PAM sequence recognized by the effector protein of the CRISPR/Cas system. The target sequence may refer only to a specific strand complementary to the guide RNA of the CRISPR/Cas complex, or may refer to the entire target double strand including the specific strand portion, which is appropriately interpreted according to the context. In addition, the term includes all meanings recognized by those skilled in the art, and may be appropriately interpreted according to the context.

벡터vector

본 명세서에서 사용되는 "벡터"는 달리 특정되지 않는 한, 유전 물질을 세포 내로 운반할 수 있는 모든 물질을 통틀어 일컫는다. 예를 들어, 벡터는 대상이 되는 유전 물질, 예를 들어 CRISPR/Cas 시스템의 이펙터 단백질을 암호화하는 핵산, 및/또는 가이드 RNA를 암호화하는 핵산을 포함하는 DNA 분자일 수 있으나, 이에 제한되는 것은 아니다. 상기 용어는 당업계 통상의 기술자가 인식할 수 있는 의미를 모두 포함하며, 문맥에 따라 적절히 해석될 수 있다.As used herein, unless otherwise specified, "vector" refers to any material capable of transporting genetic material into a cell. For example, a vector may be, but is not limited to, a DNA molecule comprising the genetic material of interest, for example, a nucleic acid encoding an effector protein of the CRISPR/Cas system, and/or a nucleic acid encoding a guide RNA. . The above terms include all meanings recognized by those skilled in the art, and may be appropriately interpreted according to the context.

자연계에서 발견되는found in nature

본 명세서에서 "자연계에서 발견되는" 이라는 용어는, 자연계에서 발견되는, 변형되지 않은 대상을 의미하며, 인위적인 변형이 가해진 "엔지니어링 된 대상"과 구분하기 위해 사용된다. 상기 "자연계에서 발견되는" 유전자, 핵산, DNA, RNA 등은 야생형 및 mature form (active form)의 유전자, 핵산, DNA, RNA를 모두 포괄하는 개념으로 사용된다. 상기 용어는 그 외 통상의 기술자가 인식할 수 있는 의미를 모두 포함하며, 문맥에 따라 적절히 해석되어야 한다.As used herein, the term "found in nature" means an unmodified object found in the natural world, and is used to distinguish it from an "engineered object" that has been artificially deformed. The "genes, nucleic acids, DNA, RNA, etc. found in nature" are used as a concept encompassing all genes, nucleic acids, DNA, and RNA in wild-type and mature form (active form). The term includes all other meanings recognized by those skilled in the art, and should be appropriately interpreted according to the context.

엔지니어링 된engineered

본 명세서에서 사용되는 "엔지니어링 된"이란 용어는 자연계에 이미 존재하는 구성을 가진 물질, 분자 등과 구분하기 위해 사용하는 용어로, 상기 물질, 분자 등에 인위적인 변형이 가해진 것을 의미한다. 예를 들어, "엔지니어링 된 가이드 RNA"의 경우, 자연계에 존재하는 가이드 RNA의 구성에 인위적인 변경이 가해진 가이드 RNA를 의미한다. 또한, 상기 용어는 당업계 통상의 기술자가 인식할 수 있는 의미를 모두 포함하며, 문맥에 따라 적절히 해석될 수 있다.As used herein, the term "engineered" is a term used to distinguish substances, molecules, etc. having a constitution that already exist in nature, and means that artificial modifications are applied to the substances, molecules, etc. For example, in the case of "engineered guide RNA", it refers to a guide RNA in which an artificial change is applied to the composition of a guide RNA existing in nature. In addition, the term includes all meanings recognized by those skilled in the art, and may be appropriately interpreted according to the context.

NLS(Nuclear Localization Sequence, or Signal)NLS (Nuclear Localization Sequence, or Signal)

본 명세서에서 "NLS"라 함은, 핵 수송(nuclear transport) 작용으로 세포 핵 외부의 물질을 핵 내부로 수송할 때, 수송 대상인 단백질에 붙어 일종의 "태그"역할을 하는 일정 길이의 펩타이드, 또는 그 서열을 의미한다. 구체적으로, 상기 NLS는 아미노산 서열 PKKKRKV(서열번호 277)를 갖는 SV40 바이러스 대형 T-항원의 NLS; 뉴클레오플라스민(nucleoplasmin)으로부터의 NLS(예를 들어, 서열 KRPAATKKAGQAKKKK(서열번호 278)를 갖는 뉴클레오플라스민 이분(bipartite) NLS); 아미노산 서열 PAAKRVKLD(서열번호 279) 또는 RQRRNELKRSP(서열번호 280)를 갖는 c-myc NLS; 서열 NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY(서열번호 281)를 갖는 hRNPA1 M9 NLS; 임포틴-알파로부 터의 IBB 도메인의 서열 RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV(서열번호 282); 마이오 마(myoma) T 단백질의 서열 VSRKRPRP(서열번호 283) 및 PPKKARED(서열번호 284); 인간 p53의 서열 PQPKKKPL(서열번호 285); 마우스 c-abl IV의 서열 SALIKKKKKMAP(서열번호 286); 인플루엔자 바이러스 NS1의 서열 DRLRR(서열번호 287) 및 PKQKKRK(서열번호 288); 간염 바이러스 델타 항원의 서열 RKLKKKIKKL(서열번호 289); 마우스 Mx1 단백질의 서열 REKKKFLKRR(서열번호 290); 인간 폴리(ADP-리보스) 중합효소의 서열 KRKGDEVDGVDEVAKKKSKK(서열번호 291); 또는 스테로이드 호르몬 수용체(인간) 글루코코르티코이드의 서열 RKCLQAGMNLEARKTKK(서열번호 292)로부터 유래된 NLS 서열일 수 있으나, 이에 제한되지 않는다. 본 명세서에서 사용되는 "NLS"라는 용어는 통상의 기술자가 인식할 수 있는 의미를 모두 포함하며, 문맥에 따라 적절하게 해석될 수 있다.As used herein, the term “NLS” refers to a peptide of a certain length that acts as a kind of “tag” attached to a protein to be transported when a substance outside the cell nucleus is transported into the nucleus by nuclear transport, or its means sequence. Specifically, the NLS is the NLS of the SV40 virus large T-antigen having the amino acid sequence PKKKRKV (SEQ ID NO: 277); NLS from nucleoplasmin (eg, nucleoplasmin bipartite NLS having the sequence KRPAATKKAGQAKKKK (SEQ ID NO: 278)); c-myc NLS having the amino acid sequence PAAKRVKLD (SEQ ID NO: 279) or RQRRNELKRSP (SEQ ID NO: 280); hRNPA1 M9 NLS having the sequence NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY (SEQ ID NO: 281); sequence RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV (SEQ ID NO: 282) of the IBB domain from importin-alpha; the sequences VSRKRPRP (SEQ ID NO: 283) and PPKKARED (SEQ ID NO: 284) of the myoma T protein; the sequence PQPKKKPL of human p53 (SEQ ID NO: 285); sequence SALIKKKKKMAP of mouse c-abl IV (SEQ ID NO: 286); sequences DRLRR (SEQ ID NO: 287) and PKQKKRK (SEQ ID NO: 288) of influenza virus NS1; sequence RKLKKKIKKL (SEQ ID NO: 289) of hepatitis virus delta antigen; sequence REKKKFLKRR (SEQ ID NO: 290) of mouse Mx1 protein; sequence KRKGDEVDGVDEVAKKKSKK (SEQ ID NO: 291) of human poly(ADP-ribose) polymerase; or an NLS sequence derived from the sequence RKCLQAGMNLEARKTKK (SEQ ID NO: 292) of a steroid hormone receptor (human) glucocorticoid. As used herein, the term “NLS” includes all meanings recognized by those of ordinary skill in the art, and may be appropriately interpreted according to the context.

NES(Nuclear Export Sequence, or Signal)NES (Nuclear Export Sequence, or Signal)

본 명세서에서 "NES"라 함은, 핵 수송(nuclear transport) 작용으로 세포 핵 내부의 물질을 핵 외부로 수송할 때, 수송 대상인 단백질에 붙어 일종의 "태그"역할을 하는 일정 길이의 펩타이드, 또는 그 서열을 의미한다. 본 명세서에서 사용되는 "NES"라는 용어는 통상의 기술자가 인식할 수 있는 의미를 모두 포함하며, 문맥에 따라 적절하게 해석될 수 있다.As used herein, the term "NES" refers to a peptide of a certain length that acts as a kind of "tag" attached to a protein to be transported when a material inside the cell nucleus is transported to the outside of the nucleus by nuclear transport, or its means sequence. As used herein, the term “NES” includes all meanings recognized by those skilled in the art, and may be appropriately interpreted according to the context.

태그tag

본 명세서에서 "태그"라 함은, 펩타이드, 또는 단백질의 추적 및/또는 분리정제를 쉽게 하기 위하여 부가되는 기능적 도메인을 통틀어 일컫는다. 구체적으로, 상기 태그는 히스티딘(His) 태그, V5 태그, FLAG 태그, 인플루엔자 헤마글루 티닌(HA) 태그, Myc 태그, VSV-G 태그 및 티오레독신(Trx) 태그 등의 태그 단백질, 녹색 형광 단백질(GFP), 황색 형광 단백질(YFP), 청록색 형관 단백질(CFP), 청색 형광 단백질(BFP), HcRED, DsRed 등의 자가형광 단백질, 및 글루타티온-S-트랜스 퍼라제(GST), 호스라디시(horseradish) 과산화효소(HRP), 클로람페니콜 아세틸트랜스퍼라제(CAT) 베타-갈락토시다제, 베타 -글루쿠로니다제, 루시퍼라제 등의 리포터 유전자를 포함하나, 이에 제한되는 것은 아니다. 본 명세서에서 사용되는 "태그"라는 용어는 통상의 기술자가 인식할 수 있는 의미를 모두 포함하며, 문맥에 따라 적절하게 해석될 수 있다.As used herein, the term “tag” refers to a functional domain added to facilitate the tracking and/or separation and purification of peptides or proteins. Specifically, the tag includes a histidine (His) tag, a V5 tag, a FLAG tag, an influenza hemagglutinin (HA) tag, a Myc tag, a VSV-G tag, and a tag protein such as a thioredoxin (Trx) tag, a green fluorescent protein (GFP), yellow fluorescent protein (YFP), cyan fluorescent protein (CFP), blue fluorescent protein (BFP), autofluorescent proteins such as HcRED, DsRed, and glutathione-S-transferase (GST), horseradish ( horseradish) peroxidase (HRP), chloramphenicol acetyltransferase (CAT) beta-galactosidase, beta-glucuronidase, reporter genes such as luciferase, but are not limited thereto. As used herein, the term "tag" includes all meanings recognizable to those of ordinary skill in the art, and may be appropriately interpreted according to the context.

배경기술 - CRISPR/Cas12f1 복합체의 구조Background - Structure of the CRISPR/Cas12f1 complex

CRISPR/Cas12f 시스템CRISPR/Cas12f system

CRISPR/Cas12f 시스템은 type V CRISPR/Cas 시스템 중 V-F 서브타입에 속하고, 이는 다시 V-F1 내지 V-F3의 베리언트로 나뉜다. CRISPR/Cas12f 시스템은 선행연구(Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018))에서 Cas14로 명명된 이펙터 단백질 중, Cas14a, Cas14b, 및 Cas14c 변이체를 포함하는 CRISPR/Cas14 시스템을 포함한다. 이 중, Cas14a 이펙터 단백질을 포함하는 CRISPR/Cas14a 시스템은 CRISPR/Cas12f1 시스템으로 분류된다(Makarova et al., Nature Reviews, Microbiology volume 18, 67 (2020)). 최근 선행 연구(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021), Xiao et al., Structural basis for the dimerization-dependent CRISPR-Cas12f nuclease, bioRxiv (2020)) 등을 통해 상기 CRISPR/Cas12f1 복합체의 구조가 밝혀진 바, 이하 이에 대해 간략히 서술한다.The CRISPR/Cas12f system belongs to the V-F subtype of the type V CRISPR/Cas system, which is further divided into V-F1 to V-F3 variants. The CRISPR/Cas12f system is one of the effector proteins named Cas14 in a previous study (Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018)), Cas14a, Cas14b, and Cas14c variants. CRISPR/Cas14 systems comprising Among them, the CRISPR/Cas14a system including the Cas14a effector protein is classified as the CRISPR/Cas12f1 system (Makarova et al., Nature Reviews, Microbiology volume 18, 67 (2020)). Recent previous studies (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021), Xiao et al., Structural basis for the dimerization-dependent CRISPR-Cas12f nuclease, bioRxiv (2020)), etc. have revealed the structure of the CRISPR/Cas12f1 complex, which will be briefly described below.

CRISPR/Cas12f1 복합체의 구조Structure of the CRISPR/Cas12f1 complex

CRISPR/Cas12f1 복합체 내에서, Cas12f1 단백질은 두 개의 분자가 이량체 형태로 가이드 RNA와 결합하여 복합체를 이루고 있음이 밝혀졌다. 상기 Cas12f1 단백질은 amino-terminal domain (NTD) 및 carboxy-terminal domin(CTD)로 나뉘며, 상기 두 도메인이 링커 루프(linker loop)를 통해 연결되어 있는 구조이다. 상기 NTD는 wedge (WED), recognition (REC), 및 zinc finger (ZF) 도메인으로 구성되며, 상기 CTD는 또 다른 ZF 도메인 및 RuvC 도메인으로 구성된다. Cas12f1 단백질 이량체의 구조는 크게 REC 로브(REC lobe) 및 nuclease 로브(NUC lobe)로 나눌 수 있다. 상기 REC 로브는 이량체를 구성하는 하나의 Cas12f1 단백질의 WED 도메인, ZF 도메인, 및 REC 도메인과 다른 하나의 Cas12f1 단백질의 WED 도메인, ZF 도메인, 및 REC 도메인으로 구성된다. 상기 nuclease 로브는 이량체를 구성하는 하나의 Cas12f1 단백질의 RuvC 도메인 및 TNB 도메인과 다른 하나의 Cas12f1 단백질의 RuvC 도메인 및 TNB 도메인으로 구성된다. 상기 Cas12f1 단백질의 각 도메인의 전부 또는 일부는 각각 Cas12f1 가이드 RNA의 스캐폴드 영역의 특정 부분을 인식하며, CRISPR/Cas12f1 복합체를 이룬다.Within the CRISPR/Cas12f1 complex, it was found that two molecules of the Cas12f1 protein bind to the guide RNA in the form of a dimer to form a complex. The Cas12f1 protein is divided into an amino-terminal domain (NTD) and a carboxy-terminal domin (CTD), and has a structure in which the two domains are connected through a linker loop. The NTD consists of wedge (WED), recognition (REC), and zinc finger (ZF) domains, and the CTD consists of another ZF domain and a RuvC domain. The structure of the Cas12f1 protein dimer can be largely divided into a REC lobe and a nuclease lobe. The REC lobe is composed of a WED domain, a ZF domain, and a REC domain of one Cas12f1 protein constituting a dimer and a WED domain, a ZF domain, and a REC domain of the other Cas12f1 protein. The nuclease lobe is composed of the RuvC domain and TNB domain of one Cas12f1 protein constituting the dimer, and the RuvC domain and TNB domain of the other Cas12f1 protein. All or part of each domain of the Cas12f1 protein recognizes a specific part of the scaffold region of the Cas12f1 guide RNA, respectively, and forms a CRISPR/Cas12f1 complex.

Cas12f1 가이드 RNA의 구조Structure of Cas12f1 guide RNA

Cas12f1 가이드 RNA는 크게 스페이서, 및 스캐폴드 영역으로 나눌 수 있으며, 상기 스캐폴드 영역은 다섯개의 Stem (Stem 1 내지 Stem 5로 명명) 및 하나의 pseudoknot (PK)로 구성된다. 상기 Cas12f1 가이드 RNA는 tracrRNA의 일부(tracrRNA anti-repeat), 및 crRNA 반복 부분(crRNA repeat)의 일부가 상보적으로 결합하여 듀플렉스(duplex)를 이루고 있는 구조를 2개 포함하며, 이를 편의상 crRNA repeat-tracrRNA anti-repeat(R:AR) 부분으로 명명한다. 상기 Stem 5(R:AR2), 및 PK(R:AR1)는 이러한 crRNA repeat-tracrRNA anti-repeat 듀플렉스 구조를 이루고 있다. CRISPR/Cas12f1 복합체에서, 상기 Cas12f1 가이드 RNA 중 Stem 1 부분, Stem 2 중 일부, 및 Stem 5(R:AR2) 부분은 상기 Cas12f1 이량체와 상호작용하지 않는 부분임이 밝혀졌으며, 이를 disordered region이라 한다.Cas12f1 guide RNA can be largely divided into a spacer and a scaffold region, and the scaffold region is composed of five stems (named Stem 1 to Stem 5) and one pseudoknot (PK). The Cas12f1 guide RNA includes two structures in which a part of tracrRNA (tracrRNA anti-repeat) and a part of crRNA repeat (crRNA repeat) are complementary to form a duplex. Named as tracrRNA anti-repeat (R:AR) moiety. The Stem 5 (R:AR2), and PK (R:AR1) form this crRNA repeat-tracrRNA anti-repeat duplex structure. In the CRISPR/Cas12f1 complex, it was found that the Stem 1 part, Stem 2 part, and Stem 5 (R:AR2) part of the Cas12f1 guide RNA do not interact with the Cas12f1 dimer, and this is called a disordered region.

배경기술 - CRISPR/Cas 시스템 발현 벡터 설계Background - CRISPR/Cas system expression vector design

벡터 설계 개괄Vector design overview

CRISPR/Cas 시스템을 유전자 편집에 사용하기 위해서, 상기 CRISPR/Cas 시스템의 각 구성을 암호화하는 서열을 가지는 벡터를 세포 내에 도입시켜, 세포 내에서 상기 CRISPR/Cas 시스템의 각 구성이 발현되도록 하는 방법이 널리 이용되고 있다. 이하, CRISPR/Cas 시스템이 세포 내에서 발현되도록 하는 벡터의 구성 요소에 대해 설명한다.In order to use the CRISPR/Cas system for gene editing, a vector having a sequence encoding each component of the CRISPR/Cas system is introduced into a cell, and each component of the CRISPR/Cas system is expressed in the cell. It is widely used. Hereinafter, the components of the vector that allow the CRISPR/Cas system to be expressed in cells will be described.

CRISPR/Cas 시스템 구성 요소를 암호화하는 핵산Nucleic Acids Encoding CRISPR/Cas System Components

상기 벡터의 목적이 CRISPR/Cas 시스템 각 구성요소를 세포 내에서 발현되도록 하는 것이므로, 상기 벡터의 서열은 CRISPR/Cas 시스템의 각 구성요소를 암호화하는 핵산 서열 중 하나 이상을 필수적으로 포함해야 한다. 구체적으로, 상기 벡터의 서열은 발현하고자 하는 CRISPR/Cas 시스템에 포함된 가이드 RNA, 및/또는 Cas 단백질을 암호화하는 핵산 서열을 포함한다. 이때, 상기 벡터의 서열은 야생형의 가이드 RNA 및 야생형의 Cas 단백질을 암호화하는 핵산 서열 뿐 아니라 그 목적에 따라 엔지니어링 된 Cas12f1 가이드 RNA 및 코돈 최적화된 Cas 단백질을 암호화하는 핵산 서열 또는 엔지니어링 된 Cas 단백질을 암호화하는 핵산 서열을 포함할 수 있다.Since the purpose of the vector is to allow expression of each component of the CRISPR/Cas system in a cell, the sequence of the vector must necessarily include at least one of the nucleic acid sequences encoding each component of the CRISPR/Cas system. Specifically, the sequence of the vector includes a guide RNA included in the CRISPR/Cas system to be expressed, and/or a nucleic acid sequence encoding a Cas protein. In this case, the sequence of the vector encodes a nucleic acid sequence encoding a Cas12f1 guide RNA engineered according to the purpose and a codon-optimized Cas protein or an engineered Cas protein as well as a nucleic acid sequence encoding a wild-type guide RNA and a wild-type Cas protein. It may contain a nucleic acid sequence that

조절/제어 구성요소Regulating/Control Components

상기 벡터를 세포 내에서 발현시키기 위해서는, 하나 이상의 조절/제어 구성요소를 포함해야 한다. 구체적으로, 상기 조절/제어 구성요소는 프로모터, 인핸서, 인트론, 폴리아데닐화 신호, 코작 공통(Kozak consensus) 서열, 내부 리보솜 유입 부위(IRES, Internal Ribosome Entry Site), 스플라이스 억셉터, 2A 서열 및/또는 복제원점(replication origin)을 포함할 수 있으나, 이에 제한되는 것은 아니다. 이때, 상기 복제원점은 f1 복제원점, SV40 복제원점, pMB1 복제원점, 아데노 복제원점, AAV 복제원점, 및/또는 BBV 복제원점일 수 있으나, 이에 제한되는 것은 아니다.In order to express the vector in a cell, it must contain one or more regulatory/control elements. Specifically, the regulatory/control elements include a promoter, an enhancer, an intron, a polyadenylation signal, a Kozak consensus sequence, an Internal Ribosome Entry Site (IRES), a splice acceptor, a 2A sequence and / or may include a replication origin, but is not limited thereto. In this case, the origin of replication may be an f1 origin of replication, an SV40 origin of replication, a pMB1 origin of replication, an adeno origin of replication, an AAV origin of replication, and/or a BBV origin of replication, but is not limited thereto.

프로모터promoter

상기 벡터의 발현 대상을 세포 내에서 발현시키려면, 각 구성 요소를 암호화하는 서열에 프로모터 서열을 작동적으로 연결시켜 세포 내에서 RNA 전사 인자가 활성화될 수 있도록 해야 한다. 상기 프로모터 서열은 대응하는 RNA 전사 인자, 또는 발현 환경에 따라 달리 설계할 수 있으며, CRISPR/Cas 시스템의 구성 요소를 세포 내에서 적절히 발현시킬 수 있는 것이라면 제한되지 않는다. 상기 프로모터 서열은 RNA 중합효소(예를 들어,RNA Pol I, Pol II, 또는 Pol III)의 전사를 촉진시키는 프로모터일 수 있다. 예를 들어, 상기 프로모터는 SV40 초기 프로모터, mouse mammary tumor virus long terminal repeat(LTR) 프로모터, adenovirus major late 프로모터 (Ad MLP), herpes simplex virus (HSV) 프로모터, CMV immediate early promoter region (CMVIE)와 같은 cytomegalovirus (CMV) 프로모터, rous sarcoma virus (RSV) 프로모터, human U6 small nuclear 프로모터 (U6) (Miyagishi et al., Nature Biotechnology 20, 497 - 500 (2002)), enhanced U6 프로모터 (e.g., Xia et al., Nucleic Acids Res. 2003 Sep 1;31(17)), human H1 프로모터 (H1), 및 7SK 중 하나 수 있으나, 이에 제한되는 것은 아니다.In order to express the expression target of the vector in a cell, a promoter sequence must be operatively linked to a sequence encoding each component so that the RNA transcription factor can be activated in the cell. The promoter sequence may be designed differently depending on the corresponding RNA transcription factor or expression environment, and is not limited as long as it can properly express the components of the CRISPR/Cas system in cells. The promoter sequence may be a promoter that promotes transcription of RNA polymerase (eg, RNA Pol I, Pol II, or Pol III). For example, the promoter is SV40 early promoter, mouse mammary tumor virus long terminal repeat (LTR) promoter, adenovirus major late promoter (Ad MLP), herpes simplex virus (HSV) promoter, such as CMV immediate early promoter region (CMVIE) cytomegalovirus (CMV) promoter, rous sarcoma virus (RSV) promoter, human U6 small nuclear promoter (U6) (Miyagishi et al., Nature Biotechnology 20, 497 - 500 (2002)), enhanced U6 promoter (e.g., Xia et al. , Nucleic Acids Res. 2003 Sep 1;31(17)), human H1 promoter (H1), and 7SK, but is not limited thereto.

종결 신호end signal

상기 벡터 서열이 상기 프로모터 서열을 포함하는 경우, RNA 전사 인자에 의해 상기 프로모터와 작동 가능하게 연결된 서열의 전사가 유도되는데, 이러한 RNA 전사 인자의 전사 종결을 유도하는 서열을 종결 신호라고 일컫는다. 상기 종결 신호는, 프로모터 서열의 종류에 따라 달라질 수 있다. 예를 들어, 상기 프로모터가 U6, 또는 H1 프로모터일 경우, 상기 프로모터는 티미딘 연속 서열(예를 들어, TTTTTT (T6) 서열)을 종결 신호로 인식한다.When the vector sequence includes the promoter sequence, transcription of a sequence operably linked to the promoter is induced by an RNA transcription factor. A sequence inducing transcription termination of the RNA transcription factor is referred to as a termination signal. The termination signal may vary depending on the type of promoter sequence. For example, when the promoter is a U6 or H1 promoter, the promoter recognizes a thymidine continuation sequence (eg, a TTTTTT (T6) sequence) as a termination signal.

부가 발현 요소additional expression elements

상기 벡터는 야생형의 CRISPR/Cas 시스템 구성, 및/또는 엔지니어링 된 CRISPR/Cas 시스템 외 통상의 기술자가 필요에 의해 발현시키고자 하는 부가 발현 요소를 암호화하는 핵산 서열을 포함하고 있을 수 있다. 예를 들어, 상기 부가 발현 요소는, "용어의 설명" 중 "태그" 단락에서 설명된 태그 중 하나일 수 있으나, 이에 제한되는 것은 아니다. 예를 들어, 상기 부가 발현 요소는, 글리포세이트(glyphosate), 글루포시네이트암모늄 (glufosinate ammonium) 또는 포스피노트리신(phosphinothricin)과 같은 제초제 저 항성 유전자, 암피실린(ampicillin), 카나마이신(kanamycin), G418, 블레오마이신 (Bleomycin), 하이그로마이신(hygromycin), 클로람페니콜(chloramphenicol)과 같은 항생제 내성 유전자일 수 있으나, 이에 제한되는 것은 아니다.The vector may contain a nucleic acid sequence encoding an additional expression element to be expressed by a person skilled in the art other than the wild-type CRISPR/Cas system configuration and/or the engineered CRISPR/Cas system as needed. For example, the additional expression element may be, but is not limited to, one of the tags described in the “tag” section of “description of terms”. For example, the additional expression element is a herbicide resistance gene such as glyphosate, glufosinate ammonium or phosphinothricin, ampicillin, kanamycin, It may be an antibiotic resistance gene such as G418, bleomycin, hygromycin, or chloramphenicol, but is not limited thereto.

발현 벡터의 형태Form of expression vector

상기 발현 벡터는 선형, 또는 원형 벡터 형태로 설계될 수 있다.The expression vector may be designed in the form of a linear or circular vector.

종래 기술의 한계점Limitations of the prior art

CRISPR/Cas12f1 시스템은 상대적으로 크기가 작기 때문에, 유전자 편집 기술에 적용하기 매력적인 시스템으로, 최초로 보고된 이후(Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018)) 이에 대한 연구가 진행되고 있다(Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018), US 2020/0190494 A1). 하지만, 세포(예를 들어, 진핵 세포) 내에서 유전자 편집 활성이 나타나지 않거나, 지나치게 낮아 이를 활용하는 데 걸림돌이 되고 있다.Since the CRISPR/Cas12f1 system is relatively small in size, it has been reported as an attractive system for gene editing technology (Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018). )) is being studied (Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018), US 2020/0190494 A1). However, gene editing activity in cells (eg, eukaryotic cells) does not appear or is too low, which is an obstacle to its utilization.

계속된 연구로 최근 선행문헌(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021))에서는 CRISPR/Cas12f1 시스템에서 Cas12f1 단백질이 이량체(dimer)를 형성한다는 사실, 및 가이드 RNA의 구조에 대해 밝혔다. 상기 선행문헌에 의해 가이드 RNA 부분 중 Cas12f1 단백질과 직접적으로 상호작용하지 않는, 이른바 disordered region이 있다는 것이 밝혀졌으며, 또 다른 선행문헌(Xiao et al., Structural basis for the dimerization-dependent CRISPR-Cas12f nuclease, bioRxiv (2020))에서는 상기 disordered region 부분을 제거하고 in vitro에서 이중가닥 DNA 절단 효율을 살핀 바 있다. 하지만, 상기 선행 연구들은 1) 세포 내 유전자 편집 활성(예를 들어, indel 발생 효율)을 보이거나, 이를 높이는 방안에 대한 연구는 아니며, 2) disordered region을 제거한 실험은 있지만, 오히려 절단 활성이 낮아지는 것으로 보고되었으며, 3) 더욱이 상기 실험은 in vitro에서 진행한 실험으로, 세포 내 유전자 편집 활성을 나타내거나, 효율을 높이기 위한 변형이 무엇인지 밝히는데는 실패하였다.As a continuing study, in the recent prior literature (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)), Cas12f1 protein is a dimer in the CRISPR / Cas12f1 system. , and the structure of guide RNAs. It was found by the above literature that there is a so-called disordered region that does not directly interact with the Cas12f1 protein in the guide RNA portion, and another prior literature (Xiao et al., Structural basis for the dimerization-dependent CRISPR-Cas12f nuclease, In bioRxiv (2020)), the disordered region was removed and double-stranded DNA cleavage efficiency was examined in vitro. However, the preceding studies do not 1) show or increase intracellular gene editing activity (eg, indel generation efficiency), and 2) there are experiments that remove the disordered region, but rather the cleavage activity is low. 3) Moreover, as the above experiment was conducted in vitro, it failed to reveal what modifications were made to show intracellular gene editing activity or to increase efficiency.

따라서 여전히, CRISPR/Cas12f1 시스템의 세포 내 유전자 편집 활성을 높이는 것이 중요한 과제로 남아 있다.Therefore, it remains an important task to enhance the intracellular gene editing activity of the CRISPR/Cas12f1 system.

엔지니어링 된 Cas12f1 가이드 RNA의 특징Characterization of engineered Cas12f1 guide RNAs

엔지니어링 된 Cas12f1 가이드 RNA 개괄Engineered Cas12f1 Guide RNA Overview

본 명세서에서는 종래 기술의 한계점을 극복하여, CRISPR/Cas12f1 시스템의 세포 내 유전자 편집 활성을 높이기 위한, 엔지니어링 된 Cas12f1 가이드 RNA를 제공한다. 상기 엔지니어링 된 Cas12f1 가이드 RNA는 자연계에서 발견되는 가이드 RNA의 구조 일부를 변형한 것이다. 상기 엔지니어링 된 Cas12f1 가이드 RNA는 그 구성 중 Cas12f1 단백질과 상호작용하는 역할을 하는 스캐폴드 영역의 적어도 일부가 변형된 것을 특징으로 한다.The present specification provides an engineered Cas12f1 guide RNA for increasing the intracellular gene editing activity of the CRISPR/Cas12f1 system by overcoming the limitations of the prior art. The engineered Cas12f1 guide RNA is a modified guide RNA structure found in nature. The engineered Cas12f1 guide RNA is characterized in that at least a part of the scaffold region that interacts with the Cas12f1 protein is modified.

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 엔지니어링 된 스캐폴드 영역 및 스페이서를 포함할 수 있다. 이때, 상기 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 가이드 RNA의 스캐폴드 영역과는 다른 것을 특징으로 한다.In one embodiment, the engineered Cas12f1 guide RNA may include an engineered scaffold region and a spacer. At this time, the engineered scaffold region is characterized in that it is different from the scaffold region of guide RNA found in nature.

엔지니어링 된 Cas12f1 가이드 RNA의 특징 - 스캐폴드 영역의 한 군데 이상이 변형됨Characteristics of Engineered Cas12f1 Guide RNAs - Modified at least one site in the scaffold region

본 명세서에서 제공하는 엔지니어링 된 Cas12f1 가이드 RNA는 자연계에서 발견되는 가이드 RNA와 비교하여, 그 스캐폴드 부분의 일부가 변형된 것을 특징으로 한다. 스캐폴드 영역은 tracrRNA 및 crRNA의 일부를 포함하는 영역으로, Cas12f1 단백질과 상호작용하는 기능을 한다. 상기 스캐폴드 영역에 대해서는 이하 더 자세히 설명할 것이다.The engineered Cas12f1 guide RNA provided herein is characterized in that a part of its scaffold portion is modified compared to the guide RNA found in nature. The scaffold region is a region containing tracrRNA and a portion of crRNA, and functions to interact with the Cas12f1 protein. The scaffold region will be described in more detail below.

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 엔지니어링 된 스캐폴드 영역을 포함할 수 있다. 이때, 상기 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 가이드 RNA의 스캐폴드 영역이 변형된 것임을 특징으로 한다. 따라서, 상기 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 가이드 RNA의 스캐폴드 영역과는 상이한 서열을 가진다. 일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 가이드 RNA의 스캐폴드 영역 중 일부 영역이 제거된 것일 수 있다. 일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 가이드 RNA의 스캐폴드 영역에 포함된 하나 이상의 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the engineered Cas12f1 guide RNA may include an engineered scaffold region. In this case, the engineered scaffold region is characterized in that the scaffold region of the guide RNA found in nature is modified. Therefore, the engineered scaffold region has a sequence different from that of the guide RNA found in nature. In one embodiment, the engineered scaffold region may be one in which some regions of the scaffold region of guide RNA found in nature have been removed. In one embodiment, the engineered scaffold region may have one or more nucleotides removed from the scaffold region of guide RNA found in nature.

엔지니어링 된 Cas12f1 가이드 RNA의 효과Effect of engineered Cas12f1 guide RNA

본 명세서에서 제공하는 엔지니어링 된 Cas12f1 가이드 RNA를 CRISPR/Cas12f1 시스템에 사용하는 경우, 자연계에서 발견되는 가이드 RNA를 사용하는 경우에 비해 세포 내에서 유전자 편집 활성이 극적으로 향상되는 효과가 나타난다. 본 발명자들은 실험을 통해, 자연계에서 발견되는 가이드 RNA에 어떤 구성을 추가해야 하는지, 혹은 그 스캐폴드 영역에 어떤 변형을 가해야 유전자 편집 효율이 향상되는지 상세하게 밝혔다. 상기 엔지니어링 된 Cas12f1 가이드 RNA를 사용하면 전술한 종래 기술의 한계점을 극복하여 세포 내에서 높은 효율로 유전자를 편집할 수 있다. 또한, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 자연계에서 발견되는 가이드 RNA와 길이가 동등하거나, 더 짧은 길이를 가져 유전자 편집 기술 분야에서 응용 가능성이 높다. 상기 엔지니어링 된 Cas12f1 가이드 RNA를 사용하면 CRISPR/Cas12f1 시스템의 장점 (예를 들어, 크기가 매우 작다는 장점)을 충분히 유전자 편집 기술에 활용할 수 있게 된다.When the engineered Cas12f1 guide RNA provided herein is used in the CRISPR/Cas12f1 system, the gene editing activity in cells is dramatically improved compared to when the guide RNA found in nature is used. The present inventors have revealed in detail what composition should be added to the guide RNA found in nature, or what kind of modification should be applied to the scaffold region to improve gene editing efficiency through experiments. By using the engineered Cas12f1 guide RNA, it is possible to edit genes with high efficiency in cells by overcoming the limitations of the prior art. In addition, the engineered Cas12f1 guide RNA has a length equal to or shorter than that of a guide RNA found in nature, so it has high application potential in the field of gene editing technology. If the engineered Cas12f1 guide RNA is used, the advantages of the CRISPR/Cas12f1 system (eg, the advantage of very small size) can be fully utilized in gene editing technology.

엔지니어링 된 Cas12f1 가이드 RNA의 용도Uses of engineered Cas12f1 guide RNAs

본 명세서에서 제공하는 엔지니어링 된 Cas12f1 가이드 RNA는 Cas12f1 단백질과 함께 유전자 편집 및/또는 유전자 치료제 용도로 사용될 수 있다. 또한, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 유전자 편집용 조성물을 제조하기 위한 용도로 사용할 수 있다.The engineered Cas12f1 guide RNA provided herein can be used for gene editing and/or gene therapy together with Cas12f1 protein. In addition, the engineered Cas12f1 guide RNA can be used for preparing a composition for gene editing.

용어의 설명 - Cas12f1 가이드 RNA의 부분을 지칭하는 용어Explanation of Terms - Terms referring to the portion of the Cas12f1 guide RNA

Cas12f1 가이드 RNA의 부분 - 개괄Part of the Cas12f1 guide RNA - an overview

자연계에서 발견되는 Cas12f1 가이드 RNA는 tracrRNA 및 crRNA로 나뉘며, 상기 crRNA는 다시 crRNA 반복 서열 부분 및 스페이서로 나눌 수 있다는 것이 통상의 기술자에게 잘 알려져 있다.It is well known to those skilled in the art that Cas12f1 guide RNA found in nature is divided into tracrRNA and crRNA, and the crRNA can be further divided into a crRNA repeat sequence portion and a spacer.

상기 기준과는 별개로, 본 명세서에서는 상기 Cas12f1 가이드 RNA 중 Cas12f1 단백질과 상호작용하는 부분을 통틀어 스캐폴드 영역이라 지칭한다. 상기 스캐폴드 영역은 tracrRNA 및 crRNA의 일부를 포함하며, 반드시 한 분자의 RNA를 지칭하는 것은 아니다. 상기 스캐폴드 영역은 다시 제1 영역, 제2 영역, 제3 영역, 제4 영역, 제5 영역, 및 제6 영역으로 세분화될 수 있다. 상기 세분화된 영역을 tracrRNA, crRNA 상에서 서술하면, 상기 제1 영역 내지 제4 영역은 tracrRNA에 포함되고, 상기 제5 영역 내지 상기 제6 영역은 crRNA, 구체적으로 crRNA 반복 서열 부분에 포함된다.Apart from the above criteria, in the present specification, the portion of the Cas12f1 guide RNA that interacts with the Cas12f1 protein is collectively referred to as a scaffold region. The scaffold region includes tracrRNA and a portion of crRNA, but does not necessarily refer to one molecule of RNA. The scaffold region may be subdivided into a first region, a second region, a third region, a fourth region, a fifth region, and a sixth region. When the subdivided region is described on tracrRNA or crRNA, the first to fourth regions are included in tracrRNA, and the fifth to sixth regions are included in crRNA, specifically crRNA repeat sequence.

이하 서술되는 "제n 영역", 또는 "자연계에서 발견되는 제n 영역" (n은 1 이상 6이하의 정수)은 기본적으로 자연계에서 발견되는 Cas12f1 가이드 RNA의 각 부분을 지칭한다. 엔지니어링 된 Cas12f1 가이드 RNA 내 상기 분류 기준과 대응되는 영역은 일반적으로 "변형된 제n 영역" 내지 "엔지니어링 된 스캐폴드 영역의 제n 영역"으로 서술된다.The "n-th region" or "n-th region found in nature" (n is an integer greater than or equal to 1 and less than or equal to 6) described below basically refers to each part of the Cas12f1 guide RNA found in nature. The region corresponding to the above classification criteria in the engineered Cas12f1 guide RNA is generally described as “modified n-th region” to “n-th region of engineered scaffold region”.

다만, 엔지니어링 된 스캐폴드 영역에 포함된 제n 영역일지라도, 달리 변형되지 않아 자연계에서 발견되는 제n 영역과 동일한 경우가 있을 수 있는데, 그 때에 한해 "제n 영역"이라는 용어는 혼용될 수 있다. 이때, 상기 "제n 영역"이 지칭하는 대상(예를 들어, 엔지니어링 된 Cas12f1 가이드 RNA에 포함된 영역인지, 자연계에서 발견되는 가이드 RNA에 포함된 영역인지 여부)은 문맥에 따라 적절히 해석되어야 한다.However, even in the n-th region included in the engineered scaffold region, there may be cases where it is not otherwise deformed and is the same as the n-th region found in nature. Only in that case, the term “n-th region” may be used interchangeably. In this case, the target referred to by the "n-th region" (eg, whether it is a region included in an engineered Cas12f1 guide RNA or a region included in a guide RNA found in nature) should be appropriately interpreted according to the context.

tracrRNA, crRNAtracrRNA, crRNA

본 명세서에서 "tracrRNA", "crRNA"라고 쓰는 경우, CRISPR/Cas 기술 분야에서 통상의 기술자가 인식할 수 있는 의미를 모두 포함한다. 이는, 자연계에서 발견되는 듀얼 가이드 RNA의 각 분자를 지칭하는 용어로 사용되는 것이 일반적이지만, 상기 tracrRNA 및 crRNA를 링커로 연결한 싱글 가이드 RNA의 각 해당 부분을 지칭하는데도 사용될 수 있다. 달리 서술하지 않는 한, tracrRNA 및 crRNA라고만 기재하는 경우 CRISPR/Cas12f1 시스템을 구성하는 tracrRNA 및 crRNA를 의미한다.When used as "tracrRNA" or "crRNA" in the present specification, it includes all meanings that can be recognized by those skilled in the art of CRISPR/Cas. This is generally used as a term to refer to each molecule of dual guide RNA found in nature, but it can also be used to refer to each corresponding portion of a single guide RNA in which the tracrRNA and crRNA are linked by a linker. Unless otherwise specified, when only tracrRNA and crRNA are described, it means tracrRNA and crRNA constituting the CRISPR/Cas12f1 system.

일 구현예로, tracrRNA의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3' (서열번호 1) 또는 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAA-3' (서열번호 2)일 수 있다. 일 구현예로, 상기 tracrRNA는 제1 영역, 제2 영역, 제3 영역, 및 제4 영역을 포함한다. 일 구현예로, 상기 tracrRNA는 5'말단에서 3'말단 방향으로, 제1 영역, 제2 영역, 제3 영역, 및 제4 영역이 순서대로 연결된 것이다.일 구현예로, tracrRNA의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3' (서열번호 1) 또는 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAA-3' (서열번호 2)일 수 있다. In one embodiment, the tracrRNA comprises a first region, a second region, a third region, and a fourth region. In one embodiment, the tracrRNA is one in which the first region, the second region, the third region, and the fourth region are sequentially linked from the 5' end to the 3' end.

일 구현예로, crRNA의 서열은 crRNA 반복 서열 및 스페이서 서열을 포함한다. 이때, 상기 crRNA 반복 서열은 5'-GAAUGAAGGAAUGCAAC-3' (서열번호 3) 또는 5'-GUUGCAGAACCCGAAUAGACGAAUGAAGGAAUGCAAC-3' (서열번호 4일) 수 있다. 상기 crRNA 반복 서열은 제5 영역, 및 제6 영역을 포함한다. 상기 스페이서 서열은 표적 서열에 따라 달라질 수 있으며, 일반적으로 10 내지 50개의 뉴클레오타이드를 포함한다. 일 구현예로, 상기 crRNA는 5'말단에서 3'말단 방향으로, 제5 영역, 제6 영역, 및 스페이서가 순서대로 연결된 것이다.In one embodiment, the sequence of crRNA comprises a crRNA repeat sequence and a spacer sequence. In this case, the crRNA repeat sequence may be 5'-GAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 3) or 5'-GUUGCAGAACCCGAAUAGACGAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 4). The crRNA repeat sequence comprises a fifth region and a sixth region. The spacer sequence may vary depending on the target sequence and generally comprises 10 to 50 nucleotides. In one embodiment, the crRNA is one in which a fifth region, a sixth region, and a spacer are sequentially linked in a direction from the 5' end to the 3' end.

스캐폴드 영역 - 개괄Scaffold Area - Overview

본 명세서에서 "스캐폴드 영역"이라고 쓰는 경우, 자연계에서 발견되는 가이드 RNA의 부분 중 스페이서를 제외한 나머지 부분을 통틀어 지칭한다. 구체적으로, 상기 스캐폴드 영역은 tracrRNA, 및 crRNA의 일부를 포함한다. 구체적으로, 상기 crRNA의 일부는 crRNA 반복 서열 부분일 수 있다. 상기 스캐폴드 영역은 일반적으로 Cas 단백질과 상호작용할 수 있는 부분으로 알려져있다. 본 명세서에서는 상기 스캐폴드 영역을 제1 영역 내지 제6 영역으로 나누어 서술하며, 각 영역에 대해서는 이하 더 자세히 설명한다.When used herein as a “scaffold region”, it refers to the rest of the guide RNAs found in nature except for the spacer. Specifically, the scaffold region includes tracrRNA, and a portion of crRNA. Specifically, a portion of the crRNA may be a portion of a crRNA repeat sequence. The scaffold region is generally known as a portion capable of interacting with a Cas protein. In the present specification, the scaffold region is divided into first to sixth regions, and each region will be described in more detail below.

스캐폴드 영역 1 - 제1 영역Scaffold region 1 - first region

본 명세서에서 "제1 영역"이라고 쓰는 경우, tracrRNA의 5'말단을 포함하는 영역을 지칭한다. 상기 제1 영역은 CRISPR/Cas12f1 복합체 내에서 Stem 구조를 형성하는 뉴클레오타이드를 포함하고, 이와 인접한 뉴클레오타이드를 포함할 수 있다.When used herein as a "first region", it refers to a region including the 5' end of tracrRNA. The first region may include nucleotides forming a Stem structure in the CRISPR/Cas12f1 complex, and may include nucleotides adjacent thereto.

상기 제1 영역은 Stem 1 부분(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021))을 포함한다. 상기 제1 영역은 상기 Stem 1 부분과 인접한 하나 이상의 뉴클레오타이드를 포함할 수 있다.The first region includes a Stem 1 moiety (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)). The first region may include one or more nucleotides adjacent to the Stem 1 portion.

상기 제1 영역은 CRISPR/Cas12f1 복합체에서, Cas12f1 단백질과 상호작용하지 않는 disordered region을 포함한다.The first region comprises a disordered region in the CRISPR/Cas12f1 complex that does not interact with the Cas12f1 protein.

일 구현예로, 상기 제1 영역은 서열번호 1 또는 2로 표현되는 tracrRNA의 5'말단으로부터 1번째 뉴클레오타이드부터 11번째 뉴클레오타이드까지를 의미할 수 있다. 일 구현예로, 상기 제1 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAA-3' (서열번호 9)일 수 있다.In one embodiment, the first region may mean from the 1st nucleotide to the 11th nucleotide from the 5' end of the tracrRNA represented by SEQ ID NO: 1 or 2. In one embodiment, the sequence of the first region may be 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9).

스캐폴드 영역 2 - 제2 영역Scaffold region 2 - second region

본 명세서에서 "제2 영역"이라고 쓰는 경우, tracrRNA 내 상기 제1 영역의 3'말단 방향에 위치한 영역을 지칭한다. 상기 제2 영역은 CRISPR/Cas12f1 복합체 내에서 Stem 구조를 형성하는 뉴클레오타이드를 포함하고, 이와 인접한 뉴클레오타이드를 포함할 수 있다. 이때, 상기 Stem 구조는 상기 제1 영역에 포함된 Stem과는 다른 것이다.When used herein as a "second region", it refers to a region located in the 3'-end direction of the first region in tracrRNA. The second region may include nucleotides forming a Stem structure in the CRISPR/Cas12f1 complex, and may include nucleotides adjacent thereto. In this case, the stem structure is different from the stem included in the first region.

상기 제2 영역은 Stem 2 부분(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021))을 포함한다. 상기 제2 영역은 상기 Stem 2 부분과 인접한 하나 이상의 뉴클레오타이드를 포함할 수 있다.The second region includes a Stem 2 moiety (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)). The second region may include one or more nucleotides adjacent to the Stem 2 portion.

상기 제2 영역은 CRISPR/Cas12f1 복합체에서, 이량체를 이루는 하나의 Cas12f1 단백질의 RuvC 도메인, 및/또는 이량체 이루는 다른 하나의 Cas12f1 단백질의 RuvC 도메인과 상호작용하는 하나 이상의 뉴클레오타이드를 포함할 수 있다. 상기 제2 영역은 CRISPR/Cas12f1 복합체에서, Cas12f1 단백질과 상호작용하지 않는 disordered region을 포함한다.The second region may include one or more nucleotides that interact with the RuvC domain of one Cas12f1 protein forming a dimer and/or the RuvC domain of another Cas12f1 protein forming a dimer in the CRISPR/Cas12f1 complex. The second region comprises a disordered region in the CRISPR/Cas12f1 complex that does not interact with the Cas12f1 protein.

일 구현예로, 상기 제2 영역은 서열번호 1 또는 2로 표현되는 tracrRNA의 5'말단으로부터 22번째 뉴클레오타이드부터 71번째 뉴클레오타이드까지를 의미할 수 있다. 일 구현예로, 상기 제2 영역의 서열은 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUG-3' (서열번호 10일) 수 있다.In one embodiment, the second region may mean from the 22nd nucleotide to the 71st nucleotide from the 5' end of the tracrRNA represented by SEQ ID NO: 1 or 2. In one embodiment, the sequence of the second region may be 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUG-3' (SEQ ID NO: 10).

스캐폴드 영역 3 - 제3 영역Scaffold Region 3 - Third Region

본 명세서에서 "제3 영역"이라고 쓰는 경우, tracrRNA 내 상기 제2 영역의 3'말단 방향에 위치한 영역을 지칭한다. 상기 제3 영역은 CRISPR/Cas12f1 복합체 내에서 Stem 구조를 형성하는 뉴클레오타이드, 및 crRNA에 포함된 일부 뉴클레오타이드와 상보적인 결합을 형성하고 있는 뉴클레오타이드를 포함하고, 이와 인접한 뉴클레오타이드를 포함할 수 있다.When used herein as a "third region", it refers to a region located in the 3'-end direction of the second region in tracrRNA. The third region may include nucleotides forming a stem structure in the CRISPR/Cas12f1 complex and nucleotides forming a complementary bond with some nucleotides included in crRNA, and may include nucleotides adjacent thereto.

상기 제3 영역은 Stem 4 부분(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)) 및 Stem 3-PK (R:AR-1) 부분 중 tracrRNA에 속한 뉴클레오타이드(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021))를 포함한다. 상기 제3 영역은 상기 Stem 4 부분 및/또는 Stem 3-PK (R:AR-1) 부분 중 tracrRNA에 속한 뉴클레오타이드와 인접한 하나 이상의 뉴클레오타이드를 포함할 수 있다.The third region consists of a Stem 4 portion (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)) and a Stem 3-PK (R:AR-1) portion and nucleotides belonging to the heavy tracrRNA (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)). The third region may include one or more nucleotides adjacent to nucleotides belonging to tracrRNA among the Stem 4 portion and/or the Stem 3-PK (R:AR-1) portion.

상기 제3 영역은 CRISPR/Cas12f1 복합체에서, 이량체를 이루는 하나의 Cas12f1 단백질의 WED 도메인, 및/또는 RuvC 도메인과 상호작용하는 하나 이상의 뉴클레오타이드를 포함한다. 이때, 상기 뉴클레오타이드는 Stem 3-PK (R:AR-1) 부분 중 tracrRNA에 속한 뉴클레오타이드일 수 있다.The third region comprises one or more nucleotides that interact with the WED domain and/or the RuvC domain of one Cas12f1 protein forming a dimer in the CRISPR/Cas12f1 complex. In this case, the nucleotide may be a nucleotide belonging to tracrRNA in the Stem 3-PK (R:AR-1) portion.

상기 제3 영역은 CRISPR/Cas12f1 복합체에서, 이량체를 이루는 하나의 Cas12f1 단백질의 RuvC 도메인 및/또는 이량체를 이루는 다른 하나의 Cas12f1 단백질의 REC 도메인과 상호작용하는 하나 이상의 뉴클레오타이드를 포함한다. 이때, 상기 뉴클레오타이드는 Stem 4 부분에 포함된 뉴클레오타이드일 수 있다.The third region includes one or more nucleotides that interact with the RuvC domain of one Cas12f1 protein forming a dimer and/or the REC domain of another Cas12f1 protein forming a dimer in the CRISPR/Cas12f1 complex. In this case, the nucleotide may be a nucleotide included in the Stem 4 portion.

상기 제3 영역은 crRNA의 제6 영역에 포함된 하나 이상의 뉴클레오타이드와 상보적으로 결합하는 하나 이상의 뉴클레오타이드를 포함한다.The third region comprises one or more nucleotides complementary to one or more nucleotides included in the sixth region of the crRNA.

일 구현예로, 상기 제3 영역은 서열번호 1 또는 2로 표현되는 tracrRNA의 5'말단으로부터 72번째 뉴클레오타이드부터 127번째 뉴클레오타이드까지를 의미할 수 있다. 일 구현예로, 상기 제3 영역의 서열은 5'-GGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (서열번호 11일) 수 있다.In one embodiment, the third region may mean from the 72nd nucleotide to the 127th nucleotide from the 5' end of the tracrRNA represented by SEQ ID NO: 1 or 2. In one embodiment, the sequence of the third region may be 5'-GGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11).

스캐폴드 영역 4 - 제4 영역Scaffold Region 4 - Fourth Region

본 명세서에서 "제4 영역"이라고 쓰는 경우, tracrRNA의 제3 영역의 3'말단 방향에 위치한 영역을 지칭한다. 상기 제4 영역은 CRISPR/Cas12f1 복합체 내에서 crRNA에 포함된 일부 뉴클레오타이드와 상보적인 결합을 형성할 수 있는 뉴클레오타이드를 포함하고, 이와 인접한 뉴클레오타이드를 포함할 수 있다.When used herein as a "fourth region", it refers to a region located at the 3' end of the third region of tracrRNA. The fourth region may include nucleotides capable of forming a complementary bond with some nucleotides included in crRNA in the CRISPR/Cas12f1 complex, and may include nucleotides adjacent thereto.

상기 제4 영역은 Stem 5 (R:AR-2) 중 tracrRNA에 속한 뉴클레오타이드를 포함한다(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)). 상기 제4 영역은 상기 Stem 5 (R:AR-2) 중 tracrRNA에 속한 뉴클레오타이드와 인접한 하나 이상의 뉴클레오타이드를 포함할 수 있다.The fourth region includes a nucleotide belonging to tracrRNA in Stem 5 (R:AR-2) (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)) ). The fourth region may include one or more nucleotides adjacent to a nucleotide belonging to tracrRNA in Stem 5 (R:AR-2).

상기 제4 영역은 CRISPR/Cas12f1 복합체에서, 이량체를 이루는 하나의 Cas12f1 단백질의 WED 도메인, 및/또는 ZF 도메인과 상호작용하는 하나 이상의 뉴클레오타이드를 포함할 수 있다. 이때 상기 뉴클레오타이드는 Stem 5 (R:AR-2) 중 tracrRNA에 속한 뉴클레오타이드일 수 있다.The fourth region may include one or more nucleotides that interact with the WED domain and/or the ZF domain of one Cas12f1 protein forming a dimer in the CRISPR/Cas12f1 complex. In this case, the nucleotide may be a nucleotide belonging to tracrRNA in Stem 5 (R:AR-2).

상기 제4 영역은 crRNA의 제5 영역에 포함된 하나 이상의 뉴클레오타이드와 상보적으로 결합하는 하나 이상의 뉴클레오타이드를 포함할 수 있다. 상기 제4 영역은 CRISPR/Cas12f1 복합체에서, Cas12f1 단백질과 상호작용하지 않는 disordered region을 포함한다.The fourth region may include one or more nucleotides complementary to one or more nucleotides included in the fifth region of the crRNA. The fourth region comprises a disordered region in the CRISPR/Cas12f1 complex that does not interact with the Cas12f1 protein.

일 구현예로, 상기 제4 영역은 서열번호 1으로 표현되는 tracrRNA의 5'말단으로부터 128번째 뉴클레오타이드부터 140번째 뉴클레오타이드까지를 의미할 수 있다. 일 구현예로, 상기 제4 영역은 서열번호 2으로 표현되는 tracrRNA의 5'말단으로부터 128번째 뉴클레오타이드부터 162번째 뉴클레오타이드까지를 의미할 수 있다.In one embodiment, the fourth region may mean from the 128th nucleotide to the 140th nucleotide from the 5' end of the tracrRNA represented by SEQ ID NO: 1. In one embodiment, the fourth region may mean from the 128th nucleotide to the 162th nucleotide from the 5' end of the tracrRNA represented by SEQ ID NO: 2.

일 구현예로, 상기 제4 영역의 서열은 5'-AACAAAUUCAUUU-3' (서열번호 12) 또는 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAA-3' (서열번호 13일) 수 있다.In one embodiment, the sequence of the fourth region may be 5'-AACAAAUUCAUUU-3' (SEQ ID NO: 12) or 5'-AACAAAUUCAUUUUUCCUCUCCCAAUUCUGCACAA-3' (SEQ ID NO: 13).

스캐폴드 영역 5 - 제5 영역Scaffold Region 5 - Fifth Region

본 명세서에서 "제5 영역"이라고 쓰는 경우, crRNA 5'말단을 포함하는 영역을 지칭한다. 상기 제5 영역은 CRISPR/Cas12f1 복합체 내에서 상기 제4 영역의 하나 이상의 뉴클레오타이드와 상보적인 결합을 형성하는 뉴클레오타이드를 포함하며, 이와 인접한 뉴클레오타이드를 포함할 수 있다.When used herein as a "fifth region", it refers to a region including the crRNA 5' end. The fifth region includes nucleotides that form a complementary bond with one or more nucleotides of the fourth region in the CRISPR/Cas12f1 complex, and may include nucleotides adjacent thereto.

상기 제5 영역은 Stem 5 (R:AR-2) 부분 중 crRNA에 속한 뉴클레오타이드를 포함한다(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)). 상기 제5 영역은 상기 Stem 5 (R:AR-2) 부분 중 crRNA에 속한 뉴클레오타이드와 인접한 하나 이상의 뉴클레오타이드를 포함할 수 있다.The fifth region includes a nucleotide belonging to the crRNA of the Stem 5 (R:AR-2) portion (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021) )). The fifth region may include one or more nucleotides adjacent to a nucleotide belonging to the crRNA of the Stem 5 (R:AR-2) portion.

상기 제5 영역은 CRISPR/Cas12f1 복합체에서, 이량체를 이루는 하나의 Cas12f1 단백질의 WED 도메인, REC 도메인 및/또는 ZF 도메인과 상호작용하는 하나 이상의 뉴클레오타이드를 포함할 수 있다. 이때 상기 뉴클레오타이드는 Stem 5 (R:AR-2) 중 crRNA에 속한 뉴클레오타이드일 수 있다.The fifth region may include one or more nucleotides that interact with the WED domain, the REC domain and/or the ZF domain of one Cas12f1 protein constituting a dimer in the CRISPR/Cas12f1 complex. In this case, the nucleotide may be a nucleotide belonging to crRNA in Stem 5 (R:AR-2).

상기 제5 영역은 상기 제4 영역에 포함된 하나 이상의 뉴클레오타이드와 상보적으로 결합하는 하나 이상의 뉴클레오타이드를 포함할 수 있다. 상기 제5 영역은 CRISPR/Cas12f1 복합체에서, Cas12f1 단백질과 상호작용하지 않는 disordered region을 포함한다.The fifth region may include one or more nucleotides complementary to one or more nucleotides included in the fourth region. The fifth region comprises a disordered region in the CRISPR/Cas12f1 complex that does not interact with the Cas12f1 protein.

일 구현예로, 상기 제5 영역은 서열번호 3로 표현되는 crRNA의 5'말단으로부터 1번째 뉴클레오타이드부터 10번째 뉴클레오타이드까지를 의미할 수 있다. 일 구현예로, 상기 제5 영역은 서열번호 4로 표현되는 crRNA의 5'말단으로부터 1번째 뉴클레오타이드부터 30번째 뉴클레오타이드까지를 의미할 수 있다. 일 구현예로, 상기 제5 영역의 서열은 5'-GAAUGAAGGA-3' (서열번호 14) 또는 5'-GUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (서열번호 15)일 수 있다.In one embodiment, the fifth region may mean from the 1st nucleotide to the 10th nucleotide from the 5' end of the crRNA represented by SEQ ID NO: 3. In one embodiment, the fifth region may mean from the 1st nucleotide to the 30th nucleotide from the 5' end of the crRNA represented by SEQ ID NO: 4. In one embodiment, the sequence of the fifth region may be 5'-GAAUGAAGGA-3' (SEQ ID NO: 14) or 5'-GUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 15).

스캐폴드 영역 6 - 제6 영역Scaffold Region 6 - Sixth Region

본 명세서에서 "제6 영역"이라고 쓰는 경우, crRNA 내 상기 제5 영역의 3'말단 방향에 위치한 영역을 지칭한다. 상기 제6 영역은 CRISPR/Cas12f1 복합체 내에서 상기 제3 영역의 하나 이상의 뉴클레오타이드와 상보적인 결합을 형성하는 뉴클레오타이드를 포함하며, 이와 인접한 뉴클레오타이드를 포함할 수 있다.When used herein as a "sixth region", it refers to a region located in the 3'-end direction of the fifth region in crRNA. The sixth region includes nucleotides that form a complementary bond with one or more nucleotides of the third region in the CRISPR/Cas12f1 complex, and may include nucleotides adjacent thereto.

상기 제6 영역은 PK (R:AR-1) 부분 중 crRNA에 속한 뉴클레오타이드를 포함한다(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)). 상기 제6 영역은 상기 PK (R:AR-1) 부분 중 crRNA에 속한 뉴클레오타이드와 인접한 하나 이상의 뉴클레오타이드를 포함할 수 있다.The sixth region includes a nucleotide belonging to crRNA in the PK (R:AR-1) portion (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)) ). The sixth region may include one or more nucleotides adjacent to a nucleotide belonging to the crRNA of the PK (R:AR-1) portion.

상기 제6 영역은 CRISPR/Cas12f1 복합체에서, 이량체를 이루는 하나의 Cas12f1 단백질의 WED 도메인, ZF 도메인 및/또는 RuvC 도메인과 상호작용하는 하나 이상의 뉴클레오타이드를 포함한다. 이때, 상기 뉴클레오타이드는 Stem 3-PK (R:AR-1) 부분 중 crRNA에 속한 뉴클레오타이드일 수 있다.The sixth region includes one or more nucleotides that interact with the WED domain, ZF domain and/or RuvC domain of one Cas12f1 protein constituting a dimer in the CRISPR/Cas12f1 complex. In this case, the nucleotide may be a nucleotide belonging to crRNA in the Stem 3-PK (R:AR-1) portion.

일 구현예로, 상기 제6 영역은 서열번호 3로 표현되는 crRNA의 5'말단으로부터 11번째 뉴클레오타이드부터 17번째 뉴클레오타이드까지를 의미할 수 있다. 일 구현예로, 상기 제6 영역은 서열번호 4로 표현되는 crRNA의 5'말단으로부터 31번째 뉴클레오타이드부터 37번째 뉴클레오타이드까지를 의미할 수 있다. 일 구현예로, 상기 제6 영역의 서열은 5'-AUGCAAC-3'일 수 있다.In one embodiment, the sixth region may mean from the 11th nucleotide to the 17th nucleotide from the 5' end of the crRNA represented by SEQ ID NO: 3. In one embodiment, the sixth region may mean from the 31st nucleotide to the 37th nucleotide from the 5' end of the crRNA represented by SEQ ID NO: 4. In one embodiment, the sequence of the sixth region may be 5'-AUGCAAC-3'.

스페이서spacer

본 명세서에서 "스페이서"라 쓰는 경우, CRISPR/Cas12f1 시스템에서 표적 서열 부분과 혼성화되는 하나 이상의 뉴클레오타이드를 지칭한다. 상기 스페이서는, CRISPR/Cas12f1 시스템에서 가이드 RNA의 crRNA의 3'말단 부근의 10개 내지 50개의 연속된 뉴클레오타이드를 지칭한다. 상기 스페이서는 CRISPR/Cas12f1 시스템을 사용하여 편집하고자 하는 표적 핵산의 표적 서열에 대응하여 설계된다. 달리 표현하면, 상기 스페이서는 표적 핵산의 표적 서열에 따라 다양한 서열을 가질 수 있다.When used herein, "spacer" refers to one or more nucleotides that hybridize with a portion of a target sequence in the CRISPR/Cas12f1 system. The spacer refers to 10 to 50 consecutive nucleotides near the 3' end of the crRNA of the guide RNA in the CRISPR/Cas12f1 system. The spacer is designed to correspond to the target sequence of the target nucleic acid to be edited using the CRISPR/Cas12f1 system. In other words, the spacer may have various sequences depending on the target sequence of the target nucleic acid.

엔지니어링 된 스캐폴드 영역 - 개괄Engineered Scaffold Area - Overview

엔지니어링 된 스캐폴드 영역 개괄Engineered Scaffold Area Overview

본 명세서에서는 CRISPR/Cas12f1 시스템의 유전자 편집 효율 향상을 위해 도입할 수 있는 엔지니어링 된 스캐폴드 영역을 제공한다. 상기 엔지니어링 된 스캐폴드 영역은 엔지니어링 된 Cas12f1 가이드 RNA가 사용된 CRISPR/Cas12f1 시스템의 유전자 편집 효율을 향상시킨다. 상기 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 Cas12f1 가이드 RNA의 스캐폴드 영역(이하, 자연계에서 발견되는 스캐폴드 영역)에서 한 군데 이상이 변형되어, 이와는 다른 서열 및/또는 구조를 가지게 된 것이 특징이다.Herein, an engineered scaffold region that can be introduced to improve the gene editing efficiency of the CRISPR/Cas12f1 system is provided. The engineered scaffold region improves the gene editing efficiency of the CRISPR/Cas12f1 system in which the engineered Cas12f1 guide RNA is used. The engineered scaffold region is characterized in that it has a sequence and/or structure different from that of the scaffold region of Cas12f1 guide RNA found in nature (hereinafter, the scaffold region found in nature) is modified in one or more places. .

이때, 상기 엔지니어링 된 스캐폴드 영역의 기능은 자연계에서 발견되는 스캐폴드 영역과 동일하거나, 유사한 기능을 가진다. 구체적으로, 상기 엔지니어링 된 스캐폴드 영역은 CRISPR/Cas12f1 복합체의 Cas12f1 단백질 이량체와 상호작용하는 기능을 가진다.In this case, the function of the engineered scaffold region is the same as or has a function similar to that of the scaffold region found in nature. Specifically, the engineered scaffold region has the function of interacting with the Cas12f1 protein dimer of the CRISPR/Cas12f1 complex.

상기 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 스캐폴드 영역의 각 부분에 대응하는 영역을 포함한다. 구체적으로, 상기 엔지니어링 된 스캐폴드 영역은 제1 영역, 제2 영역, 제3 영역, 제4 영역, 제5 영역, 및 제6 영역을 포함하며, 이는 자연계에서 발견되는 스캐폴드 영역에 포함된 제1 영역 내지 제6 영역에 각각 대응된다.The engineered scaffold region includes regions corresponding to each portion of the scaffold region found in nature. Specifically, the engineered scaffold region includes a first region, a second region, a third region, a fourth region, a fifth region, and a sixth region, which is the first region contained in the scaffold region found in nature. Each of the first to sixth areas corresponds to each other.

싱글 가이드 RNA를 만들기 위한 변형Modifications to make single guide RNAs

본 명세서에서 제공하는 엔지니어링 된 Cas12f1 가이드 RNA는 한 분자의 싱글 가이드 RNA일 수 있다. 따라서, 본 명세서에서 제공하는 엔지니어링 된 스캐폴드 영역은 각 영역 중 하나 이상이 변형된 것이고, 추가적으로 tracrRNA 제4 영역의 3'말단 및 crRNA 제5 영역의 5' 말단이 링커를 통해 연결된 것일 수 있다.The engineered Cas12f1 guide RNA provided herein may be a single guide RNA molecule. Accordingly, the engineered scaffold region provided herein may be one in which one or more of each region is modified, and additionally, the 3' end of the tracrRNA fourth region and the 5' end of the crRNA fifth region may be linked through a linker.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 스캐폴드 영역에서 한 군데 이상이 변형되고, 상기 제4 영역의 3'말단 및 상기 제5 영역의 5'말단이 링커를 통해 연결된 것일 수 있다. 이때, 상기 링커는 5'-GAAA-3'일 수 있다.In one embodiment, the engineered scaffold region is one or more modified in a scaffold region found in nature, and the 3' end of the fourth region and the 5' end of the fifth region are linked through a linker. can In this case, the linker may be 5'-GAAA-3'.

엔지니어링 된 스캐폴드 영역 1 - 제1 영역 변형Engineered Scaffold Area 1 - First Area Variant

제1 영역 변형 개괄Overview of the first region transformation

본 명세서에서 제공하는 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 스캐폴드 영역에서 제1 영역이 변형된 것일 수 있다.The engineered scaffold region provided herein may be a first region modified from a scaffold region found in nature.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 변형된 제1 영역을 포함할 수 있다. 이때, 상기 변형된 제1 영역은 자연계에서 발견되는 스캐폴드 영역의 제1 영역에서 하나 이상의 뉴클레오타이드가 제거된 것이다. 이때, 상기 제거된 뉴클레오타이드는 CRISPR/Cas12f1 복합체에서 Stem 구조를 형성하는 영역에서 선택된 뉴클레오타이드이다.In one embodiment, the engineered scaffold region may include a deformed first region. In this case, the modified first region has one or more nucleotides removed from the first region of the scaffold region found in nature. In this case, the removed nucleotide is a nucleotide selected from the region forming the Stem structure in the CRISPR/Cas12f1 complex.

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA에 포함된 엔지니어링 된 스캐폴드 영역은, 자연계에서 발견되는 스캐폴드 영역 중 제1 영역에 포함된 하나 이상의 뉴클레오타이드가 제거된 것을 포함할 수 있다. 일 구현예로, 상기 제거된 뉴클레오타이드는 상기 자연계에서 발견되는 제1 영역 중 CRISPR/Cas12f1 복합체에서 Stem 구조를 형성 부분에 포함된 뉴클레오타이드일 수 있다. 일 구현예로, 상기 제거된 뉴클레오타이드는 상기 자연계에서 발견되는 제1 영역 중 Stem 1 (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021))에 속하는 뉴클레오타이드일 수 있다. 일 구현예로, 상기 제거된 뉴클레오타이드는 상기 자연계에서 발견되는 제1 영역 중 CRISPR/Cas12f1 복합체에서 Cas12f1 단백질과 상호작용하지 않는 뉴클레오타이드일 수 있다.In one embodiment, the engineered scaffold region included in the engineered Cas12f1 guide RNA may include one in which one or more nucleotides included in the first region among scaffold regions found in nature have been removed. In one embodiment, the removed nucleotide may be a nucleotide included in a portion forming a stem structure in the CRISPR/Cas12f1 complex among the first regions found in nature. In one embodiment, the removed nucleotide is Stem 1 (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)) in the first region found in nature. It may be a nucleotide belonging to In one embodiment, the removed nucleotide may be a nucleotide that does not interact with the Cas12f1 protein in the CRISPR/Cas12f1 complex among the first regions found in nature.

일 구현예로, 상기 변형된 제1 영역은 3'말단 방향에 5'-A-3' 서열을 포함하는 것을 특징으로 한다.In one embodiment, the modified first region is characterized in that it comprises a 5'-A-3' sequence in the 3'-terminal direction.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역을 자연계에서 발견되는 스캐폴드 영역 중 제1 영역이 제거된 것일 수 있다. 달리 표현해, 상기 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 스캐폴드 영역 중 제1 영역과 대응되는 영역을 포함하지 않을 수 있다.In one embodiment, the engineered scaffold region may be one in which a first region among scaffold regions found in nature has been removed. In other words, the engineered scaffold region may not include a region corresponding to the first region among scaffold regions found in nature.

제1 영역 변형 내용 - 일부 뉴클레오타이드 제거First region modification - some nucleotides removed

상기 엔지니어링 된 스캐폴드 영역의 제1 영역은 자연계에서 발견되는 스캐폴드 영역의 제1 영역에서 하나 이상의 뉴클레오타이드가 제거된 것일 수 있다.The first region of the engineered scaffold region may have one or more nucleotides removed from the first region of the scaffold region found in nature.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역의 변형된 제1 영역은 자연계에서 발견되는 스캐폴드 영역의 제1 영역에서 5'말단의 1개 내지 20개의 뉴클레오타이드가 제거된 것일 수 있다. 일 구현예로, 상기 변형된 제1 영역은 자연계에서 발견되는 스캐폴드 영역의 제1 영역에서 5'말단의 1개, 2개, 3개, 4개, 5개, 6개, 7개, 8개, 9개, 10개, 11개, 12개, 13개, 14개, 15개, 16개, 17개, 18개, 19개, 또는 20개의 연속된 뉴클레오타이드가 제거된 것일 수 있다. 일 구현예로, 상기 변형된 제1 영역은 자연계에서 발견되는 스캐폴드 영역의 제1 영역에서 5'말단에서 바로 이전 문장에서 선택된 두 개의 수치범위 내 연속된 뉴클레오타이드가 제거된 것일 수 있다. 예를 들어, 자연계에서 발견되는 스캐폴드 영역의 제1 영역에서 5'말단의 1개 내지 3개의 연속된 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the modified first region of the engineered scaffold region may have 1 to 20 nucleotides removed from the 5' end of the first region of the scaffold region found in nature. In one embodiment, the modified first region is 1, 2, 3, 4, 5, 6, 7, 8 at the 5' end of the first region of the scaffold region found in nature. 5, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 consecutive nucleotides may be removed. In one embodiment, the modified first region may be one in which consecutive nucleotides within two numerical ranges selected in the previous sentence are removed from the 5' end of the first region of the scaffold region found in nature. For example, 1 to 3 consecutive nucleotides at the 5' end may be removed from the first region of the scaffold region found in nature.

일 구현예로, 상기 변형된 제1 영역은 적어도 하나의 뉴클레오타이드를 포함하며, 이는 5'-A-3'일 수 있다.In one embodiment, the modified first region comprises at least one nucleotide, which may be 5'-A-3'.

변형된 제1 영역 서열 예시Modified first region sequence example

일 구현예로, 상기 변형된 제1 영역의 서열은 5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 및 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26)에서 선택된 것일 수 있다.In one embodiment, the sequence of the modified first region is 5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA -3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5 '-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5'-UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20) , 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23) 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25), and 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26).

일 구현예로, 제1 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 다음을 포함할 수 있다:In one embodiment, the sequence of the engineered scaffold region in which the first region is modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 서열번호 16 내지 26으로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', a sequence selected from the group consisting of SEQ ID NOs: 16 to 26;

서열번호 10,SEQ ID NO: 10,

서열번호 11, 및SEQ ID NO: 11, and

서열번호 12이 연결된 서열; 및sequence to which SEQ ID NO: 12 is linked; and

5'말단에서 3'말단 방향으로, 서열번호 14 및 5'-AUGCAAC-3'가 연결된 서열.A sequence in which SEQ ID NO: 14 and 5'-AUGCAAC-3' are linked in the direction from the 5' end to the 3' end.

일 구현예로, 제1 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 다음을 포함할 수 있다:In one embodiment, the sequence of the engineered scaffold region in which the first region is modified may comprise:

5'-ACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 118), 5'-AACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 119), 5'-GAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 120), 5'-AGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 121), 5'-GAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 122), 5'-GGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 123), 5'-UGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 124), 5'-GUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 125), 5'-AGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 126), 5'-AAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 127), 5'-AAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 128), 5'-UAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 129), 5'-AUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 130), 5'-GAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 131), 5'-UGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 132), 5'-CUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 133), 5'-ACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 134), 5'-CACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 135), 5'-UCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 136), 및 5'-UUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 137)로 이뤄진 군에서 선택된 서열; 및 5'-GAAUGAAGGAAUGCAAC-3'(서열번호 3).5'-ACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 118), 5'-AACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 119), 5'-GAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 120), 5'-AGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 121 ), 5'-GAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 122), 5'-GGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 123), 5'-UGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 124), 5'-GUGGAGAACC GCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 125), 5'-AGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 126), 5'-AAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 127), 5'-AAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 128), 5 '-UAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 129), 5'-AUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 130), 5'-GAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAA AUUCAUUU-3'(서열번호 131), 5'-UGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 132), 5'-CUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 133), 5'-ACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 134), 5 '-CACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 135), 5'-UCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 136), 및 5'-UUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 137)로 이뤄진 군에서 선택된 서열; and 5'-GAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 3).

일 구현예로, 제1 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 다음을 포함할 수 있다:In one embodiment, the sequence of the engineered scaffold region in which the first region is modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 및 서열번호 16 내지 26으로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', and a sequence selected from the group consisting of SEQ ID NOs: 16-26;

서열번호 10,SEQ ID NO: 10,

서열번호 11, 및SEQ ID NO: 11, and

서열번호 13이 연결된 서열; 및sequence to which SEQ ID NO: 13 is linked; and

5'말단에서 3'말단 방향으로, 서열번호 15 및 5'-AUGCAAC-3'가 연결된 서열.A sequence in which SEQ ID NOs: 15 and 5'-AUGCAAC-3' are linked in a direction from the 5' end to the 3' end.

일 구현예로, 제1 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'에서 3'말단 방향으로,In one embodiment, the sequence of the engineered scaffold region in which the first region is modified is in the 5' to 3' terminal direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 및 서열번호 16 내지 26으로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', and a sequence selected from the group consisting of SEQ ID NOs: 16-26;

서열번호 10,SEQ ID NO: 10,

서열번호 11,SEQ ID NO: 11,

서열번호 12,SEQ ID NO: 12,

링커,linker,

서열번호 14, 및SEQ ID NO: 14, and

5'-AUGCAAC-3'가 연결된 것일 수 있다.5'-AUGCAAC-3' may be linked.

이때, 상기 링커는 5'-GAAA-3'일 수 있다.In this case, the linker may be 5'-GAAA-3'.

일 구현예로, 제1 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'-ACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 167), 5'-AACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 168), 5'-GAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 169), 5'-AGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 170), 5'-GAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 171), 5'-GGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 172), 5'-UGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 173), 5'-GUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 174), 5'-AGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 175), 5'-AAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 176), 5'-AAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 177), 5'-UAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 178), 5'-AUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 179), 5'-GAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 180), 5'-UGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 181), 5'-CUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 182), 5'-ACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 183), 5'-CACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 184), 5'-UCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 185), 및 5'-UUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 186)에서 선택된 서열일 수 있다.일 구현예로, 제1 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'-ACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 167), 5'-AACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 168), 5'-GAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC- 3'(서열번호 169), 5'-AGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 170), 5'-GAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 171), 5'-GGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열 번호 172), 5'-UGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 173), 5'-GUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 174), 5'-AGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 175), 5'-AAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3' (서열번호 176), 5'-AAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 177), 5'-UAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC- 3'(서열번호 178), 5'-AUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 179), 5'-GAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 180), 5'-UGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 181), 5'- CUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 182), 5'-ACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 183), 5'-CACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAAC CCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 184), 5'-UCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 185), 및 5'-UUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 186)에서 선택된 서열일 수 있다.

엔지니어링 된 스캐폴드 영역 2 - 제2 영역 변형Engineered Scaffold Region 2 - Second Region Variant

제2 영역 변형 개괄Overview of the second area transformation

본 명세서에서 제공하는 엔지니어링 돤 가이드 RNA에 포함된 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 스캐폴드 영역에서 제2 영역이 변형된 것일 수 있다.The engineered scaffold region included in the engineered guide RNA provided herein may be a modified second region from the scaffold region found in nature.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 변형된 제2 영역을 포함할 수 있다. 이때, 상기 변형된 제2 영역은 자연계에서 발견되는 스캐폴드 영역의 제2 영역에서 하나 이상의 뉴클레오타이드가 제거된 것이다. 이때, 상기 제거된 뉴클레오타이드는 CRISPR/Cas12f1 복합체에서 Stem 구조를 형성하는 영역에서 선택된 뉴클레오타이드이다.In one embodiment, the engineered scaffold region may include a modified second region. In this case, the modified second region has one or more nucleotides removed from the second region of the scaffold region found in nature. In this case, the removed nucleotide is a nucleotide selected from the region forming the Stem structure in the CRISPR/Cas12f1 complex.

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA에 포함된 엔지니어링 된 스캐폴드 영역은, 자연계에서 발견되는 스캐폴드 영역 중 제2 영역에 포함된 하나 이상의 뉴클레오타이드가 제거된 것을 포함할 수 있다. 일 구현예로, 상기 뉴클레오타이드의 제거는 상기 자연계에서 발견되는 제2 영역 중 Stem 구조를 형성하는 부분에서 일어난 것이고, 뉴클레오타이드가 베이스 페어 단위로 제거된 것일 수 있다. 일 구현예로, 상기 제거된 뉴클레오타이드는 상기 자연계에서 발견되는 제2 영역 중 CRISPR/Cas12f1 복합체에서 Stem 구조를 형성하는 부분에 포함된 뉴클레오타이드일 수 있다. 일 구현예로, 상기 제거된 뉴클레오타이드는 상기 자연계에서 발견되는 제2 영역 중 Stem 2 (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021))에 속하는 뉴클레오타이드일 수 있다. 일 구현예로, 상기 제거된 뉴클레오타이드는 상기 자연계에서 발견되는 제2 영역 중 CRISPR/Cas12f1 복합체에서 Cas12f1 단백질과 상호작용하지 않는 뉴클레오타이드일 수 있다.In one embodiment, the engineered scaffold region included in the engineered Cas12f1 guide RNA may include one in which one or more nucleotides included in the second region among scaffold regions found in nature have been removed. In one embodiment, the removal of the nucleotide may occur in a portion forming a stem structure among the second region found in the natural world, and the nucleotide may be removed in units of a base pair. In one embodiment, the removed nucleotide may be a nucleotide included in a portion forming a stem structure in the CRISPR/Cas12f1 complex among the second region found in nature. In one embodiment, the removed nucleotide is Stem 2 (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)) in the second region found in nature. It may be a nucleotide belonging to In one embodiment, the removed nucleotide may be a nucleotide that does not interact with the Cas12f1 protein in the CRISPR/Cas12f1 complex among the second region found in nature.

일 구현예로, 상기 변형된 제2 영역은 5'말단 방향에 5'-CCGCUUCAC-3'(서열번호 432) 서열을 가지는 것을 특징으로 한다. 일 구현예로, 상기 변형된 제2 영역은 3'말단 방향에 5'-UGAGUGAAGG-3'(서열번호 433) 서열을 가지는 것을 특징으로 한다.In one embodiment, the modified second region is characterized in that it has a 5'-CCGCUUCAC-3' (SEQ ID NO: 432) sequence in the 5'-terminal direction. In one embodiment, the modified second region is characterized in that it has a 5'-UGAGUGAAGG-3' (SEQ ID NO: 433) sequence in the 3' terminal direction.

제2 영역 변형 내용 1 - 일부 뉴클레오타이드 제거Second Region Modification 1 - Remove some nucleotides

상기 엔지니어링 된 스캐폴드 영역의 제2 영역은 자연계에서 발견되는 스캐폴드 영역의 제2 영역에서 하나 이상의 뉴클레오타이드가 제거된 것일 수 있다.The second region of the engineered scaffold region may have one or more nucleotides removed from the second region of the scaffold region found in nature.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역의 변형된 제2 영역은 자연계에서 발견되는 스캐폴드 영역의 제2 영역에서 1개 내지 27개의 뉴클레오타이드가 제거된 것일 수 있다. 일 구현예로, 상기 변형된 제2 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제2 영역에서, 서열번호 10 서열 기준, 5'말단으로부터 12번째 내지 24번째 뉴클레오타이드, 및/또는 27번째 내지 40번째 뉴클레오타이드 중 하나 이상이 제거된 것일 수 있다. 일 구현예로, 상기 변형된 제2 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제2 영역에서, 서열번호 10 서열 기준, 5'말단으로부터 12번째 내지 24번째 뉴클레오타이드 중 1개, 2개, 3개, 4개, 5개, 6개, 7개, 8개, 9개, 10개, 11개, 12개, 또는 13개의 연속된 뉴클레오타이드가 제거된 것일 수 있다. 일 구현예로, 상기 변형된 제2 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제2 영역에서, 서열번호 10 서열 기준, 5'말단으로부터 27번째 내지 38번째 뉴클레오타이드 중 1개, 2개, 3개, 4개, 5개, 6개, 7개, 8개, 9개, 10개, 11개, 12개, 13개, 또는 14개의 연속된 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the modified second region of the engineered scaffold region may have 1 to 27 nucleotides removed from the second region of the scaffold region found in nature. In one embodiment, the modified second region is, in the second region of the scaffold region found in nature, based on the sequence of SEQ ID NO: 10, the 12th to 24th nucleotides from the 5' end, and/or the 27th to 40th One or more of the second nucleotides may be removed. In one embodiment, the modified second region is 1, 2, 3 of the 12th to 24th nucleotides from the 5' end based on the sequence of SEQ ID NO: 10 in the second region of the scaffold region found in nature. 5, 4, 5, 6, 7, 8, 9, 10, 11, 12, or 13 consecutive nucleotides may be removed. In one embodiment, the modified second region is 1, 2, 3 of the 27th to 38th nucleotides from the 5' end, based on the sequence of SEQ ID NO: 10, in the second region of the scaffold region found in nature. Dogs, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, or 14 consecutive nucleotides may be removed.

일 구현예로, 상기 변형된 제2 영역의 서열은 적어도 5'-CCGCUUCAC-3'(서열번호 432) 및 5'-UGAGUGAAGG-3'(서열번호 433)을 포함하는 것을 특징으로 한다. 이때, 상기 변형된 제2 영역의 서열은 5'말단에서 3'말단 방향으로 5'-CCGCUUCAC-3'(서열번호 432) 및 5'-UGAGUGAAGG-3'(서열번호 433)가 순서대로 연결되어 있을 수 있으며, 적절한 중간 서열을 통해 연결되어 있을 수 있다. 일 예로, 상기 중간 서열은 5'-UUAG-3', 5'-AUUAGU-3', 5'-AAUUAGCU-3', 5'-AAAUUAGACU-3'(서열번호 57), 5'-AAAGUUAGAACU-3'(서열번호 58), 5'-AAAGCUUAGGAACU-3'(서열번호 59), 5'-AAAGCUUUAGAGAACU-3'(서열번호 60), 5'-AAAGCUGUUAGUUAGAACU-3'(서열번호 61), 5'-AAAGCUGUUAGUAGAACU-3'(서열번호 62), 5'-AAAGCUGUUUAGAUUAGAACU-3'(서열번호 63), 5'-AAAGCUGUCUUAGGAUUAGAACU-3'(서열번호 64), 5'-AAAGCUGUCCUUAGGGAUUAGAACU-3'(서열번호 65), 5'-AAAAGCUGUCCCUUAGGGGAUUAGAACUU-3'(서열번호 434), 및 5'-CAAAAGCUGUCCCUUAGGGGAUUAGAACUUG-3'(서열번호 435)로 이뤄진 군에서 선택된 것일 수 있다.In one embodiment, the sequence of the modified second region is characterized in that it comprises at least 5'-CCGCUUCAC-3' (SEQ ID NO: 432) and 5'-UGAGUGAAGG-3' (SEQ ID NO: 433). At this time, in the sequence of the modified second region, 5'-CCGCUUCAC-3' (SEQ ID NO: 432) and 5'-UGAGUGAAGG-3' (SEQ ID NO: 433) are linked in order from the 5' end to the 3' end. There may be, and may be linked through an appropriate intermediate sequence. In one example, the intermediate sequence is 5'-UUAG-3', 5'-AUUAGU-3', 5'-AAUUAGCU-3', 5'-AAAUUAGACU-3' (SEQ ID NO: 57), 5'-AAAGUUAGAACU-3 '(SEQ ID NO: 58), 5'-AAAGCUUAGGAACU-3' (SEQ ID NO: 59), 5'-AAAGCUUUAGAGAACU-3' (SEQ ID NO: 60), 5'-AAAGCUGUUAGUUAGAACU-3' (SEQ ID NO: 61), 5'-AAAGCUGUUAGUAGAACU -3' (SEQ ID NO: 62), 5'-AAAGCUGUUUAGAUUAGAACU-3' (SEQ ID NO: 63), 5'-AAAGCUGUCUUAGGAUUAGAACU-3' (SEQ ID NO: 64), 5'-AAAGCUGUGUCCUUAGGGAUUAGAACU-3' (SEQ ID NO: 65), 5' -AAAAGCUGUCCCUUAGGGGAUUAGAACUU-3' (SEQ ID NO: 434), and 5'-CAAAAGCUGUCCCUUAGGGGAUUAGAACUUG-3' (SEQ ID NO: 435) may be selected from the group consisting of.

제2 영역 변형 내용 2 - 베이스 페어 단위 제거2nd Area Variant 2 - Remove Base Pair Unit

상기 제2 영역의 변형은 Stem 구조를 형성하는 부분에 포함된 서로 상보적인 결합을 하는 한 쌍 이상의 뉴클레오타이드가 제거된 것일 수 있다.The modification of the second region may be one in which one or more pairs of nucleotides complementary to each other included in the portion forming the stem structure are removed.

일 구현예로, 상기 변형된 제2 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제2 영역에서, 서열번호 10 서열 기준, 5'말단으로부터 12번째 내지 24번째 뉴클레오타이드, 및 27번째 내지 40번째 뉴클레오타이드 중 CRISPR/Cas12f1 복합체에서 베이스 페어를 이루는 1쌍 이상의 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the modified second region is the 12th to 24th nucleotides, and the 27th to 40th nucleotides from the 5' end, based on the sequence of SEQ ID NO: 10, in the second region of the scaffold region found in nature. In the CRISPR/Cas12f1 complex, one or more pairs of nucleotides constituting a base pair may be removed.

일 구현예로, 상기 변형된 제2 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제2 영역에서, 서열번호 10 서열 기준, 5'말단으로부터 12번째 내지 24번째 뉴클레오타이드, 및 27번째 내지 40번째 뉴클레오타이드 중 CRISPR/Cas12f1 복합체에서 베이스 페어를 이루는 1쌍 이상의 뉴클레오타이드 및/또는 베이스 페어를 이루지 않는 1개 이상의 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the modified second region is the 12th to 24th nucleotides, and the 27th to 40th nucleotides from the 5' end, based on the sequence of SEQ ID NO: 10, in the second region of the scaffold region found in nature. In the CRISPR/Cas12f1 complex, one or more pairs of nucleotides forming a base pair and/or one or more nucleotides not forming a base pair may be removed.

일 구현예로, 상기 변형된 제2 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제2 영역에서, 서열번호 10 서열 기준, 5'말단으로부터 12번째 내지 24번째 뉴클레오타이드, 및 27번째 내지 40번째 뉴클레오타이드 중 CRISPR/Cas12f1 복합체에서 베이스 페어를 이루는 1쌍 이상의 뉴클레오타이드 및/또는 미스매치인 1쌍 이상의 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the modified second region is the 12th to 24th nucleotides, and the 27th to 40th nucleotides from the 5' end, based on the sequence of SEQ ID NO: 10, in the second region of the scaffold region found in nature. In the CRISPR/Cas12f1 complex, one or more pairs of nucleotides constituting a base pair and/or one or more pairs of mismatched nucleotides may be removed.

변형된 제2 영역 서열 예시Modified second region sequence example

일 구현예로, 상기 변형된 제2 영역의 서열은 5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUG-3'(서열번호 48), 및 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUG-3'(서열번호 49)로 이뤄진 군에서 선택된 서열일 수 있다.In one embodiment, the sequence of the modified second region is 5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUG-3' (SEQ ID NO: 431) 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUG-3' (SEQ ID NO: 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUG-3' (SEQ ID NO: 41), 5'-CCGCUUCUGAGAAGG-3'-CCGCUUCUGAGAAGG-3' SEQ ID NO: 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUG-3' (SEQ ID NO: 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUG-3' (SEQ ID NO: 44), 5'-CCGCUUCUCACCAAAAGCUGUUAGUUAGAACUUGUAGUUAGUUAGAACUUGAGUGAAGGAAUGAGAG' (SEQ ID NO: 45), '(SEQ ID NO: 46), 5'-CCGCUUCACCAAAAGCUGUUUUAGAUUAGAACUUGAGUGAAGGUG-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUG-3' (SEQ ID NO: 48), and 5'-CCGCUUCACCAAUGAAUGUGUCUAGGGAA from group SEQ ID NO: 49) It may be a selected sequence.

일 구현예로, 제2 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 다음을 포함할 수 있다:In one embodiment, the sequence of the engineered scaffold region in which the second region is modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

서열번호 9,SEQ ID NO: 9,

서열번호 38 내지 서열번호 49, 및 서열번호 430 내지 서열번호 431로 이뤄진 군에서 선택된 서열,SEQ ID NO: 38 to SEQ ID NO: 49, and a sequence selected from the group consisting of SEQ ID NO: 430 to SEQ ID NO: 431,

서열번호 11, 및SEQ ID NO: 11, and

서열번호 12이 연결된 서열; 및sequence to which SEQ ID NO: 12 is linked; and

5'말단에서 3'말단 방향으로, 서열번호 14 및 5'-AUGCAAC-3'가 연결된 서열.A sequence in which SEQ ID NO: 14 and 5'-AUGCAAC-3' are linked in the direction from the 5' end to the 3' end.

일 구현예로, 제2 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 다음을 포함할 수 있다:In one embodiment, the sequence of the engineered scaffold region in which the second region is modified may comprise:

5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 138), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAUUAGUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 139), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAUUAGACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 140), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 141), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 142), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 143), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 144), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 145), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 146), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 147), 및 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 148)로 이뤄진 군에서 선택된 서열; 및5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 138), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAUUAGUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 139), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAUUAGACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 140), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 141 ), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 142), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 143), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 144), 5'- CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 145), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 146), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 147), 및 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 148)로 a sequence selected from the group consisting of; and

서열번호 3.SEQ ID NO: 3.

일 구현예로, 제2 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 다음을 포함할 수 있다:In one embodiment, the sequence of the engineered scaffold region in which the second region is modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

서열번호 9,SEQ ID NO: 9,

서열번호 38 내지 서열번호 49, 및 서열번호 430 내지 서열번호 431에서 선택된 서열,a sequence selected from SEQ ID NO: 38 to SEQ ID NO: 49, and SEQ ID NO: 430 to SEQ ID NO: 431;

서열번호 11,SEQ ID NO: 11,

서열번호 13이 연결된 서열; 및sequence to which SEQ ID NO: 13 is linked; and

5'말단에서 3'말단 방향으로, 서열번호 15 및 5'-AUGCAAC-3'가 연결된 서열.A sequence in which SEQ ID NOs: 15 and 5'-AUGCAAC-3' are linked in a direction from the 5' end to the 3' end.

일 구현예로, 제2 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로, 서열번호 9, 서열번호 38 내지 서열번호 49, 및 서열번호 430 내지 서열번호 431에서 선택된 서열, 서열번호 11, 서열번호 12, 링커, 서열번호 14, 및 5'-AUGCAAC-3'가 연결된 것일 수 있다.In one embodiment, the sequence of the engineered scaffold region in which the second region is modified is 5' to 3' end in SEQ ID NO: 9, SEQ ID NO: 38 to SEQ ID NO: 49, and SEQ ID NO: 430 to SEQ ID NO: 431 The selected sequence, SEQ ID NO: 11, SEQ ID NO: 12, linker, SEQ ID NO: 14, and 5'-AUGCAAC-3' may be linked.

이때, 상기 링커는 5'-GAAA-3'일 수 있다.In this case, the linker may be 5'-GAAA-3'.

일 구현예로, 제2 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 187), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAUUAGUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 188), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAUUAGCUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 189), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 190), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 191), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 192), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 193), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 194), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 195), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 196), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 197), 및 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 198)로 이뤄진 군에서 선택된 것일 수 있다.일 구현예로, 제2 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 187), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAUUAGUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 188), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAUUAGCUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC- 3'(서열번호 189), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 190), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 191), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 192), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 193), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 194), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 195), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'( 서열번호 196), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 197), 및 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAU UCAUUUGAAAGAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 198) may be selected from the group consisting of.

엔지니어링 된 스캐폴드 영역 3 - 제3 영역 변형Engineered Scaffold Region 3 - 3rd Region Variant

본 명세서에서 제공하는 엔지니어링 된 스캐폴드에서 제3 영역은 DNA 절단 활성에 핵심적인 영향을 미치는 Stem 4 부분을 포함하는 영역이다. 따라서, 상기 엔지니어링 된 스캐폴드의 제3 영역은 자연계에서 발견되는 스캐폴드의 제3 영역과 동일하거나, 상기 제3 영역의 기능이 손상되지 않는 한도 내에서 변형된 것일 수 있다.The third region in the engineered scaffold provided herein is a region containing a Stem 4 moiety that has a key effect on DNA cleavage activity. Accordingly, the third region of the engineered scaffold may be the same as the third region of the scaffold found in nature, or may be modified within the extent that the function of the third region is not impaired.

엔지니어링 된 스캐폴드 영역 4 - 제4 영역 및 제5 영역 변형Engineered Scaffold Region 4 - 4th and 5th Region Variants

제4 영역 및 제5 영역 변형 개괄4th and 5th Region Variants Overview

본 명세서에서 제공하는 엔지니어링 된 스캐폴드 영역은 자연계에서 발견되는 스캐폴드 영역에서 제4 영역 및 제5 영역이 변형된 것일 수 있다. 상기 제4 영역 및 제5 영역은 CRISPR/Cas12f1 복합체 내에서 서로 혼성화되어 Stem을 구성하는 부분을 포함하므로, 해당 부분이 같이 변형되어 엔지니어링 된 스캐폴드 영역을 구성할 수 있다.The engineered scaffold region provided herein may be a modified region of the fourth and fifth regions from the scaffold region found in nature. Since the fourth region and the fifth region include a portion that hybridizes with each other in the CRISPR/Cas12f1 complex to constitute a stem, the corresponding portion may be modified together to constitute an engineered scaffold region.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 변형된 제4 영역 및/또는 변형된 제5 영역을 포함할 수 있다.In one embodiment, the engineered scaffold region may include a modified fourth region and/or a modified fifth region.

변형된 제4 영역은 자연계에서 발견되는 스캐폴드 영역의 제4 영역에서 하나 이상의 뉴클레오타이드가 제거된 것을 특징으로 한다. 변형된 제5 영역은 자연계에서 발견되는 스캐폴드 영역의 제5 영역에서 하나 이상의 뉴클레오타이드가 제거된 것을 특징으로 한다.The modified fourth region is characterized in that one or more nucleotides have been removed from the fourth region of the scaffold region found in nature. The modified fifth region is characterized in that one or more nucleotides have been removed from the fifth region of the scaffold region found in nature.

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA에 포함된 엔지니어링 된 스캐폴드 영역은, 자연계에서 발견되는 스캐폴드 영역 중 제4 영역 및/또는 제5 영역의 하나 이상의 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the engineered scaffold region included in the engineered Cas12f1 guide RNA may have one or more nucleotides removed from the fourth region and/or the fifth region among scaffold regions found in nature.

일 구현예로, 상기 변형된 제4 영역은 5'말단 방향에 5'-AACAAA-3' 서열을 가지는 것을 특징으로 한다. 일 구현예로, 상기 변형된 제5 영역은 3'말단 방향에 5'-GGA-3' 서열을 가지는 것을 특징으로 한다.In one embodiment, the modified fourth region is characterized in that it has a 5'-AACAAA-3' sequence in the 5'-terminal direction. In one embodiment, the modified fifth region is characterized in that it has a 5'-GGA-3' sequence in the 3'-terminal direction.

제4 영역 및 제5 영역 변형 내용 1 - 일부 뉴클레오타이드 제거4th and 5th Region Modifications 1 - Remove some nucleotides

상기 엔지니어링 된 스캐폴드 영역의 제4 영역은 자연계에서 발견되는 스캐폴드 영역 중 제4 영역에서 하나 이상의 뉴클레오타이드가 제거된 것일 수 있다. 상기 엔지니어링 된 스캐폴드 영역의 제5 영역은 자연계에서 발견되는 스캐폴드 영역 중 제5 영역에서 하나 이상의 뉴클레오타이드가 제거된 것일 수 있다.The fourth region of the engineered scaffold region may be one in which one or more nucleotides have been removed from the fourth region among scaffold regions found in nature. The fifth region of the engineered scaffold region may have one or more nucleotides removed from the fifth region among scaffold regions found in nature.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역의 변형된 제4 영역은 자연계에서 발견되는 스캐폴드 영역의 제4 영역에서 1개 내지 7개의 뉴클레오타이드가 제거된 것일 수 있다. 일 구현예로, 상기 엔지니어링 된 스캐폴드 영역의 변형된 제4 영역은 자연계에서 발견되는 스캐폴드 영역의 제4 영역에서 1개 내지 28개의 뉴클레오타이드가 제거된 것일 수 있다. 일 구현예로, 상기 변형된 제4 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제4 영역에서, 서열번호 12 서열 기준, 5'말단으로부터 7번째 내지 13번째 뉴클레오타이드 중 하나 이상이 제거된 것일 수 있다. 일 구현예로, 상기 변형된 제4 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제4 영역에서, 서열번호 13 서열 기준, 5'말단으로부터 7번째 내지 34번째 뉴클레오타이드 중 하나 이상이 제거된 것일 수 있다.In one embodiment, the modified fourth region of the engineered scaffold region may have 1 to 7 nucleotides removed from the fourth region of the scaffold region found in nature. In one embodiment, the modified fourth region of the engineered scaffold region may have 1 to 28 nucleotides removed from the fourth region of the scaffold region found in nature. In one embodiment, the modified fourth region may be one in which at least one of the 7th to 13th nucleotides from the 5' end has been removed from the fourth region of the scaffold region found in nature, based on the sequence of SEQ ID NO: 12. have. In one embodiment, the modified fourth region may be one in which one or more of the 7th to 34th nucleotides from the 5' end have been removed from the fourth region of the scaffold region found in nature, based on the sequence of SEQ ID NO: 13. have.

일 구현예로, 상기 변형된 제4 영역의 서열은 적어도 5'-AACAAA-3'을 포함하는 것을 특징으로 한다.In one embodiment, the sequence of the modified fourth region is characterized in that it comprises at least 5'-AACAAA-3'.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역의 변형된 제5 영역은 자연계에서 발견되는 스캐폴드 영역의 제5 영역에서 1개 내지 7개의 뉴클레오타이드가 제거된 것일 수 있다. 일 구현예로, 상기 엔지니어링 된 스캐폴드 영역의 변형된 제5 영역은 자연계에서 발견되는 스캐폴드 영역의 제5 영역에서 1개 내지 27개의 뉴클레오타이드가 제거된 것일 수 있다. 일 구현예로, 상기 변형된 제5 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제5 영역에서, 서열번호 14 서열 기준, 5'말단으로부터 1번째 내지 7번째 뉴클레오타이드 중 하나 이상이 제거된 것일 수 있다. 일 구현예로, 상기 변형된 제5 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제5 영역에서, 서열번호 15 서열 기준, 5'말단으로부터 1번째 내지 27번째 뉴클레오타이드 중 하나 이상이 제거된 것일 수 있다.In one embodiment, the modified fifth region of the engineered scaffold region may have 1 to 7 nucleotides removed from the fifth region of the scaffold region found in nature. In one embodiment, the modified fifth region of the engineered scaffold region may have 1 to 27 nucleotides removed from the fifth region of the scaffold region found in nature. In one embodiment, the modified fifth region may be one in which at least one of the 1st to 7th nucleotides from the 5' end has been removed from the fifth region of the scaffold region found in nature, based on the sequence of SEQ ID NO: 14. have. In one embodiment, the modified fifth region may be one in which one or more of the 1st to 27th nucleotides from the 5' end have been removed from the fifth region of the scaffold region found in nature, based on the sequence of SEQ ID NO: 15. have.

일 구현예로, 상기 변형된 제5 영역의 서열은 적어도 5'-GGA-3'을 포함하는 것을 특징으로 한다.In one embodiment, the sequence of the modified fifth region comprises at least 5'-GGA-3'.

제4 영역 및 제5 영역 변형 내용 3 - 베이스 페어 단위 제거4th area and 5th area variation content 3 - Remove base pair unit

상기 제4 영역 및 제5 영역은 CRISPR/Cas12 복합체 내에서 서로 상보적으로 결합하여 Stem을 이루는 것으로 알려져 있다. 전술한 제4 영역 및 제5 영역의 변형은 이러한 Stem을 구성하는 하나 이상의 뉴클레오타이드를 대상으로 하므로, 상기 제4 영역 및 제5 영역의 변형은 Stem을 구성하는 뉴클레오타이드들을 베이스 페어 단위로 제거하는 것일 수 있다.It is known that the fourth region and the fifth region complement each other in the CRISPR/Cas12 complex to form a stem. Since the above-described modification of the fourth region and the fifth region targets one or more nucleotides constituting the stem, the modification of the fourth region and the fifth region may be to remove the nucleotides constituting the stem in units of a base pair. have.

일 구현예로, 상기 변형된 제4 영역 및 제5 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제4 영역 및 제5 영역에서, 서열번호 12 서열 기준, 7번째 내지 13번째 뉴클레오타이드 및, 서열번호 14 서열 기준, 1번째 내지 7번째 뉴클레오타이드 중 CRISPR/Cas12f1 복합체에서 베이스 페어를 이루는 1쌍 이상의 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the modified fourth region and the fifth region are, in the fourth region and the fifth region of the scaffold region found in nature, based on SEQ ID NO: 12, the 7th to 13th nucleotides, and SEQ ID NO: 14 Based on the sequence, one or more pairs of nucleotides constituting a base pair in the CRISPR/Cas12f1 complex among the 1st to 7th nucleotides may be removed.

일 구현예로, 상기 변형된 제4 영역 및 제5 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제4 영역 및 제5 영역에서, 서열번호 12 서열 기준, 7번째 내지 13번째 뉴클레오타이드 및, 서열번호 14 서열 기준, 1번째 내지 7번째 뉴클레오타이드 중 CRISPR/Cas12f1 복합체에서 베이스 페어를 이루는 1쌍 이상의 뉴클레오타이드 및/또는 베이스 페어를 이루지 않는 1개 이상의 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the modified fourth region and the fifth region are, in the fourth region and the fifth region of the scaffold region found in nature, based on SEQ ID NO: 12, the 7th to 13th nucleotides, and SEQ ID NO: 14 Based on the sequence, one or more pairs of nucleotides forming a base pair and/or one or more nucleotides not forming a base pair in the CRISPR/Cas12f1 complex among the 1st to 7th nucleotides may be removed.

일 구현예로, 상기 변형된 제4 영역 및 제5 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제4 영역 및 제5 영역에서, 서열번호 12 서열 기준, 7번째 내지 13번째 뉴클레오타이드 및, 서열번호 14 서열 기준, 1번째 내지 7번째 뉴클레오타이드 중 CRISPR/Cas12f1 복합체에서 베이스 페어를 이루는 1쌍 이상의 뉴클레오타이드 및/또는 미스매치인 1쌍 이상의 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the modified fourth region and the fifth region are, in the fourth region and the fifth region of the scaffold region found in nature, based on SEQ ID NO: 12, the 7th to 13th nucleotides, and SEQ ID NO: 14 Based on the sequence, one or more pairs of nucleotides constituting a base pair in the CRISPR/Cas12f1 complex among the first to seventh nucleotides and/or one or more pairs of mismatched nucleotides may be removed.

일 구현예로, 상기 변형된 제4 영역 및 제5 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제4 영역 및 제5 영역에서, 서열번호 13 서열 기준, 7번째 내지 34번째 뉴클레오타이드 및, 서열번호 15 서열 기준, 1번째 내지 27번째 뉴클레오타이드 중 CRISPR/Cas12f1 복합체에서 베이스 페어를 이루는 1쌍 이상의 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the modified fourth region and the fifth region are, in the fourth region and the fifth region of the scaffold region found in nature, based on SEQ ID NO: 13, the 7th to 34th nucleotides, and SEQ ID NO: Based on the 15th sequence, one or more pairs of nucleotides constituting a base pair in the CRISPR/Cas12f1 complex among the 1st to 27th nucleotides may be removed.

일 구현예로, 상기 변형된 제4 영역 및 제5 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제4 영역 및 제5 영역에서, 서열번호 13 서열 기준, 7번째 내지 34번째 뉴클레오타이드 및, 서열번호 15 서열 기준, 1번째 내지 27번째 뉴클레오타이드 중 CRISPR/Cas12f1 복합체에서 베이스 페어를 이루는 1쌍 이상의 뉴클레오타이드 및/또는 베이스 페어를 이루지 않는 1개 이상의 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the modified fourth region and the fifth region are, in the fourth region and the fifth region of the scaffold region found in nature, based on SEQ ID NO: 13, the 7th to 34th nucleotides, and SEQ ID NO: Based on the 15th sequence, one or more pairs of nucleotides forming a base pair and/or one or more nucleotides not forming a base pair in the CRISPR/Cas12f1 complex among the 1st to 27th nucleotides may be removed.

일 구현예로, 상기 변형된 제4 영역 및 제5 영역은 상기 자연계에서 발견되는 스캐폴드 영역의 제4 영역 및 제5 영역에서, 서열번호 13 서열 기준, 7번째 내지 34번째 뉴클레오타이드 및, 서열번호 15 서열 기준, 1번째 내지 27번째 뉴클레오타이드 중 CRISPR/Cas12f1 복합체에서 베이스 페어를 이루는 1쌍 이상의 뉴클레오타이드 및/또는 미스매치인 1쌍 이상의 뉴클레오타이드가 제거된 것일 수 있다.In one embodiment, the modified fourth region and the fifth region are, in the fourth region and the fifth region of the scaffold region found in nature, based on SEQ ID NO: 13, the 7th to 34th nucleotides, and SEQ ID NO: Based on the 15th sequence, one or more pairs of nucleotides constituting a base pair in the CRISPR/Cas12f1 complex and/or one or more pairs of mismatched nucleotides among the 1st to 27th nucleotides may be removed.

변형된 제4 영역 및 제5 영역 서열 예시Examples of Modified Fourth and Fifth Region Sequences

일 구현예로, 상기 변형된 제4 영역의 서열은 5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAUUCA-3'(서열번호 66), 5'-AACAAAUUCAU-3'(서열번호 67), 및 5'-AACAAAUUCAUU-3'(서열번호 68)로 이뤄진 군에서 선택된 것일 수 있다.In one embodiment, the sequence of the modified fourth region is 5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAAUUCA It may be selected from the group consisting of -3' (SEQ ID NO: 66), 5'-AACAAAUUCAU-3' (SEQ ID NO: 67), and 5'-AACAAAUUCAUU-3' (SEQ ID NO: 68).

일 구현예로, 상기 변형된 제4 영역의 서열은 5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAUUCA-3'(서열번호 66), 5'-AACAAAUUCAU-3'(서열번호 67), 5'-AACAAAUUCAUU-3'(서열번호 68), 5'-AACAAAUUCAUUU-3'(서열번호 69), 5'-AACAAAUUCAUUUU-3'(서열번호 70), 5'-AACAAAUUCAUUUUU-3'(서열번호 71), 5'-AACAAAUUCAUUUUUC-3'(서열번호 72), 5'-AACAAAUUCAUUUUUCC-3'(서열번호 73), 5'-AACAAAUUCAUUUUUCCU-3'(서열번호 74), 5'-AACAAAUUCAUUUUUCCUC-3'(서열번호 75), 5'-AACAAAUUCAUUUUUCCUCU-3'(서열번호 76), 5'-AACAAAUUCAUUUUUCCUCUC-3'(서열번호 77), 5'-AACAAAUUCAUUUUUCCUCUCC-3'(서열번호 78), 5'-AACAAAUUCAUUUUUCCUCUCCA-3'(서열번호 79), 5'-AACAAAUUCAUUUUUCCUCUCCAA-3'(서열번호 80), 5'-AACAAAUUCAUUUUUCCUCUCCAAU-3'(서열번호 81), 5'-AACAAAUUCAUUUUUCCUCUCCAAUU-3'(서열번호 82), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUC-3'(서열번호 83), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCU-3'(서열번호 84), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUG-3'(서열번호 85), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGC-3'(서열번호 86), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCA-3'(서열번호 87), 5'-AAACAAAUUCAUUUUUCCUCUCCAAUUCUGCAC-3'(서열번호 88), 및 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACA-3'(서열번호 89)로 이뤄진 군에서 선택된 것일 수 있다.In one embodiment, the sequence of the modified fourth region is 5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAAUUCA -3' (SEQ ID NO: 66), 5'-AACAAAUUCAU-3' (SEQ ID NO: 67), 5'-AACAAAUUCAUU-3' (SEQ ID NO: 68), 5'-AACAAAUUCAUUU-3' (SEQ ID NO: 69), 5' -AACAAAUUCAUUUU-3' (SEQ ID NO: 70), 5'-AACAAAUUCAUUUUU-3' (SEQ ID NO: 71), 5'-AACAAAUUCAUUUUUC-3' (SEQ ID NO: 72), 5'-AACAAAUUCAUUUUUCC-3' (SEQ ID NO: 73), 5'-AACAAAUUCAUUUUUCCU-3' (SEQ ID NO: 74), 5'-AACAAAUUCAUUUUUCCUC-3' (SEQ ID NO: 75), 5'-AACAAAUUCAUUUUUUCCUCU-3' (SEQ ID NO: 76), 5'-AACAAAUUCAUUUUUCCUCUC-3' (SEQ ID NO: 77) ), 5'-AACAAAUUCAUUUUUCCUCUCC-3' (SEQ ID NO: 78), 5'-AACAAAUUCAUUUUUCCUCUCCA-3' (SEQ ID NO: 79), 5'-AACAAAUUCAUUUUUCCUCUCCAA-3' (SEQ ID NO: 80), 5'-AACAAAUUCAUUUUUCCUCUCUCCAAU-3' (SEQ ID NO: 80) No. 81), 5'-AACAAAUUCAUUUUUCCUCUCCAAUU-3' (SEQ ID NO: 82), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUC-3' (SEQ ID NO: 83), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUUCCUCUUCCAAUUCUAUUCUUCCAAUUCUAUUCUUCCUCUCUCUCU-3' (SEQ ID NO: 84), 5'-ACCAA (SEQ ID NO: 85), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGC-3' (SEQ ID NO: 86), 5'-AACAAAUUCAUUUUUCCUCUCCCAAUUCUGCA-3' (SEQ ID NO: 87), 5'-AAACAAAUUCAUUUUUCCUCUCUCCAUUCUCCCUCCAUAAUUG'-ACAUCAUCAUUCCUCCUCAAUUGAC-3' (SEQ ID NO: 86) It may be selected from the group consisting of A-3' (SEQ ID NO: 89).

일 구현예로, 상기 변형된 제5 영역의 서열은 5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 및 5'-AAUGAAGGA-3'에서 선택된 것일 수 있다.In one embodiment, the sequence of the modified fifth region is 5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA It may be selected from -3', 5'-AUGAAGGA-3', and 5'-AAUGAAGGA-3'.

일 구현예로, 상기 변형된 제5 영역의 서열은 5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 5'-AAUGAAGGA-3', 5'-GAAUGAAGGA-3'(서열번호 90), 5'-CGAAUGAAGGA-3'(서열번호 91), 5'-ACGAAUGAAGGA-3'(서열번호 92), 5'-GACGAAUGAAGGA-3'(서열번호 93), 5'-AGACGAAUGAAGGA-3'(서열번호 94), 5'-UAGACGAAUGAAGGA-3'(서열번호 95), 5'-AUAGACGAAUGAAGGA-3'(서열번호 96), 5'-AAUAGACGAAUGAAGGA-3'(서열번호 97), 5'-GAAUAGACGAAUGAAGGA-3'(서열번호 98), 5'-CGAAUAGACGAAUGAAGGA-3'(서열번호 99), 5'-CCGAAUAGACGAAUGAAGGA-3'(서열번호 100), 5'-CCCGAAUAGACGAAUGAAGGA-3'(서열번호 101), 5'-ACCCGAAUAGACGAAUGAAGGA-3'(서열번호 102), 5'-AACCCGAAUAGACGAAUGAAGGA-3'(서열번호 103), 5'-GAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 104), 5'-AGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 105), 5'-CAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 106), 5'-GCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 107), 5'-UGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 108), 및 5'-UUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 109)로 이뤄진 군에서 선택된 것일 수 있다.In one embodiment, the sequence of the modified fifth region is 5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA -3', 5'-AUGAAGGA-3', 5'-AAUGAAGGA-3', 5'-GAAUGAAGGA-3' (SEQ ID NO: 90), 5'-CGAAUGAAGGA-3' (SEQ ID NO: 91), 5'-ACGAAUGAAGGA -3' (SEQ ID NO: 92), 5'-GACGAAUGAAGGA-3' (SEQ ID NO: 93), 5'-AGACGAAUGAAGGA-3' (SEQ ID NO: 94), 5'-UAGACGAAUGAAGGA-3' (SEQ ID NO: 95), 5' -AUAGACGAAUGAAGGA-3' (SEQ ID NO: 96), 5'-AAUAGACGAAUGAAGGA-3' (SEQ ID NO: 97), 5'-GAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 98), 5'-CGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 99), 5'-CCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 100), 5'-CCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 101), 5'-ACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 102), 5'-AACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 103) ), 5'-GAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 104), 5'-AGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 105), 5'-CAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 106), 5'-GCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 105) 107), 5'-UGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 108), and 5'-UUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 109).

일 구현예로, 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 다음을 포함할 수 있다:In one embodiment, the sequence of the engineered scaffold region in which the fourth region and the fifth region are modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

서열번호 9,SEQ ID NO: 9,

서열번호 10,SEQ ID NO: 10,

서열번호 11, 및SEQ ID NO: 11, and

5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3',서열번호 66 내지 68로 이뤄진 군에서 선택된 서열이 연결된 서열; 및5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', a sequence to which a sequence selected from the group consisting of SEQ ID NOs: 66 to 68 is linked; and

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 및 5'-AAUGAAGGA-3'로 이뤄진 군에서 선택된 서열, 및5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', and 5 a sequence selected from the group consisting of '-AAUGAAGGA-3', and

5'-AUGCAAC-3'가 연결된 서열.5'-AUGCAAC-3' linked sequence.

일 구현예로, 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 다음을 포함할 수 있다:In one embodiment, the sequence of the engineered scaffold region in which the fourth region and the fifth region are modified may comprise:

5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAA-3'(서열번호 149), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAU-3'(서열번호 150), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUU-3'(서열번호 151), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUC-3'(서열번호 152), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCA-3'(서열번호 153), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAU-3'(서열번호 154), 및 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUU-3'(서열번호 155)로 이뤄진 군에서 선택된 서열; 및5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAA-3'(서열번호 149), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAU-3'(서열번호 150), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUU-3'(서열번호 151), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUC-3'(서열번호 152 ), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCA-3'(서열번호 153), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAU-3'(서열번호 154), 및 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGC a sequence selected from the group consisting of UGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUU-3' (SEQ ID NO: 155); and

5'-GGAAUGCAAC-3', 5'-AGGAAUGCAAC-3', 5'-AAGGAAUGCAAC-3', 5'-GAAGGAAUGCAAC-3', 5'-UGAAGGAAUGCAAC-3', 5'-AUGAAGGAAUGCAAC-3', 및 5'-AAUGAAGGAAUGCAAC-3'로 이뤄진 군에서 선택된 서열.5'-GGAAUGCAAC-3', 5'-AGGAAUGCAAC-3', 5'-AAGGAAUGCAAC-3', 5'-GAAGGAAUGCAAC-3', 5'-UGAAGGAAUGCAAC-3', 5'-AUGAAGGAAUGCAAC-3', and 5 A sequence selected from the group consisting of '-AAUGAAGGAAUGCAAC-3'.

일 구현예로, 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 다음을 포함할 수 있다:In one embodiment, the sequence of the engineered scaffold region in which the fourth region and the fifth region are modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

서열번호 9,SEQ ID NO: 9,

서열번호 10,SEQ ID NO: 10,

서열번호 11, 및SEQ ID NO: 11, and

5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 서열번호 66 내지 89로 이뤄진 군에서 선택된 서열이 연결된 서열; 및5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', a sequence to which a sequence selected from the group consisting of SEQ ID NOs: 66 to 89 is linked; and

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 5'-AAUGAAGGA-3', 서열번호 90 내지 109로 이뤄진 군에서 선택된 서열, 및5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 5' -AAUGAAGGA-3', a sequence selected from the group consisting of SEQ ID NOs: 90-109, and

5'-AUGCAAC-3'가 연결된 서열.5'-AUGCAAC-3' linked sequence.

일 구현예로, 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로,In one embodiment, the sequence of the engineered scaffold region in which the fourth region and the fifth region are modified is from the 5' end to the 3' end direction,

서열번호 9,SEQ ID NO: 9,

서열번호 10,SEQ ID NO: 10,

서열번호 11, 및SEQ ID NO: 11, and

5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 및 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116)에서 선택된 서열, 및5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), and 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), and

5'-AUGCAAC-3'가 연결된 것일 수 있다.5'-AUGCAAC-3' may be linked.

일 구현예로, 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAGAAAGGAAUGCAAC-3'(서열번호 199), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUGAAAAGGAAUGCAAC-3'(서열번호 200), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUGAAAAAGGAAUGCAAC-3'(서열번호 201), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCGAAAGAAGGAAUGCAAC-3'(서열번호 202), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAGAAAUGAAGGAAUGCAAC-3'(서열번호 203), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUGAAAAUGAAGGAAUGCAAC-3'(서열번호 204), 및 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUGAAAAAUGAAGGAAUGCAAC-3'(서열번호 205)로 이뤄진 군에서 선택된 것일 수 있다.일 구현예로, 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAGAAAGGAAUGCAAC-3'(서열번호 199), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUGAAAAGGAAUGCAAC-3'(서열번호 200), 5 '-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUGAAAAAGGAAUGCAAC-3'(서열번호 201), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCGAAAGAAGGAAUGCAAC-3'(서열번호 202), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAGAAAUGAAGGAAUGCAAC-3'(서열번호 203), 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGC UUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUGAAAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 204), and 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGUGCAGUGGGCUGCUUGAAUCAGAUACCCUAAUGUCGAGAGAAGUGAAA number of columns selected from the group consisting of:

엔지니어링 된 스캐폴드 영역 5 - 제6 영역 변형Engineered Scaffold Area 5 - 6th Area Variant

본 명세서에서 제공하는 엔지니어링 된 스캐폴드에서 제6 영역은 PK (R:AR-1) 부분 중 crRNA에 속한 뉴클레오타이드를 포함하는 영역이다. 전술한 바, 상기 제6 영역은 CRISPR/Cas12f1 복합체에서, 이량체를 이루는 하나의 Cas12f1 단백질의 WED 도메인, ZF 도메인 및/또는 RuvC 도메인과 상호작용하는 하나 이상의 뉴클레오타이드를 포함한다. 상기 엔지니어링 된 스캐폴드의 제6 영역은 자연계에서 발견되는 스캐폴드의 제6 영역과 동일하거나, 상기 제6 영역의 기능이 손상되지 않는 한도 내에서 변형된 것일 수 있다.In the engineered scaffold provided herein, the sixth region is a region including nucleotides belonging to crRNA in the PK (R:AR-1) region. As described above, the sixth region includes one or more nucleotides that interact with the WED domain, the ZF domain and/or the RuvC domain of one Cas12f1 protein constituting a dimer in the CRISPR/Cas12f1 complex. The sixth region of the engineered scaffold may be the same as the sixth region of the scaffold found in nature, or may be modified within the extent that the function of the sixth region is not impaired.

엔지니어링 된 스캐폴드 영역 6 - 각 변형의 조합Engineered Scaffold Area 6 - Combination of Each Variant

각 변형의 조합 개괄Combination overview of each variant

본 명세서에서 제공하는 엔지니어링 된 Cas12f1 가이드 RNA에 포함된 엔지니어링 된 스캐폴드 영역은, 자연계에서 발견되는 스캐폴드 영역에 전술한 각 영역 별 변형이 하나 이상 조합된 것일 수 있다.The engineered scaffold region included in the engineered Cas12f1 guide RNA provided herein may be a combination of one or more modifications for each region described above with a scaffold region found in nature.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 변형된 제1 영역, 및 변형된 제2 영역을 포함할 수 있다.In one embodiment, the engineered scaffold region may include a deformed first region and a deformed second region.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 변형된 제1 영역, 및 변형된 제4 영역 및 제5 영역을 포함할 수 있다.In one embodiment, the engineered scaffold region may include a deformed first region, and a deformed fourth region and a fifth region.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 변형된 제2 영역, 및 변형된 제4 영역 및 제5 영역을 포함할 수 있다.In one embodiment, the engineered scaffold region may include a modified second region, and a modified fourth region and a fifth region.

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 변형된 제1 영역, 변형된 제2 영역, 및 변형된 제4 영역 및 제5 영역을 포함할 수 있다.In one embodiment, the engineered scaffold region may include a deformed first region, a deformed second region, and a fourth and fifth deformed region.

이때, 상기 변형된 영역은 전술한 각 영역의 변형 단락에서 설명된 바와 같다.In this case, the deformed region is the same as described in the section on deformation of each region.

각 변형의 조합 1 - 제1 영역 변형 및 제2 영역 변형Combination of each variant 1 - first region variant and second region variant

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 변형된 제1 영역 및 변형된 제2 영역을 포함한다. 이때, 상기 변형된 제1 영역은 "엔지니어링 된 스캐폴드 영역 1 - 제1 영역 변형" 단락에서 설명된 변형을 모두 포함한다. 이때, 상기 변형된 제2 영역은 "엔지니어링 된 스캐폴드 영역 2 - 제2 영역 변형" 단락에서 설명된 변형을 모두 포함한다.In one embodiment, the engineered scaffold region comprises a deformed first region and a deformed second region. In this case, the modified first region includes all the modifications described in the paragraph “Engineered scaffold region 1 - first region deformation”. In this case, the modified second region includes all of the modifications described in the paragraph “Engineered scaffold region 2 - second region deformation”.

일 구현예로, 제1 영역 및 제2 영역이 변형된 엔지니어링 된 스캐폴드 서열은 다음을 포함할 수 있다:In one embodiment, the engineered scaffold sequence in which the first region and the second region are modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 서열번호 16 내지 26으로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', a sequence selected from the group consisting of SEQ ID NOs: 16 to 26;

서열번호 38 내지 서열번호 49, 및 서열번호 430 내지 서열번호 431로 이뤄진 군에서 선택된 서열,SEQ ID NO: 38 to SEQ ID NO: 49, and a sequence selected from the group consisting of SEQ ID NO: 430 to SEQ ID NO: 431,

서열번호 11, 및SEQ ID NO: 11, and

서열번호 12이 연결된 서열; 및sequence to which SEQ ID NO: 12 is linked; and

5'말단에서 3'말단 방향으로, 서열번호 14 및 5'-AUGCAAC-3'가 연결된 서열.A sequence in which SEQ ID NO: 14 and 5'-AUGCAAC-3' are linked in the direction from the 5' end to the 3' end.

일 구현예로, 제1 영역 및 제2 영역이 변형된 엔지니어링 된 스캐폴드 서열은 다음을 포함할 수 있다:In one embodiment, the engineered scaffold sequence in which the first region and the second region are modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-ACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 156); 및5'-ACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3' (SEQ ID NO: 156); and

5'-GAAUGAAGGAAUGCAAC-3'(서열번호 3).5'-GAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 3).

일 구현예로, 제1 영역 및 제2 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'에서 3'말단 방향으로,In one embodiment, the sequence of the engineered scaffold region in which the first region and the second region are modified is in the 5' to 3' terminal direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 서열번호 16 내지 26으로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', a sequence selected from the group consisting of SEQ ID NOs: 16 to 26;

서열번호 38 내지 서열번호 49, 및 서열번호 430 내지 서열번호 431로 이뤄진 군에서 선택된 서열,SEQ ID NO: 38 to SEQ ID NO: 49, and a sequence selected from the group consisting of SEQ ID NO: 430 to SEQ ID NO: 431,

서열번호 11,SEQ ID NO: 11,

서열번호 12,SEQ ID NO: 12,

링커,linker,

서열번호 14, 및SEQ ID NO: 14, and

5'-AUGCAAC-3'가 연결된 것일 수 있다.5'-AUGCAAC-3' may be linked.

이때, 상기 링커는 5'-GAAA-3'일 수 있다.In this case, the linker may be 5'-GAAA-3'.

일 구현예로, 제1 영역 및 제2 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'-ACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 206)일 수 있다.In one embodiment, the sequence of the engineered scaffold region in which the first region and the second region are modified may be 5'-ACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 206).

각 변형의 조합 2 - 제1 영역 변형 및 제4 영역 및 제5 영역 변형Combination of each variant 2 - first region variant and fourth region and fifth region variant

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 변형된 제1 영역, 및 변형된 제4 영역 및 제5 영역을 포함한다. 이때, 상기 변형된 제1 영역은 "엔지니어링 된 스캐폴드 영역 1 - 제1 영역 변형" 단락에서 설명된 변형을 모두 포함한다. 이때, 상기 변형된 제4 영역 및 제5 영역은 "엔지니어링 된 스캐폴드 영역 3 - 제4 영역 및 제5 영역 변형" 단락에서 설명된 변형을 모두 포함한다.In one embodiment, the engineered scaffold region comprises a deformed first region, and a deformed fourth region and a fifth region. In this case, the modified first region includes all the modifications described in the paragraph “Engineered scaffold region 1 - first region deformation”. In this case, the modified fourth region and the fifth region include all of the modifications described in the paragraph “Engineered scaffold region 3 - fourth region and fifth region deformation”.

일 구현예로, 제1 영역, 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 서열은 다음을 포함할 수 있다:In one embodiment, the engineered scaffold sequence in which the first region and the fourth and fifth regions are modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 서열번호 16 내지 26으로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', a sequence selected from the group consisting of SEQ ID NOs: 16 to 26;

서열번호 10,SEQ ID NO: 10,

서열번호 11, 및SEQ ID NO: 11, and

5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 서열번호 66 내지 68로 이뤄진 군에서 선택된 서열이 연결된 서열; 및5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', a sequence to which a sequence selected from the group consisting of SEQ ID NOs: 66 to 68 is linked; and

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 및 5'-AAUGAAGGA-3'로 이뤄진 군에서 선택된 서열, 및5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', and 5 a sequence selected from the group consisting of '-AAUGAAGGA-3', and

5'-AUGCAAC-3'가 연결된 서열.5'-AUGCAAC-3' linked sequence.

일 구현예로, 제1 영역, 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 서열은 다음을 포함할 수 있다:In one embodiment, the engineered scaffold sequence in which the first region and the fourth and fifth regions are modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-ACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAA-3'(서열번호 157); 및5'-ACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAA-3' (SEQ ID NO: 157); and

5'-GGAAUGCAAC-3'(서열번호 160).5'-GGAAUGCAAC-3' (SEQ ID NO: 160).

일 구현예로, 제1 영역 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'에서 3'말단 방향으로,In one embodiment, the sequence of the engineered scaffold region in which the first region and the fourth region and the fifth region are modified is in the 5′ to 3′ terminal direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 서열번호 16 내지 26으로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', a sequence selected from the group consisting of SEQ ID NOs: 16 to 26;

서열번호 10,SEQ ID NO: 10,

서열번호 11,SEQ ID NO: 11,

서열번호 110 내지 116에서 선택된 서열, 및a sequence selected from SEQ ID NOs: 110-116, and

5'-AUGCAAC-3'가 연결된 것일 수 있다.5'-AUGCAAC-3' may be linked.

일 구현예로, 제1 영역 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'-ACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAGAAAGGAAUGCAAC-3'(서열번호 207)일 수 있다.In one embodiment, the sequence of the engineered scaffold region in which the first region and the fourth region and the fifth region are modified may be 5'-ACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAGAAAGGAAUGCAAC-3' (SEQ ID NO: 207).

각 변형의 조합 3 - 제2 영역 변형 및 제4 영역 및 제5 영역 변형Combination of each variant 3 - second region variant and fourth region and fifth region variant

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 변형된 제2 영역 및 변형된 제4 영역 및 제5 영역을 포함한다. 이때, 상기 변형된 제2 영역은 "엔지니어링 된 스캐폴드 영역 2 - 제2 영역 변형" 단락에서 설명된 변형을 모두 포함한다. 이때, 상기 변형된 제4 영역 및 제5 영역은 "엔지니어링 된 스캐폴드 영역 3 - 제4 영역 및 제5 영역 변형" 단락에서 설명된 변형을 모두 포함한다.In one embodiment, the engineered scaffold region comprises a modified second region and a modified fourth region and a fifth region. In this case, the modified second region includes all of the modifications described in the paragraph “Engineered scaffold region 2 - second region deformation”. In this case, the modified fourth region and the fifth region include all of the modifications described in the paragraph “Engineered scaffold region 3 - fourth region and fifth region deformation”.

일 구현예로, 제2 영역, 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 서열은 다음을 포함할 수 있다:In one embodiment, the engineered scaffold sequence in which the second region and the fourth and fifth regions are modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

서열번호 9,SEQ ID NO: 9,

서열번호 38 내지 서열번호 49, 및 서열번호 430 내지 서열번호 431로 이뤄진 군에서 선택된 서열,SEQ ID NO: 38 to SEQ ID NO: 49, and a sequence selected from the group consisting of SEQ ID NO: 430 to SEQ ID NO: 431,

서열번호 11, 및SEQ ID NO: 11, and

5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 서열번호 66 내지 68로 이뤄진 군에서 선택된 서열이 연결된 서열; 및5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', a sequence to which a sequence selected from the group consisting of SEQ ID NOs: 66 to 68 is linked; and

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 및 5'-AAUGAAGGA-3'로 이뤄진 군에서 선택된 서열, 및5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', and 5 a sequence selected from the group consisting of '-AAUGAAGGA-3', and

5'-AUGCAAC-3'가 연결된 서열.5'-AUGCAAC-3' linked sequence.

일 구현예로, 제2 영역, 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 서열은 다음을 포함할 수 있다:In one embodiment, the engineered scaffold sequence in which the second region and the fourth and fifth regions are modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAA-3'(서열번호 158); 및5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAA-3' (SEQ ID NO: 158); and

5'-GGAAUGCAAC-3'(서열번호 160).5'-GGAAUGCAAC-3' (SEQ ID NO: 160).

일 구현예로, 제2 영역 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'에서 3'말단 방향으로,In one embodiment, the sequence of the engineered scaffold region in which the second region and the fourth region and the fifth region are modified is in the 5′ to 3′ terminal direction,

서열번호 9,SEQ ID NO: 9,

서열번호 38 내지 서열번호 49, 및 서열번호 430 내지 서열번호 431로 이뤄진 군에서 선택된 서열,SEQ ID NO: 38 to SEQ ID NO: 49, and a sequence selected from the group consisting of SEQ ID NO: 430 to SEQ ID NO: 431,

서열번호 11,SEQ ID NO: 11,

서열번호 110 내지 116에서 선택된 서열, 및a sequence selected from SEQ ID NOs: 110-116, and

5'-AUGCAAC-3'가 연결된 것일 수 있다.5'-AUGCAAC-3' may be linked.

일 구현예로, 제2 영역 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAGAAAGGAAUGCAAC-3'(서열번호 208)일 수 있다.In one embodiment, the sequence of the engineered scaffold region in which the second and fourth and fifth regions are modified may be 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAGAAAGGAAUGCAAC-3' (SEQ ID NO:208).

각 변형의 조합 4 - 제1 영역 변형, 제2 영역 변형, 및 제4 영역 및 제5 영역 변형Combination of each variant 4 - first region variant, second region variant, and fourth region and fifth region variant

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역은 변형된 제1 영역, 변형된 제2 영역 및 변형된 제4 영역 및 제5 영역을 포함한다. 이때, 상기 변형된 제1 영역은 "엔지니어링 된 스캐폴드 영역 1 - 제1 영역 변형" 단락에서 설명된 변형을 모두 포함한다. 이때, 상기 변형된 제2 영역은 "엔지니어링 된 스캐폴드 영역 2 - 제2 영역 변형" 단락에서 설명된 변형을 모두 포함한다. 이때, 상기 변형된 제4 영역 및 제5 영역은 "엔지니어링 된 스캐폴드 영역 3 - 제4 영역 및 제5 영역 변형" 단락에서 설명된 변형을 모두 포함한다.In one embodiment, the engineered scaffold region comprises a deformed first region, a deformed second region, and a fourth and fifth deformed region. In this case, the modified first region includes all the modifications described in the paragraph “Engineered scaffold region 1 - first region deformation”. In this case, the modified second region includes all of the modifications described in the paragraph “Engineered scaffold region 2 - second region deformation”. In this case, the modified fourth region and the fifth region include all of the modifications described in the paragraph “Engineered scaffold region 3 - fourth region and fifth region deformation”.

일 구현예로, 제1 영역, 제2 영역, 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 서열은 다음을 포함할 수 있다:In one embodiment, the engineered scaffold sequence in which the first region, the second region, and the fourth and fifth regions are modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 서열번호 16 내지 26으로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', a sequence selected from the group consisting of SEQ ID NOs: 16 to 26;

서열번호 38 내지 서열번호 49, 및 서열번호 430 내지 서열번호 431로 이뤄진 군에서 선택된 서열,SEQ ID NO: 38 to SEQ ID NO: 49, and a sequence selected from the group consisting of SEQ ID NO: 430 to SEQ ID NO: 431,

서열번호 11, 및SEQ ID NO: 11, and

5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 서열번호 66 내지 68로 이뤄진 군에서 선택된 서열이 연결된 서열; 및5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', a sequence to which a sequence selected from the group consisting of SEQ ID NOs: 66 to 68 is linked; and

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 및 5'-AAUGAAGGA-3'로 이뤄진 군에서 선택된 서열, 및5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', and 5 a sequence selected from the group consisting of '-AAUGAAGGA-3', and

5'-AUGCAAC-3'가 연결된 서열.5'-AUGCAAC-3' linked sequence.

일 구현예로, 제1 영역, 제2 영역, 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 서열은 다음을 포함할 수 있다:In one embodiment, the engineered scaffold sequence in which the first region, the second region, and the fourth and fifth regions are modified may comprise:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-ACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAA-3'(서열번호 159); 및5'-ACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAA-3' (SEQ ID NO: 159); and

5'-GGAAUGCAAC-3'(서열번호 160).5'-GGAAUGCAAC-3' (SEQ ID NO: 160).

일 구현예로, 제1 영역, 제2 영역 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'에서 3'말단 방향으로,In one embodiment, the sequence of the engineered scaffold region in which the first region, the second region and the fourth region and the fifth region are modified is in the 5′ to 3′ terminal direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 서열번호 16 내지 26으로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', a sequence selected from the group consisting of SEQ ID NOs: 16 to 26;

서열번호 38 내지 서열번호 49, 및 서열번호 430 내지 서열번호 431로 이뤄진 군에서 선택된 서열,SEQ ID NO: 38 to SEQ ID NO: 49, and a sequence selected from the group consisting of SEQ ID NO: 430 to SEQ ID NO: 431,

서열번호 11,SEQ ID NO: 11,

서열번호 110 내지 116에서 선택된 서열, 및a sequence selected from SEQ ID NOs: 110-116, and

5'-AUGCAAC-3'가 연결된 것일 수 있다.5'-AUGCAAC-3' may be linked.

일 구현예로, 제1 영역, 제2 영역 및 제4 영역 및 제5 영역이 변형된 엔지니어링 된 스캐폴드 영역의 서열은 5'-ACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAGAAAGGAAUGCAAC-3'(서열번호 209)일 수 있다.In one embodiment, the sequence of the engineered scaffold region in which the first region, the second region and the fourth region and the fifth region are modified may be 5'-ACCGCUUCACCAUUAGUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAGAAAGGAAUGCAAC-3' (SEQ ID NO: 209).

각 변형의 조합 5 - 제3 영역, 및 제6 영역의 추가적인 변형Combination of each variant 5 - the third region, and further variants of the sixth region

전술한 바, 제3 영역 및 제6 영역 또한 그 기능이 손상되지 않는 범위 내에서 변형될 수 있으므로, 본 명세서에서 제공하는 엔지니어링 된 스캐폴드 영역은 전술한 제1 영역, 제2 영역, 제4 영역, 및/또는 제5 영역의 변형 외, 제3 영역 및/또는 제6 영역이 추가적으로 변형된 것일 수 있다.As described above, since the third region and the sixth region can also be deformed within a range in which their functions are not impaired, the engineered scaffold region provided herein is the first region, the second region, and the fourth region. , and/or in addition to the deformation of the fifth region, the third region and/or the sixth region may be additionally deformed.

엔지니어링 된 스캐폴드 영역 7 - 상동성 있는 서열 포함Engineered Scaffold Region 7 - Contains Homologous Sequences

본 명세서에서 제공하는 엔지니어링 된 스캐폴드 영역은 "엔지니어링 된 스캐폴드 영역 1 - 제1 영역 변형", "엔지니어링 된 스캐폴드 영역 2 - 제2 영역 변형", "엔지니어링 된 스캐폴드 영역 3 - 제3 영역 변형", "엔지니어링 된 스캐폴드 영역 4 - 제4 영역 및 제5 영역 변형", 및 "엔지니어링 된 스캐폴드 영역 5 - 각 변형의 조합" 단락에서 서술된 엔지니어링 된 스캐폴드 영역(이하, 전술한 엔지니어링 된 스캐폴드 영역)의 서열과 상동성 있는 서열을 포함한다.Engineered scaffold regions provided herein include "Engineered Scaffold Region 1 - First Region Variant", "Engineered Scaffold Region 2 - Second Region Variant", "Engineered Scaffold Region 3 - Third Region Variant" The engineered scaffold regions described in the paragraphs “Engineered Scaffold Region 4 - Fourth Region and Fifth Region Modifications”, and “Engineered Scaffold Region 5 - Combination of Each Variant” (hereinafter referred to as engineering scaffold regions). and a sequence homologous to the sequence of the scaffold region).

일 구현예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 전술한 엔지니어링 된 스캐폴드 영역의 서열 중 어느 하나와 100%, 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 79%, 78%, 77%, 76%, 75%, 74%, 73%, 72%, 71%, 70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 61%, 60%, 59%, 58%, 57%, 56%, 55%, 54%, 53%, 52%, 51%, 또는 50% 일치하는, 또는 상동성 있는 서열일 수 있다. 일 구현예로, 상기 스캐폴드 서열은 전술한 엔지니어링 된 스캐폴드 영역의 서열 중 어느 하나와 바로 이전 문장에서 선택된 두 수치 범위 내 일치하는 서열일 수 있다. 예를 들어, 상기 스캐폴드 서열은 전술한 엔지니어링 된 스캐폴드 영역의 서열 중 어느 하나와 90% 내지 100% 일치하는 서열일 수 있다.In one embodiment, the sequence of the engineered scaffold region is 100%, 99%, 98%, 97%, 96%, 95%, 94%, 93% of any one of the sequences of the engineered scaffold region described above. , 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 79%, 78%, 77%, 76 %, 75%, 74%, 73%, 72%, 71%, 70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 61%, 60%, 59%, 58%, 57%, 56%, 55%, 54%, 53%, 52%, 51%, or 50% identical, or homologous sequences. In one embodiment, the scaffold sequence may be a sequence that matches any one of the sequences of the engineered scaffold region described above within two numerical ranges selected in the immediately preceding sentence. For example, the scaffold sequence may be a sequence that is 90% to 100% identical to any one of the sequences of the engineered scaffold region described above.

엔지니어링 된 Cas12f1 가이드 RNAEngineered Cas12f1 guide RNA

엔지니어링 된 Cas12f1 가이드 RNA 개괄Engineered Cas12f1 Guide RNA Overview

본 명세서에서는 CRISPR/Cas12f1 시스템의 세포 내 유전자 편집 효율을 높이기 위한, 엔지니어링 된 Cas12f1 가이드 RNA를 제공한다. 상기 엔지니어링 된 Cas12f1 가이드 RNA는 엔지니어링 된 스캐폴드, 및 스페이서를 포함한다. 이때, 상기 엔지니어링 된 스캐폴드는 전술한 "엔지니어링 된 스캐폴드 영역"에서 설명된 것 중 어느 하나일 수 있다.Herein, an engineered Cas12f1 guide RNA is provided to increase the intracellular gene editing efficiency of the CRISPR/Cas12f1 system. The engineered Cas12f1 guide RNA comprises an engineered scaffold, and a spacer. In this case, the engineered scaffold may be any one of those described in the above-mentioned "engineered scaffold region".

싱글 가이드 RNA 또는 듀얼 가이드 RNASingle guide RNA or dual guide RNA

상기 엔지니어링 된 Cas12f1 가이드 RNA는 싱글 가이드 RNA 또는 듀얼 가이드 RNA일 수 있다. 상기 듀얼 가이드 RNA는 가이드 RNA가 tracrRNA 및 crRNA의 두 분자 RNA로 구성된 것을 의미한다. 상기 싱글 가이드 RNA는 (엔지니어링 된) tracrRNA의 3'말단 및 (엔지니어링 된) crRNA의 5' 말단이 링커를 통해 연결된 것을 의미한다. 달리 표현하면, 상기 듀얼 가이드 RNA에서, 엔지니어링 된 스캐폴드에 포함된 제4 영역의 3'말단 및 제5 영역의 5'말단이 링커를 통해 연결된 것을 의미한다. 이때, 엔지니어링 된 스캐폴드의 각 영역은 "엔지니어링 된 스캐폴드 영역" 단락들에서 설명된 변형, 및 그 구체적인 서열을 모두 포함할 수 있다.The engineered Cas12f1 guide RNA may be a single guide RNA or a dual guide RNA. The dual guide RNA means that the guide RNA is composed of two molecular RNAs, tracrRNA and crRNA. The single guide RNA means that the 3' end of the (engineered) tracrRNA and the 5' end of the (engineered) crRNA are connected through a linker. In other words, in the dual guide RNA, it means that the 3' end of the fourth region and the 5' end of the fifth region included in the engineered scaffold are linked through a linker. In this case, each region of the engineered scaffold may include all of the modifications described in the “Engineered scaffold region” paragraphs and a specific sequence thereof.

엔지니어링 된 싱글 가이드 RNA 예시 1Engineered single guide RNA Example 1

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결된 것일 수 있다.In one embodiment, the engineered Cas12f1 guide RNA may be one in which an engineered scaffold region and a spacer are connected in order from the 5' end to the 3' end.

상기 스페이서는 10 내지 50 뉴클레오타이드 길이를 가지며, 표적 서열과 상보적인 서열을 가진다.The spacer has a length of 10 to 50 nucleotides and has a sequence complementary to the target sequence.

상기 엔지니어링 된 스캐폴드 영역은 5'말단에서 3'말단 방향으로, 자연계에서 발견되는 스캐폴드 영역과 대응되는 제1 영역, 제2 영역, 제3 영역, 제4 영역, 링커, 제5 영역, 제6 영역이 순서대로 연결된 것이며, 자연계에서 발견되는 스캐폴드 영역과 비교해 제1 영역, 제2 영역, 제4 영역, 및 제5 영역에서 선택된 하나 이상의 영역이 변형된 것일 수 있다.The engineered scaffold region is a first region, a second region, a third region, a fourth region, a linker, a fifth region, The six regions are sequentially connected, and one or more regions selected from the first region, the second region, the fourth region, and the fifth region may be modified compared to the scaffold region found in nature.

일 예로, 상기 엔지니어링 된 스캐폴드 영역의 제1 영역이 변형된 경우, 상기 변형된 제1 영역은 자연계에서 발견되는 스캐폴드 영역의 제1 영역에서 하나 이상의 뉴클레오타이드가 제거된 것일 수 있다. 이때, 상기 제거된 뉴클레오타이드는 상기 제1 영역 중 Stem 1 (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021))에 속하는 뉴클레오타이드일 수 있다. 이때, 상기 변형된 제1 영역의 서열은 5'-A-3'을 포함하는 것을 특징으로 한다.For example, when the first region of the engineered scaffold region is modified, the modified first region may have one or more nucleotides removed from the first region of the scaffold region found in nature. In this case, the removed nucleotide may be a nucleotide belonging to Stem 1 (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)) in the first region. In this case, the sequence of the modified first region is characterized in that it includes 5'-A-3'.

또 다른 예로, 상기 엔지니어링 된 스캐폴드 영역의 제2 영역이 변형된 경우, 상기 변형된 제2 영역은 자연계에서 발견되는 스캐폴드 영역의 제2 영역에서 하나 이상의 뉴클레오타이드가 제거된 것일 수 있다. 이때, 상기 뉴클레오타이드의 제거는 상기 제2 영역 중 Stem 2 구조(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021))를 형성하는 부분에서 일어난 것이고, 서로 베이스 페어를 이루는 뉴클레오타이드가 쌍 단위로 제거된 것일 수 있다. 이때, 상기 변형된 제2 영역의 서열은 적어도 5'-CCGCUUCAC-3'(서열번호 432) 및 5'-UGAGUGAAGG-3'(서열번호 433)을 포함하는 것을 특징으로 한다. 더 구체적으로, 상기 변형된 제2 영역의 서열은 5'말단에서 3'말단 방향으로 5'-CCGCUUCAC-3'(서열번호 432) 및 5'-UGAGUGAAGG-3'(서열번호 433)가 순서대로 연결되어 있을 수 있으며, 적절한 중간 서열을 통해 연결되어 있을 수 있다. 일 예로, 상기 중간 서열은 5'-UUAG-3', 5'-AUUAGU-3', 5'-AAUUAGCU-3', 5'-AAAUUAGACU-3'(서열번호 57), 5'-AAAGUUAGAACU-3'(서열번호 58), 5'-AAAGCUUAGGAACU-3'(서열번호 59), 5'-AAAGCUUUAGAGAACU-3'(서열번호 60), 5'-AAAGCUGUUAGUUAGAACU-3'(서열번호 61), 5'-AAAGCUGUUAGUAGAACU-3'(서열번호 62), 5'-AAAGCUGUUUAGAUUAGAACU-3'(서열번호 63), 5'-AAAGCUGUCUUAGGAUUAGAACU-3'(서열번호 64), 5'-AAAGCUGUCCUUAGGGAUUAGAACU-3'(서열번호 65), 5'-AAAAGCUGUCCCUUAGGGGAUUAGAACUU-3'(서열번호 434), 및 5'-CAAAAGCUGUCCCUUAGGGGAUUAGAACUUG-3'(서열번호 435)로 이뤄진 군에서 선택된 것일 수 있다.As another example, when the second region of the engineered scaffold region is modified, the modified second region may have one or more nucleotides removed from the second region of the scaffold region found in nature. At this time, the removal of the nucleotide occurred in the portion forming the Stem 2 structure (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021)) of the second region. and nucleotides constituting a base pair with each other may be removed in pairs. In this case, the sequence of the modified second region is characterized in that it comprises at least 5'-CCGCUUCAC-3' (SEQ ID NO: 432) and 5'-UGAGUGAAGG-3' (SEQ ID NO: 433). More specifically, the sequence of the modified second region is 5'-CCGCUUCAC-3' (SEQ ID NO: 432) and 5'-UGAGUGAAGG-3' (SEQ ID NO: 433) in the order from the 5' end to the 3' end. They may be linked, and may be linked through an appropriate intermediate sequence. In one example, the intermediate sequence is 5'-UUAG-3', 5'-AUUAGU-3', 5'-AAUUAGCU-3', 5'-AAAUUAGACU-3' (SEQ ID NO: 57), 5'-AAAGUUAGAACU-3 '(SEQ ID NO: 58), 5'-AAAGCUUAGGAACU-3' (SEQ ID NO: 59), 5'-AAAGCUUUAGAGAACU-3' (SEQ ID NO: 60), 5'-AAAGCUGUUAGUUAGAACU-3' (SEQ ID NO: 61), 5'-AAAGCUGUUAGUAGAACU -3' (SEQ ID NO: 62), 5'-AAAGCUGUUUAGAUUAGAACU-3' (SEQ ID NO: 63), 5'-AAAGCUGUCUUAGGAUUAGAACU-3' (SEQ ID NO: 64), 5'-AAAGCUGUGUCCUUAGGGAUUAGAACU-3' (SEQ ID NO: 65), 5' -AAAAGCUGUCCCUUAGGGGAUUAGAACUU-3' (SEQ ID NO: 434), and 5'-CAAAAGCUGUCCCUUAGGGGAUUAGAACUUG-3' (SEQ ID NO: 435) may be selected from the group consisting of.

또 다른 예로, 상기 엔지니어링 된 스캐폴드 영역의 제4 영역 및 제5 영역이 변형된 경우, 상기 변형된 제4 영역 및 제5 영역은 자연계에서 발견되는 스캐폴드 영역 중 제4 영역 및/또는 제5 영역의 하나 이상의 뉴클레오타이드가 제거된 것일 수 있다. 이때, 상기 뉴클레오타이드의 제거는 상기 제4 영역 및 제5 영역 중 Stem 5 (R:AR-2) 구조(Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1-13 (2021))를 형성하는 부분에서 일어난 것이고, 서로 베이스 페어를 이루는 뉴클레오타이드가 쌍 단위로 제거된 것일 수 있다. 이때, 상기 변형된 제4 영역의 서열은 적어도 5'-AACAAA-3'을 포함하는 것을 특징으로 한다. 이때, 상기 변형된 제5 영역의 서열은 적어도 5'-GGA-3'을 포함하는 것을 특징으로 한다.As another example, when the fourth region and the fifth region of the engineered scaffold region are modified, the modified fourth region and the fifth region are the fourth region and/or the fifth region of the scaffold region found in nature. One or more nucleotides of the region may be removed. At this time, the removal of the nucleotides is a Stem 5 (R:AR-2) structure of the fourth region and the fifth region (Takeda et al., Structure of the miniature type V-F CRISPR-Cas effector enzyme, Molecular Cell 81, 1- 13 (2021)), and nucleotides constituting a base pair with each other may be removed in pairs. In this case, the sequence of the modified fourth region is characterized in that it includes at least 5'-AACAAA-3'. In this case, the sequence of the modified fifth region is characterized in that it includes at least 5'-GGA-3'.

엔지니어링 된 싱글 가이드 RNA 예시 2Engineered Single Guide RNA Example 2

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결된 것일 수 있다.In one embodiment, the engineered Cas12f1 guide RNA may be one in which an engineered scaffold region and a spacer are connected in order from the 5' end to the 3' end.

상기 스페이서는 10 내지 50 뉴클레오타이드 길이를 가지며, 표적 서열과 상보적인 서열을 가진다.The spacer has a length of 10 to 50 nucleotides and has a sequence complementary to the target sequence.

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것이다:The sequence of the engineered scaffold region is one in which the following sequences are linked in order from the 5' end to the 3' end:

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5' -UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);

5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);

5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 및 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117)에서 선택된 서열; 및5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), and 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) a sequence selected from SEQ ID NO: 117); and

5'-AUGCAAC-3',5'-AUGCAAC-3',

이때, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 7)과 상이한 것을 특징으로 한다.At this time, the sequence of the engineered scaffold region is different from 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAACAAUGAAGGAA sequence number 7).

엔지니어링 된 싱글 가이드 RNA 예시 3Engineered Single Guide RNA Example 3

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결된 것일 수 있다.In one embodiment, the engineered Cas12f1 guide RNA may be one in which an engineered scaffold region and a spacer are connected in order from the 5' end to the 3' end.

상기 스페이서는 10 내지 50 뉴클레오타이드 길이를 가지며, 표적 서열과 상보적인 서열을 가진다.The spacer has a length of 10 to 50 nucleotides and has a sequence complementary to the target sequence.

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것이다:The sequence of the engineered scaffold region is one in which the following sequences are linked in order from the 5' end to the 3' end:

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5' -UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);

5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);

5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117), 5'-AACAAAUUCAUUUUGAAACGAAUGAAGGA-3'(서열번호 293), 5'-AACAAAUUCAUUUUUGAAAACGAAUGAAGGA-3'(서열번호 294), 5'-AACAAAUUCAUUUUUCGAAAGACGAAUGAAGGA-3'(서열번호 295), 5'-AACAAAUUCAUUUUUCCGAAAAGACGAAUGAAGGA-3'(서열번호 296), 5'-AACAAAUUCAUUUUUCCUGAAAUAGACGAAUGAAGGA-3'(서열번호 297), 5'-AACAAAUUCAUUUUUCCUCGAAAAUAGACGAAUGAAGGA-3'(서열번호 298), 5'-AACAAAUUCAUUUUUCCUCUGAAAAAUAGACGAAUGAAGGA-3'(서열번호 299), 5'-AACAAAUUCAUUUUUCCUCUCGAAAGAAUAGACGAAUGAAGGA-3'(서열번호 300), 5'-AACAAAUUCAUUUUUCCUCUCCGAAACGAAUAGACGAAUGAAGGA-3'(서열번호 301), 5'-AACAAAUUCAUUUUUCCUCUCCAGAAACCGAAUAGACGAAUGAAGGA-3'(서열번호 302), 5'-AACAAAUUCAUUUUUCCUCUCCAAGAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 303), 5'-AACAAAUUCAUUUUUCCUCUCCAAUGAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 304), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUGAAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 305), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCGAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 306), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGAAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 307), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGGAAACAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 308), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCGAAAGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 309), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCAGAAAUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 310), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 311), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 312), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 313), 및 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 314)로 이뤄진 군에서 선택된 서열; 및5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) No. 117), 5'-AACAAAUUCAUUUUGAAACGAAUGAAGGA-3' (SEQ ID NO: 293), 5'-AACAAAUUCAUUUUUGAAAACGAAUGAAGGA-3' (SEQ ID NO: 294), 5'-AACAAAUUCAUUUUUCGAAAGACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAAU-GAAAAAGGAUGA (SEQ ID NO: 296), 5'-AACAAAUUCAUUUUUCCUGAAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 297), 5'-AACAAAUUCAUUUUUCCUCGAAAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 298), 5'-AACAAAUUCAUUCAUUUUUCCUACAUGAAUUCAUUUUUUCCUACAUGAAUUCAUUUUUUCCUACAUGAAUUCAUUUUUUCCUACAACUGAAUGAAUUUUUCCUCUGAAUGAAGAAUUCA 3' (SEQ ID NO: 300), 5'-AACAAAUUCAUUUUUCCUCUCCGAAACGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 301), 5'-AACAAAUUCAUUUUUCCUCUCCAGAAACCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 302), 5'-AACAAAUCUCCAUGAAUGAGU-3' (SEQ ID NO: 302), 5'-AACAAAUCUCCAUGAAUGAU AACAAAUUCAUUUUUCCUCUCCAAUGAAAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 304), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUGAAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 305), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCGAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 306), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGAAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 307), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGGAAACAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 308 ), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCGAAAGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 309), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCAGAAAUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 310), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 311), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열 312), 5'-AACAAAUUCAUUUUUUCCUCUCCAAUUCUGCACAGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 313), and 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO:314) selected from the group consisting of; and

5'-AUGCAAC-3',5'-AUGCAAC-3',

이때, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGAAUGCAAC-3'(서열번호 315)과 상이한 것을 특징으로 한다.At this time, the sequence of the engineered scaffold region is different from 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCUCGGAAAGUAACCCUCGAAACAAAUUCAUUCAUUGAACCUCGAAUCAUUGAACAAUUCCAACCCUCGAAACAAAUUGACAGUCAUCAUGACAUUGAACAAUUCA with the sequence that is different from the sequence.

엔지니어링 된 싱글 가이드 RNA 예시 4Engineered Single Guide RNA Example 4

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결된 것일 수 있다.In one embodiment, the engineered Cas12f1 guide RNA may be one in which an engineered scaffold region and a spacer are connected in order from the 5' end to the 3' end.

상기 스페이서는 10 내지 50 뉴클레오타이드 길이를 가지며, 표적 서열과 상보적인 서열을 가진다.The spacer has a length of 10 to 50 nucleotides and has a sequence complementary to the target sequence.

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것이다:The sequence of the engineered scaffold region is one in which the following sequences are linked in order from the 5' end to the 3' end:

5'-A-3'로 표현되는 제1 서열;a first sequence represented by 5'-A-3';

5'-CCGCUUCAC-3'(서열번호 432)로 표현되는 제2 서열;a second sequence represented by 5'-CCGCUUCAC-3' (SEQ ID NO: 432);

5'-UUAG-3'로 표현되는 제3 서열;a third sequence represented by 5'-UUAG-3';

5'-UGAGUGAAGG-3'(서열번호 433)로 표현되는 제4 서열;a fourth sequence represented by 5'-UGAGUGAAGG-3' (SEQ ID NO: 433);

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11)로 표현되는 제5 서열;a fifth sequence represented by 5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);

5'-AACAAA-3'로 표현되는 제6 서열;a sixth sequence represented by 5'-AACAAA-3';

링커;linker;

5'-GGA-3'로 표현되는 제7 서열; 및a seventh sequence represented by 5'-GGA-3'; and

5'-AUGCAAC-3'로 표현되는 제8 서열.The eighth sequence represented by 5'-AUGCAAC-3'.

일 예로, 상기 링커는 5'-GAAA-3'일 수 있다.For example, the linker may be 5'-GAAA-3'.

또 다른 예로, 상기 링커는 5'-GAAA-3', 5'-UGAAAA-3', 5'-UUGAAAAA-3', 5'-UUCGAAAGAA-3'(서열번호 425), 5'-UUCAGAAAUGAA-3'(서열번호 426), 5'-UUCAUGAAAAUGAA-3'(서열번호 427), 5'-UUCAUUGAAAAAUGAA-3'(서열번호 428), 및 5'-UUCAUUUGAAAGAAUGAAGGA-3'(서열번호 429)로 이뤄진 군에서 선택된 것일 수 있다.In another example, the linker is 5'-GAAA-3', 5'-UGAAAA-3', 5'-UUGAAAAA-3', 5'-UUCGAAAGAA-3' (SEQ ID NO: 425), 5'-UUCAGAAAUGAA-3 In the group consisting of '(SEQ ID NO: 426), 5'-UUCAUGAAAAUGAA-3' (SEQ ID NO: 427), 5'-UUCAUUGAAAAAUGAA-3' (SEQ ID NO: 428), and 5'-UUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 429) may be selected.

상기 구현예의 일 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-A-3', 5'-GA-3', 5'-AGA-3', 5'-GAGA-3', 5'-GGAGA-3', 5'-UGGAGA-3', 5'-GUGGAGA-3', 5'-AGUGGAGA-3', 5'-AAGUGGAGA-3', 5'-AAAGUGGAGA-3'(서열번호 27), 5'-UAAAGUGGAGA-3'(서열번호 28), 5'-AUAAAGUGGAGA-3'(서열번호 29), 5'-GAUAAAGUGGAGA-3'(서열번호 30), 5'-UGAUAAAGUGGAGA-3'(서열번호 31), 5'-CUGAUAAAGUGGAGA-3'(서열번호 32), 5'-ACUGAUAAAGUGGAGA-3'(서열번호 33), 5'-CACUGAUAAAGUGGAGA-3'(서열번호 34), 5'-UCACUGAUAAAGUGGAGA-3'(서열번호 35), 5'-UUCACUGAUAAAGUGGAGA-3'(서열번호 36), 및 5'-CUUCACUGAUAAAGUGGAGA-3'(서열번호 37)로 이뤄진 군에서 선택된 제9 서열을 추가적으로 포함할 수 있다. 이때, 상기 제9 서열의 3'말단은 상기 제1 서열의 5'말단과 연결되어 있을 수 있다.In one embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-A-3', 5'-GA-3', 5'-AGA-3', 5'-GAGA-3', 5 '-GGAGA-3', 5'-UGGAGA-3', 5'-GUGGAGA-3', 5'-AGUGGAGA-3', 5'-AAUGGAGA-3', 5'-AAAGUGGAGA-3' (SEQ ID NO: 27 ), 5'-UAAAAGUGGAGA-3' (SEQ ID NO: 28), 5'-AUAAAGUGGAGA-3' (SEQ ID NO: 29), 5'-GAUAAAGUGGAGA-3' (SEQ ID NO: 30), 5'-UGAUAAAGUGGAGA-3' (SEQ ID NO: 30) No. 31), 5'-CUGAUAAAAGUGGAGA-3' (SEQ ID NO: 32), 5'-ACUGAUAAAGUGGAGA-3' (SEQ ID NO: 33), 5'-CACUGAUAAAAGUGGAGA-3' (SEQ ID NO: 34), 5'-UCACUGAUAAAGUGGAGA-3' (SEQ ID NO: 35), 5'-UUCACUGAUAAAGUGGAGA-3' (SEQ ID NO: 36), and 5'-CUUCACUGAUAAAGUGGAGA-3' (SEQ ID NO: 37). It may further include a ninth sequence selected from the group consisting of. In this case, the 3' end of the ninth sequence may be linked to the 5' end of the first sequence.

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-A-3', 5'-AA-3', 5'-AAA-3', 5'-AAAG-3', 5'-AAAGC-3', 5'-AAAGCU-3', 5'-AAAGCUG-3', 5'-AAAGCUGU-3', 5'-AAAGCUGUC-3', 5'-AAAGCUGUCC-3'(서열번호 52), 5'-AAAGCUGUCCC-3'(서열번호 53), 5'-AAAAGCUGUCCC-3'(서열번호 440), 및 5'-CAAAAGCUGUCCC-3'(서열번호 441)로 이뤄진 군에서 선택된 제10 서열을 추가적으로 포함할 수 있다. 이때, 상기 제2 서열의 3'말단 및 상기 제3 서열의 5'말단은 상기 제10 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-A-3', 5'-AA-3', 5'-AAA-3', 5'-AAAG-3', 5'-AAAGC-3', 5'-AAAGCU-3', 5'-AAAGCUG-3', 5'-AAAGCUGU-3', 5'-AAAGCUGUC-3', 5'-AAAGCUGUCC-3' (SEQ ID NO: 52), 5'-AAAGCUGUCCC-3' (SEQ ID NO: 53), 5'-AAAAGCUGUCCC-3' (SEQ ID NO: 440), and 5'-CAAAAGCUGUCCC-3' (SEQ ID NO: 441) a tenth sequence selected from the group consisting of may additionally include. In this case, the 3' end of the second sequence and the 5' end of the third sequence may be connected through the 10th sequence.

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-U-3', 5'-CU-3', 5'-ACU-3', 5'-AACU-3', 5'-GAACU-3', 5'-AGAACU-3', 5'-UAGAACU-3', 5'-UUAGAACU-3', 5'-AUUAGAACU-3', 5'-GAUUAGAACU-3'(서열번호 54), 5'-GGAUUAGAACU-3'(서열번호 55), 5'-GGGAUUAGAACU-3'(서열번호 56), 5'-GGGAUUAGAACUU-3'(서열번호 442), 및 5'-GGGAUUAGAACUUG-3'(서열번호 443)로 이뤄진 군에서 선택된 제11 서열을 추가적으로 포함할 수 있다. 이때, 상기 제3 서열의 3'말단 및 상기 제4 서열의 5'말단은 상기 제11 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-U-3', 5'-CU-3', 5'-ACU-3', 5'-AACU-3', 5'-GAACU-3', 5'-AGAACU-3', 5'-UAGAACU-3', 5'-UUAGAACU-3', 5'-AUUAGAACU-3', 5'-GAUUAGAACU-3' (SEQ ID NO: 54), 5'-GGAUUAGAACU-3' (SEQ ID NO: 55), 5'-GGGAUUAGAACU-3' (SEQ ID NO: 56), 5'-GGGAUUAGAACUU-3' (SEQ ID NO: 442), and 5'-GGGAUUAGAACUUG-3' (SEQ ID NO: 443) may additionally include an 11th sequence selected from the group consisting of. In this case, the 3' end of the third sequence and the 5' end of the fourth sequence may be connected through the eleventh sequence.

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-A-3', 5'-AA-3', 5'-AAA-3', 5'-AAAG-3', 5'-AAAGC-3', 5'-AAAGCU-3', 5'-AAAGCUG-3', 5'-AAAGCUGU-3', 5'-AAAGCUGUC-3', 5'-AAAGCUGUCC-3'(서열번호 52), 5'-AAAGCUGUCCC-3'(서열번호 53), 5'-AAAAGCUGUCCC-3'(서열번호 440), 및 5'-CAAAAGCUGUCCC-3'(서열번호 441)로 이뤄진 군에서 선택된 제10 서열 및 5'-U-3', 5'-CU-3', 5'-ACU-3', 5'-AACU-3', 5'-GAACU-3', 5'-AGAACU-3', 5'-UAGAACU-3', 5'-UUAGAACU-3', 5'-AUUAGAACU-3', 5'-GAUUAGAACU-3'(서열번호 54), 5'-GGAUUAGAACU-3'(서열번호 55), 5'-GGGAUUAGAACU-3'(서열번호 56), 5'-GGGAUUAGAACUU-3'(서열번호 442), 및 5'-GGGAUUAGAACUUG-3'(서열번호 443)로 이뤄진 군에서 선택된 제11 서열을 추가적으로 포함할 수 있다. 이때, 상기 제2 서열의 3'말단 및 상기 제3 서열의 5'말단은 상기 제10 서열을 통해 연결되어 있고, 상기 제3 서열의 3'말단 및 상기 제4 서열의 5'말단은 상기 제11 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-A-3', 5'-AA-3', 5'-AAA-3', 5'-AAAG-3', 5'-AAAGC-3', 5'-AAAGCU-3', 5'-AAAGCUG-3', 5'-AAAGCUGU-3', 5'-AAAGCUGUC-3', 5'-AAAGCUGUCC-3' (SEQ ID NO: 52), 5'-AAAGCUGUCCC-3' (SEQ ID NO: 53), 5'-AAAAGCUGUCCC-3' (SEQ ID NO: 440), and 5'-CAAAAGCUGUCCC-3' (SEQ ID NO: 441) a tenth sequence selected from the group consisting of and 5'-U-3', 5'-CU-3', 5'-ACU-3', 5'-AACU-3', 5'-GAACU-3', 5'-AGAACU-3', 5 '-UAGAACU-3', 5'-UUAGAACU-3', 5'-AUUAGAACU-3', 5'-GAUUAGAACU-3' (SEQ ID NO: 54), 5'-GGAUUAGAACU-3' (SEQ ID NO: 55), 5 '-GGGAUUAGAACU-3' (SEQ ID NO: 56), 5'-GGGAUUAGAACUU-3' (SEQ ID NO: 442), and 5'-GGGAUUAGAACUUG-3' (SEQ ID NO: 443) further comprising an 11th sequence selected from the group consisting of can In this case, the 3' end of the second sequence and the 5' end of the third sequence are connected through the 10th sequence, and the 3' end of the third sequence and the 5' end of the fourth sequence are 11 may be linked through a sequence.

일 예로, 상기 제10 서열이 5'-A-3'인 경우, 상기 제11 서열은 5'-U-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AA-3'인 경우, 상기 제11 서열은 5'-CU-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAA-3'인 경우, 상기 제11 서열은 5'-ACU-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAG-3'인 경우, 상기 제11 서열은 5'-AACU-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGC-3'인 경우, 상기 제11 서열은 5'-GAACU-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCU-3'인 경우, 상기 제11 서열은 5'-AGAACU-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCUG-3'인 경우, 상기 제11 서열은 5'-UAGAACU-3' 또는 5'-UUAGAACU-3' 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCUGU-3'인 경우, 상기 제11 서열은 5'-AUUAGAACU-3' 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCUGUC-3'인 경우, 상기 제11 서열은 5'-GAUUAGAACU-3'(서열번호 54) 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCUGUCC-3'(서열번호 52)인 경우, 상기 제11 서열은 5'-GGAUUAGAACU-3'(서열번호 55) 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCUGUCCC-3'(서열번호 53)인 경우, 상기 제11 서열은 5'-GGGAUUAGAACU-3'(서열번호 56) 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAAGCUGUCCC-3'(서열번호 440)인 경우, 상기 제11 서열은 5'-GGGAUUAGAACUU-3'(서열번호 442) 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-CAAAAGCUGUCCC-3'(서열번호 441)인 경우, 상기 제11 서열은 5'-GGGAUUAGAACUUG-3'(서열번호 443) 일 수 있다.For example, when the tenth sequence is 5'-A-3', the eleventh sequence may be 5'-U-3'. As another example, when the tenth sequence is 5'-AA-3', the eleventh sequence may be 5'-CU-3'. As another example, when the tenth sequence is 5'-AAA-3', the eleventh sequence may be 5'-ACU-3'. As another example, when the tenth sequence is 5'-AAAG-3', the eleventh sequence may be 5'-AACU-3'. As another example, when the tenth sequence is 5'-AAAGC-3', the eleventh sequence may be 5'-GAACU-3'. As another example, when the tenth sequence is 5'-AAAGCU-3', the eleventh sequence may be 5'-AGAACU-3'. As another example, when the tenth sequence is 5'-AAAGCUG-3', the eleventh sequence may be 5'-UAGAACU-3' or 5'-UUAGAACU-3'. As another example, when the tenth sequence is 5'-AAAGCUGU-3', the eleventh sequence may be 5'-AUUAGAACU-3'. As another example, when the tenth sequence is 5'-AAAGCUGUC-3', the eleventh sequence may be 5'-GAUUAGAACU-3' (SEQ ID NO: 54). As another example, when the tenth sequence is 5'-AAAGCUGUCC-3' (SEQ ID NO: 52), the eleventh sequence may be 5'-GGAUUAGAACU-3' (SEQ ID NO: 55). As another example, when the tenth sequence is 5'-AAAGCUGUCCC-3' (SEQ ID NO: 53), the eleventh sequence may be 5'-GGGAUUAGAACU-3' (SEQ ID NO: 56). As another example, when the tenth sequence is 5'-AAAAGCUGUCCC-3' (SEQ ID NO: 440), the eleventh sequence may be 5'-GGGAUUAGAACUU-3' (SEQ ID NO: 442). As another example, when the tenth sequence is 5'-CAAAAGCUGUCCC-3' (SEQ ID NO: 441), the eleventh sequence may be 5'-GGGAUUAGAACUUG-3' (SEQ ID NO: 443).

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-U-3', 5'-UU-3', 5'-UUC-3', 5'-UUCA-3', 5'-UUCAU-3', 5'-UUCAUU-3', 및 5'-UUCAUUU-3'로 이뤄진 군에서 선택된 제 12서열을 추가적으로 포함할 수 있다. 이때, 상기 제6 서열의 3'말단 및 상기 링커의 5'말단은 상기 제12 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-U-3', 5'-UU-3', 5'-UUC-3', 5'-UUCA-3', It may additionally include a twelfth sequence selected from the group consisting of 5'-UUCAU-3', 5'-UUCAUU-3', and 5'-UUCAUUU-3'. In this case, the 3' end of the sixth sequence and the 5' end of the linker may be connected through the twelfth sequence.

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-UGAA-3', 5'-AUGAA-3', 5'-AAUGAA-3', 및 5'-GAAUGAA-3'로 이뤄진 군에서 선택된 제 13서열을 추가적으로 포함할 수 있다. 이때, 상기 링커의 3'말단 및 상기 제7 서열의 5'말단은 상기 제13 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-UGAA-3', It may additionally include a 13th sequence selected from the group consisting of 5'-AUGAA-3', 5'-AAUGAA-3', and 5'-GAAUGAA-3'. In this case, the 3' end of the linker and the 5' end of the seventh sequence may be connected through the thirteenth sequence.

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-U-3', 5'-UU-3', 5'-UUC-3', 5'-UUCA-3', 5'-UUCAU-3', 5'-UUCAUU-3', 및 5'-UUCAUUU-3'로 이뤄진 군에서 선택된 제 12서열, 및 5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-UGAA-3', 5'-AUGAA-3', 5'-AAUGAA-3', 및 5'-GAAUGAA-3'로 이뤄진 군에서 선택된 제 13서열을 추가적으로 포함할 수 있다. 이때, 상기 제6 서열의 3'말단 및 상기 링커의 5'말단은 상기 제12 서열을 통해 연결되고, 상기 링커의 3'말단 및 상기 제7 서열의 5'말단은 상기 제13 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-U-3', 5'-UU-3', 5'-UUC-3', 5'-UUCA-3', 12th sequence selected from the group consisting of 5'-UUCAU-3', 5'-UUCAUU-3', and 5'-UUCAUUU-3', and 5'-A-3', 5'-AA-3', A 13th sequence selected from the group consisting of 5'-GAA-3', 5'-UGAA-3', 5'-AUGAA-3', 5'-AAUGAA-3', and 5'-GAAUGAA-3' is additionally added may include In this case, the 3' end of the sixth sequence and the 5' end of the linker are connected through the twelfth sequence, and the 3' end of the linker and the 5' end of the seventh sequence are connected through the thirteenth sequence may have been

일 예로, 상기 제12 서열이 5'-U-3'인 경우, 상기 제13 서열은 5'-A-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UU-3'인 경우, 상기 제13 서열은 5'-AA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUC-3'인 경우, 상기 제13 서열은 5'-GAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCA-3'인 경우, 상기 제13 서열은 5'-UGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAU-3'인 경우, 상기 제13 서열은 5'-AUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUU-3'인 경우, 상기 제13 서열은 5'-AAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUU-3'인 경우, 상기 제13 서열은 5'-GAAUGAA-3'일 수 있다.For example, when the twelfth sequence is 5'-U-3', the thirteenth sequence may be 5'-A-3'. As another example, when the twelfth sequence is 5'-UU-3', the thirteenth sequence may be 5'-AA-3'. As another example, when the twelfth sequence is 5'-UUC-3', the thirteenth sequence may be 5'-GAA-3'. As another example, when the twelfth sequence is 5'-UUCA-3', the thirteenth sequence may be 5'-UGAA-3'. As another example, when the twelfth sequence is 5'-UUCAU-3', the thirteenth sequence may be 5'-AUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUU-3', the thirteenth sequence may be 5'-AAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUU-3', the thirteenth sequence may be 5'-GAAUGAA-3'.

엔지니어링 된 싱글 가이드 RNA 예시 5Example of engineered single guide RNA 5

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결된 것일 수 있다.In one embodiment, the engineered Cas12f1 guide RNA may be one in which an engineered scaffold region and a spacer are connected in order from the 5' end to the 3' end.

상기 스페이서는 10 내지 50 뉴클레오타이드 길이를 가지며, 표적 서열과 상보적인 서열을 가진다.The spacer has a length of 10 to 50 nucleotides and has a sequence complementary to the target sequence.

상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것이다:The sequence of the engineered scaffold region is one in which the following sequences are linked in order from the 5' end to the 3' end:

5'-A-3'로 표현되는 제1 서열;a first sequence represented by 5'-A-3';

5'-CCGCUUCAC-3'(서열번호 432)로 표현되는 제2 서열;a second sequence represented by 5'-CCGCUUCAC-3' (SEQ ID NO: 432);

5'-UUAG-3'로 표현되는 제3 서열;a third sequence represented by 5'-UUAG-3';

5'-UGAGUGAAGG-3'(서열번호 433)로 표현되는 제4 서열;a fourth sequence represented by 5'-UGAGUGAAGG-3' (SEQ ID NO: 433);

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11)로 표현되는 제5 서열;a fifth sequence represented by 5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);

5'-AACAAA-3'로 표현되는 제6 서열;a sixth sequence represented by 5'-AACAAA-3';

링커;linker;

5'-GGA-3'로 표현되는 제7 서열; 및a seventh sequence represented by 5'-GGA-3'; and

5'-AUGCAAC-3'로 표현되는 제8 서열.The eighth sequence represented by 5'-AUGCAAC-3'.

일 예로, 상기 링커는 5'-GAAA-3'일 수 있다.For example, the linker may be 5'-GAAA-3'.

또 다른 예로, 상기 링커는 5'-GAAA-3', 5'-AGAAAG-3', 5'-AAGAAAGU-3', 5'-CAAGAAAGUU-3'(서열번호 316), 5'-ACAAGAAAGUUG-3'(서열번호 317), 5'-CACAAGAAAGUUGC-3'(서열번호 318), 5'-GCACAAGAAAGUUGCA-3'(서열번호 319), 5'-UGCACAAGAAAGUUGCAG-3'(서열번호 320), 5'-CUGCACAAGAAAGUUGCAGA-3'(서열번호 321), 5'-UCUGCACAAGAAAGUUGCAGAA-3'(서열번호 322), 5'-UUCUGCACAAGAAAGUUGCAGAAC-3'(서열번호 323), 5'-AUUCUGCACAAGAAAGUUGCAGAACC-3'(서열번호 324), 5'-AAUUCUGCACAAGAAAGUUGCAGAACCC-3'(서열번호 325), 5'-CAAUUCUGCACAAGAAAGUUGCAGAACCCG-3'(서열번호 326), 5'-CCAAUUCUGCACAAGAAAGUUGCAGAACCCGA-3'(서열번호 327), 5'-UCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAA-3'(서열번호 328), 5'-CUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAU-3'(서열번호 329), 5'-UCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUA-3'(서열번호 330), 5'-CUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAG-3'(서열번호 331), 5'-CCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGA-3'(서열번호 332), 5'-UCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGAC-3'(서열번호 333), 5'-UUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACG-3'(서열번호 334), 5'-UUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGA-3'(서열번호 335), 5'-UUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAA-3'(서열번호 336), 5'-UUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAU-3'(서열번호 337), 5'-AUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUG-3'(서열번호 338), 5'-CAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGA-3'(서열번호 339), 5'-UCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAA-3'(서열번호 340), 5'-UCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGA-3'(서열번호 341), 및 5'-UUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAA-3'(서열번호 342)로 이뤄진 군에서 선택된 것일 수 있다.In another example, the linker is 5'-GAAA-3', 5'-AGAAAG-3', 5'-AAGAAAGU-3', 5'-CAAGAAAGUU-3' (SEQ ID NO: 316), 5'-ACAAGAAAGUUG-3 '(SEQ ID NO: 317), 5'-CACAAGAAAGUUGC-3' (SEQ ID NO: 318), 5'-GCACAAGAAAGUUGCA-3' (SEQ ID NO: 319), 5'-UGCACAAGAAAGUUGCAG-3' (SEQ ID NO: 320), 5'-CUGCACAAGAAAGUUGCAGA -3' (SEQ ID NO: 321), 5'-UCUGCACAAGAAAGUUGCAGAA-3' (SEQ ID NO: 322), 5'-UUCUGCACAAAGAAAGUUGCAGAAC-3' (SEQ ID NO: 323), 5'-AUUCUGCACAAAGAAAGUUGCAGAACC-3' (SEQ ID NO: 324), 5' -AAUUCUGCACAAAGAAAGUUGCAGAACCC-3' (SEQ ID NO: 325), 5'-CAAUUCUGCACAAGAAAGUUGCAGAACCCG-3' (SEQ ID NO: 326), 5'-CCAAUUCUGCACAGAAAGUUGCAGAACCCGA-3' (SEQ ID NO: 327), 5'-UCCAAUUCUGCAGAACCCGAA (SEQ ID NO: 327), 5'-UCCAAUUCUGCAACCAAGAAA 5'-CUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAU-3' (SEQ ID NO: 329), 5'-UCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUA-3' (SEQ ID NO: 330), 5'-CUCUCCAAUUCUGCACAAUCGAAAGUUGCAGAACAAGAUAGUAG-3' (SEQ ID NO: 3CUGCAGAACAAGAAUGUAG-3' (SEQ ID NO: 331) ), 5'-UCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGAC-3' (SEQ ID NO: 333), 5'-UUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACG-3' (SEQ ID NO: 334), 5'-UUUCCUCUCCCAAUUCUGCACAAGAAAGUUGCA-3' (SEQ ID NO: 5), ACCCAGAAAGUUGCA -UUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAA-3'(서열번호 336), 5'-UUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAU-3'(서열번호 337), 5'-AUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUG-3'(서열번호 338), 5'-CAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGA-3'(서열번호 339), 5'-UCAUUUUUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 340), 5'-UCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAUGA-3' (SEQ ID NO: 341), and 5'-GACUAUUGAAUGA-3' (SEQ ID NO: 341).

상기 구현예의 일 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-A-3', 5'-GA-3', 5'-AGA-3', 5'-GAGA-3', 5'-GGAGA-3', 5'-UGGAGA-3', 5'-GUGGAGA-3', 5'-AGUGGAGA-3', 5'-AAGUGGAGA-3', 5'-AAAGUGGAGA-3'(서열번호 27), 5'-UAAAGUGGAGA-3'(서열번호 28), 5'-AUAAAGUGGAGA-3'(서열번호 29), 5'-GAUAAAGUGGAGA-3'(서열번호 30), 5'-UGAUAAAGUGGAGA-3'(서열번호 31), 5'-CUGAUAAAGUGGAGA-3'(서열번호 32), 5'-ACUGAUAAAGUGGAGA-3'(서열번호 33), 5'-CACUGAUAAAGUGGAGA-3'(서열번호 34), 5'-UCACUGAUAAAGUGGAGA-3'(서열번호 35), 5'-UUCACUGAUAAAGUGGAGA-3'(서열번호 36), 및 5'-CUUCACUGAUAAAGUGGAGA-3'(서열번호 37)로 이뤄진 군에서 선택된 제9 서열을 추가적으로 포함할 수 있다. 이때, 상기 제9 서열의 3'말단은 상기 제1 서열의 5'말단과 연결되어 있을 수 있다.In one embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-A-3', 5'-GA-3', 5'-AGA-3', 5'-GAGA-3', 5 '-GGAGA-3', 5'-UGGAGA-3', 5'-GUGGAGA-3', 5'-AGUGGAGA-3', 5'-AAUGGAGA-3', 5'-AAAGUGGAGA-3' (SEQ ID NO: 27 ), 5'-UAAAAGUGGAGA-3' (SEQ ID NO: 28), 5'-AUAAAGUGGAGA-3' (SEQ ID NO: 29), 5'-GAUAAAGUGGAGA-3' (SEQ ID NO: 30), 5'-UGAUAAAGUGGAGA-3' (SEQ ID NO: 30) No. 31), 5'-CUGAUAAAAGUGGAGA-3' (SEQ ID NO: 32), 5'-ACUGAUAAAGUGGAGA-3' (SEQ ID NO: 33), 5'-CACUGAUAAAAGUGGAGA-3' (SEQ ID NO: 34), 5'-UCACUGAUAAAGUGGAGA-3' (SEQ ID NO: 35), 5'-UUCACUGAUAAAGUGGAGA-3' (SEQ ID NO: 36), and 5'-CUUCACUGAUAAAGUGGAGA-3' (SEQ ID NO: 37). It may further include a ninth sequence selected from the group consisting of. In this case, the 3' end of the ninth sequence may be linked to the 5' end of the first sequence.

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-A-3', 5'-AA-3', 5'-AAA-3', 5'-AAAG-3', 5'-AAAGC-3', 5'-AAAGCU-3', 5'-AAAGCUG-3', 5'-AAAGCUGU-3', 5'-AAAGCUGUC-3', 5'-AAAGCUGUCC-3'(서열번호 52), 5'-AAAGCUGUCCC-3'(서열번호 53), 5'-AAAAGCUGUCCC-3'(서열번호 440), 및 5'-CAAAAGCUGUCCC-3'(서열번호 441)로 이뤄진 군에서 선택된 제10 서열을 추가적으로 포함할 수 있다. 이때, 상기 제2 서열의 3'말단 및 상기 제3 서열의 5'말단은 상기 제10 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-A-3', 5'-AA-3', 5'-AAA-3', 5'-AAAG-3', 5'-AAAGC-3', 5'-AAAGCU-3', 5'-AAAGCUG-3', 5'-AAAGCUGU-3', 5'-AAAGCUGUC-3', 5'-AAAGCUGUCC-3' (SEQ ID NO: 52), 5'-AAAGCUGUCCC-3' (SEQ ID NO: 53), 5'-AAAAGCUGUCCC-3' (SEQ ID NO: 440), and 5'-CAAAAGCUGUCCC-3' (SEQ ID NO: 441) a tenth sequence selected from the group consisting of may additionally include. In this case, the 3' end of the second sequence and the 5' end of the third sequence may be connected through the 10th sequence.

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-U-3', 5'-CU-3', 5'-ACU-3', 5'-AACU-3', 5'-GAACU-3', 5'-AGAACU-3', 5'-UAGAACU-3', 5'-UUAGAACU-3', 5'-AUUAGAACU-3', 5'-GAUUAGAACU-3'(서열번호 54), 5'-GGAUUAGAACU-3'(서열번호 55), 5'-GGGAUUAGAACU-3'(서열번호 56), 5'-GGGAUUAGAACUU-3'(서열번호 442), 및 5'-GGGAUUAGAACUUG-3'(서열번호 443)로 이뤄진 군에서 선택된 제11 서열을 추가적으로 포함할 수 있다. 이때, 상기 제3 서열의 3'말단 및 상기 제4 서열의 5'말단은 상기 제11 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-U-3', 5'-CU-3', 5'-ACU-3', 5'-AACU-3', 5'-GAACU-3', 5'-AGAACU-3', 5'-UAGAACU-3', 5'-UUAGAACU-3', 5'-AUUAGAACU-3', 5'-GAUUAGAACU-3' (SEQ ID NO: 54), 5'-GGAUUAGAACU-3' (SEQ ID NO: 55), 5'-GGGAUUAGAACU-3' (SEQ ID NO: 56), 5'-GGGAUUAGAACUU-3' (SEQ ID NO: 442), and 5'-GGGAUUAGAACUUG-3' (SEQ ID NO: 443) may additionally include an 11th sequence selected from the group consisting of. In this case, the 3' end of the third sequence and the 5' end of the fourth sequence may be connected through the eleventh sequence.

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-A-3', 5'-AA-3', 5'-AAA-3', 5'-AAAG-3', 5'-AAAGC-3', 5'-AAAGCU-3', 5'-AAAGCUG-3', 5'-AAAGCUGU-3', 5'-AAAGCUGUC-3', 5'-AAAGCUGUCC-3'(서열번호 52), 5'-AAAGCUGUCCC-3'(서열번호 53), 5'-AAAAGCUGUCCC-3'(서열번호 440), 및 5'-CAAAAGCUGUCCC-3'(서열번호 441)로 이뤄진 군에서 선택된 제10 서열 및 5'-U-3', 5'-CU-3', 5'-ACU-3', 5'-AACU-3', 5'-GAACU-3', 5'-AGAACU-3', 5'-UAGAACU-3', 5'-UUAGAACU-3', 5'-AUUAGAACU-3', 5'-GAUUAGAACU-3'(서열번호 54), 5'-GGAUUAGAACU-3'(서열번호 55), 5'-GGGAUUAGAACU-3'(서열번호 56), 5'-GGGAUUAGAACUU-3'(서열번호 442), 및 5'-GGGAUUAGAACUUG-3'(서열번호 443)로 이뤄진 군에서 선택된 제11 서열을 추가적으로 포함할 수 있다. 이때, 상기 제2 서열의 3'말단 및 상기 제3 서열의 5'말단은 상기 제10 서열을 통해 연결되어 있고, 상기 제3 서열의 3'말단 및 상기 제4 서열의 5'말단은 상기 제11 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-A-3', 5'-AA-3', 5'-AAA-3', 5'-AAAG-3', 5'-AAAGC-3', 5'-AAAGCU-3', 5'-AAAGCUG-3', 5'-AAAGCUGU-3', 5'-AAAGCUGUC-3', 5'-AAAGCUGUCC-3' (SEQ ID NO: 52), 5'-AAAGCUGUCCC-3' (SEQ ID NO: 53), 5'-AAAAGCUGUCCC-3' (SEQ ID NO: 440), and 5'-CAAAAGCUGUCCC-3' (SEQ ID NO: 441) a tenth sequence selected from the group consisting of and 5'-U-3', 5'-CU-3', 5'-ACU-3', 5'-AACU-3', 5'-GAACU-3', 5'-AGAACU-3', 5 '-UAGAACU-3', 5'-UUAGAACU-3', 5'-AUUAGAACU-3', 5'-GAUUAGAACU-3' (SEQ ID NO: 54), 5'-GGAUUAGAACU-3' (SEQ ID NO: 55), 5 '-GGGAUUAGAACU-3' (SEQ ID NO: 56), 5'-GGGAUUAGAACUU-3' (SEQ ID NO: 442), and 5'-GGGAUUAGAACUUG-3' (SEQ ID NO: 443) further comprising an 11th sequence selected from the group consisting of can In this case, the 3' end of the second sequence and the 5' end of the third sequence are connected through the 10th sequence, and the 3' end of the third sequence and the 5' end of the fourth sequence are 11 may be linked through a sequence.

일 예로, 상기 제10 서열이 5'-A-3'인 경우, 상기 제11 서열은 5'-U-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AA-3'인 경우, 상기 제11 서열은 5'-CU-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAA-3'인 경우, 상기 제11 서열은 5'-ACU-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAG-3'인 경우, 상기 제11 서열은 5'-AACU-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGC-3'인 경우, 상기 제11 서열은 5'-GAACU-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCU-3'인 경우, 상기 제11 서열은 5'-AGAACU-3'일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCUG-3'인 경우, 상기 제11 서열은 5'-UAGAACU-3' 또는 5'-UUAGAACU-3' 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCUGU-3'인 경우, 상기 제11 서열은 5'-AUUAGAACU-3' 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCUGUC-3'인 경우, 상기 제11 서열은 5'-GAUUAGAACU-3'(서열번호 54) 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCUGUCC-3'(서열번호 52)인 경우, 상기 제11 서열은 5'-GGAUUAGAACU-3'(서열번호 55) 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAGCUGUCCC-3'(서열번호 53)인 경우, 상기 제11 서열은 5'-GGGAUUAGAACU-3'(서열번호 56) 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-AAAAGCUGUCCC-3'(서열번호 440)인 경우, 상기 제11 서열은 5'-GGGAUUAGAACUU-3'(서열번호 442) 일 수 있다. 또 다른 예로, 상기 제10 서열이 5'-CAAAAGCUGUCCC-3'(서열번호 441)인 경우, 상기 제11 서열은 5'-GGGAUUAGAACUUG-3'(서열번호 443) 일 수 있다.For example, when the tenth sequence is 5'-A-3', the eleventh sequence may be 5'-U-3'. As another example, when the tenth sequence is 5'-AA-3', the eleventh sequence may be 5'-CU-3'. As another example, when the tenth sequence is 5'-AAA-3', the eleventh sequence may be 5'-ACU-3'. As another example, when the tenth sequence is 5'-AAAG-3', the eleventh sequence may be 5'-AACU-3'. As another example, when the tenth sequence is 5'-AAAGC-3', the eleventh sequence may be 5'-GAACU-3'. As another example, when the tenth sequence is 5'-AAAGCU-3', the eleventh sequence may be 5'-AGAACU-3'. As another example, when the tenth sequence is 5'-AAAGCUG-3', the eleventh sequence may be 5'-UAGAACU-3' or 5'-UUAGAACU-3'. As another example, when the tenth sequence is 5'-AAAGCUGU-3', the eleventh sequence may be 5'-AUUAGAACU-3'. As another example, when the tenth sequence is 5'-AAAGCUGUC-3', the eleventh sequence may be 5'-GAUUAGAACU-3' (SEQ ID NO: 54). As another example, when the tenth sequence is 5'-AAAGCUGUCC-3' (SEQ ID NO: 52), the eleventh sequence may be 5'-GGAUUAGAACU-3' (SEQ ID NO: 55). As another example, when the tenth sequence is 5'-AAAGCUGUCCC-3' (SEQ ID NO: 53), the eleventh sequence may be 5'-GGGAUUAGAACU-3' (SEQ ID NO: 56). As another example, when the tenth sequence is 5'-AAAAGCUGUCCC-3' (SEQ ID NO: 440), the eleventh sequence may be 5'-GGGAUUAGAACUU-3' (SEQ ID NO: 442). As another example, when the tenth sequence is 5'-CAAAAGCUGUCCC-3' (SEQ ID NO: 441), the eleventh sequence may be 5'-GGGAUUAGAACUUG-3' (SEQ ID NO: 443).

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-U-3', 5'-UU-3', 5'-UUC-3', 5'-UUCA-3', 5'-UUCAU-3', 5'-UUCAUU-3', 5'-UUCAUUU-3', 5'-UUCAUUUU-3', 5'-UUCAUUUUU-3', 5'-UUCAUUUUUC-3'(서열번호 343), 5'-UUCAUUUUUCC-3'(서열번호 344), 5'-UUCAUUUUUCCU-3'(서열번호 345), 5'-UUCAUUUUUCCUC-3'(서열번호 346), 5'-UUCAUUUUUCCUCU-3'(서열번호 347), 5'-UUCAUUUUUCCUCUC-3'(서열번호 348), 5'-UUCAUUUUUCCUCUCC-3'(서열번호 349), 5'-UUCAUUUUUCCUCUCCA-3'(서열번호 350), 5'-UUCAUUUUUCCUCUCCAA-3'(서열번호 351), 5'-UUCAUUUUUCCUCUCCAAU-3'(서열번호 352), 5'-UUCAUUUUUCCUCUCCAAUU-3'(서열번호 353), 5'-UUCAUUUUUCCUCUCCAAUUC-3'(서열번호 354), 5'-UUCAUUUUUCCUCUCCAAUUCU-3'(서열번호 355), 5'-UUCAUUUUUCCUCUCCAAUUCUG-3'(서열번호 356), 5'-UUCAUUUUUCCUCUCCAAUUCUGC-3'(서열번호 357), 5'-UUCAUUUUUCCUCUCCAAUUCUGCA-3'(서열번호 358), 5'-UUCAUUUUUCCUCUCCAAUUCUGCAC-3'(서열번호 359), 5'-UUCAUUUUUCCUCUCCAAUUCUGCACA-3'(서열번호 360), 및 5'-UUCAUUUUUCCUCUCCAAUUCUGCACAA-3'(서열번호 361)로 이뤄진 군에서 선택된 제 12서열을 추가적으로 포함할 수 있다. 이때, 상기 제6 서열의 3'말단 및 상기 링커의 5'말단은 상기 제12 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-U-3', 5'-UU-3', 5'-UUC-3', 5'-UUCA-3', 5'-UUCAU-3', 5'-UUCAUU-3', 5'-UUCAUUU-3', 5'-UUCAUUUU-3', 5'-UUCAUUUUU-3', 5'-UUCAUUUUUC-3' (SEQ ID NO: 343), 5'-UUCAUUUUUCC-3' (SEQ ID NO: 344), 5'-UUCAUUUUUCCU-3' (SEQ ID NO: 345), 5'-UUCAUUUUUCCUC-3' (SEQ ID NO: 346), 5'-UUCAUUUUUCCUCU-3' ( SEQ ID NO: 347), 5'-UUCAUUUUUCCUCUC-3' (SEQ ID NO: 348), 5'-UUCAUUUUUCCUCUCC-3' (SEQ ID NO: 349), 5'-UUCAUUUUUCCUCUCCA-3' (SEQ ID NO: 350), 5'-UUCAUUUUUCCUCUCCAA-3 '(SEQ ID NO: 351), 5'-UUCAUUUUUCCUCUCCAAU-3' (SEQ ID NO: 352), 5'-UUCAUUUUUCCUCUCCAAUU-3' (SEQ ID NO: 353), 5'-UUCAUUUUUCCUCUCCCAAUCUCU-3' (SEQ ID NO: 354), 5'-UUCCUCCAUUUUCCUCUCCAUUUU -3' (SEQ ID NO: 355), 5'-UUCAUUUUUCCUCCCAAUUCUG-3' (SEQ ID NO: 356), 5'-UUCAUUUUUCCUCCCAAUUCUGC-3' (SEQ ID NO: 357), 5'-UUCAUUUUUCCUCCUCCAAUUCUGCA-3' (SEQ ID NO: 358), 5' -UUCAUUUUUCCUCUCCCAAUUCUGCAC-3' (SEQ ID NO: 359), 5'-UUCAUUUUUCCUCUCCAAUUCUGCACA-3' (SEQ ID NO: 360), and 5'-UUCAUUUUUCCUCUCCCAAUUCUGCACAA-3' (SEQ ID NO: 361) may additionally include a twelfth sequence selected from the group consisting of have. In this case, the 3' end of the sixth sequence and the 5' end of the linker may be connected through the twelfth sequence.

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-UGAA-3', 5'-AUGAA-3', 5'-AAUGAA-3', 5'-GAAUGAA-3', 5'-CGAAUGAA-3', 5'-ACGAAUGAA-3', 5'-GACGAAUGAA-3'(서열번호 362), 5'-AGACGAAUGAA-3'(서열번호 363), 5'-UAGACGAAUGAA-3'(서열번호 364), 5'-AUAGACGAAUGAA-3'(서열번호 365), 5'-AAUAGACGAAUGAA-3'(서열번호 366), 5'-GAAUAGACGAAUGAA-3'(서열번호 367), 5'-CGAAUAGACGAAUGAA-3'(서열번호 368), 5'-CCGAAUAGACGAAUGAA-3'(서열번호 369), 5'-CCCGAAUAGACGAAUGAA-3'(서열번호 370), 5'-ACCCGAAUAGACGAAUGAA-3'(서열번호 371), 5'-AACCCGAAUAGACGAAUGAA-3'(서열번호 372), 5'-GAACCCGAAUAGACGAAUGAA-3'(서열번호 373), 5'-AGAACCCGAAUAGACGAAUGAA-3'(서열번호 374), 5'-CAGAACCCGAAUAGACGAAUGAA-3'(서열번호 375), 5'-GCAGAACCCGAAUAGACGAAUGAA-3'(서열번호 376), 5'-UGCAGAACCCGAAUAGACGAAUGAA-3'(서열번호 377), 5'-UUGCAGAACCCGAAUAGACGAAUGAA-3'(서열번호 378), 및 5'-GUUGCAGAACCCGAAUAGACGAAUGAA-3'(서열번호 379)로 이뤄진 군에서 선택된 제 13서열을 추가적으로 포함할 수 있다. 이때, 상기 링커의 3'말단 및 상기 제7 서열의 5'말단은 상기 제13 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-UGAA-3', 5'-AUGAA-3', 5'-AAUGAA-3', 5'-GAAUGAA-3', 5'-CGAAUGAA-3', 5'-ACGAAUGAA-3', 5'-GACGAAUGAA-3' (SEQ ID NO: 362), 5'-AGACGAAUGAA-3' (SEQ ID NO: 363), 5'-UAGACGAAUGAA-3' (SEQ ID NO: 364), 5'-AUAGACGAAUGAA-3' (SEQ ID NO: 365), 5'-AAUAGACGAAUGAA-3' ( SEQ ID NO: 366), 5'-GAAUAGACGAAUGAA-3' (SEQ ID NO: 367), 5'-CGAAUAGACGAAUGAA-3' (SEQ ID NO: 368), 5'-CCGAAUAGACGAAUGAA-3' (SEQ ID NO: 369), 5'-CCCGAAUAGACGAAUGAA-3 '(SEQ ID NO: 370), 5'-ACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 371), 5'-AACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 372), 5'-GAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 373), 5'-AGAACCCGAAUAGACGAAUGAA -3' (SEQ ID NO: 374), 5'-CAGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 375), 5'-GCAGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 376), 5'-UGCAGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 377), 5' -UUGCAGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 378), and 5'-GUUGCAGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 379) may additionally include a thirteenth sequence selected from the group consisting of. In this case, the 3' end of the linker and the 5' end of the seventh sequence may be connected through the thirteenth sequence.

상기 구현예의 또 다른 구체예로, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-U-3', 5'-UU-3', 5'-UUC-3', 5'-UUCA-3', 5'-UUCAU-3', 5'-UUCAUU-3', 5'-UUCAUUU-3', 5'-UUCAUUUU-3', 5'-UUCAUUUUU-3', 5'-UUCAUUUUUC-3'(서열번호 343), 5'-UUCAUUUUUCC-3'(서열번호 344), 5'-UUCAUUUUUCCU-3'(서열번호 345), 5'-UUCAUUUUUCCUC-3'(서열번호 346), 5'-UUCAUUUUUCCUCU-3'(서열번호 347), 5'-UUCAUUUUUCCUCUC-3'(서열번호 348), 5'-UUCAUUUUUCCUCUCC-3'(서열번호 349), 5'-UUCAUUUUUCCUCUCCA-3'(서열번호 350), 5'-UUCAUUUUUCCUCUCCAA-3'(서열번호 351), 5'-UUCAUUUUUCCUCUCCAAU-3'(서열번호 352), 5'-UUCAUUUUUCCUCUCCAAUU-3'(서열번호 353), 5'-UUCAUUUUUCCUCUCCAAUUC-3'(서열번호 354), 5'-UUCAUUUUUCCUCUCCAAUUCU-3'(서열번호 355), 5'-UUCAUUUUUCCUCUCCAAUUCUG-3'(서열번호 356), 5'-UUCAUUUUUCCUCUCCAAUUCUGC-3'(서열번호 357), 5'-UUCAUUUUUCCUCUCCAAUUCUGCA-3'(서열번호 358), 5'-UUCAUUUUUCCUCUCCAAUUCUGCAC-3'(서열번호 359), 5'-UUCAUUUUUCCUCUCCAAUUCUGCACA-3'(서열번호 360), 및 5'-UUCAUUUUUCCUCUCCAAUUCUGCACAA-3'(서열번호 361)로 이뤄진 군에서 선택된 제 12서열, 및 5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-UGAA-3', 5'-AUGAA-3', 5'-AAUGAA-3', 5'-GAAUGAA-3', 5'-CGAAUGAA-3', 5'-ACGAAUGAA-3', 5'-GACGAAUGAA-3'(서열번호 362), 5'-AGACGAAUGAA-3'(서열번호 363), 5'-UAGACGAAUGAA-3'(서열번호 364), 5'-AUAGACGAAUGAA-3'(서열번호 365), 5'-AAUAGACGAAUGAA-3'(서열번호 366), 5'-GAAUAGACGAAUGAA-3'(서열번호 367), 5'-CGAAUAGACGAAUGAA-3'(서열번호 368), 5'-CCGAAUAGACGAAUGAA-3'(서열번호 369), 5'-CCCGAAUAGACGAAUGAA-3'(서열번호 370), 5'-ACCCGAAUAGACGAAUGAA-3'(서열번호 371), 5'-AACCCGAAUAGACGAAUGAA-3'(서열번호 372), 5'-GAACCCGAAUAGACGAAUGAA-3'(서열번호 373), 5'-AGAACCCGAAUAGACGAAUGAA-3'(서열번호 374), 5'-CAGAACCCGAAUAGACGAAUGAA-3'(서열번호 375), 5'-GCAGAACCCGAAUAGACGAAUGAA-3'(서열번호 376), 5'-UGCAGAACCCGAAUAGACGAAUGAA-3'(서열번호 377), 5'-UUGCAGAACCCGAAUAGACGAAUGAA-3'(서열번호 378), 및 5'-GUUGCAGAACCCGAAUAGACGAAUGAA-3'(서열번호 379)로 이뤄진 군에서 선택된 제 13서열을 추가적으로 포함할 수 있다. 이때, 상기 제6 서열의 3'말단 및 상기 링커의 5'말단은 상기 제12 서열을 통해 연결되고, 상기 링커의 3'말단 및 상기 제7 서열의 5'말단은 상기 제13 서열을 통해 연결되어 있을 수 있다.In another embodiment of the above embodiment, the sequence of the engineered scaffold region is 5'-U-3', 5'-UU-3', 5'-UUC-3', 5'-UUCA-3', 5'-UUCAU-3', 5'-UUCAUU-3', 5'-UUCAUUU-3', 5'-UUCAUUUU-3', 5'-UUCAUUUUU-3', 5'-UUCAUUUUUC-3' (SEQ ID NO: 343), 5'-UUCAUUUUUCC-3' (SEQ ID NO: 344), 5'-UUCAUUUUUCCU-3' (SEQ ID NO: 345), 5'-UUCAUUUUUCCUC-3' (SEQ ID NO: 346), 5'-UUCAUUUUUCCUCU-3' ( SEQ ID NO: 347), 5'-UUCAUUUUUCCUCUC-3' (SEQ ID NO: 348), 5'-UUCAUUUUUCCUCUCC-3' (SEQ ID NO: 349), 5'-UUCAUUUUUCCUCUCCA-3' (SEQ ID NO: 350), 5'-UUCAUUUUUCCUCUCCAA-3 '(SEQ ID NO: 351), 5'-UUCAUUUUUCCUCUCCAAU-3' (SEQ ID NO: 352), 5'-UUCAUUUUUCCUCUCCAAUU-3' (SEQ ID NO: 353), 5'-UUCAUUUUUCCUCUCCCAAUCUCU-3' (SEQ ID NO: 354), 5'-UUCCUCCAUUUUCCUCUCCAUUUU -3' (SEQ ID NO: 355), 5'-UUCAUUUUUCCUCCCAAUUCUG-3' (SEQ ID NO: 356), 5'-UUCAUUUUUCCUCCCAAUUCUGC-3' (SEQ ID NO: 357), 5'-UUCAUUUUUCCUCCUCCAAUUCUGCA-3' (SEQ ID NO: 358), 5' -UUCAUUUUUCCUCUCCCAAUUCUGCAC-3' (SEQ ID NO: 359), 5'-UUCAUUUUUCCUCUCCAAUUCUGCACA-3' (SEQ ID NO: 360), and 5'-UUCAUUUUUCCUCUCCCAAUUCUGCACAA-3' (SEQ ID NO: 361); and A-3', 5'-AA-3', 5'-GAA-3', 5'-UGAA-3', 5'-AUGAA-3', 5'-AAUGAA-3', 5'-GAAUGAA- 3', 5'-CGAAUGAA-3', 5'-ACGAAUGAA- 3', 5'-GACGAAUGAA-3' (SEQ ID NO: 362), 5'-AGACGAAUGAA-3' (SEQ ID NO: 363), 5'-UAGACGAAUGAA-3' (SEQ ID NO: 364), 5'-AUAGACGAAUGAA-3' ( SEQ ID NO: 365), 5'-AAUAGACGAAUGAA-3' (SEQ ID NO: 366), 5'-GAAUAGACGAAUGAA-3' (SEQ ID NO: 367), 5'-CGAAUAGACGAAUGAA-3' (SEQ ID NO: 368), 5'-CCGAAUAGACGAAUGAA-3 '(SEQ ID NO: 369), 5'-CCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 370), 5'-ACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 371), 5'-AACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 372), 5'-GAACCCGAAUAGACGAAUGAA -3' (SEQ ID NO: 373), 5'-AGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 374), 5'-CAGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 375), 5'-GCAGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 376), 5' -UGCAGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 377), 5'-UUGCAGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 378), and 5'-GUUGCAGAACCCGAAUAGACGAAUGAA-3' (SEQ ID NO: 379) may additionally include a 13th sequence selected from the group consisting of have. In this case, the 3' end of the sixth sequence and the 5' end of the linker are connected through the twelfth sequence, and the 3' end of the linker and the 5' end of the seventh sequence are connected through the thirteenth sequence may have been

일 예로, 상기 제12 서열이 5'-U-3'인 경우, 상기 제13 서열은 5'-A-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UU-3'인 경우, 상기 제13 서열은 5'-AA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUC-3'인 경우, 상기 제13 서열은 5'-GAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCA-3'인 경우, 상기 제13 서열은 5'-UGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAU-3'인 경우, 상기 제13 서열은 5'-AUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUU-3'인 경우, 상기 제13 서열은 5'-AAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUU-3'인 경우, 상기 제13 서열은 5'-GAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUU-3'인 경우, 상기 제13 서열은 5'-CGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUU-3'인 경우, 상기 제13 서열은 5'-ACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUC-3'인 경우, 상기 제13 서열은 5'-GACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCC-3'인 경우, 상기 제13 서열은 5'-AGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCU-3'인 경우, 상기 제13 서열은 5'-UAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUC-3'인 경우, 상기 제13 서열은 5'-AUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCU-3'인 경우, 상기 제13 서열은 5'-AAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUC-3'인 경우, 상기 제13 서열은 5'-GAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCC-3'인 경우, 상기 제13 서열은 5'-CGAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCCA-3'인 경우, 상기 제13 서열은 5'-CCGAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCCAA-3'인 경우, 상기 제13 서열은 5'-CCCGAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCCAAU-3'인 경우, 상기 제13 서열은 5'-ACCCGAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCCAAUU-3'인 경우, 상기 제13 서열은 5'-AACCCGAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCCAAUUC-3'인 경우, 상기 제13 서열은 5'-GAACCCGAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCCAAUUCU-3'인 경우, 상기 제13 서열은 5'-AGAACCCGAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCCAAUUCUG-3'인 경우, 상기 제13 서열은 5'-CAGAACCCGAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCCAAUUCUGC-3'인 경우, 상기 제13 서열은 5'-GCAGAACCCGAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCCAAUUCUGCA-3'인 경우, 상기 제13 서열은 5'-UGCAGAACCCGAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCCAAUUCUGCAC-3'인 경우, 상기 제13 서열은 5'-UUGCAGAACCCGAAUAGACGAAUGAA-3'일 수 있다. 또 다른 예로, 상기 제12 서열이 5'-UUCAUUUUUCCUCUCCAAUUCUGCACA-3', 또는 5'-UUCAUUUUUCCUCUCCAAUUCUGCACAA-3'인 경우, 상기 제13 서열은 5'-GUUGCAGAACCCGAAUAGACGAAUGAA-3'일 수 있다.For example, when the twelfth sequence is 5'-U-3', the thirteenth sequence may be 5'-A-3'. As another example, when the twelfth sequence is 5'-UU-3', the thirteenth sequence may be 5'-AA-3'. As another example, when the twelfth sequence is 5'-UUC-3', the thirteenth sequence may be 5'-GAA-3'. As another example, when the twelfth sequence is 5'-UUCA-3', the thirteenth sequence may be 5'-UGAA-3'. As another example, when the twelfth sequence is 5'-UUCAU-3', the thirteenth sequence may be 5'-AUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUU-3', the thirteenth sequence may be 5'-AAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUU-3', the thirteenth sequence may be 5'-GAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUU-3', the thirteenth sequence may be 5'-CGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUU-3', the thirteenth sequence may be 5'-ACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUUC-3', the thirteenth sequence may be 5'-GACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUUCC-3', the thirteenth sequence may be 5'-AGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCU-3', the thirteenth sequence may be 5'-UAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUC-3', the thirteenth sequence may be 5'-AUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCU-3', the thirteenth sequence may be 5'-AAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUC-3', the thirteenth sequence may be 5'-GAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCC-3', the thirteenth sequence may be 5'-CGAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCCA-3', the thirteenth sequence may be 5'-CCGAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCCAA-3', the thirteenth sequence may be 5'-CCCGAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCCAAU-3', the thirteenth sequence may be 5'-ACCCGAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCCAAUU-3', the thirteenth sequence may be 5'-AACCCGAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCCAAUUC-3', the thirteenth sequence may be 5'-GAACCCGAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCCAAUUCU-3', the thirteenth sequence may be 5'-AGAACCCGAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCCAAUUCUG-3', the thirteenth sequence may be 5'-CAGAACCCGAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCCAAUUCUGC-3', the thirteenth sequence may be 5'-GCAGAACCCGAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCCAAUUCUGCA-3', the thirteenth sequence may be 5'-UGCAGAACCCGAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCCAAUUCUGCAC-3', the thirteenth sequence may be 5'-UUGCAGAACCCGAAUAGACGAAUGAA-3'. As another example, when the twelfth sequence is 5'-UUCAUUUUUCCUCUCCAAUUCUGCACA-3', or 5'-UUCAUUUUUCCUCUCCAAUUCUGCACAA-3', the thirteenth sequence may be 5'-GUUGCAGAACCCGAAUAGACGAAUGAA-3'.

엔지니어링 된 싱글 가이드 RNA 서열 예시Examples of engineered single guide RNA sequences

일 구현예로, 상기 엔지니어링 된 싱글 가이드 RNA는 서열번호 210 내지 258, 서열번호 381 내지 393, 서열번호 396 내지 서열번호 407, 서열번호 409 내지 서열번호 421, 및 서열번호 436 내지 서열번호 439로 이뤄진 군에서 선택된 서열을 가질 수 있다.In one embodiment, the engineered single guide RNA consists of SEQ ID NO: 210 to 258, SEQ ID NO: 381 to 393, SEQ ID NO: 396 to SEQ ID NO: 407, SEQ ID NO: 409 to SEQ ID NO: 421, and SEQ ID NO: 436 to SEQ ID NO: 439 It may have a sequence selected from the group.

엔지니어링 된 듀얼 가이드 RNA 예시 1Engineered Dual Guide RNA Example 1

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 엔지니어링 된 스캐폴드 영역, 및 스페이서를 포함한다.In one embodiment, the engineered Cas12f1 guide RNA comprises an engineered scaffold region, and a spacer.

상기 스페이서는 10 내지 50 뉴클레오타이드 길이를 가지며, 표적 서열과 상보적인 서열을 가진다.The spacer has a length of 10 to 50 nucleotides and has a sequence complementary to the target sequence.

상기 엔지니어링 된 스캐폴드 영역의 서열은 다음 서열을 포함한다:The sequence of the engineered scaffold region comprises the following sequence:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5' -UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), a sequence selected from the group consisting of 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9),

5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열,5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order,

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11), 및5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11), and

5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAUUCA-3'(서열번호 66), 5'-AACAAAUUCAU-3'(서열번호 67), 5'-AACAAAUUCAUU-3'(서열번호 68), 및 5'-AACAAAUUCAUUU-3'(서열번호 12)로 이뤄진 군에서 선택된 서열이 연결된 엔지니어링 된 tracrRNA; 및5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAAUUCA-3' (SEQ ID NO: 66), 5'-AACAAAUUCAU- an engineered tracrRNA to which a sequence selected from the group consisting of 3' (SEQ ID NO: 67), 5'-AACAAAUUCAUU-3' (SEQ ID NO: 68), and 5'-AACAAAUUCAUUU-3' (SEQ ID NO: 12) is linked; and

5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 5'-AAUGAAGGA-3', 및 5'-GAAUGAAGGA-3'(서열번호 14)으로 이뤄진 군에서 선택된 서열, 및5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 5' -AAUGAAGGA-3', and a sequence selected from the group consisting of 5'-GAAUGAAGGA-3' (SEQ ID NO: 14), and

5'-AUGCAAC-3'가 연결된 엔지니어링 된 crRNA 반복 서열 부분,5'-AUGCAAC-3' linked engineered crRNA repeat sequence portion,

이때, 상기 엔지니어링 된 crRNA 반복 서열 부분의 3'말단은 상기 스페이서의 5'말단과 연결 되어 있다.At this time, the 3' end of the engineered crRNA repeat sequence portion is connected to the 5' end of the spacer.

이때, 상기 엔지니어링 된 tracrRNA의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 1)과 상이하고, 및/또는 상기 엔지니어링 된 crRNA 반복 서열 부분은 5'-GAAUGAAGGAAUGCAAC-3'(서열번호 3)과 상이하다.At this time, the sequence of the engineered tracrRNA is 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAAUACCCUCGAAACAAAUUCAUUU-3' (sequence number 1) and different from the engineered crRNA (SEQ ID NO: 1) do.

일 예로, 상기 엔지니어링 된 tracrRNA의 서열은 서열번호 1과 동일하고, 상기 엔지니어링 된 crRNA 반복 서열 부분은 서열번호 3과 상이할 수 있다. 또 다른 일 예로, 상기 엔지니어링 된 tracrRNA의 서열은 서열번호 1과 상이하고, 상기 엔지니어링 된 crRNA 반복 서열 부분은 서열번호 3과 동일할 수 있다. 또 다른 일 예로, 상기 엔지니어링 된 tracrRNA의 서열은 서열번호 1과 상이하고, 상기 엔지니어링 된 crRNA 반복 서열 부분은 서열번호 3과 상이할 수 있다.For example, the sequence of the engineered tracrRNA may be the same as SEQ ID NO: 1, and the engineered crRNA repeat sequence portion may be different from SEQ ID NO: 3. As another example, the sequence of the engineered tracrRNA may be different from SEQ ID NO: 1, and the engineered crRNA repeat sequence portion may be the same as SEQ ID NO: 3. As another example, the sequence of the engineered tracrRNA may be different from SEQ ID NO: 1, and the engineered crRNA repeat sequence portion may be different from SEQ ID NO: 3.

일 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAA-3'를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-GGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAU-3'를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUU-3'를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AAGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUC-3'를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-GAAGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCA-3'(서열번호 66)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-UGAAGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAU-3'(서열번호 67)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AUGAAGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUU-3'(서열번호 68)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AAUGAAGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUU-3'(서열번호 12)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-GAAUGAAGGA-3'(서열번호 14)를 포함할 수 있다.For example, when the engineered tracrRNA includes 5'-AACAAA-3', the engineered crRNA may include 5'-GGA-3'. As another example, when the engineered tracrRNA includes 5'-AACAAAU-3', the engineered crRNA may include 5'-AGGA-3'. As another example, when the engineered tracrRNA includes 5'-AACAAAUU-3', the engineered crRNA may include 5'-AAGGA-3'. As another example, when the engineered tracrRNA includes 5'-AACAAAUUC-3', the engineered crRNA may include 5'-GAAGGA-3'. As another example, when the engineered tracrRNA comprises 5'-AACAAAAUUCA-3' (SEQ ID NO: 66), the engineered crRNA may include 5'-UGAAGGA-3'. As another example, when the engineered tracrRNA includes 5'-AACAAAUUCAU-3' (SEQ ID NO: 67), the engineered crRNA may include 5'-AUGAAGGA-3'. As another example, when the engineered tracrRNA includes 5'-AACAAAUUCAUU-3' (SEQ ID NO: 68), the engineered crRNA may include 5'-AAUGAAGGA-3'. As another example, when the engineered tracrRNA includes 5'-AACAAAUUCAUUU-3' (SEQ ID NO: 12), the engineered crRNA may include 5'-GAAUGAAGGA-3' (SEQ ID NO: 14).

엔지니어링 된 듀얼 가이드 RNA 예시 2Engineered Dual Guide RNA Example 2

일 구현예로, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 엔지니어링 된 스캐폴드 영역, 및 스페이서를 포함한다.In one embodiment, the engineered Cas12f1 guide RNA comprises an engineered scaffold region, and a spacer.

상기 스페이서는 10 내지 50 뉴클레오타이드 길이를 가지며, 표적 서열과 상보적인 서열을 가진다.The spacer has a length of 10 to 50 nucleotides and has a sequence complementary to the target sequence.

상기 엔지니어링 된 스캐폴드 영역의 서열은 다음 서열을 포함한다:The sequence of the engineered scaffold region comprises the following sequence:

5'말단에서 3'말단 방향으로,5' end to 3' end direction,

5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열,5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5' -UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5' -UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), a sequence selected from the group consisting of 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9),

5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열,5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order,

5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11), 및5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11), and

5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAUUCA-3'(서열번호 66), 5'-AACAAAUUCAU-3'(서열번호 67), 5'-AACAAAUUCAUU-3'(서열번호 68), 5'-AACAAAUUCAUUU-3'(서열번호 69), 5'-AACAAAUUCAUUUU-3'(서열번호 70), 5'-AACAAAUUCAUUUUU-3'(서열번호 71), 5'-AACAAAUUCAUUUUUC-3'(서열번호 72), 5'-AACAAAUUCAUUUUUCC-3'(서열번호 73), 5'-AACAAAUUCAUUUUUCCU-3'(서열번호 74), 5'-AACAAAUUCAUUUUUCCUC-3'(서열번호 75), 5'-AACAAAUUCAUUUUUCCUCU-3'(서열번호 76), 5'-AACAAAUUCAUUUUUCCUCUC-3'(서열번호 77), 5'-AACAAAUUCAUUUUUCCUCUCC-3'(서열번호 78), 5'-AACAAAUUCAUUUUUCCUCUCCA-3'(서열번호 79), 5'-AACAAAUUCAUUUUUCCUCUCCAA-3'(서열번호 80), 5'-AACAAAUUCAUUUUUCCUCUCCAAU-3'(서열번호 81), 5'-AACAAAUUCAUUUUUCCUCUCCAAUU-3'(서열번호 82), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUC-3'(서열번호 83), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCU-3'(서열번호 84), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUG-3'(서열번호 85), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGC-3'(서열번호 86), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCA-3'(서열번호 87), 5'-AAACAAAUUCAUUUUUCCUCUCCAAUUCUGCAC-3'(서열번호 88), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACA-3'(서열번호 89), 및 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAA-3'(서열번호 13)로 이뤄진 군에서 선택된 서열이 연결된 엔지니어링 된 tracrRNA; 및5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAAUUCA-3' (SEQ ID NO: 66), 5'-AACAAAUUCAU- 3' (SEQ ID NO: 67), 5'-AACAAAUUCAUU-3' (SEQ ID NO: 68), 5'-AACAAAUUCAUUU-3' (SEQ ID NO: 69), 5'-AACAAAUUCAUUUU-3' (SEQ ID NO: 70), 5'- AACAAAUUCAUUUUU-3' (SEQ ID NO: 71), 5'-AACAAAUUCAUUUUUC-3' (SEQ ID NO: 72), 5'-AACAAAUUCAUUUUUCC-3' (SEQ ID NO: 73), 5'-AACAAAUUCAUUUUUUCCU-3' (SEQ ID NO: 74), 5 '-AACAAAUUCAUUUUUCCUC-3' (SEQ ID NO: 75), 5'-AACAAAUUCAUUUUUCCUCU-3' (SEQ ID NO: 76), 5'-AACAAAUUCAUUUUUCCUCUC-3' (SEQ ID NO: 77), 5'-AACAAAUUCAUUUUUUCCUCUCC-3' (SEQ ID NO: 78) , 5'-AACAAAUUCAUUUUUCCUCUCCA-3' (SEQ ID NO: 79), 5'-AACAAAUUCAUUUUUUCCUCUCCAA-3' (SEQ ID NO: 80), 5'-AACAAAUUCAUUUUUCCUCUCCAAU-3' (SEQ ID NO: 81), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUUUCCUCUCCAA 82), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUC-3' (SEQ ID NO: 83), 5'-AACAAAUUCAUUUUUUCCUCCCAAUUCU-3' (SEQ ID NO: 84), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCAUUCCUUCCAAUUCUG-3' (SEQ ID NO: 85), 5'-AACAA SEQ ID NO: 86), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCA-3' (SEQ ID NO: 87), 5'-AAACAAAUUCAUUUUUCCUCCCAAUUCUGCAC-3' (SEQ ID NO: 88), 5'-AAACAAAUUCAUUUUUCCUCUCCAAUUCUGCAUCA-3') (SEQ ID NO: an engineered tracrRNA to which a sequence selected from the group consisting of CAUUUUUCCUCCUCCAAUUCUGCACAA-3' (SEQ ID NO: 13) is linked; and

5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 5'-AAUGAAGGA-3', 5'-GAAUGAAGGA-3'(서열번호 90), 5'-CGAAUGAAGGA-3'(서열번호 91), 5'-ACGAAUGAAGGA-3'(서열번호 92), 5'-GACGAAUGAAGGA-3'(서열번호 93), 5'-AGACGAAUGAAGGA-3'(서열번호 94), 5'-UAGACGAAUGAAGGA-3'(서열번호 95), 5'-AUAGACGAAUGAAGGA-3'(서열번호 96), 5'-AAUAGACGAAUGAAGGA-3'(서열번호 97), 5'-GAAUAGACGAAUGAAGGA-3'(서열번호 98), 5'-CGAAUAGACGAAUGAAGGA-3'(서열번호 99), 5'-CCGAAUAGACGAAUGAAGGA-3'(서열번호 100), 5'-CCCGAAUAGACGAAUGAAGGA-3'(서열번호 101), 5'-ACCCGAAUAGACGAAUGAAGGA-3'(서열번호 102), 5'-AACCCGAAUAGACGAAUGAAGGA-3'(서열번호 103), 5'-GAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 104), 5'-AGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 105), 5'-CAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 106), 5'-GCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 107), 5'-UGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 108), 5'-UUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 109), 및 5'-GUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 15)로 이뤄진 군에서 선택된 서열, 및5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 5' -AAUGAAGGA-3', 5'-GAAUGAAGGA-3' (SEQ ID NO: 90), 5'-CGAAUGAAGGA-3' (SEQ ID NO: 91), 5'-ACGAAUGAAGGA-3' (SEQ ID NO: 92), 5'-GACGAAUGAAGGA- 3' (SEQ ID NO: 93), 5'-AGACGAAUGAAGGA-3' (SEQ ID NO: 94), 5'-UAGACGAAUGAAGGA-3' (SEQ ID NO: 95), 5'-AUAGACGAAUGAAGGA-3' (SEQ ID NO: 96), 5'- AAUAGACGAAUGAAGGA-3' (SEQ ID NO: 97), 5'-GAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 98), 5'-CGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 99), 5'-CCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 100), 5 '-CCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 101), 5'-ACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 102), 5'-AACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 103), 5'-GAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 104) , 5'-AGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 105), 5'-CAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 106), 5'-GCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 107), 5'-UGCAGAACCCGAAUAGACGAAUGAAGGA-3' 108), 5'-UUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 109), and 5'-GUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 15), and

5'-AUGCAAC-3'가 연결된 엔지니어링 된 crRNA 반복 서열 부분,5'-AUGCAAC-3' linked engineered crRNA repeat sequence portion,

이때, 상기 엔지니어링 된 crRNA 반복 서열 부분의 3'말단은 상기 스페이서의 5'말단과 연결 되어 있다.At this time, the 3' end of the engineered crRNA repeat sequence portion is connected to the 5' end of the spacer.

이때, 상기 엔지니어링 된 tracrRNA의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAA-3'(서열번호 2)과 상이하고, 및/또는 상기 엔지니어링 된 crRNA 반복 서열 부분은 5'-GUUGCAGAACCCGAAUAGACGAAUGAAGGAAUGCAAC-3'(서열번호 4)과 상이하다.이때, 상기 엔지니어링 된 tracrRNA의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAA-3'(서열번호 2)과 상이하고, 및/또는 상기 엔지니어링 된 crRNA 반복 서열 부분은 5'-GUUGCAGAACCCGAAUAGACGAAUGAAGGAAUGCAAC-3'(서열번호 4)과 상이 do.

일 예로, 상기 엔지니어링 된 tracrRNA의 서열은 서열번호 2과 동일하고, 상기 엔지니어링 된 crRNA 반복 서열 부분은 서열번호 4과 상이할 수 있다. 또 다른 일 예로, 상기 엔지니어링 된 tracrRNA의 서열은 서열번호 2과 상이하고, 상기 엔지니어링 된 crRNA 반복 서열 부분은 서열번호 4과 동일할 수 있다. 또 다른 일 예로, 상기 엔지니어링 된 tracrRNA의 서열은 서열번호 2과 상이하고, 상기 엔지니어링 된 crRNA 반복 서열 부분은 서열번호 4과 상이할 수 있다.For example, the sequence of the engineered tracrRNA may be the same as SEQ ID NO: 2, and the engineered crRNA repeat sequence portion may be different from SEQ ID NO: 4. As another example, the engineered tracrRNA sequence may be different from SEQ ID NO: 2, and the engineered crRNA repeat sequence portion may be identical to SEQ ID NO: 4. As another example, the engineered tracrRNA sequence may be different from SEQ ID NO: 2, and the engineered crRNA repeat sequence portion may be different from SEQ ID NO: 4.

일 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAA-3'를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-GGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAU-3'를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUU-3'를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AAGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUC-3'를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-GAAGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCA-3'(서열번호 66)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-UGAAGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAU-3'(서열번호 67)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AUGAAGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUU-3'(서열번호 68)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AAUGAAGGA-3'를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUU-3'(서열번호 69)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-GAAUGAAGGA-3'(서열번호 90)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUU-3'(서열번호 70)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-CGAAUGAAGGA-3'(서열번호 91)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUU-3'(서열번호 71)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-ACGAAUGAAGGA-3'(서열번호 92)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUC-3'(서열번호 72)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-GACGAAUGAAGGA-3'(서열번호 93)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCC-3'(서열번호 73)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AGACGAAUGAAGGA-3'(서열번호 94)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCU-3'(서열번호 74)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-UAGACGAAUGAAGGA-3'(서열번호 95)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUC-3'(서열번호 75)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AUAGACGAAUGAAGGA-3'(서열번호 96)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCU-3'(서열번호 76)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AAUAGACGAAUGAAGGA-3'(서열번호 97)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUC-3'(서열번호 77)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-GAAUAGACGAAUGAAGGA-3'(서열번호 98)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUCC-3'(서열번호 78)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-CGAAUAGACGAAUGAAGGA-3'(서열번호 99)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUCCA-3'(서열번호 79)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-CCGAAUAGACGAAUGAAGGA-3'(서열번호 100)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUCCAA-3'(서열번호 80)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-CCCGAAUAGACGAAUGAAGGA-3'(서열번호 101)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUCCAAU-3'(서열번호 81)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-ACCCGAAUAGACGAAUGAAGGA-3'(서열번호 102)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUCCAAUU-3'(서열번호 82)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AACCCGAAUAGACGAAUGAAGGA-3'(서열번호 103)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUCCAAUUC-3'(서열번호 83)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-GAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 104)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCU-3'(서열번호 84)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-AGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 105)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUG-3'(서열번호 85)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-CAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 106)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGC-3'(서열번호 86)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-GCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 107)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCA-3'(서열번호 87)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-UGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 108)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AAACAAAUUCAUUUUUCCUCUCCAAUUCUGCAC-3'(서열번호 88)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-UUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 109)를 포함할 수 있다. 또 다른 예로, 상기 엔지니어링 된 tracrRNA가 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACA-3'(서열번호 89) 또는 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAA-3'(서열번호 13)를 포함하는 경우, 상기 엔지니어링 된 crRNA는 5'-GUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 15)를 포함할 수 있다.For example, when the engineered tracrRNA includes 5'-AACAAA-3', the engineered crRNA may include 5'-GGA-3'. As another example, when the engineered tracrRNA includes 5'-AACAAAU-3', the engineered crRNA may include 5'-AGGA-3'. As another example, when the engineered tracrRNA includes 5'-AACAAAUU-3', the engineered crRNA may include 5'-AAGGA-3'. As another example, when the engineered tracrRNA includes 5'-AACAAAUUC-3', the engineered crRNA may include 5'-GAAGGA-3'. As another example, when the engineered tracrRNA comprises 5'-AACAAAAUUCA-3' (SEQ ID NO: 66), the engineered crRNA may include 5'-UGAAGGA-3'. As another example, when the engineered tracrRNA includes 5'-AACAAAUUCAU-3' (SEQ ID NO: 67), the engineered crRNA may include 5'-AUGAAGGA-3'. As another example, when the engineered tracrRNA includes 5'-AACAAAUUCAUU-3' (SEQ ID NO: 68), the engineered crRNA may include 5'-AAUGAAGGA-3'. As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUU-3' (SEQ ID NO: 69), the engineered crRNA may include 5'-GAAUGAAGGA-3' (SEQ ID NO: 90). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUU-3' (SEQ ID NO: 70), the engineered crRNA may include 5'-CGAAUGAAGGA-3' (SEQ ID NO: 91). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUU-3' (SEQ ID NO: 71), the engineered crRNA may include 5'-ACGAAUGAAGGA-3' (SEQ ID NO: 92). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUC-3' (SEQ ID NO: 72), the engineered crRNA may include 5'-GACGAAUGAAGGA-3' (SEQ ID NO: 93). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCC-3' (SEQ ID NO: 73), the engineered crRNA may include 5'-AGACGAAUGAAGGA-3' (SEQ ID NO: 94). As another example, if the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUUCCU-3' (SEQ ID NO: 74), the engineered crRNA may include 5'-UAGACGAAUGAAGGA-3' (SEQ ID NO: 95). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUC-3' (SEQ ID NO: 75), the engineered crRNA may include 5'-AUAGACGAAUGAAGGA-3' (SEQ ID NO: 96). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCU-3' (SEQ ID NO: 76), the engineered crRNA may include 5'-AAUAGACGAAUGAAGGA-3' (SEQ ID NO: 97). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUC-3' (SEQ ID NO: 77), the engineered crRNA may include 5'-GAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 98). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUCC-3' (SEQ ID NO: 78), the engineered crRNA may include 5'-CGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 99). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUCCA-3' (SEQ ID NO: 79), the engineered crRNA may include 5'-CCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 100). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUCCAA-3' (SEQ ID NO: 80), the engineered crRNA may include 5'-CCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 101). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUCCAAU-3' (SEQ ID NO: 81), the engineered crRNA may include 5'-ACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 102). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUCCAAUU-3' (SEQ ID NO: 82), the engineered crRNA may include 5'-AACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 103). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUCCAAUUC-3' (SEQ ID NO: 83), the engineered crRNA may include 5'-GAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 104). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCU-3' (SEQ ID NO: 84), the engineered crRNA may include 5'-AGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 105). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUG-3' (SEQ ID NO: 85), the engineered crRNA may include 5'-CAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 106). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGC-3' (SEQ ID NO: 86), the engineered crRNA may include 5'-GCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 107). As another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCA-3' (SEQ ID NO: 87), the engineered crRNA may include 5'-UGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 108). As another example, when the engineered tracrRNA comprises 5'-AAACAAAUUCAUUUUUCCUCUCCCAAUUCUGCAC-3' (SEQ ID NO: 88), the engineered crRNA may include 5'-UUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 109). In another example, when the engineered tracrRNA comprises 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACA-3' (SEQ ID NO: 89) or 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAA-3' (SEQ ID NO: 13), the engineered crRNA is 5'-GUUGACGAAUGAACCCGAAUUGACGAAUGAACCC ' (SEQ ID NO: 15).

엔지니어링 된 CRISPR/Cas12f1 복합체Engineered CRISPR/Cas12f1 complex

CRISPR/Cas12f1 복합체 개괄Overview of the CRISPR/Cas12f1 complex

본 명세서에서는 엔지니어링 된 CRISPR/Cas12f1 복합체를 제공한다. 상기 엔지니어링 된 CRISPR/Cas12f1 복합체는 Cas12f1 단백질 및 엔지니어링 된 Cas12f1 가이드 RNA를 포함한다. 이때, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 "엔지니어링 된 Cas12f1 가이드 RNA" 단락에서 설명된 것이다.Provided herein is an engineered CRISPR/Cas12f1 complex. The engineered CRISPR/Cas12f1 complex comprises a Cas12f1 protein and an engineered Cas12f1 guide RNA. In this case, the engineered Cas12f1 guide RNA is as described in the "Engineered Cas12f1 guide RNA" section.

일 구현예로, 본 명세서에서는 Cas12f1 단백질 및 엔지니어링 된 Cas12f1 가이드 RNA를 포함하는, 표적 서열을 포함하는 핵산을 편집할 수 있는 엔지니어링 된 CRISPR/Cas12f1 복합체를 제공한다. 이때, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 "엔지니어링 된 Cas12f1 가이드 RNA" 단락에서 설명된 것 중 어느 하나일 수 있다.In one embodiment, the present specification provides an engineered CRISPR/Cas12f1 complex capable of editing a nucleic acid comprising a target sequence, including a Cas12f1 protein and an engineered Cas12f1 guide RNA. In this case, the engineered Cas12f1 guide RNA may be any one of those described in the "Engineered Cas12f1 guide RNA" paragraph.

Cas12f1 단백질 - 개괄Cas12f1 Protein - Overview

본 명세서에서 제공하는 엔지니어링 된 CRISPR/Cas12f1 복합체는 Cas12f1 단백질이 포함된다. 기본적으로, 상기 Cas12f1 단백질은 자연계에 존재하는 야생형 Cas12f1 단백질일 수 있다. 상기 Cas12f1 단백질을 암호화하는 서열은 야생형 Cas12f1 단백질에 대해 인간 코돈-최적화된 Cas12f1 서열일 수 있다. 또한, 상기 Cas12f1 단백질은 자연계에 존재하는 야생형 Cas12f1 단백질과 동일한 기능을 가질 수 있다. 하지만, 특별히 한정하지 않는 한, 본 명세서에서 "Cas12f1 단백질"이라고 할 때, 이는 야생형, 또는 코돈 최적화된 Cas12f1 단백질 뿐 아니라, 변형된 Cas12f1 단백질 내지 Cas12f1 융합 단백질도 포괄하여 의미할 수 있다. 또한, 자연계에 존재하는 야생형 Cas12f1 단백질과 동일한 기능을 가지는 것 뿐 아니라, 상기 기능의 전부 또는 일부가 변형된 것, 상기 기능의 전부 또는 일부가 상실된 것, 및/또는 추가적인 기능이 부가된 것을 통틀어 일컬을 수 있다. Cas12f1 단백질의 의미는 문맥에 따라 적절히 해석될 수 있고, 특별한 경우가 아닌 한 가장 넓은 의미로 해석된다. 이하 Cas12f1 단백질의 구성, 또는 기능에 대해 자세히 설명한다.The engineered CRISPR/Cas12f1 complex provided herein includes a Cas12f1 protein. Basically, the Cas12f1 protein may be a wild-type Cas12f1 protein existing in nature. The sequence encoding the Cas12f1 protein may be a human codon-optimized Cas12f1 sequence for the wild-type Cas12f1 protein. In addition, the Cas12f1 protein may have the same function as a wild-type Cas12f1 protein existing in nature. However, unless specifically limited, as used herein, when the term "Cas12f1 protein" is used, it may encompass not only a wild-type or codon-optimized Cas12f1 protein, but also a modified Cas12f1 protein to a Cas12f1 fusion protein. In addition, as well as having the same function as the wild-type Cas12f1 protein existing in nature, all or part of the function is modified, all or part of the function is lost, and/or additional function is added. can The meaning of the Cas12f1 protein can be appropriately interpreted according to the context, and is interpreted in the broadest sense unless there is a special case. Hereinafter, the composition or function of the Cas12f1 protein will be described in detail.

Cas12f1 단백질 - 야생형 Cas12f1 단백질Cas12f1 protein - wild-type Cas12f1 protein

본 명세서에서 제공하는 엔지니어링 된 CRISPR/Cas12f1 복합체는 Cas12f1 단백질을 포함할 수 있다.The engineered CRISPR/Cas12f1 complex provided herein may include a Cas12f1 protein.

일 구현예로, 상기 Cas12f1 단백질은 야생형의 Cas12f1 단백질일 수 있다. 일 구현예로, 상기 Cas12f1 단백질은 Cas14 패밀리(Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018))에서 유래한 것일 수 있다. 일 구현예로, 상기 Cas12f1 단백질은 Uncultured archaeon 유래의 Cas14a 단백질(Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018))일 수 있다. 일 구현예로, 상기 Cas12f1 단백질은 Cas14a1 단백질일 수 있다.In one embodiment, the Cas12f1 protein may be a wild-type Cas12f1 protein. In one embodiment, the Cas12f1 protein may be derived from the Cas14 family (Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018)). In one embodiment, the Cas12f1 protein may be an uncultured archaeon-derived Cas14a protein (Harrington et al., Programmed DNA destruction by miniature CRISPR-Cas14 enzymes, Science 362, 839-842 (2018)). In one embodiment, the Cas12f1 protein may be a Cas14a1 protein.

Cas12f1 단백질 - 변형된 Cas12f1 단백질Cas12f1 protein - modified Cas12f1 protein

본 명세서에서 제공하는 엔지니어링 된 CRISPR/Cas12f1 복합체는 변형된 Cas12f1 단백질을 포함할 수 있다. 상기 변형된 Cas12f1은 야생형, 또는 코돈 최적화된 Cas12f1 단백질 서열에서 적어도 일부 서열이 변형된 것을 의미한다. 상기 Cas12f1 단백질 변형은 개별 아미노산 단위로 이루어진 것일 수 있고, 단백질의 기능적 도메인 단위로 이루어진 것일 수 있다.The engineered CRISPR/Cas12f1 complex provided herein may comprise a modified Cas12f1 protein. The modified Cas12f1 means that at least a portion of the wild-type or codon-optimized Cas12f1 protein sequence is modified. The Cas12f1 protein modification may be composed of individual amino acid units or may be composed of functional domain units of a protein.

일 구현예로, 상기 단백질의 변형은 야생형, 또는 코돈 최적화된 Cas12f1 단백질 서열에서 하나 이상의 아미노산, 펩타이드, 폴리펩타이드, 단백질, 및/또는 도메인이 개별적으로 치환, 제거, 및/또는 부가된 것일 수 있다. 일 구현예로, 상기 Cas12f1 단백질은 야생형 Cas12f1 단백질에 포함된 RuvC 도메인 내 하나 이상의 아미노산, 펩타이드, 및/또는 폴리펩타이드가 치환, 제거, 및/또는 부가된 것일 수 있다.In one embodiment, the modification of the protein may be one or more amino acids, peptides, polypeptides, proteins, and/or domains individually substituted, removed, and/or added in the wild-type or codon-optimized Cas12f1 protein sequence. . In one embodiment, the Cas12f1 protein may be one in which one or more amino acids, peptides, and/or polypeptides in the RuvC domain included in the wild-type Cas12f1 protein are substituted, removed, and/or added.

Cas12f1 단백질 - Cas12f1 융합 단백질Cas12f1 protein - Cas12f1 fusion protein

본 명세서에서 제공하는 엔지니어링 된 CRISPR/Cas12f1 복합체는 Cas12f1 융합 단백질을 포함할 수 있다. 이때, 상기 Cas12f1 융합 단백질은 야생형, 또는 변형된 Cas12f1 단백질에 추가적인 아미노산, 펩타이드, 폴리펩타이드, 단백질, 및/또는 도메인이 융합된 단백질을 의미한다.The engineered CRISPR/Cas12f1 complex provided herein may comprise a Cas12f1 fusion protein. In this case, the Cas12f1 fusion protein refers to a protein in which an additional amino acid, peptide, polypeptide, protein, and/or domain is fused to a wild-type or modified Cas12f1 protein.

일 구현예로, Cas12f1 단백질은 야생형의 Cas12f1 단백질에 베이스 에디터, 및/또는 역전사 효소(reverse transcriptase)가 융합된 것일 수 있다. 일 구현예로, 상기 베이스 에디터는 adenosine deaminase, 및/또는 cytidine deaminase일 수 있다. 일 구현예로, 상기 역전사 효소는 Moloney Murine Leukemia Virus(M-MLV) 역전사 효소, 및/또는 그 변이체일 수 있다. 이때, 상기 역전사 효소가 융합된 Cas12f1 단백질은 프라임 에디터로 기능할 수 있다.In one embodiment, the Cas12f1 protein may be a fusion of a base editor and/or reverse transcriptase to a wild-type Cas12f1 protein. In one embodiment, the base editor may be adenosine deaminase, and/or cytidine deaminase. In one embodiment, the reverse transcriptase may be Moloney Murine Leukemia Virus (M-MLV) reverse transcriptase, and/or a variant thereof. In this case, the Cas12f1 protein fused with the reverse transcriptase may function as a prime editor.

일 구현예로 , Cas12f1 단백질은 야생형의 Cas12f1 단백질에, 세포 내의 유전자 발현 과정에 관여할 수 있는 다양한 효소가 융합된 것일 수 있다. 이때, 상기 효소가 융합된 Cas12f1 단백질은 세포 내 유전자 발현에 다양한 양적, 질적 변화를 초래할 수 있다. 일 구현예로, 상기 효소는 VP64, DNMT, TET, KRAB, DHAC, LSD, 및/또는 p300일 수 있다.In one embodiment, the Cas12f1 protein may be a wild-type Cas12f1 protein fused with various enzymes that may be involved in the gene expression process in a cell. In this case, the Cas12f1 protein fused with the enzyme may cause various quantitative and qualitative changes in gene expression in cells. In one embodiment, the enzyme may be VP64, DNMT, TET, KRAB, DHAC, LSD, and/or p300.

Cas12f1 단백질 - 기능의 변경Cas12f1 protein - alteration of function

본 명세서에서 제공하는 엔지니어링 된 CRISPR/Cas12f1 복합체에 포함된 Cas12f1 단백질은 야생형의 Cas12f1 단백질과 동일한 기능을 가질 수 있다. 본 명세서에서 제공하는 엔지니어링 된 CRISPR/Cas12f1 복합체에 포함된 Cas12f1 단백질은 야생형의 Cas12f1 단백질과 비교할 때, 기능이 변경된 것일 수 있다. 구체적으로, 상기 변경은 전부 또는 일부 기능의 변형, 전부 또는 일부 기능의 상실, 및/또는 부가적인 기능의 추가일 수 있다. 일 구현예로, 상기 Cas12f1 단백질은 당업계 통상의 기술자가 CRISPR/Cas 시스템의 Cas 단백질에 적용할 수 있는 변경이라면, 특별히 제한되지 않는다. 이때 상기 변경은 공지의 기술을 이용한 것일 수 있다.The Cas12f1 protein included in the engineered CRISPR/Cas12f1 complex provided herein may have the same function as the wild-type Cas12f1 protein. The Cas12f1 protein included in the engineered CRISPR/Cas12f1 complex provided herein may have an altered function when compared to the wild-type Cas12f1 protein. Specifically, the alteration may be a modification of all or some functions, loss of all or some functions, and/or addition of additional functions. In one embodiment, the Cas12f1 protein is not particularly limited as long as it is a change that a person skilled in the art can apply to the Cas protein of the CRISPR/Cas system. In this case, the change may be made using a known technique.

일 구현예로, 상기 Cas12f1 단백질은 표적 핵산의 이중가닥 중 하나의 가닥만 절단하도록 변경된 것일 수 있다. 더 나아가, 상기 Cas12f1 단백질은 표적 핵산의 이중가닥 중 하나의 가닥만 절단할 수 있고, 절단하지 않는 가닥에 대해 베이스 에디팅(Base editing) 또는 프라임 에디팅(Prime editing)을 할 수 있도록 변경된 것일 수 있다. 일 구현예로, 상기 Cas12f1 단백질은 표적 핵산의 이중가닥 전부를 절단할 수 없도록 변경된 것일 수 있다. 더 나아가, 상기 Cas12f1 단백질은 표적 핵산의 이중가닥 전부를 절단할 수 없고, 표적 핵산에 대해 베이스 에디팅(Base editing), 프라임 에디팅(Prime editing), 또는 유전자 발현 조절 기능을 할 수 있도록 변경된 것일 수 있다.In one embodiment, the Cas12f1 protein may be modified to cut only one of the double strands of the target nucleic acid. Furthermore, the Cas12f1 protein may be modified so that only one strand of the double strands of the target nucleic acid can be cut, and base editing or prime editing can be performed on the uncleaved strand. In one embodiment, the Cas12f1 protein may be modified so that it cannot cut all double-stranded strands of the target nucleic acid. Furthermore, the Cas12f1 protein cannot cut all double-strands of the target nucleic acid, and may be modified to perform a function of base editing, prime editing, or gene expression regulation for the target nucleic acid. .

Cas12f1 단백질 - 기타 변형 예시Cas12f1 protein - examples of other modifications

일 구현예로, 상기 Cas12f1 단백질은 NLS(Nuclear Localization Sequence), 또는 NES(Nuclear Export Sequence)를 포함할 수 있다. 구체적으로, 상기 NLS는 "용어의 정의" 중 NLS 단락에 예시된 것 중 어느 하나일 수 있으나, 이에 제한되는 것은 아니다. 일 구현예로, 상기 Cas12f1 단백질은 태그를 포함할 수 있다. 구체적으로, 상기 태그는 "용어의 정의" 중 태그 단락에 예시된 것 중 어느 하나일 수 있으나, 이에 제한되는 것은 아니다.In one embodiment, the Cas12f1 protein may include a Nuclear Localization Sequence (NLS) or a Nuclear Export Sequence (NES). Specifically, the NLS may be any one of those exemplified in the NLS paragraph of “Definitions of Terms”, but is not limited thereto. In one embodiment, the Cas12f1 protein may include a tag. Specifically, the tag may be any one of those exemplified in the tag paragraph of “Definition of Terms”, but is not limited thereto.

Cas12f1 단백질 - PAM 서열Cas12f1 protein - PAM sequence

CRISPR/Cas12f1 복합체가 표적 유전자, 또는 표적 핵산을 절단하기 위해서는 두 가지 조건이 필요하다. 첫째로, 표적 유전자, 또는 표적 핵산 내에 Cas12f1 단백질이 인식할 수 있는 일정 길이의 염기 서열이 있어야 한다. 둘째로, 상기 일정 길이의 염기 서열 주변에 가이드 RNA에 포함된 스페이서 서열과 상보적으로 결합할 수 있는 서열이 있어야 한다. 위 두 가지 조건이 만족되어 1) Cas12f1 단백질이 상기 일정 길이의 염기 서열을 인식하고, 2) 상기 스페이서 서열 부분이 상기 일정 길이의 염기 서열 주변 서열 부분과 상보적으로 결합하는 경우, 표적 유전자, 또는 표적 핵산이 절단된다. 이때, 상기 Cas12f1 단백질에 의해 인식되는 일정 길이의 염기 서열을 Protospacer Adjacent Motif(PAM) 서열이라 한다. 상기 PAM 서열은 상기 Cas12f1 단백질에 따라 정해지는 고유한 서열이며, 상기 CRISPR/Cas12f1 복합체의 표적 서열을 결정할 때, 상기 PAM 서열과 인접한 서열 내에서 상기 표적 서열을 결정해야 한다는 제약이 따른다.Two conditions are required for the CRISPR/Cas12f1 complex to cleave a target gene or target nucleic acid. First, there must be a nucleotide sequence of a certain length that can be recognized by the Cas12f1 protein in the target gene or target nucleic acid. Second, there should be a sequence capable of complementary binding to the spacer sequence included in the guide RNA around the base sequence of the predetermined length. When the above two conditions are satisfied 1) the Cas12f1 protein recognizes the nucleotide sequence of the predetermined length, and 2) the spacer sequence portion complementarily binds to the sequence portion surrounding the nucleotide sequence of the predetermined length, a target gene, or The target nucleic acid is cleaved. In this case, the nucleotide sequence of a certain length recognized by the Cas12f1 protein is referred to as a Protospacer Adjacent Motif (PAM) sequence. The PAM sequence is a unique sequence determined according to the Cas12f1 protein, and when determining the target sequence of the CRISPR/Cas12f1 complex, there is a constraint that the target sequence must be determined within a sequence adjacent to the PAM sequence.

Cas12f1 단백질 - PAM 서열 예시Cas12f1 protein - PAM sequence example

일 구현예로, 상기 Cas12f1 단백질의 PAM 서열은 T-rich 서열일 수 있다. 일 구현예로, 상기 Cas12f1 단백질의 PAM 서열은 5'말단에서 3'말단 순서로, THTN일 수 있다. 이때, 상기 N은 디옥시티미딘(T), 디옥시아데노신(A), 디옥시사이티딘(C), 또는 디옥시구아노신(G) 중 하나이며, 상기 H는 디옥시티미딘(T), 디옥시아데노신(A), 및 디옥시사이티딘(C) 중 하나이다. 일 구현예로, 상기 Cas12f1 단백질의 PAM 서열은 5'말단에서 3'말단 순서로, TTTN일 수 있다. 이때, 상기 N은 디옥시티미딘(T), 디옥시아데노신(A), 디옥시사이티딘(C), 또는 디옥시구아노신(G) 중 하나이다. 일 구현예로, 상기 Cas12f1 단백질의 PAM 서열은 5'말단에서 3'말단 순서로, TTTA, TTTT, TTTC, 또는 TTTG일 수 있다. 일 구현예로, 상기 Cas12f1 단백질의 PAM 서열은 5'말단에서 3'말단 순서로, TATA, TATT, TATC, 또는 TATG일 수 있다. 일 구현예로, 상기 Cas12f1 단백질의 PAM 서열은 5'말단에서 3'말단 순서로, TCTA, TCTT, TCTC, 또는 TCTG일 수 있다. 일 구현예로, 상기 Cas12f1 단백질의 PAM 서열은 5' 말단에서 3' 말단 순서로, TTTA 또는 TTTG일 수 있다. 일 구현예로, 상기 Cas12f1 단백질의 PAM 서열은 야생형 Cas12f1 단백질의 PAM 서열과는 다른 것일 수 있다.In one embodiment, the PAM sequence of the Cas12f1 protein may be a T-rich sequence. In one embodiment, the PAM sequence of the Cas12f1 protein may be THTN in the order from the 5' end to the 3' end. In this case, N is one of deoxythymidine (T), deoxyadenosine (A), deoxycytidine (C), or deoxyguanosine (G), and H is deoxythymidine (T) , deoxyadenosine (A), and deoxycytidine (C). In one embodiment, the PAM sequence of the Cas12f1 protein may be TTTN in the order from the 5' end to the 3' end. In this case, N is one of deoxythymidine (T), deoxyadenosine (A), deoxycytidine (C), or deoxyguanosine (G). In one embodiment, the PAM sequence of the Cas12f1 protein may be TTTA, TTTT, TTTC, or TTTG in the order from the 5' end to the 3' end. In one embodiment, the PAM sequence of the Cas12f1 protein may be TATA, TATT, TATC, or TATG in order from the 5' end to the 3' end. In one embodiment, the PAM sequence of the Cas12f1 protein may be TCTA, TCTT, TCTC, or TCTG in the order from the 5' end to the 3' end. In one embodiment, the PAM sequence of the Cas12f1 protein may be TTTA or TTTG in order from the 5' end to the 3' end. In one embodiment, the PAM sequence of the Cas12f1 protein may be different from the PAM sequence of the wild-type Cas12f1 protein.

Cas12f1 단백질 - 서열 예시Cas12f1 protein - sequence example

일 구현예로, 상기 Cas12f1 단백질은 서열번호 259 내지 서열번호 266로 이루어진 군에서 선택된 아미노산 서열을 가질 수 있다.In one embodiment, the Cas12f1 protein may have an amino acid sequence selected from the group consisting of SEQ ID NO: 259 to SEQ ID NO: 266.

일 구현예로, 상기 Cas12f1 단백질을 암호화하는 DNA 서열은 인간 코돈-최적화된 서열일 수 있다.In one embodiment, the DNA sequence encoding the Cas12f1 protein may be a human codon-optimized sequence.

일 구현예로, 상기 Cas12f1 단백질을 암호화하는 DNA 서열은 서열번호 267 내지 서열번호 276으로 이루어진 군에서 선택된 DNA 서열일 수 있다.In one embodiment, the DNA sequence encoding the Cas12f1 protein may be a DNA sequence selected from the group consisting of SEQ ID NO: 267 to SEQ ID NO: 276.

엔지니어링 된 Cas12f1 가이드 RNAEngineered Cas12f1 guide RNA

본 명세서에서 제공되는 CRISPR/Cas12f1 복합체를 구성하는 엔지니어링 된 Cas12f1 가이드 RNA는 "엔지니어링 된 Cas12f1 가이드 RNA" 단락에서 설명된 것과 동일한 특징 및 구조를 가진다.The engineered Cas12f1 guide RNA constituting the CRISPR/Cas12f1 complex provided herein has the same characteristics and structure as described in the section "Engineered Cas12f1 guide RNA".

CRISPR/Cas12f1 복합체 - 구성 예시CRISPR/Cas12f1 complex - construction example

일 구현예로, 상기 Cas12f1 단백질은 서열번호 259 내지 서열번호 262에서 선택된 아미노산 서열을 가지고, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 서열번호 210 내지 258, 서열번호 381 내지 393, 서열번호 396 내지 서열번호 407, 및 서열번호 409 내지 서열번호 421로 이뤄진 군에서 선택된 서열을 가지며, 상기 Cas12f1 단백질 및 상기 엔지니어링 된 Cas12f1 가이드 RNA가 결합하여 CRISPR/Cas12f1 복합체를 이루고 있을 수 있다.In one embodiment, the Cas12f1 protein has an amino acid sequence selected from SEQ ID NO: 259 to SEQ ID NO: 262, and the engineered Cas12f1 guide RNA is SEQ ID NO: 210 to 258, SEQ ID NO: 381 to 393, SEQ ID NO: 396 to SEQ ID NO: 407, And it has a sequence selected from the group consisting of SEQ ID NO: 409 to SEQ ID NO: 421, and the Cas12f1 protein and the engineered Cas12f1 guide RNA bind to form a CRISPR/Cas12f1 complex.

일 구현예로, 상기 Cas12f1 단백질은 서열번호 263 내지 서열번호 266으로 이뤄진 군에서 선택된 아미노산 서열을 가지고, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 서열번호 210 내지 258, 서열번호 381 내지 393, 서열번호 396 내지 서열번호 407, 및 서열번호 409 내지 서열번호 421로 이뤄진 군에서 선택된 서열을 가지며, 상기 Cas12f1 단백질 및 상기 엔지니어링 된 Cas12f1 가이드 RNA가 결합하여 이루어진 CRISPR/Cas12f1 복합체는 베이스 에디팅 기능을 가질 수 있다.In one embodiment, the Cas12f1 protein has an amino acid sequence selected from the group consisting of SEQ ID NO: 263 to SEQ ID NO: 266, and the engineered Cas12f1 guide RNA is SEQ ID NO: 210 to 258, SEQ ID NO: 381 to 393, SEQ ID NO: 396 to sequence The CRISPR/Cas12f1 complex having a sequence selected from the group consisting of No. 407 and SEQ ID NO: 409 to SEQ ID NO: 421, wherein the Cas12f1 protein and the engineered Cas12f1 guide RNA are bound, may have a base editing function.

CRISPR/Cas12f1 시스템의 각 구성요소를 발현시키기 위한 벡터Vectors for expressing each component of the CRISPR/Cas12f1 system

벡터 개괄vector outline

본 명세서에서는 CRISPR/Cas12f1 시스템의 구성 요소를 발현시키기 위한 벡터를 제공한다. 상기 벡터는 Cas12f1 단백질, 및/또는 엔지니어링 된 Cas12f1 가이드 RNA를 발현시키도록 구성된다. 상기 벡터의 서열은 상기 CRISPR/Cas12f1 시스템의 구성요소 중 하나를 암호화하는 핵산 서열을 포함하거나, 둘 이상의 구성요소를 암호화하는 핵산 서열을 포함할 수 있다. 상기 벡터의 서열은 Cas12f1 단백질을 암호화하는 핵산 서열 및/또는 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열을 포함한다. 상기 벡터의 서열은 하나 이상의 프로모터 서열을 포함한다. 상기 프로모터는 Cas12f1 단백질을 암호화하는 핵산 서열 및/또는 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열과 작동적으로 연결되어, 세포 내에서 상기 핵산 서열의 전사가 촉진될 수 있도록 한다. 상기 Cas12f1 단백질은 "엔지니어링 된 CRISPR/Cas12f1 복합체" 단락에서 설명된 Cas12f1 단백질, 변형된 Cas12f1 단백질, 및/또는 Cas12f1 융합 단백질과 동일한 특징 및 구성을 가진다. 상기 엔지니어링 된 Cas12f1 가이드 RNA는 "엔지니어링 된 Cas12f1 가이드 RNA" 단락에서 설명된 엔지니어링 된 Cas12f1 가이드 RNA와 동일한 특징 및 구성을 가진다.Provided herein are vectors for expressing components of the CRISPR/Cas12f1 system. The vector is configured to express a Cas12f1 protein, and/or an engineered Cas12f1 guide RNA. The sequence of the vector may include a nucleic acid sequence encoding one of the components of the CRISPR/Cas12f1 system, or may include a nucleic acid sequence encoding two or more components. The sequence of the vector comprises a nucleic acid sequence encoding a Cas12f1 protein and/or a nucleic acid sequence encoding an engineered Cas12f1 guide RNA. The sequence of the vector includes one or more promoter sequences. The promoter is operatively linked with a nucleic acid sequence encoding a Cas12f1 protein and/or a nucleic acid sequence encoding an engineered Cas12f1 guide RNA, such that transcription of the nucleic acid sequence in a cell can be promoted. The Cas12f1 protein has the same characteristics and composition as the Cas12f1 protein, the modified Cas12f1 protein, and/or the Cas12f1 fusion protein described in the section "Engineered CRISPR/Cas12f1 complex". The engineered Cas12f1 guide RNA has the same characteristics and configuration as the engineered Cas12f1 guide RNA described in the section "Engineered Cas12f1 guide RNA".

상기 벡터의 서열은 Cas12f1 단백질을 암호화하는 핵산 서열 및 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열을 포함할 수 있다. 일 구현예로, 상기 벡터의 서열은 Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 제1 서열, 및 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열을 포함하는 제2 서열을 포함할 수 있다. 상기 벡터의 서열은 Cas12f1 단백질을 암호화하는 핵산 서열을 세포 내에서 발현시키기 위한 프로모터 서열, 및 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열을 세포 내에서 발현시키기 위한 프로모터 서열을 포함하고, 상기 프로모터들은 각 발현 대상과 작동적으로 연결(operably linked)되어 있다. 일 구현예로, 상기 벡터의 서열은 상기 제1 서열과 작동 가능하게 연결된 제1 프로모터 서열, 및 상기 제2 서열과 작동 가능하게 연결된 제2 프로모터 서열을 포함할 수 있다.The sequence of the vector may include a nucleic acid sequence encoding a Cas12f1 protein and a nucleic acid sequence encoding an engineered Cas12f1 guide RNA. In one embodiment, the sequence of the vector may include a first sequence comprising a nucleic acid sequence encoding a Cas12f1 protein, and a second sequence comprising a nucleic acid sequence encoding an engineered Cas12f1 guide RNA. The sequence of the vector comprises a promoter sequence for expressing a nucleic acid sequence encoding a Cas12f1 protein in a cell, and a promoter sequence for expressing a nucleic acid sequence encoding an engineered Cas12f1 guide RNA in a cell, wherein the promoters are each It is operably linked to the expression target. In one embodiment, the sequence of the vector may include a first promoter sequence operably linked to the first sequence, and a second promoter sequence operably linked to the second sequence.

상기 벡터의 서열은 Cas12f1 단백질을 암호화하는 핵산 서열 및 둘 이상의 서로 다른 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열을 포함할 수 있다. 일 구현예로, 상기 벡터의 서열은 Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 제1 서열, 제1 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열을 포함하는 제2 서열, 제2 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열을 포함하는 제3 서열을 포함할 수 있다. 더 나아가, 상기 벡터의 서열은 상기 제1 서열과 작동 가능하게 연결된 제1 프로모터 서열, 상기 제2 서열과 작동 가능하게 연결된 제2 프로모터 서열, 상기 제3 서열과 작동 가능하게 연결된 제3 프로모터 서열을 포함할 수 있다.The sequence of the vector may include a nucleic acid sequence encoding a Cas12f1 protein and a nucleic acid sequence encoding two or more different engineered Cas12f1 guide RNAs. In one embodiment, the sequence of the vector comprises a first sequence comprising a nucleic acid sequence encoding a Cas12f1 protein, a second sequence comprising a nucleic acid sequence encoding a first engineered Cas12f1 guide RNA, a second engineered Cas12f1 guide RNA It may include a third sequence comprising a nucleic acid sequence encoding Furthermore, the sequence of the vector comprises a first promoter sequence operably linked to the first sequence, a second promoter sequence operably linked to the second sequence, and a third promoter sequence operably linked to the third sequence may include

발현 대상 - Cas12f1 단백질Expression target - Cas12f1 protein

상기 벡터는 Cas12f1 단백질을 발현하도록 구성된 것일 수 있다. 이때, 상기 Cas12f1 단백질은 "엔지니어링 된 CRISPR/Cas12f1 복합체" 단락에서 설명된 것과 동일한 구성 및 특징을 가진다.The vector may be configured to express the Cas12f1 protein. In this case, the Cas12f1 protein has the same composition and characteristics as those described in the section "Engineered CRISPR/Cas12f1 complex".

일 구현예로, 상기 벡터는 야생형의 Cas12f1 단백질을 발현하도록 구성된 것일 수 있다. 이때, 상기 야생형의 Cas12f1 단백질은 Cas14a1일 수 있다. 일 구현예로, 상기 벡터는 표적 핵산의 이중가닥 중 하나의 가닥만 절단하도록 변경된 Cas12f1 단백질을 발현하도록 구성된 것일 수 있다. 더 나아가, 상기 변경된 Cas12f1 단백질은 표적 핵산의 이중가닥 중 하나의 가닥만 절단할 수 있고, 절단하지 않는 가닥에 대해 베이스 에디팅(Base editing) 또는 프라임 에디팅(Prime editing)을 할 수 있도록 변경된 것일 수 있다. 일 구현예로, 상기 Cas12f1 단백질은 표적 핵산의 이중가닥 전부를 절단할 수 없도록 변경된 것일 수 있다. 더 나아가, 상기 Cas12f1 단백질은 표적 핵산의 이중가닥 전부를 절단할 수 없고, 표적 핵산에 대해 베이스 에디팅(Base editing), 프라임 에디팅(Prime editing), 또는 유전자 발현 조절 기능을 할 수 있도록 변경된 것일 수 있다.In one embodiment, the vector may be configured to express a wild-type Cas12f1 protein. In this case, the wild-type Cas12f1 protein may be Cas14a1. In one embodiment, the vector may be configured to express the Cas12f1 protein modified to cut only one strand of the double strand of the target nucleic acid. Furthermore, the altered Cas12f1 protein can cut only one of the double strands of the target nucleic acid, and base editing or prime editing can be performed on the uncleaved strand. . In one embodiment, the Cas12f1 protein may be modified so that it cannot cut all double-stranded strands of the target nucleic acid. Furthermore, the Cas12f1 protein cannot cut all double-strands of the target nucleic acid, and may be modified to perform a function of base editing, prime editing, or gene expression regulation for the target nucleic acid. .

발현 대상 - 엔지니어링 된 Cas12f1 가이드 RNAExpression target - engineered Cas12f1 guide RNA

상기 벡터는 엔지니어링 된 Cas12f1 가이드 RNA를 발현하도록 구성된 것일 수 있다. 상기 엔지니어링 된 Cas12f1 가이드 RNA는 "엔지니어링 된 Cas12f1 가이드 RNA" 단락에서 설명한 엔지니어링 된 Cas12f1 가이드 RNA와 동일한 특징 및 구성을 가진다. 상기 벡터는 둘 이상의 서로 다른 엔지니어링 된 Cas12f1 가이드 RNA를 발현하도록 구성된 것일 수 있다.The vector may be configured to express an engineered Cas12f1 guide RNA. The engineered Cas12f1 guide RNA has the same characteristics and configuration as the engineered Cas12f1 guide RNA described in the section "Engineered Cas12f1 guide RNA". The vector may be configured to express two or more different engineered Cas12f1 guide RNAs.

발현 대상 - 부가 구성 요소Expression Target - Additional Components

상기 벡터는 전술한 발현 대상 외, NLS, 태그 단백질 등의 부가 구성 요소를 발현하도록 구성된 것일 수 있다. 일 구현예로, 상기 부가 구성 요소는 상기 Cas12f1, 변형된 Cas12f1, 및/또는 엔지니어링 된 Cas12f1 가이드 RNA와는 독립적으로 발현될 수 있다. 또 다른 구현예로, 상기 부가 구성 요소는 상기 Cas12f1, 변형된 Cas12f1, 및/또는 엔지니어링 된 Cas12f1 가이드 RNA와 연결되어 발현될 수 있다. 이때, 상기 부가 구성 요소는 CRISPR/Cas 시스템을 발현시키고자 할 때 일반적으로 발현시키는 구성 요소일 수 있으며, 공지기술을 참조할 수 있다. 상기 부가 구성 요소는 "배경기술 - CRISPR/Cas 시스템 발현 벡터 설계" 섹션에서 설명된 구성 요소 중 하나 이상일 수 있다.The vector may be configured to express additional components such as NLS and tag proteins in addition to the above-described expression target. In one embodiment, the additional component may be expressed independently of the Cas12f1, the modified Cas12f1, and/or the engineered Cas12f1 guide RNA. In another embodiment, the additional component may be expressed in connection with the Cas12f1, the modified Cas12f1, and/or the engineered Cas12f1 guide RNA. In this case, the additional component may be a component that is generally expressed when expressing the CRISPR/Cas system, and known techniques may be referred to. The additional component may be one or more of the components described in the section "Background - CRISPR/Cas system expression vector design".

벡터 구성 - Cas12f1 단백질 발현 서열Vector Construction - Cas12f1 protein expression sequence

상기 벡터 서열은 상기 Cas12f1 단백질을 암호화하는 핵산 서열을 포함할 수 있다. 이때, 상기 Cas12f1 단백질은 "엔지니어링 된 CRISPR/Cas12f1 복합체" 단락에서 설명된 것과 동일한 구성 및 특징을 가진다.The vector sequence may include a nucleic acid sequence encoding the Cas12f1 protein. In this case, the Cas12f1 protein has the same composition and characteristics as those described in the section "Engineered CRISPR/Cas12f1 complex".

일 구현예로, 상기 벡터의 서열은 야생형의 Cas12f1 단백질을 암호화하는 서열을 포함할 수 있다. 이때, 상기 야생형의 Cas12f1 단백질은 Cas14a1일 수 있다. 일 구현예로, 상기 벡터의 서열은 Cas12f1 단백질을 암호화하는 인간 코돈 최적화된 핵산 서열을 포함할 수 있다. 이때, 상기 Cas12f1 단백질을 암호화하는 인간 코돈 최적화된 핵산 서열은 Cas14a1 단백질을 암호화하는 인간 코돈 최적화된 핵산 서열일 수 있다. 일 구현예로, 상기 벡터의 서열은 변형된 Cas12f1 단백질 또는 Cas12f1 융합 단백질을 암호화하는 서열을 포함할 수 있다. 일 구현예로, 상기 벡터의 서열은 표적 핵산의 이중가닥 중 하나의 가닥만 절단할 수 있고, 절단하지 않는 가닥에 대해 베이스 에디팅(Base editing) 또는 프라임 에디팅(Prime editing)을 할 수 있도록 변경된 Cas12f1 단백질을 암호화하는 서열을 포함할 수 있다. 일 구현예로, 상기 Cas12f1 단백질은 표적 핵산의 이중가닥 전부를 절단할 수 없고, 표적 핵산에 대해 베이스 에디팅(Base editing), 프라임 에디팅(Prime editing), 또는 유전자 발현 조절 기능을 할 수 있도록 변경된 Cas12f1 단백질을 암호화하는 서열을 포함할 수 있다.In one embodiment, the sequence of the vector may include a sequence encoding a wild-type Cas12f1 protein. In this case, the wild-type Cas12f1 protein may be Cas14a1. In one embodiment, the sequence of the vector may include a human codon-optimized nucleic acid sequence encoding the Cas12f1 protein. In this case, the human codon-optimized nucleic acid sequence encoding the Cas12f1 protein may be a human codon-optimized nucleic acid sequence encoding the Cas14a1 protein. In one embodiment, the sequence of the vector may include a sequence encoding a modified Cas12f1 protein or a Cas12f1 fusion protein. In one embodiment, the sequence of the vector can cut only one strand among the double strands of the target nucleic acid, and the non-cleaved strand Cas12f1 is modified to perform base editing or prime editing. It may include a sequence encoding a protein. In one embodiment, the Cas12f1 protein cannot cut all double-strands of the target nucleic acid, and the Cas12f1 modified to function for base editing, prime editing, or gene expression regulation of the target nucleic acid. It may include a sequence encoding a protein.

벡터 구성 - 엔지니어링 된 Cas12f1 가이드 RNA 발현 서열Vector Construction - Engineered Cas12f1 Guide RNA Expression Sequence

일 구현예로, 상기 벡터의 서열은 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 서열을 포함할 수 있다. 예를 들어, 상기 벡터의 서열은 서열번호 210 내지 서열번호 258, 서열번호 381 내지 서열번호 393, 서열번호 395 내지 서열번호 407, 서열번호 409 내지 서열번호 421, 및 서열번호 436 내지 서열번호 439로 이뤄진 군에서 선택된 서열을 포함할 수 있다.In one embodiment, the sequence of the vector may include a sequence encoding the engineered Cas12f1 guide RNA. For example, the sequence of the vector is SEQ ID NO: 210 to SEQ ID NO: 258, SEQ ID NO: 381 to SEQ ID NO: 393, SEQ ID NO: 395 to SEQ ID NO: 407, SEQ ID NO: 409 to SEQ ID NO: 421, and SEQ ID NO: 436 to SEQ ID NO: 439 It may comprise a sequence selected from the group consisting of.

일 구현예로, 상기 벡터의 서열은 둘 이상의 서로 다른 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 서열을 포함할 수 있다. 예를 들어, 상기 벡터의 서열은 서열번호 210 내지 서열번호 258, 서열번호 381 내지 서열번호 393, 서열번호 395 내지 서열번호 407, 서열번호 409 내지 서열번호 421, 및 서열번호 436 내지 서열번호 439로 이뤄진 군에서 각각 선택된 제1 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 서열, 및 제2 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 서열을 포함할 수 있다.In one embodiment, the sequence of the vector may include sequences encoding two or more different engineered Cas12f1 guide RNAs. For example, the sequence of the vector is SEQ ID NO: 210 to SEQ ID NO: 258, SEQ ID NO: 381 to SEQ ID NO: 393, SEQ ID NO: 395 to SEQ ID NO: 407, SEQ ID NO: 409 to SEQ ID NO: 421, and SEQ ID NO: 436 to SEQ ID NO: 439 It may include a sequence encoding a first engineered Cas12f1 guide RNA, each selected from the group consisting of, and a sequence encoding a second engineered Cas12f1 guide RNA.

벡터 구성 - 프로모터 서열Vector Construction - Promoter Sequence

상기 벡터 서열은 각 구성요소를 암호화하는 서열에 작동 가능하게 연결된 프로모터 서열을 포함한다. 구체적으로, 상기 프로모터 서열은 "배경기술 - CRISPR/Cas 시스템 발현 벡터 설계" 섹션 중 프로모터 부분에 개시된 프로모터 중 하나일 수 있으나, 이에 제한되지 않는다.The vector sequence includes a promoter sequence operably linked to a sequence encoding each element. Specifically, the promoter sequence may be, but is not limited to, one of the promoters disclosed in the promoter part of the "Background - CRISPR/Cas system expression vector design" section.

일 구현예로, 상기 벡터 서열은 Cas12f1 단백질을 암호화하는 서열, 및 프로모터 서열을 포함할 수 있다. 이때, 상기 프로모터 서열은 상기 Cas12f1 단백질을 암호화하는 서열과 작동 가능하게 연결된이다. 일 구현예로, 상기 벡터 서열은 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 서열, 및 프로모터 서열을 포함할 수 있다. 이때, 상기 프로모터 서열은 상기 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 서열과 작동 가능하게 연결된이다. 일 구현예로, 상기 벡터 서열은 Cas12f1 단백질을 암호화하는 서열, 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 서열, 및 프로모터 서열을 포함할 수 있다. 이때, 상기 프로모터 서열은 상기 Cas12f1 단백질을 암호화하는 서열, 및 상기 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 서열과 작동적으로 연결되며, 상기 프로모터 서열로 인해 활성화된 전사 인자가 상기 Cas12f1 단백질 및 상기 엔지니어링 된 Cas12f1 가이드 RNA를 발현시킨다.In one embodiment, the vector sequence may include a sequence encoding a Cas12f1 protein, and a promoter sequence. In this case, the promoter sequence is operably linked to the sequence encoding the Cas12f1 protein. In one embodiment, the vector sequence may include a sequence encoding the engineered Cas12f1 guide RNA, and a promoter sequence. In this case, the promoter sequence is operably linked with the sequence encoding the engineered Cas12f1 guide RNA. In one embodiment, the vector sequence may include a sequence encoding a Cas12f1 protein, a sequence encoding an engineered Cas12f1 guide RNA, and a promoter sequence. In this case, the promoter sequence is operatively linked with the sequence encoding the Cas12f1 protein and the engineered Cas12f1 guide RNA, and the transcription factor activated due to the promoter sequence is the Cas12f1 protein and the engineered Cas12f1 Guide RNA is expressed.

벡터 구성 - 둘 이상의 프로모터 서열 포함 가능Vector construction - can contain more than one promoter sequence

일 구현예로, 상기 벡터 서열은 Cas12f1 단백질을 암호화하는 제1 서열, 제1 프로모터 서열, 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 제2 서열, 및 제2 프로모터 서열을 포함할 수 있다. 이때, 상기 제1 프로모터 서열은 상기 제1 서열과 작동적으로 연결되고, 상기 제2 프로모터 서열은 상지 제2 서열과 작동적으로 연결되며, 상기 제1 프로모터 서열에 의해 상기 제1 서열의 전사가 유도되고, 상기 제2 프로모터 서열에 의해 상기 제2 서열의 전사가 유도된다. 이때, 상기 제1 프로모터 및 상기 제2 프로모터는 동일한 종류의 프로모터일 수 있다. 이때, 상기 제1 프로모터 및 상기 제2 프로모터는 상이한 종류의 프로모터일 수 있다.In one embodiment, the vector sequence may include a first sequence encoding a Cas12f1 protein, a first promoter sequence, a second sequence encoding an engineered Cas12f1 guide RNA, and a second promoter sequence. In this case, the first promoter sequence is operably linked to the first sequence, the second promoter sequence is operatively linked to the upper limb second sequence, and transcription of the first sequence is performed by the first promoter sequence. induced, and transcription of the second sequence is induced by the second promoter sequence. In this case, the first promoter and the second promoter may be the same type of promoter. In this case, the first promoter and the second promoter may be different types of promoters.

일 구현예로, 상기 벡터 서열은 Cas12f1 단백질을 암호화하는 제1 서열, 제1 프로모터 서열, 제1 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 제2 서열, 제2 프로모터 서열, 제2 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 제3 서열, 및 제3 프로모터 서열 을 포함할 수 있다. 이때, 상기 제1 프로모터 서열은 상기 제1 서열과 작동적으로 연결되고, 상기 제2 프로모터 서열은 상기 제2 서열과 작동적으로 연결되고, 상기 제3 프로모터 서열은 상기 제3 서열과 작동적으로 연결되며, 상기 제1 프로모터 서열에 의해 상기 제1 서열의 전사가 유도되고, 상기 제2 프로모터 서열에 의해 상기 제2 서열의 전사가 유도되며, 상기 제3 프로모터 서열에 의해 상기 제3 서열의 전사가 유도된다. 이때, 상기 제2 프로모터 및 상기 제3 프로모터는 동일한 종류의 프로모터일 수 있다. 구체적으로, 상기 제2 프로모터 서열 및 상기 제3 프로모터 서열은 U6 프로모터 서열일 수 있으나, 이에 제한되는 것은 아니다. 이때, 상기 제2 프로모터 및 상기 제3 프로모터는 상이한 종류의 프로모터일 수 있다. 구체적으로, 상기 제2 프로모터는 U6 프로모터 서열, 상기 제3 프로모터는 H1 프로모터 서열일 수 있으나, 이에 제한되는 것은 아니다.In one embodiment, the vector sequence comprises a first sequence encoding a Cas12f1 protein, a first promoter sequence, a second sequence encoding a first engineered Cas12f1 guide RNA, a second promoter sequence, and a second engineered Cas12f1 guide RNA. a third sequence encoding, and a third promoter sequence. wherein the first promoter sequence is operatively linked with the first sequence, the second promoter sequence is operably linked with the second sequence, and the third promoter sequence is operably linked with the third sequence. linked, wherein transcription of the first sequence is induced by the first promoter sequence, transcription of the second sequence is induced by the second promoter sequence, and transcription of the third sequence is induced by the third promoter sequence is induced In this case, the second promoter and the third promoter may be the same type of promoter. Specifically, the second promoter sequence and the third promoter sequence may be a U6 promoter sequence, but is not limited thereto. In this case, the second promoter and the third promoter may be different types of promoters. Specifically, the second promoter may be a U6 promoter sequence, and the third promoter may be a H1 promoter sequence, but is not limited thereto.

벡터 구성 - 종결 신호Vector Construction - Termination Signal

상기 벡터는 상기 프로모터 서열과 작동 가능하게 연결된 종결 신호를 포함할 수 있다. 이때, 상기 종결 신호는 "배경기술 - CRISPR/Cas 시스템 발현 벡터 설계" 섹션의 종결 신호 부분에서 개시된 종결 신호 중 하나일 수 있으나, 이에 제한되지 않는다. 상기 종결 신호는, 프로모터 서열의 종류에 따라 달라질 수 있다.The vector may include a termination signal operably linked to the promoter sequence. In this case, the termination signal may be one of the termination signals disclosed in the termination signal portion of the "Background - CRISPR/Cas system expression vector design" section, but is not limited thereto. The termination signal may vary depending on the type of promoter sequence.

일 구현예로, 상기 벡터 서열이 U6 프로모터 서열을 포함하는 경우, 상기 U6 프로모터 서열과 작동 가능하게 연결된, 티미딘 연속 서열이 종결 신호로 작용할 수 있다. 일 구현예로, 상기 티미딘 연속 서열은 5개 이상의 티미딘이 연속으로 연결된 서열일 수 있다. 일 구현예로, 상기 벡터 서열이 H1 프로모터 서열을 포함하는 경우, 상기 H1 프로모터 서열과 작동 가능하게 연결된, 티미딘 연속 서열이 종결 신호로 작용할 수 있다. 일 구현예로, 상기 티미딘 연속 서열은 5개 이상의 티미딘이 연속으로 연결된 서열일 수 있다.In one embodiment, when the vector sequence includes a U6 promoter sequence, a thymidine sequence operably linked to the U6 promoter sequence may serve as a termination signal. In one embodiment, the thymidine sequence may be a sequence in which 5 or more thymidine is continuously linked. In one embodiment, when the vector sequence includes an H1 promoter sequence, a thymidine sequence operably linked to the H1 promoter sequence may serve as a termination signal. In one embodiment, the thymidine sequence may be a sequence in which 5 or more thymidine is continuously linked.

벡터 구성 - 기타 구성Vector Composition - Miscellaneous Composition

상기 벡터 서열은 상기 구성 외, 목적에 따라 필요한 구성요소를 포함할 수 있다.The vector sequence may include elements necessary according to the purpose in addition to the above construction.

일 구현예로, 상기 벡터 서열은 조절/제어 구성요소 서열, 및/또는 부가 구성 요소 서열을 포함할 수 있다. 일 구현예로, 상기 부가 구성 요소는 형질주입된 세포를 비형질주입 세포로부터 구별하기 위한 목적으로 부가된 것일 수 있다. 이때, 상기 조절/제어 구성요소 서열 및 부가 구성 요소는 "배경기술 - CRISPR/Cas 시스템 발현 벡터 설계" 섹션에 개시된 것 중 하나일 수 있으나, 이에 제한되는 것은 아니다.In one embodiment, the vector sequence may include regulatory/control element sequences, and/or additional element sequences. In one embodiment, the additional component may be added for the purpose of distinguishing the transfected cells from the non-transfected cells. In this case, the regulatory/control element sequence and additional element may be one of those disclosed in the "Background - CRISPR/Cas system expression vector design" section, but is not limited thereto.

벡터 종류 - 바이러스 벡터Vector Type - Virus Vector

상기 벡터는 바이러스 벡터일 수 있다.The vector may be a viral vector.

일 구현예로, 상기 바이러스 벡터는 레트로바이러스, 렌티바이러스, 아데노바이러스, 아데노-연관 바이러스, 백시니아바이러스, 폭스바이러스 및 단순포진 바이러스로 구성된 군에서 선택되는 하나 이상일 수 있다. 일 구현예로, 상기 바이러스 벡터는 아데노-연관 바이러스일 수 있다.In one embodiment, the viral vector may be one or more selected from the group consisting of retroviruses, lentiviruses, adenoviruses, adeno-associated viruses, vacciniaviruses, poxviruses, and herpes simplex viruses. In one embodiment, the viral vector may be an adeno-associated virus.

벡터 종류 - 비바이러스 벡터Vector type - non-viral vector

상기 벡터는 비바이러스 벡터일 수 있다. 일 구현예로, 상기 비바이러스 벡터는 플라스미드, 파지, 네이키드 DNA, DNA 복합체, 및 mRNA로 구성된 군에서 선택되는 1 이상일 수 있다. 일 구현예로, 상기 플라스미드는 pcDNA 시리즈, pSC101, pG1796, pACYC177, ColE1, pKT230, pME290, pBR322, pUC8/9, pUC6, pBD9, pHC79, pIJ61, pLAFR1, pHV14, pGEX 시리즈, pET 시리즈, 및 pUC19으로 이뤄진 군에서 선택된 것일 수 있다. 일 구현예로, 상기 파지는 일 구현예로, 상기 파지는 λgt4λB, λ-Charon, λΔz1, 및 M13으로 이뤄진 군에서 선택된 것일 수 있다. 일 구현예로, 상기 벡터는 PCR 앰플리콘(amplicon)일 수 있다.The vector may be a non-viral vector. In one embodiment, the non-viral vector may be one or more selected from the group consisting of plasmids, phages, naked DNA, DNA complexes, and mRNA. In one embodiment, the plasmid is a pcDNA series, pSC101, pG1796, pACYC177, ColE1, pKT230, pME290, pBR322, pUC8/9, pUC6, pBD9, pHC79, pIJ61, pLAFR1, pHV14, pGEX series, pET series, and pUC19. It may be selected from the group consisting of. In one embodiment, the phage may be selected from the group consisting of λgt4λB, λ-Charon, λΔz1, and M13. In one embodiment, the vector may be a PCR amplicon.

벡터 형태 - 원형 또는 선형 벡터Vector form - circular or linear vector

상기 벡터는 원형 또는 선형 형태일 수 있다. 상기 벡터가 선형 벡터인 경우, 상기 선형 벡터 서열이 종결 신호를 따로 포함하지 않더라도, 그 3'말단에서 RNA 전사가 종결된다. 이와 비교하여, 상기 벡터가 원형 벡터인 경우, 상기 원형 벡터 서열이 종결 신호를 따로 포함하지 않는다면, RNA 전사가 종결되지 않게 된다. 따라서, 상기 벡터로 원형 벡터를 사용하는 경우, 의도한 대상을 발현하기 위해서는 각 프로모터 서열과 관련된 전사 인자에 대응하는 종결 신호가 포함되어야 한다.The vector may have a circular or linear form. When the vector is a linear vector, RNA transcription is terminated at its 3' end even if the linear vector sequence does not separately include a termination signal. In comparison, when the vector is a circular vector, RNA transcription is not terminated unless the circular vector sequence separately includes a termination signal. Therefore, when a circular vector is used as the vector, a termination signal corresponding to a transcription factor related to each promoter sequence should be included in order to express an intended target.

일 구현예로, 상기 벡터는 선형 벡터일 수 있다. 일 구현예로, 상기 벡터는 선형의 앰플리콘일 수 있다. 일 구현예로, 상기 벡터는 서열번호 267 내지 서열번호 276에서 선택된 서열 및 서열번호 210 내지 258, 서열번호 381 내지 393, 서열번호 396 내지 서열번호 407, 및 서열번호 409 내지 서열번호 421로 이뤄진 군에서 선택된 서열을 포함하는 선형 벡터일 수 있다. 일 구현예로, 상기 벡터는 원형 벡터일 수 있다. 일 구현예로, 상기 벡터는 서열번호 267 내지 서열번호 276에서 선택된 서열, 및 서열번호 210 내지 258, 서열번호 381 내지 393, 서열번호 396 내지 서열번호 407, 및 서열번호 409 내지 서열번호 421로 이뤄진 군에서 선택된 서열을 포함하는 원형 벡터일 수 있다.In one embodiment, the vector may be a linear vector. In one embodiment, the vector may be a linear amplicon. In one embodiment, the vector comprises a sequence selected from SEQ ID NO: 267 to SEQ ID NO: 276 and SEQ ID NO: 210 to 258, SEQ ID NO: 381 to 393, SEQ ID NO: 396 to SEQ ID NO: 407, and SEQ ID NO: 409 to SEQ ID NO: 421 It may be a linear vector comprising a sequence selected from In one embodiment, the vector may be a circular vector. In one embodiment, the vector consists of a sequence selected from SEQ ID NO: 267 to SEQ ID NO: 276, and SEQ ID NO: 210 to 258, SEQ ID NO: 381 to 393, SEQ ID NO: 396 to SEQ ID NO: 407, and SEQ ID NO: 409 to SEQ ID NO: 421 It may be a circular vector comprising a sequence selected from the group.

벡터 - 서열 예시Vector - Sequence Example

일 구현예로, 상기 벡터의 서열은 서열번호 267 내지 서열번호 276에서 선택된 서열, 및 서열번호 210 내지 258, 서열번호 381 내지 393, 서열번호 396 내지 서열번호 407, 및 서열번호 409 내지 서열번호 421로 이뤄진 군에서 선택된 서열을 포함할 수 있다.In one embodiment, the sequence of the vector comprises a sequence selected from SEQ ID NO: 267 to SEQ ID NO: 276, and SEQ ID NO: 210 to 258, SEQ ID NO: 381 to 393, SEQ ID NO: 396 to SEQ ID NO: 407, and SEQ ID NO: 409 to SEQ ID NO: 421 It may include a sequence selected from the group consisting of.

핵산의 화학적인 변형chemical modification of nucleic acids

본 명세서에서는 엔지니어링 된 crRNA 또는 이를 암호화하는 핵산, 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화 하는 핵산, 및/또는 CRISPR/Cas12f1 시스템의 구성 요소를 발현시키기 위한 벡터 등 핵산을 포함하거나, 핵산으로 이뤄진 구성 요소를 제공한다. 이때, 상기 구성 요소에서 "핵산"이라 함은, 자연계에 존재하는 DNA, 또는 RNA일 수 있고, 상기 구성 핵산의 일부 또는 전부에 화학적 변형이 일어난, 변형된 핵산일 수 있다. 일 구현예로, 상기 구성 핵산은 자연계에 존재하는 DNA, 및/또는 RNA일 수 있다. 일 구현예로, 상기 구성 핵산은 하나 이상의 뉴클레오타이드가 화학적으로 변형된 것일 수 있다. 이때, 상기 화학적 변형은 당업자에게 알려진 핵산의 변형을 모두 포함한다. 구체적으로, 상기 화학적 변형은 (WO 2019/089820 A1)에 기재된 핵산의 변형을 모두 포함할 수 있으나, 이에 제한되는 것은 아니다.In the present specification, an engineered crRNA or a nucleic acid encoding the same, an engineered Cas12f1 guide RNA or a nucleic acid encoding the same, and/or a vector for expressing a component of the CRISPR/Cas12f1 system. to provide. In this case, the term "nucleic acid" in the above component may be DNA or RNA existing in nature, and may be a modified nucleic acid in which some or all of the component nucleic acid is chemically modified. In one embodiment, the constituent nucleic acid may be DNA and/or RNA existing in nature. In one embodiment, the constituent nucleic acid may be one in which one or more nucleotides are chemically modified. In this case, the chemical modification includes all modifications of nucleic acids known to those skilled in the art. Specifically, the chemical modification may include all modifications of the nucleic acid described in (WO 2019/089820 A1), but is not limited thereto.

엔지니어링 된 Cas12f1 가이드 RNA를 이용한 유전자 편집 방법Gene editing method using engineered Cas12f1 guide RNA

유전자 편집 방법 개괄Overview of Gene Editing Methods

본 명세서에서는 엔지니어링 된 crRNA를 이용하여 대상 세포 내의 표적 유전자, 또는 표적 핵산을 편집하는 방법을 제공한다. 상기 표적 유전자, 또는 표적 핵산은 표적 서열을 포함한다. 상기 표적 핵산은 단일가닥 DNA, 이중가닥 DNA, 및/또는 RNA일 수 있다. 상기 유전자 편집 방법은, 엔지니어링 된 Cas12f1 가이드 RNA, 및 Cas12f1 단백질, 또는 각각을 암호화하는 핵산을 표적 유전자, 또는 표적 핵산을 포함하고 있는 대상 세포 내에 전달하는 것을 포함한다. 그 결과, 상기 대상 세포 내에 엔지니어링 된 CRISPR/Cas12f1 복합체가 주입되거나, 엔지니어링 된 CRISPR/Cas12f1 복합체의 형성이 유도되며, 상기 엔지니어링 된 CRISPR/Cas12f1 복합체에 의해 표적 유전자가 편집된다. 상기 엔지니어링 된 Cas12f1 가이드 RNA는 "엔지니어링 된 Cas12f1 가이드 RNA" 단락에서 설명된 것과 동일한 특징 및 구조를 가진다. 상기 Cas12f1 단백질은 "엔지니어링 된 CRISPR/Cas12f1 복합체" 섹션에서 설명된 Cas12f1 단백질, 및/또는 변형된 Cas12f1 단백질과 동일한 특징 및 구성을 가진다.The present specification provides a method for editing a target gene or target nucleic acid in a target cell using an engineered crRNA. The target gene, or target nucleic acid, includes a target sequence. The target nucleic acid may be single-stranded DNA, double-stranded DNA, and/or RNA. The gene editing method includes delivering an engineered Cas12f1 guide RNA, a Cas12f1 protein, or a nucleic acid encoding each of the target gene or target nucleic acid into a target cell. As a result, the engineered CRISPR/Cas12f1 complex is injected into the target cell, or the engineered CRISPR/Cas12f1 complex is induced, and the target gene is edited by the engineered CRISPR/Cas12f1 complex. The engineered Cas12f1 guide RNA has the same characteristics and structure as described in the section "Engineered Cas12f1 guide RNA". The Cas12f1 protein has the same characteristics and composition as the Cas12f1 protein described in the section "Engineered CRISPR/Cas12f1 complex", and/or the modified Cas12f1 protein.

일 구현예로, 상기 유전자 편집 방법은 Cas12f1 단백질 또는 이를 암호화하는 핵산, 및 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산을 대상 세포 내로 전달하는 것을 포함할 수 있다.In one embodiment, the gene editing method may include delivering a Cas12f1 protein or a nucleic acid encoding the same, and an engineered Cas12f1 guide RNA or a nucleic acid encoding the same into a target cell.

이때, 상기 엔지니어링 된 Cas12f1 가이드 RNA는 엔지니어링 된 스캐폴드 영역, 및 스페이서를 포함한다.At this time, the engineered Cas12f1 guide RNA includes an engineered scaffold region, and a spacer.

이때, 상기 엔지니어링 된 스캐폴드 영역은 전술한 "엔지니어링 된 스캐폴드 영역" 단락 중 어느 하나에서 서술된 것과 동일한 특징 및 구조를 가진다. 일 예로, 상기 엔지니어링 된 스캐폴드 영역은 서열번호 186 내지 167으로 이뤄진 군에서 선택된 서열로 표현될 수 있다. 또 다른 예로, 상기 엔지니어링 된 스캐폴드 영역은 서열번호 187 내지 198으로 이뤄진 군에서 선택된 서열로 표현될 수 있다. 또 다른 예로, 상기 엔지니어링 된 스캐폴드 영역은 서열번호 199 내지 205으로 이뤄진 군에서 선택된 서열로 표현될 수 있다. 또 다른 예로, 상기 엔지니어링 된 스캐폴드 영역은 서열번호 206 내지 209으로 이뤄진 군에서 선택된 서열로 표현될 수 있다.In this case, the engineered scaffold region has the same characteristics and structure as described in any one of the above-mentioned “engineered scaffold region” paragraphs. For example, the engineered scaffold region may be represented by a sequence selected from the group consisting of SEQ ID NOs: 186 to 167. As another example, the engineered scaffold region may be represented by a sequence selected from the group consisting of SEQ ID NOs: 187 to 198. As another example, the engineered scaffold region may be represented by a sequence selected from the group consisting of SEQ ID NOs: 199 to 205. As another example, the engineered scaffold region may be represented by a sequence selected from the group consisting of SEQ ID NOs: 206 to 209.

이때, 상기 스페이서 서열은 상기 대상 세포 내에 포함된 표적 유전자, 또는 표적 핵산과 상보적으로 결합할 수 있다.In this case, the spacer sequence may complementarily bind to a target gene or target nucleic acid included in the target cell.

대상 세포target cell

일 구현예로, 상기 대상 세포는 원핵 세포일 수 있다. 일 구현예로, 상기 대상 세포는 진핵 세포일 수 있다. 구체적으로, 상기 진핵 세포는 식물 세포, 동물 세포, 및/또는 인간 세포일 수 있으나, 이에 제한되지 않는다.In one embodiment, the target cell may be a prokaryotic cell. In one embodiment, the target cell may be a eukaryotic cell. Specifically, the eukaryotic cell may be, but is not limited to, a plant cell, an animal cell, and/or a human cell.

표적 서열 결정target sequencing

CRISPR/Cas12f1 복합체로 편집하고자 하는 표적 유전자, 또는 표적 핵산, 및 표적 서열은 유전자 편집의 목적, 대상 세포 환경, Cas12f1 단백질이 인식하는 PAM 서열, 및/또는 기타 변수를 고려하여 결정할 수 있다. 이때, 적절한 길이의 표적 서열을 결정할 수 있다면, 그 방법은 특별히 제한되지 않으며, 공지된 기술을 활용할 수 있다.The target gene, or target nucleic acid, and target sequence to be edited with the CRISPR/Cas12f1 complex may be determined in consideration of the purpose of gene editing, the target cell environment, the PAM sequence recognized by the Cas12f1 protein, and/or other variables. In this case, if the target sequence of an appropriate length can be determined, the method is not particularly limited, and a known technique may be used.

표적 서열에 따른 스페이서 서열 결정Determination of spacer sequence according to target sequence

상기 표적 서열이 결정되고 나면, 이에 대응하는 스페이서 서열을 설계한다. 상기 스페이서 서열은 상기 표적 서열과 상보적으로 결합할 수 있는 서열로 설계된다. 일 구현예로, 상기 스페이서 서열은 상기 표적 유전자와 상보적으로 결합할 수 있는 서열로 설계된다. 일 구현예로, 상기 스페이서 서열은 상기 표적 핵산과 상보적으로 결합할 수 있도록 설계된다. 일 구현예로, 상기 스페이서 서열은 상기 표적 핵산의 표적 가닥 서열에 포함된 표적 서열과 상보적인 서열로 설계된다. 일 구현예로, 상기 스페이서 서열은 상기 표적 핵산의 비표적 가닥 서열에 포함된 프로토스페이서의 DNA 서열에 상응하는 RNA 서열로 설계된다. 구체적으로, 상기 스페이서 서열은, 상기 프로토스페이서 서열과 동일한 염기 서열을 가지되, 상기 염기 서열에 포함된 티미딘 각각이 모두 유리딘으로 치환된 서열로 설계된다.After the target sequence is determined, a corresponding spacer sequence is designed. The spacer sequence is designed as a sequence capable of complementary binding to the target sequence. In one embodiment, the spacer sequence is designed as a sequence capable of complementary binding to the target gene. In one embodiment, the spacer sequence is designed to complementarily bind to the target nucleic acid. In one embodiment, the spacer sequence is designed as a sequence complementary to a target sequence included in the target strand sequence of the target nucleic acid. In one embodiment, the spacer sequence is designed as an RNA sequence corresponding to the DNA sequence of the protospacer included in the non-target strand sequence of the target nucleic acid. Specifically, the spacer sequence has the same nucleotide sequence as the protospacer sequence, but is designed as a sequence in which all thymidine included in the nucleotide sequence are substituted with uridine.

표적 서열과 스페이서 서열의 상보성Complementarity of target sequence and spacer sequence

일 구현예로, 상기 스페이서 서열은 상기 표적 서열과 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 상보적인 서열일 수 있다. 일 구현예로, 상기 스페이서 서열은 상기 표적 서열과 바로 이전 문장에서 선택된 수치 범위 내 상보적인 서열일 수 있다. 예를 들어, 상기 스페이서 서열은 상기 표적 서열과 60% 내지 90% 상보적인 서열일 수 있다. 또 다른 예를 들어, 상기 스페이서 서열은 상기 표적 서열과 90% 내지 100% 상보적인 서열일 수 있다.In one embodiment, the spacer sequence and the target sequence 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72 %, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% complementary sequence. In one embodiment, the spacer sequence may be a sequence complementary to the target sequence within a numerical range selected in the immediately preceding sentence. For example, the spacer sequence may be a sequence that is 60% to 90% complementary to the target sequence. As another example, the spacer sequence may be a sequence that is 90% to 100% complementary to the target sequence.

표적 서열과 스페이서 서열 간 미스매치 개수Number of mismatches between target sequence and spacer sequence

일 구현예로, 상기 스페이서 서열은 상기 표적 서열과 0개, 1개, 2개, 3개, 4개, 5개, 6개, 7개, 8개, 9개, 또는 10개의 미스매치를 가지는 상보적인 서열일 수 있다. 일 구현예로, 상기 스페이서 서열은 바로 이전 문장에서 선택된 수치 범위 내의 미스매치를 가질 수 있다. 예를 들어, 상기 스페이서 서열은 상기 표적 서열과 1개 내지 5개의 미스매치를 가질 수 있다. 또 다른 예를 들어, 상기 스페이서 서열은 상기 표적 서열과 6개 내지 10개의 미스매치를 기질 수 있다.In one embodiment, the spacer sequence has 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 mismatches with the target sequence. It may be a complementary sequence. In one embodiment, the spacer sequence may have a mismatch within the numerical range selected in the immediately preceding sentence. For example, the spacer sequence may have 1 to 5 mismatches with the target sequence. As another example, the spacer sequence may substrate 6 to 10 mismatches with the target sequence.

CRISPR/Cas12f1 복합체 이용Using the CRISPR/Cas12f1 complex

본 명세서에서 제공하는 유전자 편집 방법은 엔지니어링 된 CRISPR/Cas12f1 복합체가 표적 특이적으로 유전자, 또는 핵산을 절단하는 활성을 가지는 점을 이용한다. 상기 엔지니어링 된 CRISPR/Cas12f1 복합체는 "엔지니어링 된 CRISPR/Cas12f1 복합체" 섹션에서 설명된 엔지니어링 된 CRISPR/Cas12f1 복합체와 동일한 특징 및 구성을 가진다.The gene editing method provided herein utilizes the fact that the engineered CRISPR/Cas12f1 complex has target-specific gene or nucleic acid cleavage activity. The engineered CRISPR/Cas12f1 complex has the same characteristics and composition as the engineered CRISPR/Cas12f1 complex described in the section "Engineered CRISPR/Cas12f1 complex".

CRISRP/Cas12f1 복합체 각 구성 요소의 세포 내 전달Intracellular delivery of each component of the CRISRP/Cas12f1 complex

본 명세서에서 제공하는 유전자 편집 방법은 대상 세포 내에서 엔지니어링 된 CRISPR/Cas12f1 복합체가 표적 유전자 또는 표적 핵산과 접촉하는 것을 전제로 한다. 따라서, 상기 엔지니어링 된 CRISPR/Cas12f1 복합체가 상기 표적 유전자, 또는 표적 핵산과 접촉하는 것을 유도하기 위해, 상기 유전자 편집 방법은 상기 엔지니어링 된 CRISPR/Cas12f1 복합체의 각 구성요소를 대상 세포 내에 전달하는 것을 포함한다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산 및 Cas12f1 단백질 또는 이를 암호화하는 핵산을 대상 세포 내에 전달하는 것을 포함할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA 및 Cas12f1 단백질을 대상 세포 내에 전달하는 것을 포함할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 및 Cas12f1 단백질을 대상 세포 내에 전달하는 것을 포함할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA 및 Cas12f1 단백질을 암호화하는 핵산을 대상 세포 내에 전달하는 것을 포함할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 및 Cas12f1 단백질을 암호화하는 핵산을 대상 세포 내에 전달하는 것을 포함할 수 있다. 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산, 및 Cas12f1 단백질 또는 이를 암호화하는 핵산은 다양한 전달 형태로, 다양한 전달 방법을 이용하여 대상 세포 내에 전달될 수 있다.The gene editing method provided herein is based on the premise that the engineered CRISPR/Cas12f1 complex in a target cell contacts a target gene or target nucleic acid. Therefore, in order to induce the engineered CRISPR/Cas12f1 complex to contact the target gene or target nucleic acid, the gene editing method comprises delivering each component of the engineered CRISPR/Cas12f1 complex into a target cell. . In one embodiment, the gene editing method may include delivering an engineered Cas12f1 guide RNA or a nucleic acid encoding the same and a Cas12f1 protein or a nucleic acid encoding the same into a target cell. In one embodiment, the gene editing method may include delivering engineered Cas12f1 guide RNA and Cas12f1 protein into a target cell. In one embodiment, the gene editing method may include delivering a nucleic acid encoding an engineered Cas12f1 guide RNA and a Cas12f1 protein into a target cell. In one embodiment, the gene editing method may include delivering an engineered Cas12f1 guide RNA and a nucleic acid encoding a Cas12f1 protein into a target cell. In one embodiment, the gene editing method may include delivering a nucleic acid encoding an engineered Cas12f1 guide RNA and a nucleic acid encoding a Cas12f1 protein into a target cell. The engineered Cas12f1 guide RNA or a nucleic acid encoding the same, and the Cas12f1 protein or a nucleic acid encoding the same may be delivered into a target cell in various delivery forms and using various delivery methods.

전달 형태 - RNPDelivery mode - RNP

상기 전달 형태로, 엔지니어링 된 Cas12f1 가이드 RNA 및 Cas12f1 단백질이 결합한 리보뉴클레오프로틴 입자(Ribonucleoprotein particle, RNP)를 이용할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA 및 Cas12f1 단백질이 결합한 CRISPR/Cas12f1 복합체를 대상 세포 내에 주입하는 것을 포함할 수 있다.As the delivery form, engineered Cas12f1 guide RNA and ribonucleoprotein particle (RNP) to which Cas12f1 protein is bound can be used. In one embodiment, the gene editing method may include injecting a CRISPR/Cas12f1 complex to which an engineered Cas12f1 guide RNA and Cas12f1 protein are bound into a target cell.

전달 형태 - 비바이러스 벡터Mode of delivery - non-viral vector

또 다른 전달 형태로, 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열 및 Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 비바이러스 벡터를 이용할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열 및 Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 비바이러스 벡터를 대상 세포 내에 주입하는 것을 포함할 수 있다. 구체적으로, 상기 비바이러스 벡터는 플라스미드, 네이키드 DNA, DNA 복합체, 또는 mRNA일 수 있으나, 이에 제한되는 것은 아니다. 또 다른 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열을 포함하는 제1 비바이러스 벡터, 및 Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 제2 비바이러스 벡터를 대상 세포 내에 주입하는 것을 포함할 수 있다. 구체적으로, 상기 제1 비바이러스 벡터 및 상기 제2 비바이러스 벡터는 각각 플라스미드, 네이키드 DNA, DNA 복합체, 및 mRNA로 이뤄진 군에서 선택된 하나일 수 있으나, 이에 제한되는 것은 아니다.As another form of delivery, a non-viral vector comprising a nucleic acid sequence encoding an engineered Cas12f1 guide RNA and a nucleic acid sequence encoding a Cas12f1 protein may be used. In one embodiment, the gene editing method may include injecting a non-viral vector comprising a nucleic acid sequence encoding an engineered Cas12f1 guide RNA and a nucleic acid sequence encoding a Cas12f1 protein into a target cell. Specifically, the non-viral vector may be a plasmid, naked DNA, DNA complex, or mRNA, but is not limited thereto. In another embodiment, the gene editing method comprises a first non-viral vector comprising a nucleic acid sequence encoding an engineered Cas12f1 guide RNA, and a second non-viral vector comprising a nucleic acid sequence encoding a Cas12f1 protein into a target cell. This may include injecting. Specifically, the first non-viral vector and the second non-viral vector may each be one selected from the group consisting of plasmid, naked DNA, DNA complex, and mRNA, but is not limited thereto.

전달 형태 - 바이러스 벡터Transmission mode - viral vector

또 다른 전달 형태로, 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열 및 Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 바이러스 벡터를 이용할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열 및 Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 바이러스 벡터를 대상 세포 내에 주입하는 것을 포함할 수 있다. 구체적으로, 상기 바이러스 벡터는 레트로바이러스, 렌티바이러스, 아데노바이러스, 아데노-연관 바이러스, 백시니아바이러스, 폭스바이러스 및 단순포진 바이러스로 구성된 군에서 선택된 하나일 수 있으나, 이에 제한되는 것은 아니다. 일 구현예로, 상기 바이러스 벡터는 아데노-연관 바이러스일 수 있다.As another form of delivery, a viral vector comprising a nucleic acid sequence encoding an engineered Cas12f1 guide RNA and a nucleic acid sequence encoding a Cas12f1 protein may be used. In one embodiment, the gene editing method may include injecting a viral vector comprising a nucleic acid sequence encoding an engineered Cas12f1 guide RNA and a nucleic acid sequence encoding a Cas12f1 protein into a target cell. Specifically, the viral vector may be one selected from the group consisting of retrovirus, lentivirus, adenovirus, adeno-associated virus, vaccinia virus, poxvirus and herpes simplex virus, but is not limited thereto. In one embodiment, the viral vector may be an adeno-associated virus.

또 다른 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열을 포함하는 제1 바이러스 벡터, 및 Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 제2 바이러스 벡터를 대상 세포 내에 주입하는 것을 포함할 수 있다. 구체적으로, 상기 제1 바이러스 벡터 및 제2 바이러스 벡터는 각각 레트로바이러스, 렌티바이러스, 아데노바이러스, 아데노-연관 바이러스, 백시니아바이러스, 폭스바이러스 및 단순포진 바이러스로 구성된 군에서 선택된 하나일 수 있으나, 이에 제한되는 것은 아니다.In another embodiment, the gene editing method comprises injecting a first viral vector comprising a nucleic acid sequence encoding an engineered Cas12f1 guide RNA, and a second viral vector comprising a nucleic acid sequence encoding a Cas12f1 protein into a target cell. may include Specifically, the first viral vector and the second viral vector may be one selected from the group consisting of retrovirus, lentivirus, adenovirus, adeno-associated virus, vaccinia virus, poxvirus, and herpes simplex virus, respectively. It is not limited.

전달 방법 - 일반적인 전달 수단Method of delivery - common means of delivery

상기 전달 방법은, 세포 내로 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산, 및 Cas12f1 단백질 또는 이를 암호화하는 핵산을 적절한 전달 형태로 세포 내로 전달할 수 있는 것이라면 특별히 제한되지 않는다. 일 구현예로, 상기 전달 방법은 전기천공법, 유전자총, 초음파천공법, 자기주입법(magnetofection), 및/또는 일시적인 세포 압축 또는 스퀴징일 수 있다.The delivery method is not particularly limited as long as it can deliver the Cas12f1 guide RNA engineered into a cell or a nucleic acid encoding the same, and the Cas12f1 protein or a nucleic acid encoding the same into a cell in an appropriate delivery form. In one embodiment, the delivery method may be electroporation, gene gun, sonoporation, magnetofection, and/or transient cell compression or squeezing.

전달 방법 - 나노파티클Delivery Method - Nanoparticles

상기 전달 방법은, 상기 CRISPR/Cas12f1 시스템에 포함된 적어도 하나의 구성요소를 나노파티클을 이용하여 전달하는 것일 수 있다. 이때, 상기 전달 방법은 당업계 통상의 기술자가 적절히 선택할 수 있는 공지된 방법일 수 있다. 예를 들어, 상기 나노파티클 전달 방법은 (WO 2019/089820 A1)에 개시된 방법일 수 있으나, 이에 제한되는 것은 아니다.The delivery method may be to deliver at least one component included in the CRISPR/Cas12f1 system using nanoparticles. In this case, the delivery method may be a known method that can be appropriately selected by those skilled in the art. For example, the nanoparticle delivery method may be a method disclosed in (WO 2019/089820 A1), but is not limited thereto.

일 구현예로, 상기 전달 방법은 Cas12f1 단백질 또는 이를 암호화하는 핵산 및/또는 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산을 나노파티클을 이용하여 전달하는 것일 수 있다. 일 구현예로, 상기 전달 방법은 Cas12f1 단백질 또는 이를 암호화하는 핵산, 제1 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산, 및/또는 제2 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산을 나노파티클을 이용하여 전달하는 것일 수 있다. 이때, 상기 전달 방법은 양이온성 리포좀법, 초산 리튬-DMSO, 지질-매개 형질감염(transfection), 인산칼슘 침전법(precipitation), lipofection, PEI(Polyethyleneimine)-매개 형질감염, DEAE-dextran 매개 형질감염, 및/또는 나노파티클-매개 핵산 전달(Panyam et. , al Adv Drug Deliv Rev. 2012 Sep 13. pii: S0169-409X(12)00283-9. doi: 10.1016/j.addr.2012.09.023 참조)일 수 있으나, 이에 제한되는 것은 아니다. 이때, 상기 CRISPR/Cas12f1 시스템의 구성 요소는 RNP, 비바이러스 벡터, 및/또는 바이러스 벡터 형태일 수 있다. 예를 들어, 상기 CRISPR/Cas12f1 시스템의 구성 요소는 각 구성요소를 암호화하는 mRNA 형태일 수 있으나, 이에 제한되는 것은 아니다.In one embodiment, the delivery method may be to deliver a Cas12f1 protein or a nucleic acid encoding the same and/or an engineered Cas12f1 guide RNA or a nucleic acid encoding the same using nanoparticles. In one embodiment, the delivery method comprises a Cas12f1 protein or a nucleic acid encoding the same, a first engineered Cas12f1 guide RNA or a nucleic acid encoding the same, and/or a second engineered Cas12f1 guide RNA or a nucleic acid encoding the same using nanoparticles. may be transmitted. At this time, the delivery method is cationic liposome method, lithium acetate-DMSO, lipid-mediated transfection, calcium phosphate precipitation (precipitation), lipofection, PEI (Polyethyleneimine)-mediated transfection, DEAE-dextran-mediated transfection , and/or nanoparticle-mediated nucleic acid delivery (see Panyam et., al Adv Drug Deliv Rev. 2012 Sep 13. pii: S0169-409X(12)00283-9. doi: 10.1016/j.addr.2012.09.023) may be, but is not limited thereto. In this case, the components of the CRISPR/Cas12f1 system may be in the form of RNPs, non-viral vectors, and/or viral vectors. For example, the components of the CRISPR/Cas12f1 system may be in the form of mRNA encoding each component, but is not limited thereto.

전달 형태 및 방법 - 조합 가능Delivery form and method - Combination possible

상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산, 및 Cas12f1 단백질 또는 이를 암호화하는 핵산을 세포 내 전달하는 것을 포함하는데, 이때 상기 구성의 전달 형태 및/또는 전달 방법은 서로 동일하거나 상이할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산은 제1 전달 형태로 전달하고, Cas12f1 단백질 또는 이를 암호화하는 핵산은 제2 전달 형태로 전달하는 것을 포함할 수 있다. 이때, 상기 제1 전달 형태 및 상기 제2 전달 형태는 각각 전술한 전달 형태 중 어느 하나일 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산은 제1 전달 방법으로 전달하고, Cas12f1 단백질 또는 이를 암호화하는 핵산은 제2 전달 방법으로 전달하는 것을 포함할 수 있다. 이때, 상기 제1 전달 방법 및 상기 제2 전달 방법은 각각 전술한 전달 방법 중 어느 하나일 수 있다.The gene editing method includes intracellular delivery of an engineered Cas12f1 guide RNA or a nucleic acid encoding the same, and a Cas12f1 protein or a nucleic acid encoding the same, wherein the delivery form and/or delivery method of the construct may be the same or different from each other. can In one embodiment, the gene editing method may include delivering the engineered Cas12f1 guide RNA or a nucleic acid encoding the same in a first delivery mode, and delivering the Cas12f1 protein or a nucleic acid encoding the same in a second delivery mode. In this case, each of the first delivery mode and the second delivery mode may be any one of the aforementioned delivery modes. In one embodiment, the gene editing method may include delivering the engineered Cas12f1 guide RNA or nucleic acid encoding the same by the first delivery method, and delivering the Cas12f1 protein or the nucleic acid encoding the same by the second delivery method. In this case, the first delivery method and the second delivery method may be any one of the aforementioned delivery methods, respectively.

전달 순서delivery order

상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산, 및 Cas12f1 단백질 또는 이를 암호화하는 핵산을 세포 내 전달하는 것을 포함하는데, 이때 상기 구성이 세포 내에 동시에 전달될 수 있지만, 시간차를 두고 순차적으로 전달될 수 있다.The gene editing method includes intracellular delivery of an engineered Cas12f1 guide RNA or a nucleic acid encoding the same, and a Cas12f1 protein or a nucleic acid encoding the same, wherein the construct can be simultaneously delivered into the cell, but sequentially with a time difference can be transmitted.

일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산, 및 Cas12f1 단백질 또는 이를 암호화하는 핵산을 동시에 전달하는 것을 포함할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산을 세포 내로 전달한 후, 시간 차를 두고 Cas12f1 단백질 또는 이를 암호화하는 핵산을 세포 내로 전달하는 것을 포함할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 Cas12f1 단백질 또는 이를 암호화하는 핵산을 세포 내로 전달한 후, 시간 차를 두고 엔지니어링 된 Cas12f1 가이드 RNA를 세포 내로 전달하는 것을 포함할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 Cas12f1 단백질을 암호화하는 핵산을 세포 내로 전달한 후, 시간 차를 두고 엔지니어링 된 Cas12f1 가이드 RNA를 세포 내로 전달하는 것을 포함할 수 있다.In one embodiment, the gene editing method may include simultaneously delivering an engineered Cas12f1 guide RNA or a nucleic acid encoding the same, and a Cas12f1 protein or a nucleic acid encoding the same. In one embodiment, the gene editing method may include delivering the engineered Cas12f1 guide RNA or a nucleic acid encoding the same into a cell, and then delivering the Cas12f1 protein or a nucleic acid encoding the same into the cell with a time difference. In one embodiment, the gene editing method may include delivering a Cas12f1 protein or a nucleic acid encoding the same into a cell, and then delivering an engineered Cas12f1 guide RNA into the cell with a time difference. In one embodiment, the gene editing method may include delivering the Cas12f1 protein-encoding nucleic acid into the cell, and then delivering the engineered Cas12f1 guide RNA into the cell with a time difference.

복수의 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산 전달Delivery of multiple engineered Cas12f1 guide RNAs or nucleic acids encoding them

본 명세서에서 제공하는 유전자 편집 방법은 대상 세포 내에 Cas12f1 단백질 또는 이를 암호화하는 핵산, 및 둘 이상의 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산을 전달하는 것을 포함할 수 있다. 상기 방법을 통해, 서로 다른 서열을 표적하는 둘 이상의 CRISPR/Cas12f1 복합체가 대상 세포 내에 주입되거나, 대상 세포 내에서 형성될 수 있다. 그 결과, 세포 내에 포함된 둘 이상의 표적 유전자, 또는 표적 핵산이 편집될 수 있다. 일 구현예로, 상기 유전자 편집 방법은 Cas12f1 단백질 또는 이를 암호화하는 핵산, 제1 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산, 및 제2 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산을 표적 유전자 또는 표적 핵산을 포함하고 있는 대상 세포 내에 전달하는 것을 포함한다. 이때, 상기 각 구성요소는 전술한 전달 형태 및 전달 방법 중 하나 이상을 사용하여 세포 내로 전달될 수 있다. 이때, 둘 이상의 구성요소가 세포 내에 동시에 전달될 수 있고, 시간차를 두고 순차적으로 전달될 수 있다.The gene editing method provided herein may include delivering a Cas12f1 protein or a nucleic acid encoding the same, and two or more engineered Cas12f1 guide RNAs or a nucleic acid encoding the same into a target cell. Through the above method, two or more CRISPR/Cas12f1 complexes targeting different sequences may be injected into or formed in a target cell. As a result, two or more target genes or target nucleic acids contained in the cell can be edited. In one embodiment, the gene editing method comprises a Cas12f1 protein or a nucleic acid encoding the same, a first engineered Cas12f1 guide RNA or a nucleic acid encoding the same, and a second engineered Cas12f1 guide RNA or a nucleic acid encoding the same into a target gene or a target nucleic acid It includes delivery into the target cell containing the. In this case, each of the components may be delivered into a cell using one or more of the above-described delivery forms and delivery methods. In this case, two or more components may be simultaneously delivered into the cell, and may be sequentially delivered with a time difference.

CRISPR/Cas12f1 복합체가 표적 핵산과 접촉CRISPR/Cas12f1 complex contacts target nucleic acid

본 명세서에서 제공하는 유전자 편집 방법에서, 표적 유전자, 또는 표적 핵산의 편집은 대상 세포 내에서 엔지니어링 된 CRISPR/Cas12f1 복합체가 표적 유전자 또는 표적 핵산과 접촉하면서 이뤄진다. 따라서, 상기 유전자 편집 방법은 엔지니어링 된 CRISPR/Cas12f1 복합체가 대상 세포 내에서 접촉하도록 하거나, 접촉을 유도하는 것을 포함할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 CRISPR/Cas12f1 복합체가 대상 세포 내에서 표적 핵산과 접촉하는 것을 포함할 수 있다. 일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 CRISPR/Cas12f1 복합체가 대상 세포 내에서 표적 핵산과 접촉하도록 유도하는 것을 포함할 수 있다. 이때, 상기 유도는 상기 엔지니어링 된 CRISPR/Cas12f1 복합체가 세포 내에서 표적 핵산과 접촉하도록 하는 방법이라면 특별히 제한되지 않는다. 일 구현예로, 상기 유도는 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산, 및 Cas12f1 단백질 또는 이를 암호화하는 핵산을 세포 내에 전달하는 것일 수 있다.In the gene editing method provided herein, editing of a target gene or target nucleic acid is performed while an engineered CRISPR/Cas12f1 complex in a target cell contacts the target gene or target nucleic acid. Accordingly, the gene editing method may include allowing the engineered CRISPR/Cas12f1 complex to contact within a target cell or inducing contact. In one embodiment, the gene editing method may include contacting the engineered CRISPR/Cas12f1 complex with a target nucleic acid in a target cell. In one embodiment, the gene editing method may include inducing the engineered CRISPR/Cas12f1 complex to contact a target nucleic acid in a target cell. At this time, the induction is not particularly limited as long as it is a method for allowing the engineered CRISPR/Cas12f1 complex to contact a target nucleic acid in a cell. In one embodiment, the induction may be to deliver an engineered Cas12f1 guide RNA or a nucleic acid encoding the same, and a Cas12f1 protein or a nucleic acid encoding the same into a cell.

유전자 편집 결과 - 인델(indel)Gene Editing Results - Indels

본 명세서에서 제공하는 유전자 편집 방법의 수행 결과로, 표적 유전자 또는 표적 핵산에 인델이 발생할 수 있다. 이때, 상기 인델은 표적 서열 부분 및/또는 프로토스페이서 서열 부분의 내부 및/또는 외부에서 일어날 수 있다. 상기 인델은, 유전자 편집 전 핵산의 뉴클레오타이드 배열에서 일부 뉴클레오타이드가 중간에 결실되거나, 임의의 뉴클레오타이드가 삽입되거나, 및/또는 상기 삽입과 결실이 혼입된 변이를 일컫는다. 일반적으로, 표적 유전자 또는 표적 핵산 서열 내 인델이 일어나면, 해당 유전자 또는 핵산이 불활성화된다. 일 구현예로, 상기 유전자 편집 방법의 수행 결과, 표적 유전자 또는 표적 핵산 내 하나 이상의 뉴클레오타이드가 결실 및/또는 추가될 수 있다.As a result of performing the gene editing method provided herein, an indel may occur in a target gene or target nucleic acid. In this case, the indel may occur inside and/or outside the target sequence portion and/or the protospacer sequence portion. The indel refers to a mutation in which some nucleotides are deleted in the middle, arbitrary nucleotides are inserted, and/or the insertion and deletion are incorporated in the nucleotide sequence of the nucleic acid before gene editing. In general, when an indel occurs within a target gene or target nucleic acid sequence, the gene or nucleic acid is inactivated. In one embodiment, as a result of performing the gene editing method, one or more nucleotides in the target gene or target nucleic acid may be deleted and/or added.

유전자 편집 결과 - 베이스 에디팅(base editing)Gene editing results - base editing

본 명세서에서 제공하는 유전자 편집 방법의 수행 결과로, 표적 유전자 또는 표적 핵산 내 베이스 에디팅이 일어날 수 있다. 이는 표적 유전자 또는 표적 핵산 내 임의의 염기가 결실, 또는 추가되는 인델과는 달리, 핵산 내 하나 이상의 특정 염기를 의도한 대로 변경하는 것을 의미한다. 달리 표현하면, 표적 유전자 또는 표적 핵산 내 특정 위치에서, 미리 의도한 점 돌연변이(point mutation)를 일으키는 것이다. 일 구현예로, 상기 유전자 편집 방법의 수행 결과, 표적 유전자 또는 표적 핵산 내 하나 이상의 염기가 다른 염기로 치환될 수 있다.As a result of performing the gene editing method provided herein, base editing in a target gene or target nucleic acid may occur. This means that one or more specific bases in a nucleic acid are altered as intended, unlike indels in which any bases in the target gene or target nucleic acid are deleted or added. In other words, in a specific position in the target gene or target nucleic acid, it is to cause a point mutation (point mutation) intended in advance. In one embodiment, as a result of performing the gene editing method, one or more bases in the target gene or target nucleic acid may be substituted with another base.

유전자 편집 결과 - 삽입(insertion)Gene Editing Results - Insertion

본 명세서에서 제공하는 유전자 편집 방법의 수행 결과로, 표적 유전자 또는 표적 핵산 내 넉인이 발생할 수 있다. 상기 넉인은 표적 유전자 또는 표적 핵산 서열 내에 추가적인 핵산 서열을 삽입하는 것을 의미한다. 상기 넉인이 일어나려면, CRISPR/Cas12f1 복합체 외에 상기 추가적인 핵산 서열을 포함하는 도너가 더 필요하다. 세포 내에서 CRISPR/Cas12f1 복합체가 표적 유전자 또는 표적 핵산을 절단하는 경우, 상기 절단된 표적 유전자 또는 표적 핵산의 수복이 일어나게 된다. 이때, 상기 도너가 상기 수복 과정에 관여하여 상기 추가적인 핵산 서열이 표적 유전자 또는 표적 핵산 내에 삽입될 수 있도록 한다. 일 구현예로, 상기 유전자 편집 방법은 대상 세포 내로 도너를 전달하는 것을 추가적으로 포함할 수 있다. 이때, 상기 도너는 추가 대상 핵산을 포함하며, 상기 도너에 의해 상기 표적 유전자 또는 상기 표적 핵산 내 상기 추가 대상 핵산의 삽입이 유도된다. 이때, 상기 도너를 대상 세포 내로 전달할 때, 전술한 전달 형태 및/또는 전달 방법이 사용될 수 있다.As a result of performing the gene editing method provided herein, a knock-in in the target gene or target nucleic acid may occur. The knock-in refers to inserting an additional nucleic acid sequence into a target gene or target nucleic acid sequence. In order for the knock-in to occur, a donor comprising the additional nucleic acid sequence in addition to the CRISPR/Cas12f1 complex is further required. When the CRISPR/Cas12f1 complex cleaves a target gene or target nucleic acid in a cell, repair of the cleaved target gene or target nucleic acid occurs. In this case, the donor participates in the repair process so that the additional nucleic acid sequence can be inserted into the target gene or target nucleic acid. In one embodiment, the gene editing method may further comprise delivering a donor into a target cell. In this case, the donor includes the additional target nucleic acid, and insertion of the additional target nucleic acid into the target gene or the target nucleic acid is induced by the donor. In this case, when delivering the donor into a target cell, the above-described delivery form and/or delivery method may be used.

유전자 편집 결과 - 제거(deletion)Gene Editing Results - Deletion

본 명세서에서 제공하는 유전자 편집 방법의 수행 결과로, 표적 유전자 또는 표적 핵산 서열의 전부 또는 일부를 제거할 수 있다. 상기 제거는 상기 표적 유전자 또는 상기 표적 핵산 내 일부 염기 서열을 제거하는 것을 의미한다. 일 구현예로, 상기 유전자 편집 방법은 Cas12f1 단백질 또는 이를 암호화하는 핵산, 제1 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산, 및 제2 엔지니어링 된 Cas12f1 가이드 RNA 또는 이를 암호화하는 핵산을 표적 유전자 또는 표적 핵산을 포함하는 세포내에 도입하는 것을 포함한다. 이로 인해, 상기 유전자 편집 결과 표적 유전자 또는 표적 핵산 내에 특정 서열 부분의 제거가 일어난다.As a result of performing the gene editing method provided herein, all or part of the target gene or target nucleic acid sequence may be removed. The removal means removing a partial nucleotide sequence in the target gene or the target nucleic acid. In one embodiment, the gene editing method comprises a Cas12f1 protein or a nucleic acid encoding the same, a first engineered Cas12f1 guide RNA or a nucleic acid encoding the same, and a second engineered Cas12f1 guide RNA or a nucleic acid encoding the same into a target gene or a target nucleic acid It includes introducing into a cell comprising a. Due to this, as a result of the gene editing, removal of a specific sequence portion in the target gene or target nucleic acid occurs.

유전자 편집 방법 예시Examples of gene editing methods

일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA 및 Cas12f1 단백질이 결합한 리보뉴클레오프로틴 입자 형태의 CRISPR/Cas12f1 복합체를 진핵 세포 내에 전달하는 것을 포함할 수 있다. 이때, 상기 전달은 전기천공법, 또는 lipofection을 이용한 것일 수 있다.In one embodiment, the gene editing method may include delivering the CRISPR/Cas12f1 complex in the form of ribonucleoprotin particles to which the engineered Cas12f1 guide RNA and Cas12f1 protein are bound into a eukaryotic cell. In this case, the delivery may be using electroporation or lipofection.

일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 및 Cas12f1 단백질을 암호화하는 핵산을 진핵 세포 내에 전달하는 것을 포함할 수 있다. 이때, 상기 전달은 전기천공법, 또는 lipofection을 이용한 것일 수 있다.In one embodiment, the gene editing method may include delivering a nucleic acid encoding an engineered Cas12f1 guide RNA and a nucleic acid encoding a Cas12f1 protein into a eukaryotic cell. In this case, the delivery may be using electroporation or lipofection.

일 구현예로, 상기 유전자 편집 방법은 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열 및 Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 아데노-연관 바이러스(AAV) 벡터를 진핵세포 내에 전달하는 것을 포함할 수 있다.In one embodiment, the gene editing method may comprise delivering an adeno-associated virus (AAV) vector comprising a nucleic acid sequence encoding an engineered Cas12f1 guide RNA and a nucleic acid sequence encoding a Cas12f1 protein into a eukaryotic cell. .

일 구현예로, 상기 유전자 편집 방법은 제1 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열, 제2 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 핵산 서열, 및 Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 아데노-연관 바이러스(AAV) 벡터를 진핵세포 내에 전달하는 것을 포함할 수 있다.In one embodiment, the gene editing method comprises adeno-association comprising a nucleic acid sequence encoding a first engineered Cas12f1 guide RNA, a nucleic acid sequence encoding a second engineered Cas12f1 guide RNA, and a nucleic acid sequence encoding a Cas12f1 protein and delivering a viral (AAV) vector into a eukaryotic cell.

실험예Experimental example

이하, 실험예 및 실시예를 통해 본 명세서가 제공하는 발명에 대해 더욱 상세히 설명한다. 이들 실시예는 오로지 본 명세서에 의해 개시되는 내용을 예시하기 위한 것으로, 본 명세서에 의해 개시되는 내용의 범위가 이들 실시예에 의해 제한되는 것으로 해석되지 않는 것은 당업계에서 통상의 지식을 가진 자에게 있어서 자명할 것이다.Hereinafter, the invention provided by the present specification will be described in more detail through experimental examples and examples. These examples are only for illustrating the content disclosed by the present specification, and it is to those skilled in the art that the scope of the content disclosed by the present specification is not to be construed as being limited by these examples. it will be self-evident

실험예 1 실험 재료 준비Experimental Example 1 Preparation of test materials

실험예 1.1 플라스미드 벡터 설계 및 제조Experimental Example 1.1 Plasmid Vector Design and Preparation

인간 세포에서 발현하기 위해, Cas12f1 유전자를 코돈-최적화 시켰으며, 상기 최적화된 서열이 벡터 제조를 위해 합성되었다. 최종적으로 Cas12f1 단백질을 암호화하는 서열에는 chicken β-actin 프로모터, 5'- 및 3'-말단의 핵 위치 신호 서열(nuclear localization signal sequence), 자가 절단 T2A 펩타이드로 연결된 eGFP를 인코딩하는 서열이 포함되었다. 상기 Cas12f1 단백질의 아미노산 서열 및 이를 암호화하는 DNA 서열을 [표01]에 나타냈다.For expression in human cells, the Cas12f1 gene was codon-optimized, and the optimized sequence was synthesized for vector preparation. Finally, the sequence encoding the Cas12f1 protein included a chicken β-actin promoter, a nuclear localization signal sequence at the 5'- and 3'-ends, and a sequence encoding eGFP linked by a self-cleaving T2A peptide. The amino acid sequence of the Cas12f1 protein and the DNA sequence encoding it are shown in [Table 01].

[표01][Table 01]

Figure pat00001
Figure pat00001

(엔지니어링 된) Cas12f1 가이드 RNA를 암호화하는 주형 DNA가 합성되었고, pTwist Amp plasmid vector (Twist Bioscience)에 클로닝되었다. 필요한 경우, 상기 벡터는 U6-상보적 forward primer 및 protospacer-상보적 reverse primer를 사용하여, 상기 가이드 RNA 암호화 서열의 증폭을 위한 주형으로 사용되었다. Gibson assembly를 사용하여, 상기 코돈-최적화 된 Cas12f1 유전자를 포함하는 벡터에 엔지니어링 된 Cas12f1 가이드 RNA를 암호화하는 올리고뉴클레오타이드를 클로닝함으로써, 엔지니어링 된 CRISPR/Cas12f1 시스템에 대한 벡터를 제조하였다.Template DNA encoding the (engineered) Cas12f1 guide RNA was synthesized and cloned into the pTwist Amp plasmid vector (Twist Bioscience). If necessary, the vector was used as a template for amplification of the guide RNA coding sequence using a U6-complementary forward primer and a protospacer-complementary reverse primer. Using Gibson assembly, the vector for the engineered CRISPR/Cas12f1 system was prepared by cloning the oligonucleotide encoding the engineered Cas12f1 guide RNA into a vector containing the codon-optimized Cas12f1 gene.

실험예 1.2 Cas12f1 가이드 RNA 엔지니어링Experimental Example 1.2 Cas12f1 guide RNA engineering

상기 엔지니어링 된 Cas12f1 가이드 RNA의 엔지니어링 된 스캐폴드 영역 중 제2 영역, 제4 영역 및 제5 영역의 변형은 ApoI 및 BamHI 제한 효소를 사용하여 선형화된 가이드 RNA-암호화 벡터에 변형된 서열 (Macrogen)을 전달하는 합성 올리고뉴클레오타이드를 클로닝하여 수행되었다. 상기 엔지니어링 된 Cas12f1 가이드 RNA의 엔지니어링 된 스캐폴드 영역 중 제1 영역의 변형은, tracrRNA의 5'말단 부분을 표적으로 하는 forward primer 및 U6 프로모터 영역을 표적으로 하는 reverse primer를 사용하여 캐노니컬(canonical), 또는 엔지니어링 된 주형 플라스미드 벡터의 PCR 증폭에 의해 수행되었다. 상기 PCR 증폭은 Q5 Hot Start high-fidelity DNA polymerase (NEB)에 의해 수행되었으며, PCR 산물은 KLD Enzyme Mix (NEB)를 사용하여 결찰되었다(ligated). 상기 결찰된(ligated) PCR 산물은 DH5α E.coli 세포로 형질도입(transformed)되었다. Sanger 시퀀싱 분석에 의해 Mutagenesis가 확인되었다. 변형된 플라스미드 벡터는 NucleoBond ® Xtra Midi EF kit (MN)를 사용하여 정제되었다. 정제된 플라스미드 1 마이크로그램이 T7 RNA polymerase (NEB) 및 NTPs (Jena Bioscience)를 사용한 mRNA 합성의 주형으로 사용되었다. 상기 제조된 엔지니어링 된 Cas12f1 가이드 RNA를 Monarch® RNA cleanup kit (NEB)를 사용하여 정제하고, 극저온 바이알(cryogenic vials)에 분취하여 액체 질소에 보관하였다.Modifications of the second, fourth and fifth regions of the engineered scaffold region of the engineered Cas12f1 guide RNA were performed using ApoI and BamHI restriction enzymes to linearize the guide RNA-encoding vector with the modified sequence (Macrogen). This was done by cloning the synthetic oligonucleotides that delivered. Modification of the first region of the engineered scaffold region of the engineered Cas12f1 guide RNA canonical (canonical) using a forward primer targeting the 5' end of the tracrRNA and a reverse primer targeting the U6 promoter region ), or by PCR amplification of the engineered template plasmid vector. The PCR amplification was performed by Q5 Hot Start high-fidelity DNA polymerase (NEB), and the PCR product was ligated using KLD Enzyme Mix (NEB). The ligated PCR product was transformed into DH5α E. coli cells. Mutagenesis was confirmed by Sanger sequencing analysis. The modified plasmid vector was purified using NucleoBond ® Xtra Midi EF kit (MN). 1 microgram of purified plasmid was used as a template for mRNA synthesis using T7 RNA polymerase (NEB) and NTPs (Jena Bioscience). The engineered Cas12f1 guide RNA prepared above was purified using Monarch® RNA cleanup kit (NEB), aliquoted into cryogenic vials and stored in liquid nitrogen.

실험예 1.3 세포 배양 및 형질도입(transfection)Experimental Example 1.3 Cell culture and transfection

HEK293 T 세포(LentX-293T, Takara)가 10%의 열-불활성화 된 fetal bovine serum (FBS) 혈청(Corning) 및 페니실린/스트렙토마이신이 보충된 Dulbecco's Modified Eagle Medium (DMEM) 배지에서, 5% CO2 조건 하 배양되었다. 세포 형질주입은 전기천공법(electroporation) 또는 lipofection으로 수행되었다. 전기천공법의 경우, 실험예1.2 에서 제조된 Cas12f1 단백질을 암호화하는 플라스미드 벡터 및 가이드 RNA(및 엔지니어링 된 가이드 RNA)를 암호화하는 DNA 각 2-5 μg을 Neon transfection system (Invitrogen)을 사용해 4X105 HEK-293 T세포에 형질주입(transfection) 하였다. 상기 전기천공법은 1300V, 10 mA, 3 pulse 조건 하 수행하였다. lipofection을 위해, 6-15 μL FuGene 시약을 (Promega) 2-5 μg의 Cas12f1 단백질을 암호화하는 플라스미드 벡터 및 1.5-5 μg의 PCR 앰플리콘과 15분동안 혼합하였다. 상기 혼합물(300 μL)이 형질주입 1일 전에 1X106개의 세포가 플레이팅 된 1.5 ml DMEM 배지에 첨가되었다. 상기 세포들을 상기 혼합물의 존재 하 1 내지 10일 간 배양시켰다. 배양 후, 상기 세포들이 수집되었고, 상기 세포의 게놈 DNA가 PureHelixTM genomic DNA preparation kit (NanoHelix)를 사용하거나, Maxwell RSC Cultured cells DNA Kit (Promega)를 사용하여 수작업으로 분리되었다.HEK293 T cells (LentX-293T, Takara) were cultured in Dulbecco's Modified Eagle Medium (DMEM) medium supplemented with 10% heat-inactivated fetal bovine serum (FBS) serum (Corning) and penicillin/streptomycin, 5% CO Incubated under 2 conditions. Cell transfection was performed by electroporation or lipofection. In the case of electroporation, 2-5 μg of each DNA encoding the plasmid vector and guide RNA (and engineered guide RNA) encoding the Cas12f1 protein prepared in Experimental Example 1.2 were used with the Neon transfection system (Invitrogen) to 4X10 5 HEK -293 T cells were transfected. The electroporation method was performed under 1300V, 10 mA, 3 pulse conditions. For lipofection, 6-15 μL FuGene reagent (Promega) was mixed with 2-5 μg of the plasmid vector encoding the Cas12f1 protein and 1.5-5 μg of the PCR amplicon for 15 min. The mixture (300 μL) was added to 1.5 ml DMEM medium plated with 1× 10 6 cells 1 day before transfection. The cells were cultured for 1 to 10 days in the presence of the mixture. After incubation, the cells were collected, and the genomic DNA of the cells was manually isolated using the PureHelix™ genomic DNA preparation kit (NanoHelix) or the Maxwell RSC Cultured cells DNA Kit (Promega).

실험예 1.4 세포 내 인델 효율 측정Experimental Example 1.4 Measurement of Indel Efficiency in Cells

HEK-293 T세포로부터 분리된 게놈 DNA 중, 프로토스페이서를 포함하는 영역을 표적-특이적 프라이머를 사용하여 KAPA HiFi HotStart DNA polymerase (Roche)의 존재 하 PCR을 수행하였다. 상기 증폭 방법은 제조사의 지시를 따랐다. Illumina TruSeq HT dual indexes를 포함하는 상기 증폭의 결과물인 PCR 앰플리콘을 Illumina iSeq 100를 사용하여 150-bp 페어 엔드 시퀀싱을 수행하였다. 인델 빈도는 MAUND를 사용하여 계산되었다. 상기 MAUND는 https://github.com/ibscge/maund 에서 제공된다.Among the genomic DNA isolated from HEK-293 T cells, the region including the protospacer was subjected to PCR in the presence of KAPA HiFi HotStart DNA polymerase (Roche) using target-specific primers. The amplification method followed the manufacturer's instructions. The PCR amplicon resulting from the amplification containing Illumina TruSeq HT dual indexes was subjected to 150-bp pair-end sequencing using Illumina iSeq 100. Indel frequencies were calculated using MAUND. The MAUND is provided at https://github.com/ibscge/maund.

실험예 1.5 Quantitave real-time PCRExperimental Example 1.5 Quantitave real-time PCR

HEK293 T세포에 RNeasy Miniprep kit (Qiagen) 또는 Maxwell RSC miRNA Tissue Kit (Promega)또는 DNeasy Blood & Tissue Kit(Qiagen)를 사용하여 가이드 RNA(또는 엔지니어링 된 가이드 RNA) 또는 게놈 DNA를 각각 추출하였다. 가이드 RNA를 정량화하기 위해서, RNA특이적 프라이머를 ligation 하고, crRNA-특이적 프라이머를 사용해 cDNA를 합성하였다. 상기 cDNA는 주형으로써 정량적 real-time PCR에 사용되었다. real-time PCR은 KAFA SYBR FAST qPCR Master Mix(2X) Kit (KAPAbiosystems)를 사용하여 분석되었다.Guide RNA (or engineered guide RNA) or genomic DNA was extracted from HEK293 T cells using RNeasy Miniprep kit (Qiagen), Maxwell RSC miRNA Tissue Kit (Promega), or DNeasy Blood & Tissue Kit (Qiagen), respectively. To quantify guide RNA, RNA-specific primers were ligated, and cDNA was synthesized using crRNA-specific primers. The cDNA was used as a template for quantitative real-time PCR. Real-time PCR was analyzed using the KAFA SYBR FAST qPCR Master Mix (2X) Kit (KAPAbiosystems).

실험예 1.6 통계 분석Experimental Example 1.6 Statistical Analysis

각 실험예 별 실험은 3번씩 수행하였으며, 각 값의 평균값을 분석에 사용하였다.The experiment for each experimental example was performed three times, and the average value of each value was used for analysis.

실험예 2 엔지니어링 된 CRISPR/Cas12f1 시스템의 인델 효율 비교Experimental Example 2 Comparison of indel efficiency of the engineered CRISPR/Cas12f1 system

엔지니어링 된 Cas12f1 가이드 RNA를 사용한 엔지니어링 된 CRISPR/Cas12f1 시스템의 인델 효율을 측정하기 위해, 실험예 1.1 내지 1.2에 의해 각 실시예를 제조하였다. 실험에 사용한 표적 서열은 다음 [표02]에 나타냈다.In order to measure the indel efficiency of the engineered CRISPR/Cas12f1 system using the engineered Cas12f1 guide RNA, each Example was prepared by Experimental Examples 1.1 to 1.2. The target sequences used in the experiment are shown in Table 02 below.

[표02][Table 02]

Figure pat00002
Figure pat00002

각 실시예 별 엔지니어링 된 Cas12f1 가이드 RNA의 서열은 다음 [표03] 내지 [표08]에 나타냈다.The sequences of the engineered Cas12f1 guide RNAs for each Example are shown in the following [Table 03] to [Table 08].

[표03][Table 03]

Figure pat00003
Figure pat00003

[표04][Table 04]

Figure pat00004
Figure pat00004

[표05][Table 05]

Figure pat00005
Figure pat00005

[표06][Table 06]

Figure pat00006
Figure pat00006

[표07][Table 07]

Figure pat00007
Figure pat00007

[표08][Table 08]

Figure pat00008
Figure pat00008

이때, 각 표적 서열 별로,At this time, for each target sequence,

1) Comparative Example n.1은 자연계에서 발견되는 Cas12f1 tracrRNA 및 자연계에서 발견되는 Cas12f1 crRNA를 5'-GAAA-3'로 연결시킨 싱글 가이드 RNA,One) Comparative Example n.1 is a single guide RNA in which Cas12f1 tracrRNA found in nature and Cas12f1 crRNA found in nature are linked with 5'-GAAA-3';

2) Example n.1 내지 n.3은 변형된 제1 영역을 가지는 엔지니어링 된 스캐폴드 영역, 및 스페이서를 가지는 엔지니어링 된 Cas12f1 가이드 RNA,2) Examples n.1 to n.3 are an engineered scaffold region having a modified first region, and an engineered Cas12f1 guide RNA having a spacer,

3) Example n.4 내지 n.6는 변형된 제2 영역을 가지는 엔지니어링 된 스캐폴드 영역, 및 스페이서를 가지는 엔지니어링 된 Cas12f1 가이드 RNA,3) Examples n.4 to n.6 are an engineered scaffold region having a modified second region, and an engineered Cas12f1 guide RNA having a spacer,

4) Example n.7 내지 n.9는 변형된 제4 영역 및 제5 영역을 가지는 엔지니어링 된 스캐폴드 영역, 및 스페이서를 가지는 엔지니어링 된 Cas12f1 가이드 RNA,4) Examples n.7 to n.9 show an engineered scaffold region having a modified fourth region and a fifth region, and an engineered Cas12f1 guide RNA having a spacer,

5) Example n.10은 변형된 제1 영역 및 제2 영역을 가지는 엔지니어링 된 스캐폴드 영역, 및 스페이서를 가지는 엔지니어링 된 Cas12f1 가이드 RNA,5) Example n.10 is an engineered scaffold region having a modified first region and a second region, and an engineered Cas12f1 guide RNA with spacers,

6) Example n.11은 변형된 제1 영역 및 제4 영역 및 제5 영역을 가지는 엔지니어링 된 스캐폴드 영역, 및 스페이서를 가지는 엔지니어링 된 Cas12f1 가이드 RNA,6) Example n.11 is an engineered scaffold region having a modified first region and a fourth region and a fifth region, and an engineered Cas12f1 guide RNA having a spacer,

7) Example n.12는 변형된 제2 영역 및 제4 영역 및 제5 영역을 가지는 엔지니어링 된 스캐폴드 영역, 및 스페이서를 가지는 엔지니어링 된 Cas12f1 가이드 RNA,7) Example n.12 is an engineered scaffold region having a modified second region and a fourth region and a fifth region, and an engineered Cas12f1 guide RNA having a spacer,

8) Example n.13 및 Example n.14는 변형된 제1 영역, 제2 영역, 및 제4 영역 및 제5 영역을 가지는 엔지니어링 된 스캐폴드 영역, 및 스페이서를 가지는 엔지니어링 된 Cas12f1 가이드 RNA를 나타낸다.8) Example n.13 and Example n.14 show an engineered scaffold region having a modified first region, a second region, and a fourth region and a fifth region, and an engineered Cas12f1 guide RNA having a spacer.

이때, n은 표적 서열 별로, 1, 2, 또는 3이며, n이 1인 경우 Target 1(DY2), n이 2인 경우, Target 2(DY10), n이 3인 경우, Target3(Intergenic-22)를 나타낸다.In this case, n is 1, 2, or 3 for each target sequence, when n is 1, Target 1 (DY2), when n is 2, Target 2 (DY10), when n is 3, Target3 (Intergenic-22) ) is indicated.

상기 실시예 별 제조된 벡터를 실험예 1.3에 의해 HEK293 T세포에 형질도입(transfection)하고, 실험예 1.4 내지 1.5에 의해 인델 발생 효율을 측정하였다. 이를 실험예 1.6에 의해 분석한 결과가 도 2 내지 도 13에 나타나있다.The vectors prepared in each Example were transfected into HEK293 T cells according to Experimental Example 1.3, and indel generation efficiency was measured by Experimental Examples 1.4 to 1.5. The results of this analysis according to Experimental Example 1.6 are shown in FIGS. 2 to 13 .

도 2 내지 도 4는 DY2를 표적으로 하는 Example 1.1 내지 Example 1.13에 대한 평균 indel 효율, 도 5 내지 도 8은 DY10을 표적으로 하는 Example 2.1 내지 Example 2.13에 대한 평균 indel 효율, 도 9 내지 도 12는 Intergenic-22를 표적으로 하는 Example 3.1 내지 Example 3.13에 대한 평균 indel 효율, 도 13은, DY2를 표적으로 하는 Example 1.13 내지 Example 1.14, DY10을 표적으로 하는 Example 2.13 내지 Example 2.14, Intergenic-22를 표적으로 하는 Example 3.13 내지 Example 3.14에 대한 평균 indel 효율을 각각 나타낸다.2 to 4 are average indel efficiencies for Examples 1.1 to 1.13 targeting DY2, FIGS. 5 to 8 are average indel efficiencies for Examples 2.1 to 2.13 targeting DY10, FIGS. 9 to 12 are Average indel efficiency for Examples 3.1 to 3.13 targeting Intergenic-22, FIG. 13 is, Examples 1.13 to Example 1.14 targeting DY2, Examples 2.13 to Example 2.14 targeting DY10, Intergenic-22 targeting The average indel efficiencies for Examples 3.13 to 3.14 are respectively shown.

실험 결과, 본 명세서에서 개시하는 엔지니어링 된 스캐폴드 영역을 가지는 엔지니어링 된 Cas12f1 가이드 RNA를 유전자 편집에 사용하는 경우, 자연계에서 발견되는 스캐폴드 영역을 가지는 Cas12f1 가이드 RNA, 및 상기 자연계에서 발견되는 스캐폴드 영역의 tracrRNA, 및 crRNA 반복 서열 부분을 5'-GAAA-3' 링커로 연결한 가이드 RNA를 유전자 편집에 사용할 때보다 전반적으로 유전자 편집 효율이 향상됨을 알 수 있다. 또한, 각 영역 별 변형을 조합한 경우(e.g. Example n.10 내지 Example n.14), 각 영역 별 조합이 시너지를 일으켜 유전자 편집 효율이 더욱 더 향상됨을 알 수 있다.As a result of the experiment, when the engineered Cas12f1 guide RNA having an engineered scaffold region disclosed herein is used for gene editing, the Cas12f1 guide RNA having a scaffold region found in nature, and the scaffold region found in nature It can be seen that the overall gene editing efficiency is improved compared to when the tracrRNA of tracrRNA and the guide RNA in which the crRNA repeat sequence is linked with a 5'-GAAA-3' linker is used for gene editing. In addition, it can be seen that when the modifications for each region are combined (e.g.

<110> Genkore <120> An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use thereof <130> CP21-068 <160> 441 <170> KoPatentIn 3.0 <210> 1 <211> 140 <212> RNA <213> Artificial Sequence <220> <223> mature form of Cas12f1 tracrRNA <400> 1 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu 140 <210> 2 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> wild-type of Cas12f1 tracrRNA <400> 2 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu uuccucucca auucugcaca a 161 <210> 3 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> mature form of Cas12f1 crRNA repeat <400> 3 gaaugaagga augcaac 17 <210> 4 <211> 37 <212> RNA <213> Artificial Sequence <220> <223> wild-type of Cas12f1 crRNA repeat <400> 4 guugcagaac ccgaauagac gaaugaagga augcaac 37 <210> 5 <211> 37 <212> RNA <213> Artificial Sequence <220> <223> mature form of Cas12f1 crRNA <400> 5 gaaugaagga augcaacnnn nnnnnnnnnn nnnnnnn 37 <210> 6 <211> 57 <212> RNA <213> Artificial Sequence <220> <223> wild-type of Cas12f1 crRNA <400> 6 guugcagaac ccgaauagac gaaugaagga augcaacnnn nnnnnnnnnn nnnnnnn 57 <210> 7 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> mature form of Cas12f1 tracrRNA + GAAA + mature form of Cas12f1 crRNA repeat <400> 7 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu gaaagaauga aggaaugcaa c 161 <210> 8 <211> 181 <212> RNA <213> Artificial Sequence <220> <223> mature form of Cas12f1 tracrRNA + GAAA + mature form of Cas12f1 crRNA <400> 8 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu gaaagaauga aggaaugcaa cnnnnnnnnn nnnnnnnnnn 180 n 181 <210> 9 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> 1st region of Cas12f1 guide RNA <400> 9 cuucacugau aaaguggaga a 21 <210> 10 <211> 50 <212> RNA <213> Artificial Sequence <220> <223> 2nd region of Cas12f1 guide RNA <400> 10 ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug agugaaggug 50 <210> 11 <211> 56 <212> RNA <213> Artificial Sequence <220> <223> 3rd region of Cas12f1 guide RNA <400> 11 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucga 56 <210> 12 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> 4th region of Cas12f1 guide RNA (Mature form) <400> 12 aacaaauuca uuu 13 <210> 13 <211> 34 <212> RNA <213> Artificial Sequence <220> <223> 4th region of Cas12f1 guide RNA (Wild-type) <400> 13 aacaaauuca uuuuuccucu ccaauucugc acaa 34 <210> 14 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> 5th region of Cas12f1 guide RNA (Mature form) <400> 14 gaaugaagga 10 <210> 15 <211> 30 <212> RNA <213> Artificial Sequence <220> <223> 5th region of Cas12f1 guide RNA (Wild-type) <400> 15 guugcagaac ccgaauagac gaaugaagga 30 <210> 16 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> first region, 11nt deletion <400> 16 aaguggagaa 10 <210> 17 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> first region, 10nt deletion <400> 17 aaaguggaga a 11 <210> 18 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> first region, 9nt deletion <400> 18 uaaaguggag aa 12 <210> 19 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> first region, 8nt deletion <400> 19 auaaagugga gaa 13 <210> 20 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> first region, 7nt deletion <400> 20 gauaaagugg agaa 14 <210> 21 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> first region, 6nt deletion <400> 21 ugauaaagug gagaa 15 <210> 22 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> first region, 5nt deletion <400> 22 cugauaaagu ggagaa 16 <210> 23 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> first region, 4nt deletion <400> 23 acugauaaag uggagaa 17 <210> 24 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> first region, 3nt deletion <400> 24 cacugauaaa guggagaa 18 <210> 25 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> first region, 2nt deletion <400> 25 ucacugauaa aguggagaa 19 <210> 26 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> first region, 1nt deletion <400> 26 uucacugaua aaguggagaa 20 <210> 27 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 10nt <400> 27 aaaguggaga 10 <210> 28 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 11nt <400> 28 uaaaguggag a 11 <210> 29 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 12nt <400> 29 auaaagugga ga 12 <210> 30 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 13nt <400> 30 gauaaagugg aga 13 <210> 31 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 14nt <400> 31 ugauaaagug gaga 14 <210> 32 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 15nt <400> 32 cugauaaagu ggaga 15 <210> 33 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 16nt <400> 33 acugauaaag uggaga 16 <210> 34 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 17nt <400> 34 cacugauaaa guggaga 17 <210> 35 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 18nt <400> 35 ucacugauaa aguggaga 18 <210> 36 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 19nt <400> 36 uucacugaua aaguggaga 19 <210> 37 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 20nt <400> 37 cuucacugau aaaguggaga 20 <210> 38 <211> 28 <212> RNA <213> Artificial Sequence <220> <223> second region, 11bp deletion <400> 38 ccgcuucacc auuagugagu gaaggugg 28 <210> 39 <211> 30 <212> RNA <213> Artificial Sequence <220> <223> second region, 10bp deletion <400> 39 ccgcuucacc aauuaguuga gugaaggugg 30 <210> 40 <211> 32 <212> RNA <213> Artificial Sequence <220> <223> second region, 9bp deletion <400> 40 ccgcuucacc aaauuagcuu gagugaaggu gg 32 <210> 41 <211> 34 <212> RNA <213> Artificial Sequence <220> <223> second region, 8bp deletion <400> 41 ccgcuucacc aaaauuagac uugagugaag gugg 34 <210> 42 <211> 36 <212> RNA <213> Artificial Sequence <220> <223> second region, 7bp deletion <400> 42 ccgcuucacc aaaaguuaga acuugaguga aggugg 36 <210> 43 <211> 38 <212> RNA <213> Artificial Sequence <220> <223> second region, 6bp deletion <400> 43 ccgcuucacc aaaagcuuag gaacuugagu gaaggugg 38 <210> 44 <211> 40 <212> RNA <213> Artificial Sequence <220> <223> second region, 5bp deletion <400> 44 ccgcuucacc aaaagcuuua gagaacuuga gugaaggugg 40 <210> 45 <211> 43 <212> RNA <213> Artificial Sequence <220> <223> second region, 4bp deletion <400> 45 ccgcuucacc aaaagcuguu aguuagaacu ugagugaagg ugg 43 <210> 46 <211> 42 <212> RNA <213> Artificial Sequence <220> <223> second region, 4bp+1nt deletion <400> 46 ccgcuucacc aaaagcuguu aguagaacuu gagugaaggu gg 42 <210> 47 <211> 45 <212> RNA <213> Artificial Sequence <220> <223> second region, 3bp deletion <400> 47 ccgcuucacc aaaagcuguu uagauuagaa cuugagugaa ggugg 45 <210> 48 <211> 47 <212> RNA <213> Artificial Sequence <220> <223> second region, 2bp deletion <400> 48 ccgcuucacc aaaagcuguc uuaggauuag aacuugagug aaggugg 47 <210> 49 <211> 49 <212> RNA <213> Artificial Sequence <220> <223> second region, 1bp deletion <400> 49 ccgcuucacc aaaagcuguc cuuagggauu agaacuugag ugaaggugg 49 <210> 50 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> second region, 5' remains <400> 50 ccgcuucacc a 11 <210> 51 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> second region, 3' remains <400> 51 ugagugaagg ugg 13 <210> 52 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> upper-deleted part of the second region, 10nt <400> 52 aaagcugucc 10 <210> 53 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> upper-deleted part of the second region, 11nt <400> 53 aaagcugucc c 11 <210> 54 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> lower-deleted part of the second region, 10nt <400> 54 gauuagaacu 10 <210> 55 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> lower-deleted part of the second region, 11nt <400> 55 ggauuagaac u 11 <210> 56 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> lower-deleted part of the second region, 12nt <400> 56 gggauuagaa cu 12 <210> 57 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 10nt <400> 57 aaauuagacu 10 <210> 58 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 11nt <400> 58 aaaguuagaa cu 12 <210> 59 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 12nt <400> 59 aaagcuuagg aacu 14 <210> 60 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 13nt <400> 60 aaagcuuuag agaacu 16 <210> 61 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 14nt <400> 61 aaagcuguua guagaacu 18 <210> 62 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 15nt <400> 62 aaagcuguua guuagaacu 19 <210> 63 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 16nt <400> 63 aaagcuguuu agauuagaac u 21 <210> 64 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 17nt <400> 64 aaagcugucu uaggauuaga acu 23 <210> 65 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 18nt <400> 65 aaagcugucc uuagggauua gaacu 25 <210> 66 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(mature form), delete 3nt <400> 66 aacaaauuca 10 <210> 67 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(mature form), delete 4nt <400> 67 aacaaauuca u 11 <210> 68 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(mature form), delete 5nt <400> 68 aacaaauuca uu 12 <210> 69 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 21nt deletion <400> 69 aacaaauuca uuu 13 <210> 70 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 20nt deletion <400> 70 aacaaauuca uuuu 14 <210> 71 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 19nt deletion <400> 71 aacaaauuca uuuuu 15 <210> 72 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 18nt deletion <400> 72 aacaaauuca uuuuuc 16 <210> 73 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 17nt deletion <400> 73 aacaaauuca uuuuucc 17 <210> 74 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 16nt deletion <400> 74 aacaaauuca uuuuuccu 18 <210> 75 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 15nt deletion <400> 75 aacaaauuca uuuuuccuc 19 <210> 76 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 14nt deletion <400> 76 aacaaauuca uuuuuccucu 20 <210> 77 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 13nt deletion <400> 77 aacaaauuca uuuuuccucu c 21 <210> 78 <211> 22 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 12nt deletion <400> 78 aacaaauuca uuuuuccucu cc 22 <210> 79 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 11nt deletion <400> 79 aacaaauuca uuuuuccucu cca 23 <210> 80 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 10nt deletion <400> 80 aacaaauuca uuuuuccucu ccaa 24 <210> 81 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 9nt deletion <400> 81 aacaaauuca uuuuuccucu ccaau 25 <210> 82 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 8nt deletion <400> 82 aacaaauuca uuuuuccucu ccaauu 26 <210> 83 <211> 27 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 7nt deletion <400> 83 aacaaauuca uuuuuccucu ccaauuc 27 <210> 84 <211> 28 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 6nt deletion <400> 84 aacaaauuca uuuuuccucu ccaauucu 28 <210> 85 <211> 29 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 5nt deletion <400> 85 aacaaauuca uuuuuccucu ccaauucug 29 <210> 86 <211> 30 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 4nt deletion <400> 86 aacaaauuca uuuuuccucu ccaauucugc 30 <210> 87 <211> 31 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 3nt deletion <400> 87 aacaaauuca uuuuuccucu ccaauucugc a 31 <210> 88 <211> 32 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 2nt deletion <400> 88 aacaaauuca uuuuuccucu ccaauucugc ac 32 <210> 89 <211> 33 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA(wild-type), 1nt deletion <400> 89 aacaaauuca uuuuuccucu ccaauucugc aca 33 <210> 90 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 20nt deletion <400> 90 gaaugaagga 10 <210> 91 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 19nt deletion <400> 91 cgaaugaagg a 11 <210> 92 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 18nt deletion <400> 92 acgaaugaag ga 12 <210> 93 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 17nt deletion <400> 93 gacgaaugaa gga 13 <210> 94 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 16nt deletion <400> 94 agacgaauga agga 14 <210> 95 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 15nt deletion <400> 95 uagacgaaug aagga 15 <210> 96 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 14nt deletion <400> 96 auagacgaau gaagga 16 <210> 97 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 13nt deletion <400> 97 aauagacgaa ugaagga 17 <210> 98 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 12nt deletion <400> 98 gaauagacga augaagga 18 <210> 99 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 11nt deletion <400> 99 cgaauagacg aaugaagga 19 <210> 100 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 10nt deletion <400> 100 ccgaauagac gaaugaagga 20 <210> 101 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 9nt deletion <400> 101 cccgaauaga cgaaugaagg a 21 <210> 102 <211> 22 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 8nt deletion <400> 102 acccgaauag acgaaugaag ga 22 <210> 103 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 7nt deletion <400> 103 aacccgaaua gacgaaugaa gga 23 <210> 104 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 6nt deletion <400> 104 gaacccgaau agacgaauga agga 24 <210> 105 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 5nt deletion <400> 105 agaacccgaa uagacgaaug aagga 25 <210> 106 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 4nt deletion <400> 106 cagaacccga auagacgaau gaagga 26 <210> 107 <211> 27 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 3nt deletion <400> 107 gcagaacccg aauagacgaa ugaagga 27 <210> 108 <211> 28 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 2nt deletion <400> 108 ugcagaaccc gaauagacga augaagga 28 <210> 109 <211> 29 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA(wild-type), 1nt deletion <400> 109 uugcagaacc cgaauagacg aaugaagga 29 <210> 110 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 7bp deletion (mature form) <400> 110 aacaaagaaa gga 13 <210> 111 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 6bp deletion (mature form) <400> 111 aacaaaugaa aagga 15 <210> 112 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 5bp deletion (mature form) <400> 112 aacaaauuga aaaagga 17 <210> 113 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 4bp deletion (mature form) <400> 113 aacaaauucg aaagaagga 19 <210> 114 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 3bp deletion (mature form) <400> 114 aacaaauuca gaaaugaagg a 21 <210> 115 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 2bp deletion (mature form) <400> 115 aacaaauuca ugaaaaugaa gga 23 <210> 116 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 1bp deletion (mature form) <400> 116 aacaaauuca uugaaaaaug aagga 25 <210> 117 <211> 27 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region (mature form) <400> 117 aacaaauuca uuugaaagaa ugaagga 27 <210> 118 <211> 120 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 20nt deletion in the first region) <400> 118 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 120 <210> 119 <211> 121 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 19nt deletion in the first region) <400> 119 aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg ugggcugcuu 60 gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa caaauucauu 120 u 121 <210> 120 <211> 122 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 18nt deletion in the first region) <400> 120 gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag gugggcugcu 60 ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa acaaauucau 120 uu 122 <210> 121 <211> 123 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 17nt deletion in the first region) <400> 121 agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa ggugggcugc 60 uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga aacaaauuca 120 uuu 123 <210> 122 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 16nt deletion in the first region) <400> 122 gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga aggugggcug 60 cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg aaacaaauuc 120 auuu 124 <210> 123 <211> 125 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 15nt deletion in the first region) <400> 123 ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuu 125 <210> 124 <211> 126 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 14nt deletion in the first region) <400> 124 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuu 126 <210> 125 <211> 127 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 13nt deletion in the first region) <400> 125 guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 uucauuu 127 <210> 126 <211> 128 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 12nt deletion in the first region) <400> 126 aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga gugaaggugg 60 gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc cucgaaacaa 120 auucauuu 128 <210> 127 <211> 129 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 11nt deletion in the first region) <400> 127 aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuu 129 <210> 128 <211> 130 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 10nt deletion in the first region) <400> 128 aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu 60 gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac 120 aaauucauuu 130 <210> 129 <211> 131 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 9nt deletion in the first region) <400> 129 uaaaguggag aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu u 131 <210> 130 <211> 132 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 8nt deletion in the first region) <400> 130 auaaagugga gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 acaaauucau uu 132 <210> 131 <211> 133 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 7nt deletion in the first region) <400> 131 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuu 133 <210> 132 <211> 134 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 6nt deletion in the first region) <400> 132 ugauaaagug gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuu 134 <210> 133 <211> 135 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 5nt deletion in the first region) <400> 133 cugauaaagu ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug 60 aaggugggcu gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc 120 gaaacaaauu cauuu 135 <210> 134 <211> 136 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 4nt deletion in the first region) <400> 134 acugauaaag uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuu 136 <210> 135 <211> 137 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 3nt deletion in the first region) <400> 135 cacugauaaa guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag 60 ugaagguggg cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc 120 ucgaaacaaa uucauuu 137 <210> 136 <211> 138 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 2nt deletion in the first region) <400> 136 ucacugauaa aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuu 138 <210> 137 <211> 139 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 1nt deletion in the first region) <400> 137 uucacugaua aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug 60 agugaaggug ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac 120 ccucgaaaca aauucauuu 139 <210> 138 <211> 117 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 10bp deletion in the second region) <400> 138 cuucacugau aaaguggaga accgcuucac cauuagugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa uucauuu 117 <210> 139 <211> 119 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 9bp deletion in the second region) <400> 139 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuu 119 <210> 140 <211> 122 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 8bp deletion in the second region) <400> 140 cuucacugau aaaguggaga accgcuucac caaauuagac uugagugaag gugggcugcu 60 ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa acaaauucau 120 uu 122 <210> 141 <211> 125 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 7bp deletion in the second region) <400> 141 cuucacugau aaaguggaga accgcuucac caaaaguuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuu 125 <210> 142 <211> 127 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 6bp deletion in the second region) <400> 142 cuucacugau aaaguggaga accgcuucac caaaagcuua ggaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 uucauuu 127 <210> 143 <211> 129 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 5bp deletion in the second region) <400> 143 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuu 129 <210> 144 <211> 132 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 4bp deletion in the second region) <400> 144 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 acaaauucau uu 132 <210> 145 <211> 131 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 4bp+1nt deletion in the second region) <400> 145 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu u 131 <210> 146 <211> 134 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 3bp deletion in the second region) <400> 146 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuu 134 <210> 147 <211> 136 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 2bp deletion in the second region) <400> 147 cuucacugau aaaguggaga accgcuucac caaaagcugu cuuaggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuu 136 <210> 148 <211> 138 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 1bp deletion in the second region) <400> 148 cuucacugau aaaguggaga accgcuucac caaaagcugu ccuuagggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuu 138 <210> 149 <211> 133 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 7nt deletion in the fourth region) <400> 149 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaa 133 <210> 150 <211> 134 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 6nt deletion in the fourth region) <400> 150 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaau 134 <210> 151 <211> 135 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 5nt deletion in the fourth region) <400> 151 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauu 135 <210> 152 <211> 136 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 4nt deletion in the fourth region) <400> 152 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauuc 136 <210> 153 <211> 137 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 3nt deletion in the fourth region) <400> 153 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauuca 137 <210> 154 <211> 138 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 2nt deletion in the fourth region) <400> 154 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucau 138 <210> 155 <211> 139 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, 1nt deletion in the fourth region) <400> 155 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauu 139 <210> 156 <211> 97 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, modified first region and modified second region) <400> 156 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa uucauuu 97 <210> 157 <211> 113 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, modified first region and modified fourth and fifth region) <400> 157 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaa 113 <210> 158 <211> 110 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, modified second region and modified fourth and fifth region) <400> 158 cuucacugau aaaguggaga accgcuucac cauuagugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 110 <210> 159 <211> 90 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA(mature form, modified first region, modified second region, modified fourth and fifth region) <400> 159 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa 90 <210> 160 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat(mature form, 7nt deletion in the fifth region) <400> 160 ggaaugcaac 10 <210> 161 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat(mature form, 6nt deletion in the fifth region) <400> 161 aggaaugcaa c 11 <210> 162 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat(mature form, 5nt deletion in the fifth region) <400> 162 aaggaaugca ac 12 <210> 163 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat(mature form, 4nt deletion in the fifth region) <400> 163 gaaggaaugc aac 13 <210> 164 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat(mature form, 3nt deletion in the fifth region) <400> 164 ugaaggaaug caac 14 <210> 165 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat(mature form, 2nt deletion in the fifth region) <400> 165 augaaggaau gcaac 15 <210> 166 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat(mature form, 1nt deletion in the fifth region) <400> 166 aaugaaggaa ugcaac 16 <210> 167 <211> 141 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 20nt deletion in the first region) <400> 167 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 gaaagaauga aggaaugcaa c 141 <210> 168 <211> 142 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 19nt deletion in the first region) <400> 168 aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg ugggcugcuu 60 gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa caaauucauu 120 ugaaagaaug aaggaaugca ac 142 <210> 169 <211> 143 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 18nt deletion in the first region) <400> 169 gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag gugggcugcu 60 ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa acaaauucau 120 uugaaagaau gaaggaaugc aac 143 <210> 170 <211> 144 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 17nt deletion in the first region) <400> 170 agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa ggugggcugc 60 uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga aacaaauuca 120 uuugaaagaa ugaaggaaug caac 144 <210> 171 <211> 145 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 16nt deletion in the first region) <400> 171 gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga aggugggcug 60 cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg aaacaaauuc 120 auuugaaaga augaaggaau gcaac 145 <210> 172 <211> 146 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 15nt deletion in the first region) <400> 172 ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuugaaag aaugaaggaa ugcaac 146 <210> 173 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 14nt deletion in the first region) <400> 173 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuugaaa gaaugaagga augcaac 147 <210> 174 <211> 148 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 13nt deletion in the first region) <400> 174 guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 uucauuugaa agaaugaagg aaugcaac 148 <210> 175 <211> 149 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 12nt deletion in the first region) <400> 175 aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga gugaaggugg 60 gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc cucgaaacaa 120 auucauuuga aagaaugaag gaaugcaac 149 <210> 176 <211> 150 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 11nt deletion in the first region) <400> 176 aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac 150 <210> 177 <211> 151 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 10nt deletion in the first region) <400> 177 aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu 60 gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac 120 aaauucauuu gaaagaauga aggaaugcaa c 151 <210> 178 <211> 152 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 9nt deletion in the first region) <400> 178 uaaaguggag aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu ugaaagaaug aaggaaugca ac 152 <210> 179 <211> 153 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 8nt deletion in the first region) <400> 179 auaaagugga gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 acaaauucau uugaaagaau gaaggaaugc aac 153 <210> 180 <211> 154 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 7nt deletion in the first region) <400> 180 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuugaaagaa ugaaggaaug caac 154 <210> 181 <211> 155 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 6nt deletion in the first region) <400> 181 ugauaaagug gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaac 155 <210> 182 <211> 156 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 5nt deletion in the first region) <400> 182 cugauaaagu ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug 60 aaggugggcu gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc 120 gaaacaaauu cauuugaaag aaugaaggaa ugcaac 156 <210> 183 <211> 157 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 4nt deletion in the first region) <400> 183 acugauaaag uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuugaaa gaaugaagga augcaac 157 <210> 184 <211> 158 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 3nt deletion in the first region) <400> 184 cacugauaaa guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag 60 ugaagguggg cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc 120 ucgaaacaaa uucauuugaa agaaugaagg aaugcaac 158 <210> 185 <211> 159 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 2nt deletion in the first region) <400> 185 ucacugauaa aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuuga aagaaugaag gaaugcaac 159 <210> 186 <211> 160 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 1nt deletion in the first region) <400> 186 uucacugaua aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug 60 agugaaggug ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac 120 ccucgaaaca aauucauuug aaagaaugaa ggaaugcaac 160 <210> 187 <211> 138 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 11bp deletion in the second region) <400> 187 cuucacugau aaaguggaga accgcuucac cauuagugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa uucauuugaa 120 agaaugaagg aaugcaac 138 <210> 188 <211> 140 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 10bp deletion in the second region) <400> 188 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug 120 aaagaaugaa ggaaugcaac 140 <210> 189 <211> 142 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 9bp deletion in the second region) <400> 189 cuucacugau aaaguggaga accgcuucac caaauuagcu ugagugaagg ugggcugcuu 60 gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa caaauucauu 120 ugaaagaaug aaggaaugca ac 142 <210> 190 <211> 144 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 8bp deletion in the second region) <400> 190 cuucacugau aaaguggaga accgcuucac caaaauuaga cuugagugaa ggugggcugc 60 uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga aacaaauuca 120 uuugaaagaa ugaaggaaug caac 144 <210> 191 <211> 146 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 7bp deletion in the second region) <400> 191 cuucacugau aaaguggaga accgcuucac caaaaguuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuugaaag aaugaaggaa ugcaac 146 <210> 192 <211> 148 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 6bp deletion in the second region) <400> 192 cuucacugau aaaguggaga accgcuucac caaaagcuua ggaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 uucauuugaa agaaugaagg aaugcaac 148 <210> 193 <211> 150 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 5bp deletion in the second region) <400> 193 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac 150 <210> 194 <211> 152 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 4bp deletion in the second region) <400> 194 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu ugaaagaaug aaggaaugca ac 152 <210> 195 <211> 153 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 4bp+1nt deletion in the second region) <400> 195 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 acaaauucau uugaaagaau gaaggaaugc aac 153 <210> 196 <211> 155 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 3bp deletion in the second region) <400> 196 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaac 155 <210> 197 <211> 157 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 2bp deletion in the second region) <400> 197 cuucacugau aaaguggaga accgcuucac caaaagcugu cuuaggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuugaaa gaaugaagga augcaac 157 <210> 198 <211> 159 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 1bp deletion in the second region) <400> 198 cuucacugau aaaguggaga accgcuucac caaaagcugu ccuuagggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuuga aagaaugaag gaaugcaac 159 <210> 199 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 7bp deletion in the fourth and fifth region) <400> 199 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaagaaagga augcaac 147 <210> 200 <211> 149 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 6bp deletion in the fourth and fifth region) <400> 200 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaaugaaaag gaaugcaac 149 <210> 201 <211> 151 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 5bp deletion in the fourth and fifth region) <400> 201 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauugaaaa aggaaugcaa c 151 <210> 202 <211> 153 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 4bp deletion in the fourth and fifth region) <400> 202 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucgaaa gaaggaaugc aac 153 <210> 203 <211> 155 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 3bp deletion in the fourth and fifth region) <400> 203 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucagaa augaaggaau gcaac 155 <210> 204 <211> 157 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 2bp deletion in the fourth and fifth region) <400> 204 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauga aaaugaagga augcaac 157 <210> 205 <211> 159 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 1bp deletion in the fourth and fifth region) <400> 205 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauug aaaaaugaag gaaugcaac 159 <210> 206 <211> 118 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, modified first and second region) <400> 206 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa uucauuugaa agaaugaagg aaugcaac 118 <210> 207 <211> 127 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, modified first, fourth and fifth region) <400> 207 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaagaaagga 120 augcaac 127 <210> 208 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, modified second, fourth and fifth region) <400> 208 cuucacugau aaaguggaga accgcuucac cauuagugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug 120 caac 124 <210> 209 <211> 104 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, modified first, second, fourth and fifth region) <400> 209 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caac 104 <210> 210 <211> 189 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 1nt deletion in the first region) <400> 210 uucacugaua aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug 60 agugaaggug ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac 120 ccucgaaaca aauucauuug aaagaaugaa ggaaugcaac nnnnnnnnnn nnnnnnnnnn 180 uuuuauuuu 189 <210> 211 <211> 188 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 2nt deletion in the first region) <400> 211 ucacugauaa aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuuga aagaaugaag gaaugcaacn nnnnnnnnnn nnnnnnnnnu 180 uuuauuuu 188 <210> 212 <211> 187 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 3nt deletion in the first region) <400> 212 cacugauaaa guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag 60 ugaagguggg cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc 120 ucgaaacaaa uucauuugaa agaaugaagg aaugcaacnn nnnnnnnnnn nnnnnnnnuu 180 uuauuuu 187 <210> 213 <211> 186 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 4nt deletion in the first region) <400> 213 acugauaaag uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuugaaa gaaugaagga augcaacnnn nnnnnnnnnn nnnnnnnuuu 180 uauuuu 186 <210> 214 <211> 185 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 5nt deletion in the first region) <400> 214 cugauaaagu ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug 60 aaggugggcu gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc 120 gaaacaaauu cauuugaaag aaugaaggaa ugcaacnnnn nnnnnnnnnn nnnnnnuuuu 180 auuuu 185 <210> 215 <211> 184 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 6nt deletion in the first region) <400> 215 ugauaaagug gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaacnnnnn nnnnnnnnnn nnnnnuuuua 180 uuuu 184 <210> 216 <211> 183 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 7nt deletion in the first region) <400> 216 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuugaaagaa ugaaggaaug caacnnnnnn nnnnnnnnnn nnnnuuuuau 180 uuu 183 <210> 217 <211> 182 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 8nt deletion in the first region) <400> 217 auaaagugga gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 acaaauucau uugaaagaau gaaggaaugc aacnnnnnnn nnnnnnnnnn nnnuuuuauu 180 uu 182 <210> 218 <211> 181 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 9nt deletion in the first region) <400> 218 uaaaguggag aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu ugaaagaaug aaggaaugca acnnnnnnnn nnnnnnnnnn nnuuuuauuu 180 u 181 <210> 219 <211> 180 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 10nt deletion in the first region) <400> 219 aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu 60 gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac 120 aaauucauuu gaaagaauga aggaaugcaa cnnnnnnnnn nnnnnnnnnn nuuuuauuuu 180 180 <210> 220 <211> 179 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 11nt deletion in the first region) <400> 220 aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac nnnnnnnnnn nnnnnnnnnn uuuuauuuu 179 <210> 221 <211> 170 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 12nt deletion in the first region) <400> 221 aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga gugaaggugg 60 gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc cucgaaacaa 120 auucauuuga aagaaugaag gaaugcaacn nnnnnnnnnn nnnnnnnnnu 170 <210> 222 <211> 168 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 13nt deletion in the first region) <400> 222 guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 uucauuugaa agaaugaagg aaugcaacnn nnnnnnnnnn nnnnnnnn 168 <210> 223 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 14nt deletion in the first region) <400> 223 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuugaaa gaaugaagga augcaacnnn nnnnnnnnnn nnnnnnn 167 <210> 224 <211> 166 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 15nt deletion in the first region) <400> 224 ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuugaaag aaugaaggaa ugcaacnnnn nnnnnnnnnn nnnnnn 166 <210> 225 <211> 165 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 16nt deletion in the first region) <400> 225 gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga aggugggcug 60 cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg aaacaaauuc 120 auuugaaaga augaaggaau gcaacnnnnn nnnnnnnnnn nnnnn 165 <210> 226 <211> 164 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 17nt deletion in the first region) <400> 226 agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa ggugggcugc 60 uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga aacaaauuca 120 uuugaaagaa ugaaggaaug caacnnnnnn nnnnnnnnnn nnnn 164 <210> 227 <211> 163 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 18nt deletion in the first region) <400> 227 gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag gugggcugcu 60 ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa acaaauucau 120 uugaaagaau gaaggaaugc aacnnnnnnn nnnnnnnnnn nnn 163 <210> 228 <211> 162 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 19nt deletion in the first region) <400> 228 aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg ugggcugcuu 60 gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa caaauucauu 120 ugaaagaaug aaggaaugca acnnnnnnnn nnnnnnnnnn nn 162 <210> 229 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 20nt deletion in the first region) <400> 229 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 gaaagaauga aggaaugcaa cnnnnnnnnn nnnnnnnnnn n 161 <210> 230 <211> 179 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 1bp deletion in the second region) <400> 230 cuucacugau aaaguggaga accgcuucac caaaagcugu ccuuagggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuuga aagaaugaag gaaugcaacn nnnnnnnnnn nnnnnnnnn 179 <210> 231 <211> 177 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 2bp deletion in the second region) <400> 231 cuucacugau aaaguggaga accgcuucac caaaagcugu cuuaggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuugaaa gaaugaagga augcaacnnn nnnnnnnnnn nnnnnnn 177 <210> 232 <211> 175 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 3bp deletion in the second region) <400> 232 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaacnnnnn nnnnnnnnnn nnnnn 175 <210> 233 <211> 173 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 4bp+1nt deletion in the second region) <400> 233 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 acaaauucau uugaaagaau gaaggaaugc aacnnnnnnn nnnnnnnnnn nnn 173 <210> 234 <211> 172 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 4bp deletion in the second region) <400> 234 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu ugaaagaaug aaggaaugca acnnnnnnnn nnnnnnnnnn nn 172 <210> 235 <211> 170 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 5bp deletion in the second region) <400> 235 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac nnnnnnnnnn nnnnnnnnnn 170 <210> 236 <211> 168 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 6bp deletion in the second region) <400> 236 cuucacugau aaaguggaga accgcuucac caaaagcuua ggaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 uucauuugaa agaaugaagg aaugcaacnn nnnnnnnnnn nnnnnnnn 168 <210> 237 <211> 166 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 7bp deletion in the second region) <400> 237 cuucacugau aaaguggaga accgcuucac caaaaguuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuugaaag aaugaaggaa ugcaacnnnn nnnnnnnnnn nnnnnn 166 <210> 238 <211> 164 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 8bp deletion in the second region) <400> 238 cuucacugau aaaguggaga accgcuucac caaaauuaga cuugagugaa ggugggcugc 60 uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga aacaaauuca 120 uuugaaagaa ugaaggaaug caacnnnnnn nnnnnnnnnn nnnn 164 <210> 239 <211> 162 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 9bp deletion in the second region) <400> 239 cuucacugau aaaguggaga accgcuucac caaauuagcu ugagugaagg ugggcugcuu 60 gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa caaauucauu 120 ugaaagaaug aaggaaugca acnnnnnnnn nnnnnnnnnn nn 162 <210> 240 <211> 160 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 10bp deletion in the second region) <400> 240 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug 120 aaagaaugaa ggaaugcaac nnnnnnnnnn nnnnnnnnnn 160 <210> 241 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 11bp deletion in the second region) <400> 241 cuucacugau aaaguggaga accgcuucac cauuagugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa uucauuugaa 120 agaaugaagg aaugcaacnn nnnnnnnnnn nnnnnnnnuu uuauuuu 167 <210> 242 <211> 179 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 1bp deletion in the fourth and fifth region) <400> 242 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauug aaaaaugaag gaaugcaacn nnnnnnnnnn nnnnnnnnn 179 <210> 243 <211> 177 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 2bp deletion in the fourth and fifth region) <400> 243 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauga aaaugaagga augcaacnnn nnnnnnnnnn nnnnnnn 177 <210> 244 <211> 175 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 3bp deletion in the fourth and fifth region) <400> 244 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucagaa augaaggaau gcaacnnnnn nnnnnnnnnn nnnnn 175 <210> 245 <211> 173 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 4bp deletion in the fourth and fifth region) <400> 245 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucgaaa gaaggaaugc aacnnnnnnn nnnnnnnnnn nnn 173 <210> 246 <211> 171 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 5bp deletion in the fourth and fifth region) <400> 246 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauugaaaa aggaaugcaa cnnnnnnnnn nnnnnnnnnn n 171 <210> 247 <211> 169 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 6bp deletion in the fourth and fifth region) <400> 247 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaaugaaaag gaaugcaacn nnnnnnnnnn nnnnnnnnn 169 <210> 248 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, 7bp deletion in the fourth and fifth region) <400> 248 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaagaaagga augcaacnnn nnnnnnnnnn nnnnnnn 167 <210> 249 <211> 138 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, modified first and second region) <400> 249 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa uucauuugaa agaaugaagg aaugcaacnn 120 nnnnnnnnnn nnnnnnnn 138 <210> 250 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, modified first, fourth and fifth region) <400> 250 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaagaaagga 120 augcaacnnn nnnnnnnnnn nnnnnnn 147 <210> 251 <211> 144 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, modified second, fourth and fifth region) <400> 251 cuucacugau aaaguggaga accgcuucac cauuagugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug 120 caacnnnnnn nnnnnnnnnn nnnn 144 <210> 252 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, modified first, second, fourth and fifth region) <400> 252 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 253 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, modified first, second, fourth and fifth region) <400> 253 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 254 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, modified first, second, fourth and fifth region) <400> 254 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 255 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, modified first, second, fourth and fifth region) <400> 255 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 256 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, modified first, second, fourth and fifth region) <400> 256 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 257 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, modified first, second, fourth and fifth region) <400> 257 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 258 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA(mature form, modified first, second, fourth and fifth region) <400> 258 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 259 <211> 529 <212> PRT <213> Artificial Sequence <220> <223> Cas14a1 amino acid sequence <400> 259 Met Ala Lys Asn Thr Ile Thr Lys Thr Leu Lys Leu Arg Ile Val Arg 1 5 10 15 Pro Tyr Asn Ser Ala Glu Val Glu Lys Ile Val Ala Asp Glu Lys Asn 20 25 30 Asn Arg Glu Lys Ile Ala Leu Glu Lys Asn Lys Asp Lys Val Lys Glu 35 40 45 Ala Cys Ser Lys His Leu Lys Val Ala Ala Tyr Cys Thr Thr Gln Val 50 55 60 Glu Arg Asn Ala Cys Leu Phe Cys Lys Ala Arg Lys Leu Asp Asp Lys 65 70 75 80 Phe Tyr Gln Lys Leu Arg Gly Gln Phe Pro Asp Ala Val Phe Trp Gln 85 90 95 Glu Ile Ser Glu Ile Phe Arg Gln Leu Gln Lys Gln Ala Ala Glu Ile 100 105 110 Tyr Asn Gln Ser Leu Ile Glu Leu Tyr Tyr Glu Ile Phe Ile Lys Gly 115 120 125 Lys Gly Ile Ala Asn Ala Ser Ser Val Glu His Tyr Leu Ser Asp Val 130 135 140 Cys Tyr Thr Arg Ala Ala Glu Leu Phe Lys Asn Ala Ala Ile Ala Ser 145 150 155 160 Gly Leu Arg Ser Lys Ile Lys Ser Asn Phe Arg Leu Lys Glu Leu Lys 165 170 175 Asn Met Lys Ser Gly Leu Pro Thr Thr Lys Ser Asp Asn Phe Pro Ile 180 185 190 Pro Leu Val Lys Gln Lys Gly Gly Gln Tyr Thr Gly Phe Glu Ile Ser 195 200 205 Asn His Asn Ser Asp Phe Ile Ile Lys Ile Pro Phe Gly Arg Trp Gln 210 215 220 Val Lys Lys Glu Ile Asp Lys Tyr Arg Pro Trp Glu Lys Phe Asp Phe 225 230 235 240 Glu Gln Val Gln Lys Ser Pro Lys Pro Ile Ser Leu Leu Leu Ser Thr 245 250 255 Gln Arg Arg Lys Arg Asn Lys Gly Trp Ser Lys Asp Glu Gly Thr Glu 260 265 270 Ala Glu Ile Lys Lys Val Met Asn Gly Asp Tyr Gln Thr Ser Tyr Ile 275 280 285 Glu Val Lys Arg Gly Ser Lys Ile Gly Glu Lys Ser Ala Trp Met Leu 290 295 300 Asn Leu Ser Ile Asp Val Pro Lys Ile Asp Lys Gly Val Asp Pro Ser 305 310 315 320 Ile Ile Gly Gly Ile Asp Val Gly Val Lys Ser Pro Leu Val Cys Ala 325 330 335 Ile Asn Asn Ala Phe Ser Arg Tyr Ser Ile Ser Asp Asn Asp Leu Phe 340 345 350 His Phe Asn Lys Lys Met Phe Ala Arg Arg Arg Ile Leu Leu Lys Lys 355 360 365 Asn Arg His Lys Arg Ala Gly His Gly Ala Lys Asn Lys Leu Lys Pro 370 375 380 Ile Thr Ile Leu Thr Glu Lys Ser Glu Arg Phe Arg Lys Lys Leu Ile 385 390 395 400 Glu Arg Trp Ala Cys Glu Ile Ala Asp Phe Phe Ile Lys Asn Lys Val 405 410 415 Gly Thr Val Gln Met Glu Asn Leu Glu Ser Met Lys Arg Lys Glu Asp 420 425 430 Ser Tyr Phe Asn Ile Arg Leu Arg Gly Phe Trp Pro Tyr Ala Glu Met 435 440 445 Gln Asn Lys Ile Glu Phe Lys Leu Lys Gln Tyr Gly Ile Glu Ile Arg 450 455 460 Lys Val Ala Pro Asn Asn Thr Ser Lys Thr Cys Ser Lys Cys Gly His 465 470 475 480 Leu Asn Asn Tyr Phe Asn Phe Glu Tyr Arg Lys Lys Asn Lys Phe Pro 485 490 495 His Phe Lys Cys Glu Lys Cys Asn Phe Lys Glu Asn Ala Asp Tyr Asn 500 505 510 Ala Ala Leu Asn Ile Ser Asn Pro Lys Leu Lys Ser Thr Lys Glu Glu 515 520 525 Pro <210> 260 <211> 536 <212> PRT <213> Artificial Sequence <220> <223> N-terminal NLS + Cas14a1 amino acid sequence <400> 260 Met Pro Lys Lys Lys Arg Lys Val Ala Lys Asn Thr Ile Thr Lys Thr 1 5 10 15 Leu Lys Leu Arg Ile Val Arg Pro Tyr Asn Ser Ala Glu Val Glu Lys 20 25 30 Ile Val Ala Asp Glu Lys Asn Asn Arg Glu Lys Ile Ala Leu Glu Lys 35 40 45 Asn Lys Asp Lys Val Lys Glu Ala Cys Ser Lys His Leu Lys Val Ala 50 55 60 Ala Tyr Cys Thr Thr Gln Val Glu Arg Asn Ala Cys Leu Phe Cys Lys 65 70 75 80 Ala Arg Lys Leu Asp Asp Lys Phe Tyr Gln Lys Leu Arg Gly Gln Phe 85 90 95 Pro Asp Ala Val Phe Trp Gln Glu Ile Ser Glu Ile Phe Arg Gln Leu 100 105 110 Gln Lys Gln Ala Ala Glu Ile Tyr Asn Gln Ser Leu Ile Glu Leu Tyr 115 120 125 Tyr Glu Ile Phe Ile Lys Gly Lys Gly Ile Ala Asn Ala Ser Ser Val 130 135 140 Glu His Tyr Leu Ser Asp Val Cys Tyr Thr Arg Ala Ala Glu Leu Phe 145 150 155 160 Lys Asn Ala Ala Ile Ala Ser Gly Leu Arg Ser Lys Ile Lys Ser Asn 165 170 175 Phe Arg Leu Lys Glu Leu Lys Asn Met Lys Ser Gly Leu Pro Thr Thr 180 185 190 Lys Ser Asp Asn Phe Pro Ile Pro Leu Val Lys Gln Lys Gly Gly Gln 195 200 205 Tyr Thr Gly Phe Glu Ile Ser Asn His Asn Ser Asp Phe Ile Ile Lys 210 215 220 Ile Pro Phe Gly Arg Trp Gln Val Lys Lys Glu Ile Asp Lys Tyr Arg 225 230 235 240 Pro Trp Glu Lys Phe Asp Phe Glu Gln Val Gln Lys Ser Pro Lys Pro 245 250 255 Ile Ser Leu Leu Leu Ser Thr Gln Arg Arg Lys Arg Asn Lys Gly Trp 260 265 270 Ser Lys Asp Glu Gly Thr Glu Ala Glu Ile Lys Lys Val Met Asn Gly 275 280 285 Asp Tyr Gln Thr Ser Tyr Ile Glu Val Lys Arg Gly Ser Lys Ile Gly 290 295 300 Glu Lys Ser Ala Trp Met Leu Asn Leu Ser Ile Asp Val Pro Lys Ile 305 310 315 320 Asp Lys Gly Val Asp Pro Ser Ile Ile Gly Gly Ile Asp Val Gly Val 325 330 335 Lys Ser Pro Leu Val Cys Ala Ile Asn Asn Ala Phe Ser Arg Tyr Ser 340 345 350 Ile Ser Asp Asn Asp Leu Phe His Phe Asn Lys Lys Met Phe Ala Arg 355 360 365 Arg Arg Ile Leu Leu Lys Lys Asn Arg His Lys Arg Ala Gly His Gly 370 375 380 Ala Lys Asn Lys Leu Lys Pro Ile Thr Ile Leu Thr Glu Lys Ser Glu 385 390 395 400 Arg Phe Arg Lys Lys Leu Ile Glu Arg Trp Ala Cys Glu Ile Ala Asp 405 410 415 Phe Phe Ile Lys Asn Lys Val Gly Thr Val Gln Met Glu Asn Leu Glu 420 425 430 Ser Met Lys Arg Lys Glu Asp Ser Tyr Phe Asn Ile Arg Leu Arg Gly 435 440 445 Phe Trp Pro Tyr Ala Glu Met Gln Asn Lys Ile Glu Phe Lys Leu Lys 450 455 460 Gln Tyr Gly Ile Glu Ile Arg Lys Val Ala Pro Asn Asn Thr Ser Lys 465 470 475 480 Thr Cys Ser Lys Cys Gly His Leu Asn Asn Tyr Phe Asn Phe Glu Tyr 485 490 495 Arg Lys Lys Asn Lys Phe Pro His Phe Lys Cys Glu Lys Cys Asn Phe 500 505 510 Lys Glu Asn Ala Asp Tyr Asn Ala Ala Leu Asn Ile Ser Asn Pro Lys 515 520 525 Leu Lys Ser Thr Lys Glu Glu Pro 530 535 <210> 261 <211> 536 <212> PRT <213> Artificial Sequence <220> <223> C-terminal NLS + Cas14a1 amino acid sequence <400> 261 Met Ala Lys Asn Thr Ile Thr Lys Thr Leu Lys Leu Arg Ile Val Arg 1 5 10 15 Pro Tyr Asn Ser Ala Glu Val Glu Lys Ile Val Ala Asp Glu Lys Asn 20 25 30 Asn Arg Glu Lys Ile Ala Leu Glu Lys Asn Lys Asp Lys Val Lys Glu 35 40 45 Ala Cys Ser Lys His Leu Lys Val Ala Ala Tyr Cys Thr Thr Gln Val 50 55 60 Glu Arg Asn Ala Cys Leu Phe Cys Lys Ala Arg Lys Leu Asp Asp Lys 65 70 75 80 Phe Tyr Gln Lys Leu Arg Gly Gln Phe Pro Asp Ala Val Phe Trp Gln 85 90 95 Glu Ile Ser Glu Ile Phe Arg Gln Leu Gln Lys Gln Ala Ala Glu Ile 100 105 110 Tyr Asn Gln Ser Leu Ile Glu Leu Tyr Tyr Glu Ile Phe Ile Lys Gly 115 120 125 Lys Gly Ile Ala Asn Ala Ser Ser Val Glu His Tyr Leu Ser Asp Val 130 135 140 Cys Tyr Thr Arg Ala Ala Glu Leu Phe Lys Asn Ala Ala Ile Ala Ser 145 150 155 160 Gly Leu Arg Ser Lys Ile Lys Ser Asn Phe Arg Leu Lys Glu Leu Lys 165 170 175 Asn Met Lys Ser Gly Leu Pro Thr Thr Lys Ser Asp Asn Phe Pro Ile 180 185 190 Pro Leu Val Lys Gln Lys Gly Gly Gln Tyr Thr Gly Phe Glu Ile Ser 195 200 205 Asn His Asn Ser Asp Phe Ile Ile Lys Ile Pro Phe Gly Arg Trp Gln 210 215 220 Val Lys Lys Glu Ile Asp Lys Tyr Arg Pro Trp Glu Lys Phe Asp Phe 225 230 235 240 Glu Gln Val Gln Lys Ser Pro Lys Pro Ile Ser Leu Leu Leu Ser Thr 245 250 255 Gln Arg Arg Lys Arg Asn Lys Gly Trp Ser Lys Asp Glu Gly Thr Glu 260 265 270 Ala Glu Ile Lys Lys Val Met Asn Gly Asp Tyr Gln Thr Ser Tyr Ile 275 280 285 Glu Val Lys Arg Gly Ser Lys Ile Gly Glu Lys Ser Ala Trp Met Leu 290 295 300 Asn Leu Ser Ile Asp Val Pro Lys Ile Asp Lys Gly Val Asp Pro Ser 305 310 315 320 Ile Ile Gly Gly Ile Asp Val Gly Val Lys Ser Pro Leu Val Cys Ala 325 330 335 Ile Asn Asn Ala Phe Ser Arg Tyr Ser Ile Ser Asp Asn Asp Leu Phe 340 345 350 His Phe Asn Lys Lys Met Phe Ala Arg Arg Arg Ile Leu Leu Lys Lys 355 360 365 Asn Arg His Lys Arg Ala Gly His Gly Ala Lys Asn Lys Leu Lys Pro 370 375 380 Ile Thr Ile Leu Thr Glu Lys Ser Glu Arg Phe Arg Lys Lys Leu Ile 385 390 395 400 Glu Arg Trp Ala Cys Glu Ile Ala Asp Phe Phe Ile Lys Asn Lys Val 405 410 415 Gly Thr Val Gln Met Glu Asn Leu Glu Ser Met Lys Arg Lys Glu Asp 420 425 430 Ser Tyr Phe Asn Ile Arg Leu Arg Gly Phe Trp Pro Tyr Ala Glu Met 435 440 445 Gln Asn Lys Ile Glu Phe Lys Leu Lys Gln Tyr Gly Ile Glu Ile Arg 450 455 460 Lys Val Ala Pro Asn Asn Thr Ser Lys Thr Cys Ser Lys Cys Gly His 465 470 475 480 Leu Asn Asn Tyr Phe Asn Phe Glu Tyr Arg Lys Lys Asn Lys Phe Pro 485 490 495 His Phe Lys Cys Glu Lys Cys Asn Phe Lys Glu Asn Ala Asp Tyr Asn 500 505 510 Ala Ala Leu Asn Ile Ser Asn Pro Lys Leu Lys Ser Thr Lys Glu Glu 515 520 525 Pro Pro Lys Lys Lys Arg Lys Val 530 535 <210> 262 <211> 543 <212> PRT <213> Artificial Sequence <220> <223> N/C-terminal NLS + Cas14a1 amino acid sequence <400> 262 Met Pro Lys Lys Lys Arg Lys Val Ala Lys Asn Thr Ile Thr Lys Thr 1 5 10 15 Leu Lys Leu Arg Ile Val Arg Pro Tyr Asn Ser Ala Glu Val Glu Lys 20 25 30 Ile Val Ala Asp Glu Lys Asn Asn Arg Glu Lys Ile Ala Leu Glu Lys 35 40 45 Asn Lys Asp Lys Val Lys Glu Ala Cys Ser Lys His Leu Lys Val Ala 50 55 60 Ala Tyr Cys Thr Thr Gln Val Glu Arg Asn Ala Cys Leu Phe Cys Lys 65 70 75 80 Ala Arg Lys Leu Asp Asp Lys Phe Tyr Gln Lys Leu Arg Gly Gln Phe 85 90 95 Pro Asp Ala Val Phe Trp Gln Glu Ile Ser Glu Ile Phe Arg Gln Leu 100 105 110 Gln Lys Gln Ala Ala Glu Ile Tyr Asn Gln Ser Leu Ile Glu Leu Tyr 115 120 125 Tyr Glu Ile Phe Ile Lys Gly Lys Gly Ile Ala Asn Ala Ser Ser Val 130 135 140 Glu His Tyr Leu Ser Asp Val Cys Tyr Thr Arg Ala Ala Glu Leu Phe 145 150 155 160 Lys Asn Ala Ala Ile Ala Ser Gly Leu Arg Ser Lys Ile Lys Ser Asn 165 170 175 Phe Arg Leu Lys Glu Leu Lys Asn Met Lys Ser Gly Leu Pro Thr Thr 180 185 190 Lys Ser Asp Asn Phe Pro Ile Pro Leu Val Lys Gln Lys Gly Gly Gln 195 200 205 Tyr Thr Gly Phe Glu Ile Ser Asn His Asn Ser Asp Phe Ile Ile Lys 210 215 220 Ile Pro Phe Gly Arg Trp Gln Val Lys Lys Glu Ile Asp Lys Tyr Arg 225 230 235 240 Pro Trp Glu Lys Phe Asp Phe Glu Gln Val Gln Lys Ser Pro Lys Pro 245 250 255 Ile Ser Leu Leu Leu Ser Thr Gln Arg Arg Lys Arg Asn Lys Gly Trp 260 265 270 Ser Lys Asp Glu Gly Thr Glu Ala Glu Ile Lys Lys Val Met Asn Gly 275 280 285 Asp Tyr Gln Thr Ser Tyr Ile Glu Val Lys Arg Gly Ser Lys Ile Gly 290 295 300 Glu Lys Ser Ala Trp Met Leu Asn Leu Ser Ile Asp Val Pro Lys Ile 305 310 315 320 Asp Lys Gly Val Asp Pro Ser Ile Ile Gly Gly Ile Asp Val Gly Val 325 330 335 Lys Ser Pro Leu Val Cys Ala Ile Asn Asn Ala Phe Ser Arg Tyr Ser 340 345 350 Ile Ser Asp Asn Asp Leu Phe His Phe Asn Lys Lys Met Phe Ala Arg 355 360 365 Arg Arg Ile Leu Leu Lys Lys Asn Arg His Lys Arg Ala Gly His Gly 370 375 380 Ala Lys Asn Lys Leu Lys Pro Ile Thr Ile Leu Thr Glu Lys Ser Glu 385 390 395 400 Arg Phe Arg Lys Lys Leu Ile Glu Arg Trp Ala Cys Glu Ile Ala Asp 405 410 415 Phe Phe Ile Lys Asn Lys Val Gly Thr Val Gln Met Glu Asn Leu Glu 420 425 430 Ser Met Lys Arg Lys Glu Asp Ser Tyr Phe Asn Ile Arg Leu Arg Gly 435 440 445 Phe Trp Pro Tyr Ala Glu Met Gln Asn Lys Ile Glu Phe Lys Leu Lys 450 455 460 Gln Tyr Gly Ile Glu Ile Arg Lys Val Ala Pro Asn Asn Thr Ser Lys 465 470 475 480 Thr Cys Ser Lys Cys Gly His Leu Asn Asn Tyr Phe Asn Phe Glu Tyr 485 490 495 Arg Lys Lys Asn Lys Phe Pro His Phe Lys Cys Glu Lys Cys Asn Phe 500 505 510 Lys Glu Asn Ala Asp Tyr Asn Ala Ala Leu Asn Ile Ser Asn Pro Lys 515 520 525 Leu Lys Ser Thr Lys Glu Glu Pro Pro Lys Lys Lys Arg Lys Val 530 535 540 <210> 263 <211> 1003 <212> PRT <213> Artificial Sequence <220> <223> Amino acid sequence (N terminal -Cytidine deaminase) <400> 263 Met Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val Ala 1 5 10 15 Val Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu Val 20 25 30 Phe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr Glu 35 40 45 Ile Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln Asn 50 55 60 Thr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr Glu 65 70 75 80 Arg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu Ser 85 90 95 Trp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu Ser 100 105 110 Arg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr His 115 120 125 His Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser Ser 130 135 140 Gly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys Trp 145 150 155 160 Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro Arg 165 170 175 Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys Ile 180 185 190 Ile Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln Pro 195 200 205 Gln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln Arg 210 215 220 Leu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Gly Ser 225 230 235 240 Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala 245 250 255 Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser Ala Lys Asn Thr 260 265 270 Ile Thr Lys Thr Leu Lys Leu Arg Ile Val Arg Pro Tyr Asn Ser Ala 275 280 285 Glu Val Glu Lys Ile Val Ala Asp Glu Lys Asn Asn Arg Glu Lys Ile 290 295 300 Ala Leu Glu Lys Asn Lys Asp Lys Val Lys Glu Ala Cys Ser Lys His 305 310 315 320 Leu Lys Val Ala Ala Tyr Cys Thr Thr Gln Val Glu Arg Asn Ala Cys 325 330 335 Leu Phe Cys Lys Ala Arg Lys Leu Asp Asp Lys Phe Tyr Gln Lys Leu 340 345 350 Arg Gly Gln Phe Pro Asp Ala Val Phe Trp Gln Glu Ile Ser Glu Ile 355 360 365 Phe Arg Gln Leu Gln Lys Gln Ala Ala Glu Ile Tyr Asn Gln Ser Leu 370 375 380 Ile Glu Leu Tyr Tyr Glu Ile Phe Ile Lys Gly Lys Gly Ile Ala Asn 385 390 395 400 Ala Ser Ser Val Glu His Tyr Leu Ser Asp Val Cys Tyr Thr Arg Ala 405 410 415 Ala Glu Leu Phe Lys Asn Ala Ala Ile Ala Ser Gly Leu Arg Ser Lys 420 425 430 Ile Lys Ser Asn Phe Arg Leu Lys Glu Leu Lys Asn Met Lys Ser Gly 435 440 445 Leu Pro Thr Thr Lys Ser Asp Asn Phe Pro Ile Pro Leu Val Lys Gln 450 455 460 Lys Gly Gly Gln Tyr Thr Gly Phe Glu Ile Ser Asn His Asn Ser Asp 465 470 475 480 Phe Ile Ile Lys Ile Pro Phe Gly Arg Trp Gln Val Lys Lys Glu Ile 485 490 495 Asp Lys Tyr Arg Pro Trp Glu Lys Phe Asp Phe Glu Gln Val Gln Lys 500 505 510 Ser Pro Lys Pro Ile Ser Leu Leu Leu Ser Thr Gln Arg Arg Lys Arg 515 520 525 Asn Lys Gly Trp Ser Lys Asp Glu Gly Thr Glu Ala Glu Ile Lys Lys 530 535 540 Val Met Asn Gly Asp Tyr Gln Thr Ser Tyr Ile Glu Val Lys Arg Gly 545 550 555 560 Ser Lys Ile Gly Glu Lys Ser Ala Trp Met Leu Asn Leu Ser Ile Asp 565 570 575 Val Pro Lys Ile Asp Lys Gly Val Asp Pro Ser Ile Ile Gly Gly Ile 580 585 590 Asp Val Gly Val Lys Ser Pro Leu Val Cys Ala Ile Asn Asn Ala Phe 595 600 605 Ser Arg Tyr Ser Ile Ser Asp Asn Asp Leu Phe His Phe Asn Lys Lys 610 615 620 Met Phe Ala Arg Arg Arg Ile Leu Leu Lys Lys Asn Arg His Lys Arg 625 630 635 640 Ala Gly His Gly Ala Lys Asn Lys Leu Lys Pro Ile Thr Ile Leu Thr 645 650 655 Glu Lys Ser Glu Arg Phe Arg Lys Lys Leu Ile Glu Arg Trp Ala Cys 660 665 670 Glu Ile Ala Asp Phe Phe Ile Lys Asn Lys Val Gly Thr Val Gln Met 675 680 685 Glu Asn Leu Glu Ser Met Lys Arg Lys Glu Asp Ser Tyr Phe Asn Ile 690 695 700 Arg Leu Arg Gly Phe Trp Pro Tyr Ala Glu Met Gln Asn Lys Ile Glu 705 710 715 720 Phe Lys Leu Lys Gln Tyr Gly Ile Glu Ile Arg Lys Val Ala Pro Asn 725 730 735 Asn Thr Ser Lys Thr Cys Ser Lys Cys Gly His Leu Asn Asn Tyr Phe 740 745 750 Asn Phe Glu Tyr Arg Lys Lys Asn Lys Phe Pro His Phe Lys Cys Glu 755 760 765 Lys Cys Asn Phe Lys Glu Asn Ala Asp Tyr Asn Ala Ala Leu Asn Ile 770 775 780 Ser Asn Pro Lys Leu Lys Ser Thr Lys Glu Glu Pro Ser Gly Gly Ser 785 790 795 800 Gly Gly Ser Gly Gly Ser Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu 805 810 815 Thr Gly Lys Gln Leu Val Ile Gln Glu Ser Ile Leu Met Leu Pro Glu 820 825 830 Glu Val Glu Glu Val Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val 835 840 845 His Thr Ala Tyr Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr 850 855 860 Ser Asp Ala Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser 865 870 875 880 Asn Gly Glu Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Gly Gly Ser 885 890 895 Gly Gly Ser Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys 900 905 910 Gln Leu Val Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu 915 920 925 Glu Val Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala 930 935 940 Tyr Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala 945 950 955 960 Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu 965 970 975 Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Lys Arg Thr Ala Asp Gly 980 985 990 Ser Glu Phe Glu Pro Lys Lys Lys Arg Lys Val 995 1000 <210> 264 <211> 1003 <212> PRT <213> Artificial Sequence <220> <223> Amino acid sequence (C terminal -Cytidine deaminase) <400> 264 Met Pro Lys Lys Lys Arg Lys Val Ala Lys Asn Thr Ile Thr Lys Thr 1 5 10 15 Leu Lys Leu Arg Ile Val Arg Pro Tyr Asn Ser Ala Glu Val Glu Lys 20 25 30 Ile Val Ala Asp Glu Lys Asn Asn Arg Glu Lys Ile Ala Leu Glu Lys 35 40 45 Asn Lys Asp Lys Val Lys Glu Ala Cys Ser Lys His Leu Lys Val Ala 50 55 60 Ala Tyr Cys Thr Thr Gln Val Glu Arg Asn Ala Cys Leu Phe Cys Lys 65 70 75 80 Ala Arg Lys Leu Asp Asp Lys Phe Tyr Gln Lys Leu Arg Gly Gln Phe 85 90 95 Pro Asp Ala Val Phe Trp Gln Glu Ile Ser Glu Ile Phe Arg Gln Leu 100 105 110 Gln Lys Gln Ala Ala Glu Ile Tyr Asn Gln Ser Leu Ile Glu Leu Tyr 115 120 125 Tyr Glu Ile Phe Ile Lys Gly Lys Gly Ile Ala Asn Ala Ser Ser Val 130 135 140 Glu His Tyr Leu Ser Asp Val Cys Tyr Thr Arg Ala Ala Glu Leu Phe 145 150 155 160 Lys Asn Ala Ala Ile Ala Ser Gly Leu Arg Ser Lys Ile Lys Ser Asn 165 170 175 Phe Arg Leu Lys Glu Leu Lys Asn Met Lys Ser Gly Leu Pro Thr Thr 180 185 190 Lys Ser Asp Asn Phe Pro Ile Pro Leu Val Lys Gln Lys Gly Gly Gln 195 200 205 Tyr Thr Gly Phe Glu Ile Ser Asn His Asn Ser Asp Phe Ile Ile Lys 210 215 220 Ile Pro Phe Gly Arg Trp Gln Val Lys Lys Glu Ile Asp Lys Tyr Arg 225 230 235 240 Pro Trp Glu Lys Phe Asp Phe Glu Gln Val Gln Lys Ser Pro Lys Pro 245 250 255 Ile Ser Leu Leu Leu Ser Thr Gln Arg Arg Lys Arg Asn Lys Gly Trp 260 265 270 Ser Lys Asp Glu Gly Thr Glu Ala Glu Ile Lys Lys Val Met Asn Gly 275 280 285 Asp Tyr Gln Thr Ser Tyr Ile Glu Val Lys Arg Gly Ser Lys Ile Gly 290 295 300 Glu Lys Ser Ala Trp Met Leu Asn Leu Ser Ile Asp Val Pro Lys Ile 305 310 315 320 Asp Lys Gly Val Asp Pro Ser Ile Ile Gly Gly Ile Asp Val Gly Val 325 330 335 Lys Ser Pro Leu Val Cys Ala Ile Asn Asn Ala Phe Ser Arg Tyr Ser 340 345 350 Ile Ser Asp Asn Asp Leu Phe His Phe Asn Lys Lys Met Phe Ala Arg 355 360 365 Arg Arg Ile Leu Leu Lys Lys Asn Arg His Lys Arg Ala Gly His Gly 370 375 380 Ala Lys Asn Lys Leu Lys Pro Ile Thr Ile Leu Thr Glu Lys Ser Glu 385 390 395 400 Arg Phe Arg Lys Lys Leu Ile Glu Arg Trp Ala Cys Glu Ile Ala Asp 405 410 415 Phe Phe Ile Lys Asn Lys Val Gly Thr Val Gln Met Glu Asn Leu Glu 420 425 430 Ser Met Lys Arg Lys Glu Asp Ser Tyr Phe Asn Ile Arg Leu Arg Gly 435 440 445 Phe Trp Pro Tyr Ala Glu Met Gln Asn Lys Ile Glu Phe Lys Leu Lys 450 455 460 Gln Tyr Gly Ile Glu Ile Arg Lys Val Ala Pro Asn Asn Thr Ser Lys 465 470 475 480 Thr Cys Ser Lys Cys Gly His Leu Asn Asn Tyr Phe Asn Phe Glu Tyr 485 490 495 Arg Lys Lys Asn Lys Phe Pro His Phe Lys Cys Glu Lys Cys Asn Phe 500 505 510 Lys Glu Asn Ala Asp Tyr Asn Ala Ala Leu Asn Ile Ser Asn Pro Lys 515 520 525 Leu Lys Ser Thr Lys Glu Glu Pro Ser Gly Gly Ser Ser Gly Gly Ser 530 535 540 Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser 545 550 555 560 Ser Gly Gly Ser Ser Gly Gly Ser Ser Ser Glu Thr Gly Pro Val Ala 565 570 575 Val Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu Val 580 585 590 Phe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr Glu 595 600 605 Ile Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln Asn 610 615 620 Thr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr Glu 625 630 635 640 Arg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu Ser 645 650 655 Trp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu Ser 660 665 670 Arg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr His 675 680 685 His Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser Ser 690 695 700 Gly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys Trp 705 710 715 720 Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro Arg 725 730 735 Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys Ile 740 745 750 Ile Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln Pro 755 760 765 Gln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln Arg 770 775 780 Leu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Gly Ser 785 790 795 800 Gly Gly Ser Gly Gly Ser Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu 805 810 815 Thr Gly Lys Gln Leu Val Ile Gln Glu Ser Ile Leu Met Leu Pro Glu 820 825 830 Glu Val Glu Glu Val Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val 835 840 845 His Thr Ala Tyr Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr 850 855 860 Ser Asp Ala Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser 865 870 875 880 Asn Gly Glu Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Gly Gly Ser 885 890 895 Gly Gly Ser Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys 900 905 910 Gln Leu Val Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu 915 920 925 Glu Val Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala 930 935 940 Tyr Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala 945 950 955 960 Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu 965 970 975 Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Lys Arg Thr Ala Asp Gly 980 985 990 Ser Glu Phe Glu Pro Lys Lys Lys Arg Lys Val 995 1000 <210> 265 <211> 941 <212> PRT <213> Artificial Sequence <220> <223> DNA sequence (N terminal -Adenine deaminase) <400> 265 Met Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu 1 5 10 15 Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala 20 25 30 Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro 35 40 45 Ile Gly Arg His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg 50 55 60 Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu 65 70 75 80 Tyr Val Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His 85 90 95 Ser Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly 100 105 110 Ala Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn His 115 120 125 Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu 130 135 140 Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys 145 150 155 160 Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser 165 170 175 Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser 180 185 190 Gly Gly Ser Ser Gly Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr 195 200 205 Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg 210 215 220 Glu Val Pro Val Gly Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly 225 230 235 240 Glu Gly Trp Asn Arg Ala Ile Gly Leu His Asp Pro Thr Ala His Ala 245 250 255 Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg 260 265 270 Leu Ile Asp Ala Thr Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys 275 280 285 Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg Val Val Phe Gly Val 290 295 300 Arg Asn Ala Lys Thr Gly Ala Ala Gly Ser Leu Met Asp Val Leu His 305 310 315 320 Tyr Pro Gly Met Asn His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala 325 330 335 Asp Glu Cys Ala Ala Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln 340 345 350 Val Phe Asn Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly 355 360 365 Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser 370 375 380 Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser Ala Lys Asn 385 390 395 400 Thr Ile Thr Lys Thr Leu Lys Leu Arg Ile Val Arg Pro Tyr Asn Ser 405 410 415 Ala Glu Val Glu Lys Ile Val Ala Asp Glu Lys Asn Asn Arg Glu Lys 420 425 430 Ile Ala Leu Glu Lys Asn Lys Asp Lys Val Lys Glu Ala Cys Ser Lys 435 440 445 His Leu Lys Val Ala Ala Tyr Cys Thr Thr Gln Val Glu Arg Asn Ala 450 455 460 Cys Leu Phe Cys Lys Ala Arg Lys Leu Asp Asp Lys Phe Tyr Gln Lys 465 470 475 480 Leu Arg Gly Gln Phe Pro Asp Ala Val Phe Trp Gln Glu Ile Ser Glu 485 490 495 Ile Phe Arg Gln Leu Gln Lys Gln Ala Ala Glu Ile Tyr Asn Gln Ser 500 505 510 Leu Ile Glu Leu Tyr Tyr Glu Ile Phe Ile Lys Gly Lys Gly Ile Ala 515 520 525 Asn Ala Ser Ser Val Glu His Tyr Leu Ser Asp Val Cys Tyr Thr Arg 530 535 540 Ala Ala Glu Leu Phe Lys Asn Ala Ala Ile Ala Ser Gly Leu Arg Ser 545 550 555 560 Lys Ile Lys Ser Asn Phe Arg Leu Lys Glu Leu Lys Asn Met Lys Ser 565 570 575 Gly Leu Pro Thr Thr Lys Ser Asp Asn Phe Pro Ile Pro Leu Val Lys 580 585 590 Gln Lys Gly Gly Gln Tyr Thr Gly Phe Glu Ile Ser Asn His Asn Ser 595 600 605 Asp Phe Ile Ile Lys Ile Pro Phe Gly Arg Trp Gln Val Lys Lys Glu 610 615 620 Ile Asp Lys Tyr Arg Pro Trp Glu Lys Phe Asp Phe Glu Gln Val Gln 625 630 635 640 Lys Ser Pro Lys Pro Ile Ser Leu Leu Leu Ser Thr Gln Arg Arg Lys 645 650 655 Arg Asn Lys Gly Trp Ser Lys Asp Glu Gly Thr Glu Ala Glu Ile Lys 660 665 670 Lys Val Met Asn Gly Asp Tyr Gln Thr Ser Tyr Ile Glu Val Lys Arg 675 680 685 Gly Ser Lys Ile Gly Glu Lys Ser Ala Trp Met Leu Asn Leu Ser Ile 690 695 700 Asp Val Pro Lys Ile Asp Lys Gly Val Asp Pro Ser Ile Ile Gly Gly 705 710 715 720 Ile Asp Val Gly Val Lys Ser Pro Leu Val Cys Ala Ile Asn Asn Ala 725 730 735 Phe Ser Arg Tyr Ser Ile Ser Asp Asn Asp Leu Phe His Phe Asn Lys 740 745 750 Lys Met Phe Ala Arg Arg Arg Ile Leu Leu Lys Lys Asn Arg His Lys 755 760 765 Arg Ala Gly His Gly Ala Lys Asn Lys Leu Lys Pro Ile Thr Ile Leu 770 775 780 Thr Glu Lys Ser Glu Arg Phe Arg Lys Lys Leu Ile Glu Arg Trp Ala 785 790 795 800 Cys Glu Ile Ala Asp Phe Phe Ile Lys Asn Lys Val Gly Thr Val Gln 805 810 815 Met Glu Asn Leu Glu Ser Met Lys Arg Lys Glu Asp Ser Tyr Phe Asn 820 825 830 Ile Arg Leu Arg Gly Phe Trp Pro Tyr Ala Glu Met Gln Asn Lys Ile 835 840 845 Glu Phe Lys Leu Lys Gln Tyr Gly Ile Glu Ile Arg Lys Val Ala Pro 850 855 860 Asn Asn Thr Ser Lys Thr Cys Ser Lys Cys Gly His Leu Asn Asn Tyr 865 870 875 880 Phe Asn Phe Glu Tyr Arg Lys Lys Asn Lys Phe Pro His Phe Lys Cys 885 890 895 Glu Lys Cys Asn Phe Lys Glu Asn Ala Asp Tyr Asn Ala Ala Leu Asn 900 905 910 Ile Ser Asn Pro Lys Leu Lys Ser Thr Lys Glu Glu Pro Lys Arg Pro 915 920 925 Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys 930 935 940 <210> 266 <211> 941 <212> PRT <213> Artificial Sequence <220> <223> DNA sequence (C terminal -Adenine deaminase) <400> 266 Met Ala Lys Asn Thr Ile Thr Lys Thr Leu Lys Leu Arg Ile Val Arg 1 5 10 15 Pro Tyr Asn Ser Ala Glu Val Glu Lys Ile Val Ala Asp Glu Lys Asn 20 25 30 Asn Arg Glu Lys Ile Ala Leu Glu Lys Asn Lys Asp Lys Val Lys Glu 35 40 45 Ala Cys Ser Lys His Leu Lys Val Ala Ala Tyr Cys Thr Thr Gln Val 50 55 60 Glu Arg Asn Ala Cys Leu Phe Cys Lys Ala Arg Lys Leu Asp Asp Lys 65 70 75 80 Phe Tyr Gln Lys Leu Arg Gly Gln Phe Pro Asp Ala Val Phe Trp Gln 85 90 95 Glu Ile Ser Glu Ile Phe Arg Gln Leu Gln Lys Gln Ala Ala Glu Ile 100 105 110 Tyr Asn Gln Ser Leu Ile Glu Leu Tyr Tyr Glu Ile Phe Ile Lys Gly 115 120 125 Lys Gly Ile Ala Asn Ala Ser Ser Val Glu His Tyr Leu Ser Asp Val 130 135 140 Cys Tyr Thr Arg Ala Ala Glu Leu Phe Lys Asn Ala Ala Ile Ala Ser 145 150 155 160 Gly Leu Arg Ser Lys Ile Lys Ser Asn Phe Arg Leu Lys Glu Leu Lys 165 170 175 Asn Met Lys Ser Gly Leu Pro Thr Thr Lys Ser Asp Asn Phe Pro Ile 180 185 190 Pro Leu Val Lys Gln Lys Gly Gly Gln Tyr Thr Gly Phe Glu Ile Ser 195 200 205 Asn His Asn Ser Asp Phe Ile Ile Lys Ile Pro Phe Gly Arg Trp Gln 210 215 220 Val Lys Lys Glu Ile Asp Lys Tyr Arg Pro Trp Glu Lys Phe Asp Phe 225 230 235 240 Glu Gln Val Gln Lys Ser Pro Lys Pro Ile Ser Leu Leu Leu Ser Thr 245 250 255 Gln Arg Arg Lys Arg Asn Lys Gly Trp Ser Lys Asp Glu Gly Thr Glu 260 265 270 Ala Glu Ile Lys Lys Val Met Asn Gly Asp Tyr Gln Thr Ser Tyr Ile 275 280 285 Glu Val Lys Arg Gly Ser Lys Ile Gly Glu Lys Ser Ala Trp Met Leu 290 295 300 Asn Leu Ser Ile Asp Val Pro Lys Ile Asp Lys Gly Val Asp Pro Ser 305 310 315 320 Ile Ile Gly Gly Ile Asp Val Gly Val Lys Ser Pro Leu Val Cys Ala 325 330 335 Ile Asn Asn Ala Phe Ser Arg Tyr Ser Ile Ser Asp Asn Asp Leu Phe 340 345 350 His Phe Asn Lys Lys Met Phe Ala Arg Arg Arg Ile Leu Leu Lys Lys 355 360 365 Asn Arg His Lys Arg Ala Gly His Gly Ala Lys Asn Lys Leu Lys Pro 370 375 380 Ile Thr Ile Leu Thr Glu Lys Ser Glu Arg Phe Arg Lys Lys Leu Ile 385 390 395 400 Glu Arg Trp Ala Cys Glu Ile Ala Asp Phe Phe Ile Lys Asn Lys Val 405 410 415 Gly Thr Val Gln Met Glu Asn Leu Glu Ser Met Lys Arg Lys Glu Asp 420 425 430 Ser Tyr Phe Asn Ile Arg Leu Arg Gly Phe Trp Pro Tyr Ala Glu Met 435 440 445 Gln Asn Lys Ile Glu Phe Lys Leu Lys Gln Tyr Gly Ile Glu Ile Arg 450 455 460 Lys Val Ala Pro Asn Asn Thr Ser Lys Thr Cys Ser Lys Cys Gly His 465 470 475 480 Leu Asn Asn Tyr Phe Asn Phe Glu Tyr Arg Lys Lys Asn Lys Phe Pro 485 490 495 His Phe Lys Cys Glu Lys Cys Asn Phe Lys Glu Asn Ala Asp Tyr Asn 500 505 510 Ala Ala Leu Asn Ile Ser Asn Pro Lys Leu Lys Ser Thr Lys Glu Glu 515 520 525 Pro Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly 530 535 540 Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly 545 550 555 560 Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu 565 570 575 Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala 580 585 590 Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro 595 600 605 Ile Gly Arg His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg 610 615 620 Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu 625 630 635 640 Tyr Val Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His 645 650 655 Ser Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly 660 665 670 Ala Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn His 675 680 685 Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu 690 695 700 Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys 705 710 715 720 Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser 725 730 735 Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser 740 745 750 Gly Gly Ser Ser Gly Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr 755 760 765 Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg 770 775 780 Glu Val Pro Val Gly Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly 785 790 795 800 Glu Gly Trp Asn Arg Ala Ile Gly Leu His Asp Pro Thr Ala His Ala 805 810 815 Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg 820 825 830 Leu Ile Asp Ala Thr Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys 835 840 845 Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg Val Val Phe Gly Val 850 855 860 Arg Asn Ala Lys Thr Gly Ala Ala Gly Ser Leu Met Asp Val Leu His 865 870 875 880 Tyr Pro Gly Met Asn His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala 885 890 895 Asp Glu Cys Ala Ala Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln 900 905 910 Val Phe Asn Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Lys Arg Pro 915 920 925 Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys 930 935 940 <210> 267 <211> 1680 <212> DNA <213> Artificial Sequence <220> <223> Human codon-optimized Cas14a1 with NLS <400> 267 atgccaaaga agaagcggaa ggtcggtatc cacggagtcc cagcagccgc caagaacaca 60 attacaaaga cactgaagct gaggatcgtg agaccataca acagcgctga ggtcgagaag 120 attgtggctg atgaaaagaa caacagggaa aagatcgccc tcgagaagaa caaggataag 180 gtgaaggagg cctgctctaa gcacctgaaa gtggccgcct actgcaccac acaggtggag 240 aggaacgcct gtctgttttg taaagctcgg aagctggatg ataagtttta ccagaagctg 300 cggggccagt tccccgatgc cgtcttttgg caggagatta gcgagatctt cagacagctg 360 cagaagcagg ccgccgagat ctacaaccag agcctgatcg agctctacta cgagatcttc 420 atcaagggca agggcattgc caacgcctcc tccgtggagc actacctgag cgacgtgtgc 480 tacacaagag ccgccgagct ctttaagaac gccgctatcg cttccgggct gaggagcaag 540 attaagagta acttccggct caaggagctg aagaacatga agagcggcct gcccactaca 600 aagagcgaca acttcccaat tccactggtg aagcagaagg ggggccagta cacagggttc 660 gagatttcca accacaacag cgactttatt attaagatcc cctttggcag gtggcaggtc 720 aagaaggaga ttgacaagta caggccctgg gagaagtttg atttcgagca ggtgcagaag 780 agccccaagc ctatttccct gctgctgtcc acacagcggc ggaagaggaa caaggggtgg 840 tctaaggatg aggggaccga ggccgagatt aagaaagtga tgaacggcga ctaccagaca 900 agctacatcg aggtcaagcg gggcagtaag attggcgaga agagcgcctg gatgctgaac 960 ctgagcattg acgtgccaaa gattgataag ggcgtggatc ccagcatcat cggagggatc 1020 gatgtggggg tcaagagccc cctcgtgtgc gccatcaaca acgccttcag caggtacagc 1080 atctccgata acgacctgtt ccactttaac aagaagatgt tcgcccggcg gaggattttg 1140 ctcaagaaga accggcacaa gcgggccgga cacggggcca agaacaagct caagcccatc 1200 actatcctga ccgagaagag cgagaggttc aggaagaagc tcatcgagag atgggcctgc 1260 gagatcgccg atttctttat taagaacaag gtcggaacag tgcagatgga gaacctcgag 1320 agcatgaaga ggaaggagga ttcctacttc aacattcggc tgagggggtt ctggccctac 1380 gctgagatgc agaacaagat tgagtttaag ctgaagcagt acgggattga gatccggaag 1440 gtggccccca acaacaccag caagacctgc agcaagtgcg ggcacctcaa caactacttc 1500 aacttcgagt accggaagaa gaacaagttc ccacacttca agtgcgagaa gtgcaacttt 1560 aaggagaacg ccgattacaa cgccgccctg aacatcagca accctaagct gaagagcact 1620 aaggaggagc ccaaaaggcc ggcggccacg aaaaaggccg gccaggcaaa aaagaaaaag 1680 1680 <210> 268 <211> 3009 <212> DNA <213> Artificial Sequence <220> <223> Deaminase fusion Cas14a1 <400> 268 atgccaaaga agaagcggaa agtctcctca gagactgggc ctgtcgccgt cgatccaacc 60 ctgcgccgcc ggattgaacc tcacgagttt gaagtgttct ttgacccccg ggagctgaga 120 aaggagacat gcctgctgta cgagatcaac tggggaggca ggcactccat ctggaggcac 180 acctctcaga acacaaataa gcacgtggag gtgaacttca tcgagaagtt taccacagag 240 cggtacttct gccccaatac cagatgtagc atcacatggt ttctgagctg gtccccttgc 300 ggagagtgta gcagggccat caccgagttc ctgtccagat atccacacgt gacactgttt 360 atctacatcg ccaggctgta tcaccacgca gacccaagga ataggcaggg cctgcgcgat 420 ctgatcagct ccggcgtgac catccagatc atgacagagc aggagtccgg ctactgctgg 480 cggaacttcg tgaattattc tcctagcaac gaggcccact ggcctaggta cccacacctg 540 tgggtgcgcc tgtacgtgct ggagctgtat tgcatcatcc tgggcctgcc cccttgtctg 600 aatatcctgc ggagaaagca gccccagctg accttcttta caatcgccct gcagtcttgt 660 cactatcaga ggctgccacc ccacatcctg tgggccacag gcctgaagtc tggaggatct 720 agcggaggat cctctggcag cgagacacca ggaacaagcg agtcagcaac accagagagc 780 agtggcggca gcagcggcgg cagcgccaag aacacaatta caaagacact gaagctgagg 840 atcgtgagac catacaacag cgctgaggtc gagaagattg tggctgatga aaagaacaac 900 agggaaaaga tcgccctcga gaagaacaag gataaggtga aggaggcctg ctctaagcac 960 ctgaaagtgg ccgcctactg caccacacag gtggagagga acgcctgtct gttttgtaaa 1020 gctcggaagc tggatgataa gttttaccag aagctgcggg gccagttccc cgatgccgtc 1080 ttttggcagg agattagcga gatcttcaga cagctgcaga agcaggccgc cgagatctac 1140 aaccagagcc tgatcgagct ctactacgag atcttcatca agggcaaggg cattgccaac 1200 gcctcctccg tggagcacta cctgagcgac gtgtgctaca caagagccgc cgagctcttt 1260 aagaacgccg ctatcgcttc cgggctgagg agcaagatta agagtaactt ccggctcaag 1320 gagctgaaga acatgaagag cggcctgccc actacaaaga gcgacaactt cccaattcca 1380 ctggtgaagc agaagggggg ccagtacaca gggttcgaga tttccaacca caacagcgac 1440 tttattatta agatcccctt tggcaggtgg caggtcaaga aggagattga caagtacagg 1500 ccctgggaga agtttgattt cgagcaggtg cagaagagcc ccaagcctat ttccctgctg 1560 ctgtccacac agcggcggaa gaggaacaag gggtggtcta aggatgaggg gaccgaggcc 1620 gagattaaga aagtgatgaa cggcgactac cagacaagct acatcgaggt caagcggggc 1680 agtaagattg gcgagaagag cgcctggatg ctgaacctga gcattgacgt gccaaagatt 1740 gataagggcg tggatcccag catcatcgga gggatcgatg tgggggtcaa gagccccctc 1800 gtgtgcgcca tcaacaacgc cttcagcagg tacagcatct ccgataacga cctgttccac 1860 tttaacaaga agatgttcgc ccggcggagg attttgctca agaagaaccg gcacaagcgg 1920 gccggacacg gggccaagaa caagctcaag cccatcacta tcctgaccga gaagagcgag 1980 aggttcagga agaagctcat cgagagatgg gcctgcgaga tcgccgattt ctttattaag 2040 aacaaggtcg gaacagtgca gatggagaac ctcgagagca tgaagaggaa ggaggattcc 2100 tacttcaaca ttcggctgag ggggttctgg ccctacgctg agatgcagaa caagattgag 2160 tttaagctga agcagtacgg gattgagatc cggaaggtgg cccccaacaa caccagcaag 2220 acctgcagca agtgcgggca cctcaacaac tacttcaact tcgagtaccg gaagaagaac 2280 aagttcccac acttcaagtg cgagaagtgc aactttaagg agaacgccga ttacaacgcc 2340 gccctgaaca tcagcaaccc taagctgaag agcactaagg aggagcccag cggcgggagc 2400 ggcgggagcg gggggagcac taatctgagc gacatcattg agaaggagac tgggaaacag 2460 ctggtcattc aggagtccat cctgatgctg cctgaggagg tggaggaagt gatcggcaac 2520 aagccagagt ctgacatcct ggtgcacacc gcctacgacg agtccacaga tgagaatgtg 2580 atgctgctga cctctgacgc ccccgagtat aagccttggg ccctggtcat ccaggattct 2640 aacggcgaga ataagatcaa gatgctgagc ggaggatccg gaggatctgg aggcagcacc 2700 aacctgtctg acatcatcga gaaggagaca ggcaagcagc tggtcatcca ggagagcatc 2760 ctgatgctgc ccgaagaagt cgaagaagtg atcggaaaca agcctgagag cgatatcctg 2820 gtccataccg cctacgacga gagtaccgac gaaaatgtga tgctgctgac atccgacgcc 2880 ccagagtata agccctgggc tctggtcatc caggattcca acggagagaa caaaatcaaa 2940 atgctgtctg gcggctcaaa aagaaccgcc gacggcagcg aattcgagcc caagaagaag 3000 aggaaagtc 3009 <210> 269 <211> 1586 <212> DNA <213> Artificial Sequence <220> <223> human codon-optimized Cas14a1 <400> 269 atggccaaga acacaattac aaagacactg aagctgagga tcgtgagacc atacaacagc 60 gctgaggtcg agaagattgt ggctgatgaa aagaacaaca gggaaaagat cgccctcgag 120 aagaacaagg ataaggtgaa ggaggcctgc tctaagcacc tgaaagtggc cgcctactgc 180 accacacagg tggagaggaa cgcctgtctg ttttgtaaag ctcggaagct ggatgataag 240 ttttaccaga agctgcgggg ccagttcccc gatgccgtct tttggcagga gattagcgag 300 atcttcagac agctgcagaa gcaggccgcc gagatctaca accagagcct gatcgagctc 360 tactacgaga tcttcatcaa gggcaagggc attgccaacg cctcctccgt ggagcactac 420 ctgagcgacg tgtgctacac aagagccgcc gagctcttta agaacgccgc tatcgcttcc 480 gggctgagga gcaagattaa gagtaacttc cggctcaagg agctgaagaa catgaagagc 540 ggcctgccca ctacaaagag cgacaacttc ccaattccac tggtgaagca gaaggggggc 600 cagtacacag ggttcgagat ttccaaccac aacagcgact ttattattaa gatccccttt 660 ggcaggtggc aggtcaagaa ggagattgac aagtacaggc cctgggagaa gtttgatttc 720 gagcaggtgc agaagagccc caagcctatt tccctgctgc tgtccacaca gcggcggaag 780 aggaacaagg ggtggtctaa ggatgagggg accgaggccg agattaagaa agtgatgaac 840 ggcgactacc agacaagcta catcgaggtc aagcggggca gtaagattgg cgagaagagc 900 gcctggatgc tgaacctgag cattgacgtg ccaaagattg ataagggcgt ggatcccagc 960 atcatcggag ggatcgatgt gggggtcaag agccccctcg tgtgcgccat caacaacgcc 1020 ttcagcaggt acagcatctc cgataacgac ctgttccact ttaacaagaa gatgttcgcc 1080 cggcggagga ttttgctcaa gaagaaccgg cacaagcggg ccggacacgg ggccaagaac 1140 aagctcaagc ccatcactat cctgaccgag aagagcgaga ggttcaggaa gaagctcatc 1200 gagagatggg cctgcgagat cgccgatttc tttattaaga acaaggtcgg aacagtgcag 1260 atggagaacc tcgagagcat gaagaggaag gaggattcct acttcaacat tcggctgagg 1320 gggttctggc cctacgctga gatgcagaac aagattgagt ttaagctgaa gcagtacggg 1380 attgagatcc ggaaggtggc ccccaacaac accagcaaga cctgcagcaa gtgcgggcac 1440 ctcaacaact acttcaactt cgagtaccgg aagaagaaca agttcccaca cttcaagtgc 1500 gagaagtgca actttaagga gaacgccgat tacaacgccg ccctgaacat cagcaaccct 1560 aagctgaaga gcactaagga ggagcc 1586 <210> 270 <211> 1607 <212> DNA <213> Artificial Sequence <220> <223> N-terminal NLS + Cas14a1 amino acid sequence <400> 270 atgccaaaga agaagcggaa agtcgccaag aacacaatta caaagacact gaagctgagg 60 atcgtgagac catacaacag cgctgaggtc gagaagattg tggctgatga aaagaacaac 120 agggaaaaga tcgccctcga gaagaacaag gataaggtga aggaggcctg ctctaagcac 180 ctgaaagtgg ccgcctactg caccacacag gtggagagga acgcctgtct gttttgtaaa 240 gctcggaagc tggatgataa gttttaccag aagctgcggg gccagttccc cgatgccgtc 300 ttttggcagg agattagcga gatcttcaga cagctgcaga agcaggccgc cgagatctac 360 aaccagagcc tgatcgagct ctactacgag atcttcatca agggcaaggg cattgccaac 420 gcctcctccg tggagcacta cctgagcgac gtgtgctaca caagagccgc cgagctcttt 480 aagaacgccg ctatcgcttc cgggctgagg agcaagatta agagtaactt ccggctcaag 540 gagctgaaga acatgaagag cggcctgccc actacaaaga gcgacaactt cccaattcca 600 ctggtgaagc agaagggggg ccagtacaca gggttcgaga tttccaacca caacagcgac 660 tttattatta agatcccctt tggcaggtgg caggtcaaga aggagattga caagtacagg 720 ccctgggaga agtttgattt cgagcaggtg cagaagagcc ccaagcctat ttccctgctg 780 ctgtccacac agcggcggaa gaggaacaag gggtggtcta aggatgaggg gaccgaggcc 840 gagattaaga aagtgatgaa cggcgactac cagacaagct acatcgaggt caagcggggc 900 agtaagattg gcgagaagag cgcctggatg ctgaacctga gcattgacgt gccaaagatt 960 gataagggcg tggatcccag catcatcgga gggatcgatg tgggggtcaa gagccccctc 1020 gtgtgcgcca tcaacaacgc cttcagcagg tacagcatct ccgataacga cctgttccac 1080 tttaacaaga agatgttcgc ccggcggagg attttgctca agaagaaccg gcacaagcgg 1140 gccggacacg gggccaagaa caagctcaag cccatcacta tcctgaccga gaagagcgag 1200 aggttcagga agaagctcat cgagagatgg gcctgcgaga tcgccgattt ctttattaag 1260 aacaaggtcg gaacagtgca gatggagaac ctcgagagca tgaagaggaa ggaggattcc 1320 tacttcaaca ttcggctgag ggggttctgg ccctacgctg agatgcagaa caagattgag 1380 tttaagctga agcagtacgg gattgagatc cggaaggtgg cccccaacaa caccagcaag 1440 acctgcagca agtgcgggca cctcaacaac tacttcaact tcgagtaccg gaagaagaac 1500 aagttcccac acttcaagtg cgagaagtgc aactttaagg agaacgccga ttacaacgcc 1560 gccctgaaca tcagcaaccc taagctgaag agcactaagg aggagcc 1607 <210> 271 <211> 1607 <212> DNA <213> Artificial Sequence <220> <223> C-terminal NLS + Cas14a1 amino acid sequence <400> 271 atggccaaga acacaattac aaagacactg aagctgagga tcgtgagacc atacaacagc 60 gctgaggtcg agaagattgt ggctgatgaa aagaacaaca gggaaaagat cgccctcgag 120 aagaacaagg ataaggtgaa ggaggcctgc tctaagcacc tgaaagtggc cgcctactgc 180 accacacagg tggagaggaa cgcctgtctg ttttgtaaag ctcggaagct ggatgataag 240 ttttaccaga agctgcgggg ccagttcccc gatgccgtct tttggcagga gattagcgag 300 atcttcagac agctgcagaa gcaggccgcc gagatctaca accagagcct gatcgagctc 360 tactacgaga tcttcatcaa gggcaagggc attgccaacg cctcctccgt ggagcactac 420 ctgagcgacg tgtgctacac aagagccgcc gagctcttta agaacgccgc tatcgcttcc 480 gggctgagga gcaagattaa gagtaacttc cggctcaagg agctgaagaa catgaagagc 540 ggcctgccca ctacaaagag cgacaacttc ccaattccac tggtgaagca gaaggggggc 600 cagtacacag ggttcgagat ttccaaccac aacagcgact ttattattaa gatccccttt 660 ggcaggtggc aggtcaagaa ggagattgac aagtacaggc cctgggagaa gtttgatttc 720 gagcaggtgc agaagagccc caagcctatt tccctgctgc tgtccacaca gcggcggaag 780 aggaacaagg ggtggtctaa ggatgagggg accgaggccg agattaagaa agtgatgaac 840 ggcgactacc agacaagcta catcgaggtc aagcggggca gtaagattgg cgagaagagc 900 gcctggatgc tgaacctgag cattgacgtg ccaaagattg ataagggcgt ggatcccagc 960 atcatcggag ggatcgatgt gggggtcaag agccccctcg tgtgcgccat caacaacgcc 1020 ttcagcaggt acagcatctc cgataacgac ctgttccact ttaacaagaa gatgttcgcc 1080 cggcggagga ttttgctcaa gaagaaccgg cacaagcggg ccggacacgg ggccaagaac 1140 aagctcaagc ccatcactat cctgaccgag aagagcgaga ggttcaggaa gaagctcatc 1200 gagagatggg cctgcgagat cgccgatttc tttattaaga acaaggtcgg aacagtgcag 1260 atggagaacc tcgagagcat gaagaggaag gaggattcct acttcaacat tcggctgagg 1320 gggttctggc cctacgctga gatgcagaac aagattgagt ttaagctgaa gcagtacggg 1380 attgagatcc ggaaggtggc ccccaacaac accagcaaga cctgcagcaa gtgcgggcac 1440 ctcaacaact acttcaactt cgagtaccgg aagaagaaca agttcccaca cttcaagtgc 1500 gagaagtgca actttaagga gaacgccgat tacaacgccg ccctgaacat cagcaaccct 1560 aagctgaaga gcactaagga ggagccccaa agaagaagcg gaaagtc 1607 <210> 272 <211> 1628 <212> DNA <213> Artificial Sequence <220> <223> N/C-terminal NLS + Cas14a1 amino acid sequence <400> 272 atgccaaaga agaagcggaa agtcgccaag aacacaatta caaagacact gaagctgagg 60 atcgtgagac catacaacag cgctgaggtc gagaagattg tggctgatga aaagaacaac 120 agggaaaaga tcgccctcga gaagaacaag gataaggtga aggaggcctg ctctaagcac 180 ctgaaagtgg ccgcctactg caccacacag gtggagagga acgcctgtct gttttgtaaa 240 gctcggaagc tggatgataa gttttaccag aagctgcggg gccagttccc cgatgccgtc 300 ttttggcagg agattagcga gatcttcaga cagctgcaga agcaggccgc cgagatctac 360 aaccagagcc tgatcgagct ctactacgag atcttcatca agggcaaggg cattgccaac 420 gcctcctccg tggagcacta cctgagcgac gtgtgctaca caagagccgc cgagctcttt 480 aagaacgccg ctatcgcttc cgggctgagg agcaagatta agagtaactt ccggctcaag 540 gagctgaaga acatgaagag cggcctgccc actacaaaga gcgacaactt cccaattcca 600 ctggtgaagc agaagggggg ccagtacaca gggttcgaga tttccaacca caacagcgac 660 tttattatta agatcccctt tggcaggtgg caggtcaaga aggagattga caagtacagg 720 ccctgggaga agtttgattt cgagcaggtg cagaagagcc ccaagcctat ttccctgctg 780 ctgtccacac agcggcggaa gaggaacaag gggtggtcta aggatgaggg gaccgaggcc 840 gagattaaga aagtgatgaa cggcgactac cagacaagct acatcgaggt caagcggggc 900 agtaagattg gcgagaagag cgcctggatg ctgaacctga gcattgacgt gccaaagatt 960 gataagggcg tggatcccag catcatcgga gggatcgatg tgggggtcaa gagccccctc 1020 gtgtgcgcca tcaacaacgc cttcagcagg tacagcatct ccgataacga cctgttccac 1080 tttaacaaga agatgttcgc ccggcggagg attttgctca agaagaaccg gcacaagcgg 1140 gccggacacg gggccaagaa caagctcaag cccatcacta tcctgaccga gaagagcgag 1200 aggttcagga agaagctcat cgagagatgg gcctgcgaga tcgccgattt ctttattaag 1260 aacaaggtcg gaacagtgca gatggagaac ctcgagagca tgaagaggaa ggaggattcc 1320 tacttcaaca ttcggctgag ggggttctgg ccctacgctg agatgcagaa caagattgag 1380 tttaagctga agcagtacgg gattgagatc cggaaggtgg cccccaacaa caccagcaag 1440 acctgcagca agtgcgggca cctcaacaac tacttcaact tcgagtaccg gaagaagaac 1500 aagttcccac acttcaagtg cgagaagtgc aactttaagg agaacgccga ttacaacgcc 1560 gccctgaaca tcagcaaccc taagctgaag agcactaagg aggagcccca aagaagaagc 1620 ggaaagtc 1628 <210> 273 <211> 3009 <212> DNA <213> Artificial Sequence <220> <223> Amino acid sequence (N terminal -Cytidine deaminase) <400> 273 atgccaaaga agaagcggaa agtctcctca gagactgggc ctgtcgccgt cgatccaacc 60 ctgcgccgcc ggattgaacc tcacgagttt gaagtgttct ttgacccccg ggagctgaga 120 aaggagacat gcctgctgta cgagatcaac tggggaggca ggcactccat ctggaggcac 180 acctctcaga acacaaataa gcacgtggag gtgaacttca tcgagaagtt taccacagag 240 cggtacttct gccccaatac cagatgtagc atcacatggt ttctgagctg gtccccttgc 300 ggagagtgta gcagggccat caccgagttc ctgtccagat atccacacgt gacactgttt 360 atctacatcg ccaggctgta tcaccacgca gacccaagga ataggcaggg cctgcgcgat 420 ctgatcagct ccggcgtgac catccagatc atgacagagc aggagtccgg ctactgctgg 480 cggaacttcg tgaattattc tcctagcaac gaggcccact ggcctaggta cccacacctg 540 tgggtgcgcc tgtacgtgct ggagctgtat tgcatcatcc tgggcctgcc cccttgtctg 600 aatatcctgc ggagaaagca gccccagctg accttcttta caatcgccct gcagtcttgt 660 cactatcaga ggctgccacc ccacatcctg tgggccacag gcctgaagtc tggaggatct 720 agcggaggat cctctggcag cgagacacca ggaacaagcg agtcagcaac accagagagc 780 agtggcggca gcagcggcgg cagcgccaag aacacaatta caaagacact gaagctgagg 840 atcgtgagac catacaacag cgctgaggtc gagaagattg tggctgatga aaagaacaac 900 agggaaaaga tcgccctcga gaagaacaag gataaggtga aggaggcctg ctctaagcac 960 ctgaaagtgg ccgcctactg caccacacag gtggagagga acgcctgtct gttttgtaaa 1020 gctcggaagc tggatgataa gttttaccag aagctgcggg gccagttccc cgatgccgtc 1080 ttttggcagg agattagcga gatcttcaga cagctgcaga agcaggccgc cgagatctac 1140 aaccagagcc tgatcgagct ctactacgag atcttcatca agggcaaggg cattgccaac 1200 gcctcctccg tggagcacta cctgagcgac gtgtgctaca caagagccgc cgagctcttt 1260 aagaacgccg ctatcgcttc cgggctgagg agcaagatta agagtaactt ccggctcaag 1320 gagctgaaga acatgaagag cggcctgccc actacaaaga gcgacaactt cccaattcca 1380 ctggtgaagc agaagggggg ccagtacaca gggttcgaga tttccaacca caacagcgac 1440 tttattatta agatcccctt tggcaggtgg caggtcaaga aggagattga caagtacagg 1500 ccctgggaga agtttgattt cgagcaggtg cagaagagcc ccaagcctat ttccctgctg 1560 ctgtccacac agcggcggaa gaggaacaag gggtggtcta aggatgaggg gaccgaggcc 1620 gagattaaga aagtgatgaa cggcgactac cagacaagct acatcgaggt caagcggggc 1680 agtaagattg gcgagaagag cgcctggatg ctgaacctga gcattgacgt gccaaagatt 1740 gataagggcg tggatcccag catcatcgga gggatcgatg tgggggtcaa gagccccctc 1800 gtgtgcgcca tcaacaacgc cttcagcagg tacagcatct ccgataacga cctgttccac 1860 tttaacaaga agatgttcgc ccggcggagg attttgctca agaagaaccg gcacaagcgg 1920 gccggacacg gggccaagaa caagctcaag cccatcacta tcctgaccga gaagagcgag 1980 aggttcagga agaagctcat cgagagatgg gcctgcgaga tcgccgattt ctttattaag 2040 aacaaggtcg gaacagtgca gatggagaac ctcgagagca tgaagaggaa ggaggattcc 2100 tacttcaaca ttcggctgag ggggttctgg ccctacgctg agatgcagaa caagattgag 2160 tttaagctga agcagtacgg gattgagatc cggaaggtgg cccccaacaa caccagcaag 2220 acctgcagca agtgcgggca cctcaacaac tacttcaact tcgagtaccg gaagaagaac 2280 aagttcccac acttcaagtg cgagaagtgc aactttaagg agaacgccga ttacaacgcc 2340 gccctgaaca tcagcaaccc taagctgaag agcactaagg aggagcccag cggcgggagc 2400 ggcgggagcg gggggagcac taatctgagc gacatcattg agaaggagac tgggaaacag 2460 ctggtcattc aggagtccat cctgatgctg cctgaggagg tggaggaagt gatcggcaac 2520 aagccagagt ctgacatcct ggtgcacacc gcctacgacg agtccacaga tgagaatgtg 2580 atgctgctga cctctgacgc ccccgagtat aagccttggg ccctggtcat ccaggattct 2640 aacggcgaga ataagatcaa gatgctgagc ggaggatccg gaggatctgg aggcagcacc 2700 aacctgtctg acatcatcga gaaggagaca ggcaagcagc tggtcatcca ggagagcatc 2760 ctgatgctgc ccgaagaagt cgaagaagtg atcggaaaca agcctgagag cgatatcctg 2820 gtccataccg cctacgacga gagtaccgac gaaaatgtga tgctgctgac atccgacgcc 2880 ccagagtata agccctgggc tctggtcatc caggattcca acggagagaa caaaatcaaa 2940 atgctgtctg gcggctcaaa aagaaccgcc gacggcagcg aattcgagcc caagaagaag 3000 aggaaagtc 3009 <210> 274 <211> 3009 <212> DNA <213> Artificial Sequence <220> <223> Amino acid sequence (C terminal -Cytidine deaminase) <400> 274 atgccaaaga agaagcggaa agtcgccaag aacacaatta caaagacact gaagctgagg 60 atcgtgagac catacaacag cgctgaggtc gagaagattg tggctgatga aaagaacaac 120 agggaaaaga tcgccctcga gaagaacaag gataaggtga aggaggcctg ctctaagcac 180 ctgaaagtgg ccgcctactg caccacacag gtggagagga acgcctgtct gttttgtaaa 240 gctcggaagc tggatgataa gttttaccag aagctgcggg gccagttccc cgatgccgtc 300 ttttggcagg agattagcga gatcttcaga cagctgcaga agcaggccgc cgagatctac 360 aaccagagcc tgatcgagct ctactacgag atcttcatca agggcaaggg cattgccaac 420 gcctcctccg tggagcacta cctgagcgac gtgtgctaca caagagccgc cgagctcttt 480 aagaacgccg ctatcgcttc cgggctgagg agcaagatta agagtaactt ccggctcaag 540 gagctgaaga acatgaagag cggcctgccc actacaaaga gcgacaactt cccaattcca 600 ctggtgaagc agaagggggg ccagtacaca gggttcgaga tttccaacca caacagcgac 660 tttattatta agatcccctt tggcaggtgg caggtcaaga aggagattga caagtacagg 720 ccctgggaga agtttgattt cgagcaggtg cagaagagcc ccaagcctat ttccctgctg 780 ctgtccacac agcggcggaa gaggaacaag gggtggtcta aggatgaggg gaccgaggcc 840 gagattaaga aagtgatgaa cggcgactac cagacaagct acatcgaggt caagcggggc 900 agtaagattg gcgagaagag cgcctggatg ctgaacctga gcattgacgt gccaaagatt 960 gataagggcg tggatcccag catcatcgga gggatcgatg tgggggtcaa gagccccctc 1020 gtgtgcgcca tcaacaacgc cttcagcagg tacagcatct ccgataacga cctgttccac 1080 tttaacaaga agatgttcgc ccggcggagg attttgctca agaagaaccg gcacaagcgg 1140 gccggacacg gggccaagaa caagctcaag cccatcacta tcctgaccga gaagagcgag 1200 aggttcagga agaagctcat cgagagatgg gcctgcgaga tcgccgattt ctttattaag 1260 aacaaggtcg gaacagtgca gatggagaac ctcgagagca tgaagaggaa ggaggattcc 1320 tacttcaaca ttcggctgag ggggttctgg ccctacgctg agatgcagaa caagattgag 1380 tttaagctga agcagtacgg gattgagatc cggaaggtgg cccccaacaa caccagcaag 1440 acctgcagca agtgcgggca cctcaacaac tacttcaact tcgagtaccg gaagaagaac 1500 aagttcccac acttcaagtg cgagaagtgc aactttaagg agaacgccga ttacaacgcc 1560 gccctgaaca tcagcaaccc taagctgaag agcactaagg aggagccctc tggaggatct 1620 agcggaggat cctctggcag cgagacacca ggaacaagcg agtcagcaac accagagagc 1680 agtggcggca gcagcggcgg cagctcctca gagactgggc ctgtcgccgt cgatccaacc 1740 ctgcgccgcc ggattgaacc tcacgagttt gaagtgttct ttgacccccg ggagctgaga 1800 aaggagacat gcctgctgta cgagatcaac tggggaggca ggcactccat ctggaggcac 1860 acctctcaga acacaaataa gcacgtggag gtgaacttca tcgagaagtt taccacagag 1920 cggtacttct gccccaatac cagatgtagc atcacatggt ttctgagctg gtccccttgc 1980 ggagagtgta gcagggccat caccgagttc ctgtccagat atccacacgt gacactgttt 2040 atctacatcg ccaggctgta tcaccacgca gacccaagga ataggcaggg cctgcgcgat 2100 ctgatcagct ccggcgtgac catccagatc atgacagagc aggagtccgg ctactgctgg 2160 cggaacttcg tgaattattc tcctagcaac gaggcccact ggcctaggta cccacacctg 2220 tgggtgcgcc tgtacgtgct ggagctgtat tgcatcatcc tgggcctgcc cccttgtctg 2280 aatatcctgc ggagaaagca gccccagctg accttcttta caatcgccct gcagtcttgt 2340 cactatcaga ggctgccacc ccacatcctg tgggccacag gcctgaagag cggcgggagc 2400 ggcgggagcg gggggagcac taatctgagc gacatcattg agaaggagac tgggaaacag 2460 ctggtcattc aggagtccat cctgatgctg cctgaggagg tggaggaagt gatcggcaac 2520 aagccagagt ctgacatcct ggtgcacacc gcctacgacg agtccacaga tgagaatgtg 2580 atgctgctga cctctgacgc ccccgagtat aagccttggg ccctggtcat ccaggattct 2640 aacggcgaga ataagatcaa gatgctgagc ggaggatccg gaggatctgg aggcagcacc 2700 aacctgtctg acatcatcga gaaggagaca ggcaagcagc tggtcatcca ggagagcatc 2760 ctgatgctgc ccgaagaagt cgaagaagtg atcggaaaca agcctgagag cgatatcctg 2820 gtccataccg cctacgacga gagtaccgac gaaaatgtga tgctgctgac atccgacgcc 2880 ccagagtata agccctgggc tctggtcatc caggattcca acggagagaa caaaatcaaa 2940 atgctgtctg gcggctcaaa aagaaccgcc gacggcagcg aattcgagcc caagaagaag 3000 aggaaagtc 3009 <210> 275 <211> 2823 <212> DNA <213> Artificial Sequence <220> <223> DNA sequence (N terminal -Adenine deaminase) <400> 275 atgtccgaag tcgagttttc ccatgagtac tggatgagac acgcattgac tctcgcaaag 60 agggcttggg atgaacgcga ggtgcccgtg ggggcagtac tcgtgcataa caatcgcgta 120 atcggcgaag gttggaatag gccgatcgga cgccacgacc ccactgcaca tgcggaaatc 180 atggcccttc gacagggagg gcttgtgatg cagaattatc gacttatcga tgcgacgctg 240 tacgtcacgc ttgaaccttg cgtaatgtgc gcgggagcta tgattcactc ccgcattgga 300 cgagttgtat tcggtgcccg cgacgccaag acgggtgccg caggttcact gatggacgtg 360 ctgcatcacc caggcatgaa ccaccgggta gaaatcacag aaggcatatt ggcggacgaa 420 tgtgcggcgc tgttgtccga cttttttcgc atgcggaggc aggagatcaa ggcccagaaa 480 aaagcacaat cctctactga ctctggtggt tcttctggtg gttctagcgg cagcgagact 540 cccgggacct cagagtccgc cacacccgaa agttctggtg gttcttctgg tggttcttcc 600 gaagtcgagt tttcccatga gtactggatg agacacgcat tgactctcgc aaagagggct 660 cgagatgaac gcgaggtgcc cgtgggggca gtactcgtgc tcaacaatcg cgtaatcggc 720 gaaggttgga atagggcaat cggactccac gaccccactg cacatgcgga aatcatggcc 780 cttcgacagg gagggcttgt gatgcagaat tatcgactta tcgatgcgac gctgtacgtc 840 acgtttgaac cttgcgtaat gtgcgcggga gctatgattc actcccgcat tggacgagtt 900 gtattcggtg ttcgcaacgc caagacgggt gccgcaggtt cactgatgga cgtgctgcat 960 tacccaggca tgaaccaccg ggtagaaatc acagaaggca tattggcgga cgaatgtgcg 1020 gcgctgttgt gttacttttt tcgcatgccc aggcaggtct ttaacgccca gaaaaaagca 1080 caatcctcta ctgactctgg tggttcttct ggtggttcta gcggcagcga gactcccggg 1140 acctcagagt ccgccacacc cgaaagttct ggtggttctt ctggtggttc tgccaagaac 1200 acaattacaa agacactgaa gctgaggatc gtgagaccat acaacagcgc tgaggtcgag 1260 aagattgtgg ctgatgaaaa gaacaacagg gaaaagatcg ccctcgagaa gaacaaggat 1320 aaggtgaagg aggcctgctc taagcacctg aaagtggccg cctactgcac cacacaggtg 1380 gagaggaacg cctgtctgtt ttgtaaagct cggaagctgg atgataagtt ttaccagaag 1440 ctgcggggcc agttccccga tgccgtcttt tggcaggaga ttagcgagat cttcagacag 1500 ctgcagaagc aggccgccga gatctacaac cagagcctga tcgagctcta ctacgagatc 1560 ttcatcaagg gcaagggcat tgccaacgcc tcctccgtgg agcactacct gagcgacgtg 1620 tgctacacaa gagccgccga gctctttaag aacgccgcta tcgcttccgg gctgaggagc 1680 aagattaaga gtaacttccg gctcaaggag ctgaagaaca tgaagagcgg cctgcccact 1740 acaaagagcg acaacttccc aattccactg gtgaagcaga aggggggcca gtacacaggg 1800 ttcgagattt ccaaccacaa cagcgacttt attattaaga tcccctttgg caggtggcag 1860 gtcaagaagg agattgacaa gtacaggccc tgggagaagt ttgatttcga gcaggtgcag 1920 aagagcccca agcctatttc cctgctgctg tccacacagc ggcggaagag gaacaagggg 1980 tggtctaagg atgaggggac cgaggccgag attaagaaag tgatgaacgg cgactaccag 2040 acaagctaca tcgaggtcaa gcggggcagt aagattggcg agaagagcgc ctggatgctg 2100 aacctgagca ttgacgtgcc aaagattgat aagggcgtgg atcccagcat catcggaggg 2160 atcgatgtgg gggtcaagag ccccctcgtg tgcgccatca acaacgcctt cagcaggtac 2220 agcatctccg ataacgacct gttccacttt aacaagaaga tgttcgcccg gcggaggatt 2280 ttgctcaaga agaaccggca caagcgggcc ggacacgggg ccaagaacaa gctcaagccc 2340 atcactatcc tgaccgagaa gagcgagagg ttcaggaaga agctcatcga gagatgggcc 2400 tgcgagatcg ccgatttctt tattaagaac aaggtcggaa cagtgcagat ggagaacctc 2460 gagagcatga agaggaagga ggattcctac ttcaacattc ggctgagggg gttctggccc 2520 tacgctgaga tgcagaacaa gattgagttt aagctgaagc agtacgggat tgagatccgg 2580 aaggtggccc ccaacaacac cagcaagacc tgcagcaagt gcgggcacct caacaactac 2640 ttcaacttcg agtaccggaa gaagaacaag ttcccacact tcaagtgcga gaagtgcaac 2700 tttaaggaga acgccgatta caacgccgcc ctgaacatca gcaaccctaa gctgaagagc 2760 actaaggagg agcccaaaag gccggcggcc acgaaaaagg ccggccaggc aaaaaagaaa 2820 aag 2823 <210> 276 <211> 2823 <212> DNA <213> Artificial Sequence <220> <223> DNA sequence (C terminal -Adenine deaminase) <400> 276 atggccaaga acacaattac aaagacactg aagctgagga tcgtgagacc atacaacagc 60 gctgaggtcg agaagattgt ggctgatgaa aagaacaaca gggaaaagat cgccctcgag 120 aagaacaagg ataaggtgaa ggaggcctgc tctaagcacc tgaaagtggc cgcctactgc 180 accacacagg tggagaggaa cgcctgtctg ttttgtaaag ctcggaagct ggatgataag 240 ttttaccaga agctgcgggg ccagttcccc gatgccgtct tttggcagga gattagcgag 300 atcttcagac agctgcagaa gcaggccgcc gagatctaca accagagcct gatcgagctc 360 tactacgaga tcttcatcaa gggcaagggc attgccaacg cctcctccgt ggagcactac 420 ctgagcgacg tgtgctacac aagagccgcc gagctcttta agaacgccgc tatcgcttcc 480 gggctgagga gcaagattaa gagtaacttc cggctcaagg agctgaagaa catgaagagc 540 ggcctgccca ctacaaagag cgacaacttc ccaattccac tggtgaagca gaaggggggc 600 cagtacacag ggttcgagat ttccaaccac aacagcgact ttattattaa gatccccttt 660 ggcaggtggc aggtcaagaa ggagattgac aagtacaggc cctgggagaa gtttgatttc 720 gagcaggtgc agaagagccc caagcctatt tccctgctgc tgtccacaca gcggcggaag 780 aggaacaagg ggtggtctaa ggatgagggg accgaggccg agattaagaa agtgatgaac 840 ggcgactacc agacaagcta catcgaggtc aagcggggca gtaagattgg cgagaagagc 900 gcctggatgc tgaacctgag cattgacgtg ccaaagattg ataagggcgt ggatcccagc 960 atcatcggag ggatcgatgt gggggtcaag agccccctcg tgtgcgccat caacaacgcc 1020 ttcagcaggt acagcatctc cgataacgac ctgttccact ttaacaagaa gatgttcgcc 1080 cggcggagga ttttgctcaa gaagaaccgg cacaagcggg ccggacacgg ggccaagaac 1140 aagctcaagc ccatcactat cctgaccgag aagagcgaga ggttcaggaa gaagctcatc 1200 gagagatggg cctgcgagat cgccgatttc tttattaaga acaaggtcgg aacagtgcag 1260 atggagaacc tcgagagcat gaagaggaag gaggattcct acttcaacat tcggctgagg 1320 gggttctggc cctacgctga gatgcagaac aagattgagt ttaagctgaa gcagtacggg 1380 attgagatcc ggaaggtggc ccccaacaac accagcaaga cctgcagcaa gtgcgggcac 1440 ctcaacaact acttcaactt cgagtaccgg aagaagaaca agttcccaca cttcaagtgc 1500 gagaagtgca actttaagga gaacgccgat tacaacgccg ccctgaacat cagcaaccct 1560 aagctgaaga gcactaagga ggagccctct ggaggatcta gcggaggatc ctctggcagc 1620 gagacaccag gaacaagcga gtcagcaaca ccagagagca gtggcggcag cagcggcggc 1680 agctccgaag tcgagttttc ccatgagtac tggatgagac acgcattgac tctcgcaaag 1740 agggcttggg atgaacgcga ggtgcccgtg ggggcagtac tcgtgcataa caatcgcgta 1800 atcggcgaag gttggaatag gccgatcgga cgccacgacc ccactgcaca tgcggaaatc 1860 atggcccttc gacagggagg gcttgtgatg cagaattatc gacttatcga tgcgacgctg 1920 tacgtcacgc ttgaaccttg cgtaatgtgc gcgggagcta tgattcactc ccgcattgga 1980 cgagttgtat tcggtgcccg cgacgccaag acgggtgccg caggttcact gatggacgtg 2040 ctgcatcacc caggcatgaa ccaccgggta gaaatcacag aaggcatatt ggcggacgaa 2100 tgtgcggcgc tgttgtccga cttttttcgc atgcggaggc aggagatcaa ggcccagaaa 2160 aaagcacaat cctctactga ctctggtggt tcttctggtg gttctagcgg cagcgagact 2220 cccgggacct cagagtccgc cacacccgaa agttcaggtg gatcttcagg tggatcttcg 2280 gaagtggaat tttcgcacga gtattggatg aggcacgctt taactctcgc taagagagca 2340 cgagacgaac gggaagtgcc ggttggggct gtcctcgtac tcaataatcg agttatcgga 2400 gaaggctgga acagggcaat cggactccac gatcccacag ctcatgccga gataatggcg 2460 cttcgacaag gaggcctagt catgcaaaat tatcgtctta ttgacgcgac cctctacgtg 2520 acctttgagc catgcgttat gtgtgcgggt gcaatgatac attcccggat aggacgtgta 2580 gtatttggag ttcgcaacgc gaagaccggt gcggctggtt ctctcatgga tgtcctgcac 2640 taccctggga tgaatcaccg cgttgaaatc actgaaggca ttttggccga tgaatgcgcg 2700 gccctgttat gttacttttt tcgcatgccc aggcaggtct ttaacgcaca gaagaaagcc 2760 caatcgtcca ctgataaaag gccggcggcc acgaaaaagg ccggccaggc aaaaaagaaa 2820 aag 2823 <210> 277 <211> 7 <212> PRT <213> simian virus 40 <400> 277 Pro Lys Lys Lys Arg Lys Val 1 5 <210> 278 <211> 16 <212> PRT <213> mus musculus <400> 278 Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys 1 5 10 15 <210> 279 <211> 9 <212> PRT <213> homo sapiens <400> 279 Pro Ala Ala Lys Arg Val Lys Leu Asp 1 5 <210> 280 <211> 11 <212> PRT <213> homo sapiens <400> 280 Arg Gln Arg Arg Asn Glu Leu Lys Arg Ser Pro 1 5 10 <210> 281 <211> 38 <212> PRT <213> mus musculus <400> 281 Asn Gln Ser Ser Asn Phe Gly Pro Met Lys Gly Gly Asn Phe Gly Gly 1 5 10 15 Arg Ser Ser Gly Pro Tyr Gly Gly Gly Gly Gln Tyr Phe Ala Lys Pro 20 25 30 Arg Asn Gln Gly Gly Tyr 35 <210> 282 <211> 42 <212> PRT <213> Homo sapiens <400> 282 Arg Met Arg Ile Glx Phe Lys Asn Lys Gly Lys Asp Thr Ala Glu Leu 1 5 10 15 Arg Arg Arg Arg Val Glu Val Ser Val Glu Leu Arg Lys Ala Lys Lys 20 25 30 Asp Glu Gln Ile Leu Lys Arg Arg Asn Val 35 40 <210> 283 <211> 8 <212> PRT <213> Homo sapiens <400> 283 Val Ser Arg Lys Arg Pro Arg Pro 1 5 <210> 284 <211> 8 <212> PRT <213> homo sapiens <400> 284 Pro Pro Lys Lys Ala Arg Glu Asp 1 5 <210> 285 <211> 8 <212> PRT <213> homo sapiens <400> 285 Pro Gln Pro Lys Lys Lys Pro Leu 1 5 <210> 286 <211> 12 <212> PRT <213> mus musculus <400> 286 Ser Ala Leu Ile Lys Lys Lys Lys Lys Met Ala Pro 1 5 10 <210> 287 <211> 5 <212> PRT <213> Influenza virus <400> 287 Asp Arg Leu Arg Arg 1 5 <210> 288 <211> 7 <212> PRT <213> influenza virus <400> 288 Pro Lys Gln Lys Lys Arg Lys 1 5 <210> 289 <211> 10 <212> PRT <213> Hepatitis D virus <400> 289 Arg Lys Leu Lys Lys Lys Ile Lys Lys Leu 1 5 10 <210> 290 <211> 10 <212> PRT <213> mus musculus <400> 290 Arg Glu Lys Lys Lys Phe Leu Lys Arg Arg 1 5 10 <210> 291 <211> 20 <212> PRT <213> homo sapiens <400> 291 Lys Arg Lys Gly Asp Glu Val Asp Gly Val Asp Glu Val Ala Lys Lys 1 5 10 15 Lys Ser Lys Lys 20 <210> 292 <211> 17 <212> PRT <213> Homo sapiens <400> 292 Arg Lys Cys Leu Gln Ala Gly Met Asn Leu Glu Ala Arg Lys Thr Lys 1 5 10 15 Lys <210> 293 <211> 29 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 293 aacaaauuca uuuugaaacg aaugaagga 29 <210> 294 <211> 31 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 294 aacaaauuca uuuuugaaaa cgaaugaagg a 31 <210> 295 <211> 33 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 295 aacaaauuca uuuuucgaaa gacgaaugaa gga 33 <210> 296 <211> 35 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 296 aacaaauuca uuuuuccgaa aagacgaaug aagga 35 <210> 297 <211> 37 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 297 aacaaauuca uuuuuccuga aauagacgaa ugaagga 37 <210> 298 <211> 39 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 298 aacaaauuca uuuuuccucg aaaauagacg aaugaagga 39 <210> 299 <211> 41 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 299 aacaaauuca uuuuuccucu gaaaaauaga cgaaugaagg a 41 <210> 300 <211> 43 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 300 aacaaauuca uuuuuccucu cgaaagaaua gacgaaugaa gga 43 <210> 301 <211> 45 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 301 aacaaauuca uuuuuccucu ccgaaacgaa uagacgaaug aagga 45 <210> 302 <211> 47 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 302 aacaaauuca uuuuuccucu ccagaaaccg aauagacgaa ugaagga 47 <210> 303 <211> 49 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 303 aacaaauuca uuuuuccucu ccaagaaacc cgaauagacg aaugaagga 49 <210> 304 <211> 51 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 304 aacaaauuca uuuuuccucu ccaaugaaaa cccgaauaga cgaaugaagg a 51 <210> 305 <211> 53 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 305 aacaaauuca uuuuuccucu ccaauugaaa aacccgaaua gacgaaugaa gga 53 <210> 306 <211> 55 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 306 aacaaauuca uuuuuccucu ccaauucgaa agaacccgaa uagacgaaug aagga 55 <210> 307 <211> 57 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 307 aacaaauuca uuuuuccucu ccaauucuga aaagaacccg aauagacgaa ugaagga 57 <210> 308 <211> 59 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 308 aacaaauuca uuuuuccucu ccaauucugg aaacagaacc cgaauagacg aaugaagga 59 <210> 309 <211> 61 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 309 aacaaauuca uuuuuccucu ccaauucugc gaaagcagaa cccgaauaga cgaaugaagg 60 a 61 <210> 310 <211> 63 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 310 aacaaauuca uuuuuccucu ccaauucugc agaaaugcag aacccgaaua gacgaaugaa 60 gga 63 <210> 311 <211> 65 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 311 aacaaauuca uuuuuccucu ccaauucugc acgaaauugc agaacccgaa uagacgaaug 60 aagga 65 <210> 312 <211> 67 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 312 aacaaauuca uuuuuccucu ccaauucugc acagaaaguu gcagaacccg aauagacgaa 60 ugaagga 67 <210> 313 <211> 66 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 313 aacaaauuca uuuuuccucu ccaauucugc acagaaauug cagaacccga auagacgaau 60 gaagga 66 <210> 314 <211> 68 <212> RNA <213> Artificial Sequence <220> <223> (4th region + Linker + 5th region) of WT gRNA <400> 314 aacaaauuca uuuuuccucu ccaauucugc acaagaaagu ugcagaaccc gaauagacga 60 augaagga 68 <210> 315 <211> 202 <212> RNA <213> Artificial Sequence <220> <223> wild-type of Cas12f1 tracrRNA + GAAA + wild-type of Cas12f1 crRNA repeat <400> 315 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu uuccucucca auucugcaca agaaaguugc agaacccgaa 180 uagacgaaug aaggaaugca ac 202 <210> 316 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 316 caagaaaguu 10 <210> 317 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 317 acaagaaagu ug 12 <210> 318 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 318 cacaagaaag uugc 14 <210> 319 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 319 gcacaagaaa guugca 16 <210> 320 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 320 ugcacaagaa aguugcag 18 <210> 321 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 321 cugcacaaga aaguugcaga 20 <210> 322 <211> 22 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 322 ucugcacaag aaaguugcag aa 22 <210> 323 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 323 uucugcacaa gaaaguugca gaac 24 <210> 324 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 324 auucugcaca agaaaguugc agaacc 26 <210> 325 <211> 28 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 325 aauucugcac aagaaaguug cagaaccc 28 <210> 326 <211> 30 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 326 caauucugca caagaaaguu gcagaacccg 30 <210> 327 <211> 32 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 327 ccaauucugc acaagaaagu ugcagaaccc ga 32 <210> 328 <211> 34 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 328 uccaauucug cacaagaaag uugcagaacc cgaa 34 <210> 329 <211> 36 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 329 cuccaauucu gcacaagaaa guugcagaac ccgaau 36 <210> 330 <211> 38 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 330 ucuccaauuc ugcacaagaa aguugcagaa cccgaaua 38 <210> 331 <211> 40 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 331 cucuccaauu cugcacaaga aaguugcaga acccgaauag 40 <210> 332 <211> 42 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 332 ccucuccaau ucugcacaag aaaguugcag aacccgaaua ga 42 <210> 333 <211> 44 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 333 uccucuccaa uucugcacaa gaaaguugca gaacccgaau agac 44 <210> 334 <211> 46 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 334 uuccucucca auucugcaca agaaaguugc agaacccgaa uagacg 46 <210> 335 <211> 48 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 335 uuuccucucc aauucugcac aagaaaguug cagaacccga auagacga 48 <210> 336 <211> 50 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 336 uuuuccucuc caauucugca caagaaaguu gcagaacccg aauagacgaa 50 <210> 337 <211> 52 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 337 uuuuuccucu ccaauucugc acaagaaagu ugcagaaccc gaauagacga au 52 <210> 338 <211> 54 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 338 auuuuuccuc uccaauucug cacaagaaag uugcagaacc cgaauagacg aaug 54 <210> 339 <211> 56 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 339 cauuuuuccu cuccaauucu gcacaagaaa guugcagaac ccgaauagac gaauga 56 <210> 340 <211> 58 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 340 ucauuuuucc ucuccaauuc ugcacaagaa aguugcagaa cccgaauaga cgaaugaa 58 <210> 341 <211> 57 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 341 ucauuuuucc ucuccaauuc ugcacaagaa aguugcagaa cccgaauaga cgaauga 57 <210> 342 <211> 59 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 342 uucauuuuuc cucuccaauu cugcacaaga aaguugcaga acccgaauag acgaaugaa 59 <210> 343 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 343 uucauuuuuc 10 <210> 344 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 344 uucauuuuuc c 11 <210> 345 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 345 uucauuuuuc cu 12 <210> 346 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 346 uucauuuuuc cuc 13 <210> 347 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 347 uucauuuuuc cucu 14 <210> 348 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 348 uucauuuuuc cucuc 15 <210> 349 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 349 uucauuuuuc cucucc 16 <210> 350 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 350 uucauuuuuc cucucca 17 <210> 351 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 351 uucauuuuuc cucuccaa 18 <210> 352 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 352 uucauuuuuc cucuccaau 19 <210> 353 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 353 uucauuuuuc cucuccaauu 20 <210> 354 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 354 uucauuuuuc cucuccaauu c 21 <210> 355 <211> 22 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 355 uucauuuuuc cucuccaauu cu 22 <210> 356 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 356 uucauuuuuc cucuccaauu cug 23 <210> 357 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 357 uucauuuuuc cucuccaauu cugc 24 <210> 358 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 358 uucauuuuuc cucuccaauu cugca 25 <210> 359 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 359 uucauuuuuc cucuccaauu cugcac 26 <210> 360 <211> 27 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 360 uucauuuuuc cucuccaauu cugcaca 27 <210> 361 <211> 28 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 361 uucauuuuuc cucuccaauu cugcacaa 28 <210> 362 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 362 gacgaaugaa 10 <210> 363 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 363 agacgaauga a 11 <210> 364 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 364 uagacgaaug aa 12 <210> 365 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 365 auagacgaau gaa 13 <210> 366 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 366 aauagacgaa ugaa 14 <210> 367 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 367 gaauagacga augaa 15 <210> 368 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 368 cgaauagacg aaugaa 16 <210> 369 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 369 ccgaauagac gaaugaa 17 <210> 370 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 370 cccgaauaga cgaaugaa 18 <210> 371 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 371 acccgaauag acgaaugaa 19 <210> 372 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 372 aacccgaaua gacgaaugaa 20 <210> 373 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 373 gaacccgaau agacgaauga a 21 <210> 374 <211> 22 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 374 agaacccgaa uagacgaaug aa 22 <210> 375 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 375 cagaacccga auagacgaau gaa 23 <210> 376 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 376 gcagaacccg aauagacgaa ugaa 24 <210> 377 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 377 ugcagaaccc gaauagacga augaa 25 <210> 378 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 378 uugcagaacc cgaauagacg aaugaa 26 <210> 379 <211> 27 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 379 guugcagaac ccgaauagac gaaugaa 27 <210> 380 <211> 181 <212> RNA <213> Artificial Sequence <220> <223> Comparative Example 1.1 <400> 380 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu gaaagaauga aggaaugcaa ccacacacac agugggcuac 180 c 181 <210> 381 <211> 174 <212> RNA <213> Artificial Sequence <220> <223> Example 1.1 <400> 381 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuugaaagaa ugaaggaaug caaccacaca cacagugggc uacc 174 <210> 382 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 1.2 <400> 382 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuugaaa gaaugaagga augcaaccac acacacagug ggcuacc 167 <210> 383 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> Example 1.3 <400> 383 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 gaaagaauga aggaaugcaa ccacacacac agugggcuac c 161 <210> 384 <211> 175 <212> RNA <213> Artificial Sequence <220> <223> Example 1.4 <400> 384 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaaccacac acacaguggg cuacc 175 <210> 385 <211> 170 <212> RNA <213> Artificial Sequence <220> <223> Example 1.5 <400> 385 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac cacacacaca gugggcuacc 170 <210> 386 <211> 158 <212> RNA <213> Artificial Sequence <220> <223> Example 1.6 <400> 386 cuucacugau aaaguggaga accgcuucac cauuagugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa uucauuugaa 120 agaaugaagg aaugcaacca cacacacagu gggcuacc 158 <210> 387 <211> 177 <212> RNA <213> Artificial Sequence <220> <223> Example 1.7 <400> 387 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauga aaaugaagga augcaaccac acacacagug ggcuacc 177 <210> 388 <211> 173 <212> RNA <213> Artificial Sequence <220> <223> Example 1.8 <400> 388 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucgaaa gaaggaaugc aaccacacac acagugggcu acc 173 <210> 389 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 1.9 <400> 389 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaagaaagga augcaaccac acacacagug ggcuacc 167 <210> 390 <211> 140 <212> RNA <213> Artificial Sequence <220> <223> Example 1.10 <400> 390 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug aaagaaugaa ggaaugcaac 120 cacacacaca gugggcuacc 140 <210> 391 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> Example 1.11 <400> 391 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaagaaagga 120 augcaaccac acacacagug ggcuacc 147 <210> 392 <211> 146 <212> RNA <213> Artificial Sequence <220> <223> Example 1.12 <400> 392 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa 120 ugcaaccaca cacacagugg gcuacc 146 <210> 393 <211> 126 <212> RNA <213> Artificial Sequence <220> <223> Example 1.13 <400> 393 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa ugcaaccaca cacacagugg 120 gcuacc 126 <210> 394 <211> 181 <212> RNA <213> Artificial Sequence <220> <223> Comparative Example 2.1 <400> 394 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu gaaagaauga aggaaugcaa ccauccccag gacacacaca 180 c 181 <210> 395 <211> 174 <212> RNA <213> Artificial Sequence <220> <223> Example 2.1 <400> 395 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuugaaagaa ugaaggaaug caaccauccc caggacacac acac 174 <210> 396 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 2.2 <400> 396 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuugaaa gaaugaagga augcaaccau ccccaggaca cacacac 167 <210> 397 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> Example 2.3 <400> 397 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 gaaagaauga aggaaugcaa ccauccccag gacacacaca c 161 <210> 398 <211> 175 <212> RNA <213> Artificial Sequence <220> <223> Example 2.4 <400> 398 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaaccaucc ccaggacaca cacac 175 <210> 399 <211> 170 <212> RNA <213> Artificial Sequence <220> <223> Example 2.5 <400> 399 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac cauccccagg acacacacac 170 <210> 400 <211> 160 <212> RNA <213> Artificial Sequence <220> <223> Example 2.6 <400> 400 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug 120 aaagaaugaa ggaaugcaac cauccccagg acacacacac 160 <210> 401 <211> 177 <212> RNA <213> Artificial Sequence <220> <223> Example 2.7 <400> 401 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauga aaaugaagga augcaaccau ccccaggaca cacacac 177 <210> 402 <211> 173 <212> RNA <213> Artificial Sequence <220> <223> Example 2.8 <400> 402 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucgaaa gaaggaaugc aaccaucccc aggacacaca cac 173 <210> 403 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 2.9 <400> 403 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaagaaagga augcaaccau ccccaggaca cacacac 167 <210> 404 <211> 140 <212> RNA <213> Artificial Sequence <220> <223> Example 2.10 <400> 404 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug aaagaaugaa ggaaugcaac 120 cauccccagg acacacacac 140 <210> 405 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> Example 2.11 <400> 405 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaagaaagga 120 augcaaccau ccccaggaca cacacac 147 <210> 406 <211> 146 <212> RNA <213> Artificial Sequence <220> <223> Example 2.12 <400> 406 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa 120 ugcaaccauc cccaggacac acacac 146 <210> 407 <211> 126 <212> RNA <213> Artificial Sequence <220> <223> Example 2.13 <400> 407 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa ugcaaccauc cccaggacac 120 acacac 126 <210> 408 <211> 181 <212> RNA <213> Artificial Sequence <220> <223> Comparative Example 3.1 <400> 408 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu gaaagaauga aggaaugcaa cagaacacau accccugggc 180 c 181 <210> 409 <211> 174 <212> RNA <213> Artificial Sequence <220> <223> Example 3.1 <400> 409 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuugaaagaa ugaaggaaug caacagaaca cauaccccug ggcc 174 <210> 410 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 3.2 <400> 410 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuugaaa gaaugaagga augcaacaga acacauaccc cugggcc 167 <210> 411 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> Example 3.3 <400> 411 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 gaaagaauga aggaaugcaa cagaacacau accccugggc c 161 <210> 412 <211> 175 <212> RNA <213> Artificial Sequence <220> <223> Example 3.4 <400> 412 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaacagaac acauaccccu gggcc 175 <210> 413 <211> 170 <212> RNA <213> Artificial Sequence <220> <223> Example 3.5 <400> 413 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac agaacacaua ccccugggcc 170 <210> 414 <211> 160 <212> RNA <213> Artificial Sequence <220> <223> Example 3.6 <400> 414 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug 120 aaagaaugaa ggaaugcaac agaacacaua ccccugggcc 160 <210> 415 <211> 177 <212> RNA <213> Artificial Sequence <220> <223> Example 3.7 <400> 415 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauga aaaugaagga augcaacaga acacauaccc cugggcc 177 <210> 416 <211> 173 <212> RNA <213> Artificial Sequence <220> <223> Example 3.8 <400> 416 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucgaaa gaaggaaugc aacagaacac auaccccugg gcc 173 <210> 417 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 3.9 <400> 417 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaagaaagga augcaacaga acacauaccc cugggcc 167 <210> 418 <211> 140 <212> RNA <213> Artificial Sequence <220> <223> Example 3.10 <400> 418 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug aaagaaugaa ggaaugcaac 120 agaacacaua ccccugggcc 140 <210> 419 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> Example 3.11 <400> 419 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaagaaagga 120 augcaacaga acacauaccc cugggcc 147 <210> 420 <211> 146 <212> RNA <213> Artificial Sequence <220> <223> Example 3.12 <400> 420 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa 120 ugcaacagaa cacauacccc ugggcc 146 <210> 421 <211> 126 <212> RNA <213> Artificial Sequence <220> <223> Example 3.13 <400> 421 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa ugcaacagaa cacauacccc 120 ugggcc 126 <210> 422 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> protospacer sequence for targeting DY2 <400> 422 cacacacaca gtgggctacc 20 <210> 423 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> protospacer sequence for targeting DY10 <400> 423 catccccagg acacacacac 20 <210> 424 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> protospacer sequence for targeting Intergenic-22 <400> 424 agaacacata cccctgggcc 20 <210> 425 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, MF <400> 425 uucgaaagaa 10 <210> 426 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, MF <400> 426 uucagaaaug aa 12 <210> 427 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, MF <400> 427 uucaugaaaa ugaa 14 <210> 428 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, MF <400> 428 uucauugaaa aaugaa 16 <210> 429 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, MF <400> 429 uucauuugaa agaaugaa 18 <210> 430 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> Second region, 13bp deletion <400> 430 ccgcuucacu uagagugaag gugg 24 <210> 431 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> Second region, 12bp deletion <400> 431 ccgcuucacc uuaggaguga aggugg 26 <210> 432 <211> 9 <212> RNA <213> Artificial Sequence <220> <223> second region, 5' remains <400> 432 ccgcuucac 9 <210> 433 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> second region, 3' remains <400> 433 agugaaggug g 11 <210> 434 <211> 29 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region <400> 434 aaaagcuguc ccuuagggga uuagaacuu 29 <210> 435 <211> 31 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region <400> 435 caaaagcugu cccuuagggg auuagaacuu g 31 <210> 436 <211> 120 <212> RNA <213> Artificial Sequence <220> <223> Example 1.14 <400> 436 accgcuucac uuagagugaa ggugggcugc uugcaucagc cuaaugucga gaagugcuuu 60 cuucggaaag uaacccucga aacaaagaaa ggaaugcaac cacacacaca gugggcuacc 120 120 <210> 437 <211> 120 <212> RNA <213> Artificial Sequence <220> <223> Example 2.14 <400> 437 accgcuucac uuagagugaa ggugggcugc uugcaucagc cuaaugucga gaagugcuuu 60 cuucggaaag uaacccucga aacaaagaaa ggaaugcaac cauccccagg acacacacac 120 120 <210> 438 <211> 120 <212> RNA <213> Artificial Sequence <220> <223> Example 3.14 <400> 438 accgcuucac uuagagugaa ggugggcugc uugcaucagc cuaaugucga gaagugcuuu 60 cuucggaaag uaacccucga aacaaagaaa ggaaugcaac agaacacaua ccccugggcc 120 120 <210> 439 <211> 120 <212> RNA <213> Artificial Sequence <220> <223> engineered sgRNA <400> 439 accgcuucac uuagagugaa ggugggcugc uugcaucagc cuaaugucga gaagugcuuu 60 cuucggaaag uaacccucga aacaaagaaa ggaaugcaac nnnnnnnnnn nnnnnnnnnn 120 120 <210> 440 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> Upper deleted part of second region <400> 440 aaaagcuguc cc 12 <210> 441 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> Lower deleted part of second region <400> 441 caaaagcugu ccc 13 <110> Genkore <120> An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use it <130> CP21-068 <160> 441 <170> KoPatentIn 3.0 <210> 1 <211> 140 <212> RNA <213> Artificial Sequence <220> <223> mature form of Cas12f1 tracrRNA <400> 1 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu 140 <210> 2 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> wild-type of Cas12f1 tracrRNA <400> 2 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu uuccuucca auucugcaca a 161 <210> 3 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> mature form of Cas12f1 crRNA repeat <400> 3 gaaugaagga augcaac 17 <210> 4 <211> 37 <212> RNA <213> Artificial Sequence <220> <223> wild-type of Cas12f1 crRNA repeat <400> 4 guugcagaac ccgaauagac gaaugaagga augcaac 37 <210> 5 <211> 37 <212> RNA <213> Artificial Sequence <220> <223> mature form of Cas12f1 crRNA <400> 5 gaaugaagga augcaacnnn nnnnnnnnnn nnnnnnn 37 <210> 6 <211> 57 <212> RNA <213> Artificial Sequence <220> <223> wild-type of Cas12f1 crRNA <400> 6 guugcagaac ccgaauagac gaaugaagga augcaacnnn nnnnnnnnnn nnnnnnn 57 <210> 7 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> mature form of Cas12f1 tracrRNA + GAAA + mature form of Cas12f1 crRNA repeat <400> 7 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu gaaagaauga aggaaugcaa c 161 <210> 8 <211> 181 <212> RNA <213> Artificial Sequence <220> <223> mature form of Cas12f1 tracrRNA + GAAA + mature form of Cas12f1 crRNA <400> 8 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu gaaagaauga aggaaugcaa cnnnnnnnnn nnnnnnnnnn 180 n 181 <210> 9 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> 1st region of Cas12f1 guide RNA <400> 9 cuucacugau aaaguggaga a 21 <210> 10 <211> 50 <212> RNA <213> Artificial Sequence <220> <223> 2nd region of Cas12f1 guide RNA <400> 10 ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug agugaaggug 50 <210> 11 <211> 56 <212> RNA <213> Artificial Sequence <220> <223> 3rd region of Cas12f1 guide RNA <400> 11 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucga 56 <210> 12 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> 4th region of Cas12f1 guide RNA (Mature form) <400> 12 aacaaauuca uuu 13 <210> 13 <211> 34 <212> RNA <213> Artificial Sequence <220> <223> 4th region of Cas12f1 guide RNA (Wild-type) <400> 13 aacaaauuca uuuuuccucu ccaauucugc acaa 34 <210> 14 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> 5th region of Cas12f1 guide RNA (Mature form) <400> 14 gaaugaagga 10 <210> 15 <211> 30 <212> RNA <213> Artificial Sequence <220> <223> 5th region of Cas12f1 guide RNA (Wild-type) <400> 15 guugcagaac ccgaauagac gaaugaagga 30 <210> 16 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> first region, 11nt deletion <400> 16 aaguggagaa 10 <210> 17 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> first region, 10nt deletion <400> 17 aaaguggaga a 11 <210> 18 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> first region, 9nt deletion <400> 18 uaaaguggag aa 12 <210> 19 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> first region, 8nt deletion <400> 19 auaaagugga gaa 13 <210> 20 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> first region, 7nt deletion <400> 20 gauaaagugg agaa 14 <210> 21 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> first region, 6nt deletion <400> 21 ugauaaagug gagaa 15 <210> 22 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> first region, 5nt deletion <400> 22 cugauaaagu ggagaa 16 <210> 23 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> first region, 4nt deletion <400> 23 acugauaaag uggagaa 17 <210> 24 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> first region, 3nt deletion <400> 24 cacugauaaa guggagaa 18 <210> 25 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> first region, 2nt deletion <400> 25 ucacugauaa aguggagaa 19 <210> 26 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> first region, 1nt deletion <400> 26 uucacugaua aaguggagaa 20 <210> 27 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 10nt <400> 27 aaaguggaga 10 <210> 28 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 11nt <400> 28 uaaaguggag a 11 <210> 29 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 12nt <400> 29 auaaagugga ga 12 <210> 30 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 13nt <400> 30 gauaaagugg aga 13 <210> 31 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 14nt <400> 31 ugauaaagug gaga 14 <210> 32 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 15nt <400> 32 cugauaaagu ggaga 15 <210> 33 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 16nt <400> 33 acugauaaag uggaga 16 <210> 34 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 17nt <400> 34 cacugauaaa guggaga 17 <210> 35 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 18nt <400> 35 ucacugauaa aguggaga 18 <210> 36 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 19nt <400> 36 uucacugaua aaguggaga 19 <210> 37 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> deleted part of the first region, 20nt <400> 37 cuucacugau aaaguggaga 20 <210> 38 <211> 28 <212> RNA <213> Artificial Sequence <220> <223> second region, 11bp deletion <400> 38 ccgcuucacc auuagugagu gaaggugg 28 <210> 39 <211> 30 <212> RNA <213> Artificial Sequence <220> <223> second region, 10bp deletion <400> 39 ccgcuucacc aauuaguuga gugaaggugg 30 <210> 40 <211> 32 <212> RNA <213> Artificial Sequence <220> <223> second region, 9bp deletion <400> 40 ccgcuucacc aaauuagcuu gagugaaggu gg 32 <210> 41 <211> 34 <212> RNA <213> Artificial Sequence <220> <223> second region, 8bp deletion <400> 41 ccgcuucacc aaaauuagac uugagugaag gugg 34 <210> 42 <211> 36 <212> RNA <213> Artificial Sequence <220> <223> second region, 7bp deletion <400> 42 ccgcuucacc aaaaguuaga acuugaguga aggugg 36 <210> 43 <211> 38 <212> RNA <213> Artificial Sequence <220> <223> second region, 6bp deletion <400> 43 ccgcuucacc aaaagcuuag gaacuugagu gaaggugg 38 <210> 44 <211> 40 <212> RNA <213> Artificial Sequence <220> <223> second region, 5bp deletion <400> 44 ccgcuucacc aaaagcuuua gagaacuuga gugaaggugg 40 <210> 45 <211> 43 <212> RNA <213> Artificial Sequence <220> <223> second region, 4bp deletion <400> 45 ccgcuucacc aaaagcuguu aguuagaacu ugagugaagg ugg 43 <210> 46 <211> 42 <212> RNA <213> Artificial Sequence <220> <223> second region, 4bp+1nt deletion <400> 46 ccgcuucacc aaaagcuguu aguagaacuu gagugaaggu gg 42 <210> 47 <211> 45 <212> RNA <213> Artificial Sequence <220> <223> second region, 3bp deletion <400> 47 ccgcuucacc aaaagcuguu uagauuagaa cuugagugaa ggugg 45 <210> 48 <211> 47 <212> RNA <213> Artificial Sequence <220> <223> second region, 2bp deletion <400> 48 ccgcuucacc aaaagcuguc uuaggauuag aacuugagug aaggugg 47 <210> 49 <211> 49 <212> RNA <213> Artificial Sequence <220> <223> second region, 1bp deletion <400> 49 ccgcuucacc aaaagcuguc cuuagggauu agaacuugag ugaaggugg 49 <210> 50 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> second region, 5' remains <400> 50 ccgcuucacc a 11 <210> 51 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> second region, 3' remains <400> 51 ugagugaagg ugg 13 <210> 52 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> upper-deleted part of the second region, 10nt <400> 52 aaagcugucc 10 <210> 53 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> upper-deleted part of the second region, 11nt <400> 53 aaagcugucc c 11 <210> 54 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> lower-deleted part of the second region, 10nt <400> 54 gauuagaacu 10 <210> 55 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> lower-deleted part of the second region, 11nt <400> 55 ggauuagaac u 11 <210> 56 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> lower-deleted part of the second region, 12nt <400> 56 gggauuagaa cu 12 <210> 57 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 10nt <400> 57 aaauuagacu 10 <210> 58 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 11nt <400> 58 aaaguuagaa cu 12 <210> 59 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 12nt <400> 59 aaagcuuagg aacu 14 <210> 60 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 13nt <400> 60 aaagcuuuag agaacu 16 <210> 61 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 14nt <400> 61 aaagcuguua guagaacu 18 <210> 62 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 15nt <400> 62 aaagcuguua guuagaacu 19 <210> 63 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 16nt <400> 63 aaagcuguuu agauuagaac u 21 <210> 64 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 17nt <400> 64 aaagcugucu uaggauuaga acu 23 <210> 65 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region, 18nt <400> 65 aaagcugucc uuagggauua gaacu 25 <210> 66 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (mature form), delete 3nt <400> 66 aacaaauuca 10 <210> 67 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (mature form), delete 4nt <400> 67 aacaaauuca u 11 <210> 68 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (mature form), delete 5nt <400> 68 aacaaauuca uu 12 <210> 69 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 21nt deletion <400> 69 aacaaauuca uuu 13 <210> 70 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 20nt deletion <400> 70 aacaaauuca uuuu 14 <210> 71 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 19nt deletion <400> 71 aacaaauuca uuuuu 15 <210> 72 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 18nt deletion <400> 72 aacaaauuca uuuuuc 16 <210> 73 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 17nt deletion <400> 73 aacaaauuca uuuuucc 17 <210> 74 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 16nt deletion <400> 74 aacaaauuca uuuuuccu 18 <210> 75 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 15nt deletion <400> 75 aacaaauuca uuuuuccuc 19 <210> 76 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 14nt deletion <400> 76 aacaaauuca uuuuuccucu 20 <210> 77 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 13nt deletion <400> 77 aacaaauuca uuuuuccucu c 21 <210> 78 <211> 22 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 12nt deletion <400> 78 aacaaauuca uuuuuccucu cc 22 <210> 79 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 11nt deletion <400> 79 aacaaauuca uuuuuccucu cca 23 <210> 80 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 10nt deletion <400> 80 aacaaauuca uuuuuccucu ccaa 24 <210> 81 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 9nt deletion <400> 81 aacaaauuca uuuuuccucu ccaau 25 <210> 82 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 8nt deletion <400> 82 aacaaauuca uuuuuccucu ccaauu 26 <210> 83 <211> 27 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 7nt deletion <400> 83 aacaaauuca uuuuuccucu ccaauuc 27 <210> 84 <211> 28 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 6nt deletion <400> 84 aacaaauuca uuuuuccucu ccaauucu 28 <210> 85 <211> 29 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 5nt deletion <400> 85 aacaaauuca uuuuuccucu ccaauucug 29 <210> 86 <211> 30 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 4nt deletion <400> 86 aacaaauuca uuuuuccucu ccaauucugc 30 <210> 87 <211> 31 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 3nt deletion <400> 87 aacaaauuca uuuuuccucu ccaauucugc a 31 <210> 88 <211> 32 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 2nt deletion <400> 88 aacaaauuca uuuuuccucu ccaauucugc ac 32 <210> 89 <211> 33 <212> RNA <213> Artificial Sequence <220> <223> fourth region of the tracrRNA (wild-type), 1nt deletion <400> 89 aacaaauuca uuuuuccucu ccaauucugc aca 33 <210> 90 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 20nt deletion <400> 90 gaaugaagga 10 <210> 91 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 19nt deletion <400> 91 cgaaugaagg a 11 <210> 92 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 18nt deletion <400> 92 acgaaugaag ga 12 <210> 93 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 17nt deletion <400> 93 gacgaaugaa gga 13 <210> 94 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 16nt deletion <400> 94 agacgaauga agga 14 <210> 95 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 15nt deletion <400> 95 uagacgaaug aagga 15 <210> 96 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 14nt deletion <400> 96 auagacgaau gaagga 16 <210> 97 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 13nt deletion <400> 97 aauagacgaa ugaagga 17 <210> 98 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 12nt deletion <400> 98 gaauagacga augaagga 18 <210> 99 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 11nt deletion <400> 99 cgaauagacg aaugaagga 19 <210> 100 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 10nt deletion <400> 100 ccgaauagac gaaugaagga 20 <210> 101 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 9nt deletion <400> 101 cccgaauaga cgaaugaagg a 21 <210> 102 <211> 22 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 8nt deletion <400> 102 acccgaauag acgaaugaag ga 22 <210> 103 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 7nt deletion <400> 103 aacccgaaua gacgaaugaa gga 23 <210> 104 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 6nt deletion <400> 104 gaacccgaau agacgaauga agga 24 <210> 105 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 5nt deletion <400> 105 agaacccgaa uagacgaaug aagga 25 <210> 106 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 4nt deletion <400> 106 cagaacccga auagacgaau gaagga 26 <210> 107 <211> 27 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 3nt deletion <400> 107 gcagaacccg aauagacgaa ugaagga 27 <210> 108 <211> 28 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 2nt deletion <400> 108 ugcagaaccc gaauagacga augaagga 28 <210> 109 <211> 29 <212> RNA <213> Artificial Sequence <220> <223> fifth region of the crRNA (wild-type), 1nt deletion <400> 109 uugcagaacc cgaauagacg aaugaagga 29 <210> 110 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 7bp deletion (mature form) <400> 110 aacaaagaaa gga 13 <210> 111 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 6bp deletion (mature form) <400> 111 aacaaaugaa aagga 15 <210> 112 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 5bp deletion (mature form) <400> 112 aacaaauuga aaaagga 17 <210> 113 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 4bp deletion (mature form) <400> 113 aacaaauucg aaagaagga 19 <210> 114 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 3bp deletion (mature form) <400> 114 aacaaauuca gaaaugaagg a 21 <210> 115 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 2bp deletion (mature form) <400> 115 aacaaauuca ugaaaaugaa gga 23 <210> 116 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region, 1bp deletion (mature form) <400> 116 aacaaauuca uugaaaaaug aagga 25 <210> 117 <211> 27 <212> RNA <213> Artificial Sequence <220> <223> fourth region + Linker + fifth region (mature form) <400> 117 aacaaauuca uuugaaagaa ugaagga 27 <210> 118 <211> 120 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 20nt deletion in the first region) <400> 118 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 120 <210> 119 <211> 121 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 19nt deletion in the first region) <400> 119 aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg ugggcugcuu 60 gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa caaauucauu 120 u 121 <210> 120 <211> 122 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 18nt deletion in the first region) <400> 120 gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag gugggcugcu 60 ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa acaaauucau 120 uu 122 <210> 121 <211> 123 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 17nt deletion in the first region) <400> 121 agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa ggugggcugc 60 uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga aacaaauuca 120 uuu 123 <210> 122 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 16nt deletion in the first region) <400> 122 gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga aggugggcug 60 cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg aaacaaauuc 120 auuu 124 <210> 123 <211> 125 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 15nt deletion in the first region) <400> 123 ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuu 125 <210> 124 <211> 126 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 14nt deletion in the first region) <400> 124 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuu 126 <210> 125 <211> 127 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 13nt deletion in the first region) <400> 125 guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 127 <210> 126 <211> 128 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 12nt deletion in the first region) <400> 126 aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga gugaaggugg 60 gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc cucgaaacaa 120 128 <210> 127 <211> 129 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 11nt deletion in the first region) <400> 127 aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuu 129 <210> 128 <211> 130 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 10nt deletion in the first region) <400> 128 aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu 60 gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac 120 aaauucauuu 130 <210> 129 <211> 131 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 9nt deletion in the first region) <400> 129 uaaaguggag aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu u 131 <210> 130 <211> 132 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 8nt deletion in the first region) <400> 130 auaaagugga gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 caaauucau uu 132 <210> 131 <211> 133 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 7nt deletion in the first region) <400> 131 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuu 133 <210> 132 <211> 134 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 6nt deletion in the first region) <400> 132 ugauaaagug gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuu 134 <210> 133 <211> 135 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 5nt deletion in the first region) <400> 133 cugauaaagu ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug 60 aaggugggcu gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc 120 gaaacaaauu cauuu 135 <210> 134 <211> 136 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 4nt deletion in the first region) <400> 134 acugauaaag uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuu 136 <210> 135 <211> 137 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 3nt deletion in the first region) <400> 135 cacugauaaa guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag 60 ugaagguggg cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc 120 ucgaaacaaa uucauuu 137 <210> 136 <211> 138 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 2nt deletion in the first region) <400> 136 ucacugauaa aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuu 138 <210> 137 <211> 139 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 1nt deletion in the first region) <400> 137 uucacugaua aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug 60 agugaaggug ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac 120 ccucgaaaca aauucauuu 139 <210> 138 <211> 117 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 10bp deletion in the second) region) <400> 138 cuucacugau aaaguggaga accgcuucac cauuaugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa uucauuu 117 <210> 139 <211> 119 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 9bp deletion in the second) region) <400> 139 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuu 119 <210> 140 <211> 122 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 8bp deletion in the second) region) <400> 140 cuucacugau aaaguggaga accgcuucac caaauuagac uugagugaag gugggcugcu 60 ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa acaaauucau 120 uu 122 <210> 141 <211> 125 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 7bp deletion in the second) region) <400> 141 cuucacugau aaaguggaga accgcuucac caaaaguuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuu 125 <210> 142 <211> 127 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 6bp deletion in the second) region) <400> 142 cuucacugau aaaguggaga accgcuucac caaaagcuua ggaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 127 <210> 143 <211> 129 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 5bp deletion in the second) region) <400> 143 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuu 129 <210> 144 <211> 132 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 4bp deletion in the second) region) <400> 144 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 caaauucau uu 132 <210> 145 <211> 131 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 4bp+1nt deletion in the second) region) <400> 145 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu u 131 <210> 146 <211> 134 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 3bp deletion in the second) region) <400> 146 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuu 134 <210> 147 <211> 136 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 2bp deletion in the second) region) <400> 147 cuucacugau aaaguggaga accgcuucac caaaagcugu cuuaggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuu 136 <210> 148 <211> 138 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 1bp deletion in the second) region) <400> 148 cuucacugau aaaguggaga accgcuucac caaaagcugu ccuuagggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuu 138 <210> 149 <211> 133 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 7nt deletion in the fourth region) <400> 149 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaa 133 <210> 150 <211> 134 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 6nt deletion in the fourth region) <400> 150 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaau 134 <210> 151 <211> 135 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 5nt deletion in the fourth region) <400> 151 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauu 135 <210> 152 <211> 136 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 4nt deletion in the fourth region) <400> 152 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauuc 136 <210> 153 <211> 137 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 3nt deletion in the fourth region) <400> 153 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauuca 137 <210> 154 <211> 138 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 2nt deletion in the fourth region) <400> 154 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucau 138 <210> 155 <211> 139 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, 1nt deletion in the fourth region) <400> 155 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauu 139 <210> 156 <211> 97 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, modified first region and modified second region) <400> 156 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa uucauuu 97 <210> 157 <211> 113 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, modified first region and modified fourth and fifth region) <400> 157 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaa 113 <210> 158 <211> 110 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, modified second region and modified fourth and fifth region) <400> 158 cuucacugau aaaguggaga accgcuucac cauuaugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 110 <210> 159 <211> 90 <212> RNA <213> Artificial Sequence <220> <223> engineered tracrRNA (mature form, modified first region, modified second region, modified fourth and fifth region) <400> 159 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa 90 <210> 160 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat (mature form, 7nt deletion in the fifth region) <400> 160 ggaaugcaac 10 <210> 161 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat (mature form, 6nt deletion in the fifth region) <400> 161 aggaaugcaa c 11 <210> 162 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat (mature form, 5nt deletion in the fifth region) <400> 162 aaggaaugca ac 12 <210> 163 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat (mature form, 4nt deletion in the fifth region) <400> 163 gaaggaaugc aac 13 <210> 164 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat (mature form, 3nt deletion in the fifth region) <400> 164 ugaaggaaug caac 14 <210> 165 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat (mature form, 2nt deletion in the fifth region) <400> 165 augaaggaau gcaac 15 <210> 166 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> engineered crRNA repeat (mature form, 1nt deletion in the fifth region) <400> 166 aaugaaggaa ugcaac 16 <210> 167 <211> 141 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 20nt deletion in the first region) <400> 167 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 gaaagaauga aggaaugcaa c 141 <210> 168 <211> 142 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 19nt deletion in the first region) <400> 168 aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg ugggcugcuu 60 gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa caaauucauu 120 ugaaagaaug aaggaaugca ac 142 <210> 169 <211> 143 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 18nt deletion in the first region) <400> 169 gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag gugggcugcu 60 ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa acaaauucau 120 uugaaagaau gaaggaaugc aac 143 <210> 170 <211> 144 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 17nt deletion in the first region) <400> 170 agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa ggugggcugc 60 uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga aacaaauuca 120 uuugaaagaa ugaaggaaug caac 144 <210> 171 <211> 145 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 16nt deletion in the first region) <400> 171 gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga aggugggcug 60 cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg aaacaaauuc 120 auuugaaaga augaaggaau gcaac 145 <210> 172 <211> 146 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 15nt deletion in the first region) <400> 172 ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuugaaag aaugaaggaa ugcaac 146 <210> 173 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 14nt deletion in the first region) <400> 173 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuugaaa gaaugaagga augcaac 147 <210> 174 <211> 148 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 13nt deletion in the first region) <400> 174 guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 uucauuugaa agaaugaagg aaugcaac 148 <210> 175 <211> 149 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 12nt deletion in the first region) <400> 175 aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga gugaaggugg 60 gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc cucgaaacaa 120 auucauuuga aagaaugaag gaaugcaac 149 <210> 176 <211> 150 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 11nt deletion in the first region) <400> 176 aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac 150 <210> 177 <211> 151 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 10nt deletion in the first region) <400> 177 aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu 60 gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac 120 aaauucauuu gaaagaauga aggaaugcaa c 151 <210> 178 <211> 152 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 9nt deletion in the first region) <400> 178 uaaaguggag aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu ugaaagaaug aaggaaugca ac 152 <210> 179 <211> 153 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 8nt deletion in the first region) <400> 179 auaaagugga gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 acaaauucau uugaaagaau gaaggaaugc aac 153 <210> 180 <211> 154 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 7nt deletion in the first region) <400> 180 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuugaaagaa ugaaggaaug caac 154 <210> 181 <211> 155 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 6nt deletion in the first region) <400> 181 ugauaaagug gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaac 155 <210> 182 <211> 156 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 5nt deletion in the first region) <400> 182 cugauaaagu ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug 60 aaggugggcu gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc 120 gaaacaaauu cauuugaaag aaugaaggaa ugcaac 156 <210> 183 <211> 157 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 4nt deletion in the first region) <400> 183 acugauaaag uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuugaaa gaaugaagga augcaac 157 <210> 184 <211> 158 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 3nt deletion in the first region) <400> 184 cacugauaaa guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag 60 ugaagguggg cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc 120 ucgaaacaaa uucauuugaa agaaugaagg aaugcaac 158 <210> 185 <211> 159 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 2nt deletion in the first region) <400> 185 ucacugauaa aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuuga aagaaugaag gaaugcaac 159 <210> 186 <211> 160 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 1nt deletion in the first region) <400> 186 uucacugaua aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug 60 agugaaggug ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac 120 ccucgaaaca aauucauuug aaagaaugaa ggaaugcaac 160 <210> 187 <211> 138 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 11bp deletion in the second) region) <400> 187 cuucacugau aaaguggaga accgcuucac cauuaugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa uucauuugaa 120 agaaugaagg aaugcaac 138 <210> 188 <211> 140 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 10bp deletion in the second region) <400> 188 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug 120 aaagaaugaa ggaaugcaac 140 <210> 189 <211> 142 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 9bp deletion in the second) region) <400> 189 cuucacugau aaaguggaga accgcuucac caaauuagcu ugagugaagg ugggcugcuu 60 gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa caaauucauu 120 ugaaagaaug aaggaaugca ac 142 <210> 190 <211> 144 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 8bp deletion in the second region) <400> 190 cuucacugau aaaguggaga accgcuucac caaaauuaga cuugagugaa ggugggcugc 60 uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga aacaaauuca 120 uuugaaagaa ugaaggaaug caac 144 <210> 191 <211> 146 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 7bp deletion in the second) region) <400> 191 cuucacugau aaaguggaga accgcuucac caaaaguuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuugaaag aaugaaggaa ugcaac 146 <210> 192 <211> 148 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 6bp deletion in the second) region) <400> 192 cuucacugau aaaguggaga accgcuucac caaaagcuua ggaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 uucauuugaa agaaugaagg aaugcaac 148 <210> 193 <211> 150 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 5bp deletion in the second region) <400> 193 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac 150 <210> 194 <211> 152 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 4bp deletion in the second region) <400> 194 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu ugaaagaaug aaggaaugca ac 152 <210> 195 <211> 153 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 4bp+1nt deletion in the second region) <400> 195 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 acaaauucau uugaaagaau gaaggaaugc aac 153 <210> 196 <211> 155 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 3bp deletion in the second) region) <400> 196 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaac 155 <210> 197 <211> 157 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 2bp deletion in the second) region) <400> 197 cuucacugau aaaguggaga accgcuucac caaaagcugu cuuaggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuugaaa gaaugaagga augcaac 157 <210> 198 <211> 159 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold(mature form, 1bp deletion in the second region) <400> 198 cuucacugau aaaguggaga accgcuucac caaaagcugu ccuuagggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuuga aagaaugaag gaaugcaac 159 <210> 199 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 7bp deletion in the fourth and fifth region) <400> 199 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaagaaagga augcaac 147 <210> 200 <211> 149 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 6bp deletion in the fourth and fifth region) <400> 200 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaaugaaaag gaaugcaac 149 <210> 201 <211> 151 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 5bp deletion in the fourth and fifth region) <400> 201 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauugaaaa aggaaugcaa c 151 <210> 202 <211> 153 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 4bp deletion in the fourth and fifth region) <400> 202 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucgaaa gaaggaaugc aac 153 <210> 203 <211> 155 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 3bp deletion in the fourth and fifth region) <400> 203 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucagaa augaaggaau gcaac 155 <210> 204 <211> 157 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 2bp deletion in the fourth and fifth region) <400> 204 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauga aaaugaagga augcaac 157 <210> 205 <211> 159 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, 1bp deletion in the fourth and fifth region) <400> 205 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauug aaaaaugaag gaaugcaac 159 <210> 206 <211> 118 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, modified first and second region) <400> 206 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa uucauuugaa agaaugaagg aaugcaac 118 <210> 207 <211> 127 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, modified first, fourth and fifth region) <400> 207 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaagaaagga 120 augcaac 127 <210> 208 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, modified second, fourth and fifth region) <400> 208 cuucacugau aaaguggaga accgcuucac cauuaugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug 120 caac 124 <210> 209 <211> 104 <212> RNA <213> Artificial Sequence <220> <223> engineered scaffold (mature form, modified first, second, fourth and fifth region) <400> 209 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caac 104 <210> 210 <211> 189 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 1nt deletion in the first region) <400> 210 uucacugaua aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug 60 agugaaggug ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac 120 ccucgaaaca aauucauuug aaagaaugaa ggaaugcaac nnnnnnnnnn nnnnnnnnnn 180 189 <210> 211 <211> 188 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 2nt deletion in the first region) <400> 211 ucacugauaa aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuuga aagaaugaag gaaugcaacn nnnnnnnnnn nnnnnnnnnu 180 188 <210> 212 <211> 187 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 3nt deletion in the first region) <400> 212 cacugauaaa guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag 60 ugaagguggg cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc 120 ucgaaacaaa uucauuugaa agaaugaagg aaugcaacnn nnnnnnnnnn nnnnnnnnuu 180 187 <210> 213 <211> 186 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 4nt deletion in the first region) <400> 213 acugauaaag uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuugaaa gaaugaagga augcaacnnn nnnnnnnnnn nnnnnnnuuu 180 186 <210> 214 <211> 185 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 5nt deletion in the first region) <400> 214 cugauaaagu ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug 60 aaggugggcu gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc 120 gaaacaaauu cauuugaaag aaugaaggaa ugcaacnnnn nnnnnnnnnn nnnnnnuuuu 180 185 <210> 215 <211> 184 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 6nt deletion in the first region) <400> 215 ugauaaagug gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaacnnnnn nnnnnnnnnn nnnnnuuuua 180 uuuu 184 <210> 216 <211> 183 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 7nt deletion in the first region) <400> 216 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuugaaagaa ugaaggaaug caacnnnnnn nnnnnnnnnn nnnnuuuuau 180 uuu 183 <210> 217 <211> 182 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 8nt deletion in the first region) <400> 217 auaaagugga gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 acaaauucau uugaaagaau gaaggaaugc aacnnnnnnn nnnnnnnnnn nnnuuuuauu 180 uu 182 <210> 218 <211> 181 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 9nt deletion in the first region) <400> 218 uaaaguggag aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu ugaaagaaug aaggaaugca acnnnnnnnn nnnnnnnnnn nnuuuuauuu 180 u 181 <210> 219 <211> 180 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 10nt deletion in the first region) <400> 219 aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu 60 gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac 120 aaauucauuu gaaagaauga aggaaugcaa cnnnnnnnnn nnnnnnnnnn nuuuuauuuu 180 180 <210> 220 <211> 179 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 11nt deletion in the first region) <400> 220 aaguggagaa ccgcuucacc aaaagcuguc ccuuagggga uuagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac nnnnnnnnnn nnnnnnnnnn uuuuauuuu 179 <210> 221 <211> 170 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 12nt deletion in the first region) <400> 221 aguggagaac cgcuucacca aaagcugucc cuuaggggau uagaacuuga gugaaggugg 60 gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc cucgaaacaa 120 auucauuuga aagaaugaag gaaugcaacn nnnnnnnnnn nnnnnnnnnu 170 <210> 222 <211> 168 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 13nt deletion in the first region) <400> 222 guggagaacc gcuucaccaa aagcuguccc uuaggggauu agaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 uucauuugaa agaaugaagg aaugcaacnn nnnnnnnnnn nnnnnnnn 168 <210> 223 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 14nt deletion in the first region) <400> 223 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuugaaa gaaugaagga augcaacnnn nnnnnnnnnn nnnnnnn 167 <210> 224 <211> 166 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 15nt deletion in the first region) <400> 224 ggagaaccgc uucaccaaaa gcugucccuu aggggauuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuugaaag aaugaaggaa ugcaacnnnn nnnnnnnnnn nnnnnn 166 <210> 225 <211> 165 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 16nt deletion in the first region) <400> 225 gagaaccgcu ucaccaaaag cugucccuua ggggauuaga acuugaguga aggugggcug 60 cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg aaacaaauuc 120 auuugaaaga augaaggaau gcaacnnnnn nnnnnnnnnn nnnnn 165 <210> 226 <211> 164 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 17nt deletion in the first region) <400> 226 agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa ggugggcugc 60 uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga aacaaauuca 120 uuugaaagaa ugaaggaaug caacnnnnnn nnnnnnnnnn nnnn 164 <210> 227 <211> 163 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 18nt deletion in the first region) <400> 227 gaaccgcuuc accaaaagcu gucccuuagg ggauuagaac uugagugaag gugggcugcu 60 ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa acaaauucau 120 uugaaagaau gaaggaaugc aacnnnnnnn nnnnnnnnnn nnn 163 <210> 228 <211> 162 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 19nt deletion in the first region) <400> 228 aaccgcuuca ccaaaagcug ucccuuaggg gauuagaacu ugagugaagg ugggcugcuu 60 gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa caaauucauu 120 ugaaagaaug aaggaaugca acnnnnnnnn nnnnnnnnnn nn 162 <210> 229 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 20nt deletion in the first region) <400> 229 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 gaaagaauga aggaaugcaa cnnnnnnnnn nnnnnnnnnn n 161 <210> 230 <211> 179 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 1bp deletion in the second region) <400> 230 cuucacugau aaaguggaga accgcuucac caaaagcugu ccuuagggau uagaacuuga 60 gugaaggugg gcugcuugca ucagccuaau gucgagaagu gcuuucuucg gaaaguaacc 120 cucgaaacaa auucauuuga aagaaugaag gaaugcaacn nnnnnnnnnn nnnnnnnnnn 179 <210> 231 <211> 177 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 2bp deletion in the second region) <400> 231 cuucacugau aaaguggaga accgcuucac caaaagcugu cuuaggauua gaacuugagu 60 gaaggugggc ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu 120 cgaaacaaau ucauuugaaa gaaugaagga augcaacnnn nnnnnnnnnn nnnnnnn 177 <210> 232 <211> 175 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 3bp deletion in the second region) <400> 232 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaacnnnnn nnnnnnnnnn nnnnn 175 <210> 233 <211> 173 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 4bp+1nt deletion in the second region) <400> 233 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguuagaac uugagugaag 60 gugggcugcu ugcaucagcc uaaugucgag aagugcuuuc uucggaaagu aacccucgaa 120 acaaauucau uugaaagaau gaaggaaugc aacnnnnnnn nnnnnnnnnn nnn 173 <210> 234 <211> 172 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 4bp deletion in the second region) <400> 234 cuucacugau aaaguggaga accgcuucac caaaagcugu uaguagaacu ugagugaagg 60 ugggcugcuu gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa 120 caaauucauu ugaaagaaug aaggaaugca acnnnnnnnn nnnnnnnnnn nn 172 <210> 235 <211> 170 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 5bp deletion in the second region) <400> 235 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac nnnnnnnnnn nnnnnnnnnn 170 <210> 236 <211> 168 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 6bp deletion in the second region) <400> 236 cuucacugau aaaguggaga accgcuucac caaaagcuua ggaacuugag ugaagguggg 60 cugcuugcau cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa 120 uucauuugaa agaaugaagg aaugcaacnn nnnnnnnnnn nnnnnnnn 168 <210> 237 <211> 166 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 7bp deletion in the second region) <400> 237 cuucacugau aaaguggaga accgcuucac caaaaguuag aacuugagug aaggugggcu 60 gcuugcauca gccuaauguc gagaagugcu uucuucggaa aguaacccuc gaaacaaauu 120 cauuugaaag aaugaaggaa ugcaacnnnn nnnnnnnnnn nnnnnn 166 <210> 238 <211> 164 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 8bp deletion in the second region) <400> 238 cuucacugau aaaguggaga accgcuucac caaaauuaga cuugagugaa ggugggcugc 60 uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga aacaaauuca 120 uuugaaagaa ugaaggaaug caacnnnnnn nnnnnnnnnn nnnn 164 <210> 239 <211> 162 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 9bp deletion in the second region) <400> 239 cuucacugau aaaguggaga accgcuucac caaauuagcu ugagugaagg ugggcugcuu 60 gcaucagccu aaugucgaga agugcuuucu ucggaaagua acccucgaaa caaauucauu 120 ugaaagaaug aaggaaugca acnnnnnnnn nnnnnnnnnn nn 162 <210> 240 <211> 160 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 10bp deletion in the second region) <400> 240 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug 120 aaagaaugaa ggaaugcaac nnnnnnnnnn nnnnnnnnnn 160 <210> 241 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 11bp deletion in the second region) <400> 241 cuucacugau aaaguggaga accgcuucac cauuaugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa uucauuugaa 120 agaaugaagg aaugcaacnn nnnnnnnnnn nnnnnnnnuu uuauuuu 167 <210> 242 <211> 179 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 1bp deletion in the fourth and fifth region) <400> 242 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauug aaaaaugaag gaaugcaacn nnnnnnnnnn nnnnnnnnn 179 <210> 243 <211> 177 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 2bp deletion in the fourth and fifth region) <400> 243 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauga aaaugaagga augcaacnnn nnnnnnnnnn nnnnnnn 177 <210> 244 <211> 175 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 3bp deletion in the fourth and fifth region) <400> 244 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucagaa augaaggaau gcaacnnnnn nnnnnnnnnn nnnnn 175 <210> 245 <211> 173 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 4bp deletion in the fourth and fifth region) <400> 245 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucgaaa gaaggaaugc aacnnnnnnn nnnnnnnnnn nnn 173 <210> 246 <211> 171 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 5bp deletion in the fourth and fifth region) <400> 246 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauugaaaa aggaaugcaa cnnnnnnnnn nnnnnnnnnn n 171 <210> 247 <211> 169 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 6bp deletion in the fourth and fifth region) <400> 247 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaaugaaaag gaaugcaacn nnnnnnnnnn nnnnnnnnn 169 <210> 248 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, 7bp deletion in the fourth and fifth region) <400> 248 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaagaaagga augcaacnnn nnnnnnnnnn nnnnnnn 167 <210> 249 <211> 138 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, modified first and second region) <400> 249 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa uucauuugaa agaaugaagg aaugcaacnn 120 nnnnnnnnnn nnnnnnnn 138 <210> 250 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, modified first, fourth and fifth region) <400> 250 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaagaaagga 120 augcaacnnn nnnnnnnnnn nnnnnnn 147 <210> 251 <211> 144 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, modified second, fourth and fifth region) <400> 251 cuucacugau aaaguggaga accgcuucac cauuaugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug 120 caacnnnnnn nnnnnnnnnn nnnn 144 <210> 252 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, modified first, second, fourth and fifth region) <400> 252 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 253 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, modified first, second, fourth and fifth region) <400> 253 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 254 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, modified first, second, fourth and fifth region) <400> 254 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 255 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, modified first, second, fourth and fifth region) <400> 255 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 256 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, modified first, second, fourth and fifth region) <400> 256 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 257 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, modified first, second, fourth and fifth region) <400> 257 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 258 <211> 124 <212> RNA <213> Artificial Sequence <220> <223> sgRNA (mature form, modified first, second, fourth and fifth region) <400> 258 accgcuucac cauuagugag ugaagguggg cugcuugcau cagccuaaug ucgagaagug 60 cuuucuucgg aaaguaaccc ucgaaacaaa gaaaggaaug caacnnnnnn nnnnnnnnnn 120 nnnn 124 <210> 259 <211> 529 <212> PRT <213> Artificial Sequence <220> <223> Cas14a1 amino acid sequence <400> 259 Met Ala Lys Asn Thr Ile Thr Lys Thr Leu Lys Leu Arg Ile Val Arg 1 5 10 15 Pro Tyr Asn Ser Ala Glu Val Glu Lys Ile Val Ala Asp Glu Lys Asn 20 25 30 Asn Arg Glu Lys Ile Ala Leu Glu Lys Asn Lys Asp Lys Val Lys Glu 35 40 45 Ala Cys Ser Lys His Leu Lys Val Ala Ala Tyr Cys Thr Thr Gln Val 50 55 60 Glu Arg Asn Ala Cys Leu Phe Cys Lys Ala Arg Lys Leu Asp Asp Lys 65 70 75 80 Phe Tyr Gln Lys Leu Arg Gly Gln Phe Pro Asp Ala Val Phe Trp Gln 85 90 95 Glu Ile Ser Glu Ile Phe Arg Gln Leu Gln Lys Gln Ala Ala Glu Ile 100 105 110 Tyr Asn Gln Ser Leu Ile Glu Leu Tyr Tyr Glu Ile Phe Ile Lys Gly 115 120 125 Lys Gly Ile Ala Asn Ala Ser Ser Val Glu His Tyr Leu Ser Asp Val 130 135 140 Cys Tyr Thr Arg Ala Ala Glu Leu Phe Lys Asn Ala Ala Ile Ala Ser 145 150 155 160 Gly Leu Arg Ser Lys Ile Lys Ser Asn Phe Arg Leu Lys Glu Leu Lys 165 170 175 Asn Met Lys Ser Gly Leu Pro Thr Thr Lys Ser Asp Asn Phe Pro Ile 180 185 190 Pro Leu Val Lys Gln Lys Gly Gly Gln Tyr Thr Gly Phe Glu Ile Ser 195 200 205 Asn His Asn Ser Asp Phe Ile Ile Lys Ile Pro Phe Gly Arg Trp Gln 210 215 220 Val Lys Lys Glu Ile Asp Lys Tyr Arg Pro Trp Glu Lys Phe Asp Phe 225 230 235 240 Glu Gln Val Gln Lys Ser Pro Lys Pro Ile Ser Leu Leu Leu Ser Thr 245 250 255 Gln Arg Arg Lys Arg Asn Lys Gly Trp Ser Lys Asp Glu Gly Thr Glu 260 265 270 Ala Glu Ile Lys Lys Val Met Asn Gly Asp Tyr Gln Thr Ser Tyr Ile 275 280 285 Glu Val Lys Arg Gly Ser Lys Ile Gly Glu Lys Ser Ala Trp Met Leu 290 295 300 Asn Leu Ser Ile Asp Val Pro Lys Ile Asp Lys Gly Val Asp Pro Ser 305 310 315 320 Ile Ile Gly Gly Ile Asp Val Gly Val Lys Ser Pro Leu Val Cys Ala 325 330 335 Ile Asn Asn Ala Phe Ser Arg Tyr Ser Ile Ser Asp Asn Asp Leu Phe 340 345 350 His Phe Asn Lys Lys Met Phe Ala Arg Arg Arg Ile Leu Leu Lys Lys 355 360 365 Asn Arg His Lys Arg Ala Gly His Gly Ala Lys Asn Lys Leu Lys Pro 370 375 380 Ile Thr Ile Leu Thr Glu Lys Ser Glu Arg Phe Arg Lys Lys Leu Ile 385 390 395 400 Glu Arg Trp Ala Cys Glu Ile Ala Asp Phe Phe Ile Lys Asn Lys Val 405 410 415 Gly Thr Val Gln Met Glu Asn Leu Glu Ser Met Lys Arg Lys Glu Asp 420 425 430 Ser Tyr Phe Asn Ile Arg Leu Arg Gly Phe Trp Pro Tyr Ala Glu Met 435 440 445 Gln Asn Lys Ile Glu Phe Lys Leu Lys Gln Tyr Gly Ile Glu Ile Arg 450 455 460 Lys Val Ala Pro Asn Asn Thr Ser Lys Thr Cys Ser Lys Cys Gly His 465 470 475 480 Leu Asn Asn Tyr Phe Asn Phe Glu Tyr Arg Lys Lys Asn Lys Phe Pro 485 490 495 His Phe Lys Cys Glu Lys Cys Asn Phe Lys Glu Asn Ala Asp Tyr Asn 500 505 510 Ala Ala Leu Asn Ile Ser Asn Pro Lys Leu Lys Ser Thr Lys Glu Glu 515 520 525 Pro <210> 260 <211> 536 <212> PRT <213> Artificial Sequence <220> <223> N-terminal NLS + Cas14a1 amino acid sequence <400> 260 Met Pro Lys Lys Lys Arg Lys Val Ala Lys Asn Thr Ile Thr Lys Thr 1 5 10 15 Leu Lys Leu Arg Ile Val Arg Pro Tyr Asn Ser Ala Glu Val Glu Lys 20 25 30 Ile Val Ala Asp Glu Lys Asn Asn Arg Glu Lys Ile Ala Leu Glu Lys 35 40 45 Asn Lys Asp Lys Val Lys Glu Ala Cys Ser Lys His Leu Lys Val Ala 50 55 60 Ala Tyr Cys Thr Thr Gln Val Glu Arg Asn Ala Cys Leu Phe Cys Lys 65 70 75 80 Ala Arg Lys Leu Asp Asp Lys Phe Tyr Gln Lys Leu Arg Gly Gln Phe 85 90 95 Pro Asp Ala Val Phe Trp Gln Glu Ile Ser Glu Ile Phe Arg Gln Leu 100 105 110 Gln Lys Gln Ala Ala Glu Ile Tyr Asn Gln Ser Leu Ile Glu Leu Tyr 115 120 125 Tyr Glu Ile Phe Ile Lys Gly Lys Gly Ile Ala Asn Ala Ser Ser Val 130 135 140 Glu His Tyr Leu Ser Asp Val Cys Tyr Thr Arg Ala Ala Glu Leu Phe 145 150 155 160 Lys Asn Ala Ala Ile Ala Ser Gly Leu Arg Ser Lys Ile Lys Ser Asn 165 170 175 Phe Arg Leu Lys Glu Leu Lys Asn Met Lys Ser Gly Leu Pro Thr Thr 180 185 190 Lys Ser Asp Asn Phe Pro Ile Pro Leu Val Lys Gln Lys Gly Gly Gln 195 200 205 Tyr Thr Gly Phe Glu Ile Ser Asn His Asn Ser Asp Phe Ile Ile Lys 210 215 220 Ile Pro Phe Gly Arg Trp Gln Val Lys Lys Glu Ile Asp Lys Tyr Arg 225 230 235 240 Pro Trp Glu Lys Phe Asp Phe Glu Gln Val Gln Lys Ser Pro Lys Pro 245 250 255 Ile Ser Leu Leu Leu Ser Thr Gln Arg Arg Lys Arg Asn Lys Gly Trp 260 265 270 Ser Lys Asp Glu Gly Thr Glu Ala Glu Ile Lys Lys Val Met Asn Gly 275 280 285 Asp Tyr Gln Thr Ser Tyr Ile Glu Val Lys Arg Gly Ser Lys Ile Gly 290 295 300 Glu Lys Ser Ala Trp Met Leu Asn Leu Ser Ile Asp Val Pro Lys Ile 305 310 315 320 Asp Lys Gly Val Asp Pro Ser Ile Ile Gly Gly Ile Asp Val Gly Val 325 330 335 Lys Ser Pro Leu Val Cys Ala Ile Asn Asn Ala Phe Ser Arg Tyr Ser 340 345 350 Ile Ser Asp Asn Asp Leu Phe His Phe Asn Lys Lys Met Phe Ala Arg 355 360 365 Arg Arg Ile Leu Leu Lys Lys Asn Arg His Lys Arg Ala Gly His Gly 370 375 380 Ala Lys Asn Lys Leu Lys Pro Ile Thr Ile Leu Thr Glu Lys Ser Glu 385 390 395 400 Arg Phe Arg Lys Lys Leu Ile Glu Arg Trp Ala Cys Glu Ile Ala Asp 405 410 415 Phe Phe Ile Lys Asn Lys Val Gly Thr Val Gln Met Glu Asn Leu Glu 420 425 430 Ser Met Lys Arg Lys Glu Asp Ser Tyr Phe Asn Ile Arg Leu Arg Gly 435 440 445 Phe Trp Pro Tyr Ala Glu Met Gln Asn Lys Ile Glu Phe Lys Leu Lys 450 455 460 Gln Tyr Gly Ile Glu Ile Arg Lys Val Ala Pro Asn Asn Thr Ser Lys 465 470 475 480 Thr Cys Ser Lys Cys Gly His Leu Asn Asn Tyr Phe Asn Phe Glu Tyr 485 490 495 Arg Lys Lys Asn Lys Phe Pro His Phe Lys Cys Glu Lys Cys Asn Phe 500 505 510 Lys Glu Asn Ala Asp Tyr Asn Ala Ala Leu Asn Ile Ser Asn Pro Lys 515 520 525 Leu Lys Ser Thr Lys Glu Glu Pro 530 535 <210> 261 <211> 536 <212> PRT <213> Artificial Sequence <220> <223> C-terminal NLS + Cas14a1 amino acid sequence <400> 261 Met Ala Lys Asn Thr Ile Thr Lys Thr Leu Lys Leu Arg Ile Val Arg 1 5 10 15 Pro Tyr Asn Ser Ala Glu Val Glu Lys Ile Val Ala Asp Glu Lys Asn 20 25 30 Asn Arg Glu Lys Ile Ala Leu Glu Lys Asn Lys Asp Lys Val Lys Glu 35 40 45 Ala Cys Ser Lys His Leu Lys Val Ala Ala Tyr Cys Thr Thr Gln Val 50 55 60 Glu Arg Asn Ala Cys Leu Phe Cys Lys Ala Arg Lys Leu Asp Asp Lys 65 70 75 80 Phe Tyr Gln Lys Leu Arg Gly Gln Phe Pro Asp Ala Val Phe Trp Gln 85 90 95 Glu Ile Ser Glu Ile Phe Arg Gln Leu Gln Lys Gln Ala Ala Glu Ile 100 105 110 Tyr Asn Gln Ser Leu Ile Glu Leu Tyr Tyr Glu Ile Phe Ile Lys Gly 115 120 125 Lys Gly Ile Ala Asn Ala Ser Ser Val Glu His Tyr Leu Ser Asp Val 130 135 140 Cys Tyr Thr Arg Ala Ala Glu Leu Phe Lys Asn Ala Ala Ile Ala Ser 145 150 155 160 Gly Leu Arg Ser Lys Ile Lys Ser Asn Phe Arg Leu Lys Glu Leu Lys 165 170 175 Asn Met Lys Ser Gly Leu Pro Thr Thr Lys Ser Asp Asn Phe Pro Ile 180 185 190 Pro Leu Val Lys Gln Lys Gly Gly Gln Tyr Thr Gly Phe Glu Ile Ser 195 200 205 Asn His Asn Ser Asp Phe Ile Ile Lys Ile Pro Phe Gly Arg Trp Gln 210 215 220 Val Lys Lys Glu Ile Asp Lys Tyr Arg Pro Trp Glu Lys Phe Asp Phe 225 230 235 240 Glu Gln Val Gln Lys Ser Pro Lys Pro Ile Ser Leu Leu Leu Ser Thr 245 250 255 Gln Arg Arg Lys Arg Asn Lys Gly Trp Ser Lys Asp Glu Gly Thr Glu 260 265 270 Ala Glu Ile Lys Lys Val Met Asn Gly Asp Tyr Gln Thr Ser Tyr Ile 275 280 285 Glu Val Lys Arg Gly Ser Lys Ile Gly Glu Lys Ser Ala Trp Met Leu 290 295 300 Asn Leu Ser Ile Asp Val Pro Lys Ile Asp Lys Gly Val Asp Pro Ser 305 310 315 320 Ile Ile Gly Gly Ile Asp Val Gly Val Lys Ser Pro Leu Val Cys Ala 325 330 335 Ile Asn Asn Ala Phe Ser Arg Tyr Ser Ile Ser Asp Asn Asp Leu Phe 340 345 350 His Phe Asn Lys Lys Met Phe Ala Arg Arg Arg Ile Leu Leu Lys Lys 355 360 365 Asn Arg His Lys Arg Ala Gly His Gly Ala Lys Asn Lys Leu Lys Pro 370 375 380 Ile Thr Ile Leu Thr Glu Lys Ser Glu Arg Phe Arg Lys Lys Leu Ile 385 390 395 400 Glu Arg Trp Ala Cys Glu Ile Ala Asp Phe Phe Ile Lys Asn Lys Val 405 410 415 Gly Thr Val Gln Met Glu Asn Leu Glu Ser Met Lys Arg Lys Glu Asp 420 425 430 Ser Tyr Phe Asn Ile Arg Leu Arg Gly Phe Trp Pro Tyr Ala Glu Met 435 440 445 Gln Asn Lys Ile Glu Phe Lys Leu Lys Gln Tyr Gly Ile Glu Ile Arg 450 455 460 Lys Val Ala Pro Asn Asn Thr Ser Lys Thr Cys Ser Lys Cys Gly His 465 470 475 480 Leu Asn Asn Tyr Phe Asn Phe Glu Tyr Arg Lys Lys Asn Lys Phe Pro 485 490 495 His Phe Lys Cys Glu Lys Cys Asn Phe Lys Glu Asn Ala Asp Tyr Asn 500 505 510 Ala Ala Leu Asn Ile Ser Asn Pro Lys Leu Lys Ser Thr Lys Glu Glu 515 520 525 Pro Pro Lys Lys Lys Arg Lys Val 530 535 <210> 262 <211> 543 <212> PRT <213> Artificial Sequence <220> <223> N/C-terminal NLS + Cas14a1 amino acid sequence <400> 262 Met Pro Lys Lys Lys Arg Lys Val Ala Lys Asn Thr Ile Thr Lys Thr 1 5 10 15 Leu Lys Leu Arg Ile Val Arg Pro Tyr Asn Ser Ala Glu Val Glu Lys 20 25 30 Ile Val Ala Asp Glu Lys Asn Asn Arg Glu Lys Ile Ala Leu Glu Lys 35 40 45 Asn Lys Asp Lys Val Lys Glu Ala Cys Ser Lys His Leu Lys Val Ala 50 55 60 Ala Tyr Cys Thr Thr Gln Val Glu Arg Asn Ala Cys Leu Phe Cys Lys 65 70 75 80 Ala Arg Lys Leu Asp Asp Lys Phe Tyr Gln Lys Leu Arg Gly Gln Phe 85 90 95 Pro Asp Ala Val Phe Trp Gln Glu Ile Ser Glu Ile Phe Arg Gln Leu 100 105 110 Gln Lys Gln Ala Ala Glu Ile Tyr Asn Gln Ser Leu Ile Glu Leu Tyr 115 120 125 Tyr Glu Ile Phe Ile Lys Gly Lys Gly Ile Ala Asn Ala Ser Ser Val 130 135 140 Glu His Tyr Leu Ser Asp Val Cys Tyr Thr Arg Ala Ala Glu Leu Phe 145 150 155 160 Lys Asn Ala Ala Ile Ala Ser Gly Leu Arg Ser Lys Ile Lys Ser Asn 165 170 175 Phe Arg Leu Lys Glu Leu Lys Asn Met Lys Ser Gly Leu Pro Thr Thr 180 185 190 Lys Ser Asp Asn Phe Pro Ile Pro Leu Val Lys Gln Lys Gly Gly Gln 195 200 205 Tyr Thr Gly Phe Glu Ile Ser Asn His Asn Ser Asp Phe Ile Ile Lys 210 215 220 Ile Pro Phe Gly Arg Trp Gln Val Lys Lys Glu Ile Asp Lys Tyr Arg 225 230 235 240 Pro Trp Glu Lys Phe Asp Phe Glu Gln Val Gln Lys Ser Pro Lys Pro 245 250 255 Ile Ser Leu Leu Leu Ser Thr Gln Arg Arg Lys Arg Asn Lys Gly Trp 260 265 270 Ser Lys Asp Glu Gly Thr Glu Ala Glu Ile Lys Lys Val Met Asn Gly 275 280 285 Asp Tyr Gln Thr Ser Tyr Ile Glu Val Lys Arg Gly Ser Lys Ile Gly 290 295 300 Glu Lys Ser Ala Trp Met Leu Asn Leu Ser Ile Asp Val Pro Lys Ile 305 310 315 320 Asp Lys Gly Val Asp Pro Ser Ile Ile Gly Gly Ile Asp Val Gly Val 325 330 335 Lys Ser Pro Leu Val Cys Ala Ile Asn Asn Ala Phe Ser Arg Tyr Ser 340 345 350 Ile Ser Asp Asn Asp Leu Phe His Phe Asn Lys Lys Met Phe Ala Arg 355 360 365 Arg Arg Ile Leu Leu Lys Lys Asn Arg His Lys Arg Ala Gly His Gly 370 375 380 Ala Lys Asn Lys Leu Lys Pro Ile Thr Ile Leu Thr Glu Lys Ser Glu 385 390 395 400 Arg Phe Arg Lys Lys Leu Ile Glu Arg Trp Ala Cys Glu Ile Ala Asp 405 410 415 Phe Phe Ile Lys Asn Lys Val Gly Thr Val Gln Met Glu Asn Leu Glu 420 425 430 Ser Met Lys Arg Lys Glu Asp Ser Tyr Phe Asn Ile Arg Leu Arg Gly 435 440 445 Phe Trp Pro Tyr Ala Glu Met Gln Asn Lys Ile Glu Phe Lys Leu Lys 450 455 460 Gln Tyr Gly Ile Glu Ile Arg Lys Val Ala Pro Asn Asn Thr Ser Lys 465 470 475 480 Thr Cys Ser Lys Cys Gly His Leu Asn Asn Tyr Phe Asn Phe Glu Tyr 485 490 495 Arg Lys Lys Asn Lys Phe Pro His Phe Lys Cys Glu Lys Cys Asn Phe 500 505 510 Lys Glu Asn Ala Asp Tyr Asn Ala Ala Leu Asn Ile Ser Asn Pro Lys 515 520 525 Leu Lys Ser Thr Lys Glu Glu Pro Pro Lys Lys Lys Arg Lys Val 530 535 540 <210> 263 <211> 1003 <212> PRT <213> Artificial Sequence <220> <223> Amino acid sequence (N terminal -Cytidine deaminase) <400> 263 Met Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val Ala 1 5 10 15 Val Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu Val 20 25 30 Phe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr Glu 35 40 45 Ile Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln Asn 50 55 60 Thr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr Glu 65 70 75 80 Arg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu Ser 85 90 95 Trp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu Ser 100 105 110 Arg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr His 115 120 125 His Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser Ser 130 135 140 Gly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys Trp 145 150 155 160 Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro Arg 165 170 175 Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys Ile 180 185 190 Ile Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln Pro 195 200 205 Gln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln Arg 210 215 220 Leu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Gly Ser 225 230 235 240 Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala 245 250 255 Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser Ala Lys Asn Thr 260 265 270 Ile Thr Lys Thr Leu Lys Leu Arg Ile Val Arg Pro Tyr Asn Ser Ala 275 280 285 Glu Val Glu Lys Ile Val Ala Asp Glu Lys Asn Asn Arg Glu Lys Ile 290 295 300 Ala Leu Glu Lys Asn Lys Asp Lys Val Lys Glu Ala Cys Ser Lys His 305 310 315 320 Leu Lys Val Ala Ala Tyr Cys Thr Thr Gln Val Glu Arg Asn Ala Cys 325 330 335 Leu Phe Cys Lys Ala Arg Lys Leu Asp Asp Lys Phe Tyr Gln Lys Leu 340 345 350 Arg Gly Gln Phe Pro Asp Ala Val Phe Trp Gln Glu Ile Ser Glu Ile 355 360 365 Phe Arg Gln Leu Gln Lys Gln Ala Ala Glu Ile Tyr Asn Gln Ser Leu 370 375 380 Ile Glu Leu Tyr Tyr Glu Ile Phe Ile Lys Gly Lys Gly Ile Ala Asn 385 390 395 400 Ala Ser Ser Val Glu His Tyr Leu Ser Asp Val Cys Tyr Thr Arg Ala 405 410 415 Ala Glu Leu Phe Lys Asn Ala Ala Ile Ala Ser Gly Leu Arg Ser Lys 420 425 430 Ile Lys Ser Asn Phe Arg Leu Lys Glu Leu Lys Asn Met Lys Ser Gly 435 440 445 Leu Pro Thr Thr Lys Ser Asp Asn Phe Pro Ile Pro Leu Val Lys Gln 450 455 460 Lys Gly Gly Gln Tyr Thr Gly Phe Glu Ile Ser Asn His Asn Ser Asp 465 470 475 480 Phe Ile Ile Lys Ile Pro Phe Gly Arg Trp Gln Val Lys Lys Glu Ile 485 490 495 Asp Lys Tyr Arg Pro Trp Glu Lys Phe Asp Phe Glu Gln Val Gln Lys 500 505 510 Ser Pro Lys Pro Ile Ser Leu Leu Leu Ser Thr Gln Arg Arg Lys Arg 515 520 525 Asn Lys Gly Trp Ser Lys Asp Glu Gly Thr Glu Ala Glu Ile Lys Lys 530 535 540 Val Met Asn Gly Asp Tyr Gln Thr Ser Tyr Ile Glu Val Lys Arg Gly 545 550 555 560 Ser Lys Ile Gly Glu Lys Ser Ala Trp Met Leu Asn Leu Ser Ile Asp 565 570 575 Val Pro Lys Ile Asp Lys Gly Val Asp Pro Ser Ile Ile Gly Gly Ile 580 585 590 Asp Val Gly Val Lys Ser Pro Leu Val Cys Ala Ile Asn Asn Ala Phe 595 600 605 Ser Arg Tyr Ser Ile Ser Asp Asn Asp Leu Phe His Phe Asn Lys Lys 610 615 620 Met Phe Ala Arg Arg Arg Ile Leu Leu Lys Lys Asn Arg His Lys Arg 625 630 635 640 Ala Gly His Gly Ala Lys Asn Lys Leu Lys Pro Ile Thr Ile Leu Thr 645 650 655 Glu Lys Ser Glu Arg Phe Arg Lys Lys Leu Ile Glu Arg Trp Ala Cys 660 665 670 Glu Ile Ala Asp Phe Phe Ile Lys Asn Lys Val Gly Thr Val Gln Met 675 680 685 Glu Asn Leu Glu Ser Met Lys Arg Lys Glu Asp Ser Tyr Phe Asn Ile 690 695 700 Arg Leu Arg Gly Phe Trp Pro Tyr Ala Glu Met Gln Asn Lys Ile Glu 705 710 715 720 Phe Lys Leu Lys Gln Tyr Gly Ile Glu Ile Arg Lys Val Ala Pro Asn 725 730 735 Asn Thr Ser Lys Thr Cys Ser Lys Cys Gly His Leu Asn Asn Tyr Phe 740 745 750 Asn Phe Glu Tyr Arg Lys Lys Asn Lys Phe Pro His Phe Lys Cys Glu 755 760 765 Lys Cys Asn Phe Lys Glu Asn Ala Asp Tyr Asn Ala Ala Leu Asn Ile 770 775 780 Ser Asn Pro Lys Leu Lys Ser Thr Lys Glu Glu Pro Ser Gly Gly Ser 785 790 795 800 Gly Gly Ser Gly Gly Ser Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu 805 810 815 Thr Gly Lys Gln Leu Val Ile Gln Glu Ser Ile Leu Met Leu Pro Glu 820 825 830 Glu Val Glu Glu Val Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val 835 840 845 His Thr Ala Tyr Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr 850 855 860 Ser Asp Ala Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser 865 870 875 880 Asn Gly Glu Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Gly Gly Ser 885 890 895 Gly Gly Ser Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys 900 905 910 Gln Leu Val Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu 915 920 925 Glu Val Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala 930 935 940 Tyr Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala 945 950 955 960 Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu 965 970 975 Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Lys Arg Thr Ala Asp Gly 980 985 990 Ser Glu Phe Glu Pro Lys Lys Lys Arg Lys Val 995 1000 <210> 264 <211> 1003 <212> PRT <213> Artificial Sequence <220> <223> Amino acid sequence (C terminal -Cytidine deaminase) <400> 264 Met Pro Lys Lys Lys Arg Lys Val Ala Lys Asn Thr Ile Thr Lys Thr 1 5 10 15 Leu Lys Leu Arg Ile Val Arg Pro Tyr Asn Ser Ala Glu Val Glu Lys 20 25 30 Ile Val Ala Asp Glu Lys Asn Asn Arg Glu Lys Ile Ala Leu Glu Lys 35 40 45 Asn Lys Asp Lys Val Lys Glu Ala Cys Ser Lys His Leu Lys Val Ala 50 55 60 Ala Tyr Cys Thr Thr Gln Val Glu Arg Asn Ala Cys Leu Phe Cys Lys 65 70 75 80 Ala Arg Lys Leu Asp Asp Lys Phe Tyr Gln Lys Leu Arg Gly Gln Phe 85 90 95 Pro Asp Ala Val Phe Trp Gln Glu Ile Ser Glu Ile Phe Arg Gln Leu 100 105 110 Gln Lys Gln Ala Ala Glu Ile Tyr Asn Gln Ser Leu Ile Glu Leu Tyr 115 120 125 Tyr Glu Ile Phe Ile Lys Gly Lys Gly Ile Ala Asn Ala Ser Ser Val 130 135 140 Glu His Tyr Leu Ser Asp Val Cys Tyr Thr Arg Ala Ala Glu Leu Phe 145 150 155 160 Lys Asn Ala Ala Ile Ala Ser Gly Leu Arg Ser Lys Ile Lys Ser Asn 165 170 175 Phe Arg Leu Lys Glu Leu Lys Asn Met Lys Ser Gly Leu Pro Thr Thr 180 185 190 Lys Ser Asp Asn Phe Pro Ile Pro Leu Val Lys Gln Lys Gly Gly Gln 195 200 205 Tyr Thr Gly Phe Glu Ile Ser Asn His Asn Ser Asp Phe Ile Ile Lys 210 215 220 Ile Pro Phe Gly Arg Trp Gln Val Lys Lys Glu Ile Asp Lys Tyr Arg 225 230 235 240 Pro Trp Glu Lys Phe Asp Phe Glu Gln Val Gln Lys Ser Pro Lys Pro 245 250 255 Ile Ser Leu Leu Leu Ser Thr Gln Arg Arg Lys Arg Asn Lys Gly Trp 260 265 270 Ser Lys Asp Glu Gly Thr Glu Ala Glu Ile Lys Lys Val Met Asn Gly 275 280 285 Asp Tyr Gln Thr Ser Tyr Ile Glu Val Lys Arg Gly Ser Lys Ile Gly 290 295 300 Glu Lys Ser Ala Trp Met Leu Asn Leu Ser Ile Asp Val Pro Lys Ile 305 310 315 320 Asp Lys Gly Val Asp Pro Ser Ile Ile Gly Gly Ile Asp Val Gly Val 325 330 335 Lys Ser Pro Leu Val Cys Ala Ile Asn Asn Ala Phe Ser Arg Tyr Ser 340 345 350 Ile Ser Asp Asn Asp Leu Phe His Phe Asn Lys Lys Met Phe Ala Arg 355 360 365 Arg Arg Ile Leu Leu Lys Lys Asn Arg His Lys Arg Ala Gly His Gly 370 375 380 Ala Lys Asn Lys Leu Lys Pro Ile Thr Ile Leu Thr Glu Lys Ser Glu 385 390 395 400 Arg Phe Arg Lys Lys Leu Ile Glu Arg Trp Ala Cys Glu Ile Ala Asp 405 410 415 Phe Phe Ile Lys Asn Lys Val Gly Thr Val Gln Met Glu Asn Leu Glu 420 425 430 Ser Met Lys Arg Lys Glu Asp Ser Tyr Phe Asn Ile Arg Leu Arg Gly 435 440 445 Phe Trp Pro Tyr Ala Glu Met Gln Asn Lys Ile Glu Phe Lys Leu Lys 450 455 460 Gln Tyr Gly Ile Glu Ile Arg Lys Val Ala Pro Asn Asn Thr Ser Lys 465 470 475 480 Thr Cys Ser Lys Cys Gly His Leu Asn Asn Tyr Phe Asn Phe Glu Tyr 485 490 495 Arg Lys Lys Asn Lys Phe Pro His Phe Lys Cys Glu Lys Cys Asn Phe 500 505 510 Lys Glu Asn Ala Asp Tyr Asn Ala Ala Leu Asn Ile Ser Asn Pro Lys 515 520 525 Leu Lys Ser Thr Lys Glu Glu Pro Ser Gly Gly Ser Ser Gly Gly Ser 530 535 540 Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser 545 550 555 560 Ser Gly Gly Ser Ser Gly Gly Ser Ser Ser Glu Thr Gly Pro Val Ala 565 570 575 Val Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu Val 580 585 590 Phe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr Glu 595 600 605 Ile Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln Asn 610 615 620 Thr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr Glu 625 630 635 640 Arg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu Ser 645 650 655 Trp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu Ser 660 665 670 Arg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr His 675 680 685 His Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser Ser 690 695 700 Gly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys Trp 705 710 715 720 Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro Arg 725 730 735 Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys Ile 740 745 750 Ile Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln Pro 755 760 765 Gln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln Arg 770 775 780 Leu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Gly Ser 785 790 795 800 Gly Gly Ser Gly Gly Ser Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu 805 810 815 Thr Gly Lys Gln Leu Val Ile Gln Glu Ser Ile Leu Met Leu Pro Glu 820 825 830 Glu Val Glu Glu Val Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val 835 840 845 His Thr Ala Tyr Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr 850 855 860 Ser Asp Ala Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser 865 870 875 880 Asn Gly Glu Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Gly Gly Ser 885 890 895 Gly Gly Ser Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys 900 905 910 Gln Leu Val Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu 915 920 925 Glu Val Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala 930 935 940 Tyr Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala 945 950 955 960 Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu 965 970 975 Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Lys Arg Thr Ala Asp Gly 980 985 990 Ser Glu Phe Glu Pro Lys Lys Lys Arg Lys Val 995 1000 <210> 265 <211> 941 <212> PRT <213> Artificial Sequence <220> <223> DNA sequence (N terminal -Adenine deaminase) <400> 265 Met Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu 1 5 10 15 Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala 20 25 30 Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro 35 40 45 Ile Gly Arg His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg 50 55 60 Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu 65 70 75 80 Tyr Val Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His 85 90 95 Ser Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly 100 105 110 Ala Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn His 115 120 125 Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu 130 135 140 Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys 145 150 155 160 Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser 165 170 175 Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser 180 185 190 Gly Gly Ser Ser Gly Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr 195 200 205 Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg 210 215 220 Glu Val Pro Val Gly Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly 225 230 235 240 Glu Gly Trp Asn Arg Ala Ile Gly Leu His Asp Pro Thr Ala His Ala 245 250 255 Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg 260 265 270 Leu Ile Asp Ala Thr Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys 275 280 285 Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg Val Val Phe Gly Val 290 295 300 Arg Asn Ala Lys Thr Gly Ala Ala Gly Ser Leu Met Asp Val Leu His 305 310 315 320 Tyr Pro Gly Met Asn His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala 325 330 335 Asp Glu Cys Ala Ala Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln 340 345 350 Val Phe Asn Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly 355 360 365 Ser Ser Gly Gly Ser Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser 370 375 380 Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser Ala Lys Asn 385 390 395 400 Thr Ile Thr Lys Thr Leu Lys Leu Arg Ile Val Arg Pro Tyr Asn Ser 405 410 415 Ala Glu Val Glu Lys Ile Val Ala Asp Glu Lys Asn Asn Arg Glu Lys 420 425 430 Ile Ala Leu Glu Lys Asn Lys Asp Lys Val Lys Glu Ala Cys Ser Lys 435 440 445 His Leu Lys Val Ala Ala Tyr Cys Thr Thr Gln Val Glu Arg Asn Ala 450 455 460 Cys Leu Phe Cys Lys Ala Arg Lys Leu Asp Asp Lys Phe Tyr Gln Lys 465 470 475 480 Leu Arg Gly Gln Phe Pro Asp Ala Val Phe Trp Gln Glu Ile Ser Glu 485 490 495 Ile Phe Arg Gln Leu Gln Lys Gln Ala Ala Glu Ile Tyr Asn Gln Ser 500 505 510 Leu Ile Glu Leu Tyr Tyr Glu Ile Phe Ile Lys Gly Lys Gly Ile Ala 515 520 525 Asn Ala Ser Ser Val Glu His Tyr Leu Ser Asp Val Cys Tyr Thr Arg 530 535 540 Ala Ala Glu Leu Phe Lys Asn Ala Ala Ile Ala Ser Gly Leu Arg Ser 545 550 555 560 Lys Ile Lys Ser Asn Phe Arg Leu Lys Glu Leu Lys Asn Met Lys Ser 565 570 575 Gly Leu Pro Thr Thr Lys Ser Asp Asn Phe Pro Ile Pro Leu Val Lys 580 585 590 Gln Lys Gly Gly Gln Tyr Thr Gly Phe Glu Ile Ser Asn His Asn Ser 595 600 605 Asp Phe Ile Ile Lys Ile Pro Phe Gly Arg Trp Gln Val Lys Lys Glu 610 615 620 Ile Asp Lys Tyr Arg Pro Trp Glu Lys Phe Asp Phe Glu Gln Val Gln 625 630 635 640 Lys Ser Pro Lys Pro Ile Ser Leu Leu Leu Ser Thr Gln Arg Arg Lys 645 650 655 Arg Asn Lys Gly Trp Ser Lys Asp Glu Gly Thr Glu Ala Glu Ile Lys 660 665 670 Lys Val Met Asn Gly Asp Tyr Gln Thr Ser Tyr Ile Glu Val Lys Arg 675 680 685 Gly Ser Lys Ile Gly Glu Lys Ser Ala Trp Met Leu Asn Leu Ser Ile 690 695 700 Asp Val Pro Lys Ile Asp Lys Gly Val Asp Pro Ser Ile Ile Gly Gly 705 710 715 720 Ile Asp Val Gly Val Lys Ser Pro Leu Val Cys Ala Ile Asn Asn Ala 725 730 735 Phe Ser Arg Tyr Ser Ile Ser Asp Asn Asp Leu Phe His Phe Asn Lys 740 745 750 Lys Met Phe Ala Arg Arg Arg Ile Leu Leu Lys Lys Asn Arg His Lys 755 760 765 Arg Ala Gly His Gly Ala Lys Asn Lys Leu Lys Pro Ile Thr Ile Leu 770 775 780 Thr Glu Lys Ser Glu Arg Phe Arg Lys Lys Leu Ile Glu Arg Trp Ala 785 790 795 800 Cys Glu Ile Ala Asp Phe Phe Ile Lys Asn Lys Val Gly Thr Val Gln 805 810 815 Met Glu Asn Leu Glu Ser Met Lys Arg Lys Glu Asp Ser Tyr Phe Asn 820 825 830 Ile Arg Leu Arg Gly Phe Trp Pro Tyr Ala Glu Met Gln Asn Lys Ile 835 840 845 Glu Phe Lys Leu Lys Gln Tyr Gly Ile Glu Ile Arg Lys Val Ala Pro 850 855 860 Asn Asn Thr Ser Lys Thr Cys Ser Lys Cys Gly His Leu Asn Asn Tyr 865 870 875 880 Phe Asn Phe Glu Tyr Arg Lys Lys Asn Lys Phe Pro His Phe Lys Cys 885 890 895 Glu Lys Cys Asn Phe Lys Glu Asn Ala Asp Tyr Asn Ala Ala Leu Asn 900 905 910 Ile Ser Asn Pro Lys Leu Lys Ser Thr Lys Glu Glu Pro Lys Arg Pro 915 920 925 Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys 930 935 940 <210> 266 <211> 941 <212> PRT <213> Artificial Sequence <220> <223> DNA sequence (C terminal -Adenine deaminase) <400> 266 Met Ala Lys Asn Thr Ile Thr Lys Thr Leu Lys Leu Arg Ile Val Arg 1 5 10 15 Pro Tyr Asn Ser Ala Glu Val Glu Lys Ile Val Ala Asp Glu Lys Asn 20 25 30 Asn Arg Glu Lys Ile Ala Leu Glu Lys Asn Lys Asp Lys Val Lys Glu 35 40 45 Ala Cys Ser Lys His Leu Lys Val Ala Ala Tyr Cys Thr Thr Gln Val 50 55 60 Glu Arg Asn Ala Cys Leu Phe Cys Lys Ala Arg Lys Leu Asp Asp Lys 65 70 75 80 Phe Tyr Gln Lys Leu Arg Gly Gln Phe Pro Asp Ala Val Phe Trp Gln 85 90 95 Glu Ile Ser Glu Ile Phe Arg Gln Leu Gln Lys Gln Ala Ala Glu Ile 100 105 110 Tyr Asn Gln Ser Leu Ile Glu Leu Tyr Tyr Glu Ile Phe Ile Lys Gly 115 120 125 Lys Gly Ile Ala Asn Ala Ser Ser Val Glu His Tyr Leu Ser Asp Val 130 135 140 Cys Tyr Thr Arg Ala Ala Glu Leu Phe Lys Asn Ala Ala Ile Ala Ser 145 150 155 160 Gly Leu Arg Ser Lys Ile Lys Ser Asn Phe Arg Leu Lys Glu Leu Lys 165 170 175 Asn Met Lys Ser Gly Leu Pro Thr Thr Lys Ser Asp Asn Phe Pro Ile 180 185 190 Pro Leu Val Lys Gln Lys Gly Gly Gln Tyr Thr Gly Phe Glu Ile Ser 195 200 205 Asn His Asn Ser Asp Phe Ile Ile Lys Ile Pro Phe Gly Arg Trp Gln 210 215 220 Val Lys Lys Glu Ile Asp Lys Tyr Arg Pro Trp Glu Lys Phe Asp Phe 225 230 235 240 Glu Gln Val Gln Lys Ser Pro Lys Pro Ile Ser Leu Leu Leu Ser Thr 245 250 255 Gln Arg Arg Lys Arg Asn Lys Gly Trp Ser Lys Asp Glu Gly Thr Glu 260 265 270 Ala Glu Ile Lys Lys Val Met Asn Gly Asp Tyr Gln Thr Ser Tyr Ile 275 280 285 Glu Val Lys Arg Gly Ser Lys Ile Gly Glu Lys Ser Ala Trp Met Leu 290 295 300 Asn Leu Ser Ile Asp Val Pro Lys Ile Asp Lys Gly Val Asp Pro Ser 305 310 315 320 Ile Ile Gly Gly Ile Asp Val Gly Val Lys Ser Pro Leu Val Cys Ala 325 330 335 Ile Asn Asn Ala Phe Ser Arg Tyr Ser Ile Ser Asp Asn Asp Leu Phe 340 345 350 His Phe Asn Lys Lys Met Phe Ala Arg Arg Arg Ile Leu Leu Lys Lys 355 360 365 Asn Arg His Lys Arg Ala Gly His Gly Ala Lys Asn Lys Leu Lys Pro 370 375 380 Ile Thr Ile Leu Thr Glu Lys Ser Glu Arg Phe Arg Lys Lys Leu Ile 385 390 395 400 Glu Arg Trp Ala Cys Glu Ile Ala Asp Phe Phe Ile Lys Asn Lys Val 405 410 415 Gly Thr Val Gln Met Glu Asn Leu Glu Ser Met Lys Arg Lys Glu Asp 420 425 430 Ser Tyr Phe Asn Ile Arg Leu Arg Gly Phe Trp Pro Tyr Ala Glu Met 435 440 445 Gln Asn Lys Ile Glu Phe Lys Leu Lys Gln Tyr Gly Ile Glu Ile Arg 450 455 460 Lys Val Ala Pro Asn Asn Thr Ser Lys Thr Cys Ser Lys Cys Gly His 465 470 475 480 Leu Asn Asn Tyr Phe Asn Phe Glu Tyr Arg Lys Lys Asn Lys Phe Pro 485 490 495 His Phe Lys Cys Glu Lys Cys Asn Phe Lys Glu Asn Ala Asp Tyr Asn 500 505 510 Ala Ala Leu Asn Ile Ser Asn Pro Lys Leu Lys Ser Thr Lys Glu Glu 515 520 525 Pro Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly 530 535 540 Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly 545 550 555 560 Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu 565 570 575 Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala 580 585 590 Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro 595 600 605 Ile Gly Arg His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg 610 615 620 Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu 625 630 635 640 Tyr Val Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His 645 650 655 Ser Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly 660 665 670 Ala Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn His 675 680 685 Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu 690 695 700 Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys 705 710 715 720 Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser 725 730 735 Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser 740 745 750 Gly Gly Ser Ser Gly Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr 755 760 765 Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg 770 775 780 Glu Val Pro Val Gly Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly 785 790 795 800 Glu Gly Trp Asn Arg Ala Ile Gly Leu His Asp Pro Thr Ala His Ala 805 810 815 Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg 820 825 830 Leu Ile Asp Ala Thr Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys 835 840 845 Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg Val Val Phe Gly Val 850 855 860 Arg Asn Ala Lys Thr Gly Ala Ala Gly Ser Leu Met Asp Val Leu His 865 870 875 880 Tyr Pro Gly Met Asn His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala 885 890 895 Asp Glu Cys Ala Ala Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln 900 905 910 Val Phe Asn Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Lys Arg Pro 915 920 925 Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys 930 935 940 <210> 267 <211> 1680 <212> DNA <213> Artificial Sequence <220> <223> Human codon-optimized Cas14a1 with NLS <400> 267 atgccaaaga agaagcggaa ggtcggtatc cacggagtcc cagcagccgc caagaacaca 60 attacaaaga cactgaagct gaggatcgtg agaccataca acagcgctga ggtcgagaag 120 attgtggctg atgaaaagaa caacagggaa aagatcgccc tcgagaagaa caaggataag 180 gtgaaggagg cctgctctaa gcacctgaaa gtggccgcct actgcaccac acaggtggag 240 aggaacgcct gtctgttttg taaagctcgg aagctggatg ataagtttta ccagaagctg 300 cggggccagt tccccgatgc cgtcttttgg caggagatta gcgagatctt cagacagctg 360 cagaagcagg ccgccgagat ctacaaccag agcctgatcg agctctacta cgagatcttc 420 atcaagggca agggcattgc caacgcctcc tccgtggagc actacctgag cgacgtgtgc 480 tacacaagag ccgccgagct ctttaagaac gccgctatcg cttccgggct gaggagcaag 540 attaagagta acttccggct caaggagctg aagaacatga agagcggcct gcccactaca 600 aagagcgaca acttcccaat tccactggtg aagcagaagg ggggccagta cacagggttc 660 gagattcca accacaacag cgactttatt attaagatcc cctttggcag gtggcaggtc 720 aagaaggaga ttgacaagta caggccctgg gagaagtttg atttcgagca ggtgcagaag 780 agccccaagc ctatttccct gctgctgtcc acacagcggc ggaagaggaa caaggggtgg 840 tctaaggatg aggggaccga ggccgagatt aagaaagtga tgaacggcga ctaccagaca 900 agctacatcg aggtcaagcg gggcagtaag attggcgaga agagcgcctg gatgctgaac 960 ctgagcattg acgtgccaaa gattgataag ggcgtggatc ccagcatcat cggagggatc 1020 gatgtggggg tcaagagccc cctcgtgtgc gccatcaaca acgccttcag caggtacagc 1080 atctccgata acgacctgtt ccactttaac aagaagatgt tcgcccggcg gaggattttg 1140 ctcaagaaga accggcacaa gcgggccgga cacggggcca agaacaagct caagcccatc 1200 actatcctga ccgagaagag cgagaggttc aggaagaagc tcatcgagag atgggcctgc 1260 gagatcgccg atttctttat taagaacaag gtcggaacag tgcagatgga gaacctcgag 1320 agcatgaaga ggaaggagga ttcctacttc aacattcggc tgagggggtt ctggccctac 1380 gctgagatgc agaacaagat tgagtttaag ctgaagcagt acgggattga gatccggaag 1440 gtggccccca acaacaccag caagacctgc agcaagtgcg ggcacctcaa caactacttc 1500 aacttcgagt accggaagaa gaacaagttc ccacacttca agtgcgagaa gtgcaacttt 1560 aaggagaacg ccgattacaa cgccgccctg aacatcagca accctaagct gaagagcact 1620 aaggaggagc ccaaaaggcc ggcggccacg aaaaaggccg gccaggcaaa aaagaaaaag 1680 1680 <210> 268 <211> 3009 <212> DNA <213> Artificial Sequence <220> <223> Deaminase fusion Cas14a1 <400> 268 atgccaaaga agaagcggaa agtctcctca gagactgggc ctgtcgccgt cgatccaacc 60 ctgcgccgcc ggattgaacc tcacgagttt gaagtgttct ttgacccccg ggagctgaga 120 aaggagacat gcctgctgta cgagatcaac tggggaggca ggcactccat ctggaggcac 180 acctctcaga acacaaataa gcacgtggag gtgaacttca tcgagaagtt taccacagag 240 cggtacttct gccccaatac cagatgtagc atcacatggt ttctgagctg gtccccttgc 300 ggagagtgta gcagggccat caccgagttc ctgtccagat atccacacgt gacactgttt 360 atctacatcg ccaggctgta tcaccacgca gacccaagga ataggcaggg cctgcgcgat 420 ctgatcagct ccggcgtgac catccagatc atgacagagc aggagtccgg ctactgctgg 480 cggaacttcg tgaattattc tcctagcaac gaggcccact ggcctaggta cccacacctg 540 tgggtgcgcc tgtacgtgct ggagctgtat tgcatcatcc tgggcctgcc cccttgtctg 600 aatatcctgc ggagaaagca gccccagctg accttcttta caatcgccct gcagtcttgt 660 cactatcaga ggctgccacc ccacatcctg tgggccacag gcctgaagtc tggaggatct 720 agcggaggat cctctggcag cgagacacca ggaacaagcg agtcagcaac accagagagc 780 agtggcggca gcagcggcgg cagcgccaag aacacaatta caaagacact gaagctgagg 840 atcgtgagac catacaacag cgctgaggtc gagaagattg tggctgatga aaagaacaac 900 agggaaaaga tcgccctcga gaagaacaag gataaggtga aggaggcctg ctctaagcac 960 ctgaaagtgg ccgcctactg caccacacag gtggagagga acgcctgtct gttttgtaaa 1020 gctcggaagc tggatgataa gttttaccag aagctgcggg gccagttccc cgatgccgtc 1080 ttttggcagg agattagcga gatcttcaga cagctgcaga agcaggccgc cgagatctac 1140 aaccagagcc tgatcgagct ctactacgag atcttcatca agggcaaggg cattgccaac 1200 gcctcctccg tggagcacta cctgagcgac gtgtgctaca caagagccgc cgagctcttt 1260 aagaacgccg ctatcgcttc cgggctgagg agcaagatta agagtaactt ccggctcaag 1320 gagctgaaga acatgaagag cggcctgccc actacaaaga gcgacaactt cccaattcca 1380 ctggtgaagc agaaggggggg ccagtacaca gggttcgaga tttccaacca caacagcgac 1440 tttattatta agatcccctt tggcaggtgg caggtcaaga aggagattga caagtacagg 1500 ccctgggaga agtttgattt cgagcaggtg cagaagagcc ccaagcctat ttccctgctg 1560 ctgtccacac agcggcggaa gaggaacaag gggtggtcta aggatgaggg gaccgaggcc 1620 gagattaaga aagtgatgaa cggcgactac cagacaagct acatcgaggt caagcggggc 1680 agtaagattg gcgagaagag cgcctggatg ctgaacctga gcattgacgt gccaaagatt 1740 gataagggcg tggatcccag catcatcgga gggatcgatg tgggggtcaa gagccccctc 1800 gtgtgcgcca tcaacaacgc cttcagcagg tacagcatct ccgataacga cctgttccac 1860 tttaacaaga agatgttcgc ccggcggagg attttgctca agaagaaccg gcacaagcgg 1920 gccggacacg gggccaagaa caagctcaag cccatcacta tcctgaccga gaagagcgag 1980 aggttcagga agaagctcat cgagagatgg gcctgcgaga tcgccgattt ctttattaag 2040 aacaaggtcg gaacagtgca gatggagaac ctcgagagca tgaagaggaa ggaggattcc 2100 tacttcaaca ttcggctgag ggggttctgg ccctacgctg agatgcagaa caagatgag 2160 tttaagctga agcagtacgg gattgagatc cggaaggtgg cccccaacaa caccagcaag 2220 acctgcagca agtgcgggca cctcaacaac tacttcaact tcgagtaccg gaagaagaac 2280 aagttcccac acttcaagtg cgagaagtgc aactttaagg agaacgccga ttacaacgcc 2340 gccctgaaca tcagcaaccc taagctgaag agcactaagg aggagcccag cggcgggagc 2400 ggcgggagcg gggggagcac taatctgagc gacatcattg agaaggagac tgggaaacag 2460 ctggtcattc aggagtccat cctgatgctg cctgaggagg tggaggaagt gatcggcaac 2520 aagccagagt ctgacatcct ggtgcacacc gcctacgacg agtccacaga tgagaatgtg 2580 atgctgctga cctctgacgc ccccgagtat aagccttggg ccctggtcat ccaggattct 2640 aacggcgaga ataagatcaa gatgctgagc ggaggatccg gaggatctgg aggcagcacc 2700 aacctgtctg acatcatcga gaaggagaca ggcaagcagc tggtcatcca ggagagcatc 2760 ctgatgctgc ccgaagaagt cgaagaagtg atcggaaaca agcctgagag cgatatcctg 2820 gtccataccg cctacgacga gagtaccgac gaaaatgtga tgctgctgac atccgacgcc 2880 ccagagtata agccctgggc tctggtcatc caggattcca acggagagaa caaaatcaaa 2940 atgctgtctg gcggctcaaa aagaaccgcc gacggcagcg aattcgagcc caagaagaag 3000 aggaaagtc 3009 <210> 269 <211> 1586 <212> DNA <213> Artificial Sequence <220> <223> human codon-optimized Cas14a1 <400> 269 atggccaaga acacaattac aaagacactg aagctgagga tcgtgagacc atacaacagc 60 gctgaggtcg agaagattgt ggctgatgaa aagaacaaca gggaaaagat cgccctcgag 120 aagaacaagg ataaggtgaa ggaggcctgc tctaagcacc tgaaagtggc cgcctactgc 180 accacacagg tggagaggaa cgcctgtctg ttttgtaaag ctcggaagct ggatgataag 240 ttttaccaga agctgcgggg ccagttcccc gatgccgtct tttggcagga gattagcgag 300 atcttcagac agctgcagaa gcaggccgcc gagatctaca accagagcct gatcgagctc 360 tactacgaga tcttcatcaa gggcaagggc attgccaacg cctcctccgt ggagcactac 420 ctgagcgacg tgtgctacac aagagccgcc gagctcttta agaacgccgc tatcgcttcc 480 gggctgagga gcaagattaa gagtaacttc cggctcaagg agctgaagaa catgaagagc 540 ggcctgccca ctacaaagag cgacaacttc ccaattccac tggtgaagca gaaggggggc 600 cagtacacag ggttcgagat ttccaaccac aacagcgact ttattattaa gatccccttt 660 ggcaggtggc aggtcaagaa ggagattgac aagtacaggc cctgggagaa gtttgatttc 720 gagcaggtgc agaagagccc caagcctatt tccctgctgc tgtccacaca gcggcggaag 780 aggaacaagg ggtggtctaa ggatgagggg accgaggccg agattaagaa agtgatgaac 840 ggcgactacc agacaagcta catcgaggtc aagcggggca gtaagattgg cgagaagagc 900 gcctggatgc tgaacctgag cattgacgtg ccaaagattg ataagggcgt ggatcccagc 960 atcatcggag ggatcgatgt gggggtcaag agccccctcg tgtgcgccat caacaacgcc 1020 ttcagcaggt acagcatctc cgataacgac ctgttccact ttaacaagaa gatgttcgcc 1080 cggcggagga ttttgctcaa gaagaaccgg cacaagcggg ccggacacgg ggccaagaac 1140 aagctcaagc ccatcactat cctgaccgag aagagcgaga ggttcaggaa gaagctcatc 1200 gagagatggg cctgcgagat cgccgatttc tttattaaga acaaggtcgg aacagtgcag 1260 atggagaacc tcgagagcat gaagaggaag gaggattcct acttcaacat tcggctgagg 1320 gggttctggc cctacgctga gatgcagaac aagattgagt ttaagctgaa gcagtacggg 1380 attgagatcc ggaaggtggc ccccaacaac accagcaaga cctgcagcaa gtgcgggcac 1440 ctcaacaact acttcaactt cgagtaccgg aagaagaaca agttcccaca cttcaagtgc 1500 gagaagtgca actttaagga gaacgccgat tacaacgccg ccctgaacat cagcaaccct 1560 aagctgaaga gcactaagga ggagcc 1586 <210> 270 <211> 1607 <212> DNA <213> Artificial Sequence <220> <223> N-terminal NLS + Cas14a1 amino acid sequence <400> 270 atgccaaaga agaagcggaa agtcgccaag aacacaatta caaagacact gaagctgagg 60 atcgtgagac catacaacag cgctgaggtc gagaagattg tggctgatga aaagaacaac 120 agggaaaaga tcgccctcga gaagaacaag gataaggtga aggaggcctg ctctaagcac 180 ctgaaagtgg ccgcctactg caccacacag gtggagagga acgcctgtct gttttgtaaa 240 gctcggaagc tggatgataa gttttaccag aagctgcggg gccagttccc cgatgccgtc 300 ttttggcagg agattagcga gatcttcaga cagctgcaga agcaggccgc cgagatctac 360 aaccagagcc tgatcgagct ctactacgag atcttcatca agggcaaggg cattgccaac 420 gcctcctccg tggagcacta cctgagcgac gtgtgctaca caagagccgc cgagctcttt 480 aagaacgccg ctatcgcttc cgggctgagg agcaagatta agagtaactt ccggctcaag 540 gagctgaaga acatgaagag cggcctgccc actacaaaga gcgacaactt cccaattcca 600 ctggtgaagc agaaggggggg ccagtacaca gggttcgaga tttccaacca caacagcgac 660 tttattatta agatcccctt tggcaggtgg caggtcaaga aggagattga caagtacagg 720 ccctgggaga agtttgattt cgagcaggtg cagaagagcc ccaagcctat ttccctgctg 780 ctgtccacac agcggcggaa gaggaacaag gggtggtcta aggatgaggg gaccgaggcc 840 gagattaaga aagtgatgaa cggcgactac cagacaagct acatcgaggt caagcggggc 900 agtaagattg gcgagaagag cgcctggatg ctgaacctga gcattgacgt gccaaagatt 960 gataagggcg tggatcccag catcatcgga gggatcgatg tgggggtcaa gagccccctc 1020 gtgtgcgcca tcaacaacgc cttcagcagg tacagcatct ccgataacga cctgttccac 1080 tttaacaaga agatgttcgc ccggcggagg attttgctca agaagaaccg gcacaagcgg 1140 gccggacacg gggccaagaa caagctcaag cccatcacta tcctgaccga gaagagcgag 1200 aggttcagga agaagctcat cgagagatgg gcctgcgaga tcgccgattt ctttattaag 1260 aacaaggtcg gaacagtgca gatggagaac ctcgagagca tgaagaggaa ggaggattcc 1320 tacttcaaca ttcggctgag ggggttctgg ccctacgctg agatgcagaa caagatgag 1380 tttaagctga agcagtacgg gattgagatc cggaaggtgg cccccaacaa caccagcaag 1440 acctgcagca agtgcgggca cctcaacaac tacttcaact tcgagtaccg gaagaagaac 1500 aagttcccac acttcaagtg cgagaagtgc aactttaagg agaacgccga ttacaacgcc 1560 gccctgaaca tcagcaaccc taagctgaag agcactaagg aggagcc 1607 <210> 271 <211> 1607 <212> DNA <213> Artificial Sequence <220> <223> C-terminal NLS + Cas14a1 amino acid sequence <400> 271 atggccaaga acacaattac aaagacactg aagctgagga tcgtgagacc atacaacagc 60 gctgaggtcg agaagattgt ggctgatgaa aagaacaaca gggaaaagat cgccctcgag 120 aagaacaagg ataaggtgaa ggaggcctgc tctaagcacc tgaaagtggc cgcctactgc 180 accacacagg tggagaggaa cgcctgtctg ttttgtaaag ctcggaagct ggatgataag 240 ttttaccaga agctgcgggg ccagttcccc gatgccgtct tttggcagga gattagcgag 300 atcttcagac agctgcagaa gcaggccgcc gagatctaca accagagcct gatcgagctc 360 tactacgaga tcttcatcaa gggcaagggc attgccaacg cctcctccgt ggagcactac 420 ctgagcgacg tgtgctacac aagagccgcc gagctcttta agaacgccgc tatcgcttcc 480 gggctgagga gcaagattaa gagtaacttc cggctcaagg agctgaagaa catgaagagc 540 ggcctgccca ctacaaagag cgacaacttc ccaattccac tggtgaagca gaaggggggc 600 cagtacacag ggttcgagat ttccaaccac aacagcgact ttattattaa gatccccttt 660 ggcaggtggc aggtcaagaa ggagattgac aagtacaggc cctgggagaa gtttgatttc 720 gagcaggtgc agaagagccc caagcctatt tccctgctgc tgtccacaca gcggcggaag 780 aggaacaagg ggtggtctaa ggatgagggg accgaggccg agattaagaa agtgatgaac 840 ggcgactacc agacaagcta catcgaggtc aagcggggca gtaagattgg cgagaagagc 900 gcctggatgc tgaacctgag cattgacgtg ccaaagattg ataagggcgt ggatcccagc 960 atcatcggag ggatcgatgt gggggtcaag agccccctcg tgtgcgccat caacaacgcc 1020 ttcagcaggt acagcatctc cgataacgac ctgttccact ttaacaagaa gatgttcgcc 1080 cggcggagga ttttgctcaa gaagaaccgg cacaagcggg ccggacacgg ggccaagaac 1140 aagctcaagc ccatcactat cctgaccgag aagagcgaga ggttcaggaa gaagctcatc 1200 gagagatggg cctgcgagat cgccgatttc tttattaaga acaaggtcgg aacagtgcag 1260 atggagaacc tcgagagcat gaagaggaag gaggattcct acttcaacat tcggctgagg 1320 gggttctggc cctacgctga gatgcagaac aagattgagt ttaagctgaa gcagtacggg 1380 attgagatcc ggaaggtggc ccccaacaac accagcaaga cctgcagcaa gtgcgggcac 1440 ctcaacaact acttcaactt cgagtaccgg aagaagaaca agttcccaca cttcaagtgc 1500 gagaagtgca actttaagga gaacgccgat tacaacgccg ccctgaacat cagcaaccct 1560 aagctgaaga gcactaagga ggagccccaa agaagaagcg gaaagtc 1607 <210> 272 <211> 1628 <212> DNA <213> Artificial Sequence <220> <223> N/C-terminal NLS + Cas14a1 amino acid sequence <400> 272 atgccaaaga agaagcggaa agtcgccaag aacacaatta caaagacact gaagctgagg 60 atcgtgagac catacaacag cgctgaggtc gagaagattg tggctgatga aaagaacaac 120 agggaaaaga tcgccctcga gaagaacaag gataaggtga aggaggcctg ctctaagcac 180 ctgaaagtgg ccgcctactg caccacacag gtggagagga acgcctgtct gttttgtaaa 240 gctcggaagc tggatgataa gttttaccag aagctgcggg gccagttccc cgatgccgtc 300 ttttggcagg agattagcga gatcttcaga cagctgcaga agcaggccgc cgagatctac 360 aaccagagcc tgatcgagct ctactacgag atcttcatca agggcaaggg cattgccaac 420 gcctcctccg tggagcacta cctgagcgac gtgtgctaca caagagccgc cgagctcttt 480 aagaacgccg ctatcgcttc cgggctgagg agcaagatta agagtaactt ccggctcaag 540 gagctgaaga acatgaagag cggcctgccc actacaaaga gcgacaactt cccaattcca 600 ctggtgaagc agaaggggggg ccagtacaca gggttcgaga tttccaacca caacagcgac 660 tttattatta agatcccctt tggcaggtgg caggtcaaga aggagattga caagtacagg 720 ccctgggaga agtttgattt cgagcaggtg cagaagagcc ccaagcctat ttccctgctg 780 ctgtccacac agcggcggaa gaggaacaag gggtggtcta aggatgaggg gaccgaggcc 840 gagattaaga aagtgatgaa cggcgactac cagacaagct acatcgaggt caagcggggc 900 agtaagattg gcgagaagag cgcctggatg ctgaacctga gcattgacgt gccaaagatt 960 gataagggcg tggatcccag catcatcgga gggatcgatg tgggggtcaa gagccccctc 1020 gtgtgcgcca tcaacaacgc cttcagcagg tacagcatct ccgataacga cctgttccac 1080 tttaacaaga agatgttcgc ccggcggagg attttgctca agaagaaccg gcacaagcgg 1140 gccggacacg gggccaagaa caagctcaag cccatcacta tcctgaccga gaagagcgag 1200 aggttcagga agaagctcat cgagagatgg gcctgcgaga tcgccgattt ctttattaag 1260 aacaaggtcg gaacagtgca gatggagaac ctcgagagca tgaagaggaa ggaggattcc 1320 tacttcaaca ttcggctgag ggggttctgg ccctacgctg agatgcagaa caagatgag 1380 tttaagctga agcagtacgg gattgagatc cggaaggtgg cccccaacaa caccagcaag 1440 acctgcagca agtgcgggca cctcaacaac tacttcaact tcgagtaccg gaagaagaac 1500 aagttcccac acttcaagtg cgagaagtgc aactttaagg agaacgccga ttacaacgcc 1560 gccctgaaca tcagcaaccc taagctgaag agcactaagg aggagcccca aagaagaagc 1620 ggaaagtc 1628 <210> 273 <211> 3009 <212> DNA <213> Artificial Sequence <220> <223> Amino acid sequence (N terminal -Cytidine deaminase) <400> 273 atgccaaaga agaagcggaa agtctcctca gagactgggc ctgtcgccgt cgatccaacc 60 ctgcgccgcc ggattgaacc tcacgagttt gaagtgttct ttgacccccg ggagctgaga 120 aaggagacat gcctgctgta cgagatcaac tggggaggca ggcactccat ctggaggcac 180 acctctcaga acacaaataa gcacgtggag gtgaacttca tcgagaagtt taccacagag 240 cggtacttct gccccaatac cagatgtagc atcacatggt ttctgagctg gtccccttgc 300 ggagagtgta gcagggccat caccgagttc ctgtccagat atccacacgt gacactgttt 360 atctacatcg ccaggctgta tcaccacgca gacccaagga ataggcaggg cctgcgcgat 420 ctgatcagct ccggcgtgac catccagatc atgacagagc aggagtccgg ctactgctgg 480 cggaacttcg tgaattattc tcctagcaac gaggcccact ggcctaggta cccacacctg 540 tgggtgcgcc tgtacgtgct ggagctgtat tgcatcatcc tgggcctgcc cccttgtctg 600 aatatcctgc ggagaaagca gccccagctg accttcttta caatcgccct gcagtcttgt 660 cactatcaga ggctgccacc ccacatcctg tgggccacag gcctgaagtc tggaggatct 720 agcggaggat cctctggcag cgagacacca ggaacaagcg agtcagcaac accagagagc 780 agtggcggca gcagcggcgg cagcgccaag aacacaatta caaagacact gaagctgagg 840 atcgtgagac catacaacag cgctgaggtc gagaagattg tggctgatga aaagaacaac 900 agggaaaaga tcgccctcga gaagaacaag gataaggtga aggaggcctg ctctaagcac 960 ctgaaagtgg ccgcctactg caccacacag gtggagagga acgcctgtct gttttgtaaa 1020 gctcggaagc tggatgataa gttttaccag aagctgcggg gccagttccc cgatgccgtc 1080 ttttggcagg agattagcga gatcttcaga cagctgcaga agcaggccgc cgagatctac 1140 aaccagagcc tgatcgagct ctactacgag atcttcatca agggcaaggg cattgccaac 1200 gcctcctccg tggagcacta cctgagcgac gtgtgctaca caagagccgc cgagctcttt 1260 aagaacgccg ctatcgcttc cgggctgagg agcaagatta agagtaactt ccggctcaag 1320 gagctgaaga acatgaagag cggcctgccc actacaaaga gcgacaactt cccaattcca 1380 ctggtgaagc agaaggggggg ccagtacaca gggttcgaga tttccaacca caacagcgac 1440 tttattatta agatcccctt tggcaggtgg caggtcaaga aggagattga caagtacagg 1500 ccctgggaga agtttgattt cgagcaggtg cagaagagcc ccaagcctat ttccctgctg 1560 ctgtccacac agcggcggaa gaggaacaag gggtggtcta aggatgaggg gaccgaggcc 1620 gagattaaga aagtgatgaa cggcgactac cagacaagct acatcgaggt caagcggggc 1680 agtaagattg gcgagaagag cgcctggatg ctgaacctga gcattgacgt gccaaagatt 1740 gataagggcg tggatcccag catcatcgga gggatcgatg tgggggtcaa gagccccctc 1800 gtgtgcgcca tcaacaacgc cttcagcagg tacagcatct ccgataacga cctgttccac 1860 tttaacaaga agatgttcgc ccggcggagg attttgctca agaagaaccg gcacaagcgg 1920 gccggacacg gggccaagaa caagctcaag cccatcacta tcctgaccga gaagagcgag 1980 aggttcagga agaagctcat cgagagatgg gcctgcgaga tcgccgattt ctttattaag 2040 aacaaggtcg gaacagtgca gatggagaac ctcgagagca tgaagaggaa ggaggattcc 2100 tacttcaaca ttcggctgag ggggttctgg ccctacgctg agatgcagaa caagatgag 2160 tttaagctga agcagtacgg gattgagatc cggaaggtgg cccccaacaa caccagcaag 2220 acctgcagca agtgcgggca cctcaacaac tacttcaact tcgagtaccg gaagaagaac 2280 aagttcccac acttcaagtg cgagaagtgc aactttaagg agaacgccga ttacaacgcc 2340 gccctgaaca tcagcaaccc taagctgaag agcactaagg aggagcccag cggcgggagc 2400 ggcgggagcg gggggagcac taatctgagc gacatcattg agaaggagac tgggaaacag 2460 ctggtcattc aggagtccat cctgatgctg cctgaggagg tggaggaagt gatcggcaac 2520 aagccagagt ctgacatcct ggtgcacacc gcctacgacg agtccacaga tgagaatgtg 2580 atgctgctga cctctgacgc ccccgagtat aagccttggg ccctggtcat ccaggattct 2640 aacggcgaga ataagatcaa gatgctgagc ggaggatccg gaggatctgg aggcagcacc 2700 aacctgtctg acatcatcga gaaggagaca ggcaagcagc tggtcatcca ggagagcatc 2760 ctgatgctgc ccgaagaagt cgaagaagtg atcggaaaca agcctgagag cgatatcctg 2820 gtccataccg cctacgacga gagtaccgac gaaaatgtga tgctgctgac atccgacgcc 2880 ccagagtata agccctgggc tctggtcatc caggattcca acggagagaa caaaatcaaa 2940 atgctgtctg gcggctcaaa aagaaccgcc gacggcagcg aattcgagcc caagaagaag 3000 aggaaagtc 3009 <210> 274 <211> 3009 <212> DNA <213> Artificial Sequence <220> <223> Amino acid sequence (C terminal -Cytidine deaminase) <400> 274 atgccaaaga agaagcggaa agtcgccaag aacacaatta caaagacact gaagctgagg 60 atcgtgagac catacaacag cgctgaggtc gagaagattg tggctgatga aaagaacaac 120 agggaaaaga tcgccctcga gaagaacaag gataaggtga aggaggcctg ctctaagcac 180 ctgaaagtgg ccgcctactg caccacacag gtggagagga acgcctgtct gttttgtaaa 240 gctcggaagc tggatgataa gttttaccag aagctgcggg gccagttccc cgatgccgtc 300 ttttggcagg agattagcga gatcttcaga cagctgcaga agcaggccgc cgagatctac 360 aaccagagcc tgatcgagct ctactacgag atcttcatca agggcaaggg cattgccaac 420 gcctcctccg tggagcacta cctgagcgac gtgtgctaca caagagccgc cgagctcttt 480 aagaacgccg ctatcgcttc cgggctgagg agcaagatta agagtaactt ccggctcaag 540 gagctgaaga acatgaagag cggcctgccc actacaaaga gcgacaactt cccaattcca 600 ctggtgaagc agaaggggggg ccagtacaca gggttcgaga tttccaacca caacagcgac 660 tttattatta agatcccctt tggcaggtgg caggtcaaga aggagattga caagtacagg 720 ccctgggaga agtttgattt cgagcaggtg cagaagagcc ccaagcctat ttccctgctg 780 ctgtccacac agcggcggaa gaggaacaag gggtggtcta aggatgaggg gaccgaggcc 840 gagattaaga aagtgatgaa cggcgactac cagacaagct acatcgaggt caagcggggc 900 agtaagattg gcgagaagag cgcctggatg ctgaacctga gcattgacgt gccaaagatt 960 gataagggcg tggatcccag catcatcgga gggatcgatg tgggggtcaa gagccccctc 1020 gtgtgcgcca tcaacaacgc cttcagcagg tacagcatct ccgataacga cctgttccac 1080 tttaacaaga agatgttcgc ccggcggagg attttgctca agaagaaccg gcacaagcgg 1140 gccggacacg gggccaagaa caagctcaag cccatcacta tcctgaccga gaagagcgag 1200 aggttcagga agaagctcat cgagagatgg gcctgcgaga tcgccgattt ctttattaag 1260 aacaaggtcg gaacagtgca gatggagaac ctcgagagca tgaagaggaa ggaggattcc 1320 tacttcaaca ttcggctgag ggggttctgg ccctacgctg agatgcagaa caagatgag 1380 tttaagctga agcagtacgg gattgagatc cggaaggtgg cccccaacaa caccagcaag 1440 acctgcagca agtgcgggca cctcaacaac tacttcaact tcgagtaccg gaagaagaac 1500 aagttcccac acttcaagtg cgagaagtgc aactttaagg agaacgccga ttacaacgcc 1560 gccctgaaca tcagcaaccc taagctgaag agcactaagg aggagccctc tggaggatct 1620 agcggaggat cctctggcag cgagacacca ggaacaagcg agtcagcaac accagagagc 1680 agtggcggca gcagcggcgg cagctcctca gagactgggc ctgtcgccgt cgatccaacc 1740 ctgcgccgcc ggattgaacc tcacgagttt gaagtgttct ttgacccccg ggagctgaga 1800 aaggagacat gcctgctgta cgagatcaac tggggaggca ggcactccat ctggaggcac 1860 acctctcaga acacaaataa gcacgtggag gtgaacttca tcgagaagtt taccacagag 1920 cggtacttct gccccaatac cagatgtagc atcacatggt ttctgagctg gtccccttgc 1980 ggagagtgta gcagggccat caccgagttc ctgtccagat atccacacgt gacactgttt 2040 atctacatcg ccaggctgta tcaccacgca gacccaagga ataggcaggg cctgcgcgat 2100 ctgatcagct ccggcgtgac catccagatc atgacagagc aggagtccgg ctactgctgg 2160 cggaacttcg tgaattattc tcctagcaac gaggcccact ggcctaggta cccacacctg 2220 tgggtgcgcc tgtacgtgct ggagctgtat tgcatcatcc tgggcctgcc cccttgtctg 2280 aatatcctgc ggagaaagca gccccagctg accttcttta caatcgccct gcagtcttgt 2340 cactatcaga ggctgccacc ccacatcctg tgggccacag gcctgaagag cggcgggagc 2400 ggcgggagcg gggggagcac taatctgagc gacatcattg agaaggagac tgggaaacag 2460 ctggtcattc aggagtccat cctgatgctg cctgaggagg tggaggaagt gatcggcaac 2520 aagccagagt ctgacatcct ggtgcacacc gcctacgacg agtccacaga tgagaatgtg 2580 atgctgctga cctctgacgc ccccgagtat aagccttggg ccctggtcat ccaggattct 2640 aacggcgaga ataagatcaa gatgctgagc ggaggatccg gaggatctgg aggcagcacc 2700 aacctgtctg acatcatcga gaaggagaca ggcaagcagc tggtcatcca ggagagcatc 2760 ctgatgctgc ccgaagaagt cgaagaagtg atcggaaaca agcctgagag cgatatcctg 2820 gtccataccg cctacgacga gagtaccgac gaaaatgtga tgctgctgac atccgacgcc 2880 ccagagtata agccctgggc tctggtcatc caggattcca acggagagaa caaaatcaaa 2940 atgctgtctg gcggctcaaa aagaaccgcc gacggcagcg aattcgagcc caagaagaag 3000 aggaaagtc 3009 <210> 275 <211> 2823 <212> DNA <213> Artificial Sequence <220> <223> DNA sequence (N terminal -Adenine deaminase) <400> 275 atgtccgaag tcgagttttc ccatgagtac tggatgagac acgcattgac tctcgcaaag 60 agggcttggg atgaacgcga ggtgcccgtg ggggcagtac tcgtgcataa caatcgcgta 120 atcggcgaag gttggaatag gccgatcgga cgccacgacc ccactgcaca tgcggaaatc 180 atggcccttc gacagggagg gcttgtgatg cagaattatc gacttatcga tgcgacgctg 240 tacgtcacgc ttgaaccttg cgtaatgtgc gcgggagcta tgattcactc ccgcattgga 300 cgagttgtat tcggtgcccg cgacgccaag acgggtgccg caggttcact gatggacgtg 360 ctgcatcacc caggcatgaa ccaccgggta gaaatcacag aaggcatatt ggcggacgaa 420 tgtgcggcgc tgttgtccga cttttttcgc atgcggaggc aggagatcaa ggcccagaaa 480 aaagcacaat cctctactga ctctggtggt tcttctggtg gttctagcgg cagcgagact 540 cccgggacct cagagtccgc cacacccgaa agttctggtg gttcttctgg tggttcttcc 600 gaagtcgagt tttcccatga gtactggatg agacacgcat tgactctcgc aaagagggct 660 cgagatgaac gcgaggtgcc cgtgggggca gtactcgtgc tcaacaatcg cgtaatcggc 720 gaaggttgga atagggcaat cggactccac gaccccactg cacatgcgga aatcatggcc 780 cttcgacagg gagggcttgt gatgcagaat tatcgactta tcgatgcgac gctgtacgtc 840 acgtttgaac cttgcgtaat gtgcgcggga gctatgattc actcccgcat tggacgagtt 900 gtattcggtg ttcgcaacgc caagacgggt gccgcaggtt cactgatgga cgtgctgcat 960 tacccaggca tgaaccaccg ggtagaaatc acagaaggca tattggcgga cgaatgtgcg 1020 gcgctgttgt gttacttttt tcgcatgccc aggcaggtct ttaacgccca gaaaaaagca 1080 caatcctcta ctgactctgg tggttcttct ggtggttcta gcggcagcga gactcccggg 1140 acctcagagt ccgccacacc cgaaagttct ggtggttctt ctggtggttc tgccaagaac 1200 acaattacaa agacactgaa gctgaggatc gtgagaccat acaacagcgc tgaggtcgag 1260 aagattgtgg ctgatgaaaa gaacaacagg gaaaagatcg ccctcgagaa gaacaaggat 1320 aaggtgaagg aggcctgctc taagcacctg aaagtggccg cctactgcac cacacaggtg 1380 gagaggaacg cctgtctgtt ttgtaaagct cggaagctgg atgataagtt ttaccagaag 1440 ctgcggggcc agttccccga tgccgtcttt tggcaggaga ttagcgagat cttcagacag 1500 ctgcagaagc aggccgccga gatctacaac cagagcctga tcgagctcta ctacgagatc 1560 ttcatcaagg gcaagggcat tgccaacgcc tcctccgtgg agcactacct gagcgacgtg 1620 tgctacacaa gagccgccga gctctttaag aacgccgcta tcgcttccgg gctgaggagc 1680 aagattaaga gtaacttccg gctcaaggag ctgaagaaca tgaagagcgg cctgcccact 1740 acaaagagcg acaacttccc aattccactg gtgaagcaga aggggggcca gtacacaggg 1800 ttcgagattt ccaaccacaa cagcgacttt attattaaga tcccctttgg caggtggcag 1860 gtcaagaagg agattgacaa gtacaggccc tgggagaagt ttgatttcga gcaggtgcag 1920 aagagcccca agcctatttc cctgctgctg tccacacagc ggcggaagag gaacaagggg 1980 tggtctaagg atgaggggac cgaggccgag attaagaaag tgatgaacgg cgactaccag 2040 acaagctaca tcgaggtcaa gcggggcagt aagattggcg agaagagcgc ctggatgctg 2100 aacctgagca ttgacgtgcc aaagattgat aagggcgtgg atcccagcat catcggaggg 2160 atcgatgtgg gggtcaagag ccccctcgtg tgcgccatca acaacgcctt cagcaggtac 2220 agcatctccg ataacgacct gttccacttt aacaagaaga tgttcgcccg gcggaggatt 2280 ttgctcaaga agaaccggca caagcgggcc ggacacgggg ccaagaacaa gctcaagccc 2340 atcactatcc tgaccgagaa gagcgagagg ttcaggaaga agctcatcga gagatgggcc 2400 tgcgagatcg ccgatttctt tattaagaac aaggtcggaa cagtgcagat ggagaacctc 2460 gagagcatga agaggaagga ggattcctac ttcaacattc ggctgagggg gttctggccc 2520 tacgctgaga tgcagaacaa gattgagttt aagctgaagc agtacgggat tgagatccgg 2580 aaggtggccc ccaacaacac cagcaagacc tgcagcaagt gcgggcacct caacaactac 2640 ttcaacttcg agtaccggaa gaagaacaag ttcccacact tcaagtgcga gaagtgcaac 2700 tttaaggaga acgccgatta caacgccgcc ctgaacatca gcaaccctaa gctgaagagc 2760 actaaggagg agcccaaaag gccggcggcc acgaaaaagg ccggccaggc aaaaaagaaa 2820 aag 2823 <210> 276 <211> 2823 <212> DNA <213> Artificial Sequence <220> <223> DNA sequence (C terminal -Adenine deaminase) <400> 276 atggccaaga acacaattac aaagacactg aagctgagga tcgtgagacc atacaacagc 60 gctgaggtcg agaagattgt ggctgatgaa aagaacaaca gggaaaagat cgccctcgag 120 aagaacaagg ataaggtgaa ggaggcctgc tctaagcacc tgaaagtggc cgcctactgc 180 accacacagg tggagaggaa cgcctgtctg ttttgtaaag ctcggaagct ggatgataag 240 ttttaccaga agctgcgggg ccagttcccc gatgccgtct tttggcagga gattagcgag 300 atcttcagac agctgcagaa gcaggccgcc gagatctaca accagagcct gatcgagctc 360 tactacgaga tcttcatcaa gggcaagggc attgccaacg cctcctccgt ggagcactac 420 ctgagcgacg tgtgctacac aagagccgcc gagctcttta agaacgccgc tatcgcttcc 480 gggctgagga gcaagattaa gagtaacttc cggctcaagg agctgaagaa catgaagagc 540 ggcctgccca ctacaaagag cgacaacttc ccaattccac tggtgaagca gaaggggggc 600 cagtacacag ggttcgagat ttccaaccac aacagcgact ttattattaa gatccccttt 660 ggcaggtggc aggtcaagaa ggagattgac aagtacaggc cctgggagaa gtttgatttc 720 gagcaggtgc agaagagccc caagcctatt tccctgctgc tgtccacaca gcggcggaag 780 aggaacaagg ggtggtctaa ggatgagggg accgaggccg agattaagaa agtgatgaac 840 ggcgactacc agacaagcta catcgaggtc aagcggggca gtaagattgg cgagaagagc 900 gcctggatgc tgaacctgag cattgacgtg ccaaagattg ataagggcgt ggatcccagc 960 atcatcggag ggatcgatgt gggggtcaag agccccctcg tgtgcgccat caacaacgcc 1020 ttcagcaggt acagcatctc cgataacgac ctgttccact ttaacaagaa gatgttcgcc 1080 cggcggagga ttttgctcaa gaagaaccgg cacaagcggg ccggacacgg ggccaagaac 1140 aagctcaagc ccatcactat cctgaccgag aagagcgaga ggttcaggaa gaagctcatc 1200 gagagatggg cctgcgagat cgccgatttc tttattaaga acaaggtcgg aacagtgcag 1260 atggagaacc tcgagagcat gaagaggaag gaggattcct acttcaacat tcggctgagg 1320 gggttctggc cctacgctga gatgcagaac aagattgagt ttaagctgaa gcagtacggg 1380 attgagatcc ggaaggtggc ccccaacaac accagcaaga cctgcagcaa gtgcgggcac 1440 ctcaacaact acttcaactt cgagtaccgg aagaagaaca agttcccaca cttcaagtgc 1500 gagaagtgca actttaagga gaacgccgat tacaacgccg ccctgaacat cagcaaccct 1560 aagctgaaga gcactaagga ggagccctct ggaggatcta gcggaggatc ctctggcagc 1620 gagacaccag gaacaagcga gtcagcaaca ccagagagca gtggcggcag cagcggcggc 1680 agctccgaag tcgagttttc ccatgagtac tggatgagac acgcattgac tctcgcaaag 1740 agggcttggg atgaacgcga ggtgcccgtg ggggcagtac tcgtgcataa caatcgcgta 1800 atcggcgaag gttggaatag gccgatcgga cgccacgacc ccactgcaca tgcggaaatc 1860 atggcccttc gacagggagg gcttgtgatg cagaattatc gacttatcga tgcgacgctg 1920 tacgtcacgc ttgaaccttg cgtaatgtgc gcgggagcta tgattcactc ccgcattgga 1980 cgagttgtat tcggtgcccg cgacgccaag acgggtgccg caggttcact gatggacgtg 2040 ctgcatcacc caggcatgaa ccaccgggta gaaatcacag aaggcatatt ggcggacgaa 2100 tgtgcggcgc tgttgtccga cttttttcgc atgcggaggc aggagatcaa ggcccagaaa 2160 aaagcacaat cctctactga ctctggtggt tcttctggtg gttctagcgg cagcgagact 2220 cccgggacct cagagtccgc cacacccgaa agttcaggtg gatcttcagg tggatcttcg 2280 gaagtggaat tttcgcacga gtattggatg aggcacgctt taactctcgc taagagagca 2340 cgagacgaac gggaagtgcc ggttggggct gtcctcgtac tcaataatcg agttatcgga 2400 gaaggctgga acagggcaat cggactccac gatcccacag ctcatgccga gataatggcg 2460 cttcgacaag gaggcctagt catgcaaaat tatcgtctta ttgacgcgac cctctacgtg 2520 acctttgagc catgcgttat gtgtgcgggt gcaatgatac attcccggat aggacgtgta 2580 gtatttggag ttcgcaacgc gaagaccggt gcggctggtt ctctcatgga tgtcctgcac 2640 taccctggga tgaatcaccg cgttgaaatc actgaaggca ttttggccga tgaatgcgcg 2700 gccctgttat gttacttttt tcgcatgccc aggcaggtct ttaacgcaca gaagaaagcc 2760 caatcgtcca ctgataaaag gccggcggcc acgaaaaagg ccggccaggc aaaaaagaaa 2820 aag 2823 <210> 277 <211> 7 <212> PRT <213> simian virus 40 <400> 277 Pro Lys Lys Lys Arg Lys Val 1 5 <210> 278 <211> 16 <212> PRT <213> mus musculus <400> 278 Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys 1 5 10 15 <210> 279 <211> 9 <212> PRT <213> homo sapiens <400> 279 Pro Ala Ala Lys Arg Val Lys Leu Asp 1 5 <210> 280 <211> 11 <212> PRT <213> homo sapiens <400> 280 Arg Gln Arg Arg Asn Glu Leu Lys Arg Ser Pro 1 5 10 <210> 281 <211> 38 <212> PRT <213> mus musculus <400> 281 Asn Gln Ser Ser Asn Phe Gly Pro Met Lys Gly Gly Asn Phe Gly Gly 1 5 10 15 Arg Ser Ser Gly Pro Tyr Gly Gly Gly Gly Gln Tyr Phe Ala Lys Pro 20 25 30 Arg Asn Gln Gly Gly Tyr 35 <210> 282 <211> 42 <212> PRT <213> Homo sapiens <400> 282 Arg Met Arg Ile Glx Phe Lys Asn Lys Gly Lys Asp Thr Ala Glu Leu 1 5 10 15 Arg Arg Arg Arg Val Glu Val Ser Val Glu Leu Arg Lys Ala Lys Lys 20 25 30 Asp Glu Gln Ile Leu Lys Arg Arg Asn Val 35 40 <210> 283 <211> 8 <212> PRT <213> Homo sapiens <400> 283 Val Ser Arg Lys Arg Pro Arg Pro 1 5 <210> 284 <211> 8 <212> PRT <213> homo sapiens <400> 284 Pro Pro Lys Lys Ala Arg Glu Asp 1 5 <210> 285 <211> 8 <212> PRT <213> homo sapiens <400> 285 Pro Gln Pro Lys Lys Lys Pro Leu 1 5 <210> 286 <211> 12 <212> PRT <213> mus musculus <400> 286 Ser Ala Leu Ile Lys Lys Lys Lys Lys Met Ala Pro 1 5 10 <210> 287 <211> 5 <212> PRT <213> Influenza virus <400> 287 Asp Arg Leu Arg Arg 1 5 <210> 288 <211> 7 <212> PRT <213> influenza virus <400> 288 Pro Lys Gln Lys Lys Arg Lys 1 5 <210> 289 <211> 10 <212> PRT <213> Hepatitis D virus <400> 289 Arg Lys Leu Lys Lys Lys Ile Lys Lys Leu 1 5 10 <210> 290 <211> 10 <212> PRT <213> mus musculus <400> 290 Arg Glu Lys Lys Lys Phe Leu Lys Arg Arg 1 5 10 <210> 291 <211> 20 <212> PRT <213> homo sapiens <400> 291 Lys Arg Lys Gly Asp Glu Val Asp Gly Val Asp Glu Val Ala Lys Lys 1 5 10 15 Lys Ser Lys Lys 20 <210> 292 <211> 17 <212> PRT <213> Homo sapiens <400> 292 Arg Lys Cys Leu Gln Ala Gly Met Asn Leu Glu Ala Arg Lys Thr Lys 1 5 10 15 Lys <210> 293 <211> 29 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 293 aacaaauuca uuuugaaacg aaugaagga 29 <210> 294 <211> 31 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 294 aacaaauuca uuuuugaaaa cgaaugaagg a 31 <210> 295 <211> 33 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 295 aacaaauuca uuuuucgaaa gacgaaugaa gga 33 <210> 296 <211> 35 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 296 aacaaauuca uuuuuccgaa aagacgaaug aagga 35 <210> 297 <211> 37 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 297 aacaaauuca uuuuuccuga aauagacgaa ugaagga 37 <210> 298 <211> 39 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 298 aacaaauuca uuuuuccucg aaaauagacg aaugaagga 39 <210> 299 <211> 41 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 299 aacaaauuca uuuuuccucu gaaaaauaga cgaaugaagg a 41 <210> 300 <211> 43 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 300 aacaaauuca uuuuuccucu cgaaagaaua gacgaaugaa gga 43 <210> 301 <211> 45 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 301 aacaaauuca uuuuuccucu ccgaaacgaa uagacgaaug aagga 45 <210> 302 <211> 47 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 302 aacaaauuca uuuuuccucu ccagaaaccg aauagacgaa ugaagga 47 <210> 303 <211> 49 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 303 aacaaauuca uuuuuccucu ccaagaaacc cgaauagacg aaugaagga 49 <210> 304 <211> 51 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 304 aacaaauuca uuuuuccucu ccaaugaaaa cccgaauaga cgaaugaagg a 51 <210> 305 <211> 53 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 305 aacaaauuca uuuuuccucu ccaauugaaa aacccgaaua gacgaaugaa gga 53 <210> 306 <211> 55 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 306 aacaaauuca uuuuuccucu ccaauucgaa agaacccgaa uagacgaaug aagga 55 <210> 307 <211> 57 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 307 aacaaauuca uuuuuccucu ccaauucuga aaagaacccg aauagacgaa ugaagga 57 <210> 308 <211> 59 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 308 aacaaauuca uuuuuccucu ccaauucugg aaacagaacc cgaauagacg aaugaagga 59 <210> 309 <211> 61 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 309 aacaaauuca uuuuuccucu ccaauucugc gaaagcagaa cccgaauaga cgaaugaagg 60 a 61 <210> 310 <211> 63 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 310 aacaaauuca uuuuuccucu ccaauucugc agaaaugcag aacccgaaua gacgaaugaa 60 gga 63 <210> 311 <211> 65 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 311 aacaaauuca uuuuuccucu ccaauucugc acgaaauugc agaacccgaa uagacgaaug 60 aagga 65 <210> 312 <211> 67 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 312 aacaaauuca uuuuuccucu ccaauucugc acagaaaguu gcagaacccg aauagacgaa 60 ugaagga 67 <210> 313 <211> 66 <212> RNA <213> Artificial Sequence <220> <223> engineered (4th region + Linker + 5th region) of WT gRNA <400> 313 aacaaauuca uuuuuccucu ccaauucugc acagaaauug cagaacccga auagacgaau 60 gaagga 66 <210> 314 <211> 68 <212> RNA <213> Artificial Sequence <220> <223> (4th region + Linker + 5th region) of WT gRNA <400> 314 aacaaauuca uuuuuccucu ccaauucugc acaagaaagu ugcagaaccc gaauagacga 60 augaagga 68 <210> 315 <211> 202 <212> RNA <213> Artificial Sequence <220> <223> wild-type of Cas12f1 tracrRNA + GAAA + wild-type of Cas12f1 crRNA repeat <400> 315 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu uuccucucca auucugcaca agaaaguugc agaacccgaa 180 uagacgaaug aaggaaugca ac 202 <210> 316 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 316 caagaaaguu 10 <210> 317 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 317 acaagaaagu ug 12 <210> 318 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 318 cacaagaaag uugc 14 <210> 319 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 319 gcacaagaaa guugca 16 <210> 320 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 320 ugcacaagaa aguugcag 18 <210> 321 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 321 cugcacaaga aaguugcaga 20 <210> 322 <211> 22 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 322 ucugcacaag aaaguugcag aa 22 <210> 323 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 323 uucugcacaa gaaaguugca gaac 24 <210> 324 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 324 auucugcaca agaaaguugc agaacc 26 <210> 325 <211> 28 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 325 aauucugcac aagaaaguug cagaaccc 28 <210> 326 <211> 30 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 326 caauucugca caagaaaguu gcagaacccg 30 <210> 327 <211> 32 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 327 ccaauucugc acaagaaagu ugcagaaccc ga 32 <210> 328 <211> 34 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 328 uccaauucug cacaagaaag uugcagaacc cgaa 34 <210> 329 <211> 36 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 329 cuccaauucu gcacaagaaa guugcagaac ccgaau 36 <210> 330 <211> 38 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 330 ucuccaauuc ugcacaagaa aguugcagaa cccgaaua 38 <210> 331 <211> 40 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 331 cucuccaauu cugcacaaga aaguugcaga acccgaauag 40 <210> 332 <211> 42 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 332 ccucuccaau ucugcacaag aaaguugcag aacccgaaua ga 42 <210> 333 <211> 44 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 333 ucucuccaa uucugcacaa gaaaguugca gaacccgaau agac 44 <210> 334 <211> 46 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 334 uuccucucca auucugcaca agaaaguugc agaacccgaa uagacg 46 <210> 335 <211> 48 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 335 uuuccucucc aauucugcac aagaaaguug cagaacccga auagacga 48 <210> 336 <211> 50 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 336 uuuuccucuc caauucugca caagaaaguu gcagaacccg aauagacgaa 50 <210> 337 <211> 52 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 337 uuuuuccucu ccaauucugc acaagaaagu ugcagaaccc gaauagacga au 52 <210> 338 <211> 54 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 338 auuuuuccuc uccaauucug cacaagaaag uugcagaacc cgaauagacg aaug 54 <210> 339 <211> 56 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 339 cauuuuuccu cuccaauucu gcacaagaaa guugcagaac ccgaauagac gaauga 56 <210> 340 <211> 58 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 340 ucauuuuucc ucuccaauuc ugcacaagaa aguugcagaa cccgaauaga cgaaugaa 58 <210> 341 <211> 57 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 341 57 ucauuuuucc ucuccaauuc ugcacaagaa aguugcagaa cccgaauaga cgaauga 57 <210> 342 <211> 59 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, WT <400> 342 uucauuuuuc cucuccaauu cugcacaaga aaguugcaga acccgaauag acgaaugaa 59 <210> 343 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 343 uucauuuuuc 10 <210> 344 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 344 uucauuuuuc c 11 <210> 345 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 345 uucauuuuuc cu 12 <210> 346 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 346 cuc 13 <210> 347 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 347 cucu 14 <210> 348 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 348 uucauuuuuc cucuc 15 <210> 349 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 349 uucauuuuuc cucucc 16 <210> 350 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 350 uucauuuuuc cucucca 17 <210> 351 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 351 uucauuuuuc cucuccaa 18 <210> 352 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 352 uucauuuuuc cucuccaau 19 <210> 353 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 353 uucauuuuuc cucuccaauu 20 <210> 354 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 354 uucauuuuuc cucuccaauu c 21 <210> 355 <211> 22 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 355 uucauuuuuc cucuccaauu cu 22 <210> 356 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 356 uucauuuuuc cucuccaauu cug 23 <210> 357 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 357 uucauuuuuc cucuccaauu cugc 24 <210> 358 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 358 uucauuuuuc cucuccaauu cugca 25 <210> 359 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 359 uucauuuuuc cucuccaauu cugcac 26 <210> 360 <211> 27 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 360 uucauuuuuc cucuccaauu cugcaca 27 <210> 361 <211> 28 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 4th region, WT <400> 361 uucauuuuuc cucuccaauu cugcacaa 28 <210> 362 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 362 gacgaaugaa 10 <210> 363 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 363 agacgaauga a 11 <210> 364 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 364 uagacgaaug aa 12 <210> 365 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 365 auagacgaau gaa 13 <210> 366 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 366 aauagacgaa ugaa 14 <210> 367 <211> 15 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 367 gaauagacga augaa 15 <210> 368 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 368 cgaauagacg aaugaa 16 <210> 369 <211> 17 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 369 ccgaauagac gaaugaa 17 <210> 370 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 370 cccgaauaga cgaaugaa 18 <210> 371 <211> 19 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 371 acccgaauag acgaaugaa 19 <210> 372 <211> 20 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 372 aacccgaaua gacgaaugaa 20 <210> 373 <211> 21 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 373 gaacccgaau agacgaauga a 21 <210> 374 <211> 22 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 374 agaacccgaa uagacgaaug aa 22 <210> 375 <211> 23 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 375 cagaacccga auagacgaau gaa 23 <210> 376 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 376 gcagaacccg aauagacgaa ugaa 24 <210> 377 <211> 25 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 377 ugcagaaccc gaauagacga augaa 25 <210> 378 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 378 uugcagaacc cgaauagacg aaugaa 26 <210> 379 <211> 27 <212> RNA <213> Artificial Sequence <220> <223> deleted part of 5th region, WT <400> 379 guugcagaac ccgaauagac gaaugaa 27 <210> 380 <211> 181 <212> RNA <213> Artificial Sequence <220> <223> Comparative Example 1.1 <400> 380 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu gaaagaauga aggaaugcaa ccacacacac agugggcuac 180 c 181 <210> 381 <211> 174 <212> RNA <213> Artificial Sequence <220> <223> Example 1.1 <400> 381 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuugaaagaa ugaaggaaug caaccacaca cacagugggc uacc 174 <210> 382 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 1.2 <400> 382 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuugaaa gaaugaagga augcaaccac acacacagug ggcuacc 167 <210> 383 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> Example 1.3 <400> 383 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 gaaagaauga aggaaugcaa ccacacacac agugggcuac c 161 <210> 384 <211> 175 <212> RNA <213> Artificial Sequence <220> <223> Example 1.4 <400> 384 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaaccacac acacaguggg cuacc 175 <210> 385 <211> 170 <212> RNA <213> Artificial Sequence <220> <223> Example 1.5 <400> 385 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac cacacacaca gugggcuacc 170 <210> 386 <211> 158 <212> RNA <213> Artificial Sequence <220> <223> Example 1.6 <400> 386 cuucacugau aaaguggaga accgcuucac cauuaugag ugaagguggg cugcuugcau 60 cagccuaaug ucgagaagug cuuucuucgg aaaguaaccc ucgaaacaaa uucauuugaa 120 agaaugaagg aaugcaacca cacacacagu gggcuacc 158 <210> 387 <211> 177 <212> RNA <213> Artificial Sequence <220> <223> Example 1.7 <400> 387 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauga aaaugaagga augcaaccac acacacagug ggcuacc 177 <210> 388 <211> 173 <212> RNA <213> Artificial Sequence <220> <223> Example 1.8 <400> 388 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucgaaa gaaggaaugc aaccacacac acagugggcu acc 173 <210> 389 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 1.9 <400> 389 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaagaaagga augcaaccac acacacagug ggcuacc 167 <210> 390 <211> 140 <212> RNA <213> Artificial Sequence <220> <223> Example 1.10 <400> 390 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug aaagaaugaa ggaaugcaac 120 cacacacaca gugggcuacc 140 <210> 391 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> Example 1.11 <400> 391 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaagaaagga 120 augcaaccac acacacagug ggcuacc 147 <210> 392 <211> 146 <212> RNA <213> Artificial Sequence <220> <223> Example 1.12 <400> 392 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa 120 ugcaaccaca cacacagugg gcuacc 146 <210> 393 <211> 126 <212> RNA <213> Artificial Sequence <220> <223> Example 1.13 <400> 393 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa ugcaaccaca cacacagugg 120 gcuacc 126 <210> 394 <211> 181 <212> RNA <213> Artificial Sequence <220> <223> Comparative Example 2.1 <400> 394 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu gaaagaauga aggaaugcaa ccauccccag gacacacaca 180 c 181 <210> 395 <211> 174 <212> RNA <213> Artificial Sequence <220> <223> Example 2.1 <400> 395 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuugaaagaa ugaaggaaug caaccauccc caggacacac acac 174 <210> 396 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 2.2 <400> 396 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuugaaa gaaugaagga augcaaccau ccccaggaca cacacac 167 <210> 397 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> Example 2.3 <400> 397 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 gaaagaauga aggaaugcaa ccauccccag gacacacaca c 161 <210> 398 <211> 175 <212> RNA <213> Artificial Sequence <220> <223> Example 2.4 <400> 398 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaaccaucc ccaggacaca cacac 175 <210> 399 <211> 170 <212> RNA <213> Artificial Sequence <220> <223> Example 2.5 <400> 399 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac cauccccagg acacacacac 170 <210> 400 <211> 160 <212> RNA <213> Artificial Sequence <220> <223> Example 2.6 <400> 400 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug 120 aaagaaugaa ggaaugcaac cauccccagg acacacacac 160 <210> 401 <211> 177 <212> RNA <213> Artificial Sequence <220> <223> Example 2.7 <400> 401 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauga aaaugaagga augcaaccau ccccaggaca cacacac 177 <210> 402 <211> 173 <212> RNA <213> Artificial Sequence <220> <223> Example 2.8 <400> 402 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucgaaa gaaggaaugc aaccaucccc aggacacaca cac 173 <210> 403 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 2.9 <400> 403 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaagaaagga augcaaccau ccccaggaca cacacac 167 <210> 404 <211> 140 <212> RNA <213> Artificial Sequence <220> <223> Example 2.10 <400> 404 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug aaagaaugaa ggaaugcaac 120 cauccccagg acacacacac 140 <210> 405 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> Example 2.11 <400> 405 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaagaaagga 120 augcaaccau ccccaggaca cacacac 147 <210> 406 <211> 146 <212> RNA <213> Artificial Sequence <220> <223> Example 2.12 <400> 406 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa 120 ugcaaccauc cccaggacac acacac 146 <210> 407 <211> 126 <212> RNA <213> Artificial Sequence <220> <223> Example 2.13 <400> 407 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa ugcaaccauc cccaggacac 120 acacac 126 <210> 408 <211> 181 <212> RNA <213> Artificial Sequence <220> <223> Comparative Example 3.1 <400> 408 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauuu gaaagaauga aggaaugcaa cagaacacau accccugggc 180 c 181 <210> 409 <211> 174 <212> RNA <213> Artificial Sequence <220> <223> Example 3.1 <400> 409 gauaaagugg agaaccgcuu caccaaaagc ugucccuuag gggauuagaa cuugagugaa 60 ggugggcugc uugcaucagc cuaaugucga gaagugcuuu cuucggaaag uaacccucga 120 aacaaauuca uuugaaagaa ugaaggaaug caacagaaca cauaccccug ggcc 174 <210> 410 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 3.2 <400> 410 uggagaaccg cuucaccaaa agcugucccu uaggggauua gaacuugagu gaaggugggc 60 ugcuugcauc agccuaaugu cgagaagugc uuucuucgga aaguaacccu cgaaacaaau 120 ucauuugaaa gaaugaagga augcaacaga acacauaccc cugggcc 167 <210> 411 <211> 161 <212> RNA <213> Artificial Sequence <220> <223> Example 3.3 <400> 411 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaauucauuu 120 gaaagaauga aggaaugcaa cagaacacau accccugggc c 161 <210> 412 <211> 175 <212> RNA <213> Artificial Sequence <220> <223> Example 3.4 <400> 412 cuucacugau aaaguggaga accgcuucac caaaagcugu uuagauuaga acuugaguga 60 aggugggcug cuugcaucag ccuaaugucg agaagugcuu ucuucggaaa guaacccucg 120 aaacaaauuc auuugaaaga augaaggaau gcaacagaac acauaccccu gggcc 175 <210> 413 <211> 170 <212> RNA <213> Artificial Sequence <220> <223> Example 3.5 <400> 413 cuucacugau aaaguggaga accgcuucac caaaagcuuu agagaacuug agugaaggug 60 ggcugcuugc aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca 120 aauucauuug aaagaaugaa ggaaugcaac agaacacaua ccccugggcc 170 <210> 414 <211> 160 <212> RNA <213> Artificial Sequence <220> <223> Example 3.6 <400> 414 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug 120 aaagaaugaa ggaaugcaac agaacacaua ccccugggcc 160 <210> 415 <211> 177 <212> RNA <213> Artificial Sequence <220> <223> Example 3.7 <400> 415 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucauga aaaugaagga augcaacaga acacauaccc cugggcc 177 <210> 416 <211> 173 <212> RNA <213> Artificial Sequence <220> <223> Example 3.8 <400> 416 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaauucgaaa gaaggaaugc aacagaacac auaccccugg gcc 173 <210> 417 <211> 167 <212> RNA <213> Artificial Sequence <220> <223> Example 3.9 <400> 417 cuucacugau aaaguggaga accgcuucac caaaagcugu cccuuagggg auuagaacuu 60 gagugaaggu gggcugcuug caucagccua augucgagaa gugcuuucuu cggaaaguaa 120 cccucgaaac aaagaaagga augcaacaga acacauaccc cugggcc 167 <210> 418 <211> 140 <212> RNA <213> Artificial Sequence <220> <223> Example 3.10 <400> 418 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aauucauuug aaagaaugaa ggaaugcaac 120 agaacacaua ccccugggcc 140 <210> 419 <211> 147 <212> RNA <213> Artificial Sequence <220> <223> Example 3.11 <400> 419 accgcuucac caaaagcugu cccuuagggg auuagaacuu gagugaaggu gggcugcuug 60 caucagccua augucgagaa gugcuuucuu cggaaaguaa cccucgaaac aaagaaagga 120 augcaacaga acacauaccc cugggcc 147 <210> 420 <211> 146 <212> RNA <213> Artificial Sequence <220> <223> Example 3.12 <400> 420 cuucacugau aaaguggaga accgcuucac caauuaguug agugaaggug ggcugcuugc 60 aucagccuaa ugucgagaag ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa 120 ugcaacagaa cacauacccc ugggcc 146 <210> 421 <211> 126 <212> RNA <213> Artificial Sequence <220> <223> Example 3.13 <400> 421 accgcuucac caauuaguug agugaaggug ggcugcuugc aucagccuaa ugucgagaag 60 ugcuuucuuc ggaaaguaac ccucgaaaca aagaaaggaa ugcaacagaa cacauacccc 120 ugggcc 126 <210> 422 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> protospacer sequence for targeting DY2 <400> 422 cacacacaca gtgggctacc 20 <210> 423 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> protospacer sequence for targeting DY10 <400> 423 catccccagg acacacacac 20 <210> 424 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> protospacer sequence for targeting Intergenic-22 <400> 424 agaacacata cccctgggcc 20 <210> 425 <211> 10 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, MF <400> 425 uucgaaagaa 10 <210> 426 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, MF <400> 426 uucagaaaug aa 12 <210> 427 <211> 14 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, MF <400> 427 uucaugaaaa ugaa 14 <210> 428 <211> 16 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, MF <400> 428 uucauugaaa aaugaa 16 <210> 429 <211> 18 <212> RNA <213> Artificial Sequence <220> <223> Linker variation, MF <400> 429 uucauuugaa agaaugaa 18 <210> 430 <211> 24 <212> RNA <213> Artificial Sequence <220> <223> Second region, 13bp deletion <400> 430 ccgcuucacu uagagugaag gugg 24 <210> 431 <211> 26 <212> RNA <213> Artificial Sequence <220> <223> Second region, 12bp deletion <400> 431 ccgcuucacc uuaggaguga aggugg 26 <210> 432 <211> 9 <212> RNA <213> Artificial Sequence <220> <223> second region, 5' remains <400> 432 ccgcuucac 9 <210> 433 <211> 11 <212> RNA <213> Artificial Sequence <220> <223> second region, 3' remains <400> 433 agugaaggug g 11 <210> 434 <211> 29 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region <400> 434 aaaagcuguc ccuuagggga uuagaacuu 29 <210> 435 <211> 31 <212> RNA <213> Artificial Sequence <220> <223> intermediate sequence of the second region <400> 435 caaaagcugu cccuuagggg auuagaacuu g 31 <210> 436 <211> 120 <212> RNA <213> Artificial Sequence <220> <223> Example 1.14 <400> 436 accgcuucac uuagagugaa ggugggcugc uugcaucagc cuaaugucga gaagugcuuu 60 cuucggaaag uaacccucga aacaaagaaa ggaaugcaac cacacacaca gugggcuacc 120 120 <210> 437 <211> 120 <212> RNA <213> Artificial Sequence <220> <223> Example 2.14 <400> 437 accgcuucac uuagagugaa ggugggcugc uugcaucagc cuaaugucga gaagugcuuu 60 cuucggaaag uaacccucga aacaaagaaa ggaaugcaac cauccccagg acacacacac 120 120 <210> 438 <211> 120 <212> RNA <213> Artificial Sequence <220> <223> Example 3.14 <400> 438 accgcuucac uuagagugaa ggugggcugc uugcaucagc cuaaugucga gaagugcuuu 60 cuucggaaag uaacccucga aacaaagaaa ggaaugcaac agaacacaua ccccugggcc 120 120 <210> 439 <211> 120 <212> RNA <213> Artificial Sequence <220> <223> engineered sgRNA <400> 439 accgcuucac uuagagugaa ggugggcugc uugcaucagc cuaaugucga gaagugcuuu 60 cuucggaaag uaacccucga aacaaagaaa ggaaugcaac nnnnnnnnnn nnnnnnnnnn 120 120 <210> 440 <211> 12 <212> RNA <213> Artificial Sequence <220> <223> Upper deleted part of second region <400> 440 aaaagcuguc cc 12 <210> 441 <211> 13 <212> RNA <213> Artificial Sequence <220> <223> Lower deleted part of second region <400> 441 caaaagcugu ccc 13

Claims (33)

다음을 포함하는, CRISPR/Cas12f1 시스템을 위한 엔지니어링 된 가이드 RNA:
엔지니어링 된 스캐폴드 영역; 및
스페이서;
이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,
상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 7)과 상이한 것을 특징으로 하며,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;
5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 및 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117)에서 선택된 서열; 및
5'-AUGCAAC-3'.
Engineered guide RNAs for the CRISPR/Cas12f1 system, including:
engineered scaffold area; and
spacer;
At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,
The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,
The sequence of the engineered scaffold region is characterized in that it differs from 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 7),
The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3',5'-GUGGAGAA-3',5'-AGUGGAGAA-3',5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5'-UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);
5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), and 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) a sequence selected from SEQ ID NO: 117); and
5'-AUGCAAC-3'.
다음을 포함하는, CRISPR/Cas12f1 시스템을 위한 엔지니어링 된 가이드 RNA:
엔지니어링 된 스캐폴드 영역; 및
스페이서; 및
이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,
상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGAAUGCAAC-3'(서열번호 315)과 상이한 것을 특징으로 하며,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;
5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117), 5'-AACAAAUUCAUUUUGAAACGAAUGAAGGA-3'(서열번호 293), 5'-AACAAAUUCAUUUUUGAAAACGAAUGAAGGA-3'(서열번호 294), 5'-AACAAAUUCAUUUUUCGAAAGACGAAUGAAGGA-3'(서열번호 295), 5'-AACAAAUUCAUUUUUCCGAAAAGACGAAUGAAGGA-3'(서열번호 296), 5'-AACAAAUUCAUUUUUCCUGAAAUAGACGAAUGAAGGA-3'(서열번호 297), 5'-AACAAAUUCAUUUUUCCUCGAAAAUAGACGAAUGAAGGA-3'(서열번호 298), 5'-AACAAAUUCAUUUUUCCUCUGAAAAAUAGACGAAUGAAGGA-3'(서열번호 299), 5'-AACAAAUUCAUUUUUCCUCUCGAAAGAAUAGACGAAUGAAGGA-3'(서열번호 300), 5'-AACAAAUUCAUUUUUCCUCUCCGAAACGAAUAGACGAAUGAAGGA-3'(서열번호 301), 5'-AACAAAUUCAUUUUUCCUCUCCAGAAACCGAAUAGACGAAUGAAGGA-3'(서열번호 302), 5'-AACAAAUUCAUUUUUCCUCUCCAAGAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 303), 5'-AACAAAUUCAUUUUUCCUCUCCAAUGAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 304), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUGAAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 305), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCGAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 306), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGAAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 307), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGGAAACAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 308), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCGAAAGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 309), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCAGAAAUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 310), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 311), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 312), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 313), 및 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 314)로 이뤄진 군에서 선택된 서열; 및
5'-AUGCAAC-3'.
Engineered guide RNAs for the CRISPR/Cas12f1 system, including:
engineered scaffold area; and
spacer; and
At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,
The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,
The sequence of the engineered scaffold region is 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUUCCUCUCCCAAAAUUCGAGAACCAAUGAAUGAAUUCGAUGAAGAAUGAAUGAAUGAUGAAGUGAGAAUGAAUGAAUGAUGAUGAGAAUGAAUGAAUGAUGAAUGAAGAAU characterized by that the sequence differs by the sequence
The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3',5'-GUGGAGAA-3',5'-AGUGGAGAA-3',5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5'-UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);
5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) No. 117), 5'-AACAAAUUCAUUUUGAAACGAAUGAAGGA-3' (SEQ ID NO: 293), 5'-AACAAAUUCAUUUUUGAAAACGAAUGAAGGA-3' (SEQ ID NO: 294), 5'-AACAAAUUCAUUUUUCGAAAGACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAAU-GAAAAAGGAUGA (SEQ ID NO: 296), 5'-AACAAAUUCAUUUUUCCUGAAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 297), 5'-AACAAAUUCAUUUUUCCUCGAAAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 298), 5'-AACAAAUUCAUUCAUUUUUCCUACAUGAAUUCAUUUUUUCCUACAUGAAUUCAUUUUUUCCUACAUGAAUUCAUUUUUUCCUACAACUGAAUGAAUUUUUCCUCUGAAUGAAGAAUUCA 3' (SEQ ID NO: 300), 5'-AACAAAUUCAUUUUUCCUCUCCGAAACGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 301), 5'-AACAAAUUCAUUUUUCCUCUCCAGAAACCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 302), 5'-AACAAAUCUCCAUGAAUGAGU-3' (SEQ ID NO: 302), 5'-AACAAAUCUCCAUGAAUGAU AACAAAUUCAUUUUUCCUCUCCAAUGAAAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 304), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUGAAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 305), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCGAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 306), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGAAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 307), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGGAAACAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 308 ), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCGAAAGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 309), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCAGAAAUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 310), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 311), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열 312), 5'-AACAAAUUCAUUUUUUCCUCUCCAAUUCUGCACAGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 313), and 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO:314) selected from the group consisting of; and
5'-AUGCAAC-3'.
제 1항에 있어서, 상기 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것을 특징으로 하는 엔지니어링 된 가이드 RNA:
5'-A-3';
5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110); 및
5'-AUGCAAC-3'.
According to claim 1, wherein the sequence of the scaffold region is engineered guide RNA, characterized in that the following sequences are linked in order from the 5' end to the 3' end:
5'-A-3';
5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 10);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110); and
5'-AUGCAAC-3'.
제 1항에 있어서, 상기 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것을 특징으로 하는 엔지니어링 된 가이드 RNA:
5'-A-3';
5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110); 및
5'-AUGCAAC-3'.
According to claim 1, wherein the sequence of the scaffold region is engineered guide RNA, characterized in that the following sequences are linked in order from the 5' end to the 3' end:
5'-A-3';
5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110); and
5'-AUGCAAC-3'.
다음을 포함하는, CRISPR/Cas12f1 시스템을 위한 엔지니어링 된 가이드 RNA:
엔지니어링 된 스캐폴드; 및
스페이서,
이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,
상기 스페이서는 10 내지 50 뉴클레오타이드 길이를 가지며, 표적 서열과 상보적인 서열을 가지며,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:
5'-A-3'로 표현되는 제1 서열;
5'-CCGCUUCAC-3'(서열번호 432)로 표현되는 제2 서열;
5'-UUAG-3'로 표현되는 제3 서열;
5'-AGUGAAGGUGG-3'(서열번호 433)로 표현되는 제4 서열;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11)로 표현되는 제5 서열;
5'-AACAAA-3'로 표현되는 제6 서열;
링커;
5'-GGA-3'로 표현되는 제7 서열; 및
5'-AUGCAAC-3'로 표현되는 제8 서열.
Engineered guide RNAs for the CRISPR/Cas12f1 system, including:
engineered scaffolds; and
spacer,
At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,
The spacer has a length of 10 to 50 nucleotides and has a sequence complementary to the target sequence,
The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:
a first sequence represented by 5'-A-3';
a second sequence represented by 5'-CCGCUUCAC-3' (SEQ ID NO: 432);
a third sequence represented by 5'-UUAG-3';
a fourth sequence represented by 5'-AGUGAAGGUGG-3' (SEQ ID NO: 433);
a fifth sequence represented by 5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
a sixth sequence represented by 5'-AACAAA-3';
linker;
a seventh sequence represented by 5'-GGA-3'; and
The eighth sequence represented by 5'-AUGCAAC-3'.
제5항에 있어서, 상기 링커는 5'-GAAA-3'인 것을 특징으로 하는 엔지니어링 된 가이드 RNA.The engineered guide RNA according to claim 5, wherein the linker is 5'-GAAA-3'. 제5항에 있어서, 상기 링커는 5'-GAAA-3', 5'-UGAAAA-3', 5'-UUGAAAAA-3', 5'-UUCGAAAGAA-3'(서열번호 425), 5'-UUCAGAAAUGAA-3'(서열번호 426), 5'-UUCAUGAAAAUGAA-3'(서열번호 427), 및 5'-UUCAUUGAAAAAUGAA-3'(서열번호 428)로 이뤄진 군에서 선택된 것을 특징으로 하는 엔지니어링 된 가이드 RNA.6. The method of claim 5, wherein the linker is 5'-GAAA-3', 5'-UGAAAA-3', 5'-UUGAAAAA-3', 5'-UUCGAAAGAA-3' (SEQ ID NO: 425), 5'-UUCAGAAAUGAA An engineered guide RNA, characterized in that it is selected from the group consisting of -3' (SEQ ID NO: 426), 5'-UUCAUGAAAAUGAA-3' (SEQ ID NO: 427), and 5'-UUCAUUGAAAAAUGAA-3' (SEQ ID NO: 428). 제5항에 있어서, 상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-A-3', 5'-GA-3', 5'-AGA-3', 5'-GAGA-3', 5'-GGAGA-3', 5'-UGGAGA-3', 5'-GUGGAGA-3', 5'-AGUGGAGA-3', 5'-AAGUGGAGA-3', 5'-AAAGUGGAGA-3'(서열번호 27), 5'-UAAAGUGGAGA-3'(서열번호 28), 5'-AUAAAGUGGAGA-3'(서열번호 29), 5'-GAUAAAGUGGAGA-3'(서열번호 30), 5'-UGAUAAAGUGGAGA-3'(서열번호 31), 5'-CUGAUAAAGUGGAGA-3'(서열번호 32), 5'-ACUGAUAAAGUGGAGA-3'(서열번호 33), 5'-CACUGAUAAAGUGGAGA-3'(서열번호 34), 5'-UCACUGAUAAAGUGGAGA-3'(서열번호 35), 5'-UUCACUGAUAAAGUGGAGA-3'(서열번호 36), 및 5'-CUUCACUGAUAAAGUGGAGA-3'(서열번호 37)로 이뤄진 군에서 선택된 제9 서열을 추가적으로 포함하며, 상기 제9 서열의 3'말단은 상기 제1 서열의 5'말단과 연결되어 있는 것을 특징으로 하는 엔지니어링 된 가이드 RNA.6. The method of claim 5, wherein the sequence of the engineered scaffold region is 5'-A-3', 5'-GA-3', 5'-AGA-3', 5'-GAGA-3', 5'- GGAGA-3', 5'-UGGAGA-3', 5'-GUGGAGA-3', 5'-AGUGGAGA-3', 5'-AAUGGAGA-3', 5'-AAAGUGGAGA-3' (SEQ ID NO: 27), 5'-UAAAAGUGGAGA-3' (SEQ ID NO: 28), 5'-AUAAAGUGGAGA-3' (SEQ ID NO: 29), 5'-GAUAAAAGUGGAGA-3' (SEQ ID NO: 30), 5'-UGAUAAAGUGGAGA-3' (SEQ ID NO: 31) ), 5'-CUGAUAAAAGUGGAGA-3' (SEQ ID NO: 32), 5'-ACUGAUAAAGUGGAGA-3' (SEQ ID NO: 33), 5'-CACUGAUAAAAGUGGAGA-3' (SEQ ID NO: 34), 5'-UCACUGAUAAAGUGGAGA-3' (SEQ ID NO: 34), 5'-UCACUGAUAAAGUGGAGA-3' (SEQ ID NO: 33) 35), 5'-UUCACUGAUAAAGUGGAGA-3' (SEQ ID NO: 36), and 5'-CUUCACUGAUAAAGUGGAGA-3' (SEQ ID NO: 37), further comprising a ninth sequence selected from the group consisting of, 3' of the ninth sequence The engineered guide RNA, characterized in that the end is connected to the 5' end of the first sequence. 제5항에 있어서, 상기 엔지니어링 된 스캐폴드 영역의 서열은,
5'-A-3', 5'-AA-3', 5'-AAA-3', 5'-AAAG-3', 5'-AAAGC-3', 5'-AAAGCU-3', 5'-AAAGCUG-3', 5'-AAAGCUGU-3', 5'-AAAGCUGUC-3', 5'-AAAGCUGUCC-3'(서열번호 52), 및 5'-AAAGCUGUCCC-3'(서열번호 53)로 이뤄진 군에서 선택된 제10 서열, 및
5'-U-3', 5'-CU-3', 5'-ACU-3', 5'-AACU-3', 5'-GAACU-3', 5'-AGAACU-3', 5'-UAGAACU-3', 5'-UUAGAACU-3', 5'-AUUAGAACU-3', 5'-GAUUAGAACU-3'(서열번호 54), 5'-GGAUUAGAACU-3'(서열번호 55), 및 5'-GGGAUUAGAACU-3'(서열번호 56)로 이뤄진 군에서 선택된 제11 서열을 추가적으로 포함하며,
상기 제2 서열의 3'말단 및 상기 제3 서열의 5'말단은 상기 제10 서열을 통해 연결되어 있고,
상기 제3 서열의 3'말단 및 상기 제4 서열의 5'말단은 상기 제11 서열을 통해 연결되어 있는 것을 특징으로 하는 엔지니어링 된 가이드 RNA.
The method of claim 5, wherein the sequence of the engineered scaffold region comprises:
5'-A-3', 5'-AA-3', 5'-AAA-3', 5'-AAAG-3', 5'-AAAGC-3', 5'-AAAGCU-3', 5' consisting of -AAAGCUG-3', 5'-AAAGCUGU-3', 5'-AAAGCUGUC-3', 5'-AAAGCUGUCC-3' (SEQ ID NO: 52), and 5'-AAAGCUGUCCC-3' (SEQ ID NO: 53) a tenth sequence selected from the group, and
5'-U-3', 5'-CU-3', 5'-ACU-3', 5'-AACU-3', 5'-GAACU-3', 5'-AGAACU-3', 5'-UAGAACU-3',5'-UUAGAACU-3',5'-AUUAGAACU-3',5'-GAUUAGAACU-3' (SEQ ID NO: 54), 5'-GGAUUAGAACU-3' (SEQ ID NO: 55), and 5 It additionally comprises an 11th sequence selected from the group consisting of '-GGGAUUAGAACU-3' (SEQ ID NO: 56),
The 3' end of the second sequence and the 5' end of the third sequence are linked through the 10th sequence,
The engineered guide RNA, characterized in that the 3' end of the third sequence and the 5' end of the fourth sequence are linked through the eleventh sequence.
제9항에 있어서,
상기 제10 서열이 5'-A-3'인 경우, 상기 제11 서열은 5'-U-3'이고,
상기 제10 서열이 5'-AA-3'인 경우, 상기 제11 서열은 5'-CU-3'이고,
상기 제10 서열이 5'-AAA-3'인 경우, 상기 제11 서열은 5'-ACU-3'이고,
상기 제10 서열이 5'-AAAG-3'인 경우, 상기 제11 서열은 5'-AACU-3'이고,
상기 제10 서열이 5'-AAAGC-3'인 경우, 상기 제11 서열은 5'-GAACU-3'이고,
상기 제10 서열이 5'-AAAGCU-3'인 경우, 상기 제11 서열은 5'-AGAACU-3'이고,
상기 제10 서열이 5'-AAAGCUG-3'인 경우, 상기 제11 서열은 5'-UAGAACU-3' 또는 5'-UUAGAACU-3'이고,
상기 제10 서열이 5'-AAAGCUGU-3'인 경우, 상기 제11 서열은 5'-AUUAGAACU-3'이고,
상기 제10 서열이 5'-AAAGCUGUC-3'인 경우, 상기 제11 서열은 5'-GAUUAGAACU-3'(서열번호 54)이고,
상기 제10 서열이 5'-AAAGCUGUCC-3'(서열번호 52)인 경우, 상기 제11 서열은 5'-GGAUUAGAACU-3'(서열번호 55)이고,
상기 제10 서열이 5'-AAAGCUGUCCC-3'(서열번호 53)인 경우, 상기 제11 서열은 5'-GGGAUUAGAACU-3'(서열번호 56)이고,
상기 제10 서열이 5'-AAAAGCUGUCCC-3'(서열번호 440)인 경우, 상기 제11 서열은 5'-GGGAUUAGAACUU-3'(서열번호 442) 일 수 있다.
상기 제10 서열이 5'-CAAAAGCUGUCCC-3'(서열번호 441)인 경우, 상기 제11 서열은 5'-GGGAUUAGAACUUG-3'(서열번호 443) 인 것을 특징으로 하는 엔지니어링 된 가이드 RNA.
10. The method of claim 9,
When the tenth sequence is 5'-A-3', the eleventh sequence is 5'-U-3',
When the tenth sequence is 5'-AA-3', the eleventh sequence is 5'-CU-3',
When the tenth sequence is 5'-AAA-3', the eleventh sequence is 5'-ACU-3',
When the tenth sequence is 5'-AAAG-3', the eleventh sequence is 5'-AACU-3',
When the tenth sequence is 5'-AAAGC-3', the eleventh sequence is 5'-GAACU-3',
When the tenth sequence is 5'-AAAGCU-3', the eleventh sequence is 5'-AGAACU-3',
When the tenth sequence is 5'-AAAGCUG-3', the eleventh sequence is 5'-UAGAACU-3' or 5'-UUAGAACU-3',
When the tenth sequence is 5'-AAAGCUGU-3', the eleventh sequence is 5'-AUUAGAACU-3',
When the tenth sequence is 5'-AAAGCUGUC-3', the eleventh sequence is 5'-GAUUAGAACU-3' (SEQ ID NO: 54),
When the tenth sequence is 5'-AAAGCUGUCC-3' (SEQ ID NO: 52), the eleventh sequence is 5'-GGAUUAGAACU-3' (SEQ ID NO: 55),
When the tenth sequence is 5'-AAAGCUGUCCC-3' (SEQ ID NO: 53), the eleventh sequence is 5'-GGGAUUAGAACU-3' (SEQ ID NO: 56),
When the tenth sequence is 5'-AAAAGCUGUCCC-3' (SEQ ID NO: 440), the eleventh sequence may be 5'-GGGAUUAGAACUU-3' (SEQ ID NO: 442).
Where the tenth sequence is 5'-CAAAAGCUGUCCC-3' (SEQ ID NO: 441), the eleventh sequence is 5'-GGGAUUAGAACUUG-3' (SEQ ID NO: 443).
제10항에 있어서, 상기 제10 서열은 5'-CAAAAGCUGUCCC-3'(서열번호 441)이고, 상기 제11 서열은 5'-GGGAUUAGAACUUG-3'(서열번호 443)인 것을 특징으로 하는 엔지니어링 된 가이드 RNA.11. The engineered guide according to claim 10, wherein the tenth sequence is 5'-CAAAAGCUGUCCC-3' (SEQ ID NO: 441) and the eleventh sequence is 5'-GGGAUUAGAACUUG-3' (SEQ ID NO: 443). RNA. 제5항에 있어서, 상기 엔지니어링 된 스캐폴드 영역의 서열은,
5'-U-3', 5'-UU-3', 5'-UUC-3', 5'-UUCA-3', 5'-UUCAU-3', 5'-UUCAUU-3', 및 5'-UUCAUUU-3'로 이뤄진 군에서 선택된 제 12서열, 및
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-UGAA-3', 5'-AUGAA-3', 5'-AAUGAA-3', 및 5'-GAAUGAA-3'로 이뤄진 군에서 선택된 제 13서열을 추가적으로 포함하며,
이때, 상기 제6 서열의 3'말단 및 상기 링커의 5'말단은 상기 제12 서열을 통해 연결되고,
상기 링커의 3'말단 및 상기 제7 서열의 5'말단은 상기 제13 서열을 통해 연결되어 있는 것을 특징으로 하는 엔지니어링 된 가이드 RNA.
The method of claim 5, wherein the sequence of the engineered scaffold region comprises:
5'-U-3', 5'-UU-3', 5'-UUC-3', 5'-UUCA-3', 5'-UUCAU-3', 5'-UUCAUU-3', and 5 12th sequence selected from the group consisting of '-UUCAUUU-3', and
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-UGAA-3', 5'-AUGAA-3', 5'-AAUGAA-3', and 5 It additionally includes a 13th sequence selected from the group consisting of '-GAAUGAA-3',
In this case, the 3' end of the sixth sequence and the 5' end of the linker are connected through the 12th sequence,
The engineered guide RNA, characterized in that the 3' end of the linker and the 5' end of the 7th sequence are linked through the 13th sequence.
제12항에 있어서,
상기 제12 서열이 5'-U-3'인 경우, 상기 제13 서열은 5'-A-3'이고,
상기 제12 서열이 5'-UU-3'인 경우, 상기 제13 서열은 5'-AA-3'이고,
상기 제12 서열이 5'-UUC-3'인 경우, 상기 제13 서열은 5'-GAA-3'이고,
상기 제12 서열이 5'-UUCA-3'인 경우, 상기 제13 서열은 5'-UGAA-3'이고,
상기 제12 서열이 5'-UUCAU-3'인 경우, 상기 제13 서열은 5'-AUGAA-3'이고,
상기 제12 서열이 5'-UUCAUU-3'인 경우, 상기 제13 서열은 5'-AAUGAA-3'이고,
상기 제12 서열이 5'-UUCAUUU-3'인 경우, 상기 제13 서열은 5'-GAAUGAA-3'인 것을 특징으로 하는 엔지니어링 된 가이드 RNA.
13. The method of claim 12,
when the twelfth sequence is 5'-U-3', the thirteenth sequence is 5'-A-3',
when the twelfth sequence is 5'-UU-3', the thirteenth sequence is 5'-AA-3',
when the twelfth sequence is 5'-UUC-3', the thirteenth sequence is 5'-GAA-3',
when the twelfth sequence is 5'-UUCA-3', the thirteenth sequence is 5'-UGAA-3',
When the twelfth sequence is 5'-UUCAU-3', the thirteenth sequence is 5'-AUGAA-3',
when the twelfth sequence is 5'-UUCAUU-3', the thirteenth sequence is 5'-AAUGAA-3',
When the twelfth sequence is 5'-UUCAUUU-3', the thirteenth sequence is 5'-GAAUGAA-3'.
다음을 포함하는 CRISPR/Cas12f1 시스템을 위한 엔지니어링 된 가이드 RNA:
엔지니어링 된 스캐폴드 영역; 및
스페이서;
이때, 상기 스페이서는 10 내지 50 뉴클레오타이드 길이를 가지며, 표적 서열과 상보적인 서열을 가지며,
상기 엔지니어링 된 스캐폴드 영역의 서열은 다음 서열을 포함함:
5'말단에서 3'말단 방향으로,
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열,
5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열,
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11), 및
5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAUUCA-3'(서열번호 66), 5'-AACAAAUUCAU-3'(서열번호 67), 5'-AACAAAUUCAUU-3'(서열번호 68), 및 5'-AACAAAUUCAUUU-3'(서열번호 12)로 이뤄진 군에서 선택된 서열이 연결된 엔지니어링 된 tracrRNA; 및
5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 5'-AAUGAAGGA-3', 및 5'-GAAUGAAGGA-3'(서열번호 14)으로 이뤄진 군에서 선택된 서열, 및
5'-AUGCAAC-3'가 연결된 엔지니어링 된 crRNA 반복 서열 부분,
이때, 상기 엔지니어링 된 crRNA 반복 서열 부분의 3'말단은 상기 스페이서의 5'말단과 연결 되어 있으며,
상기 엔지니어링 된 tracrRNA의 서열이 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3'(서열번호 1)과 동일하고, 및 상기 엔지니어링 된 crRNA 반복 서열이 5'-GAAUGAAGGAAUGCAAC-3'(서열번호 3)과 동일한 경우는 제외되는 것을 특징으로 함.
Engineered guide RNAs for the CRISPR/Cas12f1 system, including:
engineered scaffold area; and
spacer;
In this case, the spacer has a length of 10 to 50 nucleotides, and has a sequence complementary to the target sequence,
The sequence of the engineered scaffold region comprises the following sequence:
5' end to 3' end direction,
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3',5'-GUGGAGAA-3',5'-AGUGGAGAA-3',5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5'-UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), a sequence selected from the group consisting of 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9),
5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order,
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11), and
5'-AACAAA-3', 5'-AACAAAU-3', 5'-AACAAAUU-3', 5'-AACAAAUUC-3', 5'-AACAAAAUUCA-3' (SEQ ID NO: 66), 5'-AACAAAUUCAU- an engineered tracrRNA to which a sequence selected from the group consisting of 3' (SEQ ID NO: 67), 5'-AACAAAUUCAUU-3' (SEQ ID NO: 68), and 5'-AACAAAUUCAUUU-3' (SEQ ID NO: 12) is linked; and
5'-GGA-3', 5'-AGGA-3', 5'-AAGGA-3', 5'-GAAGGA-3', 5'-UGAAGGA-3', 5'-AUGAAGGA-3', 5'-AAUGAAGGA-3', and a sequence selected from the group consisting of 5'-GAAUGAAGGA-3' (SEQ ID NO: 14), and
5'-AUGCAAC-3' linked engineered crRNA repeat sequence portion,
At this time, the 3' end of the engineered crRNA repeat sequence portion is connected to the 5' end of the spacer,
If the sequence of the engineered tracrRNA is identical to 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3' (SEQ ID NO: 1), except that the sequence is identical to that of the engineered tracrRNA repeats that the sequence is identical to 5'-CUUCACUGAUAAAGUGGAGAGAACCGCUUCACCAAAAGCUGUCCCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUU-3' (SEQ ID NO: 1). characterized.
다음을 포함하는, 표적 서열을 포함하는 핵산을 편집할 수 있는 엔지니어링 된 CRISPR/Cas12f1 복합체:
Cas12f1 단백질; 및
엔지니어링 된 가이드 RNA,
이때, 상기 엔지니어링 된 가이드 RNA는 다음을 포함함:
엔지니어링 된 스캐폴드 영역; 및
스페이서;
이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,
상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 7)과 상이한 것을 특징으로 하며,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;
5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 및 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117)에서 선택된 서열; 및
5'-AUGCAAC-3'.
An engineered CRISPR/Cas12f1 complex capable of editing a nucleic acid comprising a target sequence, comprising:
Cas12f1 protein; and
engineered guide RNA,
In this case, the engineered guide RNA comprises:
engineered scaffold area; and
spacer;
At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,
The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,
The sequence of the engineered scaffold region is characterized in that it differs from 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 7),
The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3',5'-GUGGAGAA-3',5'-AGUGGAGAA-3',5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5'-UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);
5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), and 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) a sequence selected from SEQ ID NO: 117); and
5'-AUGCAAC-3'.
제15 항에 있어서, 상기 엔지니어링 된 가이드 RNA에 포함된 상기 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것을 특징으로 하는 엔지니어링 된 CRISPR/Cas12f1 복합체:
5'-A-3';
5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110); 및
5'-AUGCAAC-3'.
The engineered CRISPR/Cas12f1 complex according to claim 15, wherein the sequence of the scaffold region included in the engineered guide RNA is sequentially linked from the 5' end to the 3' end:
5'-A-3';
5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 10);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110); and
5'-AUGCAAC-3'.
제15 항에 있어서, 상기 엔지니어링 된 가이드 RNA에 포함된 상기 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것을 특징으로 하는 엔지니어링 된 CRISPR/Cas12f1 복합체:
5'-A-3';
5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110); 및
5'-AUGCAAC-3'.
The engineered CRISPR/Cas12f1 complex according to claim 15, wherein the sequence of the scaffold region included in the engineered guide RNA is sequentially linked from the 5' end to the 3' end:
5'-A-3';
5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110); and
5'-AUGCAAC-3'.
다음을 포함하는, 표적 서열을 포함하는 핵산을 편집할 수 있는 엔지니어링 된 CRISPR/Cas12f1 복합체:
Cas12f1 단백질; 및
제5항 내지 제13항 중 어느 한 항에 따른 엔지니어링 된 가이드 RNA.
An engineered CRISPR/Cas12f1 complex capable of editing a nucleic acid comprising a target sequence, comprising:
Cas12f1 protein; and
An engineered guide RNA according to any one of claims 5 to 13.
다음을 포함하는, CRISPR/Cas12f1 시스템의 각 구성요소를 발현할 수 있는 벡터:
Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 제1 서열;
상기 제1 서열과 작동 가능하게 연결된 제1 프로모터 서열;
엔지니어링 된 가이드 RNA를 암호화하는 핵산 서열을 포함하는 제2 서열; 및
상기 제2 서열과 작동 가능하게 연결된 제2 프로모터 서열,
이때, 상기 엔지니어링 된 가이드 RNA는 다음을 포함함:
엔지니어링 된 스캐폴드 영역; 및
스페이서;
이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,
상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 7)과 상이한 것을 특징으로 하며,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;
5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 및 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117)에서 선택된 서열; 및
5'-AUGCAAC-3'.
A vector capable of expressing each component of the CRISPR/Cas12f1 system, comprising:
a first sequence comprising a nucleic acid sequence encoding a Cas12f1 protein;
a first promoter sequence operably linked to said first sequence;
a second sequence comprising a nucleic acid sequence encoding the engineered guide RNA; and
a second promoter sequence operably linked to said second sequence;
In this case, the engineered guide RNA comprises:
engineered scaffold area; and
spacer;
At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,
The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,
The sequence of the engineered scaffold region is characterized in that it differs from 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 7),
The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3',5'-GUGGAGAA-3',5'-AGUGGAGAA-3',5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5'-UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);
5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), and 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) a sequence selected from SEQ ID NO: 117); and
5'-AUGCAAC-3'.
제19항에 있어서, 상기 엔지니어링 된 가이드 RNA에 포함된 상기 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것을 특징으로 하는 벡터:
5'-A-3';
5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110); 및
5'-AUGCAAC-3'.
20. The vector according to claim 19, wherein the sequence of the scaffold region included in the engineered guide RNA is sequentially linked from the 5' end to the 3' end:
5'-A-3';
5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 10);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110); and
5'-AUGCAAC-3'.
제19항에 있어서, 상기 엔지니어링 된 가이드 RNA에 포함된 상기 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것을 특징으로 하는 벡터:
5'-A-3';
5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110); 및
5'-AUGCAAC-3'.
20. The vector according to claim 19, wherein the sequence of the scaffold region included in the engineered guide RNA is sequentially linked from the 5' end to the 3' end:
5'-A-3';
5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430);
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110); and
5'-AUGCAAC-3'.
제19항에 있어서, 상기 벡터는 플라스미드, 레트로바이러스, 렌티바이러스, 아데노바이러스, 아데노-연관 바이러스, 백시니아바이러스, 폭스바이러스 및 단순포진 바이러스로 구성된 군에서 선택되는 하나 이상인 것을 특징으로 하는 벡터.The vector according to claim 19, wherein the vector is at least one selected from the group consisting of a plasmid, a retrovirus, a lentivirus, an adenovirus, an adeno-associated virus, a vaccinia virus, a poxvirus, and a herpes simplex virus. 다음을 포함하는, CRISPR/Cas12f1 시스템의 각 구성요소를 발현할 수 있는 벡터:
Cas12f1 단백질을 암호화하는 핵산 서열을 포함하는 제1 서열;
상기 제1 서열과 작동 가능하게 연결된 제1 프로모터 서열;
제5항 내지 제13항 중 어느 한 항에 따른 엔지니어링 된 가이드 RNA를 암호화하는 핵산 서열을 포함하는 제2 서열; 및
상기 제2 서열과 작동 가능하게 연결된 제2 프로모터 서열.
A vector capable of expressing each component of the CRISPR/Cas12f1 system, comprising:
a first sequence comprising a nucleic acid sequence encoding a Cas12f1 protein;
a first promoter sequence operably linked to said first sequence;
a second sequence comprising a nucleic acid sequence encoding the engineered guide RNA according to any one of claims 5 to 13; and
a second promoter sequence operably linked to said second sequence.
다음을 포함하는, 세포 내에서 표적 서열을 포함하는 핵산을 편집하는 방법:
Cas12f1 단백질 또는 이를 암호화하는 핵산, 및 엔지니어링 된 가이드 RNA 또는 이름 암호화하는 핵산을 세포 내로 전달하는 것,
이로 인해 상기 세포 내에서 CRISPR/Cas12f1 복합체가 형성될 수 있으며,
이로 인해 상기 표적 서열을 포함하는 핵산이 CRISPR/Cas12f1 복합체에 의해 편집될 수 있고,
상기 엔지니어링 된 가이드 RNA는 다음을 포함하는 것을 특징으로 함:
엔지니어링 된 스캐폴드 영역; 및
스페이서;
이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,
상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3'(서열번호 7)과 상이한 것을 특징으로 하며,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;
5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 및 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117)에서 선택된 서열; 및
5'-AUGCAAC-3'.
A method of editing a nucleic acid comprising a target sequence in a cell, comprising:
delivering the Cas12f1 protein or a nucleic acid encoding the same, and an engineered guide RNA or a nucleic acid encoding the name into a cell;
Due to this, the CRISPR/Cas12f1 complex may be formed in the cell,
Due to this, the nucleic acid comprising the target sequence can be edited by the CRISPR/Cas12f1 complex,
The engineered guide RNA is characterized in that it comprises:
engineered scaffold area; and
spacer;
At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,
The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,
The sequence of the engineered scaffold region is characterized in that it differs from 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUGAAAGAAUGAAGGAAUGCAAC-3' (SEQ ID NO: 7),
The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3',5'-GUGGAGAA-3',5'-AGUGGAGAA-3',5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5'-UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);
5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), and 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) a sequence selected from SEQ ID NO: 117); and
5'-AUGCAAC-3'.
다음을 포함하는, 세포 내에서 표적 서열을 포함하는 핵산을 편집하는 방법:
Cas12f1 단백질 또는 이를 암호화하는 핵산, 및 엔지니어링 된 가이드 RNA 또는 이름 암호화하는 핵산을 세포 내로 전달하는 것,
이로 인해 상기 세포 내에서 CRISPR/Cas12f1 복합체가 형성될 수 있으며,
이로 인해 상기 표적 서열을 포함하는 핵산이 CRISPR/Cas12f1 복합체에 의해 편집될 수 있고,
상기 엔지니어링 된 가이드 RNA는 다음을 포함하는 것을 특징으로 함:
엔지니어링 된 스캐폴드 영역; 및
스페이서; 및
이때, 상기 엔지니어링 된 가이드 RNA의 5'말단에서 3'말단 방향으로, 엔지니어링 된 스캐폴드 영역, 및 스페이서가 순서대로 연결되어 있고,
상기 스페이서는 10개 이상 50개 이하의 뉴클레오타이드를 포함하며, 표적 서열과 상보적인 서열을 가지고,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGAAUGCAAC-3'(서열번호 315)과 상이한 것을 특징으로 하며,
상기 엔지니어링 된 스캐폴드 영역의 서열은 5'말단에서 3'말단 방향으로 다음 서열이 순서대로 연결된 것임:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3', 5'-GUGGAGAA-3', 5'-AGUGGAGAA-3', 5'-AAGUGGAGAA-3'(서열번호 16), 5'-AAAGUGGAGAA-3'(서열번호 17), 5'-UAAAGUGGAGAA-3'(서열번호 18), 5'-AUAAAGUGGAGAA-3'(서열번호 19), 5'-GAUAAAGUGGAGAA-3'(서열번호 20), 5'-UGAUAAAGUGGAGAA-3'(서열번호 21), 5'-CUGAUAAAGUGGAGAA-3'(서열번호 22), 5'-ACUGAUAAAGUGGAGAA-3'(서열번호 23), 5'-CACUGAUAAAGUGGAGAA-3'(서열번호 24), 5'-UCACUGAUAAAGUGGAGAA-3'(서열번호 25), 5'-UUCACUGAUAAAGUGGAGAA-3'(서열번호 26), 및 5'-CUUCACUGAUAAAGUGGAGAA-3'(서열번호 9)로 이뤄진 군에서 선택된 서열;
5'-CCGCUUCACUUAGAGUGAAGGUGG-3'(서열번호 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3'(서열번호 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3'(서열번호 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGUGG-3'(서열번호 39), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3'(서열번호 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3'(서열번호 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 42), 5'-CCGCUUCACCAAAAGCUUAGGAACUUGAGUGAAGGUGG-3'(서열번호 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3'(서열번호 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3'(서열번호 45), 5'-CCGCUUCACCAAAAGCUGUUAGUAGAACUUGAGUGAAGGUGG-3'(서열번호 46), 5'-CCGCUUCACCAAAAGCUGUUUAGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 49), 및 5'-CCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGG-3'(서열번호 10)로 이뤄진 군에서 선택된 서열;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3'(서열번호 11);
5'-AACAAAGAAAGGA-3'(서열번호 110), 5'-AACAAAUGAAAAGGA-3'(서열번호 111), 5'-AACAAAUUGAAAAAGGA-3'(서열번호 112), 5'-AACAAAUUCGAAAGAAGGA-3'(서열번호 113), 5'-AACAAAUUCAGAAAUGAAGGA-3'(서열번호 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3'(서열번호 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3'(서열번호 116), 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3'(서열번호 117), 5'-AACAAAUUCAUUUUGAAACGAAUGAAGGA-3'(서열번호 293), 5'-AACAAAUUCAUUUUUGAAAACGAAUGAAGGA-3'(서열번호 294), 5'-AACAAAUUCAUUUUUCGAAAGACGAAUGAAGGA-3'(서열번호 295), 5'-AACAAAUUCAUUUUUCCGAAAAGACGAAUGAAGGA-3'(서열번호 296), 5'-AACAAAUUCAUUUUUCCUGAAAUAGACGAAUGAAGGA-3'(서열번호 297), 5'-AACAAAUUCAUUUUUCCUCGAAAAUAGACGAAUGAAGGA-3'(서열번호 298), 5'-AACAAAUUCAUUUUUCCUCUGAAAAAUAGACGAAUGAAGGA-3'(서열번호 299), 5'-AACAAAUUCAUUUUUCCUCUCGAAAGAAUAGACGAAUGAAGGA-3'(서열번호 300), 5'-AACAAAUUCAUUUUUCCUCUCCGAAACGAAUAGACGAAUGAAGGA-3'(서열번호 301), 5'-AACAAAUUCAUUUUUCCUCUCCAGAAACCGAAUAGACGAAUGAAGGA-3'(서열번호 302), 5'-AACAAAUUCAUUUUUCCUCUCCAAGAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 303), 5'-AACAAAUUCAUUUUUCCUCUCCAAUGAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 304), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUGAAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 305), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCGAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 306), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGAAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 307), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGGAAACAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 308), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCGAAAGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 309), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCAGAAAUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 310), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 311), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 312), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 313), 및 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 314)로 이뤄진 군에서 선택된 서열; 및
5'-AUGCAAC-3'.
A method of editing a nucleic acid comprising a target sequence in a cell, comprising:
delivering the Cas12f1 protein or a nucleic acid encoding the same, and an engineered guide RNA or a nucleic acid encoding the name into a cell;
Due to this, the CRISPR/Cas12f1 complex may be formed in the cell,
Due to this, the nucleic acid comprising the target sequence can be edited by the CRISPR/Cas12f1 complex,
The engineered guide RNA is characterized in that it comprises:
engineered scaffold area; and
spacer; and
At this time, in the direction from the 5' end to the 3' end of the engineered guide RNA, the engineered scaffold region, and the spacer are connected in order,
The spacer comprises 10 or more and 50 or less nucleotides and has a sequence complementary to the target sequence,
The sequence of the engineered scaffold region is 5'-CUUCACUGAUAAAGUGGAGAACCGCUUCACCAAAAGCUGUCCCUUAGGGGAUUAGAACUUGAGUGAAGGUGGGCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUUCUUCGGAAAGUAACCCUCGAAACAAAUUCAUUUUUUCCUCUCCCAAAAUUCGAGAACCAAUGAAUGAAUUCGAUGAAGAAUGAAUGAAUGAUGAAGUGAGAAUGAAUGAAUGAUGAUGAGAAUGAAUGAAUGAUGAAUGAAGAAU characterized by that the sequence differs by the sequence
The sequence of the engineered scaffold region consists of the following sequences linked in order from the 5' end to the 3' end:
5'-A-3', 5'-AA-3', 5'-GAA-3', 5'-AGAA-3', 5'-GAGAA-3', 5'-GGAGAA-3', 5'-UGGAGAA-3',5'-GUGGAGAA-3',5'-AGUGGAGAA-3',5'-AAGUGGAGAA-3' (SEQ ID NO: 16), 5'-AAAGUGGAGAA-3' (SEQ ID NO: 17), 5'-UAAAGUGGAGAA-3' (SEQ ID NO: 18), 5'-AUAAAGUGGAGAA-3' (SEQ ID NO: 19), 5'-GAUAAAGUGGAGAA-3' (SEQ ID NO: 20), 5'-UGAUAAAGUGGAGAA-3' (SEQ ID NO: 21), 5'-CUGAUAAAGUGGAGAA-3' (SEQ ID NO: 22), 5'-ACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 23), 5'-CACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 24), 5'-UCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 25 ), 5'-UUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 26), and 5'-CUUCACUGAUAAAGUGGAGAA-3' (SEQ ID NO: 9);
5'-CCGCUUCACUUAGAGUGAAGGUGG-3' (SEQ ID NO: 430), 5'-CCGCUUCACCUUAGGAGUGAAGGUGG-3' (SEQ ID NO: 431), 5'-CCGCUUCACCAUUAGUGAGUGAAGGUGG-3' (SEQ ID NO: 38), 5'-CCGCUUCACCAAUUAGUUGAGUGAAGGG-3' (SEQ ID NO: 39) ), 5'-CCGCUUCACCAAAUUAGCUUGAGUGAAGGUGG-3' (SEQ ID NO: 40), 5'-CCGCUUCACCAAAAUUAGACUUGAGUGAAGGUGG-3' (SEQ ID NO: 41), 5'-CCGCUUCACCAAAAGUUAGAACUUGAGUGAAGCUAAGUGG-3' (SEQ ID NO: 42), 5'-AGAGGUCCG'UGAAGGUGG-3' (SEQ ID NO: 42), 5'-AGAG No. 43), 5'-CCGCUUCACCAAAAGCUUUAGAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 44), 5'-CCGCUUCACCAAAAGCUGUUAGUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 45), 5'-CCGCUUAUCACCAAAAGCUGUGUUAGUAAGAAGGUAGGUAGGUAAGAUAAGAUGAGUUAGUAAGUAACUUGACUGAGUAGGUA-3' (SEQ ID NO: 47), 5'-CCGCUUCACCAAAAGCUGUCUUAGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 48), 5'-CCGCUUCACCAAAAGCUGUCCUUAGGGAUUAGAACUUGAGUGAAGGUGG-3' (SEQ ID NO: 49), and 5'-GUCCUGAGUUCACCAAAAGGAUGGAUGA selected from the group consisting of: order;
5'-GCUGCUUGCAUCAGCCUAAUGUCGAGAAGUGCUUUCUUCGGAAAGUAACCCUCGA-3' (SEQ ID NO: 11);
5'-AACAAAGAAAGGA-3' (SEQ ID NO: 110), 5'-AACAAAUGAAAAGGA-3' (SEQ ID NO: 111), 5'-AACAAAUUGAAAAAGGA-3' (SEQ ID NO: 112), 5'-AACAAAUUCGAAAGAAGGA-3' (SEQ ID NO: 113) ), 5'-AACAAAUUCAGAAAUGAAGGA-3' (SEQ ID NO: 114), 5'-AACAAAUUCAUGAAAAUGAAGGA-3' (SEQ ID NO: 115), 5'-AACAAAUUCAUUGAAAAAUGAAGGA-3' (SEQ ID NO: 116), 5'-AACAAAUUCAUUUGAAAGAAUGAAGGA-3' (SEQ ID NO: 115) No. 117), 5'-AACAAAUUCAUUUUGAAACGAAUGAAGGA-3' (SEQ ID NO: 293), 5'-AACAAAUUCAUUUUUGAAAACGAAUGAAGGA-3' (SEQ ID NO: 294), 5'-AACAAAUUCAUUUUUCGAAAGACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAACGAAUGAAGGAUUGAAUGAAU-GAAAAAGGAUGA (SEQ ID NO: 296), 5'-AACAAAUUCAUUUUUCCUGAAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 297), 5'-AACAAAUUCAUUUUUCCUCGAAAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 298), 5'-AACAAAUUCAUUCAUUUUUCCUACAUGAAUUCAUUUUUUCCUACAUGAAUUCAUUUUUUCCUACAUGAAUUCAUUUUUUCCUACAACUGAAUGAAUUUUUCCUCUGAAUGAAGAAUUCA 3' (SEQ ID NO: 300), 5'-AACAAAUUCAUUUUUCCUCUCCGAAACGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 301), 5'-AACAAAUUCAUUUUUCCUCUCCAGAAACCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 302), 5'-AACAAAUCUCCAUGAAUGAGU-3' (SEQ ID NO: 302), 5'-AACAAAUCUCCAUGAAUGAU AACAAAUUCAUUUUUCCUCUCCAAUGAAAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 304), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUGAAAAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 305), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCGAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 306), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGAAAAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 307), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGGAAACAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 308 ), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCGAAAGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 309), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCAGAAAUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 310), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열번호 311), 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3'(서열 312), 5'-AACAAAUUCAUUUUUUCCUCUCCAAUUCUGCACAGAAAUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO: 313), and 5'-AACAAAUUCAUUUUUCCUCUCCAAUUCUGCACAAGAAAGUUGCAGAACCCGAAUAGACGAAUGAAGGA-3' (SEQ ID NO:314) selected from the group consisting of; and
5'-AUGCAAC-3'.
제24항 또는 제25항 중 어느 한 항에 있어서, 상기 전달은 상기 Cas12f1 단백질 및 상기 엔지니어링 된 가이드 RNA를 CRISPR/Cas12f1 복합체로써 세포 내에 주입하는 것인 방법.26. The method of any one of claims 24 or 25, wherein said delivery is intracellular injection of said Cas12f1 protein and said engineered guide RNA as a CRISPR/Cas12f1 complex. 제24항에 있어서, 상기 전달은 상기 Cas12f1 단백질을 암호화하는 핵산 및 상기 엔지니어링 된 가이드 RNA를 암호화하는 핵산을 포함하는 벡터를 세포 내에 주입하는 것인 방법.25. The method of claim 24, wherein said delivering is intracellular injection of a vector comprising a nucleic acid encoding said Cas12f1 protein and a nucleic acid encoding said engineered guide RNA. 제24항 또는 제25항 중 어느 한 항에 있어서, 상기 세포는 진핵세포인 것을 특징으로 하는 방법.26. The method of any one of claims 24 or 25, wherein the cell is a eukaryotic cell. 제27항에 있어서, 상기 벡터는 플라스미드, 레트로바이러스, 렌티바이러스, 아데노바이러스, 아데노-연관 바이러스, 백시니아바이러스, 폭스바이러스 및 단순포진 바이러스로 구성된 군에서 선택되는 하나 이상인 것을 특징으로 하는 방법.The method of claim 27, wherein the vector is one or more selected from the group consisting of plasmid, retrovirus, lentivirus, adenovirus, adeno-associated virus, vacciniavirus, poxvirus and herpes simplex virus. 다음을 포함하는, 세포 내에서 표적 서열을 포함하는 핵산을 편집하는 방법:
Cas12f1 단백질 또는 이를 암호화하는 핵산, 및 제5항 내지 제13항 중 어느 한 항에 따른 엔지니어링 된 가이드 RNA 또는 이름 암호화하는 핵산을 세포 내로 전달하는 것,
이로 인해 상기 세포 내에서 CRISPR/Cas12f1 복합체가 형성될 수 있으며,
이로 인해 상기 표적 서열을 포함하는 핵산이 CRISPR/Cas12f1 복합체에 의해 편집될 수 있음.
A method of editing a nucleic acid comprising a target sequence in a cell, comprising:
Delivering the Cas12f1 protein or a nucleic acid encoding the same, and the engineered guide RNA according to any one of claims 5 to 13 or a nucleic acid encoding the name into a cell,
Due to this, the CRISPR/Cas12f1 complex may be formed in the cell,
This allows the nucleic acid containing the target sequence to be edited by the CRISPR/Cas12f1 complex.
제1항 또는 제2항 중 어느 한 항의 엔지니어링 된 가이드 RNA를 암호화하는 DNA.A DNA encoding the engineered guide RNA of any one of claims 1 or 2. 제5항 내지 제13항 중 어느 한 항의 엔지니어링 된 가이드 RNA를 암호화하는 DNA.A DNA encoding the engineered guide RNA of any one of claims 5-13. 제14항의 엔지니어링 된 가이드 RNA를 암호화하는 DNA.



A DNA encoding the engineered guide RNA of claim 14 .



KR1020210051552A 2020-10-08 2021-04-21 An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use thereof KR20220145438A (en)

Priority Applications (8)

Application Number Priority Date Filing Date Title
KR1020210051552A KR20220145438A (en) 2021-04-21 2021-04-21 An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use thereof
US18/030,624 US20240254479A1 (en) 2020-10-08 2021-10-08 Engineered guide rna for optimized crispr/cas12f1 system and use thereof
JP2023521464A JP2023544817A (en) 2020-10-08 2021-10-08 Engineered guide RNA and its applications to improve CRISPR/Cas12f1 system efficiency
EP21878063.3A EP4227411A1 (en) 2020-10-08 2021-10-08 Engineered guide rna for increasing efficiency of crispr/cas12f1 system, and use of same
AU2021357377A AU2021357377A1 (en) 2020-10-08 2021-10-08 Engineered guide RNA for optimized CRISPR/Cas12f1 (Cas14a1) system and use thereof
PCT/KR2021/013923 WO2022075813A1 (en) 2020-10-08 2021-10-08 Engineered guide rna for increasing efficiency of crispr/cas12f1 system, and use of same
CN202180082426.4A CN116806261A (en) 2020-10-08 2021-10-08 Engineered guide RNAs for increasing efficiency of CRISPR/Cas12f1 systems and uses thereof
CA3198429A CA3198429A1 (en) 2020-10-08 2021-10-08 Engineered guide rna for optimized crispr/cas12f1 system and use thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020210051552A KR20220145438A (en) 2021-04-21 2021-04-21 An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use thereof

Publications (1)

Publication Number Publication Date
KR20220145438A true KR20220145438A (en) 2022-10-31

Family

ID=83803041

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020210051552A KR20220145438A (en) 2020-10-08 2021-04-21 An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use thereof

Country Status (2)

Country Link
KR (1) KR20220145438A (en)
CN (1) CN116806261A (en)

Also Published As

Publication number Publication date
CN116806261A (en) 2023-09-26

Similar Documents

Publication Publication Date Title
KR102455623B1 (en) An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use thereof
KR102690083B1 (en) An engineered guide RNA including a U-rich tail for the optimized CRISPR/Cas12f1 system and use thereof
US20240254479A1 (en) Engineered guide rna for optimized crispr/cas12f1 system and use thereof
US20040005593A1 (en) Novel method for delivery and intracellular synthesis of siRNA molecules
US20230416784A1 (en) Engineered guide rna for optimized crispr/cas12f1 (cas14a1) system and use thereof
US20230374500A1 (en) Engineered guide rna comprising u-rich tail for optimized crispr/cas12f1 system and use thereof
KR20200135225A (en) Single base editing proteins and composition comprising the same
KR102638799B1 (en) An engineered guide RNA for the optimized CRISPR/Cas12f1(Cas14a1) system and use thereof
KR20230007218A (en) Hypercompact base editing systems and use thereof
CN116162609A (en) Cas13 protein, CRISPR-Cas system and application thereof
KR20230051095A (en) Novel genome editing TaRGET system and uses thereof
KR20230121569A (en) TaRGET system for homology-directed repair and gene editing method using the same
US20230023791A1 (en) Gene editing systems comprising a crispr nuclease and uses thereof
KR20220145438A (en) An engineered guide RNA for the optimized CRISPR/Cas12f1 system and use thereof
KR20240034661A (en) An improved Campylobacter jejuni derived CRISPR/Cas9 gene-editing system by structure modification of a guide RNA
US20070122798A1 (en) Methods and tools for screening active rna in cellulo
CN116568806A (en) Engineered guide RNAs for increasing efficiency of CRISPR/CAS12F1 (CAS 14 A1) systems and uses thereof
CN117916372A (en) Cleavage-free CAS12F1, fusion protein based on cleavage-free CAS12F1, CRISPR gene editing system comprising same, and preparation method and application thereof
CN117813379A (en) Gene editing system comprising CRISPR nucleases and uses thereof
CN116648505A (en) Compositions comprising B2M-targeted RNA guides and uses thereof

Legal Events

Date Code Title Description
E902 Notification of reason for refusal
E90F Notification of reason for final refusal