WO2023206871A1 - Optimized crispr/spcas12f1 system, engineered guide rna and use thereof - Google Patents

Optimized crispr/spcas12f1 system, engineered guide rna and use thereof Download PDF

Info

Publication number
WO2023206871A1
WO2023206871A1 PCT/CN2022/113354 CN2022113354W WO2023206871A1 WO 2023206871 A1 WO2023206871 A1 WO 2023206871A1 CN 2022113354 W CN2022113354 W CN 2022113354W WO 2023206871 A1 WO2023206871 A1 WO 2023206871A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequence
seq
spcas12f1
guide rna
gene
Prior art date
Application number
PCT/CN2022/113354
Other languages
French (fr)
Chinese (zh)
Inventor
季泉江
王玉珏
吴兆韡
Original Assignee
上海科技大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海科技大学 filed Critical 上海科技大学
Publication of WO2023206871A1 publication Critical patent/WO2023206871A1/en

Links

Images

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • A61K38/16Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • A61K38/43Enzymes; Proenzymes; Derivatives thereof
    • A61K38/46Hydrolases (3)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • A61K38/16Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • A61K38/43Enzymes; Proenzymes; Derivatives thereof
    • A61K38/46Hydrolases (3)
    • A61K38/465Hydrolases (3) acting on ester bonds (3.1), e.g. lipases, ribonucleases
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • A61K48/0008Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'non-active' part of the composition delivered, e.g. wherein such 'non-active' part is not delivered simultaneously with the 'active' part of the composition
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • A61K48/005Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/65Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression using markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/06Animal cells or tissues; Human cells or tissues
    • C12N5/0602Vertebrate cells
    • C12N5/0684Cells of the urinary tract or kidneys
    • C12N5/0686Kidney cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/10Cells modified by introduction of foreign genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2510/00Genetically modified cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/22Vectors comprising a coding region that has been codon optimised for expression in a respective host

Definitions

  • the invention belongs to the field of biotechnology and relates to an optimized CRISPR/SpCas12f1 system, engineered guide RNA and its application. Specifically, it relates to an engineered guide RNA and a gene editing method based on an extremely small CRISPR/SpCas12f1 system.
  • CRISPR Clustered regularly interspaced short palindromic repeats
  • Cas Clustered regularly interspaced short palindromic repeats
  • Cas Clustered regularly interspaced short palindromic repeats
  • Cas Clustered regularly interspaced short palindromic repeats
  • Cas Clustered regularly interspaced short palindromic repeats
  • Cas Clustered regularly interspaced short palindromic repeats
  • Cas its associated genes
  • the CRISPR-Cas system uses three steps to protect microorganisms from foreign invasion: (1) adaptation, in which Cas proteins capture foreign nucleic acid fragments (spacers) and insert them into the CRISPR array; (2) processing of transcripts from the CRISPR array to produce mature CRISPR RNA (crRNAs); (3) Interference, crRNAs guide the Cas protein to target and cut the homologous sequence of the new invader.
  • the use of genome editing technology can promote cell engineering, the generation of model animals, the development of new plant varieties and genetic screening, etc. It
  • CRISPR/Cas9 and CRISPR/Cas12a are the most widely used genome editing systems, but the large size of these two systems limits their use in fields such as gene therapy.
  • V-F subtype proteins - Cas12f which are composed of approximately 400–700 amino acid residues and contain a RuvC nuclease domain, have been discovered. These small-sized proteins can help solve the problem of adeno-associated virus vector packaging.
  • SyntrophomonaspalmitaticaCas12f1 SpCas12f1, 497 amino acids
  • SpCas12f1 can also target target genes in eukaryotic cells and trigger gene insertion and deletion, but its efficiency is only 0.1%-3.6%. Therefore, it is particularly urgent to optimize small molecular weight or small size gene editing systems to make them efficient and precise gene editing tools.
  • the purpose of the present invention is to solve the shortcomings of small nuclease gene editing within the framework of the existing technology and provide an optimized CRISPR/SpCas12f1 system and its efficient gene editing method.
  • the optimized gene editing system and method provided by the present invention can achieve efficient and precise editing of target genes or target genomes in vitro (including cells) or in vivo, greatly improving the gene editing efficiency of the small CRISPR/SpCas12f1 system.
  • a guide RNA which is characterized in that it includes a tracr RNA sequence and a crRNA sequence; the crRNA sequence includes a gene targeting segment capable of hybridizing with a target sequence and a tracr partner sequence; the The tracr RNA sequence and the tracr partner sequence constitute the backbone sequence of the guide RNA; the gene targeting segment that hybridizes to the target sequence can be determined based on the target gene for editing;
  • the tracrRNA includes the nucleotide sequence shown in SEQ ID NO. 111 or a variant sequence having at least 50% identity with it;
  • the tracr partner sequence includes the nucleotide sequence shown in SEQ ID NO. 112 or a variant sequence having at least 50% identity with it.
  • Another object of the present invention is to provide an isolated polynucleotide encoding the guide RNA as described above.
  • Another object of the present invention is to provide a construct containing an isolated polynucleotide as described above.
  • Another object of the present invention is to provide an expression system, which contains the construct as described above or an exogenous polynucleotide as described above integrated into the genome.
  • Another object of the present invention is to provide a gene editing system, which includes the guide RNA or its encoding polynucleotide as described above; the editing system further includes a nuclease or its encoding polynucleotide.
  • Another object of the present invention is to provide a gene editing method that contacts the target gene with the gene editing system as described above to achieve editing of the target gene.
  • Another object of the present invention is to provide the guide RNA, isolated polynucleotide, construct, expression system, gene editing system, pharmaceutical composition or method described above in vivo, ex vivo cell or cell-free environment. Applications in gene editing of target genes and/or their related polypeptides.
  • Another object of the present invention is to provide a genetically modified cell, which is obtained by gene editing using the gene editing system or method described above.
  • the present invention conducted in-depth exploration of the gRNA of SpCas12f1 through multiple rounds of gRNA transformation, thereby determining the gRNA with the highest editing efficiency and transforming the CRISPR/SpCas12f1 system into an efficient and accurate genome editing tool.
  • the optimal version of gRNA can increase the efficiency of gene insertion and gene deletion in all tested targets by approximately 2-9 times.
  • gRNA remodeling in the present invention significantly reduces the size of gRNA, and the engineered SpCas12f1 system is smaller, providing more possibilities for gene editing and gene therapy for AAV delivery.
  • This compact CRISPR system has broad application prospects in genome editing.
  • the present invention tested the editing efficiency of a total of 32 targeted sequences on 9 endogenous genes in a mammalian cell genome editing experiment, thereby improving the applicability of the small CRISPR system in cell gene editing.
  • Figure 1 is a schematic diagram of the initial version of gRNA_MS1 of SpCas12f1 nuclease in Example 1.
  • Figure 2 is a diagram showing the gene editing results in human cells mediated by SpCas12f1 nuclease and the corresponding initial version of gRNA_MS1 in Example 1. The results showed that SpCas12f1 nuclease successfully achieved gene editing of multiple genes on the mammalian cell genome.
  • Figure 3 is a schematic diagram of the specific modification sites and corresponding base lengths of SpCas12f1 nuclease gRNA engineering in Example 2.
  • Figure 4 is a diagram showing the gene editing results in mammalian cells mediated by SpCas12f1 nuclease and corresponding different versions of gRNA in Example 2. The results showed that the editing efficiency of the partially engineered gRNA was higher than that of the initial version gRNA_MS1, with the highest efficiency being gRNA_MS13.
  • Figure 5 is a diagram showing the comparison results of gene editing in mammalian cells mediated by SpCas12f1 nuclease and the corresponding initial version (MS1) and optimal version (MS13) of gRNA in Example 3. The results showed that gRNA_MS13 greatly improved the gene editing efficiency of nuclease in mammalian cells.
  • a single base editing system based on SpCas12f1 was developed by fusing inactivated SpCas12f1 and base deaminase;
  • a Prime editing system based on SpCas12f1 was developed by fusing inactivated SpCas12f1 and reverse transcriptase;
  • a SpCas12f1-based Prime editing system was developed by fusing inactivated SpCas12f1 and transcription activator to develop a transcription activation system based on SpCas12f1; by fusing inactivated SpCas12f1 and a nucleic acid epigenetic modification enzyme, an epigenetic modification system based on SpCas12f1 was developed; using inactivated SpCas12f1, a transcription repression system based on SpCas12f1 was developed.
  • the CRISPR system mainly used in the present invention is the V-F type CRISPR system, in which the effector protein is mainly Syntrophomonaspalmitatica Cas12f (SpCas12f1) nuclease.
  • SpCas12f1 nuclease can accurately locate the target gene under the guidance of the corresponding guide RNA, cut the genomic DNA, and achieve double-stranded breaks in the genomic DNA. Utilizing the host cell's own or exogenous repair mechanisms, this system can efficiently and accurately achieve gene editing in living cells.
  • the invention provides a guide RNA (gRNA), which includes a trans-activating CRISPR RNA (tracrRNA) sequence and a crRNA sequence; the crRNA sequence includes a gene targeting segment capable of hybridizing with a target sequence and a tracr partner sequence (gene The targeting segment and the tracr partner sequence are connected to obtain the RNA sequence (crRNA));
  • gRNA guide RNA
  • tracrRNA trans-activating CRISPR RNA
  • crRNA sequence includes a gene targeting segment capable of hybridizing with a target sequence and a tracr partner sequence (gene The targeting segment and the tracr partner sequence are connected to obtain the RNA sequence (crRNA));
  • the tracrRNA includes the nucleotide sequence shown in SEQ ID NO. 111 or a variant sequence having at least 50% identity with it;
  • the tracr partner sequence includes the nucleotide sequence shown in SEQ ID NO. 112 or a variant sequence having at least 50% identity with it.
  • the guide RNA of the present invention includes one or more of the complementary base pairs in the tracrRNA, and the tracrRNA is obtained by exchanging positions of the paired bases.
  • the base A at position 13 of tracrRNA is complementary to the base U at position 50.
  • the tracrRNA obtained after setting position 13 to U and position 50 to A can also achieve the technical effects of the present invention.
  • the guide RNA of the present invention includes one or more base pairs of complementary pairs between tracrRNA and tracr partner sequence, and the tracrRNA and tracr partner sequence are obtained by exchanging positions of the paired bases.
  • base A at position 70 of tracrRNA is complementary to base U at position 10 of the tracr partner.
  • the counterpart sequence can also achieve the technical effects of the present invention.
  • the gene targeting segment is a nucleotide sequence complementary to the target sequence in the target gene and is located at the 3' end of the crRNA sequence; the gene targeting segment identifies the target The PAM sequence on the sequence; the preferred PAM sequence is at least one of 5'-NTTC and 5'-GTTT, where N is A, T, C, G; further preferably the PAM sequence is 5'-GTTC, 5'-TTTC and 5'-ATTC at least either.
  • the gene targeting segment targets a nucleic acid fragment with a length of 12 to 40 bp after the PAM sequence; for example, it can be 13-20, 18-25, 22-32, 26-37, 30-38, 32-40 nucleotides
  • the length of the acid is preferably 20 bp.
  • the gene targeting segment is preferably an RNA sequence corresponding to a 20 bp length nucleic acid fragment following the PAM sequence.
  • the gene targeting segment is selected from SEQ ID NO. 168.
  • the complementarity percentage between the targeting segment of the guide RNA and the target sequence of the target gene can be at least 50% (for example, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, at least 98%, at least 99% or 100%).
  • the gRNA further comprises a transcription terminator.
  • the guide RNA can be two strands, one strand includes a tracrRNA sequence, and the other strand includes a crRNA sequence, wherein the tracr partner sequence hybridizes with the tracrRNA sequence and forms a stem-loop structure.
  • the crRNA sequence and the tracrRNA sequence can be connected together to form a single guide RNA skeleton sequence, that is, the guide RNA is a strand, which sequentially contains from the 5' end to the 3' end.
  • the tracr RNA sequence and crRNA sequence can be connected together to form a single guide RNA skeleton sequence, that is, the guide RNA is a strand, which sequentially contains from the 5' end to the 3' end.
  • the 3' end of the tracr RNA sequence is directly connected to the 5' end of the crRNA sequence or connected through a connecting strand; preferably, the connecting strand is connected, and the connecting strand is selected from oligonucleotides ; Preferably, the number of oligonucleotides in the connecting chain is 3 to 18 nt; further preferably, the connecting chain is AAGG.
  • the crRNA and tracrRNA sequences are two separate RNA sequences and can mediate the activity of nuclease such as SpCas12f1 endonuclease when they exist simultaneously.
  • nuclease such as SpCas12f1 endonuclease
  • the complete guide RNA expression construct for the target sequence obtained by connecting the guide RNA backbone sequence and the DNA-targeting segment that hybridizes to the target sequence can also mediate the activity of nucleases such as SpCas12f1 endonuclease.
  • the gene targeting segment includes a nucleotide sequence complementary to a sequence in the target gene, and the gene targeting sequence interacts with the target gene in a sequence-specific manner through hybridization (i.e., base pairing). interaction.
  • the gene targeting sequence of the gRNA can be modified, for example, by genetic engineering, so that the gRNA hybridizes to any desired sequence within the target gene.
  • gRNA guides the bound polypeptide to a specific nucleotide sequence within the target gene through the above-mentioned gene targeting sequence.
  • the guide RNA of the present invention also contains a stem-loop structure, which forms a protein-binding structure that interacts with nucleases such as SpCas12f1.
  • the protein binding structure of the gRNA includes 4 stem-loop structures, including stem-loop 1, 2, 3, and 4.
  • the target gene is a DNA sequence. In some embodiments, the target gene is an RNA sequence.
  • the invention also provides methods for modifying the guide RNA, including but not limited to individual modifications and combined modifications of trans-activating CRISPR RNA (tracrRNA) and CRISPR RNA (crRNA).
  • the transformation methods include but are not limited to truncation or extension of tracrRNA or crRNA, and the connection between tracrRNA and crRNA.
  • tracrRNA and crRNA are connected with AAGG.
  • the variant sequence of tracrRNA refers to adding, reducing or replacing part of the nucleosides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 111 Sequence obtained after acid.
  • the variant sequence of tracrRNA refers to a sequence obtained by reducing nucleotides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO.
  • the variant sequence of tracrRNA refers to a sequence obtained by reducing 1 to 30 nt nucleotides from the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 111.
  • the variant sequence of tracrRNA refers to a reduction of 4 to 16 nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 111, and/or, in the SEQ ID NO.
  • the 3' end of the nucleotide sequence shown in NO. 111 is reduced by 3 to 28 nt nucleotides.
  • the variant sequence of the tracr partner sequence refers to a sequence obtained by adding, reducing or replacing part of the nucleotides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 112 ;
  • the variant sequence of the tracr partner sequence refers to a sequence obtained by reducing nucleotides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 112.
  • the variant sequence of the tracr partner sequence refers to a sequence obtained by reducing 3 to 28 nt nucleotides from the 5' end of the nucleotide sequence shown in SEQ ID NO. 112. More preferably, the variant sequence of the tracr partner sequence refers to a reduction of 8 to 21 nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 112.
  • the variant sequence of tracrRNA refers to a sequence obtained by reducing 7nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 111; preferably, the tracrRNA The variant sequence is shown in SEQ ID NO.114;
  • the variant sequence of tracrRNA refers to a sequence obtained by reducing 6nt nucleotides at the 3' end of the nucleotide sequence shown in SEQ ID NO. 111; preferably, the The variant sequence of tracrRNA is shown in SEQ ID NO.120;
  • the variant sequence of the tracr partner sequence refers to a sequence obtained by reducing 8 nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 112; preferably , the variant sequence of the tracr partner sequence is shown in SEQ ID NO.130;
  • the variant sequence of tracrRNA refers to reducing 7nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 111 and reducing 6nt nucleotides at the 3' end.
  • the variant sequence of tracrRNA is as shown in SEQ ID NO. 138;
  • the variant sequence of tracrRNA refers to reducing 7nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 111 and reducing 6nt nucleotides at the 3' end.
  • the variant sequence of tracrRNA is as shown in SEQ ID NO. 144;
  • the variant sequence of the tracr partner sequence refers to a sequence obtained by reducing 21nt nucleotides at the 3' end of the nucleotide sequence shown in SEQ ID NO. 112; preferably , the variant sequence of the tracr partner sequence is shown in SEQ ID NO. 148.
  • nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 116. In other preferred embodiments, the nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 122. In other preferred embodiments, the nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 131. In other preferred embodiments, the nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 140.
  • the variant sequence of the tracrRNA is shown in SEQ ID NO.144, and the variant sequence of the tracr partner sequence is shown in SEQ ID NO.145; preferably, the tracrRNA and The tracr partner sequences are connected through connecting strands; further preferably, the nucleotide sequence of the backbone sequence of the guide RNA is as shown in SEQ ID NO. 146.
  • the variant sequence of the tracrRNA is shown in SEQ ID NO. 147, and the variant sequence of the tracr partner sequence is shown in SEQ ID NO.
  • the tracrRNA and The tracr partner sequences are connected through connecting strands; further preferably, the nucleotide sequence of the backbone sequence of the guide RNA is as shown in SEQ ID NO. 149.
  • the preferred guide RNA targets the same target sequence in mammalian cell genome editing. Show higher gene deletion and/or gene cleavage efficiency.
  • the present invention also provides modified gRNA, which can be used to achieve hybridization with any desired sequence within the target gene through modification; or, by modifying the gRNA to change the characteristics of the gRNA itself, such as enhancing the stability of the gRNA through modification, including But it is not limited to increasing its resistance to ribonuclease (RNase) degradation present in the cell, thereby extending its half-life in the cell; alternatively, it can be modified to enhance the content of gRNA and endonuclease (such as SpCas12f1 The formation of the CRISPR-SpCas12f1 genome editing complex or its stability; alternatively, it can be used to enhance the specificity of the genome editing complex through modification; alternatively, it can be used to enhance the interaction between the genome editing complex and the genome The initiation site, stability or kinetics of the interaction between the target sequences; alternatively, it can be modified to reduce the likelihood or extent of the RNA introduced into the cell to trigger an innate immune response, etc.
  • RNase
  • RNA can be modified using modification methods known in the art, including but not limited to 2'-fluoro and 2'-amino modifications on the ribose and base residues of the pyrimidine or the reverse base at the 3' end of the RNA. .
  • any modification or combination of modifications can be used to modify gRNA.
  • the sgRNA introduced into the cell is modified to edit any one or more genomic loci.
  • the invention also provides an isolated polynucleotide encoding the guide RNA as described in any one of the above.
  • the invention also provides a construct comprising an isolated polynucleotide as described above.
  • the construct can usually be constructed by inserting the isolated polynucleotide into a suitable expression vector, and those skilled in the art can select a suitable expression vector.
  • the construct may, for example, be a recombinant expression vector, and any suitable expression vector may be used as long as it is compatible with the host cell, including, but not limited to, viral vectors (e.g., vaccinia virus-based viral vectors; poliovirus; Adenovirus; adeno-associated virus; SV40; herpes simplex virus; human immunodeficiency virus; retroviral vectors (e.g., murine leukemia virus, spleen necrosis virus, and vectors derived from retroviruses, such as Rous sarcoma virus, Harvey Sarcoma virus, avian leukemia virus, lentivirus, human immunodeficiency virus, myeloproliferative sarcoma
  • multiple gRNAs are used simultaneously in the same cell to simultaneously regulate transcription of different locations on the same target gene or different target genes.
  • they can exist on the same expression vector or on different vectors, or they can be expressed simultaneously; when they exist on the same vector, they can be expressed under the same control element.
  • a nucleotide sequence encoding a gRNA is operably linked to a control element, such as a transcription control element, such as a promoter.
  • a nucleotide sequence encoding a gRNA is operably linked to an inducible promoter.
  • a nucleotide sequence encoding a gRNA is operably linked to a constitutive promoter.
  • Transcriptional control elements may function in eukaryotic cells, such as mammalian cells (HEK293T cells); or in prokaryotic cells, such as bacterial or archaeal cells.
  • a nucleotide sequence encoding a gRNA is operably linked to a plurality of control elements that permit expression of the nucleotide sequence encoding the gRNA in both prokaryotic and eukaryotic cells.
  • the gRNA can be synthesized by artificial synthesis, for example, by chemical methods, so that it can be easily modified in various ways.
  • the modification can adopt any modification method known in the art, for example, using a polyA tail, adding a 5' cap analog, a 5' or 3' untranslated region (UTR), and the 5' or 3' end including phosphorothioate 2 '-O-methyl nucleotide or treated with phosphatase to remove the 5' terminal phosphate, etc.
  • the nucleotide sequence encoding a gRNA contains one or more modifications that may be used, for example, to enhance activity, stability or specificity, alter delivery, reduce the innate immune response in the host cell, or for Other enhancements.
  • one or more targeting moieties or conjugates that enhance the activity, cellular distribution, or cellular uptake of the nucleotide sequence encoding the gRNA are chemically linked to the gRNA.
  • the targeting moiety or conjugate may include a conjugate group covalently bonded to a functional group; the conjugate group includes reporter molecules, polyamines, polyethylene glycol.
  • a group that enhances pharmacodynamic properties is attached to the gRNA, including groups that improve uptake, enhance resistance to degradation, and/or enhance sequence-specific hybridization to the target nucleic acid. group.
  • the nucleic acid containing the polynucleotide encoding gRNA may be a nucleic acid mimetic.
  • polynucleotide mimetic peptide nucleic acids with excellent hybridization properties may be used.
  • the gRNA, or polynucleotide encoding gRNA is applicable to any biological or in vitro environment, including but not limited to bacteria, archaea, fungi, protists, plants or animals.
  • suitable target cells include, but are not limited to, bacterial cells, archaeal cells, fungal cells, protist cells, plant cells or animal cells.
  • Applicable target cells can be any type of cells, including stem cells, somatic cells, etc.
  • the present invention also provides an expression system, which contains the construct as described above or an exogenous polynucleotide as described above integrated into the genome.
  • the host cell of the expression system is selected from eukaryotic cells or prokaryotic cells; preferably, the host cell is selected from mouse cells and human cells.
  • the present invention also provides a gene editing system, which includes the guide RNA or its encoding polynucleotide as described in any one of the above; further, it may also include a nuclease or its encoding polynucleotide.
  • the polynucleotide encoding the nuclease includes: a coding sequence encoding only a nuclease; a nuclease coding sequence and various additional coding sequences; a nuclease coding sequence (and optional additional coding sequences) and non-coding sequences.
  • the polynucleotide encoding the guide RNA includes: a coding sequence encoding only the guide RNA; a coding sequence of the guide RNA and various additional coding sequences; a coding sequence of the guide RNA (and optional additional coding sequences) and non-coding sequences.
  • the base editing system includes one or more vectors; the one or more vectors include (i) a first regulatory element operably linked to the nuclease The coding polynucleotide; and (ii) a second regulatory element operably linked to the coding polynucleotide of the guide RNA nucleotide sequence; the (i) and (ii) are located on the same or different carriers.
  • the base editing system comprises (i) a nuclease or a variant thereof, and (ii) a vector comprising the coding sequence of the guide RNA.
  • the system includes a gRNA and nuclease complex.
  • the first regulatory element may regulate the transcription of a polynucleotide encoding the nuclease or a variant thereof.
  • the polynucleotide encoding the nuclease or its variant may be one or more, and the first regulatory element may be one or more.
  • the second regulatory element can regulate the transcription of the polynucleotide encoding the guide RNA.
  • the guide RNA encoding polynucleotide may be one or more, and the second regulatory element may be one or more.
  • the system of the present invention may contain one gRNA or multiple gRNAs simultaneously.
  • the system includes multiple gRNAs simultaneously to simultaneously modify different positions on the same target DNA or different target DNAs.
  • two or more guide RNAs target the same gene or transcript or locus.
  • two or more guide RNAs target different unrelated loci.
  • two or more guide RNAs target different but related loci.
  • the nuclease is a CRISPR nuclease; preferably, the nuclease is selected from Cas9, Cas12, Cas13 protein family or variants thereof; further preferably, the Cas nuclease is selected from nSpCas9 and its mutants, SaCas9 and its mutants, Cas12a and its mutants, or Cas12f and its mutants; more preferably, it is SpCas12f1 nuclease or its mutants.
  • SpCas12f1 nuclease is provided directly as a protein; for example, spheroplast transformation can be used to transform fungi with exogenous proteins and/or nucleic acids.
  • SpCas12f1 nuclease can be introduced into cells by any suitable method, such as injection.
  • the gene editing system of the present invention recognizes the PAM sequence on the target sequence; preferably the PAM sequence is at least one of 5'-NTTC and 5'-GTTT, where N is A, T, C, G; further preferably the PAM sequence Be at least any one of 5'-GTTC, 5'-TTTC and 5'-ATTC.
  • the gene editing system targets nucleic acid fragments with a length of 12 to 40 bp after the PAM sequence, and the preferred length is 20 bp.
  • the gene editing system targets at least one target sequence in the genome of the cell.
  • the nucleic acid encoding SpCas12f1 nuclease is DNA. In certain embodiments, the nucleic acid encoding SpCas12f1 nuclease is RNA. In certain embodiments, the nucleic acid encoding SpCas12f1 nuclease is an expression vector, such as a recombinant expression vector.
  • Any suitable expression vector may be used so long as it is compatible with the host cell, including, but not limited to, viral vectors (e.g., vaccinia virus-based viral vectors; poliovirus; adenovirus; adeno-associated virus; SV40; herpes simplex Viruses; human immunodeficiency virus; retroviral vectors (e.g., murine leukemia virus, spleen necrosis virus, and vectors derived from retroviruses, such as Rous sarcoma virus, Harvey sarcoma virus, avian leukemia virus, lentivirus, human immunodeficiency virus, myeloproliferative sarcoma virus and breast tumor virus), etc.
  • viral vectors e.g., vaccinia virus-based viral vectors; poliovirus; adenovirus; adeno-associated virus; SV40; herpes simplex Viruses; human immunodeficiency virus; retroviral vectors (e
  • the nucleic acid encoding the SpCas12f1 nuclease is shown in SEQ ID NO. 40.
  • the invention provides a SpCas12f1 nuclease codon-optimized polynucleotide sequence that is at least 90%, 92%, 93%, 94%, 95%, 96%, 97% identical to SEQ ID NO: 42 , 98%, 99%, 99.2%, 99.5%, 99.8%, 99.9% or 100% sequence homology.
  • the better coding sequence of the SpCas12f1 nuclease optimized for human codons is shown in SEQ ID NO. 42 in the sequence listing, which encodes one or more functional SpCas12f1 domains, or encodes A polypeptide that has the same function as the polypeptide encoded by the original native nucleotide sequence.
  • a nucleotide sequence encoding a SpCas12f1 nuclease is operably linked to a control element, such as a transcription control element, such as a promoter.
  • a control element such as a transcription control element, such as a promoter.
  • the nucleotide sequence encoding SpCas12f1 nuclease is operably linked to an inducible promoter.
  • the nucleotide sequence encoding SpCas12f1 nuclease is operably linked to a constitutive promoter.
  • Transcriptional control elements may function in eukaryotic cells, such as mammalian cells (HEK293T cells); or in prokaryotic cells, such as bacterial or archaeal cells.
  • the nucleotide sequence encoding the SpCas12f1 nuclease is operably linked to a plurality of control elements that allow expression of the nucleotide sequence encoding the SpCas12f1 nuclease in both prokaryotic and eukaryotic cells.
  • the polynucleotide sequence encoding SpCas12fl nuclease is operably linked to a suitable nuclear localization signal for expression in a cellular or in vitro environment.
  • the polynucleotide encoding SpCas12f1 nuclease can be synthesized artificially, for example, chemically, so that it can be easily modified in various ways.
  • the modification may adopt any modification method known in the art.
  • the polynucleotide encoding SpCas12f1 nuclease contains one or more modifications, thereby enabling easy incorporation of a number of modifications, such as enhancing transcriptional activity, altering enzymatic activity, improving its translation or stability ( For example, increasing its resistance to proteolysis, degradation) or specificity, changing solubility, changing delivery, and reducing the innate immune response in host cells.
  • the modification may adopt any modification method known in the art.
  • any one or more genomic loci are edited by modifying the DNA or RNA encoding the SpCas12f1 nuclease introduced into the cell.
  • the nucleic acid sequence encoding SpCas12f1 nuclease is a modified nucleic acid, eg, codon optimized.
  • the modification may be a single modification or a combination of modifications.
  • the nucleic acid comprising the polynucleotide encoding SpCas12f1 nuclease may be a nucleic acid mimetic.
  • polynucleotide mimetic peptide nucleic acids with excellent hybridization properties may be used.
  • the SpCas12f1 nuclease or the polynucleotide encoding SpCas12f1 nuclease is suitable for use in any biological or in vitro environment, including but not limited to bacteria, archaea, fungi, protists, plants or animals.
  • suitable target cells include but are not limited to eukaryotic cells and prokaryotic cells, such as bacterial cells, archaeal cells, fungal cells, protist cells, plant cells or animal cells; the eukaryotic cells include mammalian cells and Plant cells, the prokaryotic cells include Escherichia coli and Klebsiella pneumoniae.
  • Applicable target cells can be any type of cells, including stem cells, somatic cells, etc.
  • the present invention is preferably used for mammalian cells HEK293T cells.
  • the cells may be in vivo or ex vivo.
  • the SpCas12f1 nuclease or nucleic acid encoding the SpCas12f1 nuclease is formulated in liposomes or lipid nanoparticles.
  • SpCas12f1 nuclease and gRNA can form a complex in the host cell to identify the PAM sequence on the target gene (such as target DNA) sequence;
  • the target sequence of the CRISPR/SpCas12f1 gene editing system is A nucleic acid fragment (such as a DNA fragment) of 20 bp in length following the PAM sequence.
  • the complex can selectively modulate the transcription of target DNA in a host cell.
  • the CRISPR/SpCas12f1 gene editing system can cut the double strands of targeted DNA, causing DNA breaks.
  • the system includes a recombinant expression vector.
  • the system comprises a recombinant expression vector comprising (i) a nucleotide sequence encoding a gRNA, wherein the gRNA comprises: (a) a core comprising a sequence complementary to a sequence in the target DNA and (b) a second segment that interacts with SpCas12f1 nuclease; and (ii) a nucleotide sequence encoding SpCas12f1 nuclease, wherein said SpCas12f1 nuclease comprises: (a) an RNA-binding moiety that interacts with the gRNA; and (b) an active moiety that modulates transcription within the target DNA, wherein the site of regulated transcription within the target DNA is determined by the gRNA.
  • SpCas12f1 nuclease variants can also be formed through modification, mutation, DNA shuffling, etc., so that the SpCas12f1 nuclease variants have improved desired characteristics, such as function, activity, kinetics, half-life, etc.
  • the modification may be, for example, the deletion, insertion or substitution of amino acids, or may be, for example, the replacement of the "" of the SpCas12f1 nuclease with a homologous or heterologous cleavage domain from a different nuclease (for example, the HNH domain of a CRISPR-related nuclease).
  • the DNA target of SpCas12f1 nuclease can be changed by any modification method of DNA binding and/or DNA modifying proteins known in the art, such as methylation, demethylation, acetylation, etc. tropism.
  • the DNA shuffling refers to the exchange of sequence fragments between DNA sequences of SpCas12f1 nucleases from different sources to generate chimeric DNA sequences encoding synthetic proteins with RNA-guided endonuclease activity.
  • the modification, mutation, DNA shuffling, etc. may be used singly or in combination.
  • the SpCas12f1 nuclease of the present invention can be:
  • (II) A variant having at least 50% sequence homology with the amino acid sequence of (I) and having RNA-guided nucleic acid binding activity;
  • the SpCas12f1 nuclease has endonuclease activity.
  • the SpCas12f1 can be used in combination with other enzyme components or other components to further develop various potential applications of the SpCas12f1 nuclease.
  • SpCas12f1 nuclease variants under (IV) for example, a single base editing system based on SpCas12f1 nuclease is developed by fusing inactivated SpCas12f1 and a base deaminase; by fusing inactivated SpCas12f1 and Reverse transcriptase develops a Prime editing system based on SpCas12f1 nuclease; by fusing inactivated SpCas12f1 and a transcription activator, a transcription activation system based on SpCas12f1 nuclease is developed; by fusing inactivated SpCas12f1 and a nucleic acid epigenetic modification enzyme, a SpCas12f1-based transcription activation system is developed Nu
  • SpCas12f1 nuclease variants may have the following specific properties, including but not limited to:
  • deaminase activity which can act on cytosine, guanine or adenine bases and then replicate and repair them within the cell through the deamination site to produce guanine, thymine and guanine respectively;
  • the enzymatic activity can be methyltransferase activity, demethylase activity, acetyltransferase activity, deacetylase activity, kinase activity, phosphatase activity, ubiquitin activity, etc.
  • peptide ligase activity deubiquitinating activity, ribosylation activity, etc.
  • the covalent modification of proteins is catalyzed by these enzyme activities; for example, SpCas12f1 nuclease variants catalyze covalent modification of proteins through methylation, acetylation function, ubiquitination, phosphorylation, etc., to modify histones to cause structural changes in histone-related DNA, thereby controlling the structure and properties of DNA).
  • the SpCas12f1 nuclease variant has no cleavage activity. In some embodiments, SpCas12f1 nuclease variants have single-strand cleavage activity. In some embodiments, the SpCas12f1 nuclease variant has double-stranded cleavage activity.
  • Having enhanced activity or ability means having an activity or ability that is increased by at least 1%, 5%, 10%, 20%, 30%, 40%, or 50% compared to wild-type SpCas12f1 nuclease.
  • Having reduced activity and ability refers to having an activity or ability of less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, or less than 1% relative to wild-type SpCas12f1 nuclease.
  • SpCas12f1 and SpCas12f1 nuclease include wild-type SpCas12f1 nuclease and all variants thereof. Those skilled in the art can determine the type of SpCas12f1 nuclease variant by routine means without limitation. Those cited above.
  • Each component in the system of the present invention can be transported by means of carriers.
  • methods that can be used include, but are not limited to, nanoparticles, liposomes, ribonucleoproteins, small molecule RNA-conjugates, chimeras, and RNA-fusion protein complexes, etc.
  • the system of the present invention may further include one or more donor templates.
  • the donor template includes donor sequences for insertion of a target gene.
  • the system of the present invention may further comprise a dimeric FOK1 nuclease, a complete or partially or completely defective SpCas12f1 nuclease or a gRNA connected to the dimeric FOK1 nuclease to guide the process through one or more gRNA molecules. Directs endonuclease cleavage when reaching one or more specific DNA target sites.
  • the system of the present invention can edit or modify DNA at multiple locations in cells for gene therapy, including but not limited to gene therapy for diseases, biological research, and improvement or enhancement of crop resistance. Yield etc.
  • the present invention also provides a composition comprising one or more of the SpCas12f1 nuclease or the polynucleotide encoding the same, gRNA or the polynucleotide encoding the same, a recombinant expression vector, and a system as described above, and also Acceptable carriers, media, etc. may be included.
  • the acceptable carriers and media such as sterile water or physiological saline, stabilizers, excipients, antioxidants (ascorbic acid, etc.), buffers (phosphoric acid, citric acid, other organic acids, etc.), preservatives, surface Active agents (PEG, Tween, etc.), chelating agents (EDTA, etc.), adhesives, etc.
  • polypeptides such as serum albumin, gelatin or immunoglobulin; amino acids such as glycine, glutamine, asparagine, arginine and lysine; sugars such as polysaccharides and monosaccharides or Carbohydrates; sugar alcohols such as mannitol or sorbitol.
  • aqueous solutions for injection such as physiological saline, isotonic solutions containing glucose or other auxiliary drugs, such as D-sorbitol, D-mannose, D-mannitol, and sodium chloride
  • appropriate Solubilizers include alcohols (ethanol, etc.), polyols (propylene glycol, PEG, etc.), nonionic surfactants (Tween 80, HCO-50), etc.
  • the composition includes gRNA and a buffer for stabilizing the nucleic acid.
  • the present invention also provides a kit comprising the system or composition as described above.
  • the kit may further include one or more additional reagents, for example selected from: dilution buffer; wash buffer; control reagents, etc.
  • the kit includes (a) a SpCas12f1 nuclease or a nucleic acid encoding a SpCas12f1 nuclease as described above; and (b) a gRNA or a nucleic acid encoding the gRNA, wherein the gRNA is capable of converting The SpCas12f1 nuclease or variant thereof is directed to a target polynucleotide sequence.
  • the kit further contains a donor template comprising a heterologous polynucleotide sequence capable of being inserted into the target polynucleotide sequence.
  • the present invention also provides a gene editing method, which involves contacting the target gene with the gene editing system as described above to achieve editing of the target gene.
  • the methods of the present invention can be used to target, edit, modify or manipulate target genes (such as target DNA) in cells or in vivo, ex vivo cells or cell-free systems, and the methods include: adding the SpCas12f1 nuclease as described above Or the polynucleotide encoding it, gRNA or the polynucleotide encoding it, recombinant expression vector, system, composition, etc. are introduced into the kit in vivo, in vitro cells or cell-free systems to target, edit and modify the target gene. or manipulation.
  • the method includes the following:
  • sgRNA gRNA
  • a nucleic acid e.g., DNA
  • the gene editing method of the present invention includes the following steps:
  • each parameter condition in the gene editing method of the present invention can be adjusted according to common knowledge in the field.
  • the concentration of the expression vector including nuclease and guide RNA is preferably 1 ⁇ g; the cells are transfected and then The time for editing is preferably 72 hours.
  • the SpCas12f1 nuclease is guided to the target gene through a processed or unprocessed form of guide RNA.
  • the SpCas12f1 nuclease and guide RNA form a complex to recognize the PAM sequence on the target gene.
  • the method further includes the step of introducing a donor template comprising a heterologous polynucleotide sequence into the cell.
  • the present invention also provides the above-mentioned guide RNA, isolated polynucleotide, construct, expression system, gene editing system, pharmaceutical composition or said method for targeting genes and genes in vivo, ex vivo cells or cell-free environment. /or its related peptides for use in gene editing.
  • the in vitro cells are selected from at least one of bacterial cells, archaeal cells, fungal cells, protist cells, viral cells, plant cells and animal cells.
  • the gene editing is selected from the group consisting of: gene cutting, gene deletion, gene insertion, point mutation, transcription inhibition, transcription activation, base editing and guided editing, including but not limited to:
  • SpCas12f1 nuclease has an enzymatic activity that modifies the target gene in a manner other than introducing a double-strand break; the enzymatic activity may be possessed by SpCas12f1 itself, or by, for example, adding an enzyme with The active heterologous polypeptide is fused to SpCas12f1 nuclease to form a chimeric SpCas12f1 nuclease.
  • the enzyme activities include but are not limited to methyltransferase activity, deamination activity, dismutase activity, alkylation activity, and demethylation. Enzyme activity, DNA repair activity, transposase activity, recombinase activity, DNA damage activity, depurination activity, oxidation activity, pyrimidine dimer formation activity, etc.).
  • the gene editing is gene deletion or gene cutting; the gene editing can be used to achieve one or more of the following including but not limited to the correction of pathogenic sites, gene function research, enhancement of cell function, cell therapy, etc. .
  • the SpCas12f1 nuclease of the present invention or the polynucleotide encoding the same, gRNA or the polynucleotide encoding the same, recombinant expression vector, system, composition and kit can be applied in the research field, diagnostic field, industrial field (such as microbial engineering), Drug discovery (such as high-throughput screening), target identification, imaging fields, and therapeutic areas, etc.
  • the target gene is target DNA.
  • the target DNA can be in vitro naked DNA that is not bound to DNA-associated proteins.
  • the target DNA is chromosomal DNA in cells in vitro.
  • the target gene is target RNA.
  • the target DNA is contacted with a targeting complex comprising the SpCas12f1 nuclease and a gRNA, the gRNA providing target specificity to the targeting complex by comprising a nucleotide sequence complementary to the target DNA; SpCas12f1 nucleic acid Enzymes provide site-specific activity.
  • the targeting complex modifies the target DNA, resulting in, for example, DNA cleavage, DNA methylation, DNA damage, DNA repair, and the like.
  • the targeting complex modifies a polypeptide associated with the target DNA (e.g., histones, DNA-binding proteins, etc.), resulting in, for example, methylation of the target DNA-associated polypeptide-histone, histone acetylation, histone acetylation, etc. Protein ubiquitination, etc.
  • SpCas12f1 nuclease or a nucleic acid containing a nucleotide sequence encoding a polypeptide of SpCas12f1 nuclease can be introduced into cells by known methods.
  • gRNA or a nucleic acid comprising a nucleotide sequence encoding gRNA can be introduced into a cell by well-known methods.
  • Well-known methods include DEAE-dextran-mediated transfection, liposome-mediated transfection, virus or phage infection, lipofection, transfection, conjugation, protoplast fusion, polyethylenimine-mediated guided transfection, electroporation, calcium phosphate precipitation, gene gun, calcium phosphate precipitation, microinjection, nanoparticle-mediated nucleic acid delivery, etc. Plasmids are delivered, for example, by electroporation, calcium chloride transfection, microinjection, and lipofection.
  • cells are contacted with viral particles comprising nucleic acid encoding gRNA and/or SpCas12f1 nuclease and/or chimeric SpCas12f1 nuclease and/or donor polynucleotide.
  • a nuclease cleaves target DNA in a cell to create a double-stranded break, which is then repaired by the cell, typically through non-homologous end joining (NHEJ) and homology-directed of repair.
  • NHEJ non-homologous end joining
  • the present invention also provides a cell, including a host cell that has been genetically modified with the above-mentioned SpCas12f1 nuclease or polynucleotide encoding the same, gRNA or polynucleotide encoding the same, recombinant expression vector, system, and composition.
  • the effective dosage of gRNA and/or SpCas12f1 nuclease and/or recombinant expression vector and/or donor polynucleotide is routine for those skilled in the art. This may vary depending on the route of administration and the nature of the condition being treated.
  • the bacteria or prokaryotic bacteria may be Escherichia coli, Klebsiella pneumoniae, Bacteroides ovatus, Campylobacter jejuni, Staphylococcus saprophyticus, Enterococcus faecalis, Bacteroides thetaiotaomicron, Bacteroides vulgaris, Bacteroides monomorpha, Bacteroides, Lactobacillus casei, Bacteroides fragilis, Acinetobacter reuteri, Fusobacterium nucleatum, Bacteroides johnsonii, Bacteroides arabidopsis, Lactobacillus rhamnosus, Bacteroides marseillei, Parabacteroides faecalis, Clostridium death Bacillus and Bifidobacterium breve, etc.
  • the eukaryotic cells include but are not limited to mammalian cells, fungi and other eukaryotic cells.
  • the fungi include yeast and Aspergillus, such as Saccharomyces cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyveromyces fragilis, Kluyveromyces lactis, Schizosaccharomyces pombe and Candida albicans.
  • a novel genome editing method based on extremely small CRISPR/SpCas12f1 nuclease is disclosed.
  • the invention shows that through the guidance and positioning functions of guide RNA, SpCas12f1 can accurately cut genomic DNA and achieve double-stranded breaks in genomic DNA. Utilizing the host cell's own or exogenous repair mechanisms, this system can efficiently and accurately achieve gene editing in living cells.
  • the present invention also provides a method for preparing guide RNA as described above, which method includes individually transforming tracrRNA and crRNA of the basic guide RNA and transforming them in combination, and the transformation is selected from truncation or extension of tracrRNA or crRNA, or TracrRNA and crRNA are connected through connecting strands to prepare engineered guide RNA.
  • the complete sequence of the wild-type guide RNA containing the targeting sequence contains the following sequence:
  • the underlined part is the targeting sequence: 5'- acacagaggaccccuaguaa -3' (SEQ ID NO. 168); which is the 20 bp fragment after the preferred PAM sequence.
  • SpCas12f1 nuclease described in the present invention is Syntrophomonas palmmitatica Cas12f (SpCas12f1), and its amino acid sequence preferably includes the sequence shown below:
  • the codon-optimized nucleotide sequence encoding the SpCas12f1 nuclease E. coli includes the following sequence:
  • the guide RNA sequences of different versions corresponding to the SpCas12f1 nuclease described in the present invention are as follows, where light gray is the tracrRNA sequence; dark gray is the tracr partner sequence; when there is no connecting chain, the tracrRNA sequence + tracr partner sequence is Backbone; when there is a connecting strand, the tracrRNA sequence + connecting strand + tracr partner sequence is the backbone.
  • FIG. 1 Its schematic diagram is shown in Figure 1; its tracrRNA sequence is shown in SEQ ID NO.111, and its tracr partner sequence is shown in SEQ ID NO.112; the guide RNA backbone sequence is SEQ ID NO.1 except for the gene targeting segment.
  • the sequence is shown in SEQ ID NO.113; the gene targeting segment that hybridizes to the target sequence is shown in SEQ ID NO.168 ( acacagaggaccccuaguaa );
  • the tracrRNA sequence is shown in SEQ ID NO.114, the tracr partner sequence is shown in SEQ ID NO.115; the guide RNA backbone sequence is the sequence of SEQ ID NO.2 except for the gene targeting segment, such as SEQ ID NO.116 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.117, the tracr partner sequence is shown in SEQ ID NO.118; the guide RNA backbone sequence is the sequence of SEQ ID NO.3 except the gene targeting segment, such as SEQ ID NO.119 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.120, the tracr partner sequence is shown in SEQ ID NO.121; the guide RNA backbone sequence is the sequence of SEQ ID NO.4 except the gene targeting segment, such as SEQ ID NO.122 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.123, the tracr partner sequence is shown in SEQ ID NO.124; the guide RNA backbone sequence is the sequence of SEQ ID NO.5 except the gene targeting segment, such as SEQ ID NO.125 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.126, the tracr partner sequence is shown in SEQ ID NO.127; the guide RNA backbone sequence is the sequence of SEQ ID NO.6 except for the gene targeting segment, such as SEQ ID NO.128 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.129, the tracr partner sequence is shown in SEQ ID NO.130; the guide RNA backbone sequence is the sequence of SEQ ID NO.7 except the gene targeting segment, such as SEQ ID NO.131 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.132, the tracr partner sequence is shown in SEQ ID NO.133; the guide RNA backbone sequence is the sequence of SEQ ID NO.8 except the gene targeting segment, such as SEQ ID NO.134 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.135, the tracr partner sequence is shown in SEQ ID NO.136; the guide RNA backbone sequence is the sequence of SEQ ID NO.9 except the gene targeting segment, such as SEQ ID NO.137 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.138, the tracr partner sequence is shown in SEQ ID NO.139; the guide RNA backbone sequence is the sequence of SEQ ID NO.10 except for the gene targeting segment, such as SEQ ID NO.140 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.141, the tracr partner sequence is shown in SEQ ID NO.142; the guide RNA backbone sequence is the sequence of SEQ ID NO.11 except the gene targeting segment, such as SEQ ID NO.143 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.144, the tracr partner sequence is shown in SEQ ID NO.145; the guide RNA backbone sequence is the sequence of SEQ ID NO.12 except for the gene targeting segment, such as SEQ ID NO.146 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.147, the tracr partner sequence is shown in SEQ ID NO.148; the guide RNA backbone sequence is the sequence of SEQ ID NO.13 except the gene targeting segment, such as SEQ ID NO.149 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.150, the tracr partner sequence is shown in SEQ ID NO.151; the guide RNA backbone sequence is the sequence of SEQ ID NO.14 except for the gene targeting segment, such as SEQ ID NO.152 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.153, the tracr partner sequence is shown in SEQ ID NO.154; the guide RNA backbone sequence is the sequence of SEQ ID NO.15 except the gene targeting segment, such as SEQ ID NO.155 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.156, the tracr partner sequence is shown in SEQ ID NO.157; the guide RNA backbone sequence is the sequence of SEQ ID NO.16 except the gene targeting segment, such as SEQ ID NO.158 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.159, the tracr partner sequence is shown in SEQ ID NO.160; the guide RNA backbone sequence is the sequence of SEQ ID NO.17 except for the gene targeting segment, such as SEQ ID NO.161 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.162, the tracr partner sequence is shown in SEQ ID NO.163; the guide RNA backbone sequence is the sequence of SEQ ID NO.18 except for the gene targeting segment, such as SEQ ID NO.164 shown;
  • the tracrRNA sequence is shown in SEQ ID NO.165, the tracr partner sequence is shown in SEQ ID NO.166; the guide RNA backbone sequence is the sequence of SEQ ID NO.19 except for the gene targeting segment, such as SEQ ID NO.167 shown.
  • the underlined part is the targeting sequence, which is preferably the 20 bp fragment after the PAM sequence, which can be replaced by other qualifying targeting sequences.
  • SpCas12f1 The terms “SpCas12f1”, “SpCas12f1 nuclease”, “SpCas12f1 polypeptide”, “SpCas12f1 protein” and “SpCas12f1 protein” are used interchangeably.
  • guide RNA guide RNA
  • gRNA single gRNA
  • chimeric gRNA chimeric gRNA
  • homology refers to the sequence similarity between two peptides or between two nucleic acid molecules. Homology can be determined by comparing corresponding positions in different polypeptides or nucleic acid molecules. When the same position in the sequence of the molecule being compared is occupied by the same base or amino acid in different sequences, then the molecule is at that position. Homogenous. The degree of homology between sequences is determined as a function of the number of matches or homologous positions shared by the sequences. An "unrelated" or “non-homologous" sequence should have less than 20% homology to one of the sequences disclosed herein.
  • a polynucleotide or polynucleotide region has a certain percentage of sequence homology (e.g., 20%, 30%) with another polynucleotide or polynucleotide region (or polypeptide or polypeptide region). , 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98% or 99%) refers to the percentage of bases (or amino acids) are the same.
  • sequence homology e.g. 20%, 30%
  • polynucleotide and “oligonucleotide” are used interchangeably and they refer to a polymeric form of nucleotides of any length, whether deoxyribonucleotides or ribonucleotides or its analogues. Polynucleotides can have any three-dimensional structure and can perform any function, known or unknown.
  • polynucleotides include, but are not limited to, the following: genes or gene fragments (including probes, primers, EST or SAGE tags), exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, Ribozymes, cDNA, dsRNA, siRNA, miRNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes and primers.
  • Polynucleotides also include modified nucleotides, such as methylated nucleotides and nucleotide analogs.
  • any embodiment of a polynucleotide disclosed herein includes its double-stranded form and any of the two complementary single-stranded forms known or predicted to constitute the double-stranded form.
  • the term “encoding” means that the polynucleotide "encodes" a polypeptide, meaning that in its native state or when manipulated by methods well known to those skilled in the art, it can be transcribed and/or Or translated to produce a polypeptide of interest and/or a fragment thereof, or to produce an mRNA capable of encoding the polypeptide of interest and/or a fragment thereof.
  • the antisense strand refers to the sequence that is complementary to the polynucleotide and from which the coding sequence can be deduced.
  • genomic DNA refers to the DNA of the genome of an organism, including the DNA of the genome of bacteria, archaea, fungi, protists, viruses, plants or animals.
  • Manipulating DNA includes binding, nicking one strand, or cleaving both strands of DNA, or includes modifying or editing DNA or polypeptides that bind to DNA.
  • Manipulating DNA can silence, activate or regulate the expression of RNA or polypeptide encoded by the DNA (so that it is not transcribed, or reduces the transcription activity, or makes it not translated, or reduces the level of translation), or prevents or enhances the interaction of the polypeptide with the DNA. combine.
  • Cleavage can be performed by a variety of methods, such as enzymatic or chemical hydrolysis of phosphodiester bonds; cleavage can be single-stranded or double-stranded; DNA cleavage can result in blunt or staggered ends.
  • hybridizable or “complementary” or “substantially complementary” means that a nucleic acid (e.g., RNA) contains a nucleotide sequence that enables it to react at appropriate in vitro and/or in vivo temperatures and solution ionic strengths. Non-covalently bind to another nucleic acid under conditions in a sequence-specific, antiparallel manner, i.e., form Watson-Crick base pairs and/or G/U base pairs, “anneal” or “hybridize” ".
  • sequence of a polynucleotide need not be 100% complementary to the sequence of a target nucleic acid to which it specifically hybridizes.
  • Polynucleotides can hybridize on one or more segments.
  • the polynucleotide may comprise at least 70%, at least 80%, at least 90%, at least 95%, at least 99%, or 100% sequence complementarity to a target region within the target nucleic acid sequence to which it is targeted.
  • peptide refers to polymeric forms of amino acids of any length, which may include both coded and non-coded amino acids, chemically or biochemically modified or derivatized Amino acids, and polypeptides with modified peptide backbones.
  • RNA RNA that is translated into a protein
  • RNA RNA that is not translated into a protein
  • a "protein coding sequence” or a sequence encoding a specific protein or polypeptide is a nucleic acid sequence that is transcribed into mRNA (in the case of DNA) and translated (in the case of mRNA) into a polypeptide in vivo or in vitro under the control of appropriate regulatory sequences .
  • vector or "expression vector” is a replicon, such as a plasmid, phage, virus or cosmid, to which another DNA segment, i.e., an "insert” can be attached so that the attached segment can be expressed in a cell copy in.
  • expression cassette encompasses a DNA coding sequence operably linked to a promoter.
  • "Operably connected” means connected in parallel, with the components in a relationship that allows them to function in their intended manner.
  • recombinant expression vector or "DNA construct” are used interchangeably in the present invention to refer to a DNA molecule comprising a vector and at least one insert. Recombinant expression vectors are typically produced for the purpose of expressing and/or amplifying inserts or for the construction of other recombinant nucleotide sequences.
  • the cell When foreign DNA, such as a recombinant expression vector, has been introduced into a cell, the cell has been "genetically modified” or “transformed” or “transfected” by the DNA. The presence of foreign DNA results in permanent or transient genetic changes.
  • the transforming DNA may or may not be integrated into the cell's genome.
  • target DNA is a DNA polynucleotide comprising a "target site” or “target sequence”.
  • target site a DNA polynucleotide comprising a "target site” or "target sequence”.
  • target site a DNA polynucleotide comprising a "target site” or "target sequence”.
  • target site a DNA polynucleotide comprising a "target site” or "target sequence”.
  • target site RNA molecules contain sequences that bind, hybridize, or are complementary to target sequences within target DNA, thereby targeting the bound polypeptide to a specific location within the target DNA (target sequence).
  • “Cleaving” refers to the break of the covalent backbone of the DNA molecule.
  • nuclease and “endonuclease” are used interchangeably to refer to enzymes having endonucleic acid degrading catalytic activity for polynucleotide cleavage.
  • the "cleavage domain” or “active domain” or “nuclease domain” of a nuclease refers to a polypeptide sequence or domain within a nuclease that has catalytic activity for DNA cleavage.
  • the cleavage domain may be contained in a single polypeptide chain, or the cleavage activity may result from the association of two or more polypeptides.
  • targeting polypeptide or "RNA-binding site directed polypeptide” refers to a polypeptide that binds RNA and is targeted to a specific DNA sequence.
  • leader sequence or DNA-targeting segment (or “DNA-targeting sequence”) encompasses a nucleotide sequence complementary to a specific sequence within the target DNA, referred to in the present invention as a “protospacer-like” sequence (Complementary strand of target DNA).
  • HDR Homology-directed repair
  • homology-directed repair may result in changes to the target molecule sequence (e.g., insertions, deletions, mutation).
  • nonhomologous end joining refers to the repair of double-stranded breaks in DNA by directly joining the broken ends to each other without the need for homologous templates. NHEJ often results in deletions of nucleotide sequences near the double-strand break site.
  • treatment includes preventing the occurrence of a disease or symptom; inhibiting a disease or symptom or alleviating a disease.
  • the terms "individual”, “subject”, “host” and “patient” are used interchangeably in the present invention and refer to any mammalian subject, particularly a human, for whom diagnosis, treatment or therapy is desired. .
  • the primers used in the examples were all synthesized by Shanghai Sangon Bioengineering Co., Ltd. and Suzhou Jinweizhi Biotechnology Co., Ltd. If the manufacturer of the reagents or instruments used is not indicated, they are regarded as conventional products that can be purchased through regular channels.
  • human embryonic kidney cell HEK293T was used as the cell used in the experiment.
  • Phosphorylation and annealing of target sequence DNA Prepare the annealing system according to Table 1.
  • the phosphorylation procedure is: react at 37°C for 30 minutes. Then add a final concentration of 50mM NaCl to the 10 ⁇ l reaction system and anneal slowly to obtain the annealed target sequence DNA.
  • the target sequence DNA was inserted into the pCMV-SpCas12f1-gRNA_MS1 plasmid through the Golden gate assembly to construct the pCMV-SpCas12f1-gRNA_MS1-G series plasmids (including pCMV-SpCas12f1-gRNA_MS1-G1 to pCMV-SpCas12f1-gRNA_MS1-G32 plasmids). A total of 32 plasmids).
  • the Golden gate assembly system is shown in Table 2 below.
  • Connection procedure 37°C for 2min, 16°C for 3min, repeat this step for a total of 25 cycles, finally 50°C for 10min, 80°C for 10min.
  • the ligation product was transformed into E. coli DH5 ⁇ cells, and single clones were selected for sequencing to obtain pCMV-SpCas12f1-gRNA_MS1-G series plasmids (transient expression plasmids) containing the targeting sequence.
  • HEK293T cells Culture the activated HEK293T cells in DMEM medium containing 10% FBS by volume. When the cell growth density reaches about 90%, they are passaged into a 24-well plate. The number of cells in each well is about 1.0 ⁇ 10 5 . 16-18 hours later, 1000ng of pCMV-SpCas12f1-gRNA_MS1-G plasmid editing different genes was transfected into the cells using 1.5 ⁇ L lipofectamine3000 (Invitrogen) in each well. After 24 hours, fresh medium containing puromycin at a final concentration of 2 ⁇ g/ml was added for selection. After continuing to culture for 48 hours, the adherent cells were digested and genomic DNA was extracted.
  • PCR amplifies the target gene fragment of the target sequence, and the PCR product after gel recovery is annealed with NEBuffer2 (NEB). Then add T7 endonuclease 1 (NEB) to the PCR reaction system, digest at 37°C for 15 minutes, and then add 6 ⁇ Gel Loading Dye (NEB) to terminate the reaction.
  • the reaction products were separated by 6% TBE-PAGE and stained and imaged by 4S Red dye (Sangon Bioengineering (Shanghai) Co., Ltd.).
  • Results Figure 2 shows the gene editing results of mammalian cells mediated by the CRISPR/SpCas12f1 editing system (pCMV-SpCas12f1-gRNA_MS1 series plasmids). As shown in the figure, the CRISPR/SpCas12f1 editing system successfully achieved gene editing of multiple genes on the mammalian cell genome.
  • the tracrRNA and crRNA of the guide RNA corresponding to the SpCas12f1 nuclease were individually transformed and combined.
  • the specific transformation sites and corresponding base lengths are shown in Figure 3.
  • the specific engineering methods and The modified base length is as shown in the following sequence:
  • Coding nucleotides of gRNA_MS1 5'-ttatctctgtttcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaggctaacgcttctctaacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgc gtaagcgcgtggattgaac acacagaggacccctagtaa -3' (SEQ ID NO. 20);
  • the coding nucleotide of gRNA_MS9 truncates 4nt bases at the 5' end of tracrRNA and truncates 3nt bases at the 3' end of tracrRNA: 5'-tctgtttcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaggctaacgcttctctaacggcgaccttggcgaaatgccatcaataccacgcggcccgaa agggttcgcgcgaaactgagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO. 28);
  • the coding nucleotide of gRNA_MS10 truncates 7nt bases at the 5' end of tracrRNA and truncates 6nt bases at the 3' end of tracrRNA: 5'- gtttcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaggctaacgcttctctaacggcgaccttggcgaaatgccatcaataccacgcggcccgaaag ggttcgcgcgaaacgaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO. 29);
  • the coding nucleotide of gRNA_MS11 truncates 10nt bases at the 5' end of tracrRNA and truncates 9nt bases at the 3' end of tracrRNA: 5'-tcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaggctaacgcttctctaacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggt tcgcgcgcgagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO. 30);
  • the coding nucleotide of gRNA_MS12 truncates 7nt bases at the 5' end of tracrRNA, truncates 6nt bases at the 3' end of tracrRNA, and truncates 8nt bases at the 5' end of crRNA, and adds AAGG four bases between tracrRNA and crRNA Base for ligation: 5'-gtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaggctaacgcttctctaacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgcgaaacAAGGgtcgcatcttgcgtaagcgcgtggattgaaac a cacagaggacccctagtaaa
  • the coding nucleotide of gRNA_MS13 truncates 7nt bases at the 5' end of tracrRNA, truncates 6nt bases at the 3' end of tracrRNA, and truncates 21nt bases at the 5' end of crRNA, and adds AAGG four bases between tracrRNA and crRNA Base for ligation: 5'-gtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaggctaacgcttctctaacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgcgaaacAAGGtaagcgcgtggattgaaac acacagaggacccctagtaa -3'(SEQ ID NO.32);
  • Coding nucleotide of gRNA_MS15 cut off the upper half of stem loop 2, 16nt base: 5'-ttactctgtttcgcgccagggcagttaggtgccctaaaagagcgaagtaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtc gcatcttgcgtaagcgcgtggattgaac acacagaggacccctagtaa –3’ (SEQ ID NO.34);
  • the coding nucleotide of gRNA_MS16 cut off the entire stem loop 2, 32nt bases: 5'-ttactctgtttcgcgccagggcagttaggtgccctaaaagtctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcg cgtggattgaaac aca cagaggacccctagtaa –3’ (SEQ ID NO. 35);
  • the coding nucleotide of gRNA_MS19 add UUUUAUUUUUUU (SEQ ID NO.169) 11nt base at the 3' end of crRNA: 5'-ttactctgtttcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaggctaacgcttctctaacggcgaccttggcgaaatgccatcaataccacg cggccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaac acacagaggacccctagtaa tttttttt–3′ (SEQ ID NO. 38).
  • nucleotide sequence of the guide RNA is shown in SEQ ID NO.1 to SEQ ID NO.19.
  • human embryonic kidney cell HEK293T was used as the cell used in the experiment.
  • the underlined part is the targeting sequence, which is preferably the 20 bp fragment after the PAM sequence.
  • the plasmid construction steps are the same as the first point in Example 1, except that the guide gRNA_MS1 expression cassette corresponding to SpCas12f1 in human cells is replaced with the guide gRNA_MS2-gRNA_MS19 expression cassette, which is pCMV-SpCas12f1-gRNA_MS2, pCMV-SpCas12f1-gRNA_MS3, pCMV- SpCas12f1-gRNA_MS4, pCMV-SpCas12f1-gRNA_MS5, pCMV-SpCas12f1-gRNA_MS6, pCMV-SpCas12f1-gRNA_MS7, pCMV-SpCas12f1-gRNA_MS8, pCMV-SpCas12f1-gRNA_MS9, pCMV-SpCas12f1-gRNA_MS10, pCMV-SpCas 12f1-gRNA_MS
  • the gene editing results of engineered SpCas12f1 gRNA_MS1 to gRNA_MS19 in mammalian cells are shown in Figure 4.
  • the engineered gRNA_MS2, gRNA_MS4, gRNA_MS7, gRNA_MS10, gRNA_MS12, and gRNA_MS13 of SpCas12f1 can improve the gene editing efficiency of SpCas12f1 in mammalian cells, among which gRNA_MS13 is the most efficient; gRNA_MS19 has little effect on cell editing; Removing any one of the four stem-loops on the gRNA will have a great impact on the activity, so all four stem-loops cannot be removed.
  • the preferred SpCas12f1 engineered guide RNA gene sequence described in this example is gRNA_MS13: 5'-guuucgcgcgccagggcaguuaggugcccuaaaagagcgaaguggccgaaaggaaggcuaacgcuucucuaacgcuacggcgaccuuggcgaaaugccaucaauaccacgcggcccgaaaggguucgcgcgaaaggguucgcgcgaaacAAGGuaagcg cguggauugaaac acacagaggaccccuaguaa –3'
  • human embryonic kidney cell HEK293T was used as the cell used in the experiment.
  • the plasmid construction steps are the same as the first point in Example 1, except that the guide gRNA_MS1 expression cassette corresponding to SpCas12f1 in human cells is replaced with the guide gRNA_MS13 expression cassette.
  • point 2 of Example 1 different targeting sequences were inserted into the above plasmids to obtain pCMV-SpCas12f1-gRNA_MS13-G series plasmids containing targeting sequences.
  • FIG. 5 Comparison of the editing efficiency of gRNA_MS1 and gRNA_MS13 in HEK293T cells is shown in Figure 5. It can be seen from the figure that the engineered gRNA_MS13 can be used to achieve efficient editing in mammalian cells.
  • the editing efficiency of the same site can be increased by up to about 9 times compared with gRNA_MS1.
  • the editing efficiency of the Guide3 site can be increased from 2.58% to 21.95%. , greatly improving the activity and versatility of SpCas12f1 in mammalian cell gene editing.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biomedical Technology (AREA)
  • Zoology (AREA)
  • Biotechnology (AREA)
  • Wood Science & Technology (AREA)
  • Organic Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Biophysics (AREA)
  • Animal Behavior & Ethology (AREA)
  • Epidemiology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Veterinary Medicine (AREA)
  • Public Health (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Immunology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Cell Biology (AREA)
  • Urology & Nephrology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

Provided are an optimized CRISPR/SpCas12f1 system, an engineered guide RNA and use thereof. The CRISPR/SpCas12f1 genome editing system comprises: an expression construct comprising an SpCas12f1 nuclease; and an expression construct comprising an expressed DNA sequence for a guide RNA corresponding to the SpCas12f1 nuclease and a targeting sequence of a target sequence. Also provided is a method for optimizing a guide RNA corresponding to an SpCas12f1 nuclease, comprising: modifying tracrRNA and crRNA individually or in combination. By using the gene editing system of the present invention, a target gene in a cell can be accurately edited; by using the optimized guide RNA of the present invention, the editing efficiency and applicability of the editing system can be greatly improved.

Description

一种优化的CRISPR/SpCas12f1系统、工程化向导RNA及其应用An optimized CRISPR/SpCas12f1 system, engineered guide RNA and its applications 技术领域Technical field
本发明属于生物技术领域,涉及一种优化的CRISPR/SpCas12f1系统、工程化向导RNA及其应用,具体涉及工程化向导RNA、基于极小型CRISPR/SpCas12f1系统的基因编辑方法。The invention belongs to the field of biotechnology and relates to an optimized CRISPR/SpCas12f1 system, engineered guide RNA and its application. Specifically, it relates to an engineered guide RNA and a gene editing method based on an extremely small CRISPR/SpCas12f1 system.
背景技术Background technique
Clustered regularly interspaced short palindromic repeats(CRISPR)及其相关基因(Cas)在许多古细菌和细菌中作为适应性免疫系统发挥作用,已经发展成为多功能基因组编辑工具。CRISPR-Cas系统使用三个步骤来保护微生物免受外来入侵:(1)适应,Cas蛋白捕获外来核酸片段(间隔子)并插入CRISPR阵列;(2)处理CRISPR阵列的转录本以产生成熟的CRISPR RNA(crRNAs);(3)干扰,crRNAs引导Cas蛋白靶向并切割新入侵者的同源序列。利用基因组编辑技术可促进细胞工程、模型动物的产生,新植物品种的开发和基因筛查等,还有望在癌症、遗传疾病和传染病的基因治疗中发挥作用。Clustered regularly interspaced short palindromic repeats (CRISPR) and its associated genes (Cas) function as adaptive immune systems in many archaea and bacteria and have evolved into versatile genome editing tools. The CRISPR-Cas system uses three steps to protect microorganisms from foreign invasion: (1) adaptation, in which Cas proteins capture foreign nucleic acid fragments (spacers) and insert them into the CRISPR array; (2) processing of transcripts from the CRISPR array to produce mature CRISPR RNA (crRNAs); (3) Interference, crRNAs guide the Cas protein to target and cut the homologous sequence of the new invader. The use of genome editing technology can promote cell engineering, the generation of model animals, the development of new plant varieties and genetic screening, etc. It is also expected to play a role in gene therapy of cancer, genetic diseases and infectious diseases.
目前CRISPR/Cas9和CRISPR/Cas12a是最广泛使用的基因组编辑系统,但由于这两种系统的尺寸较大,限制了它们在基因治疗等领域中发挥作用。最近发现了一些小尺寸的V-F亚型蛋白—Cas12f,其约由400–700个氨基酸残基组成,并包含一个RuvC核酸酶结构域,这些小尺寸的蛋白可以帮助解决腺相关病毒载体包装的问题。SyntrophomonaspalmitaticaCas12f1(SpCas12f1,497个氨基酸)最初被报道仅显示大肠杆菌质粒干扰活性。随后的一项研究揭示了SpCas12f1在真核细胞也能靶向目标基因,引发基因的插入和缺失,但其效率只有0.1%-3.6%。因此,优化小分子量或小尺寸的基因编辑系统,使之成为高效和精确的基因编辑工具显得尤为迫切。Currently, CRISPR/Cas9 and CRISPR/Cas12a are the most widely used genome editing systems, but the large size of these two systems limits their use in fields such as gene therapy. Recently, some small-sized V-F subtype proteins - Cas12f, which are composed of approximately 400–700 amino acid residues and contain a RuvC nuclease domain, have been discovered. These small-sized proteins can help solve the problem of adeno-associated virus vector packaging. . SyntrophomonaspalmitaticaCas12f1 (SpCas12f1, 497 amino acids) was originally reported to only show E. coli plasmid interference activity. A subsequent study revealed that SpCas12f1 can also target target genes in eukaryotic cells and trigger gene insertion and deletion, but its efficiency is only 0.1%-3.6%. Therefore, it is particularly urgent to optimize small molecular weight or small size gene editing systems to make them efficient and precise gene editing tools.
发明内容Contents of the invention
针对上述问题,本发明的目的是要解决现有技术框架内小型核酸酶基因编辑的缺陷,提供一种优化的CRISPR/SpCas12f1系统及其高效基因编辑方法。利用本发明提供的优化的基因编辑系统和方法能够在体外(包括细胞)或体内实现高效和精准的目标基因或目标基因组的编辑,极大地提高小型CRISPR/SpCas12f1 系统的基因编辑效率。In response to the above problems, the purpose of the present invention is to solve the shortcomings of small nuclease gene editing within the framework of the existing technology and provide an optimized CRISPR/SpCas12f1 system and its efficient gene editing method. The optimized gene editing system and method provided by the present invention can achieve efficient and precise editing of target genes or target genomes in vitro (including cells) or in vivo, greatly improving the gene editing efficiency of the small CRISPR/SpCas12f1 system.
本发明的目的之一是提供一种向导RNA,其特征在于,其包含tracr RNA序列和crRNA序列;所述crRNA序列包含能够与靶序列杂交的基因靶向区段和tracr配对物序列;所述tracr RNA序列和tracr配对物序列构成向导RNA的骨架序列;所述与靶序列杂交的基因靶向区段可以根据进行编辑的靶基因进行确定;One of the purposes of the present invention is to provide a guide RNA, which is characterized in that it includes a tracr RNA sequence and a crRNA sequence; the crRNA sequence includes a gene targeting segment capable of hybridizing with a target sequence and a tracr partner sequence; the The tracr RNA sequence and the tracr partner sequence constitute the backbone sequence of the guide RNA; the gene targeting segment that hybridizes to the target sequence can be determined based on the target gene for editing;
其中,所述tracrRNA包含如SEQ ID NO.111所示的核苷酸序列或与其具有至少50%以上同一性的变体序列;Wherein, the tracrRNA includes the nucleotide sequence shown in SEQ ID NO. 111 or a variant sequence having at least 50% identity with it;
所述tracr配对物序列包含如SEQ ID NO.112所示的核苷酸序列或与其具有至少50%以上同一性的变体序列。The tracr partner sequence includes the nucleotide sequence shown in SEQ ID NO. 112 or a variant sequence having at least 50% identity with it.
本发明的另一目的是提供一种分离的多核苷酸,其编码如上所述的向导RNA。Another object of the present invention is to provide an isolated polynucleotide encoding the guide RNA as described above.
本发明的另一目的是提供一种构建体,所述构建体含有如上所述的分离的多核苷酸。Another object of the present invention is to provide a construct containing an isolated polynucleotide as described above.
本发明的另一目的是提供一种表达系统,所述表达系统含有如上所述的构建体或基因组中整合有外源的如上所述的多核苷酸。Another object of the present invention is to provide an expression system, which contains the construct as described above or an exogenous polynucleotide as described above integrated into the genome.
本发明的另一目的是提供一种基因编辑系统,其包括如上所述的向导RNA或其编码多核苷酸;所述编辑系统还包括核酸酶或其编码多核苷酸。Another object of the present invention is to provide a gene editing system, which includes the guide RNA or its encoding polynucleotide as described above; the editing system further includes a nuclease or its encoding polynucleotide.
本发明的另一目的是提供一种基因编辑方法,将靶基因与如上所述的基因编辑系统接触,以实现靶基因的编辑。Another object of the present invention is to provide a gene editing method that contacts the target gene with the gene editing system as described above to achieve editing of the target gene.
本发明的另一目的是提供如上所述的向导RNA、分离的多核苷酸、构建体、表达系统、基因编辑系统、药物组合物或所述的方法在体内、离体细胞或无细胞环境中对靶基因和/或其相关多肽进行基因编辑中的应用。Another object of the present invention is to provide the guide RNA, isolated polynucleotide, construct, expression system, gene editing system, pharmaceutical composition or method described above in vivo, ex vivo cell or cell-free environment. Applications in gene editing of target genes and/or their related polypeptides.
本发明的另一目的是提供一种遗传修饰的细胞,其通过如上所述的基因编辑系统或所述的方法进行基因编辑获得。Another object of the present invention is to provide a genetically modified cell, which is obtained by gene editing using the gene editing system or method described above.
本发明的积极进步效果包括:The positive progressive effects of the present invention include:
(1)本发明通过多轮gRNA改造,对SpCas12f1的gRNA进行了深入探究,从而确定了编辑效率最高的gRNA,将CRISPR/SpCas12f1系统转化为高效和精准的基因组编辑工具。与原始的gRNA_MS1相比,最优版本的gRNA对测试的全部靶点中进行基因插入和基因删除的效率均能提升约2-9倍不等。(1) The present invention conducted in-depth exploration of the gRNA of SpCas12f1 through multiple rounds of gRNA transformation, thereby determining the gRNA with the highest editing efficiency and transforming the CRISPR/SpCas12f1 system into an efficient and accurate genome editing tool. Compared with the original gRNA_MS1, the optimal version of gRNA can increase the efficiency of gene insertion and gene deletion in all tested targets by approximately 2-9 times.
(2)本发明中gRNA重塑显著降低了gRNA的大小,工程化后SpCas12f1系统体积更小,为用于AAV传递的基因编辑和基因治疗提供了更多可能。这种紧凑的CRISPR系统在基因组编辑方面具有广阔的应用前景。(2) gRNA remodeling in the present invention significantly reduces the size of gRNA, and the engineered SpCas12f1 system is smaller, providing more possibilities for gene editing and gene therapy for AAV delivery. This compact CRISPR system has broad application prospects in genome editing.
(3)本发明在哺乳动物细胞基因组编辑实验中测试了在9个内源性基因上共32个靶向序列的编辑效率,提高了小型CRISPR系统在细胞基因编辑的适用性。(3) The present invention tested the editing efficiency of a total of 32 targeted sequences on 9 endogenous genes in a mammalian cell genome editing experiment, thereby improving the applicability of the small CRISPR system in cell gene editing.
附图说明Description of the drawings
图1为实施例1中SpCas12f1核酸酶的初始版本gRNA_MS1的示意图。Figure 1 is a schematic diagram of the initial version of gRNA_MS1 of SpCas12f1 nuclease in Example 1.
图2为实施例1中SpCas12f1核酸酶与相应的初始版本gRNA_MS1介导的人细胞中的基因编辑结果图。结果显示,SpCas12f1核酸酶成功实现了对哺乳动物细胞基因组上多个基因的基因编辑。Figure 2 is a diagram showing the gene editing results in human cells mediated by SpCas12f1 nuclease and the corresponding initial version of gRNA_MS1 in Example 1. The results showed that SpCas12f1 nuclease successfully achieved gene editing of multiple genes on the mammalian cell genome.
图3为实施例2中SpCas12f1核酸酶gRNA工程化的具体改造位点和相应碱基长度示意图。Figure 3 is a schematic diagram of the specific modification sites and corresponding base lengths of SpCas12f1 nuclease gRNA engineering in Example 2.
图4为实施例2中SpCas12f1核酸酶与相应的不同版本gRNA介导的哺乳动物细胞中的基因编辑结果图。结果显示,部分工程化的gRNA的编辑效率比初始版本gRNA_MS1高,其中效率最高的是gRNA_MS13。Figure 4 is a diagram showing the gene editing results in mammalian cells mediated by SpCas12f1 nuclease and corresponding different versions of gRNA in Example 2. The results showed that the editing efficiency of the partially engineered gRNA was higher than that of the initial version gRNA_MS1, with the highest efficiency being gRNA_MS13.
图5为实施例3中SpCas12f1核酸酶与相应的初始版本(MS1)及最优版本(MS13)的gRNA介导的哺乳动物细胞中的基因编辑比较结果图。结果显示,gRNA_MS13极大的提高了核酸酶在哺乳动物细胞中的基因编辑效率。Figure 5 is a diagram showing the comparison results of gene editing in mammalian cells mediated by SpCas12f1 nuclease and the corresponding initial version (MS1) and optimal version (MS13) of gRNA in Example 3. The results showed that gRNA_MS13 greatly improved the gene editing efficiency of nuclease in mammalian cells.
具体实施方式Detailed ways
为更好的说明本发明的目的、技术方案和积极进步的效果,下面将结合附图和具体实施例对本发明作进一步阐述。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。此外应理解,在阅读了本发明讲授的内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。例如,通过融合失活的SpCas12f1和碱基脱氨酶开发基于SpCas12f1的单碱基编辑系统;通过融合失活的SpCas12f1和逆转录酶开发基于SpCas12f1的Prime编辑系统;通过融合失活的SpCas12f1和转录激活因子,开发基于SpCas12f1的转录激活系统;通过融合失活的SpCas12f1和核酸表观修饰酶,开发基于SpCas12f1的表观修饰系统;利用失活的SpCas12f1,开发基于 SpCas12f1的转录抑制系统。In order to better explain the purpose, technical solutions and positive progressive effects of the present invention, the present invention will be further elaborated below with reference to the accompanying drawings and specific embodiments. It should be understood that these examples are only used to illustrate the invention and are not intended to limit the scope of the invention. In addition, it should be understood that after reading the teachings of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the appended claims of this application. For example, a single base editing system based on SpCas12f1 was developed by fusing inactivated SpCas12f1 and base deaminase; a Prime editing system based on SpCas12f1 was developed by fusing inactivated SpCas12f1 and reverse transcriptase; a SpCas12f1-based Prime editing system was developed by fusing inactivated SpCas12f1 and transcription activator to develop a transcription activation system based on SpCas12f1; by fusing inactivated SpCas12f1 and a nucleic acid epigenetic modification enzyme, an epigenetic modification system based on SpCas12f1 was developed; using inactivated SpCas12f1, a transcription repression system based on SpCas12f1 was developed.
本发明中主要使用的CRISPR系统是V-F型CRISPR系统,其中效应蛋白主要为Syntrophomonaspalmitatica Cas12f(SpCas12f1)核酸酶。SpCas12f1核酸酶在对应的向导RNA引导下能准确定位到目标基因,对基因组DNA进行切割,实现基因组DNA双链断裂。利用宿主细胞自身或外源的修复机制,该系统能够高效和精确地实现活细胞内基因编辑。The CRISPR system mainly used in the present invention is the V-F type CRISPR system, in which the effector protein is mainly Syntrophomonaspalmitatica Cas12f (SpCas12f1) nuclease. SpCas12f1 nuclease can accurately locate the target gene under the guidance of the corresponding guide RNA, cut the genomic DNA, and achieve double-stranded breaks in the genomic DNA. Utilizing the host cell's own or exogenous repair mechanisms, this system can efficiently and accurately achieve gene editing in living cells.
本发明提供了一种向导RNA(gRNA),其包含trans-activating CRISPR RNA(tracrRNA)序列和crRNA序列;所述crRNA序列包含能够与靶序列杂交的基因靶向区段和tracr配对物序列(基因靶向区段和tracr配对物序列连接得到RNA序列(crRNA));The invention provides a guide RNA (gRNA), which includes a trans-activating CRISPR RNA (tracrRNA) sequence and a crRNA sequence; the crRNA sequence includes a gene targeting segment capable of hybridizing with a target sequence and a tracr partner sequence (gene The targeting segment and the tracr partner sequence are connected to obtain the RNA sequence (crRNA));
其中,所述tracrRNA包含如SEQ ID NO.111所示的核苷酸序列或与其具有至少50%以上同一性的变体序列;Wherein, the tracrRNA includes the nucleotide sequence shown in SEQ ID NO. 111 or a variant sequence having at least 50% identity with it;
所述tracr配对物序列包含如SEQ ID NO.112所示的核苷酸序列或与其具有至少50%以上同一性的变体序列。The tracr partner sequence includes the nucleotide sequence shown in SEQ ID NO. 112 or a variant sequence having at least 50% identity with it.
本发明所述向导RNA包含在所述tracrRNA中互补配对的碱基对的一个或多个中,配对碱基互换位置后获得的tracrRNA。例如,tracrRNA的第13位碱基A与第50位碱基U互补配对,在将13位设置为U,第50位设置为A后获得的tracrRNA也能实现本发明的技术效果。The guide RNA of the present invention includes one or more of the complementary base pairs in the tracrRNA, and the tracrRNA is obtained by exchanging positions of the paired bases. For example, the base A at position 13 of tracrRNA is complementary to the base U at position 50. The tracrRNA obtained after setting position 13 to U and position 50 to A can also achieve the technical effects of the present invention.
本发明所述向导RNA包含在tracrRNA与tracr配对物序列互补配对的碱基对的一个或多个中,配对碱基互换位置后获得的tracrRNA和tracr配对物序列。例如,tracrRNA第70位碱基A与tracr配对物的第10位碱基U互补配对,在将tracrRNA第70位碱基设置为U,tracr配对物第10位设置为A后获得的tracrRNA和tracr配对物序列也能实现本发明的技术效果。The guide RNA of the present invention includes one or more base pairs of complementary pairs between tracrRNA and tracr partner sequence, and the tracrRNA and tracr partner sequence are obtained by exchanging positions of the paired bases. For example, base A at position 70 of tracrRNA is complementary to base U at position 10 of the tracr partner. The tracrRNA and tracr obtained after setting base 70 of tracrRNA to U and position 10 of the tracr partner to A The counterpart sequence can also achieve the technical effects of the present invention.
本发明所述向导RNA中,所述基因靶向区段为与靶基因中的靶序列互补的核苷酸序列,位于所述crRNA序列的3’端;所述基因靶向区段识别靶向序列上的PAM序列;优选PAM序列为5’-NTTC和5’-GTTT至少任一,其中N为A、T、C、G;进一步优选地PAM序列为5’-GTTC,5’-TTTC和5’-ATTC至少任一。所述基因靶向区段靶向PAM序列之后长度为12~40bp的核酸片段;例如可以是13-20、18-25、22-32、26-37、30-38、32-40个核苷酸的长度,优选的长度为20bp。 在一实施方式中,所述的基因靶向区段较佳地为PAM序列之后长度为20bp的核酸片段所对应的RNA序列。在一实施方式中,所述基因靶向区段选自SEQ ID NO.168。本发明中,所述向导RNA的靶向区段与靶基因的靶序列之间的互补性百分比可以为至少50%(例如,至少55%,至少60%,至少65%,至少70%,至少75%,至少80%,至少85%,至少90%,至少95%,至少97%,至少98%,至少99%或100%)。In the guide RNA of the present invention, the gene targeting segment is a nucleotide sequence complementary to the target sequence in the target gene and is located at the 3' end of the crRNA sequence; the gene targeting segment identifies the target The PAM sequence on the sequence; the preferred PAM sequence is at least one of 5'-NTTC and 5'-GTTT, where N is A, T, C, G; further preferably the PAM sequence is 5'-GTTC, 5'-TTTC and 5'-ATTC at least either. The gene targeting segment targets a nucleic acid fragment with a length of 12 to 40 bp after the PAM sequence; for example, it can be 13-20, 18-25, 22-32, 26-37, 30-38, 32-40 nucleotides The length of the acid is preferably 20 bp. In one embodiment, the gene targeting segment is preferably an RNA sequence corresponding to a 20 bp length nucleic acid fragment following the PAM sequence. In one embodiment, the gene targeting segment is selected from SEQ ID NO. 168. In the present invention, the complementarity percentage between the targeting segment of the guide RNA and the target sequence of the target gene can be at least 50% (for example, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, at least 98%, at least 99% or 100%).
在一些实施方式中,所述gRNA还包含转录终止子。In some embodiments, the gRNA further comprises a transcription terminator.
本发明所述向导RNA中,所述向导RNA可以为两条链,一条链包含tracrRNA序列,另一条链包含crRNA序列,其中所述tracr配对物序列与所述tracrRNA序列杂交,并形成茎-环结构。本发明所述向导RNA中,所述crRNA序列与所述tracrRNA序列能够连接在一起,形成单条的向导RNA骨架序列,即所述向导RNA为一条链,其从5’端至3’端依次包含所述tracr RNA序列和crRNA序列。当所述向导RNA为一条链时,所述tracr RNA序列的3’端与crRNA序列的5’端直接连接或通过连接链连接;优选为连接链连接,所述连接链选自寡核苷酸;优选地,所述连接链的寡核苷酸的个数为3~18nt;进一步优选地,所述连接链为AAGG。In the guide RNA of the present invention, the guide RNA can be two strands, one strand includes a tracrRNA sequence, and the other strand includes a crRNA sequence, wherein the tracr partner sequence hybridizes with the tracrRNA sequence and forms a stem-loop structure. In the guide RNA of the present invention, the crRNA sequence and the tracrRNA sequence can be connected together to form a single guide RNA skeleton sequence, that is, the guide RNA is a strand, which sequentially contains from the 5' end to the 3' end. The tracr RNA sequence and crRNA sequence. When the guide RNA is one strand, the 3' end of the tracr RNA sequence is directly connected to the 5' end of the crRNA sequence or connected through a connecting strand; preferably, the connecting strand is connected, and the connecting strand is selected from oligonucleotides ; Preferably, the number of oligonucleotides in the connecting chain is 3 to 18 nt; further preferably, the connecting chain is AAGG.
本发明所述向导RNA中,crRNA与tracrRNA序列作为两条单独的RNA序列,在同时存在情况下可以介导核酸酶如SpCas12f1核酸内切酶活性。或向导RNA骨架序列和与靶序列杂交的DNA-靶向区段连接后得到的针对靶序列的完整向导RNA表达构建体,同样可以介导核酸酶如SpCas12f1核酸内切酶活性。In the guide RNA of the present invention, the crRNA and tracrRNA sequences are two separate RNA sequences and can mediate the activity of nuclease such as SpCas12f1 endonuclease when they exist simultaneously. Or the complete guide RNA expression construct for the target sequence obtained by connecting the guide RNA backbone sequence and the DNA-targeting segment that hybridizes to the target sequence can also mediate the activity of nucleases such as SpCas12f1 endonuclease.
本发明所述向导RNA中,所述基因靶向区段包含与靶基因中的序列互补的核苷酸序列,该基因靶向序列通过杂交(即碱基配对)以序列特异性方式与靶基因相互作用。可以通过例如基因工程的方式修饰gRNA的基因靶向序列,以使得gRNA与靶基因内的任何所需序列杂交。gRNA通过上述基因靶向序列将结合的多肽引导至靶基因内的特定核苷酸序列。In the guide RNA of the present invention, the gene targeting segment includes a nucleotide sequence complementary to a sequence in the target gene, and the gene targeting sequence interacts with the target gene in a sequence-specific manner through hybridization (i.e., base pairing). interaction. The gene targeting sequence of the gRNA can be modified, for example, by genetic engineering, so that the gRNA hybridizes to any desired sequence within the target gene. gRNA guides the bound polypeptide to a specific nucleotide sequence within the target gene through the above-mentioned gene targeting sequence.
本发明所述向导RNA中还包含茎-环结构,所述茎-环结构形成与核酸酶如SpCas12f1相互作用的蛋白结合结构。在一些实施方式中,所述gRNA的蛋白结合结构包含4个茎环结构,包括茎环1、2、3、和4。The guide RNA of the present invention also contains a stem-loop structure, which forms a protein-binding structure that interacts with nucleases such as SpCas12f1. In some embodiments, the protein binding structure of the gRNA includes 4 stem-loop structures, including stem- loop 1, 2, 3, and 4.
在一些实施方式中,靶基因是DNA序列。在一些实施方式中,靶基因是RNA 序列。In some embodiments, the target gene is a DNA sequence. In some embodiments, the target gene is an RNA sequence.
本发明还提供了对所述向导RNA进行改造的方法,包括但不限于对trans-activating CRISPR RNA(tracrRNA)和CRISPR RNA(crRNA)的单独改造以及组合改造。所述的改造方法包括但不限于对tracrRNA或crRNA的截断或延长、tracrRNA和crRNA之间的连接。在本发明某一较佳实施例中,tracrRNA和crRNA之间用AAGG连接。The invention also provides methods for modifying the guide RNA, including but not limited to individual modifications and combined modifications of trans-activating CRISPR RNA (tracrRNA) and CRISPR RNA (crRNA). The transformation methods include but are not limited to truncation or extension of tracrRNA or crRNA, and the connection between tracrRNA and crRNA. In a preferred embodiment of the present invention, tracrRNA and crRNA are connected with AAGG.
本发明所述向导RNA中,所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端和/或3’端增加、减少或替换部分核苷酸后获得的序列。优选地,所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端和/或3’端减少核苷酸后获得的序列,即,可以只在5’端减少或截断一定个数的核苷酸,或只在3’端减少或截断一定数量的核苷酸,或同时在5’端和3’端减少或截断一定个数的核苷酸。进一步优选地,所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端和/或3’端减少1~30nt核苷酸后获得的序列。更进一步优选地,所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端减少4~16nt核苷酸,和/或,在所述SEQ ID NO.111所示的核苷酸序列的3’端减少3~28nt核苷酸。In the guide RNA of the present invention, the variant sequence of tracrRNA refers to adding, reducing or replacing part of the nucleosides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 111 Sequence obtained after acid. Preferably, the variant sequence of tracrRNA refers to a sequence obtained by reducing nucleotides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 111, that is, it can only Reduce or truncate a certain number of nucleotides at the 5' end, or reduce or truncate a certain number of nucleotides at the 3' end only, or reduce or truncate a certain number of nucleosides at both the 5' and 3' ends. acid. Further preferably, the variant sequence of tracrRNA refers to a sequence obtained by reducing 1 to 30 nt nucleotides from the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 111. More preferably, the variant sequence of tracrRNA refers to a reduction of 4 to 16 nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 111, and/or, in the SEQ ID NO. The 3' end of the nucleotide sequence shown in NO. 111 is reduced by 3 to 28 nt nucleotides.
所述tracr配对物序列的变体序列是指在所述SEQ ID NO.112所示的核苷酸序列的5’端和/或3’端增加、减少或替换部分核苷酸后获得的序列;优选地,所述tracr配对物序列的变体序列是指在所述SEQ ID NO.112所示的核苷酸序列的5’端和/或3’端减少核苷酸后获得的序列。即,可以只在5’端减少或截断一定个数的核苷酸,或只在3’端减少或截断一定数量的核苷酸,或同时在5’端和3’端减少或截断一定个数的核苷酸。进一步优选地,所述tracr配对物序列的变体序列是指在所述SEQ ID NO.112所示的核苷酸序列的5’端减少3~28nt核苷酸后获得的序列。更进一步优选地,所述tracr配对物序列的变体序列是指在所述SEQ ID NO.112所示的核苷酸序列的5’端减少8~21nt核苷酸。The variant sequence of the tracr partner sequence refers to a sequence obtained by adding, reducing or replacing part of the nucleotides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 112 ; Preferably, the variant sequence of the tracr partner sequence refers to a sequence obtained by reducing nucleotides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 112. That is, you can reduce or truncate a certain number of nucleotides only at the 5' end, or reduce or truncate a certain number of nucleotides only at the 3' end, or reduce or truncate a certain number of nucleotides at both the 5' and 3' ends. number of nucleotides. Further preferably, the variant sequence of the tracr partner sequence refers to a sequence obtained by reducing 3 to 28 nt nucleotides from the 5' end of the nucleotide sequence shown in SEQ ID NO. 112. More preferably, the variant sequence of the tracr partner sequence refers to a reduction of 8 to 21 nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 112.
在一些优选实施方式中,所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端减少7nt核苷酸获得的序列;优选地,所述tracrRNA的变体序列如SEQ ID NO.114所示;In some preferred embodiments, the variant sequence of tracrRNA refers to a sequence obtained by reducing 7nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 111; preferably, the tracrRNA The variant sequence is shown in SEQ ID NO.114;
在另一些优选实施方式中,所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的3’端减少6nt核苷酸获得的序列;优选地,所述tracrRNA的变体序列如SEQ ID NO.120所示;In other preferred embodiments, the variant sequence of tracrRNA refers to a sequence obtained by reducing 6nt nucleotides at the 3' end of the nucleotide sequence shown in SEQ ID NO. 111; preferably, the The variant sequence of tracrRNA is shown in SEQ ID NO.120;
在另一些优选实施方式中,所述tracr配对物序列的变体序列是指在所述SEQ ID NO.112所示的核苷酸序列的5’端减少8nt核苷酸获得的序列;优选地,所述tracr配对物序列的变体序列如SEQ ID NO.130所示;In other preferred embodiments, the variant sequence of the tracr partner sequence refers to a sequence obtained by reducing 8 nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 112; preferably , the variant sequence of the tracr partner sequence is shown in SEQ ID NO.130;
在另一些优选实施方式中,所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端减少7nt核苷酸,在3’端减少6nt核苷酸获得的序列;优选地,所述tracrRNA的变体序列如SEQ ID NO.138所示;In other preferred embodiments, the variant sequence of tracrRNA refers to reducing 7nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 111 and reducing 6nt nucleotides at the 3' end. The sequence obtained by acid; Preferably, the variant sequence of tracrRNA is as shown in SEQ ID NO. 138;
在另一些优选实施方式中,所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端减少7nt核苷酸,在3’端减少6nt核苷酸获得的序列;优选地,所述tracrRNA的变体序列如SEQ ID NO.144所示;In other preferred embodiments, the variant sequence of tracrRNA refers to reducing 7nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 111 and reducing 6nt nucleotides at the 3' end. The sequence obtained by acid; Preferably, the variant sequence of tracrRNA is as shown in SEQ ID NO. 144;
在另一些优选实施方式中,所述tracr配对物序列的变体序列是指在所述SEQ ID NO.112所示的核苷酸序列的3’端减少21nt核苷酸获得的序列;优选地,所述tracr配对物序列的变体序列如SEQ ID NO.148所示。In other preferred embodiments, the variant sequence of the tracr partner sequence refers to a sequence obtained by reducing 21nt nucleotides at the 3' end of the nucleotide sequence shown in SEQ ID NO. 112; preferably , the variant sequence of the tracr partner sequence is shown in SEQ ID NO. 148.
在另一些优选实施方式中,所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.116所示。在另一些优选实施方式中,所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.122所示。在另一些优选实施方式中,所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.131所示。在另一些优选实施方式中,所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.140所示。在另一些优选实施方式中,所述tracrRNA的变体序列如SEQ ID NO.144所示,所述tracr配对物序列的变体序列如SEQ ID NO.145所示;优选地,所述tracrRNA和tracr配对物序列通过连接链连接;进一步优选地,所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.146所示。在另一些优选实施方式中,所述tracrRNA的变体序列如SEQ ID NO.147所示,所述tracr配对物序列的变体序列如SEQ ID NO.148所示;优选地,所述tracrRNA和tracr配对物序列通过连接链连接;进一步优选地,所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.149所示,优选的向导RNA在哺乳动物细胞基因组编辑中,对同一靶序列显示有更高的基因删除和/或基因切割效率。In other preferred embodiments, the nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 116. In other preferred embodiments, the nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 122. In other preferred embodiments, the nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 131. In other preferred embodiments, the nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 140. In other preferred embodiments, the variant sequence of the tracrRNA is shown in SEQ ID NO.144, and the variant sequence of the tracr partner sequence is shown in SEQ ID NO.145; preferably, the tracrRNA and The tracr partner sequences are connected through connecting strands; further preferably, the nucleotide sequence of the backbone sequence of the guide RNA is as shown in SEQ ID NO. 146. In other preferred embodiments, the variant sequence of the tracrRNA is shown in SEQ ID NO. 147, and the variant sequence of the tracr partner sequence is shown in SEQ ID NO. 148; preferably, the tracrRNA and The tracr partner sequences are connected through connecting strands; further preferably, the nucleotide sequence of the backbone sequence of the guide RNA is as shown in SEQ ID NO. 149. The preferred guide RNA targets the same target sequence in mammalian cell genome editing. Show higher gene deletion and/or gene cleavage efficiency.
本发明还提供了修饰型gRNA,其可以通过修饰用于实现与靶基因内的任何所需序列杂交;或者,通过对gRNA修饰来改变gRNA本身的特性,如通过修饰增强gRNA的稳定性,包括但不限于通过增加其对细胞中存在的核糖核酸酶(RNase)降解的抗性,从而延长其在细胞中的半衰期;或者,其可以通过修饰用于增强包含gRNA和核酸内切酶(例如SpCas12f1核酸酶)的CRISPR-SpCas12f1基因组编辑复合体的形成或其稳定性;或者,其可以通过修饰用于增强基因组编辑复合体的特异性;或者,其可以通过修饰用于增强基因组编辑复合体与基因组中靶序列之间相互作用的起始位点,稳定性或动力学;或者,其可以通过修饰用于降低引入细胞中的RNA引发先天免疫反应的可能性或程度等。本发明中,可以通过对gRNA修饰来改变CRISPR-SpCas12f1系统(如下文中所述)的多种特性,如增强CRISPR-SpCas12f1基因组编辑复合物的形成、中靶活性、特异性、稳定性或动力学特性。可以采用本领域已知的修饰方式对RNA进行修饰,包括但不限于在嘧啶的核糖、碱基残基或RNA的3'端的反向碱基上的2'-氟、2'-氨基修饰等。本发明中,可以采用任意一种修饰或多种修饰组合对gRNA进行修饰。在一些实施方式中,通过修饰引入细胞中的sgRNA,以编辑任何一个或多个基因组的基因座。The present invention also provides modified gRNA, which can be used to achieve hybridization with any desired sequence within the target gene through modification; or, by modifying the gRNA to change the characteristics of the gRNA itself, such as enhancing the stability of the gRNA through modification, including But it is not limited to increasing its resistance to ribonuclease (RNase) degradation present in the cell, thereby extending its half-life in the cell; alternatively, it can be modified to enhance the content of gRNA and endonuclease (such as SpCas12f1 The formation of the CRISPR-SpCas12f1 genome editing complex or its stability; alternatively, it can be used to enhance the specificity of the genome editing complex through modification; alternatively, it can be used to enhance the interaction between the genome editing complex and the genome The initiation site, stability or kinetics of the interaction between the target sequences; alternatively, it can be modified to reduce the likelihood or extent of the RNA introduced into the cell to trigger an innate immune response, etc. In the present invention, various characteristics of the CRISPR-SpCas12f1 system (as described below) can be changed by modifying the gRNA, such as enhancing the formation, on-target activity, specificity, stability or kinetics of the CRISPR-SpCas12f1 genome editing complex. characteristic. RNA can be modified using modification methods known in the art, including but not limited to 2'-fluoro and 2'-amino modifications on the ribose and base residues of the pyrimidine or the reverse base at the 3' end of the RNA. . In the present invention, any modification or combination of modifications can be used to modify gRNA. In some embodiments, the sgRNA introduced into the cell is modified to edit any one or more genomic loci.
本发明还提供了一种分离的多核苷酸,其编码如上任一项所述的向导RNA。The invention also provides an isolated polynucleotide encoding the guide RNA as described in any one of the above.
本发明还提供了一种构建体,所述构建体含有如上所述的分离的多核苷酸。所述构建体通常可以通过将所述分离的多核苷酸插入合适的表达载体中构建获得,本领域技术人员可选择合适的表达载体。所述构建体例如可以是重组表达载体,可以使用任何合适的表达载体,只要它与宿主细胞相容即可,包括但不限于,病毒载体(例如基于痘苗病毒的病毒载体;脊髓灰质炎病毒;腺病毒;腺相关病毒;SV40;单纯疱疹病毒;人免疫缺陷病毒;反转录病毒载体(例如鼠白血病病毒,脾脏坏死病毒,以及衍生自反转录病毒的载体,例如劳斯肉瘤病毒,Harvey肉瘤病毒,禽类白血病病毒,慢病毒,人免疫缺陷病毒,骨髓增生肉瘤病毒和乳腺肿瘤病毒)等。The invention also provides a construct comprising an isolated polynucleotide as described above. The construct can usually be constructed by inserting the isolated polynucleotide into a suitable expression vector, and those skilled in the art can select a suitable expression vector. The construct may, for example, be a recombinant expression vector, and any suitable expression vector may be used as long as it is compatible with the host cell, including, but not limited to, viral vectors (e.g., vaccinia virus-based viral vectors; poliovirus; Adenovirus; adeno-associated virus; SV40; herpes simplex virus; human immunodeficiency virus; retroviral vectors (e.g., murine leukemia virus, spleen necrosis virus, and vectors derived from retroviruses, such as Rous sarcoma virus, Harvey Sarcoma virus, avian leukemia virus, lentivirus, human immunodeficiency virus, myeloproliferative sarcoma virus and mammary tumor virus), etc.
在某些实施方式中,在相同细胞中同时使用多种gRNA以同时调节相同靶基因或不同靶基因上的不同位置的转录。当多个gRNA同时使用时,它们可以存在于相同的表达载体上或不同的载体上,也可以同时表达;当存在于相同的载 体上时,它们可以在相同的控制元件下进行表达。In certain embodiments, multiple gRNAs are used simultaneously in the same cell to simultaneously regulate transcription of different locations on the same target gene or different target genes. When multiple gRNAs are used at the same time, they can exist on the same expression vector or on different vectors, or they can be expressed simultaneously; when they exist on the same vector, they can be expressed under the same control element.
在某些实施方式中,编码gRNA的核苷酸序列可操作地连接至控制元件,例如转录控制元件,例如启动子。在某些实施方式中,编码gRNA的核苷酸序列可操作地连接至诱导型启动子。在某些实施方式中,编码gRNA的核苷酸序列可操作地连接至组成型启动子。转录控制元件可以在真核细胞,例如哺乳动物细胞(HEK293T细胞);或原核细胞(例如细菌或古细菌细胞)中起作用。在某些实施方式中,编码gRNA的核苷酸序列可操作地连接至多个控制元件,其允许在原核和真核细胞两者中表达编码gRNA的核苷酸序列。In certain embodiments, a nucleotide sequence encoding a gRNA is operably linked to a control element, such as a transcription control element, such as a promoter. In certain embodiments, a nucleotide sequence encoding a gRNA is operably linked to an inducible promoter. In certain embodiments, a nucleotide sequence encoding a gRNA is operably linked to a constitutive promoter. Transcriptional control elements may function in eukaryotic cells, such as mammalian cells (HEK293T cells); or in prokaryotic cells, such as bacterial or archaeal cells. In certain embodiments, a nucleotide sequence encoding a gRNA is operably linked to a plurality of control elements that permit expression of the nucleotide sequence encoding the gRNA in both prokaryotic and eukaryotic cells.
本发明中,所述gRNA可以通过人工合成方式进行合成,例如通过化学方法合成,从而使得能够容易地对其进行多种修饰。所述修饰可以采用本领域公知的任何修饰方式,例如,使用polyA尾巴,添加5'帽类似物,5'或3'非翻译区(UTR),5'或3'端包括硫代磷酸化2’-O-甲基核苷酸或用磷酸酶处理以去除5'末端磷酸酯等。In the present invention, the gRNA can be synthesized by artificial synthesis, for example, by chemical methods, so that it can be easily modified in various ways. The modification can adopt any modification method known in the art, for example, using a polyA tail, adding a 5' cap analog, a 5' or 3' untranslated region (UTR), and the 5' or 3' end including phosphorothioate 2 '-O-methyl nucleotide or treated with phosphatase to remove the 5' terminal phosphate, etc.
在一些实施方案中,所述编码gRNA的核苷酸序列包含一种或多种修饰,其可用于例如增强活性,稳定性或特异性,改变递送,减少宿主细胞中的先天免疫应答或用于其他增强。In some embodiments, the nucleotide sequence encoding a gRNA contains one or more modifications that may be used, for example, to enhance activity, stability or specificity, alter delivery, reduce the innate immune response in the host cell, or for Other enhancements.
在一些实施方式中,将增强编码gRNA的核苷酸序列的活性,细胞分布或细胞摄取的一个或多个靶向部分或缀合物化学连接至gRNA。所述靶向部分或缀合物可包括与功能基团共价结合的缀合物基团;缀合物基团包括报告分子,多胺,聚乙二醇。在一些实施方式中,将增强药效学性质的基团连接至gRNA,增强药效学性质的基团包括改善摄取,增强对降解的抗性和/或增强与靶核酸的序列特异性杂交的基团。In some embodiments, one or more targeting moieties or conjugates that enhance the activity, cellular distribution, or cellular uptake of the nucleotide sequence encoding the gRNA are chemically linked to the gRNA. The targeting moiety or conjugate may include a conjugate group covalently bonded to a functional group; the conjugate group includes reporter molecules, polyamines, polyethylene glycol. In some embodiments, a group that enhances pharmacodynamic properties is attached to the gRNA, including groups that improve uptake, enhance resistance to degradation, and/or enhance sequence-specific hybridization to the target nucleic acid. group.
本发明中,包含编码gRNA的多核苷酸的核酸可以是核酸模拟物。例如具有优良杂交性质的多核苷酸模拟物肽核酸等。In the present invention, the nucleic acid containing the polynucleotide encoding gRNA may be a nucleic acid mimetic. For example, polynucleotide mimetic peptide nucleic acids with excellent hybridization properties.
本发明中,所述gRNA,或编码gRNA的多核苷酸适用任何生物或体外环境,包括但不限于是细菌、古细菌、真菌、原生生物、植物或动物。相应地,适用的靶细胞包括但不限于细菌细胞、古细菌细胞、真菌细胞、原生生物细胞、植物细胞或动物细胞。适用的靶细胞可以是任何类型的细胞,包括干细胞,体细胞等。In the present invention, the gRNA, or polynucleotide encoding gRNA, is applicable to any biological or in vitro environment, including but not limited to bacteria, archaea, fungi, protists, plants or animals. Accordingly, suitable target cells include, but are not limited to, bacterial cells, archaeal cells, fungal cells, protist cells, plant cells or animal cells. Applicable target cells can be any type of cells, including stem cells, somatic cells, etc.
本发明还提供了一种表达系统,所述表达系统含有如上所述的构建体或基因组中整合有外源的如上所述的多核苷酸。所述表达系统的宿主细胞选自真核细胞或原核细胞;优选地,所述宿主细胞选自小鼠细胞、人细胞。The present invention also provides an expression system, which contains the construct as described above or an exogenous polynucleotide as described above integrated into the genome. The host cell of the expression system is selected from eukaryotic cells or prokaryotic cells; preferably, the host cell is selected from mouse cells and human cells.
本发明还提供了一种基因编辑系统,其包括如上任一项所述的向导RNA或其编码多核苷酸;进一步地,还可包括核酸酶或其编码多核苷酸。The present invention also provides a gene editing system, which includes the guide RNA or its encoding polynucleotide as described in any one of the above; further, it may also include a nuclease or its encoding polynucleotide.
本发明所述编辑系统中,编码所述核酸酶的多核苷酸包括:只编码核酸酶的编码序列;核酸酶的编码序列和各种附加编码序列;核酸酶的编码序列(和任选的附加编码序列)以及非编码序列。编码所述向导RNA的多核苷酸包括:只编码向导RNA的编码序列;向导RNA的编码序列和各种附加编码序列;向导RNA的编码序列(和任选的附加编码序列)以及非编码序列。在一些实施方式中,所述碱基编辑系统包含一个或多个载体;所述一个或多个载体包含(i)第一调控元件,所述第一调控元件可操作地连接至所述核酸酶的编码多核苷酸;以及(ii)第二调控元件,所述第二调控元件可操作地连接至所述向导RNA核苷酸序列的编码多核苷酸;所述(i)和(ii)位于相同或不同载体上。在一些实施方式中,所述碱基编辑系统包含(i)核酸酶或其变体,以及(ii)包含所述向导RNA的编码序列的载体。在另一实施方式中,所述系统包含gRNA和核酸酶复合物。In the editing system of the present invention, the polynucleotide encoding the nuclease includes: a coding sequence encoding only a nuclease; a nuclease coding sequence and various additional coding sequences; a nuclease coding sequence (and optional additional coding sequences) and non-coding sequences. The polynucleotide encoding the guide RNA includes: a coding sequence encoding only the guide RNA; a coding sequence of the guide RNA and various additional coding sequences; a coding sequence of the guide RNA (and optional additional coding sequences) and non-coding sequences. In some embodiments, the base editing system includes one or more vectors; the one or more vectors include (i) a first regulatory element operably linked to the nuclease The coding polynucleotide; and (ii) a second regulatory element operably linked to the coding polynucleotide of the guide RNA nucleotide sequence; the (i) and (ii) are located on the same or different carriers. In some embodiments, the base editing system comprises (i) a nuclease or a variant thereof, and (ii) a vector comprising the coding sequence of the guide RNA. In another embodiment, the system includes a gRNA and nuclease complex.
所述第一调控元件可以调控所述核酸酶或其变体的编码多核苷酸的转录。所述核酸酶或其变体的编码多核苷酸可以是一个或多个,所述第一调控元件可以是一个或多个。所述第二调控元件可以调控所述向导RNA的编码多核苷酸的转录。所述向导RNA的编码多核苷酸可以是一个或多个,所述第二调控元件可以是一个或多个。The first regulatory element may regulate the transcription of a polynucleotide encoding the nuclease or a variant thereof. The polynucleotide encoding the nuclease or its variant may be one or more, and the first regulatory element may be one or more. The second regulatory element can regulate the transcription of the polynucleotide encoding the guide RNA. The guide RNA encoding polynucleotide may be one or more, and the second regulatory element may be one or more.
本发明所述系统可以包含一个gRNA或同时包含多个gRNA。在一实施方式中,所述系统同时包含多个gRNA以同时修饰相同靶标DNA或不同靶标DNA上的不同位置。在一实施方案中,两个或更多个向导RNA靶向相同的基因或转录本或基因座。在一实施方案中,两个或更多个向导RNA靶向不同的不相关基因座。在一些实施方案中,两个或更多个向导RNA靶向不同但相关的基因座。The system of the present invention may contain one gRNA or multiple gRNAs simultaneously. In one embodiment, the system includes multiple gRNAs simultaneously to simultaneously modify different positions on the same target DNA or different target DNAs. In one embodiment, two or more guide RNAs target the same gene or transcript or locus. In one embodiment, two or more guide RNAs target different unrelated loci. In some embodiments, two or more guide RNAs target different but related loci.
本发明所述基因编辑系统中,所述核酸酶是CRISPR核酸酶;优选地,所述核酸酶选自Cas9、Cas12、Cas13蛋白家族或其变体;进一步优选地,所述Cas核酸酶选自nSpCas9及其突变体、SaCas9及其突变体、Cas12a及其突变体或 Cas12f及其突变体;更进一步优选为SpCas12f1核酸酶或其变体。在一些实施方式中,SpCas12f1核酸酶直接作为蛋白提供;例如可以使用原生质球转化用外源蛋白和/或核酸转化真菌的方式。可以通过任何合适的方法将SpCas12f1核酸酶引入细胞,如注射方式等。本发明所述基因编辑系统识别靶向序列上的PAM序列;优选地PAM序列为5’-NTTC和5’-GTTT至少任一,其中N为A、T、C、G;进一步优选地PAM序列为5’-GTTC,5’-TTTC和5’-ATTC至少任一。所述基因编辑系统靶向PAM序列之后长度为12~40bp的核酸片段,优选的长度为20bp。所述基因编辑系统靶向细胞基因组中的至少一个靶序列。In the gene editing system of the present invention, the nuclease is a CRISPR nuclease; preferably, the nuclease is selected from Cas9, Cas12, Cas13 protein family or variants thereof; further preferably, the Cas nuclease is selected from nSpCas9 and its mutants, SaCas9 and its mutants, Cas12a and its mutants, or Cas12f and its mutants; more preferably, it is SpCas12f1 nuclease or its mutants. In some embodiments, SpCas12f1 nuclease is provided directly as a protein; for example, spheroplast transformation can be used to transform fungi with exogenous proteins and/or nucleic acids. SpCas12f1 nuclease can be introduced into cells by any suitable method, such as injection. The gene editing system of the present invention recognizes the PAM sequence on the target sequence; preferably the PAM sequence is at least one of 5'-NTTC and 5'-GTTT, where N is A, T, C, G; further preferably the PAM sequence Be at least any one of 5'-GTTC, 5'-TTTC and 5'-ATTC. The gene editing system targets nucleic acid fragments with a length of 12 to 40 bp after the PAM sequence, and the preferred length is 20 bp. The gene editing system targets at least one target sequence in the genome of the cell.
在某些实施方式中,编码SpCas12f1核酸酶的核酸为DNA。在某些实施方式中,编码SpCas12f1核酸酶的核酸为RNA。在某些实施方式中,编码SpCas12f1核酸酶的核酸是表达载体,例如重组表达载体。可以使用任何合适的表达载体,只要它与宿主细胞相容即可,包括但不限于,病毒载体(例如基于痘苗病毒的病毒载体;脊髓灰质炎病毒;腺病毒;腺相关病毒;SV40;单纯疱疹病毒;人免疫缺陷病毒;反转录病毒载体(例如鼠白血病病毒,脾脏坏死病毒,以及衍生自反转录病毒的载体,例如劳斯肉瘤病毒,Harvey肉瘤病毒,禽类白血病病毒,慢病毒,人免疫缺陷病毒,骨髓增生肉瘤病毒和乳腺肿瘤病毒)等。In certain embodiments, the nucleic acid encoding SpCas12f1 nuclease is DNA. In certain embodiments, the nucleic acid encoding SpCas12f1 nuclease is RNA. In certain embodiments, the nucleic acid encoding SpCas12f1 nuclease is an expression vector, such as a recombinant expression vector. Any suitable expression vector may be used so long as it is compatible with the host cell, including, but not limited to, viral vectors (e.g., vaccinia virus-based viral vectors; poliovirus; adenovirus; adeno-associated virus; SV40; herpes simplex Viruses; human immunodeficiency virus; retroviral vectors (e.g., murine leukemia virus, spleen necrosis virus, and vectors derived from retroviruses, such as Rous sarcoma virus, Harvey sarcoma virus, avian leukemia virus, lentivirus, human immunodeficiency virus, myeloproliferative sarcoma virus and breast tumor virus), etc.
在一实施方式中,编码所述SpCas12f1核酸酶的核酸如SEQ ID NO.40所示。在一实施方式中,本发明提供了SpCas12f1核酸酶密码子优化的多核苷酸序列,与SEQ ID NO:42具有至少90%、92%、93%、94%、95%、96%、97%、98%、99%、99.2%、99.5%、99.8%、99.9%或100%序列同源性。在一优选实施方式中,所述SpCas12f1核酸酶的针对人类密码子优化后更佳的编码序列如序列表中SEQ ID NO.42所示,其编码一个或多个功能性SpCas12f1结构域,或编码具有与原始天然核苷酸序列编码的多肽相同的功能的多肽。In one embodiment, the nucleic acid encoding the SpCas12f1 nuclease is shown in SEQ ID NO. 40. In one embodiment, the invention provides a SpCas12f1 nuclease codon-optimized polynucleotide sequence that is at least 90%, 92%, 93%, 94%, 95%, 96%, 97% identical to SEQ ID NO: 42 , 98%, 99%, 99.2%, 99.5%, 99.8%, 99.9% or 100% sequence homology. In a preferred embodiment, the better coding sequence of the SpCas12f1 nuclease optimized for human codons is shown in SEQ ID NO. 42 in the sequence listing, which encodes one or more functional SpCas12f1 domains, or encodes A polypeptide that has the same function as the polypeptide encoded by the original native nucleotide sequence.
在某些实施方式中,编码SpCas12f1核酸酶的核苷酸序列可操作地连接至控制元件,例如转录控制元件,例如启动子。在某些实施方式中,编码SpCas12f1核酸酶的核苷酸序列可操作地连接至诱导型启动子。在某些实施方式中,编码SpCas12f1核酸酶的核苷酸序列可操作地连接至组成型启动子。转录控制元件可以在真核细胞,例如哺乳动物细胞(HEK293T细胞);或原核细胞(例如细菌或古细菌细胞)中起作用。在某些实施方式中,编码SpCas12f1核酸酶的核苷酸序 列可操作地连接至多个控制元件,其允许在原核和真核细胞两者中表达编码SpCas12f1核酸酶的核苷酸序列。在一些实施方案中,所述编码SpCas12f1核酸酶的多核苷酸序列可操作地连接至用于在细胞或体外环境中表达的合适的核定位信号。In certain embodiments, a nucleotide sequence encoding a SpCas12f1 nuclease is operably linked to a control element, such as a transcription control element, such as a promoter. In certain embodiments, the nucleotide sequence encoding SpCas12f1 nuclease is operably linked to an inducible promoter. In certain embodiments, the nucleotide sequence encoding SpCas12f1 nuclease is operably linked to a constitutive promoter. Transcriptional control elements may function in eukaryotic cells, such as mammalian cells (HEK293T cells); or in prokaryotic cells, such as bacterial or archaeal cells. In certain embodiments, the nucleotide sequence encoding the SpCas12f1 nuclease is operably linked to a plurality of control elements that allow expression of the nucleotide sequence encoding the SpCas12f1 nuclease in both prokaryotic and eukaryotic cells. In some embodiments, the polynucleotide sequence encoding SpCas12fl nuclease is operably linked to a suitable nuclear localization signal for expression in a cellular or in vitro environment.
本发明中,所述编码SpCas12f1核酸酶的多核苷酸可以通过人工合成方式进行合成,例如通过化学方法合成,从而使得能够容易地对其进行多种修饰。所述修饰可以采用本领域公知的任何修饰方式。在一些实施方案中,所述编码SpCas12f1核酸酶的多核苷酸包含一种或多种修饰,从而使得能够容易地并入许多修饰,例如增强转录活性,改变酶活性,提高其翻译或稳定性(例如增加其对蛋白水解、降解的抗性)或特异性,改变溶解性,改变递送,减少宿主细胞中的先天免疫应答。所述修饰可以采用本领域公知的任何修饰方式。在一些实施方式中,通过修饰引入细胞中的编码SpCas12f1核酸酶的DNA或RNA,以编辑任何一个或多个基因组的基因座。在一些实施方式中,编码SpCas12f1核酸酶的核酸序列是修饰的核酸,例如密码子优化的。所述修饰可以是单一修饰,或组合修饰。In the present invention, the polynucleotide encoding SpCas12f1 nuclease can be synthesized artificially, for example, chemically, so that it can be easily modified in various ways. The modification may adopt any modification method known in the art. In some embodiments, the polynucleotide encoding SpCas12f1 nuclease contains one or more modifications, thereby enabling easy incorporation of a number of modifications, such as enhancing transcriptional activity, altering enzymatic activity, improving its translation or stability ( For example, increasing its resistance to proteolysis, degradation) or specificity, changing solubility, changing delivery, and reducing the innate immune response in host cells. The modification may adopt any modification method known in the art. In some embodiments, any one or more genomic loci are edited by modifying the DNA or RNA encoding the SpCas12f1 nuclease introduced into the cell. In some embodiments, the nucleic acid sequence encoding SpCas12f1 nuclease is a modified nucleic acid, eg, codon optimized. The modification may be a single modification or a combination of modifications.
本发明中,包含编码SpCas12f1核酸酶的多核苷酸的核酸可以是核酸模拟物。例如具有优良杂交性质的多核苷酸模拟物肽核酸等。In the present invention, the nucleic acid comprising the polynucleotide encoding SpCas12f1 nuclease may be a nucleic acid mimetic. For example, polynucleotide mimetic peptide nucleic acids with excellent hybridization properties.
本发明中,所述SpCas12f1核酸酶,或编码SpCas12f1核酸酶的多核苷酸适用于任何生物或体外环境,包括但不限于是细菌、古细菌、真菌、原生生物、植物或动物。相应地,适用的靶细胞包括但不限于真核细胞和原核细胞,例如是细菌细胞、古细菌细胞、真菌细胞、原生生物细胞、植物细胞或动物细胞;所述真核细胞包含哺乳动物细胞和植物细胞,所述原核细胞包括大肠杆菌和肺炎克雷伯氏菌。适用的靶细胞可以是任何类型的细胞,包括干细胞,体细胞等。本发明优选用于哺乳动物细胞HEK293T细胞。所述细胞可以是体内的或离体的。在某些实施方式中,所述SpCas12f1核酸酶或编码该SpCas12f1核酸酶的核酸配制在脂质体或脂质纳米颗粒中。In the present invention, the SpCas12f1 nuclease or the polynucleotide encoding SpCas12f1 nuclease is suitable for use in any biological or in vitro environment, including but not limited to bacteria, archaea, fungi, protists, plants or animals. Accordingly, suitable target cells include but are not limited to eukaryotic cells and prokaryotic cells, such as bacterial cells, archaeal cells, fungal cells, protist cells, plant cells or animal cells; the eukaryotic cells include mammalian cells and Plant cells, the prokaryotic cells include Escherichia coli and Klebsiella pneumoniae. Applicable target cells can be any type of cells, including stem cells, somatic cells, etc. The present invention is preferably used for mammalian cells HEK293T cells. The cells may be in vivo or ex vivo. In certain embodiments, the SpCas12f1 nuclease or nucleic acid encoding the SpCas12f1 nuclease is formulated in liposomes or lipid nanoparticles.
本发明所述系统中,SpCas12f1核酸酶和gRNA可以在宿主细胞中形成复合物,识别靶向基因(如靶向DNA)序列上的PAM序列;所述CRISPR/SpCas12f1基因编辑系统的靶向序列为PAM序列之后长度为20bp的核酸片段(如DNA片段)。在一实施方式中,所述复合物可以选择性地调节宿主细胞中靶DNA的转 录。CRISPR/SpCas12f1基因编辑系统能够切割靶向DNA的双链,造成DNA断裂。In the system of the present invention, SpCas12f1 nuclease and gRNA can form a complex in the host cell to identify the PAM sequence on the target gene (such as target DNA) sequence; the target sequence of the CRISPR/SpCas12f1 gene editing system is A nucleic acid fragment (such as a DNA fragment) of 20 bp in length following the PAM sequence. In one embodiment, the complex can selectively modulate the transcription of target DNA in a host cell. The CRISPR/SpCas12f1 gene editing system can cut the double strands of targeted DNA, causing DNA breaks.
在一实施方式中,所述系统包含重组表达载体。在一实施方式中,所述系统包含重组表达载体,所述重组表达载体包含(i)编码gRNA的核苷酸序列,其中所述gRNA包含:(a)包含与靶DNA中的序列互补的核苷酸序列的第一区段;和(b)与SpCas12f1核酸酶相互作用的第二区段;和(ii)编码SpCas12f1核酸酶的核苷酸序列,其中所述SpCas12f1核酸酶包含:(a)与所述gRNA相互作用的RNA-结合部分;和(b)调节靶DNA内转录的活性部分,其中靶DNA内调节的转录的位点由所述gRNA确定。In one embodiment, the system includes a recombinant expression vector. In one embodiment, the system comprises a recombinant expression vector comprising (i) a nucleotide sequence encoding a gRNA, wherein the gRNA comprises: (a) a core comprising a sequence complementary to a sequence in the target DNA and (b) a second segment that interacts with SpCas12f1 nuclease; and (ii) a nucleotide sequence encoding SpCas12f1 nuclease, wherein said SpCas12f1 nuclease comprises: (a) an RNA-binding moiety that interacts with the gRNA; and (b) an active moiety that modulates transcription within the target DNA, wherein the site of regulated transcription within the target DNA is determined by the gRNA.
本发明中,还可以通过修饰、突变、DNA改组等方式形成SpCas12f1核酸酶变体,使得SpCas12f1核酸酶变体具有改善的所期望的特征,例如功能、活性、动力学、半衰期等。所述修饰例如可以是氨基酸的缺失、插入或取代,再例如可以是用来自不同核酸酶的同源或异源切割结构域(例如CRISPR-相关核酸酶的HNH结构域)置换SpCas12f1核酸酶的“切割结构域”);通过本领域已知的DNA结合和/或DNA修饰蛋白的任何修饰方法,如甲基化作用、脱甲基作用、乙酰化作用等,例如可以改变SpCas12f1核酸酶的DNA靶向性。所述DNA改组是指在不同来源的SpCas12f1核酸酶的DNA序列之间交换序列片段,以产生编码具有RNA-指导的内切核酸酶活性的合成蛋白的嵌合DNA序列。所述修饰、突变、DNA改组等可以是单一使用或组合使用。In the present invention, SpCas12f1 nuclease variants can also be formed through modification, mutation, DNA shuffling, etc., so that the SpCas12f1 nuclease variants have improved desired characteristics, such as function, activity, kinetics, half-life, etc. The modification may be, for example, the deletion, insertion or substitution of amino acids, or may be, for example, the replacement of the "" of the SpCas12f1 nuclease with a homologous or heterologous cleavage domain from a different nuclease (for example, the HNH domain of a CRISPR-related nuclease). Cleavage domain"); for example, the DNA target of SpCas12f1 nuclease can be changed by any modification method of DNA binding and/or DNA modifying proteins known in the art, such as methylation, demethylation, acetylation, etc. tropism. The DNA shuffling refers to the exchange of sequence fragments between DNA sequences of SpCas12f1 nucleases from different sources to generate chimeric DNA sequences encoding synthetic proteins with RNA-guided endonuclease activity. The modification, mutation, DNA shuffling, etc. may be used singly or in combination.
具体地,本发明所述SpCas12f1核酸酶可以是:Specifically, the SpCas12f1 nuclease of the present invention can be:
(I)野生型SpCas12f1核酸酶或其片段,具有受RNA引导的核酸结合活性;所述SpCas12f1核酸酶来源于Syntrophomonaspalmitatica Cas12f1,氨基酸序列如SEQ ID NO.40所示;优选地,所述SpCas12f1核酸酶人源化密码子优化的核酸序列如SEQ ID NO.42所示;(1) Wild-type SpCas12f1 nuclease or a fragment thereof, which has RNA-guided nucleic acid binding activity; the SpCas12f1 nuclease is derived from Syntrophomonaspalmitatica Cas12f1, and the amino acid sequence is as shown in SEQ ID NO.40; Preferably, the SpCas12f1 nuclease The humanized codon-optimized nucleic acid sequence is shown in SEQ ID NO.42;
(II)与(I)的氨基酸序列具有至少50%序列同源性的变体,且具有受RNA引导的核酸结合活性;(II) A variant having at least 50% sequence homology with the amino acid sequence of (I) and having RNA-guided nucleic acid binding activity;
(III)根据(I)或(II),其进一步包括核定位信号片段;(III) according to (I) or (II), which further includes a nuclear localization signal fragment;
(IV)根据(I)或(II)或(III),其进一步包含:(IV) Subject to (I) or (II) or (III), which further includes:
(a)一种或多种修饰或突变,其产生具有相比修饰或突变前显著减小的核酸内切酶活性,或使核酸内切酶活性丧失;和/或(a) One or more modifications or mutations that produce an endonuclease activity that is significantly reduced compared to before the modification or mutation, or that results in a loss of endonuclease activity; and/or
(b)具有其他功能活性的多肽或结构域;(b) Polypeptides or domains with other functional activities;
(V)根据(I)或(II)或(III),所述SpCas12f1核酸酶具有核酸内切酶活性。(V) According to (I) or (II) or (III), the SpCas12f1 nuclease has endonuclease activity.
在一些实施方式中,所述SpCas12f1可以通过与其他酶成分或其他成分结合使用,以进一步开发SpCas12f1核酸酶的各种潜在应用。作为(IV)下的SpCas12f1核酸酶变体非限制性的例子,例如是通过融合失活的SpCas12f1和碱基脱氨酶开发基于SpCas12f1核酸酶的单碱基编辑系统;通过融合失活的SpCas12f1和逆转录酶开发基于SpCas12f1核酸酶的Prime编辑系统;通过融合失活的SpCas12f1和转录激活因子,开发基于SpCas12f1核酸酶的转录激活系统;通过融合失活的SpCas12f1和核酸表观修饰酶,开发基于SpCas12f1核酸酶的表观修饰系统;利用失活的SpCas12f1,开发基于SpCas12f1核酸酶的转录抑制系统。In some embodiments, the SpCas12f1 can be used in combination with other enzyme components or other components to further develop various potential applications of the SpCas12f1 nuclease. As non-limiting examples of SpCas12f1 nuclease variants under (IV), for example, a single base editing system based on SpCas12f1 nuclease is developed by fusing inactivated SpCas12f1 and a base deaminase; by fusing inactivated SpCas12f1 and Reverse transcriptase develops a Prime editing system based on SpCas12f1 nuclease; by fusing inactivated SpCas12f1 and a transcription activator, a transcription activation system based on SpCas12f1 nuclease is developed; by fusing inactivated SpCas12f1 and a nucleic acid epigenetic modification enzyme, a SpCas12f1-based transcription activation system is developed Nuclease epigenetic modification system; using inactivated SpCas12f1, develop a transcription inhibition system based on SpCas12f1 nuclease.
SpCas12f1核酸酶变体可以具有如下特定性质,包括但不限于:SpCas12f1 nuclease variants may have the following specific properties, including but not limited to:
具有增强的或降低的与靶位结合的能力,或保留了与靶位结合的能力;Has enhanced or reduced ability to bind to the target, or retains the ability to bind to the target;
具有增强的或降低的核糖核酸内切酶和/或核酸内切酶活性,或保留了核糖核酸内切酶和/或核酸内切酶活性;Has enhanced or reduced endoribonuclease and/or endonuclease activity, or retained endoribonuclease and/or endonuclease activity;
具有脱氨酶活性,其可作用于胞嘧啶、鸟嘌呤或腺嘌呤碱基,并随后通过脱氨基位点复制并在细胞内修复,分别产生鸟嘌呤、胸腺嘧啶和鸟嘌呤;It has deaminase activity, which can act on cytosine, guanine or adenine bases and then replicate and repair them within the cell through the deamination site to produce guanine, thymine and guanine respectively;
具有调节靶DNA的转录的活性,可以是增加也可以是减少靶DNA中特定位置处的靶DNA转录;Have the activity of regulating the transcription of target DNA, either by increasing or decreasing the transcription of target DNA at a specific position in the target DNA;
具有改变的DNA靶向性;Have altered DNA targeting;
增加或降低或维持的稳定性;increase or decrease or maintain stability;
可以切割靶DNA的互补链,但具有降低的切割靶DNA的非互补链的能力;Can cleave the complementary strand of the target DNA, but has a reduced ability to cleave the non-complementary strand of the target DNA;
可以切割靶DNA的非互补链,但具有降低的切割靶DNA的互补链的能力;Can cleave the non-complementary strand of the target DNA, but has a reduced ability to cleave the complementary strand of the target DNA;
具有降低的切割靶DNA的互补链和非互补链两者的能力;Has a reduced ability to cleave both the complementary and non-complementary strands of target DNA;
具有修饰与DNA相关的多肽(例如组蛋白)的酶活性,酶活性可以是甲基转移酶活性、脱甲基酶活性、乙酰转移酶活性、脱乙酰酶活性、激酶活性、磷酸酶活性、泛素连接酶活性、脱泛素活性、核糖基化活性等中的一种或几种(通过这些酶活性催化对蛋白的共价修饰;例如,SpCas12f1核酸酶变体通过甲基化作 用、乙酰化作用、泛素化、磷酸化作用等,修饰组蛋白,以引起组蛋白相关DNA的结构变化,从而控制DNA的结构和特性)。It has enzymatic activity that modifies DNA-related polypeptides (such as histones). The enzymatic activity can be methyltransferase activity, demethylase activity, acetyltransferase activity, deacetylase activity, kinase activity, phosphatase activity, ubiquitin activity, etc. One or more of peptide ligase activity, deubiquitinating activity, ribosylation activity, etc. (the covalent modification of proteins is catalyzed by these enzyme activities; for example, SpCas12f1 nuclease variants catalyze covalent modification of proteins through methylation, acetylation function, ubiquitination, phosphorylation, etc., to modify histones to cause structural changes in histone-related DNA, thereby controlling the structure and properties of DNA).
在一些实施方式中,SpCas12f1核酸酶变体无切割活性。在一些实施方式中,SpCas12f1核酸酶变体具有单链切割活性。在一些实施方式中,SpCas12f1核酸酶变体具有双链切割活性。In some embodiments, the SpCas12f1 nuclease variant has no cleavage activity. In some embodiments, SpCas12f1 nuclease variants have single-strand cleavage activity. In some embodiments, the SpCas12f1 nuclease variant has double-stranded cleavage activity.
具有增强的活性或能力是指,相对于野生型SpCas12f1核酸酶,具有提升至少1%、5%、10%、20%、30%、40%、50%的活性或能力。Having enhanced activity or ability means having an activity or ability that is increased by at least 1%, 5%, 10%, 20%, 30%, 40%, or 50% compared to wild-type SpCas12f1 nuclease.
具有降低的活性和能力是指相对于野生型SpCas12f1核酸酶,具有小于50%、小于40%、小于30%、小于20%、小于10%、小于5%或小于1%的活性或能力。Having reduced activity and ability refers to having an activity or ability of less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, or less than 1% relative to wild-type SpCas12f1 nuclease.
本发明所述的这些小的SpCas12f1及其变体可用作本发明下文所述的任何系统、组合物、试剂盒和方法中。These small SpCas12fl and variants thereof described herein may be used in any of the systems, compositions, kits and methods described below in the invention.
如果没有另外说明,术语“SpCas12f1”、“SpCas12f1核酸酶”包括野生型SpCas12f1核酸酶以及其所有变体,本领域技术人员可以通过常规手段可以确定SpCas12f1核酸酶变体的类型,而不受限于上文所例举的那些。If not otherwise stated, the terms "SpCas12f1" and "SpCas12f1 nuclease" include wild-type SpCas12f1 nuclease and all variants thereof. Those skilled in the art can determine the type of SpCas12f1 nuclease variant by routine means without limitation. Those cited above.
本发明所述系统中的各组成部分可通过载体方式来运送。例如,对于多核苷酸,可以采用的方法包括但不限于纳米颗粒、脂质体、核糖核蛋白、小分子RNA-缀合物、嵌合体和RNA-融合蛋白复合物等。Each component in the system of the present invention can be transported by means of carriers. For example, for polynucleotides, methods that can be used include, but are not limited to, nanoparticles, liposomes, ribonucleoproteins, small molecule RNA-conjugates, chimeras, and RNA-fusion protein complexes, etc.
本发明所述的系统还可以进一步包括一个或多个供体模板。在某些实施方式中,所述供体模板包含用于插入靶基因的供体序列。The system of the present invention may further include one or more donor templates. In certain embodiments, the donor template includes donor sequences for insertion of a target gene.
本发明所述系统中,还可以进一步包含二聚体FOK1核酸酶,完整或部分或完全缺陷的SpCas12f1核酸酶或gRNA连接至二聚体FOK1核酸酶,以在通过一种或多种gRNA分子指导到一个或多个特定DNA靶位点时指导内切核酸酶切割。The system of the present invention may further comprise a dimeric FOK1 nuclease, a complete or partially or completely defective SpCas12f1 nuclease or a gRNA connected to the dimeric FOK1 nuclease to guide the process through one or more gRNA molecules. Directs endonuclease cleavage when reaching one or more specific DNA target sites.
本发明所述系统可以在细胞中的多个位置处编辑或修饰DNA,以用于基因治疗,包括但不限于用于疾病的基因治疗,用于生物学研究,用于农作物抗性改进或提高产量等。The system of the present invention can edit or modify DNA at multiple locations in cells for gene therapy, including but not limited to gene therapy for diseases, biological research, and improvement or enhancement of crop resistance. Yield etc.
本发明还提供了一种组合物,其包含如上所述的SpCas12f1核酸酶或编码其的多核苷酸、gRNA或编码其的多核苷酸、重组表达载体、系统中的一种或几种,还可以包括可接受的载体、介质等。所述可接受的载体、介质例如无菌水或生理 盐水、稳定剂、赋形剂、抗氧化剂(抗坏血酸等)、缓冲剂(磷酸、枸橼酸、其它的有机酸等)、防腐剂、表面活性剂(PEG、Tween等)、螯合剂(EDTA等)、粘合剂等。而且,也可含有其它低分子量的多肽;血清白蛋白、明胶或免疫球蛋白等蛋白质;甘氨酸、谷酰胺、天冬酰胺、精氨酸和赖氨酸等氨基酸;多糖和单糖等糖类或碳水化物;甘露糖醇或山梨糖醇等糖醇。当制备用于注射的水溶液时,例如生理盐水、含有葡萄糖或其它的辅助药物的等渗溶液,如D-山梨糖醇、D-甘露糖、D-甘露糖醇、氯化钠,可并用适当的增溶剂例如醇(乙醇等)、多元醇(丙二醇,PEG等)、非离子表面活性剂(吐温80,HCO-50)等。在一些实施方式中,组合物包含gRNA和用于稳定核酸的缓冲液。The present invention also provides a composition comprising one or more of the SpCas12f1 nuclease or the polynucleotide encoding the same, gRNA or the polynucleotide encoding the same, a recombinant expression vector, and a system as described above, and also Acceptable carriers, media, etc. may be included. The acceptable carriers and media such as sterile water or physiological saline, stabilizers, excipients, antioxidants (ascorbic acid, etc.), buffers (phosphoric acid, citric acid, other organic acids, etc.), preservatives, surface Active agents (PEG, Tween, etc.), chelating agents (EDTA, etc.), adhesives, etc. Moreover, it may also contain other low molecular weight polypeptides; proteins such as serum albumin, gelatin or immunoglobulin; amino acids such as glycine, glutamine, asparagine, arginine and lysine; sugars such as polysaccharides and monosaccharides or Carbohydrates; sugar alcohols such as mannitol or sorbitol. When preparing aqueous solutions for injection, such as physiological saline, isotonic solutions containing glucose or other auxiliary drugs, such as D-sorbitol, D-mannose, D-mannitol, and sodium chloride, appropriate Solubilizers include alcohols (ethanol, etc.), polyols (propylene glycol, PEG, etc.), nonionic surfactants (Tween 80, HCO-50), etc. In some embodiments, the composition includes gRNA and a buffer for stabilizing the nucleic acid.
本发明还提供了一种试剂盒,其包括如上所述的系统或组合物。所述试剂盒还可以进一步包括一种或多种例如选自:稀释缓冲液;洗涤缓冲液;对照试剂等的额外试剂。在一些实施方式中,所述试剂盒包括(a)根据上文所述的SpCas12f1核酸酶或编码SpCas12f1核酸酶的核酸;和(b)gRNA或编码所述gRNA的核酸,其中所述gRNA能够将所述SpCas12f1核酸酶或其变体指导至靶多核苷酸序列。在某些实施方式中,所述试剂盒进一步含有包含异源多核苷酸序列的供体模板,其中所述异源多核苷酸序列能够被插入所述靶多核苷酸序列中。The present invention also provides a kit comprising the system or composition as described above. The kit may further include one or more additional reagents, for example selected from: dilution buffer; wash buffer; control reagents, etc. In some embodiments, the kit includes (a) a SpCas12f1 nuclease or a nucleic acid encoding a SpCas12f1 nuclease as described above; and (b) a gRNA or a nucleic acid encoding the gRNA, wherein the gRNA is capable of converting The SpCas12f1 nuclease or variant thereof is directed to a target polynucleotide sequence. In certain embodiments, the kit further contains a donor template comprising a heterologous polynucleotide sequence capable of being inserted into the target polynucleotide sequence.
本发明还提供了一种基因编辑方法,将靶基因与如上所述的基因编辑系统接触,以实现靶基因的编辑。本发明方法可以用于在细胞中或在体内、离体细胞或无细胞系统中靶向、编辑、修饰或操纵靶基因(如靶DNA),所述方法包括:将如上所述的SpCas12f1核酸酶或编码其的多核苷酸、gRNA或编码其的多核苷酸、重组表达载体、系统、组合物等引入试剂盒体内、离体细胞或无细胞系统中,对靶基因进行靶向、编辑、修饰或操纵。在一实施方式中,所述方法包括如下:The present invention also provides a gene editing method, which involves contacting the target gene with the gene editing system as described above to achieve editing of the target gene. The methods of the present invention can be used to target, edit, modify or manipulate target genes (such as target DNA) in cells or in vivo, ex vivo cells or cell-free systems, and the methods include: adding the SpCas12f1 nuclease as described above Or the polynucleotide encoding it, gRNA or the polynucleotide encoding it, recombinant expression vector, system, composition, etc. are introduced into the kit in vivo, in vitro cells or cell-free systems to target, edit and modify the target gene. or manipulation. In one embodiment, the method includes the following:
(a)将所述SpCas12f1核酸酶或编码SpCas12f1核酸酶的核酸引入体内、离体细胞或无细胞系统中;和(a) introducing the SpCas12f1 nuclease or nucleic acid encoding the SpCas12f1 nuclease into an in vivo, ex vivo cell or cell-free system; and
(b)引入所述gRNA(sgRNA)或适合于原位产生这种sgRNA的核酸(例如,DNA);和(b) introducing the gRNA (sgRNA) or a nucleic acid (e.g., DNA) suitable for in situ production of such sgRNA; and
(c)使细胞或靶基因与SpCas12f1核酸酶或编码SpCas12f1核酸酶的核酸、gRNA(sgRNA)或适合于原位产生这种sgRNA的核酸接触,以在所述靶基因中产生一个或多个切割、切口或编辑;其中所述SpCas12f1核酸酶通过其加工的 或未加工形式的gRNA被指导至所述靶基因。(c) contacting the cell or target gene with SpCas12f1 nuclease or a nucleic acid encoding SpCas12f1 nuclease, a gRNA (sgRNA), or a nucleic acid suitable for producing such sgRNA in situ to produce one or more cleavages in the target gene , nicking or editing; wherein the SpCas12f1 nuclease is directed to the target gene via its processed or unprocessed form of gRNA.
在一些实施方式中,本发明所述基因编辑方法包括下列步骤:In some embodiments, the gene editing method of the present invention includes the following steps:
i)将所述SpCas12f1核酸酶或其编码多核苷酸、以及所述向导RNA或其编码多核苷酸引入细胞中;i) introducing the SpCas12f1 nuclease or its encoding polynucleotide and the guide RNA or its encoding polynucleotide into a cell;
ii)由所述SpCas12f1核酸酶介导,在靶基因中产生一种或多种切口,或靶向、编辑、修饰或操纵所述靶基因。ii) Produce one or more nicks in a target gene mediated by the SpCas12f1 nuclease, or target, edit, modify or manipulate the target gene.
本发明所述基因编辑方法中各参数条件可以根据本领域常规知识进行调整,例如,所述的包括核酸酶及向导RNA的表达载体的浓度较佳地为1μg;所述的细胞转染后再进行编辑的时间较佳地为72小时。Each parameter condition in the gene editing method of the present invention can be adjusted according to common knowledge in the field. For example, the concentration of the expression vector including nuclease and guide RNA is preferably 1 μg; the cells are transfected and then The time for editing is preferably 72 hours.
本发明所述基因编辑方法中,所述SpCas12f1核酸酶通过加工或未加工形式的向导RNA引导至靶基因。所述SpCas12f1核酸酶和向导RNA形成复合物,识别所述靶基因上的PAM序列。在一些优选实施方式中,所述方法进一步包括将包含异源多核苷酸序列的供体模板引入细胞中的步骤。In the gene editing method of the present invention, the SpCas12f1 nuclease is guided to the target gene through a processed or unprocessed form of guide RNA. The SpCas12f1 nuclease and guide RNA form a complex to recognize the PAM sequence on the target gene. In some preferred embodiments, the method further includes the step of introducing a donor template comprising a heterologous polynucleotide sequence into the cell.
本发明还提供如上所述的向导RNA、分离的多核苷酸、构建体、表达系统、基因编辑系统、药物组合物或所述的方法在体内、离体细胞或无细胞环境中对靶基因和/或其相关多肽进行基因编辑中的应用。所述离体细胞选自细菌细胞、古细菌细胞、真菌细胞、原生生物细胞、病毒细胞、植物细胞和动物细胞中的至少一种。所述基因编辑选自由:基因切割、基因删除、基因插入、点突变、转录抑制、转录激活、碱基编辑和引导编辑构成的群组,包括但不限于:The present invention also provides the above-mentioned guide RNA, isolated polynucleotide, construct, expression system, gene editing system, pharmaceutical composition or said method for targeting genes and genes in vivo, ex vivo cells or cell-free environment. /or its related peptides for use in gene editing. The in vitro cells are selected from at least one of bacterial cells, archaeal cells, fungal cells, protist cells, viral cells, plant cells and animal cells. The gene editing is selected from the group consisting of: gene cutting, gene deletion, gene insertion, point mutation, transcription inhibition, transcription activation, base editing and guided editing, including but not limited to:
切割靶基因;Cut target genes;
操控靶基因的表达;Manipulate the expression of target genes;
遗传修饰靶基因;Genetically modify target genes;
遗传修饰靶基因相关多肽;Genetically modified target gene-related peptides;
用于在靶基因的任何所需位置处的有意和受控的损伤;For intentional and controlled damage at any desired location on a target gene;
用于在靶基因的任何所需位置处的有意和受控的修复;For intentional and controlled repair at any desired location on a target gene;
引入双链断裂以外的方式修饰靶基因(SpCas12f1核酸酶具有酶活性,其以除引入双链断裂以外的方式修饰靶基因;所述酶活性可以是SpCas12f1本身所具有的,或通过例如将具有酶活性的异源多肽融合到SpCas12f1核酸酶形成嵌合SpCas12f1核酸酶获得的,所述酶活性包括但不限于甲基转移酶活性、脱氨作用 活性、歧化酶活性、烷基化活性、脱甲基酶活性、DNA修复活性、转座酶活性、重组酶活性、DNA损伤活性、脱嘌呤活性、氧化活性、嘧啶二聚体形成活性等)。Modify the target gene in a manner other than introducing a double-strand break (SpCas12f1 nuclease has an enzymatic activity that modifies the target gene in a manner other than introducing a double-strand break; the enzymatic activity may be possessed by SpCas12f1 itself, or by, for example, adding an enzyme with The active heterologous polypeptide is fused to SpCas12f1 nuclease to form a chimeric SpCas12f1 nuclease. The enzyme activities include but are not limited to methyltransferase activity, deamination activity, dismutase activity, alkylation activity, and demethylation. Enzyme activity, DNA repair activity, transposase activity, recombinase activity, DNA damage activity, depurination activity, oxidation activity, pyrimidine dimer formation activity, etc.).
优选地,所述基因编辑为基因删除或者基因切割;所述基因编辑可用于实现包括但不限于致病位点的修正、基因功能研究、增强细胞功能、细胞治疗等中的一种或几种。Preferably, the gene editing is gene deletion or gene cutting; the gene editing can be used to achieve one or more of the following including but not limited to the correction of pathogenic sites, gene function research, enhancement of cell function, cell therapy, etc. .
本发明SpCas12f1核酸酶或编码其的多核苷酸、gRNA或编码其的多核苷酸、重组表达载体、系统、组合物和试剂盒可以应用于研究领域,诊断领域,工业领域(例如微生物工程),药物发现(例如高通量筛选),靶标确认,影像学领域以及治疗领域等。The SpCas12f1 nuclease of the present invention or the polynucleotide encoding the same, gRNA or the polynucleotide encoding the same, recombinant expression vector, system, composition and kit can be applied in the research field, diagnostic field, industrial field (such as microbial engineering), Drug discovery (such as high-throughput screening), target identification, imaging fields, and therapeutic areas, etc.
在一些实施方式中,所述靶基因为靶DNA。在一些实施方式中,靶DNA可以是未与DNA相关蛋白结合的体外裸DNA。在一些实施方式中,靶DNA是体外细胞中的染色体DNA。在一些实施方式中,所述靶基因为靶RNA。在一些实施方式中,将靶DNA与包含所述SpCas12f1核酸酶和gRNA的靶向复合物接触,gRNA通过包含与靶DNA互补的核苷酸序列,为靶向复合物提供靶特异性;SpCas12f1核酸酶提供位点特异性活性。在一些实施方式中,靶向复合物修饰靶DNA,从而导致例如DNA切割、DNA甲基化作用、DNA损伤、DNA修复等。在一些实施方式中,靶向复合物修饰与靶DNA相关多肽(例如,组蛋白、DNA-结合蛋白等),从而导致例如靶DNA相关多肽-组蛋白的甲基化、组蛋白乙酰化、组蛋白泛素化等。In some embodiments, the target gene is target DNA. In some embodiments, the target DNA can be in vitro naked DNA that is not bound to DNA-associated proteins. In some embodiments, the target DNA is chromosomal DNA in cells in vitro. In some embodiments, the target gene is target RNA. In some embodiments, the target DNA is contacted with a targeting complex comprising the SpCas12f1 nuclease and a gRNA, the gRNA providing target specificity to the targeting complex by comprising a nucleotide sequence complementary to the target DNA; SpCas12f1 nucleic acid Enzymes provide site-specific activity. In some embodiments, the targeting complex modifies the target DNA, resulting in, for example, DNA cleavage, DNA methylation, DNA damage, DNA repair, and the like. In some embodiments, the targeting complex modifies a polypeptide associated with the target DNA (e.g., histones, DNA-binding proteins, etc.), resulting in, for example, methylation of the target DNA-associated polypeptide-histone, histone acetylation, histone acetylation, etc. Protein ubiquitination, etc.
本发明所述方法中,可以通过公知的方法将SpCas12f1核酸酶或包含编码SpCas12f1核酸酶的多肽的核苷酸序列的核酸引入细胞。同样地,可以通过公知的方法将gRNA或包含编码gRNA的核苷酸序列的核酸引入细胞。公知的方法包括,DEAE-葡聚糖介导的转染、脂质体介导的转染、病毒或噬菌体感染、脂质转染法、转染、接合、原生质体融合、聚乙烯亚胺介导的转染、电穿孔、磷酸钙沉淀、基因枪、磷酸钙沉淀、显微注射、纳米颗粒介导的核酸递送等。例如通过电穿孔、氯化钙转染、显微注射和脂质转染法等递送质粒。对于病毒载体递送,使细胞与包含编码gRNA和/或SpCas12f1核酸酶和/或嵌合SpCas12f1核酸酶的核酸和/或供体多核苷酸的病毒颗粒接触。In the method of the present invention, SpCas12f1 nuclease or a nucleic acid containing a nucleotide sequence encoding a polypeptide of SpCas12f1 nuclease can be introduced into cells by known methods. Likewise, gRNA or a nucleic acid comprising a nucleotide sequence encoding gRNA can be introduced into a cell by well-known methods. Well-known methods include DEAE-dextran-mediated transfection, liposome-mediated transfection, virus or phage infection, lipofection, transfection, conjugation, protoplast fusion, polyethylenimine-mediated guided transfection, electroporation, calcium phosphate precipitation, gene gun, calcium phosphate precipitation, microinjection, nanoparticle-mediated nucleic acid delivery, etc. Plasmids are delivered, for example, by electroporation, calcium chloride transfection, microinjection, and lipofection. For viral vector delivery, cells are contacted with viral particles comprising nucleic acid encoding gRNA and/or SpCas12f1 nuclease and/or chimeric SpCas12f1 nuclease and/or donor polynucleotide.
在一些实施方式中,本发明所述方法中,核酸酶切割细胞中的靶DNA以产 生双链断裂,然后由细胞通常以下列方式进行修复:非同源末端连接(NHEJ)和同源性指导的修复。In some embodiments, in the methods of the invention, a nuclease cleaves target DNA in a cell to create a double-stranded break, which is then repaired by the cell, typically through non-homologous end joining (NHEJ) and homology-directed of repair.
本发明还提供了一种细胞,包括已用上述SpCas12f1核酸酶或编码其的多核苷酸、gRNA或编码其的多核苷酸、重组表达载体、系统、组合物对宿主细胞进行了遗传修饰。The present invention also provides a cell, including a host cell that has been genetically modified with the above-mentioned SpCas12f1 nuclease or polynucleotide encoding the same, gRNA or polynucleotide encoding the same, recombinant expression vector, system, and composition.
本发明中,gRNA和/或SpCas12f1核酸酶和/或重组表达载体和/或供体多核苷酸的有效剂量对于本领域技术人员而言是常规的。可以根据不同的施用途径以及治疗的病症的特性来确定。In the present invention, the effective dosage of gRNA and/or SpCas12f1 nuclease and/or recombinant expression vector and/or donor polynucleotide is routine for those skilled in the art. This may vary depending on the route of administration and the nature of the condition being treated.
本发明中,所述细菌或原核细菌可以是大肠杆菌、肺炎克雷伯氏菌、卵形拟杆菌、空肠弯曲菌、腐生葡萄球菌、粪肠球菌、多形拟杆菌、普通拟杆菌、单形拟杆菌、干酪乳杆菌、脆弱拟杆菌、鲁氏不动杆菌、具核梭杆菌、乔氏拟杆菌、拟南芥拟杆菌、鼠李糖乳杆菌、马赛拟杆菌、粪副拟杆菌、死亡梭杆菌和短双歧杆菌等。In the present invention, the bacteria or prokaryotic bacteria may be Escherichia coli, Klebsiella pneumoniae, Bacteroides ovatus, Campylobacter jejuni, Staphylococcus saprophyticus, Enterococcus faecalis, Bacteroides thetaiotaomicron, Bacteroides vulgaris, Bacteroides monomorpha, Bacteroides, Lactobacillus casei, Bacteroides fragilis, Acinetobacter reuteri, Fusobacterium nucleatum, Bacteroides johnsonii, Bacteroides arabidopsis, Lactobacillus rhamnosus, Bacteroides marseillei, Parabacteroides faecalis, Clostridium death Bacillus and Bifidobacterium breve, etc.
本发明中,所述真核细胞包括但不限于是哺乳动物细胞、真菌等真核生物细胞。所述真菌包括酵母、曲霉,例如可以是酿酒酵母、多形汉逊酵母、毕赤酵母、脆壁克鲁维氏酵母、乳酸克鲁维氏酵母,以及栗酒裂殖酵母、白假丝酵母、杜氏假丝酵母、光滑假丝酵母、季也蒙假丝酵母、乳酒假丝酵母、克鲁斯假丝酵母、葡萄牙假丝酵母、梅林假丝酵母、嗜油假丝酵母、近平滑假丝酵母、热带假丝酵母和产朊假丝酵母、烟曲霉、黄曲霉、黑曲霉、棒曲霉、灰绿曲霉群、构巢曲霉、米曲霉、土曲霉、焦曲霉和杂色曲霉等。In the present invention, the eukaryotic cells include but are not limited to mammalian cells, fungi and other eukaryotic cells. The fungi include yeast and Aspergillus, such as Saccharomyces cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyveromyces fragilis, Kluyveromyces lactis, Schizosaccharomyces pombe and Candida albicans. , Candida donovani, Candida glabrata, Candida quaymondo, Candida lactis, Candida krusei, Candida portuguese, Candida mellin, Candida oleophila, Candida parapsilosis Fibromyces, Candida tropicalis and Candida virogens, Aspergillus fumigatus, Aspergillus flavus, Aspergillus niger, Aspergillus clavulatus, Aspergillus griseus, Aspergillus nidulans, Aspergillus oryzae, Aspergillus terreus, Aspergillus pyrophorus and Aspergillus versicolor, etc.
在本发明的实施例中,公开了一种基于极小型CRISPR/SpCas12f1核酸酶的新型基因组编辑方法。该发明显示,通过向导RNA的导向和定位功能,SpCas12f1能够精确切割基因组DNA,实现基因组DNA双链断裂。利用宿主细胞自身或外源的修复机制,该系统能够高效和精确地实现活细胞内基因编辑。In embodiments of the present invention, a novel genome editing method based on extremely small CRISPR/SpCas12f1 nuclease is disclosed. The invention shows that through the guidance and positioning functions of guide RNA, SpCas12f1 can accurately cut genomic DNA and achieve double-stranded breaks in genomic DNA. Utilizing the host cell's own or exogenous repair mechanisms, this system can efficiently and accurately achieve gene editing in living cells.
本发明还提供了如上所述的向导RNA的制备方法,所述方法包括对基础向导RNA的tracrRNA和crRNA进行单独改造及组合改造,所述改造选自对tracrRNA或crRNA进行截断或延长,或将tracrRNA和crRNA之间通过连接链进行连接,制备得到工程化的向导RNA。The present invention also provides a method for preparing guide RNA as described above, which method includes individually transforming tracrRNA and crRNA of the basic guide RNA and transforming them in combination, and the transformation is selected from truncation or extension of tracrRNA or crRNA, or TracrRNA and crRNA are connected through connecting strands to prepare engineered guide RNA.
本发明中所述的部分序列如下所示:Partial sequences described in the present invention are as follows:
其中,所述的含靶向序列的野生型向导RNA完整序列含有如下所示的序列:Wherein, the complete sequence of the wild-type guide RNA containing the targeting sequence contains the following sequence:
5’-ucgcgaaccccaaguuauaaaaaaggucuuugacaacaaaacaagucauaucgcuuuaaagccugacauaauuuacucuguuucgcgcgccagggcaguuaggugcccuaaaagagcgaaguggccgaaaggaaaggcuaacgcuucucuaacgcuacggcgaccuuggcgaaaugccaucaauaccacgcggcccgaaaggguucgcgcgaaacugaguaauaaaacauugcggaugcggcaauacagaaccgcugucgcaucuugcguaagcgcguggauugaaac acacagaggaccccuaguaa-3’(SEQ ID NO.39); 5'-ucgcgaaccccaaguuauaaaaaaggucuuugacaacaaaacaagucauaucgcuuuaaagccugacauaauuuacucuguuucgcgccagggcaguuaggugcccuaaaagagcgaaguggccgaaaggaaaggcuaacgcuucucuaacgcuacggcgaccuuggcgaaaugccaucaauaccacgcggcccga aaggguucgcgcgaaacugaguaauaaaacauugcggaugcggcaauacagaaccgcugucgcaucuugcguaagcgcguggauugaaac acacagaggaccccuaguaa -3' (SEQ ID NO. 39);
下划线部分为所述的靶向序列:5’- acacagaggaccccuaguaa-3’(SEQ ID NO.168);是优选PAM序列后的20bp片段。 The underlined part is the targeting sequence: 5'- acacagaggaccccuaguaa -3' (SEQ ID NO. 168); which is the 20 bp fragment after the preferred PAM sequence.
本发明中所述的SpCas12f1核酸酶为SyntrophomonaspalmitaticaCas12f(SpCas12f1),其氨基酸序列优选包含如下所示的序列:The SpCas12f1 nuclease described in the present invention is Syntrophomonas palmmitatica Cas12f (SpCas12f1), and its amino acid sequence preferably includes the sequence shown below:
Figure PCTCN2022113354-appb-000001
Figure PCTCN2022113354-appb-000001
编码所述SpCas12f1核酸酶大肠杆菌密码子优化的核苷酸序列包含如下序列:The codon-optimized nucleotide sequence encoding the SpCas12f1 nuclease E. coli includes the following sequence:
5’-ATGGGTGAAAGCGTTAAAGCGATCAAACTGAAAATCCTGGACATGTTCCTGGACCCGGAATGCACCAAACAGGATGATAACTGGCGTAAAGATCTGTCTACCATGAGCCGCTTCTGCGCTGAAGCGGGCAACATGTGCCTGCGTGATCTGTACAACTACTTCAGCATGCCGAAAGAAGATCGTATCAGCAGCAAAGATCTGTACAACGCGATGTACCACAAAACCAAACTGCTGCACCCGGAACTGCCGGGCAAAGTGGCAAACCAGATCGTGAACCACGCGAAAGATGTTTGGAAACGTAACGCGAAACTGATCTACCGTAACCAGATCTCCATGCCGACCTATAAAATTACCACCGCGCCGATCCGCCTGCAGAACAACATCTACAAACTGATCAAAAACAAAAACAAATACATCATCGACGTTCAGCTGTACAGCAAAGAATACAGCAAAGACAGCGGTAAAGGCACCCACCGCTACTTCCTGGTTGCAGTGCGTGACA GCTCTACCCGTATGATCTTCGATCGTATCATGAGCAAAGATCACATCGATTCCTCCAAAAGCTACACGCAGGGTCAGCTGCAGATCAAAAAAGACCACCAGGGTAAATGGTACTGTATCATCCCGTACACCTTCCCGACCCACGAAACCGTGCTGGACCCGGACAAAGTTATGGGTGTTGATCTGGGCGTTGCGAAAGCGGTGTACTGGGCGTTCAACTCTTCTTACAAACGCGGCTGCATCGATGGCGGTGAAATCGAACACTTTCGTAAAATGATCCGTGCGCGCCGTGTTTCCATTCAGAACCAGATCAAACACAGCGGTGATGCGCGTAAAGGTCACGGTCGTAAACGCGCTCTGAAACCGATCGAAACCCTGTCTGAAAAAGAGAAAAACTTCCGCGATACCATCAACCACCGCTATGCTAACCGTATCGTTGAAGCGGCGATCAAACAGGGCTGCGGCACCATCCAGATCGAAAACCTGGAAGGCATCGCTGACACCACCGGCAGCAAATTCCTGAAAAACTGGCCGTACTATGATCTGCAGACCAAAATCGTTAACAAAGCGAAAGAACACGGCATCACCGTTGTGGCGATCAACCCGCAGTACACCAGCCAGCGTTGCAGCATGTGTGGCTACATCGAAAAAACCAACCGCAGCTCCCAGGCAGTTTTCGAATGCAAACAGTGTGGTTACGGCAGCCGTACCATCTGCATCAACTGCCGTCACGTTCAGGTTTCCGGTGATGTTTGCGAAGAATGCGGCGGCATTGTTAAAAAAGAAAACGTGAACGCGGACTACAACGCGGCGAAAAACATTAGCACCCCGTACATCGATCAGATCATCATGGAAAAATGCCTGGAACTGGGTATCCCGTACCGTAGCATCACCTGCAAAGAATGCGGTCACATCCAGGCGTCCGGTAACACCTGCGAAGTTTGCGGCTCTACCAACATCCTGAAACCGAAAAAGATCCGTAAAGCTAAATAA-3’(SEQ ID NO.41);5’-ATGGGTGAAAGCGTTAAAGCGATCAAACTGAAAATCCTGGACATGTTCCTGGACCCGGAATGCACCAAACAGGATGATAACTGGCGTAAAGATCTGTCTACCATGAGCCGCTTCTGCGCTGAAGCGGGCAACATGTGCCTGCGTGATCTGTACAACTACTTCAGCATGCCGAAAGAAGATCGTATCAGCAGCAAAGATCTGTACAACGCGATGTACCACAAAACCAAACTGCTGCACCCGGAACTGCCGGGCAAA GTGGCAAACCAGATCGTGAACCACGCGAAAGATGTTTTGGAAACGTAACGCGAAACTGATCTACCGTAACCAGATCTCCATGCCGACCTATAAAATTACCACCGCGCCGATCCGCCTGCAGAACATCTACAAACTGATCAAAAACAAAAACAAATACATCGACGTTCAGCTGTACAGCAAAGAATACAGCAAAGACAGCGGTAAAGGCACCCACCGCTACTTCCTGGTTGCAGTGCGTGACA GCTCTACCCGTATGATCTTCGA TCGTATCATGAGCAAAGATCACATCGATTCCTCCAAAAGCTACACGCAGGGTCAGCTGCAGATCAAAAAAGACCACCAGGGTTAAATGGTACTGTATCATCCCGTACACCTTCCCGACCCACGAAACCGTGCTGGACCCGGACAAAGTTATGGGTGTTGATCTGGGCGTTGCGAAAGCGGTGTACTGGGCGTTCAACTCTTCTTACAAACGCGGCTGCATCGATGGCGGTGAAATCGAACACTTTCGTAAAATGATCCGTGCGCGC CGTGTTTCATTCAGAACCAGATCAAACACAGCGGTGATGCGCGTAAAGGTCACGGTCGTAAACGCGCTCTGAAACCGATCGAAACCCTGTCTGAAAAAGAGAAAAACTTCCGCGATACCATCAACCACCGCTATGCTAACCGTATCGTTGAAGCGGCGATCAAACAGGGCTGCGGCACCATCCAGATCGAAAACCTGGAAGGCATCGCTGACACCACCGGCAGCAAATTCCTGAAAAACTGGCCGTACTATGATCTGCAGACCAAA ATCGTTAACAAAGCGAAAGAACACGGCATCACCGTTGTGGCGATCAACCCGCAGTACACCAGCCAGCGTTGCAGCATGTGTGGCTACATCGAAAAAACCAACCGCAGCTCCCAGGCAGTTTTCGAATGCAAACAGTGTGGTTACGGCAGCCGTACCATCTGCATCAACTGCCGTCACGTTCAGGTTTCCGGTGATGTTTGCGAAGAATGCGGCGGCATTGTTAAAAAAGAAAACGTGAACGCGGACTACAACGCGGCGAAAAAA CATTAGCACCCCGTACATCGATCAGATCATCATGGAAAAATGCCTGGAACTGGGTATCCCGTACCGTAGCATCACCTGCAAAGAATGCGGTCACATCCAGGCGTCCGGTAACACCTGCGAAGTTTGCGGCTCTACCAACATCCTGAAACCGAAAAAGATCCGTAAAGCTAAATAA-3’(SEQ ID NO.41);
其人源密码子优化的核苷酸序列含如下序列:Its human codon-optimized nucleotide sequence contains the following sequence:
5’-ATGGGCGAGAGCGTGAAGGCCATCAAGCTGAAGATCCTGGATATGTTTCTGGACCCTGAGTGTACCAAGCAGGATGATAACTGGAGAAAGGACCTGAGCACCATGAGCAGATTTTGCGCCGAGGCCGGCAATATGTGCCTGAGGGATCTGTATAATTACTTTAGCATGCCTAAGGAGGATAGGATTAGCAGCAAGGACCTGTACAACGCCATGTACCACAAGACCAAGCTGCTGCACCCTGAGCTGCCCGGCAAGGTGGCCAACCAGATCGTGAACCACGCCAAGGATGTGTGGAAGAGAAACGCCAAGCTGATCTACAGAAACCAGATCAGCATGCCTACATACAAGATTACCACAGCCCCTATCAGGCTGCAGAACAACATCTACAAGCTGATCAAGAACAAAAATAAGTACATCATCGACGTGCAGCTGTATTCCAAGGAGTACTCCAAGGACTCTGGCAAGGGCACCCACAGGTACTTCCTGGTGGCCGTGAGAGACAGCAGCACAAGGATGATCTTCGACAGGATCATGTCTAAGGATCACATCGATTCCAGCAAGTCCTACACCCAGGGCCAGCTGCAGATCAAGAAGGACCACCA GGGCAAGTGGTACTGCATCATCCCATACACCTTCCCAACCCACGAGACAGTGCTGGACCCCGACAAGGTCATGGGCGTGGACCTGGGCGTGGCCAAGGCCGTGTACTGGGCCTTCAACAGCAGCTACAAGCGCGGCTGCATCGACGGCGGCGAGATCGAGCACTTCCGGAAGATGATCAGAGCCAGGAGAGTGAGCATCCAGAACCAGATCAAGCACTCCGGCGACGCAAGGAAGGGCCACGGCAGAAAGCGGGCCCTGAAGCCAATCGAAACCCTGTCTGAGAAGGAGAAGAACTTCCGGGACACCATCAACCACAGATACGCCAACAGGATCGTGGAGGCCGCCATTAAGCAGGGCTGCGGCACAATCCAGATCGAGAATCTGGAGGGCATCGCCGACACCACCGGCAGCAAGTTCCTGAAGAATTGGCCTTACTACGATCTGCAGACCAAGATCGTGAATAAGGCCAAGGAGCACGGCATCACCGTGGTGGCCATCAACCCTCAGTACACATCCCAGAGATGTTCCATGTGCGGCTACATCGAAAAGACAAACCGCAGCAGCCAGGCCGTGTTCGAGTGCAAGCAGTGCGGCTACGGCAGCAGGACCATCTGCATCAACTGTAGACACGTGCAGGTGAGCGGCGACGTGTGTGAGGAGTGCGGCGGCATCGTGAAGAAGGAAAACGTGAACGCCGATTACAATGCCGCCAAGAACATCAGCACCCCCTACATCGACCAGATTATCATGGAGAAGTGCCTGGAGCTGGGCATCCCCTACAGAAGCATCACCTGTAAGGAGTGCGGCCACATCCAGGCCAGCGGCAATACCTGCGAGGTGTGCGGCAGCACCAACATCCTGAAGCCTAAGAAGATCAGAAAGGCCAAG-3’(SEQ ID NO.42);5’-ATGGGCGAGAGCGTGAAGGCCATCAAGCTGAAGATCCTGGATATGTTTTGGACCCTGAGTGTACCAAGCAGGATGATAACTGGAGAAAGGACCTGAGCACCATGAGCAGATTTTGCGCCGAGGCCGGCAATATGTGCCTGAGGGATCTGTATAATTACTTTAGCATGCCTAAGGAGGATAGGATTAGCAGCAAGGACCTGTACAACGCCATGTACCACAAGACCAAGCTGCTGCACCCTGAGCTGCCCGGCAAGGTG GCCAACCAGATCGTGAACCACGCCAAGGATGTGTGGAAGAGAAACGCCAAGCTGATCTACAGAAACCAGATCAGCATGCCTACATACAAGATTACCACAGCCCCTATCAGGCTGCAGAACATCTACAAGCTGATCAAGAAAAATAAGTACATCATCGACGTGCAGCTGTATTCCAAGGAGTACTCCAAGGACTCTGGCAAGGGCACCCACAGGTACTTCCTGGTGGCCGTGAGAGACAGCAGCACAAGGATGATCTTCGACAGG ATCATGTCTAAGGATCACATCGATTCCAGCAAGTCCTACACCCAGGGCCAGCTGCAGATCAAGAAGGACCACCA GGGCAAGTGGTACTGCATCATCCCATACACCTTCCCAACCCACGAGACAGTGCTGGACCCCGACAAGGTCATGGGCGTGGACCTGGGCGTGGCCAAGGCCGTGTACTGGGCCTTCAACAGCAGCTACAAGCGCGGCTGCATCGACGGCGGCGAGATCGAGCACTTCCGGAAGATGATCAGAGCCAGGAGAGT GAGCATCCAGAACCAGATCAAGCACTCCGGCGACGCAAGGAAGGGCCACGGCAGAAAGCGGGCCCTGAAGCCAATCGAAACCCTGTCTGAGAAGGAGAAGAACTTCCGGGACACCATCAACCGATACGCCAACAGGATCGTGGAGGCCGCCATTAAGCAGGGCTGCGGCACAATCCAGATCGAGAATCTGGAGGGCATCGCCGACACCACCGGCAGCAAGTTCCTGAAGAATTGGCCTTACTACGATCTGCAGAC CAAGATCGTGAATAAGGCCAAGGAGCACGGCATCACCGTGGTGGCCATCAACCCTCAGTACACATCCCAGAGATGTTCCATGTGCGGCTACATCGAAAAGACAAACCGCAGCAGGCCGTGTTCGAGTGCAAGCAGTGCGGCTACGGCAGCAGGACCATCTGCATCAACTGTAGACACGTGCAGGTGAGCGGCGACGTGTGTGAGGAGTGCCGGCGGCATCGTGAAGAAGGAAAACGTGAACGCCGATTACAATGCCGCCAA GAACATCAGCACCCCCTACATCGACCAGATTATCATGGAGAAGTGCCTGGAGCTGGGCATCCCCTACAGAAGCATCACCTGTAAGGAGTGCGGCCACATCCAGGCCAGCGGCAATACCTGCGAGGTGTGCGGCAGCACCAACATCCTGAAGCCTAAGAAGATCAGAAAGGCCAAG-3’(SEQ ID NO.42);
本发明中所述的SpCas12f1核酸酶相应的不同版本的向导RNA序列如下所示,其中 浅灰色为tracrRNA序列;深灰色为tracr配对物序列;当无连接链时,tracrRNA序列+tracr配对物序列为骨架;当有连接链时,tracrRNA序列+连接链+tracr配对物序列为骨架。 The guide RNA sequences of different versions corresponding to the SpCas12f1 nuclease described in the present invention are as follows, where light gray is the tracrRNA sequence; dark gray is the tracr partner sequence; when there is no connecting chain, the tracrRNA sequence + tracr partner sequence is Backbone; when there is a connecting strand, the tracrRNA sequence + connecting strand + tracr partner sequence is the backbone.
Figure PCTCN2022113354-appb-000002
Figure PCTCN2022113354-appb-000003
其示意图如图1所示;其tracrRNA序列如SEQ ID NO.111所示,tracr配对物序列如SEQ ID NO.112所示;向导RNA骨架序列为SEQ ID NO.1除基因靶向区段的序列,如SEQ ID NO.113所示;与靶序列杂交的基因靶向区段如SEQ ID NO.168所示( acacagaggaccccuaguaa);
Figure PCTCN2022113354-appb-000002
Figure PCTCN2022113354-appb-000003
Its schematic diagram is shown in Figure 1; its tracrRNA sequence is shown in SEQ ID NO.111, and its tracr partner sequence is shown in SEQ ID NO.112; the guide RNA backbone sequence is SEQ ID NO.1 except for the gene targeting segment. The sequence is shown in SEQ ID NO.113; the gene targeting segment that hybridizes to the target sequence is shown in SEQ ID NO.168 ( acacagaggaccccuaguaa );
Figure PCTCN2022113354-appb-000004
Figure PCTCN2022113354-appb-000005
Figure PCTCN2022113354-appb-000006
其tracrRNA序列如SEQ ID NO.114所示,tracr配对物序列如SEQ ID NO.115所示;向导RNA骨架序列为SEQ ID NO.2除基因靶向区段的序列,如SEQ ID NO.116所示;
Figure PCTCN2022113354-appb-000004
Figure PCTCN2022113354-appb-000005
Figure PCTCN2022113354-appb-000006
The tracrRNA sequence is shown in SEQ ID NO.114, the tracr partner sequence is shown in SEQ ID NO.115; the guide RNA backbone sequence is the sequence of SEQ ID NO.2 except for the gene targeting segment, such as SEQ ID NO.116 shown;
Figure PCTCN2022113354-appb-000007
Figure PCTCN2022113354-appb-000008
其tracrRNA序列如SEQ ID NO.117所示,tracr配对物序列如SEQ ID NO.118所示;向导RNA骨架序列为SEQ ID NO.3除基因靶向区段的序列,如SEQ ID NO.119所示;
Figure PCTCN2022113354-appb-000007
Figure PCTCN2022113354-appb-000008
The tracrRNA sequence is shown in SEQ ID NO.117, the tracr partner sequence is shown in SEQ ID NO.118; the guide RNA backbone sequence is the sequence of SEQ ID NO.3 except the gene targeting segment, such as SEQ ID NO.119 shown;
Figure PCTCN2022113354-appb-000009
Figure PCTCN2022113354-appb-000010
其tracrRNA序列如SEQ ID NO.120所示,tracr配对物序列如SEQ ID NO.121所示;向导RNA骨架序列为SEQ ID NO.4除基因靶向区段的序列,如SEQ ID NO.122所示;
Figure PCTCN2022113354-appb-000009
Figure PCTCN2022113354-appb-000010
The tracrRNA sequence is shown in SEQ ID NO.120, the tracr partner sequence is shown in SEQ ID NO.121; the guide RNA backbone sequence is the sequence of SEQ ID NO.4 except the gene targeting segment, such as SEQ ID NO.122 shown;
Figure PCTCN2022113354-appb-000011
Figure PCTCN2022113354-appb-000012
其tracrRNA序列如SEQ ID NO.123所示,tracr配对物序列如SEQ ID NO.124所示;向导RNA骨架序列为SEQ ID NO.5除基因靶向区段的序列,如SEQ ID NO.125所示;
Figure PCTCN2022113354-appb-000011
Figure PCTCN2022113354-appb-000012
The tracrRNA sequence is shown in SEQ ID NO.123, the tracr partner sequence is shown in SEQ ID NO.124; the guide RNA backbone sequence is the sequence of SEQ ID NO.5 except the gene targeting segment, such as SEQ ID NO.125 shown;
Figure PCTCN2022113354-appb-000013
Figure PCTCN2022113354-appb-000014
其tracrRNA序列如SEQ ID NO.126所示,tracr配对物序列如SEQ ID NO.127所示;向导RNA骨架序列为SEQ ID NO.6除基因靶向区段的序列,如SEQ ID NO.128所示;
Figure PCTCN2022113354-appb-000013
Figure PCTCN2022113354-appb-000014
The tracrRNA sequence is shown in SEQ ID NO.126, the tracr partner sequence is shown in SEQ ID NO.127; the guide RNA backbone sequence is the sequence of SEQ ID NO.6 except for the gene targeting segment, such as SEQ ID NO.128 shown;
Figure PCTCN2022113354-appb-000015
Figure PCTCN2022113354-appb-000016
Figure PCTCN2022113354-appb-000017
其tracrRNA序列如SEQ ID NO.129所示,tracr配对物序列如SEQ ID NO.130所示;向导RNA骨架序列为SEQ ID NO.7除基因靶向区段的序列,如SEQ ID NO.131所示;
Figure PCTCN2022113354-appb-000015
Figure PCTCN2022113354-appb-000016
Figure PCTCN2022113354-appb-000017
The tracrRNA sequence is shown in SEQ ID NO.129, the tracr partner sequence is shown in SEQ ID NO.130; the guide RNA backbone sequence is the sequence of SEQ ID NO.7 except the gene targeting segment, such as SEQ ID NO.131 shown;
Figure PCTCN2022113354-appb-000018
Figure PCTCN2022113354-appb-000019
其tracrRNA序列如SEQ ID NO.132所示,tracr配对物序列如SEQ ID NO.133所示;向导RNA骨架序列为SEQ ID NO.8除基因靶向区段的序列,如SEQ ID NO.134所示;
Figure PCTCN2022113354-appb-000018
Figure PCTCN2022113354-appb-000019
The tracrRNA sequence is shown in SEQ ID NO.132, the tracr partner sequence is shown in SEQ ID NO.133; the guide RNA backbone sequence is the sequence of SEQ ID NO.8 except the gene targeting segment, such as SEQ ID NO.134 shown;
Figure PCTCN2022113354-appb-000020
Figure PCTCN2022113354-appb-000021
其tracrRNA序列如SEQ ID NO.135所示,tracr配对物序列如SEQ ID NO.136所示;向导RNA骨架序列为SEQ ID NO.9除基因靶向区段的序列,如SEQ ID NO.137所示;
Figure PCTCN2022113354-appb-000020
Figure PCTCN2022113354-appb-000021
The tracrRNA sequence is shown in SEQ ID NO.135, the tracr partner sequence is shown in SEQ ID NO.136; the guide RNA backbone sequence is the sequence of SEQ ID NO.9 except the gene targeting segment, such as SEQ ID NO.137 shown;
Figure PCTCN2022113354-appb-000022
Figure PCTCN2022113354-appb-000023
其tracrRNA序列如SEQ ID NO.138所示,tracr配对物序列如SEQ ID NO.139所示;向导RNA骨架序列为SEQ ID NO.10除基因靶向区段的序列,如SEQ ID NO.140所示;
Figure PCTCN2022113354-appb-000022
Figure PCTCN2022113354-appb-000023
The tracrRNA sequence is shown in SEQ ID NO.138, the tracr partner sequence is shown in SEQ ID NO.139; the guide RNA backbone sequence is the sequence of SEQ ID NO.10 except for the gene targeting segment, such as SEQ ID NO.140 shown;
Figure PCTCN2022113354-appb-000024
Figure PCTCN2022113354-appb-000025
其tracrRNA序列如SEQ ID NO.141所示,tracr配对物序列如SEQ ID NO.142所示;向导RNA骨架序列为SEQ ID NO.11除基因靶向区段的序列,如SEQ ID NO.143所示;
Figure PCTCN2022113354-appb-000024
Figure PCTCN2022113354-appb-000025
The tracrRNA sequence is shown in SEQ ID NO.141, the tracr partner sequence is shown in SEQ ID NO.142; the guide RNA backbone sequence is the sequence of SEQ ID NO.11 except the gene targeting segment, such as SEQ ID NO.143 shown;
Figure PCTCN2022113354-appb-000026
Figure PCTCN2022113354-appb-000027
其tracrRNA序列如SEQ ID NO.144所示,tracr配对物序列如SEQ ID NO.145所示;向导RNA骨架序列为SEQ ID NO.12除基因靶向区段的序列,如SEQ ID NO.146所示;
Figure PCTCN2022113354-appb-000026
Figure PCTCN2022113354-appb-000027
The tracrRNA sequence is shown in SEQ ID NO.144, the tracr partner sequence is shown in SEQ ID NO.145; the guide RNA backbone sequence is the sequence of SEQ ID NO.12 except for the gene targeting segment, such as SEQ ID NO.146 shown;
Figure PCTCN2022113354-appb-000028
其tracrRNA序列如SEQ ID NO.147所示,tracr配对物序列如SEQ ID NO.148所示;向导RNA骨架序列为SEQ ID NO.13除基因靶向区段的序列,如SEQ ID NO.149所示;
Figure PCTCN2022113354-appb-000028
The tracrRNA sequence is shown in SEQ ID NO.147, the tracr partner sequence is shown in SEQ ID NO.148; the guide RNA backbone sequence is the sequence of SEQ ID NO.13 except the gene targeting segment, such as SEQ ID NO.149 shown;
Figure PCTCN2022113354-appb-000029
Figure PCTCN2022113354-appb-000030
其tracrRNA序列如SEQ ID NO.150所示,tracr配对物序列如SEQ ID NO.151所示;向导RNA骨架序列为SEQ ID NO.14除基因靶向区段的序列,如SEQ ID NO.152所示;
Figure PCTCN2022113354-appb-000029
Figure PCTCN2022113354-appb-000030
The tracrRNA sequence is shown in SEQ ID NO.150, the tracr partner sequence is shown in SEQ ID NO.151; the guide RNA backbone sequence is the sequence of SEQ ID NO.14 except for the gene targeting segment, such as SEQ ID NO.152 shown;
Figure PCTCN2022113354-appb-000031
Figure PCTCN2022113354-appb-000032
其tracrRNA序列如SEQ ID NO.153所示,tracr配对物序列如SEQ ID NO.154所示;向导RNA骨架序列为SEQ ID NO.15除基因靶向区段的序列,如SEQ ID NO.155所示;
Figure PCTCN2022113354-appb-000031
Figure PCTCN2022113354-appb-000032
The tracrRNA sequence is shown in SEQ ID NO.153, the tracr partner sequence is shown in SEQ ID NO.154; the guide RNA backbone sequence is the sequence of SEQ ID NO.15 except the gene targeting segment, such as SEQ ID NO.155 shown;
Figure PCTCN2022113354-appb-000033
Figure PCTCN2022113354-appb-000034
其tracrRNA序列如SEQ ID NO.156所示,tracr配对物序列如SEQ ID NO.157所示;向导RNA骨架序列为SEQ ID NO.16除基因靶向区段的序列,如SEQ ID NO.158 所示;
Figure PCTCN2022113354-appb-000033
Figure PCTCN2022113354-appb-000034
The tracrRNA sequence is shown in SEQ ID NO.156, the tracr partner sequence is shown in SEQ ID NO.157; the guide RNA backbone sequence is the sequence of SEQ ID NO.16 except the gene targeting segment, such as SEQ ID NO.158 shown;
Figure PCTCN2022113354-appb-000035
Figure PCTCN2022113354-appb-000036
其tracrRNA序列如SEQ ID NO.159所示,tracr配对物序列如SEQ ID NO.160所示;向导RNA骨架序列为SEQ ID NO.17除基因靶向区段的序列,如SEQ ID NO.161所示;
Figure PCTCN2022113354-appb-000035
Figure PCTCN2022113354-appb-000036
The tracrRNA sequence is shown in SEQ ID NO.159, the tracr partner sequence is shown in SEQ ID NO.160; the guide RNA backbone sequence is the sequence of SEQ ID NO.17 except for the gene targeting segment, such as SEQ ID NO.161 shown;
Figure PCTCN2022113354-appb-000037
Figure PCTCN2022113354-appb-000038
其tracrRNA序列如SEQ ID NO.162所示,tracr配对物序列如SEQ ID NO.163所示;向导RNA骨架序列为SEQ ID NO.18除基因靶向区段的序列,如SEQ ID NO.164所示;
Figure PCTCN2022113354-appb-000037
Figure PCTCN2022113354-appb-000038
The tracrRNA sequence is shown in SEQ ID NO.162, the tracr partner sequence is shown in SEQ ID NO.163; the guide RNA backbone sequence is the sequence of SEQ ID NO.18 except for the gene targeting segment, such as SEQ ID NO.164 shown;
Figure PCTCN2022113354-appb-000039
Figure PCTCN2022113354-appb-000040
其tracrRNA序列如SEQ ID NO.165所示,tracr配对物序列如SEQ ID NO.166所示;向导RNA骨架序列为SEQ ID NO.19除基因靶向区段的序列,如SEQ ID NO.167所示。
Figure PCTCN2022113354-appb-000039
Figure PCTCN2022113354-appb-000040
The tracrRNA sequence is shown in SEQ ID NO.165, the tracr partner sequence is shown in SEQ ID NO.166; the guide RNA backbone sequence is the sequence of SEQ ID NO.19 except for the gene targeting segment, such as SEQ ID NO.167 shown.
下划线部分是所述的靶向序列,是优选PAM序列后的20bp片段,可由其他符合条件的靶向序列进行替换。The underlined part is the targeting sequence, which is preferably the 20 bp fragment after the PAM sequence, which can be replaced by other qualifying targeting sequences.
术语“SpCas12f1”、“SpCas12f1核酸酶”、“SpCas12f1多肽”、“SpCas12f1蛋白”、“SpCas12f1蛋白质”可互换使用。The terms "SpCas12f1", "SpCas12f1 nuclease", "SpCas12f1 polypeptide", "SpCas12f1 protein" and "SpCas12f1 protein" are used interchangeably.
术语“向导RNA”、“向导RNA”、“gRNA”、“单一gRNA”和“嵌合gRNA”可互换使用。The terms "guide RNA", "guide RNA", "gRNA", "single gRNA" and "chimeric gRNA" are used interchangeably.
术语“一”或“一个”实体是指一个或多个该实体;因此,术语“一”(或“一个”),“一个或多个”和“至少一个”在本文中可互换使用。The term "a" or "an" entity refers to one or more of that entity; thus, the terms "a" (or "an"), "one or more" and "at least one" are used interchangeably herein.
术语“同源性”或“同一性”或“相似性”是指两个肽之间或两个核酸分子之间的序列相似性。同源性可以通过比对不同多肽或核酸分子中的相对应位置来 确定,当被比较分子序列中的同一位置在不同序列中被相同的碱基或氨基酸占据时,那么该分子在该位置是同源的。序列之间的同源程度由序列共有的匹配或同源位置的数目的函数决定。“不相关的”或“非同源的”序列与本发明所公开的序列之一的同源性应小于20%。The term "homology" or "identity" or "similarity" refers to the sequence similarity between two peptides or between two nucleic acid molecules. Homology can be determined by comparing corresponding positions in different polypeptides or nucleic acid molecules. When the same position in the sequence of the molecule being compared is occupied by the same base or amino acid in different sequences, then the molecule is at that position. Homogenous. The degree of homology between sequences is determined as a function of the number of matches or homologous positions shared by the sequences. An "unrelated" or "non-homologous" sequence should have less than 20% homology to one of the sequences disclosed herein.
多核苷酸或多核苷酸区域(或多肽或多肽区域)与另一多核苷酸或多核苷酸区域(或多肽或多肽区域)具有一定百分比的序列同源性(例如,20%、30%、40%、50%、60%、70%、80%、90%、95%、98%或99%)是指比对时,被比对的两个序列中有该百分比的碱基(或氨基酸)是相同的。该比对和百分比同源性或序列同一性可以使用本领域已知的软件程序和方法来确定。A polynucleotide or polynucleotide region (or polypeptide or polypeptide region) has a certain percentage of sequence homology (e.g., 20%, 30%) with another polynucleotide or polynucleotide region (or polypeptide or polypeptide region). , 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98% or 99%) refers to the percentage of bases (or amino acids) are the same. The alignment and percent homology or sequence identity can be determined using software programs and methods known in the art.
在本发明中,术语“多核苷酸”和“寡核苷酸”可互换使用,并且它们是指任何长度的核苷酸的聚合形式,无论是脱氧核糖核苷酸还是核糖核苷酸或其类似物。多核苷酸可以具有任何三维结构并且可以执行已知或未知的任何功能。多核苷酸的实例包括但不限于如下这些:基因或基因片段(包括探针,引物,EST或SAGE标签),外显子,内含子,信使RNA(mRNA),转运RNA,核糖体RNA,核酶,cDNA,dsRNA,siRNA,miRNA,重组多核苷酸,分支多核苷酸,质粒,载体,任何序列的分离的DNA,任何序列的分离的RNA,核酸探针和引物。多核苷酸也包含经过修饰的核苷酸,例如甲基化的核苷酸和核苷酸类似物。如果多核苷酸上存在修饰,该修饰可以在组装多核苷酸之前或之后赋予。核苷酸序列可以被非核苷酸组分打断。多核苷酸可以在聚合后被进一步修饰,例如通过偶联被标记组分所标记。该术语同时指双链和单链多核苷酸分子。除非另有说明或要求,否则本发明公开的多核苷酸的任何实施方案包括其双链形式和已知或预测能构成双链形式的两条互补单链形式中的任一种。In the present invention, the terms "polynucleotide" and "oligonucleotide" are used interchangeably and they refer to a polymeric form of nucleotides of any length, whether deoxyribonucleotides or ribonucleotides or its analogues. Polynucleotides can have any three-dimensional structure and can perform any function, known or unknown. Examples of polynucleotides include, but are not limited to, the following: genes or gene fragments (including probes, primers, EST or SAGE tags), exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, Ribozymes, cDNA, dsRNA, siRNA, miRNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes and primers. Polynucleotides also include modified nucleotides, such as methylated nucleotides and nucleotide analogs. If a modification is present on the polynucleotide, the modification can be imparted before or after the polynucleotide is assembled. Nucleotide sequences can be interrupted by non-nucleotide components. The polynucleotide may be further modified after polymerization, for example by conjugation to be labeled with a labeling component. The term refers to both double-stranded and single-stranded polynucleotide molecules. Unless otherwise stated or required, any embodiment of a polynucleotide disclosed herein includes its double-stranded form and any of the two complementary single-stranded forms known or predicted to constitute the double-stranded form.
当将其应用于多核苷酸时,术语“编码”是指多核苷酸“编码”多肽,意即在其天然状态下或当通过本领域技术人员公知的方法操作时,其可以通过转录和/或翻译以产生目的多肽和/或其片段,或产生能够编码该目的多肽和/或其片段的mRNA。反义链是指与该多核苷酸互补的序列,并且可以从中推导出编码序列。When applied to a polynucleotide, the term "encoding" means that the polynucleotide "encodes" a polypeptide, meaning that in its native state or when manipulated by methods well known to those skilled in the art, it can be transcribed and/or Or translated to produce a polypeptide of interest and/or a fragment thereof, or to produce an mRNA capable of encoding the polypeptide of interest and/or a fragment thereof. The antisense strand refers to the sequence that is complementary to the polynucleotide and from which the coding sequence can be deduced.
术语“基因组DNA”表示生物的基因组的DNA,包括细菌、古细菌、真菌、原生生物、病毒、植物或动物的基因组的DNA。The term "genomic DNA" refers to the DNA of the genome of an organism, including the DNA of the genome of bacteria, archaea, fungi, protists, viruses, plants or animals.
术语“操纵”DNA包括结合、在一条链产生切口、或切割DNA的两条链, 或包括修饰或编辑DNA或与DNA结合的多肽。操纵DNA可以沉默、激活或调节由所述DNA编码的RNA或多肽的表达(使其不转录,或降低转录活性,或使其不翻译,或降低翻译水平),或防止或增强多肽与DNA的结合。切割可通过多种方法进行,例如磷酸二酯键的酶促或化学水解;可单链切割或双链切割;DNA切割可导致平头末端或交错末端的产生。The term "manipulating" DNA includes binding, nicking one strand, or cleaving both strands of DNA, or includes modifying or editing DNA or polypeptides that bind to DNA. Manipulating DNA can silence, activate or regulate the expression of RNA or polypeptide encoded by the DNA (so that it is not transcribed, or reduces the transcription activity, or makes it not translated, or reduces the level of translation), or prevents or enhances the interaction of the polypeptide with the DNA. combine. Cleavage can be performed by a variety of methods, such as enzymatic or chemical hydrolysis of phosphodiester bonds; cleavage can be single-stranded or double-stranded; DNA cleavage can result in blunt or staggered ends.
术语“可杂交的”或“互补的”或“基本上互补的”是指,核酸(例如RNA)包含核苷酸序列,该序列使得其能够在适当的体外和/或体内温度和溶液离子强度条件下以序列特异性的、反平行的方式与另一核酸非共价地结合,即,形成沃森-克里克碱基对和/或G/U碱基对、“退火”或“杂交”。The term "hybridizable" or "complementary" or "substantially complementary" means that a nucleic acid (e.g., RNA) contains a nucleotide sequence that enables it to react at appropriate in vitro and/or in vivo temperatures and solution ionic strengths. Non-covalently bind to another nucleic acid under conditions in a sequence-specific, antiparallel manner, i.e., form Watson-Crick base pairs and/or G/U base pairs, "anneal" or "hybridize" ".
在本领域中可以理解,多核苷酸的序列不需要与其可特异性杂交的靶核酸的序列100%互补。多核苷酸可在一个或多个区段上杂交。多核苷酸可包含与其所靶向的靶核酸序列内的靶区域的至少70%、至少80%、至少90%、至少95%、至少99%或100%的序列互补性。It is understood in the art that the sequence of a polynucleotide need not be 100% complementary to the sequence of a target nucleic acid to which it specifically hybridizes. Polynucleotides can hybridize on one or more segments. The polynucleotide may comprise at least 70%, at least 80%, at least 90%, at least 95%, at least 99%, or 100% sequence complementarity to a target region within the target nucleic acid sequence to which it is targeted.
术语“肽”、“多肽”和“蛋白”在本发明中可互换地使用,且表示任何长度的氨基酸的聚合形式,其可以包括编码和非编码的氨基酸,化学或生物化学修饰或衍生的氨基酸,和具有修饰的肽主链的多肽。The terms "peptide", "polypeptide" and "protein" are used interchangeably in the present invention and refer to polymeric forms of amino acids of any length, which may include both coded and non-coded amino acids, chemically or biochemically modified or derivatized Amino acids, and polypeptides with modified peptide backbones.
术语“编码”特定RNA的DNA序列是转录为RNA的DNA核酸序列。DNA多核苷酸可以编码被翻译成蛋白的RNA(mRNA),或者DNA多核苷酸可以编码不被翻译成蛋白质的RNA(例如,tRNA、rRNA或gRNA;也称为“非编码”RNA或“ncRNA”)。“蛋白编码序列”或编码特定蛋白或多肽的序列是在合适调节序列的控制下,在体内或体外转录成mRNA(在DNA的情况下)并翻译(在mRNA的情况下)成多肽的核酸序列。The term "encoding" a DNA sequence for a particular RNA is a DNA nucleic acid sequence that is transcribed into RNA. A DNA polynucleotide may encode an RNA (mRNA) that is translated into a protein, or a DNA polynucleotide may encode an RNA that is not translated into a protein (e.g., tRNA, rRNA, or gRNA; also known as "non-coding" RNA or "ncRNA" "). A "protein coding sequence" or a sequence encoding a specific protein or polypeptide is a nucleic acid sequence that is transcribed into mRNA (in the case of DNA) and translated (in the case of mRNA) into a polypeptide in vivo or in vitro under the control of appropriate regulatory sequences .
术语“载体”或“表达载体”是复制子,诸如质粒、噬菌体、病毒或粘粒,可以在其上附着另一种DNA区段,即,“插入片段”,以便实现附着的区段在细胞中的复制。The term "vector" or "expression vector" is a replicon, such as a plasmid, phage, virus or cosmid, to which another DNA segment, i.e., an "insert" can be attached so that the attached segment can be expressed in a cell copy in.
术语“表达盒”包含可操作地连接至启动子的DNA编码序列。“可操作地连接的”表示并列连接,各组分处于允许它们以其预期方式起作用的关系中。术语“重组表达载体”或“DNA构建体”在本发明中互换地用于表示包含载体和至少一个插入片段的DNA分子。重组表达载体通常是为了表达和/或扩增插入片段的 目的或为了构建其它重组核苷酸序列而产生。The term "expression cassette" encompasses a DNA coding sequence operably linked to a promoter. "Operably connected" means connected in parallel, with the components in a relationship that allows them to function in their intended manner. The terms "recombinant expression vector" or "DNA construct" are used interchangeably in the present invention to refer to a DNA molecule comprising a vector and at least one insert. Recombinant expression vectors are typically produced for the purpose of expressing and/or amplifying inserts or for the construction of other recombinant nucleotide sequences.
当外源DNA例如重组表达载体已经被引入细胞内时,细胞已被所述DNA“遗传上修饰”或“转化”或“转染”。外源DNA的存在导致永久或短暂的遗传改变。转化DNA整合或不整合到细胞的基因组中。When foreign DNA, such as a recombinant expression vector, has been introduced into a cell, the cell has been "genetically modified" or "transformed" or "transfected" by the DNA. The presence of foreign DNA results in permanent or transient genetic changes. The transforming DNA may or may not be integrated into the cell's genome.
术语“靶DNA”是包含“靶位点”或“靶序列”的DNA多核苷酸。术语“靶位点”、“靶序列”、“靶原间隔区DNA”或“原间隔区样序列”在本发明中互换地用于表示存在于靶DNA中的核酸序列,如果存在足够的用于结合的条件,则gRNA的DNA-靶向区段将与其结合。RNA分子包含与靶DNA内的靶序列结合、杂交或互补的序列,从而将结合的多肽靶向至靶DNA内的特定位置(靶序列)。“切割”是指DNA分子的共价主链的断裂。The term "target DNA" is a DNA polynucleotide comprising a "target site" or "target sequence". The terms "target site", "target sequence", "target protospacer DNA" or "protospacer-like sequence" are used interchangeably in the present invention to refer to the nucleic acid sequence present in the target DNA if sufficient conditions for binding, then the DNA-targeting segment of the gRNA will bind to it. RNA molecules contain sequences that bind, hybridize, or are complementary to target sequences within target DNA, thereby targeting the bound polypeptide to a specific location within the target DNA (target sequence). "Cleaving" refers to the break of the covalent backbone of the DNA molecule.
术语“核酸酶”和“内切核酸酶”可互换使用,用于指具有用于多核苷酸切割的内切核酸降解催化活性的酶。核酸酶的“切割结构域”或“活性结构域”或“核酸酶结构域”是指在核酸酶内具有用于DNA切割的催化活性的多肽序列或结构域。切割结构域可以包含在单个多肽链中,或者切割活性可以由两个或多个多肽的缔合产生。The terms "nuclease" and "endonuclease" are used interchangeably to refer to enzymes having endonucleic acid degrading catalytic activity for polynucleotide cleavage. The "cleavage domain" or "active domain" or "nuclease domain" of a nuclease refers to a polypeptide sequence or domain within a nuclease that has catalytic activity for DNA cleavage. The cleavage domain may be contained in a single polypeptide chain, or the cleavage activity may result from the association of two or more polypeptides.
术语“定位多肽”或“RNA-结合位点指导的多肽”是指结合RNA并靶向至特定DNA序列的多肽。The term "targeting polypeptide" or "RNA-binding site directed polypeptide" refers to a polypeptide that binds RNA and is targeted to a specific DNA sequence.
术语“引导序列”或DNA-靶向区段(或“DNA-靶向序列”)包含与在本发明中称为“原间隔区样”序列的靶DNA内的特定序列互补的核苷酸序列(靶DNA的互补链)。The term "leader sequence" or DNA-targeting segment (or "DNA-targeting sequence") encompasses a nucleotide sequence complementary to a specific sequence within the target DNA, referred to in the present invention as a "protospacer-like" sequence (Complementary strand of target DNA).
术语“重组”是指在两个多核苷酸之间交换遗传信息的过程。本发明中使用的“同源性指导的修复(HDR)”表示例如在细胞中双链断裂修复期间发生的特殊化形式的DNA修复。这个过程需要核苷酸序列同源性,使用“供体”分子为“靶”分子(即,经历双链断裂的分子)的修复提供模板,并导致遗传信息从供体转移至靶。如果供体多核苷酸不同于靶分子,并且供体多核苷酸的部分或全部序列被并入到靶DNA中,则同源性指导的修复可能导致靶分子序列的改变(例如插入、缺失、突变)。The term "recombination" refers to the process of exchanging genetic information between two polynucleotides. "Homology-directed repair (HDR)" as used herein refers to a specialized form of DNA repair that occurs, for example, during the repair of double-strand breaks in cells. This process requires nucleotide sequence homology, uses a "donor" molecule to provide a template for repair of a "target" molecule (i.e., a molecule that has undergone a double-strand break), and results in the transfer of genetic information from the donor to the target. If the donor polynucleotide is different from the target molecule and part or all of the donor polynucleotide sequence is incorporated into the target DNA, homology-directed repair may result in changes to the target molecule sequence (e.g., insertions, deletions, mutation).
术语“非同源末端连接(NHEJ)”是指在不需要同源模板的情况下通过将断裂末端彼此直接连接来修复DNA中的双链断裂。NHEJ经常导致双链断裂位 点附近的核苷酸序列缺失。The term "nonhomologous end joining (NHEJ)" refers to the repair of double-stranded breaks in DNA by directly joining the broken ends to each other without the need for homologous templates. NHEJ often results in deletions of nucleotide sequences near the double-strand break site.
术语“治疗”包括防止疾病或症状的发生;抑制疾病或症状或减轻疾病。The term "treatment" includes preventing the occurrence of a disease or symptom; inhibiting a disease or symptom or alleviating a disease.
术语“个体”、“受试者”、“宿主”和“患者”在本发明中可互换地使用,并且表示期望对其进行诊断、治疗或疗法的任何哺乳动物受试者,特别是人。The terms "individual", "subject", "host" and "patient" are used interchangeably in the present invention and refer to any mammalian subject, particularly a human, for whom diagnosis, treatment or therapy is desired. .
实施例中未注明具体技术或条件者,按照本领域内的文献所描述的常规技术或条件,或者按照产品制造商的说明书进行。If specific techniques or conditions are not specified in the examples, the conventional techniques or conditions described in literature in the field or the instructions of the product manufacturer shall be followed.
实施例中所用引物均由上海生工生物工程股份有限公司和苏州金唯智生物科技有限公司合成,所用试剂或仪器未注明生产厂商者,视为可通过正规渠道商购得的常规产品。The primers used in the examples were all synthesized by Shanghai Sangon Bioengineering Co., Ltd. and Suzhou Jinweizhi Biotechnology Co., Ltd. If the manufacturer of the reagents or instruments used is not indicated, they are regarded as conventional products that can be purchased through regular channels.
实施例1 SpCas12f1与gRNA_MS1在哺乳动物细胞中的基因编辑Example 1 Gene editing of SpCas12f1 and gRNA_MS1 in mammalian cells
本实施例中,以人胚胎肾细胞HEK293T为实验所用细胞。In this example, human embryonic kidney cell HEK293T was used as the cell used in the experiment.
1、pCMV-SpCas12f-gRNA_MS1质粒(不含基因靶向区段)构建1. Construction of pCMV-SpCas12f-gRNA_MS1 plasmid (without gene targeting segment)
生工生物工程(上海)股份有限公司合成如下序列:Sangon Bioengineering (Shanghai) Co., Ltd. synthesized the following sequence:
人源瞬时表达质粒骨架:5’-ctgtgccttctagttgccagccatctgttgtttgcccctcccccgtgccttccttgaccctggaaggtgccactcccactgtcctttcctaataaaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctggggggtggggtggggcaggacagcaagggggaggattgggaagacaatagcaggcatgctggggatgcggtgggctctatggcttctgaggcggaaagaaccagctggggctcgataccgtcgacctctagctagagcttggcgtaatcatggtcatagctgtttcctgcaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtcccgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaaa aggatctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcgcagataccaaatactgtccttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatacctacagcgtgagcattgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcacatgttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgctgccggcacctgtcctacgagttgcatgataaagaagacagtcataagtgcggcgacgatagtcatgccccgcgcccaccggaaggagctgactgggttgaaggctctcaagggcatcggtcgacggatccttatcgattttaccacatttgtagaggttttacttgctttaaaaaacctcccacacctccccctgaacctgaaacataaaatgaatgcaattgttgttgttaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatgtctgctcgaagcggccggccgccccgactctagaaagtggctaagatctacagctgccttgtaagtcattggtcttaaaggtaccgatgcatggggtcgtgcgctcctttcggtcgggcgctgcgggtcgtggggcgggcg-3’(SEQ ID NO.43);Human transient expression plasmid backbone: 5’-ctgtgccttctagttgccagccatctgttgtttgcccctcccccgtgccttccttgaccctggaaggtgccactcccactgtcctttcctaataaaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctggggggtggggtggggcaggacagcaagggg gaggattgggaagacaatagcaggcatgctggggatgcggtgggctctatggcttctgaggcggaaagaaccagctggggctcgataccgtcgacctctagctagagcttggcgtaatcatggtcatagctgtttcctgcaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaa atatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggtta catcgaactggatctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggat ggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagca atggcaacaacgttgcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcaggacccacttctgcgctcggcccttccggctggctggttattgctgataaatctggagccggtgagcgtgggtcccgcggtatcattgcagcactggggccagatggtaagccctccc gtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaaa aggatctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagt tttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcgcagat accaaatactgtccttctagtgtagccgtagttaggcacccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccccggataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagct tggagcgaacgacctacaccgaactgagatacctacagcgtgagcattgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgt cgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggcctttgctcacatgttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacga ccgagcgcagcgagtcagtgagcgaggaagcggaagagcgctgccggcacctgtcctacgagttgcatgataaagaagacagtcataagtgcggcgacgatagtcatgccccgcgcccaccggaaggagctgactgggttgaaggctctcaagggcatcggtcgacggatccttatcgattttaccacatttgtagaggttt tacttgctttaaaaaacctcccacacctcccctgaacctgaaacataaaatgaatgcaattgttgttgttaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatctttatcatgtctgctc gaagcggccggccgccccgactctagaaagtggctaagatctacagctgccttgtaagtcattggtcttaaaggtaccgatgcatggggtcgtgcgctcctttcggtcgggcgctgcgggtcgtggggcgggcg-3’(SEQ ID NO.43);
嘌呤霉素抗性基因表达盒:5’-ggggttggggttgcgccttttccaaggcagccctgggtttgcgcagggacgcggctgctctgggcgtggttccgggaaacgcagcggcgccgaccctgggactcgcacattcttcacgtccgttcgcagcgtcacccggatcttcgccgctacccttgtgggccccccggcgacgcttcctgctccgcccctaagtcgggaaggttccttgcggttcgcggcgtgccggacgtgacaaacggaagccgcacgtctcactagtaccctcgcagacggacagcgccagggagcaatggcagcgcgccgaccgcgatgggctgtggccaatagcggctgctcagcagggcgcgccgagagcagcggccgggaaggggcggtgcgggaggcggggtgtggggcggtagtgtgggccctgttcctgcccgcgcggtgttccgcattctgcaagcctccggagcgcacgtcggcagtcggctccctcgttgaccgaatcaccgacctctctccccagggggatccaccggagcttaccatgaccgagtacaagcccacggtgcgcctcgccacccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggttcctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggaaacctccgcgccccgcaacctccccttctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgcccgaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctga-3’(SEQ ID NO.44);Puromycin resistance gene expression cassette: 5’-ggggttggggttgcgccttttccaaggcagccctgggtttgcgcagggacgcggctgctctgggcgtggttccgggaaacgcagcggcgccgaccctgggactcgcacattcttcacgtccgttcgcagcgtcacccggatcttcgccgctacccttg tgggccccccggcgacgcttcctgctccgcccctaagtcgggaaggttccttgcggttcgcggcgtgccggacgtgacaaacggaagccgcacgtctcactagtaccctcgcagacggacagcgccagggagcaatggcagcgcgccgaccgcgatgggctgtggccaatagcggctgctcagcagggc gcgccgagagcagcggccggggaaggggcggtgcgggaggcggggtgtggggcggtagtgtgggccctgttcctgcccgcgcggtgttccgcattctgcaagcctccggagcgcacgtcggcagtcggctccctcgttgaccgaatcaccgacctctctccccagggggatccaccggagcttaccatgaccgag tacaagcccacggtgcgcctcgccacccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtg ggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggttcctggccac cgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggaaacctccgcgccccgcaacctcccctctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgcccgaaggaccgc gcacctggtgcatgacccgcaagcccggtgcctga-3’(SEQ ID NO.44);
人源密码子优化的SpCas12f1编码基因表达盒:5’-gacattgattattgactagttattaatagtaatcaattacggggtcattagttcatagcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctggtttagtgaaccgtcagatccgctagagatccgcggccgctaatacgactcactatagggagagccgccaccatgcccaagaagaagaggaaagtcATGGGCGAGAGCGTGAAGGCCATCAAGCTGAAGATCCTGGATATGTTTCTGGACCCTGAGTGTACCAAGCAGGATGATAACTGGAGAAAGGACCTGAGCACCATGAGCAGATTTTGCGCCGAGGCCGGCAATATGTGCCTGAGGGATCTGTATAATTACTTTAGCATGCCTAAGGAGGATAGGATTAGCAGCAAGGACCTGTACAACGCCATGTACCACAAGACCAAGCTGCTGCACCCTGAGCTGCCCGGCAAGGTGGCCAACCAGATCGTGAACCACGCCAAGGATGTGTGGAAGAGAAACGCCAAGCTGATCTACAGAAACCAGATCAGCATGCCTACATACAAGATTACCACAGCCCCTATCAGGCTGCAGAACAACATCTACAAGCTGATCAAGAACAAAAATAAGTACATCATCGACGTGCAGCTGTATTCCAAGGAGTACTCCAAGGACTCTGGCAAGGGCACCCACAGGTACTTCCTGGTGGCCGTGAGAGACAGCAGCACAAGGATGATCTTCGACAGGATCATGTCTAAGGATCACATCGATTCCAGCAAGTCCTACACCCAGGGCCAGCTGCAGATCAAGAAGGACCACCAGGGCAAGTGGTACTGCATCATCCCATACACCTTCCCAACCCACGAGACAGTGCTGGACCCCGACAAGGTCATGGGCGTGGACCTGGGCGTGGCCAAGGCCGTGTACTGGGCCTTCAACAGCAGCTACAAGCGCGGCTGCATCGACGGCGGCGAGATCGAGCACTTCCGGAAGATGATCAGAGCCAGGAGAGTGAGCATCCAGAACCAGATCAAGCACTCCGGCGACGCAAGGAAGGGCCACGGCAGAAAGCGGGCCCTGAAGCCAATCGAAACCCTGTCTGAGAAGGAGAAGAACTTCCGGGACACCATCAACCACAGATACGCCAACAGGATCGTGGAGGCCGCCATTAAGCAGGGCTGCGGCACAATCCAGATCGAGAATCTGGAGGGCATCGCCGACACCACCGGCAGCAAGTTCCTGAAGAATTGGCCTTACTACGATCTGCAGACCAAGATCGTGAATAAGGCCAAGGAGCACGGCATCACCGTGGTGGCCATCAACCCTCAGTACACATCCCAGAGATGTTCCATGTGCGGCTACATCGAAAAGACAAACCGCAGCAGCCAGGCCGTGTTCGAGTGCAAGCAGTGCGGCTACGGCAGC AGGACCATCTGCATCAACTGTAGACACGTGCAGGTGAGCGGCGACGTGTGTGAGGAGTGCGGCGGCATCGTGAAGAAGGAAAACGTGAACGCCGATTACAATGCCGCCAAGAACATCAGCACCCCCTACATCGACCAGATTATCATGGAGAAGTGCCTGGAGCTGGGCATCCCCTACAGAAGCATCACCTGTAAGGAGTGCGGCCACATCCAGGCCAGCGGCAATACCTGCGAGGTGTGCGGCAGCACCAACATCCTGAAGCCTAAGAAGATCAGAAAGGCCAAGtctggtggttctcccaagaagaagaggaaagtctaaccggtcatcatcaccatcaccat-3’(SEQ ID NO.45);Human codon-optimized SpCas12f1 encoding gene expression cassette: 5’-gacattgattattgactagttattaatagtaatcaattacggggtcattagttcatagcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactttccatt gacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggttttggcagta catcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgtttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctggtttagtgaacc gtcagatccgctagagatccgcggccgctaatacgactcactataggggagagccgccaccatgcccaagaagaagaggaaagtcATGGGCGAGAGCGTGAAGGCCATCAAGCTGAAGATCCTGGATATGTTTCTGGACCCTGAGTGTACCAAGCAGGATGATAACTGGAGAAAGGACCTGAGCACCATGAGCAGATTTTGCGCCGAGGCCGGCAATATGTGCCTGAGGGATCTGTATAATTACTTTAG CATGCCTAAGGAGGATAGGATTAGCAGCAAGGACCTGTACAACGCCATGTACCACAAGACCAAGCTGCTGCACCCTGAGCTGCCCGGCAAGGTGGCCAACCAGATCGTC CAAGGAGTACTCCAAGGACTCTGGCAAGGGCACCCACAGGTACTTCCTGGTGGCCGTGAGAGACAGCAGCACAAGGATGATCTTCGACAGGATCATGTCTAAGGATCACATCGATTCCAGCAAGTCCTACACCCAGGGCCAGCTGCAGATCAAGAAGGACCACCAGGGCAAGTGGTACTGCATCATCCCATACACCTTCCCAACCCACGAGACAGTGCTGGACCCCGACAAGGTCATGGGCGTGGACCTGGGCGTGGCCAAGGCCGTGTACT GGGCCTTCAACAGCAGCTACAAGCGCGGCTGCATCGACGGCGGCGAGATCGAGCACTTCCGGAAGATGATCAGAGCCAGGAGAGTGAGCATCCAGAACCAGATCAAGCACTCCGGCGACGCAAGGAAGGGCCACGGCAGAAAGCGGGCCCTGAAGCCAATCGAAACCCTGTCTGAGAAGGAGAAGAACTTCCGGGACACCATCAACCACAGATACGCCAACAGGATCGTGGAGGCCGCCATTAAGCAGGGCTGCGGCA CAATCCAGATCGAATCTGGAGGGCATCGCCGACACCACCGGCAGCAAGTTCCTGAAGAATTGGCCTTACTACGATCTGCAGACCAAGATCGTGAATAAGGCCAAGGAGCACGGCATCACCGTGGTGGCCATCAACCCTCAGTACACATCCCAGAGATGTTCCATGTGCGGCTACATCGAAAAGACAAACCGCAGCAGCCAGGCCGTGTTCGAGTGCAAGCAGTGCGGCTACGGCAGC AGGACCATCTGCATCAACTGTAG ACACGTGCAGGTGAGCGGCGACGTGTGTGAGGAGTGCGGCGGCATCGTGAAGAAGGAAAACGTGAACGCCGATTACAATGCCGCCAAGAACATCAGCACCCCCTACATCGACCAGATTATCATGGAGAAGTGCCTGGAGCTGGGCATCCCCTACAGAAGCATCACCTGTAAGGAGTGCGGCCACATCCAGGCCAGCGGCAATACCTGCGAGGTGTGCGGCAGCACCAACATCCTGAAGCCTAAGAAGATCAGAAAGGCCAA Gtctggtggttctcccaagaagaagaggaaagtctaaccggtcatcatcaccatcaccat-3’(SEQ ID NO.45);
人细胞中SpCas12f1相应的导向gRNA_MS1表达盒:5’-gagggcctatttcccatgattccttcatatttgcatatacgatacaaggctgttagagagataattggaattaatttgactgtaaacacaaagatattagtacaaaatacgtgacgtagaaagtaataatttcttgggtagtttgcagttttaaaattatgttttaaaatggactatcatatgcttaccgtaacttgaaagtatttcgatttcttggctttatatatcttgtggaaaggacgaaacaccGttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccaCGCGgcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaaccgagaccattggtctcatttttttgaattctcgacctcgagacaaatggcagtattcatccacaattttaaaagaaaaggggggattggggggtacagtgcaggggaaagaatagtagacataatagcaacagacatacaaactaaagaattacaaaaacaaattacaaaaattcaaaattttcgggtttattacagggacagcagagatccactttggccgcggctcgag-3’(SEQ ID NO.46)。The corresponding guide gRNA_MS1 expression cassette for SpCas12f1 in human cells: 5’-gagggcctatttcccatgattccttcatatttgcatatacgatacaaggctgttagagagataattggaattaatttgactgtaaacacaaagatattagtacaaaatacgtgacgtagaaagtaataatttcttgggtagtttgcagttttaaaattatgttttaaaaatggact atcatatgcttaccgtaacttgaaagtatttcgatttcttggctttatatatcttgtggaaaggacgaaacaccGttatactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggctacggcgaccttggcgaaatgccatcaataccaCGCGgc ccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaaccgagaccattggtctcatttttttgaattctcgacctcgagacaaatggcagtattcatccacaattttaaaagaaaaggggggattggggggtacagtgcaggggaaagaatagtagacataatagcaacagacatacaaacta aagaattacaaaaacaaattacaaaaattcaaaattttcgggtttattacagggacagcagagatccactttggccgcggctcgag-3’ (SEQ ID NO. 46).
由Gibson assembly技术将这四片段组装为pCMV-SpCas12f1质粒。将pCMV-SpCas12f1质粒转化至转化大肠杆菌DH5α细胞,挑取单克隆进行测序,得到pCMV-SpCas12f1质粒。These four fragments were assembled into pCMV-SpCas12f1 plasmid using Gibson assembly technology. The pCMV-SpCas12f1 plasmid was transformed into Escherichia coli DH5α cells, and single clones were picked for sequencing to obtain the pCMV-SpCas12f1 plasmid.
2、含靶向序列的pCMV-SpCas12f1-gRNA_MS1-G系列质粒构建2. Construction of pCMV-SpCas12f1-gRNA_MS1-G series plasmids containing targeting sequences
2.1靶向序列DNA制备2.1 Target sequence DNA preparation
本实施例中挑选了HEK293T细胞基因组中的VEGFA、AAVS1、EMX1、FANCF、ATP1A1F、TP53、HEXA、PRNP和PDCD1这9个基因作为靶序列,并在这些靶序列上挑选了32个20bp的靶向序列DNA。通过生工生物工程(上海)股份有限公司合成为如下序列:In this example, 9 genes including VEGFA, AAVS1, EMX1, FANCF, ATP1A1F, TP53, HEXA, PRNP and PDCD1 in the HEK293T cell genome were selected as target sequences, and 32 20 bp targets were selected on these target sequences. Sequence DNA. It was synthesized into the following sequence by Sangon Bioengineering (Shanghai) Co., Ltd.:
Guide1-F:5’-aaacacacagaggacccctagtaa-3’(SEQ ID NO.47);Guide1-F: 5’-aaacacacagaggacccctagtaa-3’(SEQ ID NO.47);
Guide1-R:5’-aaaattactaggggtcctctgtgt-3’(SEQ ID NO.48);Guide1-R: 5’-aaaattactaggggtcctctgtgt-3’ (SEQ ID NO.48);
Guide2-F:5’-AAACaatgcctggcacagcatggt-3’(SEQ ID NO.49);Guide2-F: 5’-AAACaatgcctggcacagcatggt-3’(SEQ ID NO.49);
Guide2-R:5’-AAAAaccatgctgtgccaggcatt-3’(SEQ ID NO.50);Guide2-R: 5’-AAAAaccatgctgtgccaggcatt-3’ (SEQ ID NO.50);
Guide3-F:5’-AAACctcttggggaaatgattctt-3’(SEQ ID NO.51);Guide3-F: 5’-AAACctcttggggaaatgattctt-3’(SEQ ID NO.51);
Guide3-R:5’-AAAAaagaatcatttccccaagag-3’(SEQ ID NO.52);Guide3-R: 5’-AAAAaagaatcatttccccaagag-3’ (SEQ ID NO.52);
Guide4-F:5’-AAACtcttgaaggggacaggctac-3’(SEQ ID NO.53);Guide4-F: 5’-AAACtcttgaaggggacaggctac-3’(SEQ ID NO.53);
Guide4-R:5’-AAAAgtagcctgtccccttcaaga-3’(SEQ ID NO.54);Guide4-R: 5’-AAAAgtagcctgtccccttcaaga-3’(SEQ ID NO.54);
Guide5-F:5’-AAACtcttgggcttggcaggcatg-3’(SEQ ID NO.55);Guide5-F: 5’-AAACtcttgggcttggcaggcatg-3’(SEQ ID NO.55);
Guide5-R:5’-AAAAcatgcctgccaagcccaaga-3(SEQ ID NO.56);’Guide5-R:5’-AAAAcatgcctgccaagcccaaga-3(SEQ ID NO.56);’
Guide6-F:5’-AAACtctccccagcctgtcaaagc-3’(SEQ ID NO.57);Guide6-F: 5’-AAACtctccccagcctgtcaaagc-3’(SEQ ID NO.57);
Guide6-R:5’-AAAAgctttgacaggctggggaga-3’(SEQ ID NO.58);Guide6-R: 5’-AAAAgctttgacaggctggggaga-3’ (SEQ ID NO.58);
Guide7-F:5’-AAACcgtgccagctcccaaggtag-3’(SEQ ID NO.59);Guide7-F: 5’-AAACcgtgccagctcccaaggtag-3’(SEQ ID NO.59);
Guide7-R:5’-AAAActaccttgggagctggcacg-3’(SEQ ID NO.60);Guide7-R: 5’-AAAActaccttgggagctggcacg-3’ (SEQ ID NO.60);
Guide8-F:5’-AAACtgtcgcagttgcaaatgaag-3’(SEQ ID NO.61);Guide8-F: 5’-AAACtgtcgcagttgcaaatgaag-3’(SEQ ID NO.61);
Guide8-R:5’-AAAActtcatttgcaactgcgaca-3’(SEQ ID NO.62);Guide8-R: 5’-AAAActtcatttgcaactgcgaca-3’(SEQ ID NO.62);
Guide9-F:5’-aaacAGTGCGACGCCGCGAGCCCC-3’(SEQ ID NO.63);Guide9-F: 5’-aaacAGTGCCGACGCCGCGAGCCCC-3’ (SEQ ID NO.63);
Guide9-R:5’-aaaaGGGGCTCGCGGCGTCGCACT-3’(SEQ ID NO.64);Guide9-R: 5’-aaaaGGGGCTCGCGGCGTCGCACT-3’(SEQ ID NO.64);
Guide10-F:5’-aaacCCTCTTTAGCCAGAGCCGGG-3’(SEQ ID NO.65);Guide10-F: 5’-aaacCCTCTTTAGCCAGAGCCGGG-3’(SEQ ID NO.65);
Guide10-R:5’-aaaaCCCGGCTCTGGCTAAAGAGG-3’(SEQ ID NO.66);Guide10-R: 5’-aaaaCCCGGCTCTGGCTAAAGAGG-3’ (SEQ ID NO.66);
Guide11-F:5’-AAACCTGGCTCTGCTCTTCAGACT-3’(SEQ ID NO.67);Guide11-F: 5’-AAACCTGGCTCTGCTCTTCAGACT-3’(SEQ ID NO.67);
Guide11-R:5’-AAAAAGTCTGAAGAGCAGAGCCAG-3’(SEQ ID NO.68);Guide11-R: 5’-AAAAAGTCTGAAGAGCAGAGCCAG-3’(SEQ ID NO.68);
Guide12-F:5’-AAACACCAGGTCGTGGCCGCCTCT-3’(SEQ ID NO.69);Guide12-F: 5’-AAACACCAGGTCGTGGCCGCCTCT-3’(SEQ ID NO.69);
Guide12-R:5’-AAAAAGAGGCGGCCACGACCTGGT-3’(SEQ ID NO.70);Guide12-R: 5’-AAAAAGAGGCGGCCACGACCTGGT-3’(SEQ ID NO.70);
Guide13-F:5’-AAACCTCCGTGCGTCAGTTTTACC-3’(SEQ ID NO.71);Guide13-F: 5’-AAACCTCCGTGCGTCAGTTTTACC-3’(SEQ ID NO.71);
Guide13-R:5’-AAAAGGTAAAACTGACGCACGGAG-3’(SEQ ID NO.72);Guide13-R: 5’-AAAAGGTAAAACTGACGCACGGAG-3’(SEQ ID NO.72);
Guide14-F:5’-AAACTAACTTTGGCTCTTCACCTT-3’(SEQ ID NO.73);Guide14-F: 5’-AAAACTAACTTTGGCTCTTCACCTT-3’(SEQ ID NO.73);
Guide14-R:5’-AAAAAAGGTGAAGAGCCAAAGTTA-3’(SEQ ID NO.74);Guide14-R: 5’-AAAAAAGGTGAAGAGCCAAAGTTA-3’(SEQ ID NO.74);
Guide15-F:5’-AAACTGGGAGAGGGTAGCGCAGGG-3’(SEQ ID NO.75);Guide15-F: 5’-AAACTGGGAGAGGGTAGCGCAGGG-3’(SEQ ID NO.75);
Guide15-R:5’-AAAACCCTGCGCTACCCTCTCCCA-3’(SEQ ID NO.76);Guide15-R: 5’-AAAACCCTGCGCTACCCTCTCCCA-3’(SEQ ID NO.76);
Guide16-F:5’-AAACTCAGTGGCCACCCTGCGCTA-3’(SEQ ID NO.77);Guide16-F: 5’-AAACTCAGTGGCCACCCTGCGCTA-3’(SEQ ID NO.77);
Guide16-R:5’-AAAATAGCGCAGGGTGGCCACTGA-3’(SEQ ID NO.78);Guide16-R: 5’-AAAATAGCGCAGGGGTGGCCACTGA-3’(SEQ ID NO.78);
Guide17-F:5’-AAACTGGCAAGGAGAGAGATGGCT-3’(SEQ ID NO.79);Guide17-F: 5’-AAACTGGCAAGGAGAGAGATGGCT-3’(SEQ ID NO.79);
Guide17-R:5’-AAAAAGCCATCTCTCTCCTTGCCA-3’(SEQ ID NO.80);Guide17-R: 5’-AAAAAGCCATCTCTCTCCTTGCCA-3’(SEQ ID NO.80);
Guide18-F:5’-AAACTGGGTACTTTTATCTGTCCC-3’(SEQ ID NO.81);Guide18-F: 5’-AAACTGGGTACTTTTATCTGTCCC-3’(SEQ ID NO.81);
Guide18-R:5’-AAAAGGGACAGATAAAAGTACCCA-3’(SEQ ID NO.82);Guide18-R: 5’-AAAAGGGACAGATAAAAGTACCCA-3’(SEQ ID NO.82);
Guide19-F:5’-AAACTCCTGTGGATTCGGGTCACC-3’(SEQ ID NO.83);Guide19-F: 5’-AAAACTCCTGTGGATTCGGGTCACC-3’(SEQ ID NO.83);
Guide19-R:5’-AAAAGGTGACCCGAATCCACAGGA-3’(SEQ ID NO.84);Guide19-R: 5’-AAAAGGTGACCCGAATCCACAGGA-3’(SEQ ID NO.84);
Guide20-F:5’-AAACctgacccaataatgggtttt-3’(SEQ ID NO.85);Guide20-F: 5’-AAACctgacccaataatgggtttt-3’(SEQ ID NO.85);
Guide20-R:5’-AAAAaaaacccattattgggtcag-3’(SEQ ID NO.86);Guide20-R: 5’-AAAAaaaacccattattgggtcag-3’ (SEQ ID NO.86);
Guide21-F:5’-AAACctgtttcaagctgtggcatt-3’(SEQ ID NO.87);Guide21-F: 5’-AAACctgtttcaagctgtggcatt-3’(SEQ ID NO.87);
Guide21-R:5’-AAAAaatgccacagcttgaaacag-3’(SEQ ID NO.88);Guide21-R: 5’-AAAAaatgccacagcttgaaacag-3’ (SEQ ID NO.88);
Guide22-F:5’-AAACaaccttgtccttgttgtaac-3’(SEQ ID NO.89);Guide22-F: 5’-AAACaaccttgtccttgttgtaac-3’(SEQ ID NO.89);
Guide22-R:5’-AAAAgttacaacaaggacaaggtt-3’(SEQ ID NO.90);Guide22-R: 5’-AAAAgttacaacaaggacaaggtt-3’(SEQ ID NO.90);
Guide23-F:5’-AAACtaaggccatgtccctcttgt-3’(SEQ ID NO.91);Guide23-F: 5’-AAACtaaggccatgtccctcttgt-3’(SEQ ID NO.91);
Guide23-R:5’-AAAAacaagagggacatggcctta-3’(SEQ ID NO.92);Guide23-R: 5’-AAAAacaagagggacatggcctta-3’(SEQ ID NO.92);
Guide24-F:5’-AAACgccataatgactgctctgca-3’(SEQ ID NO.93);Guide24-F: 5’-AAACgccataatgactgctctgca-3’(SEQ ID NO.93);
Guide24-R:5’-AAAAtgcagagcagtcattatggc-3’(SEQ ID NO.94);Guide24-R: 5’-AAAAtgcagagcagtcattatggc-3’(SEQ ID NO.94);
Guide25-F:5’-AAACcatcctccaggcttcgggcg-3’(SEQ ID NO.95);Guide25-F: 5’-AAACcatcctccaggcttcgggcg-3’(SEQ ID NO.95);
Guide25-R:5’-AAAAcgcccgaagcctggaggatg-3’(SEQ ID NO.96);Guide25-R: 5’-AAAAcgcccgaagcctggaggatg-3’(SEQ ID NO.96);
Guide26-F:5’-AAACagtttgaaagagagttaatt-3’(SEQ ID NO.97);Guide26-F:5’-AAACagtttgaaagagagttaatt-3’(SEQ ID NO.97);
Guide26-R:5’-AAAAaattaactctctttcaaact-3’(SEQ ID NO.98);Guide26-R: 5’-AAAAaattaactctctttcaaact-3’ (SEQ ID NO.98);
Guide27-F:5’-AAACcccaggctcaaacctcccct-3’(SEQ ID NO.99);Guide27-F: 5’-AAACcccaggctcaaacctcccct-3’(SEQ ID NO.99);
Guide27-R:5’-AAAAaggggaggtttgagcctggg-3’(SEQ ID NO.100);Guide27-R: 5’-AAAAaggggaggtttgagcctggg-3’ (SEQ ID NO.100);
Guide28-F:5’-AAACcatggctcagtaccagcaag-3’(SEQ ID NO.101);Guide28-F: 5’-AAACcatggctcagtaccagcaag-3’(SEQ ID NO.101);
Guide28-R:5’-AAAActtgctggtactgagccatg-3’(SEQ ID NO.102);Guide28-R: 5’-AAAActtgctggtactgagccatg-3’ (SEQ ID NO.102);
Guide29-F:5’-aaacCCTCACTCCTGCTCGGTGAA-3’(SEQ ID NO.103);Guide29-F: 5’-aaacCCTCACTCCTGCTCGGTGAA-3’(SEQ ID NO.103);
Guide29-R:5’-aaaaTTCACCGAGCAGGAGTGAGG-3’(SEQ ID NO.104);Guide29-R: 5’-aaaaTTCACCGAGCAGGAGTGAGG-3’ (SEQ ID NO.104);
Guide30-F:5’-AAACtgtaattcagcatatggatt-3’(SEQ ID NO.105);Guide30-F:5’-AAACtgtaattcagcatatggatt-3’(SEQ ID NO.105);
Guide30-R:5’-AAAAaatccatatgctgaattaca-3’(SEQ ID NO.106);Guide30-R: 5’-AAAAaatccatatgctgaattaca-3’ (SEQ ID NO.106);
Guide31-F:5’-AAACtgaagttagttagctacaac-3’(SEQ ID NO.107);Guide31-F:5’-AAACtgaagttagttagctacaac-3’(SEQ ID NO.107);
Guide31-R:5’-AAAAgttgtagctaactaacttca-3’(SEQ ID NO.108);Guide31-R: 5’-AAAAgttgtagctaactaacttca-3’ (SEQ ID NO.108);
Guide32-F:5’-AAACctacacacaaaacaaatatt-3’(SEQ ID NO.109);Guide32-F: 5’-AAACctacacacaaaacaaatatt-3’(SEQ ID NO.109);
Guide32-R:5’-AAAAaatatttgttttgtgtgtag-3’(SEQ ID NO.110);Guide32-R: 5’-AAAAaatatttgttttgtgtgtag-3’ (SEQ ID NO.110);
靶向序列DNA的磷酸化和退火:按下表1分别配制退火体系。Phosphorylation and annealing of target sequence DNA: Prepare the annealing system according to Table 1.
表1Table 1
组分 Components 体积volume
10×T4 DNA ligase Buffer(NEB)10×T4 DNA ligase Buffer(NEB) 1μl1μl
Guide-FGuide-F 2μl2μl
Guide-RGuide-R 2μl2μl
T4 PNK(NEB)T4 PNK(NEB) 0.5μl0.5μl
无菌无酶水Sterile enzyme-free water 4.5μl4.5μl
磷酸化程序为:37℃反应30分钟。随后在10μl反应体系中加入终浓度50mM NaCl,缓慢退火,得到退火后的靶向序列DNA。The phosphorylation procedure is: react at 37°C for 30 minutes. Then add a final concentration of 50mM NaCl to the 10μl reaction system and anneal slowly to obtain the annealed target sequence DNA.
2.2靶向序列DNA插入2.2 Targeting sequence DNA insertion
接着通过Golden gate assembly将靶向序列DNA插入到pCMV-SpCas12f1-gRNA_MS1质粒中,分别构建成pCMV-SpCas12f1-gRNA_MS1-G系列质粒(包含pCMV-SpCas12f1-gRNA_MS1-G1至pCMV-SpCas12f1-gRNA_MS1-G32质粒共32个质粒)。Golden gate assembly体系如下表2所示。Then, the target sequence DNA was inserted into the pCMV-SpCas12f1-gRNA_MS1 plasmid through the Golden gate assembly to construct the pCMV-SpCas12f1-gRNA_MS1-G series plasmids (including pCMV-SpCas12f1-gRNA_MS1-G1 to pCMV-SpCas12f1-gRNA_MS1-G32 plasmids). A total of 32 plasmids). The Golden gate assembly system is shown in Table 2 below.
表2Table 2
组分Components 含量content
10×T4 DNA ligase Buffer(NEB)10×T4 DNA ligase Buffer(NEB) 1μl1μl
pCMV-SpCas12f1-gRNA_MS1pCMV-SpCas12f1-gRNA_MS1 50ng50ng
靶向序列DNATargeting sequence DNA 0.5μl0.5μl
T4 DNA ligase(NEB)T4 DNA ligase(NEB) 0.5μl0.5μl
BsaI(NEB)BsaI(NEB) 0.5μl0.5μl
无菌无酶水Sterile enzyme-free water 至10μlto 10μl
连接程序:37℃ 2min,16℃ 3min,重复这一步骤共25个循环,最后50℃ 10min,80℃ 10min。Connection procedure: 37℃ for 2min, 16℃ for 3min, repeat this step for a total of 25 cycles, finally 50℃ for 10min, 80℃ for 10min.
转化连接产物至大肠杆菌DH5α细胞,挑取单克隆进行测序,得到含靶向序 列的pCMV-SpCas12f1-gRNA_MS1-G系列质粒(瞬时表达质粒)。The ligation product was transformed into E. coli DH5α cells, and single clones were selected for sequencing to obtain pCMV-SpCas12f1-gRNA_MS1-G series plasmids (transient expression plasmids) containing the targeting sequence.
3、pCMV-SpCas12f1-gRNA_MS1-G系列质粒介导的人细胞基因编辑3. Human cell gene editing mediated by pCMV-SpCas12f1-gRNA_MS1-G series plasmids
在含有体积百分比为10%FBS的DMEM培养基中培养活化后的HEK293T细胞,待细胞生长密度至90%左右,传代至24孔板中,每孔细胞数量约为1.0×10 5个。16-18小时后,向每孔中用1.5μL的lipofectamine3000(Invitrogen)分别转染1000ng编辑不同基因的pCMV-SpCas12f1-gRNA_MS1-G质粒至细胞中。24小时后,加入含有终浓度为2μg/ml的嘌呤霉素的新鲜培养基进行筛选。继续培养48小时后,消化贴壁细胞,提取基因组DNA。 Culture the activated HEK293T cells in DMEM medium containing 10% FBS by volume. When the cell growth density reaches about 90%, they are passaged into a 24-well plate. The number of cells in each well is about 1.0×10 5 . 16-18 hours later, 1000ng of pCMV-SpCas12f1-gRNA_MS1-G plasmid editing different genes was transfected into the cells using 1.5 μL lipofectamine3000 (Invitrogen) in each well. After 24 hours, fresh medium containing puromycin at a final concentration of 2 μg/ml was added for selection. After continuing to culture for 48 hours, the adherent cells were digested and genomic DNA was extracted.
PCR扩增靶向序列的目的基因片段,胶回收后的PCR产物用NEBuffer2(NEB)进行退火。随后在PCR反应体系加入T7 endonuclease 1(NEB),37℃酶切15min,然后加入6×Gel Loading Dye(NEB)终止反应。反应产物通过6%的TBE-PAGE分离,并通过4S Red dye(生工生物工程(上海)股份有限公司)进行染色成像。PCR amplifies the target gene fragment of the target sequence, and the PCR product after gel recovery is annealed with NEBuffer2 (NEB). Then add T7 endonuclease 1 (NEB) to the PCR reaction system, digest at 37°C for 15 minutes, and then add 6×Gel Loading Dye (NEB) to terminate the reaction. The reaction products were separated by 6% TBE-PAGE and stained and imaged by 4S Red dye (Sangon Bioengineering (Shanghai) Co., Ltd.).
4、结果图2为CRISPR/SpCas12f1编辑系统(pCMV-SpCas12f1-gRNA_MS1系列质粒)介导的哺乳动物细胞的基因编辑结果图。如图所示,CRISPR/SpCas12f1编辑系统成功实现了对哺乳动物细胞基因组上多个基因的基因编辑。4. Results Figure 2 shows the gene editing results of mammalian cells mediated by the CRISPR/SpCas12f1 editing system (pCMV-SpCas12f1-gRNA_MS1 series plasmids). As shown in the figure, the CRISPR/SpCas12f1 editing system successfully achieved gene editing of multiple genes on the mammalian cell genome.
实施例2 SpCas12f1核酸酶相应的向导RNA的工程化Example 2 Engineering of guide RNA corresponding to SpCas12f1 nuclease
本实施例在gRNA_MS1的基础上,对SpCas12f1核酸酶相应的向导RNA的tracrRNA和crRNA进行单独改造和组合改造,具体改造位点和相应碱基长度示意图如图3所示,具体工程化的方法和改造的碱基长度如下序列所示:In this example, on the basis of gRNA_MS1, the tracrRNA and crRNA of the guide RNA corresponding to the SpCas12f1 nuclease were individually transformed and combined. The specific transformation sites and corresponding base lengths are shown in Figure 3. The specific engineering methods and The modified base length is as shown in the following sequence:
gRNA_MS1的编码核苷酸:5’-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa-3’(SEQ ID NO.20); Coding nucleotides of gRNA_MS1: 5'-ttatctctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgc gtaagcgcgtggattgaaac acacagaggacccctagtaa -3' (SEQ ID NO. 20);
gRNA_MS2的编码核苷酸,在tracrRNA 5’端截断7nt碱基:5’-gtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa-3’(SEQ ID NO.21); The coding nucleotide of gRNA_MS2, truncating the 7nt base at the 5' end of tracrRNA: 5'-gtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaacc gctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa -3' (SEQ ID NO. 21);
gRNA_MS3的编码核苷酸,在tracrRNA 5’端截断16nt碱基:5’-gccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa-3’(SEQ ID NO.22); The coding nucleotide of gRNA_MS3, truncated 16nt base at the 5' end of tracrRNA: 5'-gccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatct tgcgtaagcgcgtggattgaaacacacagaggacccctagtaa -3' (SEQ ID NO. 22);
gRNA_MS4的编码核苷酸,在tracrRNA3’端截断6nt碱基:5’-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaacgaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.23); The coding nucleotide of gRNA_MS4, truncated 6nt bases at the 3' end of tracrRNA: 5'-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaacgaacc gctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO. 23);
gRNA_MS5的编码核苷酸,在tracrRNA3’端截断15nt碱基:5’-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.24); The coding nucleotide of gRNA_MS5, truncated 15nt base at the 3' end of tracrRNA: 5'-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgaaccgctgtcg catcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO. 24);
gRNA_MS6的编码核苷酸,在tracrRNA3’端截断28nt碱基:5’-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcgggaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.25); The coding nucleotide of gRNA_MS6, truncated 28nt bases at the 3' end of tracrRNA: 5'-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcgggaaccgctgtcgcatcttgcgtaagc gcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO. 25);
gRNA_MS7的编码核苷酸,在crRNA 5’端截断8nt碱基:5’-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.26); The coding nucleotide of gRNA_MS7, truncated 8nt base at the 5' end of crRNA: 5'-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtag tcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO. 26);
gRNA_MS8的编码核苷酸,在crRNA 5’端截断21nt碱基:5’-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtataagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.27); The coding nucleotide of gRNA_MS8, truncated 21nt bases at the 5' end of crRNA: 5'-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgag tataagcgcgtggattgaaacacacagaggacccctagtaa -3' (SEQ ID NO. 27);
gRNA_MS9的编码核苷酸,在tracrRNA 5’端截断4nt碱基,同时在tracrRNA 3’端截断3nt碱基:5’-tctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.28); The coding nucleotide of gRNA_MS9 truncates 4nt bases at the 5' end of tracrRNA and truncates 3nt bases at the 3' end of tracrRNA: 5'-tctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaa agggttcgcgcgaaactgagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO. 28);
gRNA_MS10的编码核苷酸,在tracrRNA 5’端截断7nt碱基,同时在tracrRNA 3’端截断6nt碱基:5’- gtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaacgaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.29); The coding nucleotide of gRNA_MS10 truncates 7nt bases at the 5' end of tracrRNA and truncates 6nt bases at the 3' end of tracrRNA: 5'- gtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaag ggttcgcgcgaaacgaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO. 29);
gRNA_MS11的编码核苷酸,在tracrRNA 5’端截断10nt碱基,同时在tracrRNA 3’端截断9nt碱基:5’-tcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.30); The coding nucleotide of gRNA_MS11 truncates 10nt bases at the 5' end of tracrRNA and truncates 9nt bases at the 3' end of tracrRNA: 5'-tcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggt tcgcgcgagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO. 30);
gRNA_MS12的编码核苷酸,在tracrRNA 5’端截断7nt碱基,在tracrRNA 3’端截断6nt碱基,同时在crRNA 5’端截断8nt碱基,并且在tracrRNA和crRNA之间添加AAGG四个碱基进行连接:5’-gtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaacAAGGgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.31); The coding nucleotide of gRNA_MS12 truncates 7nt bases at the 5' end of tracrRNA, truncates 6nt bases at the 3' end of tracrRNA, and truncates 8nt bases at the 5' end of crRNA, and adds AAGG four bases between tracrRNA and crRNA Base for ligation: 5'-gtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaacAAGGgtcgcatcttgcgtaagcgcgtggattgaaac a cacagaggacccctagtaa -3' (SEQ ID NO. 31);
gRNA_MS13的编码核苷酸,在tracrRNA 5’端截断7nt碱基,在tracrRNA 3’端截断6nt碱基,同时在crRNA 5’端截断21nt碱基,并且在tracrRNA和crRNA之间添加AAGG四个碱基进行连接:5’-gtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaacAAGGtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.32); The coding nucleotide of gRNA_MS13 truncates 7nt bases at the 5' end of tracrRNA, truncates 6nt bases at the 3' end of tracrRNA, and truncates 21nt bases at the 5' end of crRNA, and adds AAGG four bases between tracrRNA and crRNA Base for ligation: 5'-gtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaacAAGGtaagcgcgtggattgaaac acacagaggacccctagtaa -3'(SEQ ID NO.32);
gRNA_MS14的编码核苷酸,截掉茎环1,16nt碱基:5’-ttactctgtttcgcgcgccataaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.33); Coding nucleotide of gRNA_MS14, cut off stem loop 1, 16nt base: 5'-ttactctgtttcgcgcgccataaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatct tgcgtaagcgcgtggattgaaacacacagaggacccctagtaa -3' (SEQ ID NO. 33);
gRNA_MS15的编码核苷酸,截掉茎环2的上半部分,16nt碱基:5’-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.34); Coding nucleotide of gRNA_MS15, cut off the upper half of stem loop 2, 16nt base: 5'-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtc gcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO.34);
gRNA_MS16的编码核苷酸,截掉整个茎环2,32nt碱基:5’-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagtctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac aca cagaggacccctagtaa–3’(SEQ ID NO.35); The coding nucleotide of gRNA_MS16, cut off the entire stem loop 2, 32nt bases: 5'-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagtctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcg cgtggattgaaac aca cagaggacccctagtaa –3’ (SEQ ID NO. 35);
gRNA_MS17的编码核苷酸,截掉茎环3,8nt碱基:5’-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.36); The coding nucleotide of gRNA_MS17, cut off the stem loop 3, 8nt base: 5'-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgt cgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO. 36);
gRNA_MS18的编码核苷酸,截掉茎环4,11nt碱基:5’-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa–3’(SEQ ID NO.37); Coding nucleotide of gRNA_MS18, cut off stem loop 4, 11nt base: 5'-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcg catcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa –3’ (SEQ ID NO.37);
gRNA_MS19的编码核苷酸,在crRNA 3’末端加上UUUUAUUUUUU(SEQ ID NO.169)11nt碱基:5’-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacgcggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaattttatttttt–3’(SEQ ID NO.38)。 The coding nucleotide of gRNA_MS19, add UUUUAUUUUUU (SEQ ID NO.169) 11nt base at the 3' end of crRNA: 5'-ttactctgtttcgcgcgccagggcagttaggtgccctaaaagagcgaagtggccgaaaggaaaggctaacgcttctctaacgctacggcgaccttggcgaaatgccatcaataccacg cggcccgaaagggttcgcgcgaaactgagtagaaccgctgtcgcatcttgcgtaagcgcgtggattgaaac acacagaggacccctagtaa ttttatttttt–3′ (SEQ ID NO. 38).
相应地,向导RNA的核苷酸序列如SEQ ID NO.1~SEQ ID NO.19所示。Correspondingly, the nucleotide sequence of the guide RNA is shown in SEQ ID NO.1 to SEQ ID NO.19.
本实施例中,以人胚胎肾细胞HEK293T为实验所用细胞。In this example, human embryonic kidney cell HEK293T was used as the cell used in the experiment.
下划线部分是所述的靶向序列,是优选PAM序列后的20bp片段。The underlined part is the targeting sequence, which is preferably the 20 bp fragment after the PAM sequence.
1、pCMV-SpCas12f1-gRNA_MS2-MS19-G1系列质粒构建1. Construction of pCMV-SpCas12f1-gRNA_MS2-MS19-G1 series plasmids
质粒构建步骤同实施例1的第1点,仅将其中的人细胞中SpCas12f1相应的导向gRNA_MS1表达盒替换为导向gRNA_MS2-gRNA_MS19表达盒,为pCMV-SpCas12f1-gRNA_MS2、pCMV-SpCas12f1-gRNA_MS3、pCMV-SpCas12f1-gRNA_MS4、pCMV-SpCas12f1-gRNA_MS5、pCMV-SpCas12f1-gRNA_MS6、pCMV-SpCas12f1-gRNA_MS7、pCMV-SpCas12f1-gRNA_MS8、pCMV-SpCas12f1-gRNA_MS9、pCMV-SpCas12f1-gRNA_MS10、pCMV-SpCas12f1-gRNA_MS11、pCMV-SpCas12f1-gRNA_MS12、pCMV-SpCas12f1-gRNA_MS13、pCMV-SpCas12f1-gRNA_MS14、pCMV-SpCas12f1-gRNA_MS15、pCMV-SpCas12f1-gRNA_MS16、pCMV-SpCas12f1-gRNA_MS17、 pCMV-SpCas12f1-gRNA_MS18、pCMV-SpCas12f1-gRNA_MS19。The plasmid construction steps are the same as the first point in Example 1, except that the guide gRNA_MS1 expression cassette corresponding to SpCas12f1 in human cells is replaced with the guide gRNA_MS2-gRNA_MS19 expression cassette, which is pCMV-SpCas12f1-gRNA_MS2, pCMV-SpCas12f1-gRNA_MS3, pCMV- SpCas12f1-gRNA_MS4, pCMV-SpCas12f1-gRNA_MS5, pCMV-SpCas12f1-gRNA_MS6, pCMV-SpCas12f1-gRNA_MS7, pCMV-SpCas12f1-gRNA_MS8, pCMV-SpCas12f1-gRNA_MS9, pCMV-SpCas12f1-gRNA_MS10, pCMV-SpCas 12f1-gRNA_MS11, pCMV-SpCas12f1- gRNA_MS12, pCMV-SpCas12f1-gRNA_MS13, pCMV-SpCas12f1-gRNA_MS14, pCMV-SpCas12f1-gRNA_MS15, pCMV-SpCas12f1-gRNA_MS16, pCMV-SpCas12f1-gRNA_MS17, pCMV-SpCas12f1-gRNA_MS18, pCMV-SpCas1 2f1-gRNA_MS19.
将上述质粒按照实施例1的第2点,插入相同的靶向序列Guide 1,得到含有靶向序列的质粒。Insert the same targeting sequence Guide 1 into the above plasmid according to point 2 of Example 1 to obtain a plasmid containing the targeting sequence.
2、pCMV-SpCas12f1-gRNA_MS1-MS19-G1系列质粒介导的人细胞基因编辑2. Human cell gene editing mediated by pCMV-SpCas12f1-gRNA_MS1-MS19-G1 series plasmids
操作步骤同实施例1的第3点,仅将其中的pCMV-SpCas12f1-gRNA_MS1-G系列质粒,替换为pCMV-SpCas12f1-gRNA_MS1-MS19-G1系列质粒。The operation steps are the same as point 3 in Example 1, except that the pCMV-SpCas12f1-gRNA_MS1-G series plasmids are replaced with pCMV-SpCas12f1-gRNA_MS1-MS19-G1 series plasmids.
3、结果3. Result
工程化的SpCas12f1的gRNA_MS1到gRNA_MS19在哺乳动物细胞中的基因编辑结果如图4所示。如图所示,工程化后的SpCas12f1的gRNA_MS2、gRNA_MS4、gRNA_MS7、gRNA_MS10、gRNA_MS12、gRNA_MS13能提高SpCas12f1在哺乳动物细胞中的基因编辑效率,其中效率最高的是gRNA_MS13;gRNA_MS19对细胞编辑影响不大;去除gRNA上四个茎环中任意一个茎环均对活性影响很大,故四个茎环均不能去除。The gene editing results of engineered SpCas12f1 gRNA_MS1 to gRNA_MS19 in mammalian cells are shown in Figure 4. As shown in the figure, the engineered gRNA_MS2, gRNA_MS4, gRNA_MS7, gRNA_MS10, gRNA_MS12, and gRNA_MS13 of SpCas12f1 can improve the gene editing efficiency of SpCas12f1 in mammalian cells, among which gRNA_MS13 is the most efficient; gRNA_MS19 has little effect on cell editing; Removing any one of the four stem-loops on the gRNA will have a great impact on the activity, so all four stem-loops cannot be removed.
实施例3工程化后的SpCas12f1核酸酶gRNA在哺乳动物细胞中的基因编辑Example 3 Gene editing of engineered SpCas12f1 nuclease gRNA in mammalian cells
本实施例中对所述的SpCas12f1核酸酶的进一步限定如实施例1中所述。Further limitations on the SpCas12f1 nuclease in this example are as described in Example 1.
本实施例中所述的优选的SpCas12f1工程化后的向导RNA基因序列为gRNA_MS13:5’-guuucgcgcgccagggcaguuaggugcccuaaaagagcgaaguggccgaaaggaaaggcuaacgcuucucuaacgcuacggcgaccuuggcgaaaugccaucaauaccacgcggcccgaaaggguucgcgcgaaacAAGGuaagcgcguggauugaaac acacagaggaccccuaguaa–3’ The preferred SpCas12f1 engineered guide RNA gene sequence described in this example is gRNA_MS13: 5'-guuucgcgcgccagggcaguuaggugcccuaaaagagcgaaguggccgaaaggaaaggcuaacgcuucucuaacgcuacggcgaccuuggcgaaaugccaucaauaccacgcggcccgaaaggguucgcgcgaaacAAGGuaagcg cguggauugaaac acacagaggaccccuaguaa –3'
本实施例中,以人胚胎肾细胞HEK293T为实验所用细胞。In this example, human embryonic kidney cell HEK293T was used as the cell used in the experiment.
1、pCMV-SpCas12f1-gRNA_MS13-G系列质粒构建1. Construction of pCMV-SpCas12f1-gRNA_MS13-G series plasmids
质粒构建步骤同实施例1的第1点,仅将其中的人细胞中SpCas12f1相应的导向gRNA_MS1表达盒替换为导向gRNA_MS13表达盒。将上述质粒按照实施例1的第2点,插入不同的靶向序列,得到含有靶向序列的pCMV-SpCas12f1-gRNA_MS13-G系列质粒。The plasmid construction steps are the same as the first point in Example 1, except that the guide gRNA_MS1 expression cassette corresponding to SpCas12f1 in human cells is replaced with the guide gRNA_MS13 expression cassette. According to point 2 of Example 1, different targeting sequences were inserted into the above plasmids to obtain pCMV-SpCas12f1-gRNA_MS13-G series plasmids containing targeting sequences.
2、pCMV-SpCas12f1-gRNA_MS13-G系列质粒介导的人细胞基因编辑2. Human cell gene editing mediated by pCMV-SpCas12f1-gRNA_MS13-G series plasmids
操作步骤同实施例1的第3点,仅将其中的pCMV-SpCas12f1-gRNA_MS1-G 系列质粒,替换为pCMV-SpCas12f1-gRNA_MS13-G系列质粒。The operation steps are the same as point 3 in Example 1, except that the pCMV-SpCas12f1-gRNA_MS1-G series plasmids are replaced with pCMV-SpCas12f1-gRNA_MS13-G series plasmids.
3、结果3. Result
gRNA_MS1和gRNA_MS13在HEK293T细胞中编辑效率的比较如图5所示。由图可知,利用工程化后的gRNA_MS13可以实现在哺乳动物细胞中的高效编辑,相同位点的编辑效率比gRNA_MS1最高可提高9倍左右,如Guide3位点可由2.58%的编辑效率提高到21.95%,大大提高了SpCas12f1在哺乳动物细胞基因编辑的活性和通用性。Comparison of the editing efficiency of gRNA_MS1 and gRNA_MS13 in HEK293T cells is shown in Figure 5. It can be seen from the figure that the engineered gRNA_MS13 can be used to achieve efficient editing in mammalian cells. The editing efficiency of the same site can be increased by up to about 9 times compared with gRNA_MS1. For example, the editing efficiency of the Guide3 site can be increased from 2.58% to 21.95%. , greatly improving the activity and versatility of SpCas12f1 in mammalian cell gene editing.
最后应当说明的是,虽然以上详细描述了本发明实施例的具体方式,但是本领域的技术人员应当理解,这些仅是举例说明,在不背离本发明的原理和实质的前提下,可以对本发明的实施方式进行修改或者等同替换。因此,本发明的保护范围由所附权利要求书限定。Finally, it should be noted that although the specific embodiments of the present invention have been described in detail above, those skilled in the art should understand that these are only examples, and the present invention can be modified without departing from the principle and essence of the present invention. The implementation is modified or equivalently replaced. Accordingly, the scope of the present invention is defined by the appended claims.

Claims (25)

  1. 一种向导RNA,其特征在于,其包含tracr RNA序列和crRNA序列;所述crRNA序列包含能够与靶序列杂交的基因靶向区段和tracr配对物序列;所述tracr RNA序列和tracr配对物序列构成向导RNA的骨架序列;A guide RNA, characterized in that it includes a tracr RNA sequence and a crRNA sequence; the crRNA sequence includes a gene targeting segment capable of hybridizing with a target sequence and a tracr partner sequence; the tracr RNA sequence and a tracr partner sequence The backbone sequence that makes up the guide RNA;
    其中,所述tracrRNA包含如SEQ ID NO.111所示的核苷酸序列或与其具有至少50%以上同一性的变体序列;Wherein, the tracrRNA includes the nucleotide sequence shown in SEQ ID NO. 111 or a variant sequence having at least 50% identity with it;
    所述tracr配对物序列包含如SEQ ID NO.112所示的核苷酸序列或与其具有至少50%以上同一性的变体序列。The tracr partner sequence includes the nucleotide sequence shown in SEQ ID NO. 112 or a variant sequence having at least 50% identity with it.
  2. 如权利要求1所述的向导RNA,其特征在于,包含以下至少任一项:The guide RNA of claim 1, comprising at least any one of the following:
    1)所述基因靶向区段位于所述crRNA序列的3’端;1) The gene targeting segment is located at the 3’ end of the crRNA sequence;
    2)所述向导RNA包含在所述tracrRNA中互补配对的碱基对的一个或多个互换位置后获得的tracrRNA;2) The guide RNA includes tracrRNA obtained by exchanging one or more positions of complementary paired base pairs in the tracrRNA;
    3)所述向导RNA包含在tracrRNA与tracr配对物序列互补配对的碱基对的一个或多个互换位置后获得的tracrRNA和tracr配对物序列;3) The guide RNA includes the tracrRNA and the tracr partner sequence obtained after one or more exchange positions of the base pairs of the tracrRNA and the tracr partner sequence.
    4)所述基因靶向区段识别靶向序列上的PAM序列;优选地PAM序列为5’-NTTC和5’-GTTT至少任一,其中N为A、T、C、G;进一步优选地PAM序列为5’-GTTC,5’-TTTC和5’-ATTC至少任一;4) The gene targeting segment recognizes the PAM sequence on the targeting sequence; preferably the PAM sequence is at least either 5'-NTTC and 5'-GTTT, where N is A, T, C, G; further preferably The PAM sequence is at least one of 5'-GTTC, 5'-TTTC and 5'-ATTC;
    5)所述基因靶向区段靶向PAM序列之后长度为12~40bp的核酸片段;优选的长度为20bp;5) The gene targeting segment is a nucleic acid fragment with a length of 12 to 40 bp after targeting the PAM sequence; the preferred length is 20 bp;
    6)所述向导RNA为两条链,一条链包含所述tracrRNA序列,另一条链包含所述crRNA序列;6) The guide RNA has two strands, one strand contains the tracrRNA sequence, and the other strand contains the crRNA sequence;
    7)所述向导RNA为一条链,从5’端至3’端依次包含所述tracr RNA序列和crRNA序列。7) The guide RNA is a chain, containing the tracr RNA sequence and crRNA sequence in sequence from the 5' end to the 3' end.
  3. 如权利要求2所述的向导RNA,其特征在于,当所述向导RNA为一条链时,所述tracr RNA序列的3’端与crRNA序列的5’端直接连接或通过连接链连接;优选地,通过连接链连接,所述连接链选自寡核苷酸;优选地,所述连接链的寡核苷酸的个数为3~18nt;进一步优选地,所述连接链为AAGG。The guide RNA of claim 2, wherein when the guide RNA is a strand, the 3' end of the tracr RNA sequence is directly connected to the 5' end of the crRNA sequence or connected through a connecting strand; preferably , connected through a connecting chain, and the connecting chain is selected from oligonucleotides; preferably, the number of oligonucleotides in the connecting chain is 3 to 18 nt; further preferably, the connecting chain is AAGG.
  4. 如权利要求1所述的向导RNA,其特征在于,包括以下至少任一项:The guide RNA of claim 1, comprising at least any one of the following:
    1)所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端和/或3’端增加、减少或替换部分核苷酸后获得的序列;优选地,所述 tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端和/或3’端减少核苷酸后获得的序列;1) The variant sequence of tracrRNA refers to a sequence obtained by adding, reducing or replacing some nucleotides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 111; Preferably, the variant sequence of tracrRNA refers to a sequence obtained by reducing nucleotides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 111;
    2)所述tracr配对物序列的变体序列是指在所述SEQ ID NO.112所示的核苷酸序列的5’端和/或3’端增加、减少或替换部分核苷酸后获得的序列;优选地,所述tracr配对物序列的变体序列是指在所述SEQ ID NO.112所示的核苷酸序列的5’端和/或3’端减少核苷酸后获得的序列。2) The variant sequence of the tracr partner sequence is obtained by adding, reducing or replacing some nucleotides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 112 Sequence; Preferably, the variant sequence of the tracr partner sequence refers to one obtained by reducing nucleotides at the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 112 sequence.
  5. 如权利要求4所述的向导RNA,其特征在于,包括以下至少任一项:The guide RNA of claim 4, comprising at least any one of the following:
    1)所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端和/或3’端减少1~30nt核苷酸后获得的序列;优选地,在5’端减少4~16nt核苷酸,和/或,在3’端减少3~28nt核苷酸;1) The variant sequence of tracrRNA refers to a sequence obtained by reducing 1 to 30 nt nucleotides from the 5' end and/or 3' end of the nucleotide sequence shown in SEQ ID NO. 111; preferably , reduce 4 to 16 nt nucleotides at the 5' end, and/or reduce 3 to 28 nt nucleotides at the 3' end;
    2)所述tracr配对物序列的变体序列是指在所述SEQ ID NO.112所示的核苷酸序列的5’端减少3~28nt核苷酸后获得的序列;优选地,减少8~21nt核苷酸。2) The variant sequence of the tracr partner sequence refers to a sequence obtained by reducing 3 to 28 nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 112; preferably, reducing 8 nt ~21nt nucleotide.
  6. 如权利要求5所述的向导RNA,其特征在于,包括以下至少任一项:The guide RNA of claim 5, comprising at least any one of the following:
    1)所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端减少7nt核苷酸获得的序列;优选地,所述tracrRNA的变体序列如SEQ ID NO.114所示;1) The variant sequence of tracrRNA refers to the sequence obtained by reducing 7nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 111; preferably, the variant sequence of tracrRNA is as follows Shown in SEQ ID NO.114;
    2)所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的3’端减少6nt核苷酸获得的序列;优选地,所述tracrRNA的变体序列如SEQ ID NO.120所示;2) The variant sequence of tracrRNA refers to a sequence obtained by reducing 6 nt nucleotides from the 3' end of the nucleotide sequence shown in SEQ ID NO. 111; preferably, the variant sequence of tracrRNA is as follows Shown in SEQ ID NO.120;
    3)所述tracr配对物序列的变体序列是指在所述SEQ ID NO.112所示的核苷酸序列的5’端减少8nt核苷酸获得的序列;优选地,所述tracr配对物序列的变体序列如SEQ ID NO.130所示;3) The variant sequence of the tracr partner sequence refers to a sequence obtained by reducing 8nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 112; preferably, the tracr partner sequence The variant sequence of the sequence is shown in SEQ ID NO.130;
    4)所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端减少7nt核苷酸,在3’端减少6nt核苷酸获得的序列;优选地,所述tracrRNA的变体序列如SEQ ID NO.138所示;4) The variant sequence of tracrRNA refers to a sequence obtained by reducing 7nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 111 and reducing 6nt nucleotides at the 3' end; preferably Land, the variant sequence of tracrRNA is shown in SEQ ID NO. 138;
    5)所述tracrRNA的变体序列是指在所述SEQ ID NO.111所示的核苷酸序列的5’端减少7nt核苷酸,在3’端减少6nt核苷酸获得的序列;优选地,所述tracrRNA的变体序列如SEQ ID NO.144所示;5) The variant sequence of tracrRNA refers to a sequence obtained by reducing 7nt nucleotides at the 5' end of the nucleotide sequence shown in SEQ ID NO. 111 and reducing 6nt nucleotides at the 3' end; preferably Land, the variant sequence of tracrRNA is shown in SEQ ID NO. 144;
    6)所述tracr配对物序列的变体序列是指在所述SEQ ID NO.112所示的核苷酸序列的3’端减少21nt核苷酸获得的序列;优选地,所述tracr配对物序列的变体序列如SEQ ID NO.148所示。6) The variant sequence of the tracr partner sequence refers to a sequence obtained by reducing 21nt nucleotides at the 3' end of the nucleotide sequence shown in SEQ ID NO. 112; preferably, the tracr partner sequence The variant sequence of the sequence is shown in SEQ ID NO. 148.
  7. 如权利要求6所述的向导RNA,其特征在于,包括以下至少任一项:The guide RNA of claim 6, comprising at least any one of the following:
    1)所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.116所示;1) The nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 116;
    2)所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.122所示;2) The nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 122;
    3)所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.131所示;3) The nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 131;
    4)所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.140所示;4) The nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 140;
    5)所述tracrRNA的变体序列如SEQ ID NO.144所示,所述tracr配对物序列的变体序列如SEQ ID NO.145所示;优选地,所述tracrRNA和tracr配对物序列通过连接链连接;进一步优选地,所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.146所示;5) The variant sequence of the tracrRNA is shown in SEQ ID NO.144, and the variant sequence of the tracr partner sequence is shown in SEQ ID NO.145; preferably, the tracrRNA and the tracr partner sequence are connected by Chain connection; Further preferably, the nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 146;
    6)所述tracrRNA的变体序列如SEQ ID NO.147所示,所述tracr配对物序列的变体序列如SEQ ID NO.148所示;优选地,所述tracrRNA和tracr配对物序列通过连接链连接;进一步优选地,所述向导RNA的骨架序列的核苷酸序列如SEQ ID NO.149所示;6) The variant sequence of the tracrRNA is shown in SEQ ID NO.147, and the variant sequence of the tracr partner sequence is shown in SEQ ID NO.148; preferably, the tracrRNA and the tracr partner sequence are connected by Chain connection; further preferably, the nucleotide sequence of the backbone sequence of the guide RNA is shown in SEQ ID NO. 149;
    7)所述向导RNA如SEQ ID NO.1~19至少任一所示;优选地,如SEQ ID NO.2、4、7、10、12、13至少任一所示。7) The guide RNA is as shown in at least any one of SEQ ID NO. 1 to 19; preferably, it is as shown in at least any one of SEQ ID NO. 2, 4, 7, 10, 12, and 13.
  8. 一种分离的多核苷酸,其特征在于,其编码如权利要求1~7任一权利要求所述的向导RNA。An isolated polynucleotide, characterized in that it encodes the guide RNA according to any one of claims 1 to 7.
  9. 一种构建体,其特征在于,所述构建体含有如权利要求8所述的分离的多核苷酸。A construct, characterized in that the construct contains the isolated polynucleotide of claim 8.
  10. 一种表达系统,其特征在于,所述表达系统含有如权利要求9所述的构建体或基因组中整合有外源的如权利要求8所述的多核苷酸。An expression system, characterized in that the expression system contains the construct according to claim 9 or an exogenous polynucleotide according to claim 8 integrated into the genome.
  11. 如权利要求10所述的表达系统,其特征在于,所述表达系统的宿主细胞选自真核细胞或原核细胞;优选地,所述宿主细胞选自小鼠细胞、人细胞。The expression system of claim 10, wherein the host cell of the expression system is selected from eukaryotic cells or prokaryotic cells; preferably, the host cell is selected from mouse cells and human cells.
  12. 一种基因编辑系统,其特征在于,其包括如权利要求1~7任一权利要求所述的向导RNA或其编码多核苷酸。A gene editing system, characterized in that it includes the guide RNA or its encoding polynucleotide as described in any one of claims 1 to 7.
  13. 如权利要求12所述的碱基编辑系统,其特征在于,所述碱基编辑系统还包括核酸酶或其编码多核苷酸。The base editing system of claim 12, wherein the base editing system further includes a nuclease or a polynucleotide encoding it.
  14. 如权利要求13所述的碱基编辑系统,其特征在于,包括以下至少任一项:The base editing system of claim 13, comprising at least any one of the following:
    1)所述碱基编辑系统包含一个或多个载体;所述一个或多个载体包含(i)第一调控元件,所述第一调控元件可操作地连接至所述核酸酶的编码多核苷酸;以及(ii)第二调控元件,所述第二调控元件可操作地连接至所述向导RNA核苷酸序列的编码多核苷酸;1) The base editing system comprises one or more vectors; the one or more vectors comprise (i) a first regulatory element operably connected to the encoding polynucleoside of the nuclease an acid; and (ii) a second regulatory element operably linked to a polynucleotide encoding the guide RNA nucleotide sequence;
    所述(i)和(ii)位于相同或不同载体上;Said (i) and (ii) are located on the same or different carriers;
    2)所述碱基编辑系统包含(i)核酸酶,以及(ii)包含所述向导RNA的编码多核苷酸的载体。2) The base editing system includes (i) a nuclease, and (ii) a vector containing a polynucleotide encoding the guide RNA.
  15. 如权利要求13所述的基因编辑系统,所述核酸酶是CRISPR核酸酶;优选地,所述核酸酶选自Cas9、Cas12、Cas13蛋白家族及其突变体;进一步优选地,所述Cas核酸酶选自nSpCas9及其突变体、SaCas9及其突变体、Cas12a及其突变体或Cas12f及其突变体;更进一步优选为SpCas12f1核酸酶及其突变体。The gene editing system of claim 13, wherein the nuclease is a CRISPR nuclease; preferably, the nuclease is selected from Cas9, Cas12, Cas13 protein family and mutants thereof; further preferably, the Cas nuclease Selected from nSpCas9 and its mutants, SaCas9 and its mutants, Cas12a and its mutants, or Cas12f and its mutants; more preferably, it is SpCas12f1 nuclease and its mutants.
  16. 如权利要求15所述的基因编辑系统,其特征在于,所述SpCas12f1核酸酶及其突变体包括:The gene editing system of claim 15, wherein the SpCas12f1 nuclease and its mutants include:
    (I)野生型SpCas12f1核酸酶或其片段,具有受RNA引导的核酸结合活性;所述SpCas12f1核酸酶来源于Syntrophomonaspalmitatica Cas12f1,氨基酸序列如SEQ ID NO.40所示;优选地,所述SpCas12f1核酸酶人源化密码子优化的核酸序列如SEQ ID NO.42所示;(1) Wild-type SpCas12f1 nuclease or a fragment thereof, which has RNA-guided nucleic acid binding activity; the SpCas12f1 nuclease is derived from Syntrophomonaspalmitatica Cas12f1, and the amino acid sequence is as shown in SEQ ID NO.40; Preferably, the SpCas12f1 nuclease The humanized codon-optimized nucleic acid sequence is shown in SEQ ID NO.42;
    (II)与(I)的氨基酸序列具有至少50%序列同源性的变体,且具有受RNA引导的核酸结合活性;(II) A variant having at least 50% sequence homology with the amino acid sequence of (I) and having RNA-guided nucleic acid binding activity;
    (III)根据(I)或(II),其进一步包括核定位信号片段;(III) according to (I) or (II), which further includes a nuclear localization signal fragment;
    (IV)根据(I)或(II)或(III),其进一步包含:(IV) Subject to (I) or (II) or (III), which further includes:
    (a)一种或多种修饰或突变,其产生具有相比修饰或突变前显著减小的核酸内切酶活性,或使核酸内切酶活性丧失;和/或(a) One or more modifications or mutations that produce an endonuclease activity that is significantly reduced compared to before the modification or mutation, or that results in a loss of endonuclease activity; and/or
    (b)具有其他功能活性的多肽或结构域;(b) Polypeptides or domains with other functional activities;
    (V)根据(I)或(II)或(III),所述SpCas12f1核酸酶具有核酸内切酶活性。(V) According to (I) or (II) or (III), the SpCas12f1 nuclease has endonuclease activity.
  17. 如权利要求12-16任一项所述的基因编辑系统,其特征在于,包括以下至少任一项:The gene editing system according to any one of claims 12-16, characterized in that it includes at least any one of the following:
    1)所述基因编辑系统识别靶向序列上的PAM序列;优选地PAM序列为5’-NTTC和5’-GTTT至少任一,其中N为A、T、C、G;进一步优选地PAM序列为5’-GTTC,5’-TTTC和5’-ATTC至少任一;1) The gene editing system recognizes the PAM sequence on the target sequence; preferably the PAM sequence is at least one of 5'-NTTC and 5'-GTTT, where N is A, T, C, G; further preferably the PAM sequence Be at least any one of 5'-GTTC, 5'-TTTC and 5'-ATTC;
    2)所述基因编辑系统靶向PAM序列之后长度为12~40bp的核酸片段,优选的长度为20bp;2) The gene editing system targets nucleic acid fragments with a length of 12 to 40 bp after the PAM sequence, and the preferred length is 20 bp;
    3)所述基因编辑系统靶向细胞基因组中的至少一个靶序列。3) The gene editing system targets at least one target sequence in the cell genome.
  18. 一种药物组合物,其特征在于,其包含权利要求12-17任一所述的基因编辑系统,以及药学上可接受的载体。A pharmaceutical composition, characterized in that it contains the gene editing system according to any one of claims 12 to 17, and a pharmaceutically acceptable carrier.
  19. 一种基因编辑方法,其特征在于,将靶基因与权利要求12-17任一所述的基因编辑系统接触,以实现靶基因的编辑。A gene editing method, characterized by contacting a target gene with the gene editing system described in any one of claims 12-17 to achieve editing of the target gene.
  20. 如权利要求19所述的基因编辑方法,其特征在于,包括下列步骤:The gene editing method of claim 19, comprising the following steps:
    i)将所述SpCas12f1核酸酶或其编码多核苷酸、以及所述向导RNA或其编码多核苷酸引入细胞中;i) introducing the SpCas12f1 nuclease or its encoding polynucleotide and the guide RNA or its encoding polynucleotide into a cell;
    ii)由所述SpCas12f1核酸酶介导,在靶基因中产生一种或多种切口,或靶向、编辑、修饰或操纵所述靶基因。ii) Produce one or more nicks in a target gene mediated by the SpCas12f1 nuclease, or target, edit, modify or manipulate the target gene.
  21. 如权利要求19或20所述的基因编辑方法,其特征在于,包括以下至少任一项:The gene editing method according to claim 19 or 20, characterized in that it includes at least any one of the following:
    1)所述SpCas12f1核酸酶通过加工或未加工形式的向导RNA引导至靶基因;1) The SpCas12f1 nuclease is guided to the target gene through processed or unprocessed guide RNA;
    2)所述SpCas12f1核酸酶和向导RNA形成复合物,识别所述靶基因上的PAM序列;2) The SpCas12f1 nuclease and guide RNA form a complex to recognize the PAM sequence on the target gene;
    3)所述基因编辑系统的靶向序列为PAM序列之后长度为12~40bp的核酸片段,优选PAM序列之后长度为20bp的核酸片段;3) The target sequence of the gene editing system is a nucleic acid fragment with a length of 12 to 40 bp after the PAM sequence, preferably a nucleic acid fragment with a length of 20 bp after the PAM sequence;
    4)所述方法进一步包括将包含异源多核苷酸序列的供体模板引入细胞中的步骤。4) The method further includes the step of introducing a donor template comprising a heterologous polynucleotide sequence into the cell.
  22. 如权利要求1~7任一权利要求所述的向导RNA或如权利要求8所述的分离的多核苷酸或如权利要求9所述的构建体或如权利要求10或11所述的表达系统或如权利要求12-17任一项所述的基因编辑系统或如权利要求18所述的 药物组合物或如权利要求19-21任一所述的方法在体内、离体细胞或无细胞环境中对靶基因和/或其相关多肽进行基因编辑中的应用。The guide RNA according to any one of claims 1 to 7, the isolated polynucleotide according to claim 8, the construct according to claim 9, or the expression system according to claims 10 or 11 Or the gene editing system according to any one of claims 12-17 or the pharmaceutical composition according to claim 18 or the method according to any one of claims 19-21 in vivo, ex vivo cells or cell-free environment Application in gene editing of target genes and/or related polypeptides.
  23. 如权利要求22所述的应用,其特征在于,包括以下至少任一项:The application according to claim 22, characterized in that it includes at least any one of the following:
    1)所述基因编辑选自由:基因切割、基因删除、基因插入、点突变、转录抑制、转录激活、碱基编辑和引导编辑构成的群组;优选地,所述基因编辑为基因删除和/或基因切割;1) The gene editing is selected from the group consisting of: gene cutting, gene deletion, gene insertion, point mutation, transcription inhibition, transcription activation, base editing and guided editing; preferably, the gene editing is gene deletion and/or or gene cutting;
    2)所述离体细胞选自细菌细胞、古细菌细胞、真菌细胞、原生生物细胞、病毒细胞、植物细胞和动物细胞中的至少一种;2) The in vitro cells are selected from at least one of bacterial cells, archaeal cells, fungal cells, protist cells, viral cells, plant cells and animal cells;
    3)所述基因编辑用于实现致病位点的修正、基因功能研究、增强细胞功能、细胞治疗至少任一。3) The gene editing is used to achieve at least any of the correction of disease-causing sites, research on gene functions, enhancement of cell functions, and cell therapy.
  24. 一种遗传修饰的细胞,其通过如权利要求12-17任一所述的基因编辑系统或如权利要求18所述的药物组合物或如权利要求19-21任一所述的方法进行基因编辑获得。A genetically modified cell, which is gene edited by the gene editing system as described in any one of claims 12-17 or the pharmaceutical composition as claimed in claim 18 or the method as described in any one of claims 19-21 get.
  25. 一种如权利要求1~7任一项所述的向导RNA制备方法,其特征在于,所述方法包括对基础向导RNA的tracrRNA和crRNA进行单独改造及组合改造的步骤,所述改造选自对tracrRNA或crRNA进行截断或延长,或将tracrRNA和crRNA之间通过连接链进行连接。A guide RNA preparation method according to any one of claims 1 to 7, characterized in that the method includes the steps of individually transforming and combining the tracrRNA and crRNA of the basic guide RNA, and the transformation is selected from the group consisting of TracrRNA or crRNA is truncated or extended, or tracrRNA and crRNA are connected through a connecting strand.
PCT/CN2022/113354 2022-04-25 2022-08-18 Optimized crispr/spcas12f1 system, engineered guide rna and use thereof WO2023206871A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210444211.9A CN116987693A (en) 2022-04-25 2022-04-25 Optimized CRISPR/SpCas12f1 system, engineering guide RNA and application thereof
CN202210444211.9 2022-04-25

Publications (1)

Publication Number Publication Date
WO2023206871A1 true WO2023206871A1 (en) 2023-11-02

Family

ID=88517099

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/113354 WO2023206871A1 (en) 2022-04-25 2022-08-18 Optimized crispr/spcas12f1 system, engineered guide rna and use thereof

Country Status (2)

Country Link
CN (1) CN116987693A (en)
WO (1) WO2023206871A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117568311A (en) * 2023-11-16 2024-02-20 广西科学院 Engineered CRISPR enzyme and system for precise gene editing and application thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019126762A2 (en) * 2017-12-22 2019-06-27 The Broad Institute, Inc. Cas12a systems, methods, and compositions for targeted rna base editing
WO2020088450A1 (en) * 2018-10-29 2020-05-07 中国农业大学 Novel crispr/cas12f enzyme and system
WO2020098772A1 (en) * 2018-11-15 2020-05-22 中国农业大学 Crispr-cas12j enzyme and system
CN113166744A (en) * 2018-12-14 2021-07-23 先锋国际良种公司 Novel CRISPR-CAS system for genome editing
WO2021238128A1 (en) * 2020-05-28 2021-12-02 上海科技大学 Genome editing system and method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019126762A2 (en) * 2017-12-22 2019-06-27 The Broad Institute, Inc. Cas12a systems, methods, and compositions for targeted rna base editing
WO2020088450A1 (en) * 2018-10-29 2020-05-07 中国农业大学 Novel crispr/cas12f enzyme and system
WO2020098772A1 (en) * 2018-11-15 2020-05-22 中国农业大学 Crispr-cas12j enzyme and system
CN113166744A (en) * 2018-12-14 2021-07-23 先锋国际良种公司 Novel CRISPR-CAS system for genome editing
WO2021238128A1 (en) * 2020-05-28 2021-12-02 上海科技大学 Genome editing system and method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
GRETA BIGELYTE ET AL.: "Miniature Type V-F CRISPR-Cas Nucleases Enable Targeted DNA Modification in Cells", NATURE COMMUNICATIONS, vol. 12, no. 1, 26 October 2021 (2021-10-26), XP093104321, ISSN: 2041-1723, DOI: 10.1038/s41467-021-26469-4 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117568311A (en) * 2023-11-16 2024-02-20 广西科学院 Engineered CRISPR enzyme and system for precise gene editing and application thereof

Also Published As

Publication number Publication date
CN116987693A (en) 2023-11-03

Similar Documents

Publication Publication Date Title
US20230383290A1 (en) High-throughput precision genome editing
US11713471B2 (en) Class II, type V CRISPR systems
EP3464587B1 (en) Compositions and methods for enhancing homologous recombination
RU2707542C1 (en) METHOD OF PRODUCING A RECOMBINANT NUCLEASE CAS ESSENTIALLY FREE OF BACTERIAL ENDOTOXINS, THE PREPARATION OBTAINED BY THIS METHOD AND CONTAINING A KIT FOR USE IN A CRISPR/Cas SYSTEM
US20230416710A1 (en) Engineered and chimeric nucleases
US20230212612A1 (en) Genome editing system and method
WO2023098485A1 (en) C2c9 nuclease-based novel genome editing system and application thereof
WO2023206871A1 (en) Optimized crispr/spcas12f1 system, engineered guide rna and use thereof
TW202330918A (en) Circular rna and preparation method thereof
WO2022066335A1 (en) Systems and methods for transposing cargo nucleotide sequences
CA3225082A1 (en) Enzymes with ruvc domains
WO2024055664A1 (en) Optimized guide rna, crispr/acc2c9 gene editing system, and gene editing method
WO2023206872A1 (en) Engineering-optimized nuclease, guide rna, editing system, and use
WO2022159742A1 (en) Novel engineered and chimeric nucleases
EP4428233A1 (en) C2c9 nuclease-based novel genome editing system and application thereof
US20190218533A1 (en) Genome-Scale Engineering of Cells with Single Nucleotide Precision
WO2024125221A1 (en) Base editing system and base editing method
CN114891786B (en) Dog Rosa26 gene and application thereof
US20230348877A1 (en) Base editing enzymes
CN118048343A (en) Cas9 nuclease and preparation method and application thereof
CA3189020A1 (en) Nuclease-mediated nucleic acid modification
CN112997966A (en) Mouse model knocking-in miRNA-125a based on CRISPR/Cas9 technology and construction method
CN118048344A (en) Nuclease, preparation method and application thereof
WO2024059811A2 (en) Retron directed gene editing
EP4330386A2 (en) Enzymes with ruvc domains

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22939697

Country of ref document: EP

Kind code of ref document: A1