WO2019206233A1 - Rna-edited crispr/cas effector protein and system - Google Patents

Rna-edited crispr/cas effector protein and system Download PDF

Info

Publication number
WO2019206233A1
WO2019206233A1 PCT/CN2019/084340 CN2019084340W WO2019206233A1 WO 2019206233 A1 WO2019206233 A1 WO 2019206233A1 CN 2019084340 W CN2019084340 W CN 2019084340W WO 2019206233 A1 WO2019206233 A1 WO 2019206233A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequence
protein
nucleic acid
seq
composition
Prior art date
Application number
PCT/CN2019/084340
Other languages
French (fr)
Chinese (zh)
Inventor
赖锦盛
张湘博
周英思
朱金洁
吕梦璐
赵海铭
宋伟彬
Original Assignee
中国农业大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国农业大学 filed Critical 中国农业大学
Priority to CN201980028197.0A priority Critical patent/CN112020560B/en
Publication of WO2019206233A1 publication Critical patent/WO2019206233A1/en

Links

Images

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • A61K38/16Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • A61K38/43Enzymes; Proenzymes; Derivatives thereof
    • A61K38/46Hydrolases (3)
    • A61K38/47Hydrolases (3) acting on glycosyl compounds (3.2), e.g. cellulases, lactases
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts

Definitions

  • the invention relates to the field of nucleic acid editing, in particular to the field of regularly clustered short palindrome repetition (CRISPR) technology.
  • CRISPR regularly clustered short palindrome repetition
  • the invention relates to Cas effector proteins, fusion proteins comprising such proteins, and nucleic acid molecules encoding the same.
  • the invention also relates to complexes and compositions for nucleic acid editing (e.g., gene or genome editing) comprising a protein or fusion protein of the invention, or a nucleic acid molecule encoding the same.
  • the invention also relates to methods for nucleic acid editing (eg, gene or genome editing) using a protein or fusion protein comprising the invention.
  • CRISPR-cas-mediated immunity mainly includes three stages: (1) adaptive stage, cas1-cas2 protein complex inserts the target DNA fragment into the CRISPR repeat region; (2) expression and processing stage, CRISPR sequence transcription and cas The effector protein processes the mature crRNA; (3) the interference phase, the cas protein and the crRNA assemble into a complex to process the target DNA or RNA.
  • the CRISPR-cas system has evolved into a variety of defense mechanisms, including RNA-mediated DNA or RNA editing mechanisms.
  • the CRISPR-cas system is divided into class1 and class2.
  • Class1 is a complex composed of multiple proteins and is relatively complex.
  • the Class 2 system is relatively simple in structure and has only one effector protein, so it is widely used.
  • Type II and typeV edit DNA Type VI CRISPR-Cas system has cleavage activity on the target RNA.
  • the RNA-edited CRISPR-Cas system can regulate genes at the level of gene transcription, as well as detect live viruses, RNA interference, gene-selective splicing, and fluorescence in situ hybridization. Therefore, it is especially important to mine new RNA editing systems.
  • the inventors of the present application have unexpectedly discovered a novel RNA-directed ribonuclease endonuclease after extensive experimentation and repeated exploration. Based on this finding, the inventors have developed a new RNA editing CRISPR/Cas system and a gene editing method based on the system.
  • the invention provides a protein having the amino acid sequence set forth in any one of SEQ ID NOs: 1-7, or an ortholog, homolog, variant or functional fragment thereof; Wherein the ortholog, homologue, variant or functional fragment substantially retains the biological function of the sequence from which it is derived.
  • the biological functions include, but are not limited to, activity binding to a targeting RNA, endoribonuclease activity, binding to a specific site of a target sequence under the guidance of a targeting RNA, and cleavage.
  • the ortholog, homolog, variant has at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least compared to the sequence from which it is derived. 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity.
  • the ortholog, homolog, variant has at least 80%, at least 85%, at least 90%, compared to the sequence set forth in any one of SEQ ID NOs: 1-7, At least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity and substantially retains the origin of the sequence
  • the biological function of the sequence eg, activity binding to the targeting RNA, endoribonuclease activity, activity that binds to a specific site of the target sequence and cleaves under the guidance of a targeting RNA).
  • the protein is an effector protein in the CRISPR/Cas system.
  • the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
  • the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
  • the protein of the invention has the amino acid sequence set forth in SEQ ID NO:1.
  • the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
  • the protein of the invention has the amino acid sequence set forth in SEQ ID NO:2.
  • the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
  • the protein of the invention has the amino acid sequence set forth in SEQ ID NO:3.
  • the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
  • the protein of the invention has the amino acid sequence set forth in SEQ ID NO:4.
  • the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
  • the protein of the invention has the amino acid sequence set forth in SEQ ID NO:5.
  • the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
  • the protein of the invention has the amino acid sequence set forth in SEQ ID NO: 6.
  • the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
  • the protein of the invention has the amino acid sequence set forth in SEQ ID NO:7.
  • a protein of the invention can be derivatized, for example, linked to another molecule (e.g., another polypeptide or protein).
  • derivatization eg, labeling
  • the proteins of the invention are also intended to include such derivatized forms.
  • a protein of the invention can be functionally linked (by chemical coupling, gene fusion, non-covalent attachment or otherwise) to one or more other molecular groups, such as another protein or polypeptide, a detection reagent, a pharmaceutical reagent Wait.
  • the proteins of the invention may be linked to other functional units.
  • it can be ligated to a nuclear localization signal (NLS) sequence to increase the ability of the proteins of the invention to enter the nucleus.
  • NLS nuclear localization signal
  • it can be linked to a targeting moiety to render the protein of the invention targeted.
  • it can be linked to a detectable label to facilitate detection of the proteins of the invention.
  • it can be linked to an epitope tag to facilitate expression, detection, tracing, and/or purification of the proteins of the invention.
  • the invention provides a conjugate comprising a protein and a modified moiety as described above.
  • the modified moiety is selected from another protein or polypeptide, a detectable label, or any combination thereof.
  • a nuclease domain eg, Fok1
  • a nuclease domain having a domain selected from the group consisting of methylase activity, demethylase, transcriptional activation activity, transcriptional repression Activity, transcription release factor activity, histone modification activity, nuclease activity, single-strand RNA cleavage activity, double-stranded RNA cleavage activity, single-strand DNA cleavage activity, double-strand DNA cleavage activity and nucleic acid binding activity; and any combination thereof.
  • a conjugate of the invention comprises one or more NLS sequences, such as the NLS of the SV40 viral large T antigen.
  • the NLS sequence is set forth in SEQ ID NO:22.
  • the NLS sequence is located at, near, or near the end of the protein of the invention (eg, the N-terminus or the C-terminus).
  • the NLS sequence is located at, near, or near the C-terminus of the protein of the invention.
  • the conjugates of the invention comprise an epitope tag.
  • epitope tags are well known to those skilled in the art, examples of which include, but are not limited to, His, V5, FLAG, HA, Myc, VSV-G, Trx, etc., and those skilled in the art know how to achieve the desired purpose (eg, Purify, test or trace) Select the appropriate epitope tag.
  • a conjugate of the invention comprises a reporter gene sequence.
  • reporter genes are well known to those skilled in the art, and examples include, but are not limited to, GST, HRP, CAT, GFP, HcRed, DsRed, CFP, YFP, BFP, and the like.
  • the conjugates of the invention comprise a domain capable of binding to a DNA molecule or an intracellular molecule, such as a maltose binding protein (MBP), a DNA binding domain of Lex A (DBD), a DBD of GAL4, and the like.
  • MBP maltose binding protein
  • DBD DNA binding domain of Lex A
  • GAL4 GAL4
  • the conjugates of the invention comprise a detectable label, such as a fluorescent dye, such as FITC or DAPI.
  • a protein of the invention is optionally coupled, conjugated or fused to the modified moiety by a linker.
  • the modified moiety is directly linked to the N-terminus or C-terminus of the protein of the invention.
  • the modified moiety is linked to the N-terminus or C-terminus of the protein of the invention by a linker.
  • linkers are well known in the art, examples of which include, but are not limited to, one or more (eg, 1, 2, 3, 4 or 5) amino acids (eg, Glu or Ser) or amino acid derivatives.
  • a linker eg, Ahx, ⁇ -Ala, GABA, or Ava), or PEG, and the like.
  • the invention provides a fusion protein comprising a protein of the invention and an additional protein or polypeptide.
  • a nuclease domain eg, Fok1
  • a nuclease domain having a domain selected from the group consisting of methylase activity, demethylase, transcriptional activation activity, transcriptional repression Activity, transcription release factor activity, histone modification activity, nuclease activity, single-strand RNA cleavage activity, double-stranded RNA cleavage activity, single-strand DNA cleavage activity, double-strand DNA cleavage activity and nucleic acid binding activity; and any combination thereof.
  • a fusion protein of the invention comprises one or more NLS sequences, such as the NLS of the SV40 viral large T antigen.
  • the NLS sequence is at, near, or near the end of the protein of the invention (eg, the N-terminus or the C-terminus). In certain exemplary embodiments, the NLS sequence is located at, near, or near the C-terminus of the protein of the invention.
  • a fusion protein of the invention comprises an epitope tag.
  • a fusion protein of the invention comprises a reporter gene sequence.
  • a fusion protein of the invention comprises a domain capable of binding to a DNA molecule or an intracellular molecule.
  • a protein of the invention is optionally fused to the additional protein or polypeptide via a linker.
  • the additional protein or polypeptide is directly linked to the N-terminus or C-terminus of the protein of the invention.
  • the additional protein or polypeptide is linked to the N-terminus or C-terminus of the protein of the invention by a linker.
  • the fusion proteins of the invention have an amino acid sequence selected from the group consisting of SEQ ID NOs: 23-29.
  • the protein of the present invention, the conjugate of the present invention or the fusion protein of the present invention is not limited by the manner in which it is produced, for example, it can be produced by a genetic engineering method (recombination technique) or can be produced by a chemical synthesis method.
  • the invention provides an isolated nucleic acid molecule comprising a sequence selected from the group consisting of: or consisting of:
  • sequence of any one of (ii)-(v) substantially retains the biological function of the sequence from which it is derived, the biological function of the sequence being referred to as the same direction in the CRISPR-Cas system Repeat the activity of the sequence.
  • the isolated nucleic acid molecule is a direct repeat in a CRISPR-Cas system.
  • the isolated nucleic acid molecule comprises one or more stem loops or an optimized secondary structure.
  • the sequence of any of (ii)-(v) retains the secondary structure of the sequence from which it is derived.
  • the nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
  • the isolated nucleic acid molecule is RNA.
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
  • the invention provides a composite comprising:
  • a protein component selected from the group consisting of a protein, conjugate or fusion protein of the invention, and any combination thereof;
  • nucleic acid component comprising an isolated nucleic acid molecule as described above and a targeting sequence capable of hybridizing to the target sequence
  • the targeting sequence is linked to the 3' or 5' end of the nucleic acid molecule.
  • the targeting sequence comprises the complement of the target sequence.
  • the nucleic acid component is a targeting RNA in a CRISPR/Cas system.
  • the nucleic acid molecule is RNA.
  • the complex does not comprise a trans-acting tracrRNA.
  • the invention provides an isolated nucleic acid molecule comprising:
  • nucleotide sequence set forth in any of (i)-(iii) is codon optimized for expression in a prokaryotic cell. In certain embodiments, the nucleotide sequence set forth in any of (i)-(iii) is codon optimized for expression in eukaryotic cells.
  • the invention provides a vector comprising the isolated nucleic acid molecule of the sixth aspect.
  • the vector of the present invention may be a cloning vector or an expression vector.
  • vectors of the invention are, for example, plasmids, cosmids, phage, cosmid, and the like.
  • the vector is capable of expressing a protein of the invention, a fusion protein, an isolated nucleic acid molecule of the fourth aspect, or a fifth aspect, in a subject (eg, a mammal, eg, a human) Said complex.
  • the invention also provides a host cell comprising an isolated nucleic acid molecule or vector as described above.
  • host cells include, but are not limited to, prokaryotic cells such as E. coli cells, and eukaryotic cells such as yeast cells, insect cells, plant cells, and animal cells (eg, mammalian cells, such as mouse cells, human cells, etc.).
  • the cells of the invention may also be cell lines, such as 293T cells.
  • the host cell is a prokaryotic cell.
  • compositions and carrier composition are Composition and carrier composition
  • the present invention also provides a composition comprising:
  • a first component selected from the group consisting of a protein, a conjugate, a fusion protein, a nucleotide sequence encoding the protein or fusion protein, and any combination thereof;
  • a second component which is a nucleotide sequence comprising a targeting RNA or a nucleotide sequence encoding the nucleotide sequence comprising the targeting RNA;
  • the targeting RNA comprises a homologous repeat sequence and a targeting sequence, the targeting sequence being capable of hybridizing to the target sequence;
  • the targeting RNA is capable of forming a complex with the protein, conjugate or fusion protein described in (i).
  • the isotropic repeat is an isolated nucleic acid molecule as defined in the fourth aspect.
  • the targeting sequence is ligated to the 3' or 5' end of the homologous repeat. In certain embodiments, the targeting sequence comprises the complement of the target sequence.
  • the composition does not comprise tracrRNA.
  • the composition is non-naturally occurring or modified. In certain embodiments, at least one component of the composition is non-naturally occurring or modified. In certain embodiments, the first component is non-naturally occurring or modified; and/or the second component is non-naturally occurring or modified.
  • the target sequence is an RNA sequence from a prokaryotic or eukaryotic cell. In certain embodiments, the target sequence is a non-naturally occurring RNA sequence.
  • the target sequence is present in a cell. In certain embodiments, the target sequence is present within the nucleus or within the cytoplasm (eg, an organelle). In certain embodiments, the cell is a prokaryotic cell. In certain embodiments, the cell is a eukaryotic cell.
  • the protein is linked to one or more NLS sequences.
  • the conjugate or fusion protein comprises one or more NLS sequences.
  • the NLS sequence is linked to the N-terminus or C-terminus of the protein.
  • the NLS sequence is fused to the N-terminus or C-terminus of the protein.
  • the present invention also provides a composition comprising one or more carriers, the one or more carriers comprising:
  • a first nucleic acid which is a nucleotide sequence encoding a protein or fusion protein of the invention; optionally the first nucleic acid is operably linked to a first regulatory element;
  • a second nucleic acid encoding a nucleotide sequence comprising a targeting RNA; optionally the second nucleic acid is operably linked to a second regulatory element;
  • the first nucleic acid and the second nucleic acid are present on the same or different carrier;
  • the targeting RNA comprises a homologous repeat sequence and a targeting sequence capable of hybridizing to the target sequence
  • the targeting RNA is capable of forming a complex with the effector protein or fusion protein described in (i).
  • the isotropic repeat is an isolated nucleic acid molecule as defined in the fourth aspect.
  • the targeting sequence is ligated to the 3' or 5' end of the homologous repeat. In certain embodiments, the targeting sequence comprises the complement of the target sequence.
  • the composition does not comprise tracrRNA.
  • the composition is non-naturally occurring or modified. In certain embodiments, at least one component of the composition is non-naturally occurring or modified.
  • the first regulatory element is a promoter, such as an inducible promoter.
  • the second regulatory element is a promoter, such as an inducible promoter.
  • the target sequence is an RNA sequence from a prokaryotic or eukaryotic cell. In certain embodiments, the target sequence is a non-naturally occurring RNA sequence.
  • the target sequence is present in a cell. In certain embodiments, the target sequence is present within the nucleus or within the cytoplasm (eg, an organelle). In certain embodiments, the cell is a prokaryotic cell. In certain embodiments, the cell is a eukaryotic cell.
  • the protein is linked to one or more NLS sequences.
  • the conjugate or fusion protein comprises one or more NLS sequences.
  • the NLS sequence is linked to the N-terminus or C-terminus of the protein.
  • the NLS sequence is fused to the N-terminus or C-terminus of the protein.
  • one type of vector is a plasmid, which refers to a circular double stranded DNA loop in which additional DNA fragments can be inserted, for example, by standard molecular cloning techniques.
  • a viral vector in which a virus-derived DNA or RNA sequence is present for packaging a virus (eg, retrovirus, replication-defective retrovirus, adenovirus, replication-defective adenovirus, and adeno-associated In the vector of the virus).
  • the viral vector also comprises a polynucleotide carried by a virus for transfection into a host cell.
  • vectors e.g., bacterial vectors having bacterial origins of replication and episomal mammalian vectors
  • Other vectors e.g., non-episomal mammalian vectors
  • certain vectors are capable of directing the expression of genes to which they are operably linked.
  • Such vectors are referred to herein as "expression vectors.”
  • Common expression vectors used in recombinant DNA techniques are typically in the form of plasmids.
  • the recombinant expression vector can comprise a nucleic acid molecule of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vector comprises one or more regulatory elements selected based on the host cell to be used for expression.
  • the regulatory element is operably linked to the nucleic acid sequence to be expressed.
  • compositions of the ninth and tenth aspects can be delivered by any method known in the art.
  • Such methods include, but are not limited to, electroporation, lipofection, nuclear transfection, microinjection, sonoporation, gene gun, calcium phosphate mediated transfection, cation transfection, lipofection, dendritic Transfection, heat shock transfection, nuclear transfection, magnetic transfection, lipofection, puncture transfection, optical transfection, reagent-enhanced nucleic acid uptake, and via liposomes, immunoliposomes, viral particles, artificial viruses Delivery of body, etc.
  • the present invention provides a delivery composition
  • a delivery composition comprising a delivery vehicle, and one or more selected from the group consisting of a protein, a conjugate, a fusion protein of the invention, as in the fourth aspect
  • the delivery vehicle is a particle.
  • the delivery vehicle is selected from the group consisting of a lipid particle, a sugar particle, a metal particle, a protein particle, a liposome, an exosome, a microvesicle, a gene gun, or a viral vector (eg, replication defective reverse transcription) Virus, lentivirus, adenovirus or adeno-associated virus).
  • a viral vector eg, replication defective reverse transcription
  • the invention provides a kit comprising one or more of the components described above.
  • the kit comprises one or more components selected from the group consisting of a protein, a conjugate, a fusion protein of the invention, an isolated nucleic acid molecule of the fourth aspect, the invention The complex, the isolated nucleic acid molecule of the sixth aspect, the carrier of the seventh aspect, the composition of the ninth and tenth aspects.
  • the kit of the invention comprises the composition of the ninth aspect. In certain embodiments, the kit further comprises instructions for using the composition.
  • the kit of the invention comprises the composition of the tenth aspect. In certain embodiments, the kit further comprises instructions for using the composition.
  • kits of the invention can be provided in any suitable container.
  • the kit further comprises one or more buffers.
  • the buffer can be any buffer including, but not limited to, sodium carbonate buffer, sodium bicarbonate buffer, borate buffer, Tris buffer, MOPS buffer, HEPES buffer, and combinations thereof.
  • the buffer is basic.
  • the buffer has a pH of from about 7 to about 10.
  • the kit further comprises one or more oligonucleotides corresponding to a targeting sequence for insertion into a vector for operably linking the guide Sequence and adjustment elements.
  • the kit further comprises an RNA template.
  • the RNA template comprises an RNA sequence encoding a protein or a non-coding RNA sequence (eg, a microRNA).
  • the invention provides a method of modifying a target sequence, comprising: the complex of the fifth aspect, the composition of the ninth aspect, the composition of the tenth aspect Or the delivery composition as described herein is contacted with the target sequence or delivered to a cell comprising the target sequence; wherein the target sequence is associated with or is present in the gene of interest And the target sequence is RNA.
  • the target sequence is present in a cell.
  • the cell is a prokaryotic cell.
  • the cell is a eukaryotic cell.
  • the cell is a mammalian cell.
  • the cell is a human cell.
  • the cell is selected from a non-human primate, bovine, porcine or rodent cell.
  • the cell is a non-mammalian eukaryotic cell, such as a poultry or fish.
  • the cell is a plant cell, such as a cell of a cultivated plant (such as cassava, corn, sorghum, wheat, or rice), algae, tree, or vegetable.
  • the target sequence is present in a nucleic acid molecule (eg, a plasmid) in vitro. In certain embodiments, the target sequence is present in a plasmid.
  • a nucleic acid molecule eg, a plasmid
  • the modification refers to a cleavage of the target sequence, such as a double strand break or a single strand break.
  • the target sequence is a ssRNA.
  • the method further comprises contacting the RNA template with the target sequence or delivering to a cell comprising the target sequence.
  • the modification further comprises inserting an RNA template (eg, an exogenous nucleic acid) into the fragment.
  • the protein, conjugate, fusion protein, isolated nucleic acid molecule, complex, vector or composition is included in a delivery vehicle.
  • the delivery vehicle is selected from the group consisting of a lipid particle, a sugar particle, a metal particle, a protein particle, a liposome, an exosome, a viral vector (eg, a replication defective retrovirus, a lentivirus, an adenovirus) Or adeno-associated virus).
  • a viral vector eg, a replication defective retrovirus, a lentivirus, an adenovirus
  • the methods are for RNA interference or modulation of gene expression.
  • the methods modulate gene expression by modulating RNA processing or RNA activation (RNAa).
  • RNA processing can include RNA splicing (including alternative splicing), viral replication (eg, satellite viruses, phage or retroviruses, eg, HBV, HCV, HIV, etc.) or tRNA biosynthesis.
  • the RNAa promotes gene expression.
  • the method inhibits gene expression by interfering with or reducing RNAa.
  • the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated nucleic acid molecule of the fourth aspect, The complex of claim 5, the isolated nucleic acid molecule of the sixth aspect, the carrier of the seventh aspect, the composition of the ninth aspect, the composition of the tenth aspect
  • a kit or delivery composition of the invention for use in one or more of the following, or in the preparation of a formulation for one or more selected from the group consisting of:
  • the invention provides a method of detecting a target sequence, comprising the complex of the fifth aspect, the composition of the ninth aspect, the composition of the tenth aspect or A delivery composition as described herein is contacted with the target sequence or delivered to a cell comprising the target sequence; wherein the target sequence is RNA.
  • a targeting sequence contained in the complex, composition or delivery composition is capable of hybridizing to the target sequence.
  • the target sequence is present in a nucleic acid molecule in vitro.
  • the target sequence is present in a cell.
  • the cell is a prokaryotic cell.
  • the cell is a living cell.
  • the protein component comprised by the complex, composition or delivery composition carries a detectable label.
  • the protein component comprised by the complex, composition or delivery composition is fused with a fluorescent protein (eg, GFP).
  • a fluorescent protein eg, GFP
  • the method is northern blotting.
  • the northern blotting method involves isolating RNA samples by size by electrophoresis.
  • the complexes or compositions of the invention can be used to specifically bind and detect target RNA sequences.
  • the method is fluorescence in situ hybridization.
  • the protein component of the complex, composition or delivery composition of the invention carries a detectable label (eg, a fluorescent label).
  • the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated nucleic acid molecule of the fourth aspect, The complex of claim 5, the isolated nucleic acid molecule of the sixth aspect, the carrier of the seventh aspect, the composition of the ninth aspect, the composition of the tenth aspect A kit or delivery composition of the invention for use in detecting a target sequence, or in the preparation of a preparation for detecting a target sequence.
  • Cas13e in the complex when the CRISPR/Cas complex of the invention binds to a target RNA, Cas13e in the complex is activated, followed by cleavage of any nearby ssRNA sequence (ie, collateral cleavage) ). Once Cas13e is primed by the target RNA, other (non-complementary) RNA molecules can be cleaved. This confounding RNA cleavage can cause cytotoxicity or otherwise affect cell physiology or cellular status.
  • the invention provides a method of modulating the state of a target cell, comprising the composition of the fifth aspect, the composition of the ninth aspect, as described in the tenth aspect
  • the composition or delivery composition as described herein is introduced into the target cell.
  • a target sequence that hybridizes to a targeting sequence contained in the complex, composition, or delivery composition is present in the target cell.
  • the modulating the state of the target cell comprises:
  • the cell is a prokaryotic cell.
  • the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated nucleic acid molecule of the fourth aspect, The complex of claim 5, the isolated nucleic acid molecule of the sixth aspect, the carrier of the seventh aspect, the composition of the ninth aspect, the composition of the tenth aspect
  • a kit or delivery composition of the invention for use in one or more of the following, or in the preparation of a formulation for one or more selected from the group consisting of:
  • the methods as described above may be therapeutic or prophylactic, and may target a particular target cell, cell (sub)pop, or cell/tissue type.
  • the particular cell, cell (sub)pop, or cell/tissue type expresses one or more target sequences, such as one or more specific target RNAs.
  • target cells include tumor cells that express a particular transcript, a given class of neurons, cells that cause autoimmunity, or cells that are infected with a particular pathogen (eg, a virus).
  • the present invention also provides a method of treating a pathological condition characterized by the presence of a defective cell, comprising administering to a subject in need thereof a complex as described in the fifth aspect
  • a method of treating a pathological condition characterized by the presence of a defective cell comprising administering to a subject in need thereof a complex as described in the fifth aspect
  • the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated of the fourth aspect
  • the nucleic acid molecule, the complex according to the fifth aspect, the isolated nucleic acid molecule according to the sixth aspect, the carrier according to the seventh aspect, the composition according to the ninth aspect, according to the tenth aspect Use of the composition, kit of the invention or delivery composition for the treatment of a pathological condition characterized by the presence of defective cells. It will be appreciated that in the above embodiments, the complex or composition of the invention preferably targets a specific target sequence of the defective cell.
  • the pathological condition characterized by the presence of a defective cell is a tumor.
  • the present invention also provides a method of treating a tumor comprising administering to a subject in need thereof a complex according to the fifth aspect, as described in the ninth aspect A composition, a composition as described in the tenth aspect, or a delivery composition as described herein.
  • the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated of the fourth aspect
  • the nucleic acid molecule, the complex according to the fifth aspect, the isolated nucleic acid molecule according to the sixth aspect, the carrier according to the seventh aspect, the composition according to the ninth aspect, according to the tenth aspect Use of the composition, kit of the invention or delivery composition for the treatment of a tumor.
  • the complex or composition of the invention preferably targets a tumor cell specific target sequence.
  • the pathological condition characterized by the presence of a defective cell is a disease caused by a pathogen infection or infection by a pathogen.
  • the present invention also provides a method of treating a pathogen infection or a disease caused by a pathogen infection, comprising administering to a subject in need thereof a compound as described in the fifth aspect
  • the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated of the fourth aspect
  • the complex or composition of the invention preferably targets a specific target sequence (e.g., a target sequence derived from the pathogen) of a cell infected by the pathogen.
  • the pathological condition characterized by the presence of a defective cell is an autoimmune disease.
  • the present invention also provides a method of treating an autoimmune disease, comprising administering to a subject in need thereof a complex according to the fifth aspect, as in the ninth aspect A composition, a composition as described in the tenth aspect, or a delivery composition as described herein.
  • the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated of the fourth aspect
  • the nucleic acid molecule, the complex according to the fifth aspect, the isolated nucleic acid molecule according to the sixth aspect, the carrier according to the seventh aspect, the composition according to the ninth aspect, according to the tenth aspect Use of the composition, kit of the invention or delivery composition for the treatment of an autoimmune disease.
  • the complex or composition of the invention preferably targets a specific target sequence of a cell (eg, a particular immune cell) that causes the autoimmune disease.
  • the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated of the fourth aspect
  • Such methods of treatment include gene editing, transcriptome editing, or gene therapy.
  • modifications introduced to cells by the methods of the invention can cause the cells and their progeny to be altered to improve the production of their biological products, such as antibodies, starch, ethanol, or other desired cellular output.
  • modifications introduced into the cell by the methods of the invention can cause the cell and its progeny to include changes that result in a change in the produced biological product.
  • the invention relates to a cell or a progeny thereof obtained by the method as described above, wherein the cell contains a modification that is not found in its wild type.
  • the modification results in a change in transcription or translation of at least one RNA product. In certain embodiments, the modification results in increased expression of at least one RNA product. In certain embodiments, the modification results in decreased expression of at least one RNA product.
  • the cell is a prokaryotic cell.
  • the cell is a eukaryotic cell. In certain embodiments, the cell is a mammalian cell, such as a human cell.
  • the invention also relates to a cell product of a cell or a progeny thereof as described above.
  • the invention also relates to an in vitro, ex vivo or in vivo cell or cell line or a progeny thereof, the cell or cell line or a progeny thereof comprising: the protein of the first aspect, such as the second The conjugate of the aspect, the fusion protein of the third aspect, the isolated nucleic acid molecule of the fourth aspect, the complex of the fifth aspect, the isolated nucleic acid of the sixth aspect A molecule, a carrier according to the seventh aspect, a composition according to the ninth aspect, a composition according to the tenth aspect, a kit of the invention or a delivery composition.
  • the cell is a prokaryotic cell.
  • the cell is a eukaryotic cell. In certain embodiments, the cell is a mammalian cell. In certain embodiments, the cell is a human cell. In certain embodiments, the cell is a non-human mammalian cell, such as a cell of a non-human primate, cow, sheep, pig, dog, monkey, rabbit, rodent (eg, rat or mouse). In certain embodiments, the cell is a non-mammalian eukaryotic cell, such as a poultry bird (eg, chicken), a fish or a crustacean (eg, scorpion, shrimp) cells.
  • a poultry bird eg, chicken
  • fish or a crustacean eg, scorpion, shrimp
  • the cell is a plant cell, such as a cell possessed by a monocot or a dicot or a cultivated plant or a cell of a food crop such as cassava, corn, sorghum, soybean, wheat, oat or rice, for example Algae, tree or production of plants, fruits or vegetables (for example, trees such as citrus, nut trees; nightshade, cotton, tobacco, tomatoes, grapes, coffee, cocoa, etc.).
  • a plant cell such as a cell possessed by a monocot or a dicot or a cultivated plant or a cell of a food crop such as cassava, corn, sorghum, soybean, wheat, oat or rice, for example Algae, tree or production of plants, fruits or vegetables (for example, trees such as citrus, nut trees; nightshade, cotton, tobacco, tomatoes, grapes, coffee, cocoa, etc.).
  • the cell is a stem cell or stem cell line.
  • Cas13e refers to a Cas effector protein first discovered and identified by the present inventors having an amino acid sequence selected from the group consisting of:
  • the Cas13e of the present invention is an endonuclease which binds to a specific site of a target RNA sequence and cleaves under the guidance of a guide RNA.
  • CRISPR complex regional short palindrome repeat
  • Cas CRISPR-CRISPR-related
  • CRISPR system CRISPR system
  • Such transcription products or other elements may comprise a sequence encoding a Cas effector protein and a targeting RNA comprising CRISPR RNA (crRNA), and a trans-acting crRNA (tracrRNA) sequence contained in the CRISPR-Cas9 system, or from a CRISPR locus Other sequences or transcripts.
  • crRNA CRISPR RNA
  • tracrRNA trans-acting crRNA
  • Cas effector protein As used herein, the terms “Cas effector protein”, “Cas effector enzyme” are used interchangeably and refer to any of the proteins presented in the CRISPR-Cas system that are greater than 900 amino acids in length. In some cases, such proteins refer to proteins identified from the Cas locus.
  • the terms “guide RNA”, “mature crRNA” are used interchangeably and have the meaning as commonly understood by one of ordinary skill in the art.
  • the targeting RNA may comprise a direct repeat sequence and a guide sequence, or consist essentially of or consist of a homologous repeat sequence and a guide sequence (also referred to as a spacer sequence in the context of an endogenous CRISPR system). (spacer)) composition.
  • the targeting sequence is any polynucleotide sequence that is sufficiently complementary to the target sequence to hybridize to the target sequence and direct the specific binding of the CRISPR/Cas complex to the target sequence.
  • the degree of complementarity between the targeting sequence and its corresponding target sequence is at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, Or at least 99%. Determining the optimal alignment is within the abilities of one of ordinary skill in the art. For example, there are publicly available and commercially available alignment algorithms and programs such as, but not limited to, ClustalW, Smith-Waterman in Matlab, Bowtie, Geneious, Biopython, and SeqMan.
  • the targeting sequence is at least 5, at least 10, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21 in length, At least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 35, at least 40, at least 45 or at least 50 Nucleotides.
  • the guide sequence is no more than 50, 45, 40, 35, 30, 25, 24, 23, 22, 21, 20, 15 in length. , 10 or fewer nucleotides.
  • the isotropic repeats are at least 10, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22 in length. , at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 35, at least 40, at least 45, at least 50, At least 55, at least 56, at least 57, at least 58, at least 59, at least 60, at least 61, at least 62, at least 63, at least 64, at least 65 or at least 70 nucleotides .
  • the same direction repeat sequence is no more than 70, 65, 64, 63, 62, 61, 60, 59, 58, 57, 56 in length. , 55, 50, 45, 40, 35, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 15 , 10 or fewer nucleotides.
  • CRISPR/Cas complex refers to a ribonucleoprotein complex formed by the binding of a guide RNA or a mature crRNA to a Cas protein, which comprises hybridization to a target sequence and with Cas Protein-directed targeting sequences.
  • the ribonucleoprotein complex is capable of recognizing and cleaving a polynucleotide that hybridizes to the targeting RNA or mature crRNA.
  • a target sequence refers to a polynucleotide that is designed to be targeted by a targeting sequence, such as a sequence that is complementary to the targeting sequence, wherein the target Hybridization between the sequence and the targeting sequence will promote the formation of the CRISPR/Cas complex. Complete complementarity is not required as long as sufficient complementarity exists to cause hybridization and promote the formation of a CRISPR/Cas complex.
  • the target sequence is RNA. Therefore, “target sequence” and “target RNA” are used interchangeably in the present invention.
  • the target sequence can be any suitable form of RNA, such as mRNA, tRNA, rRNA, miRNA, siRNA or shRNA.
  • the expression "target sequence” may be any endogenous or exogenous RNA sequence to a cell (eg, a eukaryotic cell).
  • the target sequence is located in the nucleus or cytoplasm of the cell.
  • the target sequence can be located in an organelle of a eukaryotic cell, such as a mitochondria or chloroplast.
  • RNA template A sequence or template that can be used to integrate into an RNA sequence comprising the target sequence is referred to as an "RNA template.”
  • the RNA template is an exogenous nucleic acid.
  • the target RNA can be a sequence encoding a gene product (eg, a protein) (eg, mRNA or pre-mRNA) or a non-coding sequence (eg, ncRNA, lncRNA, tRNA or rRNA).
  • a gene product eg, a protein
  • mRNA or pre-mRNA eg, mRNA or pre-mRNA
  • a non-coding sequence eg, ncRNA, lncRNA, tRNA or rRNA
  • Non-limiting examples of target RNA include sequences associated with signaling biochemical pathways (eg, signaling biochemical pathway-associated RNA) or disease-associated RNA.
  • the "disease-associated RNA” refers to an RNA sequence produced by the RNA sequence in an abnormal level or abnormal form in a tissue or cell affected by the disease, compared to a tissue or a cell not having a disease control. presence.
  • the "disease-associated RNA” may be transcribed from a gene having an abnormally elevated expression level, or may be an RNA transcribed from a gene having abnormally decreased expression, wherein the altered expression level is related to the occurrence and/or development of the disease.
  • Disease-associated RNA also refers to RNA transcribed from a gene having a mutation or gene mutation that is directly responsible for or is in linkage disequilibrium with the gene causing the disease.
  • the translated product may be known or unknown and may be at a normal or abnormal level.
  • the target RNA can comprise interfering RNA (ie, RNA present in the RNA interference pathway, eg, shRNA, siRNA, etc.).
  • the target RNA is a microRNA (miRNA).
  • wild type has the meaning commonly understood by those skilled in the art to mean a typical form of a organism, a strain, a gene, or a feature that distinguishes it from a mutant or variant when it exists in nature. It can be isolated from sources in nature and not intentionally modified.
  • nucleic acid molecule or polypeptide As used herein, the terms “non-naturally occurring” or “engineered” are used interchangeably and refer to artificial participation. When these terms are used to describe a nucleic acid molecule or polypeptide, it is meant that the nucleic acid molecule or polypeptide is at least substantially freed from at least one other component of its association in nature or as found in nature.
  • an "ortholog" of a protein as referred to herein refers to a protein belonging to a different species that performs the same or similar function as a protein that is an ortholog thereof.
  • identity is used to mean the matching of sequences between two polypeptides or between two nucleic acids.
  • a position in the two sequences being compared is occupied by the same base or amino acid monomer subunit (for example, a position in each of the two DNA molecules is occupied by adenine, or two
  • Each position in each of the polypeptides is occupied by lysine, and then each molecule is identical at that position.
  • the "percent identity" between the two sequences is a function of the number of matching positions shared by the two sequences divided by the number of positions to be compared x 100. For example, if 6 of the 10 positions of the two sequences match, then the two sequences have 60% identity.
  • the DNA sequences CTGACT and CAGGTT share 50% identity (3 out of a total of 6 positions match).
  • the comparison is made when the two sequences are aligned to produce maximum identity.
  • Such alignment can be achieved by, for example, the method of Needleman et al. (1970) J. Mol. Biol. 48: 443-453, which can be conveniently performed by a computer program such as the Align program (DNAstar, Inc.). It is also possible to use the algorithm of E. Meyers and W. Miller (Comput. Appl Biosci., 4: 11-17 (1988)) integrated into the ALIGN program (version 2.0), using the PAM 120 weight residue table.
  • the gap length penalty of 12 and the gap penalty of 4 were used to determine the percent identity between the two amino acid sequences.
  • the Needleman and Wunsch (J MoI Biol. 48: 444-453 (1970)) algorithms in the GAP program integrated into the GCG software package can be used, using the Blossum 62 matrix or The PAM250 matrix and the gap weight of 16, 14, 12, 10, 8, 6 or 4 and the length weight of 1, 2, 3, 4, 5 or 6 to determine the percent identity between two amino acid sequences .
  • vector refers to a nucleic acid vehicle into which a polynucleotide can be inserted.
  • a vector is referred to as an expression vector when the vector enables expression of the protein encoded by the inserted polynucleotide.
  • the vector can be introduced into the host cell by transformation, transduction or transfection, and the genetic material element carried thereby can be expressed in the host cell.
  • Vectors are well known to those skilled in the art and include, but are not limited to, plasmids; phagemids; cosmids; artificial chromosomes, such as yeast artificial chromosomes (YAC), bacterial artificial chromosomes (BAC), or P1 derived artificial chromosomes (PAC).
  • Phage such as lambda phage or M13 phage and animal virus.
  • Animal viruses useful as vectors include, but are not limited to, retroviruses (including lentiviruses), adenoviruses, adeno-associated viruses, herpes viruses (such as herpes simplex virus), poxviruses, baculoviruses, papillomaviruses, nipples Multi-tumor vacuolar virus (such as SV40).
  • a vector may contain a variety of elements that control expression, including, but not limited to, promoter sequences, transcription initiation sequences, enhancer sequences, selection elements, and reporter genes. In addition, the vector may also contain an origin of replication.
  • the term "host cell” refers to a cell that can be used to introduce a vector, including, but not limited to, a prokaryotic cell such as Escherichia coli or Bacillus subtilis, such as a fungal cell such as a yeast cell or an Aspergillus.
  • a prokaryotic cell such as Escherichia coli or Bacillus subtilis
  • a fungal cell such as a yeast cell or an Aspergillus.
  • S2 Drosophila cells or insect cells such as Sf9
  • animal cells such as fibroblasts, CHO cells, COS cells, NSO cells, HeLa cells, BHK cells, HEK 293 cells or human cells.
  • a vector can be introduced into a host cell to thereby produce a transcript, protein, or peptide, including a protein, fusion protein, isolated nucleic acid molecule, etc. as described herein (eg, a CRISPR transcript, such as a nucleic acid transcript) , protein, or enzyme).
  • a CRISPR transcript such as a nucleic acid transcript
  • regulatory element is intended to include promoters, enhancers, internal ribosome entry sites (IRES), and other expression control elements (eg, transcription termination signals, such as polyadenylation signals and Poly U sequence), a detailed description can be found in Goeddel, GENE EXPRE SSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego ), California (1990).
  • regulatory elements include those sequences that direct constitutive expression of a nucleotide sequence in a plurality of types of host cells, as well as those sequences that direct expression of the nucleotide sequence only in certain host cells (eg, Tissue-specific regulatory sequence).
  • Tissue-specific promoters can primarily direct expression in a desired tissue of interest, such as muscle, neurons, bone, skin, blood, specific organs (eg, liver, pancreas), or specific cell types (eg, Lymphocytes).
  • the regulatory elements may also direct expression in a time-dependent manner (eg, in a cell cycle dependent or developmental stage dependent manner), which may or may not be tissue or cell type specific.
  • the term "regulatory element” encompasses enhancer elements such as WPRE; CMV enhancer; R-U5' fragment in LTR of HTLV-I ((Mol. Cell. Biol., 8th ( 1) Vol., pp. 466-472, 1988); SV40 enhancer; and intron sequence between exons 2 and 3 of rabbit ⁇ -globin (Proc. Natl. Acad. Sci. USA., Vol. 78(3), pp. 1527-31, 1981).
  • promoter has the meaning well-known to those skilled in the art and refers to a non-coding nucleotide sequence located upstream of the gene that initiates expression of the downstream gene.
  • a constitutive promoter is a nucleotide sequence that, when operably linked to a polynucleotide encoding or defining a gene product, results in a gene product in the cell under most or all physiological conditions of the cell. The production.
  • An inducible promoter is a nucleotide sequence that, when operably linked to a polynucleotide encoding or defining a gene product, results in substantially only when an inducer corresponding to the promoter is present in the cell The gene product is produced intracellularly.
  • a tissue-specific promoter is a nucleotide sequence that, when operably linked to a polynucleotide encoding or defining a gene product, is substantially only caused when the cell is a cell of the tissue type corresponding to the promoter Gene products are produced in the cells.
  • operably linked is intended to mean that a nucleotide sequence of interest is linked to the one or more regulatory elements in a manner that allows expression of the nucleotide sequence (eg, In an in vitro transcription/translation system or in the host cell when the vector is introduced into a host cell).
  • complementarity refers to the ability of a nucleic acid to form one or more hydrogen bonds with another nucleic acid sequence by means of conventional Watson-Crick or other non-traditional types. Percent complement indicates the percentage of residues in a nucleic acid molecule that can form a hydrogen bond (eg, Watson-Crick base pairing) with a second nucleic acid sequence (eg, 5, 6, 7, 8 out of 10) 9, 10, that is 50%, 60%, 70%, 80%, 90%, and 100% complementary). "Completely complementary” means that all contiguous residues of one nucleic acid sequence form a hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence.
  • substantially complementary means having 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, At least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98 in the region of 30, 35, 40, 45, 50 or more nucleotides %, 99%, or 100% complementarity, or two nucleic acids that hybridize under stringent conditions.
  • stringent conditions for hybridization refers to conditions under which a nucleic acid that is complementary to a target sequence primarily hybridizes to the target sequence and does not substantially hybridize to a non-target sequence. Stringent conditions are usually sequence dependent and vary depending on many factors. In general, the longer the sequence, the higher the temperature at which the sequence specifically hybridizes to its target sequence. Non-limiting examples of stringent conditions are described in “Technology Techniques In Biochemi stry And Molecular Biology-Hybridization With Nucleic Acid Probes" by Tijssen (1993). ), Part I, Chapter 2, “Overview of principles of hybridization and the strategy of nucleic acid probe assay", Elsevier, New York.
  • hybridization refers to a reaction in which one or more polynucleotides react to form a complex that hydrogen bonds through the bases between these nucleotide residues. And stabilized. Hydrogen bonding can occur by means of Watson-Crick base pairing, Hoogstein binding or in any other sequence specific manner.
  • the complex may comprise two chains forming one duplex, three or more chains forming a multi-strand complex, a single self-hybridizing strand, or any combination of these.
  • the hybridization reaction can constitute a step in a broader process, such as the initiation of PCR, or the cleavage of a polynucleotide via an enzyme. A sequence that is capable of hybridizing to a given sequence is referred to as the "complement" of the given sequence.
  • the term "expression” refers to a process by which a DNA template is transcribed into a polynucleotide (eg, transcribed into mRNA or other RNA transcript) and/or transcribed mRNA, which is subsequently translated into a peptide, The process of a polypeptide or protein.
  • the transcripts and encoded polypeptides may be collectively referred to as "gene products.” If the polynucleotide is derived from genomic DNA, expression can include splicing of mRNA in eukaryotic cells.
  • linker refers to a linear polypeptide formed by the joining of multiple amino acid residues by peptide bonds.
  • the linker of the invention may be a synthetic amino acid sequence, or a naturally occurring polypeptide sequence, such as a polypeptide having the function of a hinge region.
  • linker polypeptides are well known in the art (see, for example, Holliger, P. et al. (1993) Proc. Natl. Acad. Sci. USA 90:6444-6448; Poljak, RJ et al. (1994) Structure 2: 1121- 1123).
  • treating refers to treating or curing a condition, delaying the onset of symptoms of the condition, and/or delaying the progression of the condition.
  • the term "subject” includes, but is not limited to, various animals, such as mammals, such as bovine, equine, ovine, porcine, canine, feline, A rabbit, a rodent (eg, a mouse or rat), a non-human primate (eg, a macaque or a cynomolgus monkey) or a human.
  • mammals such as bovine, equine, ovine, porcine
  • canine feline
  • a rabbit a rodent (eg, a mouse or rat), a non-human primate (eg, a macaque or a cynomolgus monkey) or a human.
  • rodent eg, a mouse or rat
  • non-human primate eg, a macaque or a cynomolgus monkey
  • the Cas effector protein and system of the invention can efficiently perform RNA editing, detecting living virus, RNA interference, gene selective scission, fluorescence in situ hybridization, etc., and realizing regulation of genes at the level of gene transcription. This will provide important resources for new applications in genomic engineering and biotechnology.
  • Figure 1 shows the two HEPN domains of the cas13e protein.
  • Figure 2 shows the secondary structure of the repeat sequence of cas13e.
  • A, B, C, D, E, F, and G are the repeat secondary structures of cas13e1.1, cas13e1.2, cas13e1.3, cas13e1.4, cas13e1.5, cas13e2.1, and cas13e2.2, respectively.
  • Figure 3 shows the activity of cas13e1.3 protein on the processing of pre-crRNA in bacteria.
  • Figure 4 shows the in vitro processing activity of cas13e1.3 protein on pre-crRNA.
  • SEQ ID NO: description 1 Amino acid sequence of Cas13e1.1 2 Amino acid sequence of Cas13e1.2 3 Amino acid sequence of Cas13e1.3 4 Amino acid sequence of Cas13e1.4 5 Amino acid sequence of Cas13e1.5 6 Amino acid sequence of Cas13e2.1 7 Amino acid sequence of Cas13e2.2 8 Encoding nucleic acid sequence of Cas13e1.1 9 Encoding nucleic acid sequence of Cas13e1.2 10 Encoding nucleic acid sequence of Cas13e1.3 11 Encoding nucleic acid sequence of Cas13e1.4
  • Cas13e a novel family of Cas effector proteins, namely Cas13e, which can be divided into two subfamilies, namely cas13e.1 and cas13e.2.
  • the family protein sequences are designated Cas13e1.1 (SEQ ID NO: 1), Cas13e1.2 (SEQ ID NO: 2), Cas13e1.3 (SEQ ID NO: 3), Cas13e1.4 (SEQ ID NO: 4), Cas13e1.5 (SEQ ID NO: 5), Cas13e2.1 (SEQ ID NO: 6), Cas13e2.2 (SEQ ID NO: 7), the DNA encoding these seven proteins are as shown in SEQ ID NOs: 8-14, respectively. Show.
  • the prototype homologous repeat sequences corresponding to the seven proteins are shown in SEQ ID NOs: 15-21.
  • the two HEPN domains of the above seven cas13e proteins were obtained by performing multiple sequence alignment of the proteins to analyze the functional domains containing "RxxxxH” (Fig. 1). Further, the secondary structure of the prototype repeat sequence of the above seven cas13e proteins was obtained by analysis by Vienna RNA software (Fig. 2).
  • a double-stranded DNA molecule of SEQ ID NO: 10 is artificially synthesized, and a double-stranded DNA molecule encoding SEQ ID NO: 32 (CRISPR array: repeat + spacer 1 + repeat + spacer + repeat) is artificially synthesized.
  • the double-stranded DNA molecule synthesized in the step 1 was ligated to the prokaryotic expression vector pACYC-Duet-1 to obtain a recombinant plasmid pACYC-Duet-1-CRISPR/cas13e1.3.
  • the recombinant plasmid pACYC-Duet-1-CRISPR/cas13e1.3 was subjected to one-generation sequencing to confirm the DNA sequence described in the step 1.
  • the recombinant plasmid pACYC-Duet-1-CRISPR/cas13e1.3 was introduced into Escherichia coli EC100 to obtain a recombinant strain, and the recombinant strain was named EC100-CRISPR/cas13e1.3.
  • the monoclonal clone of EC100-CRISPR/cas13e1.3 was taken, inoculated into 100 mL of LB liquid medium (containing 50 ⁇ g/mL ampicillin), and cultured at 37 ° C, shaking at 200 rpm for 12 hours to obtain a culture bacterial solution.
  • Extraction of bacterial RNA Transfer 1.5 mL of the bacterial culture to a pre-chilled microcentrifuge tube, centrifuge at 6,000 x g for 5 minutes at 4 °C. After centrifugation, the supernatant was discarded, and the cell pellet was resuspended in 200 ⁇ L MaxBacterial Enhancement Reagent preheated to 95 ° C, and mixed by pipetting. Incubate at 95 ° C for 4 minutes. Add 1 mL to the lysate Mix well by pipetting and incubate for 5 minutes at room temperature. Add 0.2 mL of cold chloroform, mix by shaking the tube for 15 seconds, and incubate for 2-3 minutes at room temperature.
  • RNA pellet was dissolved in 50 ⁇ L of RNase-free water and incubated at 60 ° C for 10 minutes.
  • RNAI Digestion of DNA: 20 ug of RNA was dissolved in 39.5 ⁇ L dH 2 O at 65 ° C for 5 min. On ice for 5 min, 0.5 ⁇ L of RNAI, 5 ⁇ L of buffer, 5 ⁇ L of DNase I, and 37 ° C for 45 min (50 ⁇ L system) were added. Add 50 ⁇ L dH 2 O and adjust the volume to 100 ⁇ L.
  • the precipitate was washed by adding 350 ⁇ L of 75% ethanol, centrifuged at 16000 g for 10 min at 4 ° C, and the supernatant was discarded. Dry, add 20 ⁇ L of RNase-free water, 65 ° C, and dissolve the precipitate for 5 min. NanoDrop measures the concentration and runs the glue.
  • the precipitate was washed by adding 350 ⁇ L of 75% ethanol, centrifuged at 16000 g for 10 min at 4 ° C, and the supernatant was discarded. Dry, add 21 ⁇ L of RNase-free water, 65 ° C, dissolve the precipitate for 5 min, and measure the concentration by NanoDrop.
  • RNA monophosphorylation 20 ⁇ L of RNA, 1 min at 90 ° C, and cooled on ice for 5 min. 2 ⁇ L of RNA 5'Polphosphatase 10 ⁇ Reaction buffer, 0.5 ⁇ L of Inhibitor, 1 ⁇ L of RNA 5'Polphosphatase (20 Units), RNase-free water to 20 ⁇ L, and 37 ° C for 60 min were added. Add 80 ⁇ L dH 2 O and adjust the volume to 100 ⁇ L.
  • cDNA library 16.5 ⁇ L RNase-free water. 5 ⁇ L Poly(A) Polymerase 10 ⁇ Reaction buffer. 5 ⁇ L of 10 mM ATP. 1.5 ⁇ L RiboGuard RNase Inhibitor. 20 ⁇ L RNA Substrate. 2 ⁇ L Poly(A) Polymerase (4 Units). 50 ⁇ L total volume. 37 ° C for 20 min. Add 50 ⁇ L dH 2 O and adjust the volume to 100 ⁇ L.
  • the cDNA library was added to the sequencing linker and sent to Beijing Berry Hekang for sequencing.
  • RNA sequence of 25 nt to 50 nt was retained and aligned to the reference sequence of the CRISPR array using bowtie.
  • the results are shown in Fig. 3.
  • the peak shape is the structure of the second-generation sequencing sequence alignment of the CRISPR block
  • the vertical line shows the enzyme cleavage site
  • the gray rectangle is the Repeat structure diagram
  • the dark gray diamond shape is the spacer sequence structure diagram.
  • Primers for designing a precrRNA transcript template The structure of the transcription template is: T7 promoter + CRISPR array of Cas13e1.3 (SEQ ID NO: 32). The primers were designed using Primer 5.0 software to ensure that the upstream and downstream primers have overlapping sequences of at least 18 bp. A double stranded DNA template was obtained using a primer annealing procedure. The template was purified using a MinElute PCR Purifcation Kit, and the concentration was measured with Nanodrop, and stored at -20 ° C until use.
  • RNA transcription was performed using NEB's HiScribe T7 high-efficiency RNA synthesis kit, and the PCR reaction procedure was set to: 37 ° C / 3 h or 31 ° C / forever, DNAseI was added, 37 ° C / 45 min.
  • Cas13e1.3 is an EDTA-sensitive protein
  • an EDTA group was also designed, which added EDTA to the reaction system.
  • the SYBR-Gold nucleic acid gel dye was added to 1 ⁇ TBE running buffer, placed in a gel, and stained for 10 minutes at room temperature.
  • Cas13e1.3 is capable of cleaving pre-crRNA in vitro and is not an EDTA-sensitive protein.

Abstract

The invention relates to the field of nucleic acid editing, in particular to the field of clustered regularly interspaced short palindromic repeats (CRISPR) technology. In particular, the invention relates to a Cas effector protein, a fusion protein comprising the Cas effector protein, and nucleic acid molecules encoding the same. The invention also relates to a compound and a composition for nucleic acid editing (e.g., gene or genome editing) comprising the protein or the fusion protein of the invention, or nucleic acid molecules encoding the same. The invention also relates to a method for nucleic acid editing (e.g., gene or genome editing) using the protein or the fusion protein of the invention.

Description

一种RNA编辑的CRISPR/Cas效应蛋白及系统RNA-edited CRISPR/Cas effector protein and system 技术领域Technical field
本发明涉及核酸编辑领域,特别是规律成簇的间隔短回文重复(CRISPR)技术领域。具体而言,本发明涉及Cas效应蛋白,包含此类蛋白的融合蛋白,以及编码它们的核酸分子。本发明还涉及用于核酸编辑(例如,基因或基因组编辑)的复合物和组合物,其包含本发明的蛋白或融合蛋白,或编码它们的核酸分子。本发明还涉及用于核酸编辑(例如,基因或基因组编辑)的方法,其使用包含本发明的蛋白或融合蛋白。The invention relates to the field of nucleic acid editing, in particular to the field of regularly clustered short palindrome repetition (CRISPR) technology. In particular, the invention relates to Cas effector proteins, fusion proteins comprising such proteins, and nucleic acid molecules encoding the same. The invention also relates to complexes and compositions for nucleic acid editing (e.g., gene or genome editing) comprising a protein or fusion protein of the invention, or a nucleic acid molecule encoding the same. The invention also relates to methods for nucleic acid editing (eg, gene or genome editing) using a protein or fusion protein comprising the invention.
背景技术Background technique
CRISPR-cas系统广泛存在于细菌和古细菌中,对病毒有着适应性免疫。CRISPR-cas介导的免疫主要包括三个阶段:(1)适应性阶段,cas1-cas2蛋白复合体把目标DNA片段插入到CRISPR重复区域;(2)表达与加工阶段,CRISPR序列转录并且被cas效应蛋白加工成熟的crRNA;(3)干扰阶段,cas蛋白和crRNA组装成复合体加工目标DNA或者RNA。The CRISPR-cas system is widely found in bacteria and archaea and is adaptive to the virus. CRISPR-cas-mediated immunity mainly includes three stages: (1) adaptive stage, cas1-cas2 protein complex inserts the target DNA fragment into the CRISPR repeat region; (2) expression and processing stage, CRISPR sequence transcription and cas The effector protein processes the mature crRNA; (3) the interference phase, the cas protein and the crRNA assemble into a complex to process the target DNA or RNA.
目前,CRISPR-cas系统进化形成了多种防御机制,包括RNA介导的DNA或者RNA编辑机制。CRISPR-cas系统分为class1和class2。Class1由多个蛋白构成复合体发挥功能,相对复杂;Class2系统结构相对简单,效应蛋白中只有一个,因此被广泛应用。在class2中,TypeII和typeV编辑DNA;Type VI CRISPR-Cas系统对目标RNA有切割活性。与DNA编辑的CRISPR-Cas系统相比,RNA编辑的CRISPR-Cas系统可以实现在基因转录水平对基因进行调控,还可以检测活体病毒,RNA干扰,基因选择性剪切,荧光原位杂交等。因此,挖掘新的RNA编辑系统变得尤为重要。Currently, the CRISPR-cas system has evolved into a variety of defense mechanisms, including RNA-mediated DNA or RNA editing mechanisms. The CRISPR-cas system is divided into class1 and class2. Class1 is a complex composed of multiple proteins and is relatively complex. The Class 2 system is relatively simple in structure and has only one effector protein, so it is widely used. In class 2, Type II and typeV edit DNA; Type VI CRISPR-Cas system has cleavage activity on the target RNA. Compared with the DNA-edited CRISPR-Cas system, the RNA-edited CRISPR-Cas system can regulate genes at the level of gene transcription, as well as detect live viruses, RNA interference, gene-selective splicing, and fluorescence in situ hybridization. Therefore, it is especially important to mine new RNA editing systems.
发明内容Summary of the invention
本申请的发明人经过大量实验和反复摸索,出人意料地发现了一种新型RNA指导的核糖核酸酶内切酶。基于这一发现,本发明人开发了新的RNA编辑的CRISPR/Cas系统以及基于该系统的基因编辑方法。The inventors of the present application have unexpectedly discovered a novel RNA-directed ribonuclease endonuclease after extensive experimentation and repeated exploration. Based on this finding, the inventors have developed a new RNA editing CRISPR/Cas system and a gene editing method based on the system.
Cas效应蛋白Cas effector protein
因此,在第一方面,本发明提供了一种蛋白,其具有SEQ ID NOs:1-7任一项所示 的氨基酸序列或其直系同源物、同源物、变体或功能性片段;其中,所述直系同源物、同源物、变体或功能性片段基本保留了其所源自的序列的生物学功能。Accordingly, in a first aspect, the invention provides a protein having the amino acid sequence set forth in any one of SEQ ID NOs: 1-7, or an ortholog, homolog, variant or functional fragment thereof; Wherein the ortholog, homologue, variant or functional fragment substantially retains the biological function of the sequence from which it is derived.
在本发明中,所述生物学功能包括但不限于,与导向RNA结合的活性、核糖核酸内切酶活性、在导向RNA引导下与靶序列特定位点结合并切割的活性。In the present invention, the biological functions include, but are not limited to, activity binding to a targeting RNA, endoribonuclease activity, binding to a specific site of a target sequence under the guidance of a targeting RNA, and cleavage.
在某些实施方案中,所述直系同源物、同源物、变体与其所源自的序列相比具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性。In certain embodiments, the ortholog, homolog, variant has at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least compared to the sequence from which it is derived. 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity.
在某些实施方案中,所述直系同源物、同源物、变体与SEQ ID NOs:1-7任一项所示的序列相比具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性,并且基本保留了其所源自的序列的生物学功能(例如,与导向RNA结合的活性、核糖核酸内切酶活性、在导向RNA引导下与靶序列特定位点结合并切割的活性)。In certain embodiments, the ortholog, homolog, variant has at least 80%, at least 85%, at least 90%, compared to the sequence set forth in any one of SEQ ID NOs: 1-7, At least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity and substantially retains the origin of the sequence The biological function of the sequence (eg, activity binding to the targeting RNA, endoribonuclease activity, activity that binds to a specific site of the target sequence and cleaves under the guidance of a targeting RNA).
在某些实施方案中,所述蛋白是CRISPR/Cas系统中的效应蛋白。In certain embodiments, the protein is an effector protein in the CRISPR/Cas system.
在某些实施方案中,本发明的蛋白包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
(i)SEQ ID NOs:1-7任一项所示的序列;(i) the sequence of any one of SEQ ID NOs: 1-7;
(ii)与SEQ ID NOs:1-7任一项所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in any one of SEQ ID NOs: 1-7 (eg, 1, 2, 3, 4, 5, 6 a sequence of 7, 7 or 9 amino acid substitutions, deletions or additions; or
(iii)与SEQ ID NOs:1-7任一项所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列。(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95% of the sequence set forth in any one of SEQ ID NOs: 1-7 a sequence of at least 96%, at least 97%, at least 98%, or at least 99% sequence identity.
在某些实施方案中,本发明的蛋白包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
(i)SEQ ID NO:1所示的序列;(i) the sequence shown as SEQ ID NO: 1;
(ii)与SEQ ID NO:1所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 1 ( eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
(iii)与SEQ ID NO:1所示的序列具有至少80%、至少85%、至少90%、至少 91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列。(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, of the sequence set forth in SEQ ID NO: At least 97%, at least 98%, or at least 99% of sequences of sequence identity.
在某些实施方案中,本发明的蛋白具有SEQ ID NO:1所示的氨基酸序列。In certain embodiments, the protein of the invention has the amino acid sequence set forth in SEQ ID NO:1.
在某些实施方案中,本发明的蛋白包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
(i)SEQ ID NO:2所示的序列;(i) the sequence shown as SEQ ID NO: 2;
(ii)与SEQ ID NO:2所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 2 (eg, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
(iii)与SEQ ID NO:2所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列。(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, of the sequence set forth in SEQ ID NO: At least 97%, at least 98%, or at least 99% of sequences of sequence identity.
在某些实施方案中,本发明的蛋白具有SEQ ID NO:2所示的氨基酸序列。In certain embodiments, the protein of the invention has the amino acid sequence set forth in SEQ ID NO:2.
在某些实施方案中,本发明的蛋白包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
(i)SEQ ID NO:3所示的序列;(i) the sequence shown as SEQ ID NO:3;
(ii)与SEQ ID NO:3所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) having one or more amino acid substitutions, deletions or additions compared to the sequence set forth in SEQ ID NO: 3 ( eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
(iii)与SEQ ID NO:3所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列。(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, and the sequence set forth in SEQ ID NO: At least 97%, at least 98%, or at least 99% of sequences of sequence identity.
在某些实施方案中,本发明的蛋白具有SEQ ID NO:3所示的氨基酸序列。In certain embodiments, the protein of the invention has the amino acid sequence set forth in SEQ ID NO:3.
在某些实施方案中,本发明的蛋白包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
(i)SEQ ID NO:4所示的序列;(i) the sequence shown as SEQ ID NO:4;
(ii)与SEQ ID NO:4所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 4 (eg, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
(iii)与SEQ ID NO:4所示的序列具有至少80%、至少85%、至少90%、至少 91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列。(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, of the sequence set forth in SEQ ID NO: At least 97%, at least 98%, or at least 99% of sequences of sequence identity.
在某些实施方案中,本发明的蛋白具有SEQ ID NO:4所示的氨基酸序列。In certain embodiments, the protein of the invention has the amino acid sequence set forth in SEQ ID NO:4.
在某些实施方案中,本发明的蛋白包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
(i)SEQ ID NO:5所示的序列;(i) the sequence shown as SEQ ID NO:5;
(ii)与SEQ ID NO:5所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 5 (eg, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
(iii)与SEQ ID NO:5所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列。(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, and the sequence set forth in SEQ ID NO: At least 97%, at least 98%, or at least 99% of sequences of sequence identity.
在某些实施方案中,本发明的蛋白具有SEQ ID NO:5所示的氨基酸序列。In certain embodiments, the protein of the invention has the amino acid sequence set forth in SEQ ID NO:5.
在某些实施方案中,本发明的蛋白包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
(i)SEQ ID NO:6所示的序列;(i) the sequence shown as SEQ ID NO:6;
(ii)与SEQ ID NO:6所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 6 ( eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
(iii)与SEQ ID NO:6所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列。(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, of the sequence set forth in SEQ ID NO: At least 97%, at least 98%, or at least 99% of sequences of sequence identity.
在某些实施方案中,本发明的蛋白具有SEQ ID NO:6所示的氨基酸序列。In certain embodiments, the protein of the invention has the amino acid sequence set forth in SEQ ID NO: 6.
在某些实施方案中,本发明的蛋白包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the protein of the invention comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
(i)SEQ ID NO:7所示的序列;(i) the sequence shown as SEQ ID NO:7;
(ii)与SEQ ID NO:7所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 7 ( eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
(iii)与SEQ ID NO:7所示的序列具有至少80%、至少85%、至少90%、至少 91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列。(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, and the sequence set forth in SEQ ID NO: At least 97%, at least 98%, or at least 99% of sequences of sequence identity.
在某些实施方案中,本发明的蛋白具有SEQ ID NO:7所示的氨基酸序列。In certain embodiments, the protein of the invention has the amino acid sequence set forth in SEQ ID NO:7.
衍生的蛋白Derived protein
本发明的蛋白可进行衍生化,例如被连接至另一个分子(例如另一个多肽或蛋白)。通常,蛋白的衍生化(例如,标记)不会影响该蛋白的期望活性(例如,与导向RNA结合的活性、核酸内切酶活性、在导向RNA引导下与靶序列特定位点结合并切割的活性)。因此,本发明的蛋白还意欲包括此类衍生化的形式。例如,可以将本发明的蛋白功能性连接(通过化学偶合、基因融合、非共价连接或其它方式)于一个或多个其它分子基团,例如另一个蛋白或多肽,检测试剂,药用试剂等。A protein of the invention can be derivatized, for example, linked to another molecule (e.g., another polypeptide or protein). In general, derivatization (eg, labeling) of a protein does not affect the desired activity of the protein (eg, activity associated with targeting RNA, endonuclease activity, binding to a specific site of the target sequence under the guidance of a targeting RNA, and cleavage active). Thus, the proteins of the invention are also intended to include such derivatized forms. For example, a protein of the invention can be functionally linked (by chemical coupling, gene fusion, non-covalent attachment or otherwise) to one or more other molecular groups, such as another protein or polypeptide, a detection reagent, a pharmaceutical reagent Wait.
特别地,可以将本发明的蛋白连接其他功能性单元。例如,可以将其与核定位信号(NLS)序列连接,以提高本发明的蛋白进入细胞核的能力。例如,可以将其与靶向部分连接,以使得本发明的蛋白具有靶向性。例如,可以将其与可检测的标记连接,以便于对本发明的蛋白进行检测。例如,可以将其与表位标签连接,以便于本发明的蛋白的表达、检测、示踪和/或纯化。In particular, the proteins of the invention may be linked to other functional units. For example, it can be ligated to a nuclear localization signal (NLS) sequence to increase the ability of the proteins of the invention to enter the nucleus. For example, it can be linked to a targeting moiety to render the protein of the invention targeted. For example, it can be linked to a detectable label to facilitate detection of the proteins of the invention. For example, it can be linked to an epitope tag to facilitate expression, detection, tracing, and/or purification of the proteins of the invention.
缀合物Conjugate
因此,在第二方面,本发明提供了一种缀合物,其包含如上所述的蛋白和修饰部分。Thus, in a second aspect, the invention provides a conjugate comprising a protein and a modified moiety as described above.
在某些实施方案中,所述修饰部分选自另外的蛋白或多肽、可检测的标记或其任意组合。In certain embodiments, the modified moiety is selected from another protein or polypeptide, a detectable label, or any combination thereof.
在某些实施方案中,所述另外的蛋白或多肽选自表位标签、报告基因序列、核定位信号(NLS)序列、靶向部分、转录激活结构域(例如,VP64)、转录抑制结构域(例如,KRAB结构域或SID结构域)、核酸酶结构域(例如,Fok1),具有选自下列的活性的结构域:甲基化酶活性,去甲基化酶,转录激活活性,转录抑制活性,转录释放因子活性,组蛋白修饰活性,核酸酶活性,单链RNA切割活性,双链RNA切割活性,单链DNA切割活性,双链DNA切割活性和核酸结合活性;以及其任意组合。And X. (eg, a KRAB domain or a SID domain), a nuclease domain (eg, Fok1), having a domain selected from the group consisting of methylase activity, demethylase, transcriptional activation activity, transcriptional repression Activity, transcription release factor activity, histone modification activity, nuclease activity, single-strand RNA cleavage activity, double-stranded RNA cleavage activity, single-strand DNA cleavage activity, double-strand DNA cleavage activity and nucleic acid binding activity; and any combination thereof.
在某些实施方案中,本发明的缀合物包含一个或多个NLS序列,例如SV40病毒大T抗原的NLS。在某些示例性实施方案中,所述NLS序列如SEQ ID NO:22所示。在某些实施方案中,所述NLS序列位于、靠近或接近本发明的蛋白的末端(例如,N端或C 端)。在某些示例性实施方案中,所述NLS序列位于、靠近或接近本发明的蛋白的C端。In certain embodiments, a conjugate of the invention comprises one or more NLS sequences, such as the NLS of the SV40 viral large T antigen. In certain exemplary embodiments, the NLS sequence is set forth in SEQ ID NO:22. In certain embodiments, the NLS sequence is located at, near, or near the end of the protein of the invention (eg, the N-terminus or the C-terminus). In certain exemplary embodiments, the NLS sequence is located at, near, or near the C-terminus of the protein of the invention.
在某些实施方案中,本发明的缀合物包含表位标签(epitope tag)。这类表位标签是本领域技术人员熟知的,其实例包括但不限于His、V5、FLAG、HA、Myc、VSV-G、Trx等,并且本领域技术人员已知如何根据期望目的(例如,纯化、检测或示踪)选择合适的表位标签。In certain embodiments, the conjugates of the invention comprise an epitope tag. Such epitope tags are well known to those skilled in the art, examples of which include, but are not limited to, His, V5, FLAG, HA, Myc, VSV-G, Trx, etc., and those skilled in the art know how to achieve the desired purpose (eg, Purify, test or trace) Select the appropriate epitope tag.
在某些实施方案中,本发明的缀合物包含报告基因序列。这类报告基因是本领域技术人员熟知的,其实例包括但不限于GST、HRP、CAT、GFP、HcRed、DsRed、CFP、YFP、BFP等。In certain embodiments, a conjugate of the invention comprises a reporter gene sequence. Such reporter genes are well known to those skilled in the art, and examples include, but are not limited to, GST, HRP, CAT, GFP, HcRed, DsRed, CFP, YFP, BFP, and the like.
在某些实施方案中,本发明的缀合物包含能够与DNA分子或细胞内分子结合的结构域,例如麦芽糖结合蛋白(MBP)、Lex A的DNA结合结构域(DBD)、GAL4的DBD等。In certain embodiments, the conjugates of the invention comprise a domain capable of binding to a DNA molecule or an intracellular molecule, such as a maltose binding protein (MBP), a DNA binding domain of Lex A (DBD), a DBD of GAL4, and the like. .
在某些实施方案中,本发明的缀合物包含可检测的标记,例如荧光染料,例如FITC或DAPI。In certain embodiments, the conjugates of the invention comprise a detectable label, such as a fluorescent dye, such as FITC or DAPI.
在某些实施方案中,本发明的蛋白任选地通过接头与所述修饰部分偶联、缀合或融合。In certain embodiments, a protein of the invention is optionally coupled, conjugated or fused to the modified moiety by a linker.
在某些实施方案中,所述修饰部分直接连接至本发明的蛋白的N端或C端。In certain embodiments, the modified moiety is directly linked to the N-terminus or C-terminus of the protein of the invention.
在某些实施方案中,所述修饰部分通过接头连接至本发明的蛋白的N端或C端。这类接头是本领域熟知的,其实例包括但不限于包含一个或多个(例如,1个,2个,3个,4个或5个)氨基酸(如,Glu或Ser)或氨基酸衍生物(如,Ahx、β-Ala、GABA或Ava)的接头,或PEG等。In certain embodiments, the modified moiety is linked to the N-terminus or C-terminus of the protein of the invention by a linker. Such linkers are well known in the art, examples of which include, but are not limited to, one or more (eg, 1, 2, 3, 4 or 5) amino acids (eg, Glu or Ser) or amino acid derivatives. A linker (eg, Ahx, β-Ala, GABA, or Ava), or PEG, and the like.
融合蛋白Fusion protein
在第三方面,本发明提供了一种融合蛋白,其包含本发明的蛋白以及另外的蛋白或多肽。In a third aspect, the invention provides a fusion protein comprising a protein of the invention and an additional protein or polypeptide.
在某些实施方案中,所述另外的蛋白或多肽选自表位标签、报告基因序列、核定位信号(NLS)序列、靶向部分、转录激活结构域(例如,VP64)、转录抑制结构域(例如,KRAB结构域或SID结构域)、核酸酶结构域(例如,Fok1),具有选自下列的活性的结构域:甲基化酶活性,去甲基化酶,转录激活活性,转录抑制活性,转录释放因子活性,组蛋白修饰活性,核酸酶活性,单链RNA切割活性,双链RNA切割活性,单链DNA切割活性,双链DNA切割活性和核酸结合活性;以及其任意组合。And X. (eg, a KRAB domain or a SID domain), a nuclease domain (eg, Fok1), having a domain selected from the group consisting of methylase activity, demethylase, transcriptional activation activity, transcriptional repression Activity, transcription release factor activity, histone modification activity, nuclease activity, single-strand RNA cleavage activity, double-stranded RNA cleavage activity, single-strand DNA cleavage activity, double-strand DNA cleavage activity and nucleic acid binding activity; and any combination thereof.
在某些实施方案中,本发明的融合蛋白包含一个或多个NLS序列,例如SV40病毒大T抗原的NLS。在某些实施方案中,所述NLS序列位于、靠近或接近本发明的蛋白的末端(例如,N端或C端)。在某些示例性实施方案中,所述NLS序列位于、靠近或接近本发明的蛋白的C端。In certain embodiments, a fusion protein of the invention comprises one or more NLS sequences, such as the NLS of the SV40 viral large T antigen. In certain embodiments, the NLS sequence is at, near, or near the end of the protein of the invention (eg, the N-terminus or the C-terminus). In certain exemplary embodiments, the NLS sequence is located at, near, or near the C-terminus of the protein of the invention.
在某些实施方案中,本发明的融合蛋白包含表位标签。In certain embodiments, a fusion protein of the invention comprises an epitope tag.
在某些实施方案中,本发明的融合蛋白包含报告基因序列。In certain embodiments, a fusion protein of the invention comprises a reporter gene sequence.
在某些实施方案中,本发明的融合蛋白包含能够与DNA分子或细胞内分子结合的结构域。In certain embodiments, a fusion protein of the invention comprises a domain capable of binding to a DNA molecule or an intracellular molecule.
在某些实施方案中,本发明的蛋白任选地通过接头与所述另外的蛋白或多肽融合。In certain embodiments, a protein of the invention is optionally fused to the additional protein or polypeptide via a linker.
在某些实施方案中,所述另外的蛋白或多肽直接连接至本发明的蛋白的N端或C端。In certain embodiments, the additional protein or polypeptide is directly linked to the N-terminus or C-terminus of the protein of the invention.
在某些实施方案中,所述另外的蛋白或多肽通过接头连接至本发明的蛋白的N端或C端。In certain embodiments, the additional protein or polypeptide is linked to the N-terminus or C-terminus of the protein of the invention by a linker.
在某些示例性实施方案中,本发明的融合蛋白具有选自下列的氨基酸序列:SEQ ID NOs:23-29。In certain exemplary embodiments, the fusion proteins of the invention have an amino acid sequence selected from the group consisting of SEQ ID NOs: 23-29.
本发明的蛋白、本发明的缀合物或本发明的融合蛋白不受其产生方式的限定,例如,其可以通过基因工程方法(重组技术)产生,也可以通过化学合成方法产生。The protein of the present invention, the conjugate of the present invention or the fusion protein of the present invention is not limited by the manner in which it is produced, for example, it can be produced by a genetic engineering method (recombination technique) or can be produced by a chemical synthesis method.
同向重复序列Co-directional repeat
在第四方面,本发明提供了一种分离的核酸分子,其包含选自下列的序列,或由选自下列的序列组成:In a fourth aspect, the invention provides an isolated nucleic acid molecule comprising a sequence selected from the group consisting of: or consisting of:
(i)SEQ ID NOs:15-21任一项所示的序列;(i) the sequence set forth in any one of SEQ ID NOs: 15-21.
(ii)与SEQ ID NOs:15-21任一项所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) having one or more base substitutions, deletions or additions (for example, 1, 2, 3, 4, 5, compared to the sequence shown in any one of SEQ ID NOs: 15-21) a sequence of 6, 7, 8, or 10 base substitutions, deletions, or additions;
(iii)与SEQ ID NOs:15-21任一项所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90% of the sequence set forth in any one of SEQ ID NOs: 15-21. a sequence of at least 95% sequence identity;
(iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
(v)(i)-(iii)任一项中所述的序列的互补序列;(v) the complement of the sequence set forth in any of (i)-(iii);
并且,(ii)-(v)中任一项所述的序列基本保留了其所源自的序列的生物学功能,所述序列的生物学功能是指,作为CRISPR-Cas系统中的同向重复序列的活性。Furthermore, the sequence of any one of (ii)-(v) substantially retains the biological function of the sequence from which it is derived, the biological function of the sequence being referred to as the same direction in the CRISPR-Cas system Repeat the activity of the sequence.
在某些实施方案中,所述分离的核酸分子是CRISPR-Cas系统中的同向重复序列。In certain embodiments, the isolated nucleic acid molecule is a direct repeat in a CRISPR-Cas system.
在某些实施方案中,所述分离的核酸分子包含一个或多个茎环或优化的二级结构。在某些实施方案中,(ii)-(v)中任一项所述的序列保留了其所源自的序列的二级结构。In certain embodiments, the isolated nucleic acid molecule comprises one or more stem loops or an optimized secondary structure. In certain embodiments, the sequence of any of (ii)-(v) retains the secondary structure of the sequence from which it is derived.
在某些实施方案中,所述核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
(a)SEQ ID NOs:15-21任一项所示的核苷酸序列;(a) the nucleotide sequence shown in any one of SEQ ID NOs: 15-21.
(b)在严格条件下与(a)中所述的序列杂交的序列;或(b) a sequence that hybridizes under stringent conditions to the sequence described in (a); or
(c)(a)中所述的序列的互补序列。(c) the complement of the sequence described in (a).
在某些实施方案中,所述分离的核酸分子是RNA。In certain embodiments, the isolated nucleic acid molecule is RNA.
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(i)SEQ ID NO:15所示的序列;(i) the sequence of SEQ ID NO: 15;
(ii)与SEQ ID NO:15所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) a substitution, deletion or addition of one or more bases compared to the sequence shown in SEQ ID NO: 15 ( eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
(iii)与SEQ ID NO:15所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: Sequence of sequence identity;
(iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
(v)(i)-(iii)任一项中所述的序列的互补序列。(v) the complement of the sequence set forth in any of (i)-(iii).
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(a)SEQ ID NO:15所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO: 15;
(b)在严格条件下与(a)中所述的序列杂交的序列;或(b) a sequence that hybridizes under stringent conditions to the sequence described in (a); or
(c)SEQ ID NO:15所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 15.
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(i)SEQ ID NO:16所示的序列;(i) the sequence shown as SEQ ID NO:16;
(ii)与SEQ ID NO:16所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) having one or more base substitutions, deletions or additions compared to the sequence shown in SEQ ID NO: 16 (for example, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
(iii)与SEQ ID NO:16所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: Sequence of sequence identity;
(iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
(v)(i)-(iii)任一项中所述的序列的互补序列。(v) the complement of the sequence set forth in any of (i)-(iii).
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(a)SEQ ID NO:16所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO: 16;
(b)在严格条件下与(a)中所述的序列杂交的序列;或(b) a sequence that hybridizes under stringent conditions to the sequence described in (a); or
(c)SEQ ID NO:16所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 16.
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(i)SEQ ID NO:17所示的序列;(i) the sequence shown as SEQ ID NO:17;
(ii)与SEQ ID NO:17所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) having one or more base substitutions, deletions or additions compared to the sequence shown in SEQ ID NO: 17 (for example, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
(iii)与SEQ ID NO:17所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: Sequence of sequence identity;
(iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
(v)(i)-(iii)任一项中所述的序列的互补序列。(v) the complement of the sequence set forth in any of (i)-(iii).
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(a)SEQ ID NO:17所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO: 17;
(b)在严格条件下与(a)中所述的序列杂交的序列;(b) a sequence that hybridizes under stringent conditions to the sequence described in (a);
(c)SEQ ID NO:17所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 17.
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(i)SEQ ID NO:18所示的序列;(i) the sequence shown as SEQ ID NO:18;
(ii)与SEQ ID NO:18所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) a substitution, deletion or addition of one or more bases compared to the sequence shown in SEQ ID NO: 18 ( eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
(iii)与SEQ ID NO:18所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: 18. Sequence of sequence identity;
(iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
(v)(i)-(iii)任一项中所述的序列的互补序列。(v) the complement of the sequence set forth in any of (i)-(iii).
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(a)SEQ ID NO:18所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO: 18;
(b)在严格条件下与(a)中所述的序列杂交的序列;(b) a sequence that hybridizes under stringent conditions to the sequence described in (a);
(c)SEQ ID NO:18所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 18.
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(i)SEQ ID NO:19所示的序列;(i) the sequence shown as SEQ ID NO:19;
(ii)与SEQ ID NO:19所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) having one or more base substitutions, deletions or additions compared to the sequence shown in SEQ ID NO: 19 (for example, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
(iii)与SEQ ID NO:19所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: Sequence of sequence identity;
(iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
(v)(i)-(iii)任一项中所述的序列的互补序列。(v) the complement of the sequence set forth in any of (i)-(iii).
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(a)SEQ ID NO:19所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO: 19;
(b)在严格条件下与(a)中所述的序列杂交的序列;(b) a sequence that hybridizes under stringent conditions to the sequence described in (a);
(c)SEQ ID NO:19所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 19.
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(i)SEQ ID NO:20所示的序列;(i) the sequence shown as SEQ ID NO:20;
(ii)与SEQ ID NO:20所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) having one or more base substitutions, deletions or additions compared to the sequence shown in SEQ ID NO: 20 (for example, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
(iii)与SEQ ID NO:20所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: Sequence of sequence identity;
(iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
(v)(i)-(iii)任一项中所述的序列的互补序列。(v) the complement of the sequence set forth in any of (i)-(iii).
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(a)SEQ ID NO:20所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO:20;
(b)在严格条件下与(a)中所述的序列杂交的序列;(b) a sequence that hybridizes under stringent conditions to the sequence described in (a);
(c)SEQ ID NO:20所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 20.
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(i)SEQ ID NO:21所示的序列;(i) the sequence shown as SEQ ID NO:21.
(ii)与SEQ ID NO:21所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) a substitution, deletion or addition of one or more bases compared to the sequence shown in SEQ ID NO: 21 (for example, 1, 2, 3, 4, 5, 6, 7 a sequence of 8, 8 or 10 base substitutions, deletions or additions;
(iii)与SEQ ID NO:21所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO:21. Sequence of sequence identity;
(iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
(v)(i)-(iii)任一项中所述的序列的互补序列。(v) the complement of the sequence set forth in any of (i)-(iii).
在某些实施方案中,所述分离的核酸分子包含选自下列的序列,或由选自下列的序列组成:In certain embodiments, the isolated nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of:
(a)SEQ ID NO:21所示的核苷酸序列;(a) the nucleotide sequence shown as SEQ ID NO: 21;
(b)在严格条件下与(a)中所述的序列杂交的序列;(b) a sequence that hybridizes under stringent conditions to the sequence described in (a);
(c)SEQ ID NO:21所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO:21.
CRISPR/Cas复合物CRISPR/Cas Complex
在第五方面,本发明提供了一种复合物,其包含:In a fifth aspect, the invention provides a composite comprising:
(i)蛋白组分,其选自:本发明的蛋白、缀合物或融合蛋白,及其任意组合;和(i) a protein component selected from the group consisting of a protein, conjugate or fusion protein of the invention, and any combination thereof;
(ii)核酸组分,其包含如上文所述的分离的核酸分子和能够与靶序列杂交的导向序列,(ii) a nucleic acid component comprising an isolated nucleic acid molecule as described above and a targeting sequence capable of hybridizing to the target sequence,
其中,所述蛋白组分与核酸组分相互结合形成复合物。Wherein the protein component and the nucleic acid component are combined with each other to form a complex.
在某些实施方案中,所述导向序列连接于所述核酸分子的3’端或5’端。In certain embodiments, the targeting sequence is linked to the 3' or 5' end of the nucleic acid molecule.
在某些实施方案中,所述导向序列包含所述靶序列的互补序列。In certain embodiments, the targeting sequence comprises the complement of the target sequence.
在某些实施方案中,所述核酸组分是CRISPR/Cas系统中的导向RNA。In certain embodiments, the nucleic acid component is a targeting RNA in a CRISPR/Cas system.
在某些实施方案中,所述核酸分子是RNA。In certain embodiments, the nucleic acid molecule is RNA.
在某些实施方案中,所述复合物不包含反式作用tracrRNA。In certain embodiments, the complex does not comprise a trans-acting tracrRNA.
编码核酸、载体及宿主细胞Encoding nucleic acids, vectors and host cells
在第六方面,本发明提供了一种分离的核酸分子,其包含:In a sixth aspect, the invention provides an isolated nucleic acid molecule comprising:
(i)编码本发明的蛋白或融合蛋白的核苷酸序列;(i) a nucleotide sequence encoding a protein or fusion protein of the invention;
(ii)编码如第四方面所述的分离的核酸分子;或(ii) an isolated nucleic acid molecule encoding a fourth aspect; or
(iii)包含(i)和(ii)的核苷酸序列。(iii) a nucleotide sequence comprising (i) and (ii).
在某些实施方案中,(i)-(iii)任一项中所述的核苷酸序列经密码子优化用于在原核细胞中进行表达。在某些实施方案中,(i)-(iii)任一项中所述的核苷酸序列经密码子优化用于在真核细胞中进行表达。In certain embodiments, the nucleotide sequence set forth in any of (i)-(iii) is codon optimized for expression in a prokaryotic cell. In certain embodiments, the nucleotide sequence set forth in any of (i)-(iii) is codon optimized for expression in eukaryotic cells.
在第七方面,本发明还提供了一种载体,其包含如第六方面所述的分离的核酸分子。本发明的载体可以是克隆载体,也可以是表达载体。在某些实施方案中,本发明的载体是例如质粒,粘粒,噬菌体,柯斯质粒等等。在某些选实施方案中,所述载体能够在受试者(例如哺乳动物,例如人)体内表达本发明的蛋白、融合蛋白、如第四方面所述的分离的核酸分子或如第五方面所述的复合物。In a seventh aspect, the invention provides a vector comprising the isolated nucleic acid molecule of the sixth aspect. The vector of the present invention may be a cloning vector or an expression vector. In certain embodiments, vectors of the invention are, for example, plasmids, cosmids, phage, cosmid, and the like. In certain selected embodiments, the vector is capable of expressing a protein of the invention, a fusion protein, an isolated nucleic acid molecule of the fourth aspect, or a fifth aspect, in a subject (eg, a mammal, eg, a human) Said complex.
在第八方面,本发明还提供了包含如上所述的分离的核酸分子或载体的宿主细胞。 此类宿主细胞包括但不限于,原核细胞例如大肠杆菌细胞,以及真核细胞例如酵母细胞,昆虫细胞,植物细胞和动物细胞(如哺乳动物细胞,例如小鼠细胞、人细胞等)。本发明的细胞还可以是细胞系,例如293T细胞。在某些实施方案中,所述宿主细胞是原核细胞。In an eighth aspect, the invention also provides a host cell comprising an isolated nucleic acid molecule or vector as described above. Such host cells include, but are not limited to, prokaryotic cells such as E. coli cells, and eukaryotic cells such as yeast cells, insect cells, plant cells, and animal cells (eg, mammalian cells, such as mouse cells, human cells, etc.). The cells of the invention may also be cell lines, such as 293T cells. In certain embodiments, the host cell is a prokaryotic cell.
组合物及载体组合物Composition and carrier composition
在第九方面,本发明还提供了一种组合物,其包含:In a ninth aspect, the present invention also provides a composition comprising:
(i)第一组分,其选自:本发明的蛋白、缀合物、融合蛋白、编码所述蛋白或融合蛋白的核苷酸序列,以及其任意组合;和(i) a first component selected from the group consisting of a protein, a conjugate, a fusion protein, a nucleotide sequence encoding the protein or fusion protein, and any combination thereof;
(ii)第二组分,其为包含导向RNA的核苷酸序列,或者编码所述包含导向RNA的核苷酸序列的核苷酸序列;(ii) a second component which is a nucleotide sequence comprising a targeting RNA or a nucleotide sequence encoding the nucleotide sequence comprising the targeting RNA;
其中,所述导向RNA包含同向重复序列和导向序列,所述导向序列能够与靶序列杂交;Wherein the targeting RNA comprises a homologous repeat sequence and a targeting sequence, the targeting sequence being capable of hybridizing to the target sequence;
所述导向RNA能够与(i)中所述的蛋白、缀合物或融合蛋白形成复合物。The targeting RNA is capable of forming a complex with the protein, conjugate or fusion protein described in (i).
在某些实施方案中,所述同向重复序列是如第四方面所定义的分离的核酸分子。In certain embodiments, the isotropic repeat is an isolated nucleic acid molecule as defined in the fourth aspect.
在某些实施方案中,所述导向序列连接至所述同向重复序列的3’端或5’端。在某些实施方案中,所述导向序列包含所述靶序列的互补序列。In certain embodiments, the targeting sequence is ligated to the 3' or 5' end of the homologous repeat. In certain embodiments, the targeting sequence comprises the complement of the target sequence.
在某些实施方案中,所述组合物不包含tracrRNA。In certain embodiments, the composition does not comprise tracrRNA.
在某些实施方案中,所述组合物是非天然存在的或经修饰的。在某些实施方案中,所述组合物中的至少一个组分是非天然存在的或经修饰的。在某些实施方案中,所述第一组分是非天然存在的或经修饰的;和/或,所述第二组分是非天然存在的或经修饰的。In certain embodiments, the composition is non-naturally occurring or modified. In certain embodiments, at least one component of the composition is non-naturally occurring or modified. In certain embodiments, the first component is non-naturally occurring or modified; and/or the second component is non-naturally occurring or modified.
在某些实施方案中,所述靶序列是来自原核细胞或真核细胞的RNA序列。在某些实施方案中,所述靶序列是非天然存在的RNA序列。In certain embodiments, the target sequence is an RNA sequence from a prokaryotic or eukaryotic cell. In certain embodiments, the target sequence is a non-naturally occurring RNA sequence.
在某些实施方案中,所述靶序列存在于细胞内。在某些实施方案中,所述靶序列存在于细胞核内或细胞质(例如,细胞器)内。在某些实施方案中,所述细胞是原核细胞。在某些实施方案中,所述细胞是真核细胞。In certain embodiments, the target sequence is present in a cell. In certain embodiments, the target sequence is present within the nucleus or within the cytoplasm (eg, an organelle). In certain embodiments, the cell is a prokaryotic cell. In certain embodiments, the cell is a eukaryotic cell.
在某些实施方案中,所述蛋白连接有一个或多个NLS序列。在某些实施方案中,所述缀合物或融合蛋白包含一个或多个NLS序列。在某些实施方案中,所述NLS序列连接至所述蛋白的N端或C端。在某些实施方案中,所述NLS序列融合至所述蛋白的N端或C端。In certain embodiments, the protein is linked to one or more NLS sequences. In certain embodiments, the conjugate or fusion protein comprises one or more NLS sequences. In certain embodiments, the NLS sequence is linked to the N-terminus or C-terminus of the protein. In certain embodiments, the NLS sequence is fused to the N-terminus or C-terminus of the protein.
在第十方面,本发明还提供了一种组合物,其包含一种或多种载体,所述一种或多种载体包含:In a tenth aspect, the present invention also provides a composition comprising one or more carriers, the one or more carriers comprising:
(i)第一核酸,其为编码本发明的蛋白或融合蛋白的核苷酸序列;任选地所述第一核酸可操作地连接至第一调节元件;以及(i) a first nucleic acid which is a nucleotide sequence encoding a protein or fusion protein of the invention; optionally the first nucleic acid is operably linked to a first regulatory element;
(ii)第二核酸,其编码包含导向RNA的核苷酸序列;任选地所述第二核酸可操作地连接至第二调节元件;(ii) a second nucleic acid encoding a nucleotide sequence comprising a targeting RNA; optionally the second nucleic acid is operably linked to a second regulatory element;
其中:among them:
所述第一核酸与第二核酸存在于相同或不同的载体上;The first nucleic acid and the second nucleic acid are present on the same or different carrier;
所述导向RNA包含同向重复序列和导向序列,所述导向序列能够与靶序列杂交;The targeting RNA comprises a homologous repeat sequence and a targeting sequence capable of hybridizing to the target sequence;
所述导向RNA能够与(i)中所述的效应蛋白或融合蛋白形成复合物。The targeting RNA is capable of forming a complex with the effector protein or fusion protein described in (i).
在某些实施方案中,所述同向重复序列是如第四方面所定义的分离的核酸分子。In certain embodiments, the isotropic repeat is an isolated nucleic acid molecule as defined in the fourth aspect.
在某些实施方案中,所述导向序列连接至所述同向重复序列的3’端或5’端。在某些实施方案中,所述导向序列包含所述靶序列的互补序列。In certain embodiments, the targeting sequence is ligated to the 3' or 5' end of the homologous repeat. In certain embodiments, the targeting sequence comprises the complement of the target sequence.
在某些实施方案中,所述组合物不包含tracrRNA。In certain embodiments, the composition does not comprise tracrRNA.
在某些实施方案中,所述组合物是非天然存在的或经修饰的。在某些实施方案中,所述组合物中的至少一个组分是非天然存在的或经修饰的。In certain embodiments, the composition is non-naturally occurring or modified. In certain embodiments, at least one component of the composition is non-naturally occurring or modified.
在某些实施方案中,所述第一调节元件是启动子,例如诱导型启动子。In certain embodiments, the first regulatory element is a promoter, such as an inducible promoter.
在某些实施方案中,所述第二调节元件是启动子,例如诱导型启动子。In certain embodiments, the second regulatory element is a promoter, such as an inducible promoter.
在某些实施方案中,所述靶序列是来自原核细胞或真核细胞的RNA序列。在某些实施方案中,所述靶序列是非天然存在的RNA序列。In certain embodiments, the target sequence is an RNA sequence from a prokaryotic or eukaryotic cell. In certain embodiments, the target sequence is a non-naturally occurring RNA sequence.
在某些实施方案中,所述靶序列存在于细胞内。在某些实施方案中,所述靶序列存在于细胞核内或细胞质(例如,细胞器)内。在某些实施方案中,所述细胞是原核细胞。在某些实施方案中,所述细胞是真核细胞。In certain embodiments, the target sequence is present in a cell. In certain embodiments, the target sequence is present within the nucleus or within the cytoplasm (eg, an organelle). In certain embodiments, the cell is a prokaryotic cell. In certain embodiments, the cell is a eukaryotic cell.
在某些实施方案中,所述蛋白连接有一个或多个NLS序列。在某些实施方案中,所述缀合物或融合蛋白包含一个或多个NLS序列。在某些实施方案中,所述NLS序列连接至所述蛋白的N端或C端。在某些实施方案中,所述NLS序列融合至所述蛋白的N端或C端。In certain embodiments, the protein is linked to one or more NLS sequences. In certain embodiments, the conjugate or fusion protein comprises one or more NLS sequences. In certain embodiments, the NLS sequence is linked to the N-terminus or C-terminus of the protein. In certain embodiments, the NLS sequence is fused to the N-terminus or C-terminus of the protein.
在某些实施方案中,一种类型的载体是质粒,其是指其中可以例如通过标准分子克隆技术插入另外的DNA片段的环状双链DNA环。另一种类型的载体是病毒载体,其中 病毒衍生的DNA或RNA序列存在于用于包装病毒(例如,逆转录病毒、复制缺陷型逆转录病毒、腺病毒、复制缺陷型腺病毒、以及腺相关病毒)的载体中。病毒载体还包含由用于转染到一种宿主细胞中的病毒携带的多核苷酸。某些载体(例如,具有细菌复制起点的细菌载体和附加型哺乳动物载体)能够在它们被导入的宿主细胞中自主复制。其他载体(例如,非附加型哺乳动物载体)在引入宿主细胞后整合到该宿主细胞的基因组中,并且由此与该宿主基因组一起复制。而且,某些载体能够指导它们可操作连接的基因的表达。这样的载体在此被称为“表达载体”。在重组DNA技术中使用的普通表达栽体通常是质粒形式。In certain embodiments, one type of vector is a plasmid, which refers to a circular double stranded DNA loop in which additional DNA fragments can be inserted, for example, by standard molecular cloning techniques. Another type of vector is a viral vector in which a virus-derived DNA or RNA sequence is present for packaging a virus (eg, retrovirus, replication-defective retrovirus, adenovirus, replication-defective adenovirus, and adeno-associated In the vector of the virus). The viral vector also comprises a polynucleotide carried by a virus for transfection into a host cell. Certain vectors (e.g., bacterial vectors having bacterial origins of replication and episomal mammalian vectors) are capable of autonomous replication in the host cell into which they are introduced. Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of the host cell upon introduction into the host cell and thereby replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operably linked. Such vectors are referred to herein as "expression vectors." Common expression vectors used in recombinant DNA techniques are typically in the form of plasmids.
重组表达载体可包含处于适合于在宿主细胞中的核酸表达的形式的本发明的核酸分子,这意味着这些重组表达载体包含基于待用于表达的宿主细胞而选择的一种或多种调节元件,所述调节元件可操作地连接至待表达的核酸序列。The recombinant expression vector can comprise a nucleic acid molecule of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vector comprises one or more regulatory elements selected based on the host cell to be used for expression. The regulatory element is operably linked to the nucleic acid sequence to be expressed.
递送及递送组合物Delivery and delivery composition
本发明的蛋白、缀合物、融合蛋白、如第四方面所述的分离的核酸分子、本发明的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面及第十方面所述的组合物,可以通过本领域已知的任何方法进行递送。此类方法包括但不限于,电穿孔、脂转染、核转染、显微注射、声孔效应、基因枪、磷酸钙介导的转染、阳离子转染、脂质体转染、树枝状转染、热激转染、核转染、磁转染、脂转染、穿刺转染、光学转染、试剂增强性核酸摄取、以及经由脂质体、免疫脂质体、病毒颗粒、人工病毒体等的递送。The protein, the conjugate, the fusion protein of the invention, the isolated nucleic acid molecule of the fourth aspect, the complex of the invention, the isolated nucleic acid molecule of the sixth aspect, the vector of the seventh aspect The compositions of the ninth and tenth aspects can be delivered by any method known in the art. Such methods include, but are not limited to, electroporation, lipofection, nuclear transfection, microinjection, sonoporation, gene gun, calcium phosphate mediated transfection, cation transfection, lipofection, dendritic Transfection, heat shock transfection, nuclear transfection, magnetic transfection, lipofection, puncture transfection, optical transfection, reagent-enhanced nucleic acid uptake, and via liposomes, immunoliposomes, viral particles, artificial viruses Delivery of body, etc.
因此,在另一个方面,本发明提供了一种递送组合物,其包含递送载体,以及选自下列的一种或多种:本发明的蛋白、缀合物、融合蛋白、如第四方面所述的分离的核酸分子、本发明的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面及第十方面所述的组合物。Accordingly, in another aspect, the present invention provides a delivery composition comprising a delivery vehicle, and one or more selected from the group consisting of a protein, a conjugate, a fusion protein of the invention, as in the fourth aspect The isolated nucleic acid molecule, the complex of the invention, the isolated nucleic acid molecule of the sixth aspect, the carrier of the seventh aspect, the composition of the ninth and tenth aspects.
在某些实施方案中,所述递送载体是粒子。In certain embodiments, the delivery vehicle is a particle.
在某些实施方案中,所述递送载体选自脂质颗粒、糖颗粒、金属颗粒、蛋白颗粒、脂质体、外泌体、微泡、基因枪或病毒载体(例如,复制缺陷型逆转录病毒、慢病毒、腺病毒或腺相关病毒)。In certain embodiments, the delivery vehicle is selected from the group consisting of a lipid particle, a sugar particle, a metal particle, a protein particle, a liposome, an exosome, a microvesicle, a gene gun, or a viral vector (eg, replication defective reverse transcription) Virus, lentivirus, adenovirus or adeno-associated virus).
试剂盒Kit
在另一个方面,本发明提供了一种试剂盒,其包含如上所述的组分中的一种或多种。在某些实施方案中,所述试剂盒包含一种或多种选自下列的组分:本发明的蛋白、缀合物、融合蛋白、如第四方面所述的分离的核酸分子、本发明的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面及第十方面所述的组合物。In another aspect, the invention provides a kit comprising one or more of the components described above. In certain embodiments, the kit comprises one or more components selected from the group consisting of a protein, a conjugate, a fusion protein of the invention, an isolated nucleic acid molecule of the fourth aspect, the invention The complex, the isolated nucleic acid molecule of the sixth aspect, the carrier of the seventh aspect, the composition of the ninth and tenth aspects.
在某些实施方案中,本发明的试剂盒包含如第九方面所述的组合物。在某些实施方案中,所述试剂盒还包含使用所述组合物的说明书。In certain embodiments, the kit of the invention comprises the composition of the ninth aspect. In certain embodiments, the kit further comprises instructions for using the composition.
在某些实施方案中,本发明的试剂盒包含如第十方面所述的组合物。在某些实施方案中,所述试剂盒还包含使用所述组合物的说明书。In certain embodiments, the kit of the invention comprises the composition of the tenth aspect. In certain embodiments, the kit further comprises instructions for using the composition.
在某些实施方案中,本发明的试剂盒中包含的组分可以被提供于任何适合的容器中。In certain embodiments, the components contained in the kits of the invention can be provided in any suitable container.
在某些实施方案中,所述试剂盒还包含一种或多种缓冲液。缓冲液可以是任何缓冲液,包括但不限于碳酸钠缓冲液、碳酸氢钠缓冲液、硼酸盐缓冲液、Tris缓冲液、MOPS缓冲液、HEPES缓冲液及其组合。在某些实施方案中,该缓冲液是碱性的。在某些实施方案中,该缓冲液具有从约7至约10的pH。In certain embodiments, the kit further comprises one or more buffers. The buffer can be any buffer including, but not limited to, sodium carbonate buffer, sodium bicarbonate buffer, borate buffer, Tris buffer, MOPS buffer, HEPES buffer, and combinations thereof. In certain embodiments, the buffer is basic. In certain embodiments, the buffer has a pH of from about 7 to about 10.
在某些实施方案中,该试剂盒还包括一个或多个寡核苷酸,该一个或多个寡核苷酸对应于一个用于插入进载体中的导向序列,以便可操作地连接该导向序列和调节元件。In certain embodiments, the kit further comprises one or more oligonucleotides corresponding to a targeting sequence for insertion into a vector for operably linking the guide Sequence and adjustment elements.
在某些实施方案中,该试剂盒还包括RNA模板。在某些实施方案中,所述RNA模板包含编码蛋白的RNA序列或者非编码RNA序列(例如,microRNA)。In certain embodiments, the kit further comprises an RNA template. In certain embodiments, the RNA template comprises an RNA sequence encoding a protein or a non-coding RNA sequence (eg, a microRNA).
方法及用途Method and use
在另一个方面,本发明提供了一种修饰靶序列的方法,其包括:将如第五方面所述的复合物、如第九方面所述的组合物、如第十方面所述的组合物或如本文所述的递送组合物与所述靶序列接触,或者递送至包含所述靶序列的细胞中;其中,所述靶序列与感兴趣的基因相关或者存在于所述感兴趣的基因中,并且所述靶序列为RNA。In another aspect, the invention provides a method of modifying a target sequence, comprising: the complex of the fifth aspect, the composition of the ninth aspect, the composition of the tenth aspect Or the delivery composition as described herein is contacted with the target sequence or delivered to a cell comprising the target sequence; wherein the target sequence is associated with or is present in the gene of interest And the target sequence is RNA.
在某些实施方案中,所述靶序列存在于细胞内。在某些实施方案中,所述细胞是原核细胞。在某些实施方案中,所述细胞是真核细胞。在某些实施方案中,所述细胞是哺乳动物细胞。在某些实施方案中,所述细胞是人类细胞。在某些实施方案中,所述细胞选自非人灵长类动物、牛、猪或啮齿类动物细胞。在某些实施方案中,所述细胞是非哺 乳动物真核细胞,例如家禽或鱼等。在某些实施方案中,所述细胞是植物细胞,例如栽培植物(如木薯、玉米、高粱、小麦或水稻)、藻类、树或蔬菜具有的细胞。In certain embodiments, the target sequence is present in a cell. In certain embodiments, the cell is a prokaryotic cell. In certain embodiments, the cell is a eukaryotic cell. In certain embodiments, the cell is a mammalian cell. In certain embodiments, the cell is a human cell. In certain embodiments, the cell is selected from a non-human primate, bovine, porcine or rodent cell. In certain embodiments, the cell is a non-mammalian eukaryotic cell, such as a poultry or fish. In certain embodiments, the cell is a plant cell, such as a cell of a cultivated plant (such as cassava, corn, sorghum, wheat, or rice), algae, tree, or vegetable.
在某些实施方案中,所述靶序列存在于体外的核酸分子(例如,质粒)中。在某些实施方案中,所述靶序列存在于质粒中。In certain embodiments, the target sequence is present in a nucleic acid molecule (eg, a plasmid) in vitro. In certain embodiments, the target sequence is present in a plasmid.
在某些实施方案中,所述修饰是指所述靶序列的断裂,如双链断裂或单链断裂。In certain embodiments, the modification refers to a cleavage of the target sequence, such as a double strand break or a single strand break.
在某些实施方案中,所述靶序列是ssRNA。In certain embodiments, the target sequence is a ssRNA.
在某些实施方案中,所述方法还包括:将RNA模板与所述靶序列接触,或者递送至包含所述靶序列的细胞中。在某些实施方案中,所述修饰还包括将RNA模板(例如外源核酸)插入所述断裂中。In certain embodiments, the method further comprises contacting the RNA template with the target sequence or delivering to a cell comprising the target sequence. In certain embodiments, the modification further comprises inserting an RNA template (eg, an exogenous nucleic acid) into the fragment.
在某些实施方案中,所述的蛋白、缀合物、融合蛋白、分离的核酸分子、复合物、载体或组合物包含于递送载体中。In certain embodiments, the protein, conjugate, fusion protein, isolated nucleic acid molecule, complex, vector or composition is included in a delivery vehicle.
在某些实施方案中,所述递送载体选自脂质颗粒、糖颗粒、金属颗粒、蛋白颗粒、脂质体、外泌体、病毒载体(如复制缺陷型逆转录病毒、慢病毒、腺病毒或腺相关病毒)。In certain embodiments, the delivery vehicle is selected from the group consisting of a lipid particle, a sugar particle, a metal particle, a protein particle, a liposome, an exosome, a viral vector (eg, a replication defective retrovirus, a lentivirus, an adenovirus) Or adeno-associated virus).
在某些实施方案中,所述方法用于RNA干扰或调节基因表达。In certain embodiments, the methods are for RNA interference or modulation of gene expression.
在某些实施方案中,所述方法通过调控RNA加工或RNA激活(RNAa)来调节基因表达。在此类实施方案中,所述RNA加工可以包括RNA剪接(包括选择性剪接)、病毒复制(例如,卫星病毒、噬菌体或逆转录病毒,例如HBV、HCV、HIV等)或tRNA生物合成。在某些情况下,所述RNAa促进基因表达。因此,在此类实施方案中,所述方法通过干扰或减少RNAa来抑制基因表达。In certain embodiments, the methods modulate gene expression by modulating RNA processing or RNA activation (RNAa). In such embodiments, the RNA processing can include RNA splicing (including alternative splicing), viral replication (eg, satellite viruses, phage or retroviruses, eg, HBV, HCV, HIV, etc.) or tRNA biosynthesis. In some cases, the RNAa promotes gene expression. Thus, in such embodiments, the method inhibits gene expression by interfering with or reducing RNAa.
在另一个方面,本发明涉及如第一方面所述的蛋白、如第二方面所述的缀合物、如第三方面所述的融合蛋白、如第四方面所述的分离的核酸分子、如第五方面所述的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面所述的组合物、如第十方面所述的组合物、本发明的试剂盒或递送组合物,在选自下列的一项或多项中的用途,或者在制备制剂中的用途,所述制剂用于选自下列的一项或多项:In another aspect, the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated nucleic acid molecule of the fourth aspect, The complex of claim 5, the isolated nucleic acid molecule of the sixth aspect, the carrier of the seventh aspect, the composition of the ninth aspect, the composition of the tenth aspect A kit or delivery composition of the invention, for use in one or more of the following, or in the preparation of a formulation for one or more selected from the group consisting of:
(1)修饰靶序列;(1) modifying the target sequence;
(2)RNA干扰;或(2) RNA interference; or
(3)调控基因表达。(3) Regulate gene expression.
在另一个方面,本发明提供了一种检测靶序列的方法,其包括将如第五方面所述的 复合物、如第九方面所述的组合物、如第十方面所述的组合物或如本文所述的递送组合物与所述靶序列接触,或者递送至包含所述靶序列的细胞中;其中,所述靶序列是RNA。In another aspect, the invention provides a method of detecting a target sequence, comprising the complex of the fifth aspect, the composition of the ninth aspect, the composition of the tenth aspect or A delivery composition as described herein is contacted with the target sequence or delivered to a cell comprising the target sequence; wherein the target sequence is RNA.
在某些实施方案中,所述复合物、组合物或递送组合物中所包含的导向序列能够与所述靶序列杂交。In certain embodiments, a targeting sequence contained in the complex, composition or delivery composition is capable of hybridizing to the target sequence.
在某些实施方案中,所述靶序列存在于体外的核酸分子中。In certain embodiments, the target sequence is present in a nucleic acid molecule in vitro.
在某些实施方案中,所述靶序列存在于细胞内。在某些实施方案中,所述细胞是原核细胞。In certain embodiments, the target sequence is present in a cell. In certain embodiments, the cell is a prokaryotic cell.
在某些实施方案中,所述细胞是活细胞。In certain embodiments, the cell is a living cell.
在某些实施方案中,所述复合物、组合物或递送组合物所包含的蛋白组分带有可检测的标记。In certain embodiments, the protein component comprised by the complex, composition or delivery composition carries a detectable label.
在某些实施方案中,所述复合物、组合物或递送组合物所包含的蛋白组分融合有荧光蛋白(例如GFP)。In certain embodiments, the protein component comprised by the complex, composition or delivery composition is fused with a fluorescent protein (eg, GFP).
在某些实施方案中,所述方法是northern blot印迹法。在此类实施方案中,所述northern blot印迹法涉及通过电泳按大小分离RNA样品。因此,本发明的复合物或组合物可以被用于特异性结合和检测靶RNA序列。In certain embodiments, the method is northern blotting. In such embodiments, the northern blotting method involves isolating RNA samples by size by electrophoresis. Thus, the complexes or compositions of the invention can be used to specifically bind and detect target RNA sequences.
在某些实施方案中,所述方法是荧光原位杂交。在此类实施方案中,本发明的复合物、组合物或递送组合物所包含的蛋白组分带有可检测的标记(例如荧光标记)。In certain embodiments, the method is fluorescence in situ hybridization. In such embodiments, the protein component of the complex, composition or delivery composition of the invention carries a detectable label (eg, a fluorescent label).
在另一个方面,本发明涉及如第一方面所述的蛋白、如第二方面所述的缀合物、如第三方面所述的融合蛋白、如第四方面所述的分离的核酸分子、如第五方面所述的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面所述的组合物、如第十方面所述的组合物、本发明的试剂盒或递送组合物,用于检测靶序列的用途,或者在制备制剂中的用途,所述制剂用于检测靶序列。In another aspect, the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated nucleic acid molecule of the fourth aspect, The complex of claim 5, the isolated nucleic acid molecule of the sixth aspect, the carrier of the seventh aspect, the composition of the ninth aspect, the composition of the tenth aspect A kit or delivery composition of the invention for use in detecting a target sequence, or in the preparation of a preparation for detecting a target sequence.
在某些实施方案中,当本发明的CRISPR/Cas复合物与靶RNA结合后,所述复合物中的Cas13e即被活化,随后可裂解任何附近的ssRNA序列(即,附带切割(collateral cleavage))。Cas13e一旦被所述靶RNA引发,就可以切割其他(非互补)RNA分子。这种混杂的RNA切割可能会引起细胞毒性,或以其他方式影响细胞生理学或细胞状态。In certain embodiments, when the CRISPR/Cas complex of the invention binds to a target RNA, Cas13e in the complex is activated, followed by cleavage of any nearby ssRNA sequence (ie, collateral cleavage) ). Once Cas13e is primed by the target RNA, other (non-complementary) RNA molecules can be cleaved. This confounding RNA cleavage can cause cytotoxicity or otherwise affect cell physiology or cellular status.
因此,在另一个方面,本发明提供了一种调节靶细胞的状态的方法,其包括将如第五方面所述的复合物、如第九方面所述的组合物、如第十方面所述的组合物或如本文所 述的递送组合物引入所述靶细胞中。In a further aspect, the invention provides a method of modulating the state of a target cell, comprising the composition of the fifth aspect, the composition of the ninth aspect, as described in the tenth aspect The composition or delivery composition as described herein is introduced into the target cell.
在某些实施方案中,所述靶细胞中存在与所述复合物、组合物或递送组合物中所包含的导向序列杂交的靶序列。In certain embodiments, a target sequence that hybridizes to a targeting sequence contained in the complex, composition, or delivery composition is present in the target cell.
在某些实施方案中,所述调节靶细胞的状态包括:In certain embodiments, the modulating the state of the target cell comprises:
(1)在体外或体内诱导细胞休眠;(1) inducing cell dormancy in vitro or in vivo;
(2)在体外或体内诱导细胞周期停滞;(2) Inducing cell cycle arrest in vitro or in vivo;
(3)在体外或体内抑制细胞生长和/或细胞增殖;(3) inhibiting cell growth and/or cell proliferation in vitro or in vivo;
(4)在体外或体内诱导细胞无反应性;(4) inducing cell anergy in vitro or in vivo;
(5)在体外或体内诱导细胞凋亡;(5) inducing apoptosis in vitro or in vivo;
(6)在体外或体内诱导细胞坏死;(6) inducing cell necrosis in vitro or in vivo;
(7)在体外或体内诱导细胞死亡;或(7) inducing cell death in vitro or in vivo; or
(8)在体外或体内诱导程序性细胞死亡。(8) Inducing programmed cell death in vitro or in vivo.
在某些实施方案中,所述细胞是原核细胞。In certain embodiments, the cell is a prokaryotic cell.
在另一个方面,本发明涉及如第一方面所述的蛋白、如第二方面所述的缀合物、如第三方面所述的融合蛋白、如第四方面所述的分离的核酸分子、如第五方面所述的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面所述的组合物、如第十方面所述的组合物、本发明的试剂盒或递送组合物,在选自下列的一项或多项中的用途,或者在制备制剂中的用途,所述制剂用于选自下列的一项或多项:In another aspect, the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated nucleic acid molecule of the fourth aspect, The complex of claim 5, the isolated nucleic acid molecule of the sixth aspect, the carrier of the seventh aspect, the composition of the ninth aspect, the composition of the tenth aspect A kit or delivery composition of the invention, for use in one or more of the following, or in the preparation of a formulation for one or more selected from the group consisting of:
(1)在体外或体内诱导细胞休眠;(1) inducing cell dormancy in vitro or in vivo;
(2)在体外或体内诱导细胞周期停滞;(2) Inducing cell cycle arrest in vitro or in vivo;
(3)在体外或体内抑制细胞生长和/或细胞增殖;(3) inhibiting cell growth and/or cell proliferation in vitro or in vivo;
(4)在体外或体内诱导细胞无反应性;(4) inducing cell anergy in vitro or in vivo;
(5)在体外或体内诱导细胞凋亡;(5) inducing apoptosis in vitro or in vivo;
(6)在体外或体内诱导细胞坏死;(6) inducing cell necrosis in vitro or in vivo;
(7)在体外或体内诱导细胞死亡;或(7) inducing cell death in vitro or in vivo; or
(8)在体外或体内诱导程序性细胞死亡。(8) Inducing programmed cell death in vitro or in vivo.
在本发明中,如上所述的方法可以是治疗性或预防性的,并且可以靶向特定的靶细胞、细胞(亚)群体或细胞/组织类型。在某些情况下,所述特定的细胞、细胞(亚)群体或细胞/组织类型表达一种或多种靶序列,例如一种或多种特定的靶RNA。在某些实施方案 中,所述靶细胞的非限制性实例包括,表达特定转录物的肿瘤细胞、给定类别的神经元、导致自身免疫的细胞或被特定病原体(例如病毒)感染的细胞。In the present invention, the methods as described above may be therapeutic or prophylactic, and may target a particular target cell, cell (sub)pop, or cell/tissue type. In certain instances, the particular cell, cell (sub)pop, or cell/tissue type expresses one or more target sequences, such as one or more specific target RNAs. In certain embodiments, non-limiting examples of such target cells include tumor cells that express a particular transcript, a given class of neurons, cells that cause autoimmunity, or cells that are infected with a particular pathogen (eg, a virus).
因此,在某些实施方案中,本发明还提供了一种治疗以存在不良细胞为特征的病理状况的方法,其包括给有此需要的受试者施用将如第五方面所述的复合物、如第九方面所述的组合物、如第十方面所述的组合物或如本文所述的递送组合物。在某些实施方案中,本发明还涉及如第一方面所述的蛋白、如第二方面所述的缀合物、如第三方面所述的融合蛋白、如第四方面所述的分离的核酸分子、如第五方面所述的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面所述的组合物、如第十方面所述的组合物、本发明的试剂盒或递送组合物用于治疗以存在不良细胞为特征的病理状况的用途。应理解的是,在上述实施方案中,本发明的复合物或组合物优选靶向所述不良细胞的特异性靶序列。Accordingly, in certain embodiments, the present invention also provides a method of treating a pathological condition characterized by the presence of a defective cell, comprising administering to a subject in need thereof a complex as described in the fifth aspect The composition of the ninth aspect, the composition of the tenth aspect, or the delivery composition as described herein. In certain embodiments, the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated of the fourth aspect The nucleic acid molecule, the complex according to the fifth aspect, the isolated nucleic acid molecule according to the sixth aspect, the carrier according to the seventh aspect, the composition according to the ninth aspect, according to the tenth aspect Use of the composition, kit of the invention or delivery composition for the treatment of a pathological condition characterized by the presence of defective cells. It will be appreciated that in the above embodiments, the complex or composition of the invention preferably targets a specific target sequence of the defective cell.
在某些实施方案中,所述以存在不良细胞为特征的病理状况是肿瘤。因此,在某些实施方案中,本发明还提供了一种治疗肿瘤的方法,其包括给有此需要的受试者施用将如第五方面所述的复合物、如第九方面所述的组合物、如第十方面所述的组合物或如本文所述的递送组合物。在某些实施方案中,本发明还涉及如第一方面所述的蛋白、如第二方面所述的缀合物、如第三方面所述的融合蛋白、如第四方面所述的分离的核酸分子、如第五方面所述的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面所述的组合物、如第十方面所述的组合物、本发明的试剂盒或递送组合物用于治疗肿瘤的用途。在此类实施方案中,本发明的复合物或组合物优选靶向肿瘤细胞特异性的靶序列。In certain embodiments, the pathological condition characterized by the presence of a defective cell is a tumor. Accordingly, in certain embodiments, the present invention also provides a method of treating a tumor comprising administering to a subject in need thereof a complex according to the fifth aspect, as described in the ninth aspect A composition, a composition as described in the tenth aspect, or a delivery composition as described herein. In certain embodiments, the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated of the fourth aspect The nucleic acid molecule, the complex according to the fifth aspect, the isolated nucleic acid molecule according to the sixth aspect, the carrier according to the seventh aspect, the composition according to the ninth aspect, according to the tenth aspect Use of the composition, kit of the invention or delivery composition for the treatment of a tumor. In such embodiments, the complex or composition of the invention preferably targets a tumor cell specific target sequence.
在某些实施方案中,所述以存在不良细胞为特征的病理状况是病原体感染或由病原体感染所导致的疾病。因此,在某些实施方案中,本发明还提供了一种治疗病原体感染或由病原体感染所导致的疾病的方法,其包括给有此需要的受试者施用将如第五方面所述的复合物、如第九方面所述的组合物、如第十方面所述的组合物或如本文所述的递送组合物。在某些实施方案中,本发明还涉及如第一方面所述的蛋白、如第二方面所述的缀合物、如第三方面所述的融合蛋白、如第四方面所述的分离的核酸分子、如第五方面所述的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面所述的组合物、如第十方面所述的组合物、本发明的试剂盒或递送组合物用于治疗病原体感染或由病原体感染所导致的疾病的用途。在此类实施方案中,本发明的复合物或组合物优选靶向被所述病原体所感染的细胞的特异性靶序列(例如,衍生自所述病原体 的靶序列)。In certain embodiments, the pathological condition characterized by the presence of a defective cell is a disease caused by a pathogen infection or infection by a pathogen. Accordingly, in certain embodiments, the present invention also provides a method of treating a pathogen infection or a disease caused by a pathogen infection, comprising administering to a subject in need thereof a compound as described in the fifth aspect The composition of the ninth aspect, the composition of the tenth aspect, or the delivery composition as described herein. In certain embodiments, the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated of the fourth aspect The nucleic acid molecule, the complex according to the fifth aspect, the isolated nucleic acid molecule according to the sixth aspect, the carrier according to the seventh aspect, the composition according to the ninth aspect, according to the tenth aspect Use of a composition, a kit of the invention or a delivery composition for treating a pathogen infection or a disease caused by a pathogen infection. In such embodiments, the complex or composition of the invention preferably targets a specific target sequence (e.g., a target sequence derived from the pathogen) of a cell infected by the pathogen.
在某些实施方案中,所述以存在不良细胞为特征的病理状况是自身免疫病。因此,在某些实施方案中,本发明还提供了一种治疗自身免疫病的方法,其包括给有此需要的受试者施用将如第五方面所述的复合物、如第九方面所述的组合物、如第十方面所述的组合物或如本文所述的递送组合物。在某些实施方案中,本发明还涉及如第一方面所述的蛋白、如第二方面所述的缀合物、如第三方面所述的融合蛋白、如第四方面所述的分离的核酸分子、如第五方面所述的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面所述的组合物、如第十方面所述的组合物、本发明的试剂盒或递送组合物用于治疗自身免疫病的用途。在此类实施方案中,本发明的复合物或组合物优选靶向造成所述自身免疫病的细胞(例如,特定免疫细胞)的特异性靶序列。In certain embodiments, the pathological condition characterized by the presence of a defective cell is an autoimmune disease. Accordingly, in certain embodiments, the present invention also provides a method of treating an autoimmune disease, comprising administering to a subject in need thereof a complex according to the fifth aspect, as in the ninth aspect A composition, a composition as described in the tenth aspect, or a delivery composition as described herein. In certain embodiments, the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated of the fourth aspect The nucleic acid molecule, the complex according to the fifth aspect, the isolated nucleic acid molecule according to the sixth aspect, the carrier according to the seventh aspect, the composition according to the ninth aspect, according to the tenth aspect Use of the composition, kit of the invention or delivery composition for the treatment of an autoimmune disease. In such embodiments, the complex or composition of the invention preferably targets a specific target sequence of a cell (eg, a particular immune cell) that causes the autoimmune disease.
在某些实施方案中,本发明还涉及如第一方面所述的蛋白、如第二方面所述的缀合物、如第三方面所述的融合蛋白、如第四方面所述的分离的核酸分子、如第五方面所述的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面所述的组合物、如第十方面所述的组合物、本发明的试剂盒或递送组合物,在治疗方法中的用途,或者在制备制剂中的用途,所述制剂用于治疗方法之中。所述治疗方法包括基因编辑、转录组编辑或基因治疗。In certain embodiments, the invention relates to the protein of the first aspect, the conjugate of the second aspect, the fusion protein of the third aspect, the isolated of the fourth aspect The nucleic acid molecule, the complex according to the fifth aspect, the isolated nucleic acid molecule according to the sixth aspect, the carrier according to the seventh aspect, the composition according to the ninth aspect, according to the tenth aspect The composition, the kit or delivery composition of the invention, the use in a method of treatment, or the use in the preparation of a formulation for use in a method of treatment. Such methods of treatment include gene editing, transcriptome editing, or gene therapy.
细胞及细胞子代Cell and cell progeny
在某些情况下,由本发明的方法引入到细胞的修饰可以使得细胞和其子代被改变以改进其生物产物(如抗体、淀粉、乙醇或其他期望的细胞输出物)的产生。在某些情况下,由本发明的方法引入到细胞的修饰可以使得细胞和其子代包括使所生产生物产物发生变化的改变。In some cases, modifications introduced to cells by the methods of the invention can cause the cells and their progeny to be altered to improve the production of their biological products, such as antibodies, starch, ethanol, or other desired cellular output. In some cases, modifications introduced into the cell by the methods of the invention can cause the cell and its progeny to include changes that result in a change in the produced biological product.
因此,在另一方面,本发明还涉及如上所述的方法获得的细胞或其子代,其中所述细胞含有在其野生型中不存在的修饰。Thus, in another aspect, the invention relates to a cell or a progeny thereof obtained by the method as described above, wherein the cell contains a modification that is not found in its wild type.
在某些实施方案中,所述修饰导致至少一种RNA产物的转录或翻译发生改变。在某些实施方案中,所述修饰导致至少一种RNA产物的表达增加。在某些实施方案中,所述修饰导致至少一种RNA产物的表达降低。In certain embodiments, the modification results in a change in transcription or translation of at least one RNA product. In certain embodiments, the modification results in increased expression of at least one RNA product. In certain embodiments, the modification results in decreased expression of at least one RNA product.
在某些实施方案中,所述细胞为原核细胞。In certain embodiments, the cell is a prokaryotic cell.
在某些实施方案中,所述细胞为真核细胞。在某些实施方案中,所述细胞是哺乳动 物细胞,例如人类细胞。In certain embodiments, the cell is a eukaryotic cell. In certain embodiments, the cell is a mammalian cell, such as a human cell.
本发明还涉及如上所述的细胞或其子代的细胞产物。The invention also relates to a cell product of a cell or a progeny thereof as described above.
本发明还涉及一种体外的、离体的或体内的细胞或细胞系或它们的子代,所述细胞或细胞系或它们的子代包含:如第一方面所述的蛋白、如第二方面所述的缀合物、如第三方面所述的融合蛋白、如第四方面所述的分离的核酸分子、如第五方面所述的复合物、如第六方面所述的分离的核酸分子、如第七方面所述的载体、如第九方面所述的组合物、如第十方面所述的组合物、本发明的试剂盒或递送组合物。The invention also relates to an in vitro, ex vivo or in vivo cell or cell line or a progeny thereof, the cell or cell line or a progeny thereof comprising: the protein of the first aspect, such as the second The conjugate of the aspect, the fusion protein of the third aspect, the isolated nucleic acid molecule of the fourth aspect, the complex of the fifth aspect, the isolated nucleic acid of the sixth aspect A molecule, a carrier according to the seventh aspect, a composition according to the ninth aspect, a composition according to the tenth aspect, a kit of the invention or a delivery composition.
在某些实施方案中,所述细胞是原核细胞。In certain embodiments, the cell is a prokaryotic cell.
在某些实施方案中,所述细胞是真核细胞。在某些实施方案中,所述细胞是哺乳动物细胞。在某些实施方案中,所述细胞是人类细胞。某些实施方案中,所述细胞是非人哺乳动物细胞,例如非人灵长类动物、牛、羊、猪、犬、猴、兔、啮齿类(如大鼠或小鼠)的细胞。在某些实施方案中,所述细胞是非哺乳动物真核细胞,例如家禽鸟类(如鸡)、鱼类或甲壳动物(如蛤蜊、虾)的细胞。在某些实施方案中,所述细胞是植物细胞,例如单子叶植物或双子叶植物具有的细胞或栽培植物或粮食作物如木薯、玉米、高粱、大豆、小麦、燕麦或水稻具有的细胞,例如藻类、树或生产植物、果实或蔬菜(例如,树类如柑橘树、坚果树;茄属植物、棉花、烟草、番茄、葡萄、咖啡、可可等)。In certain embodiments, the cell is a eukaryotic cell. In certain embodiments, the cell is a mammalian cell. In certain embodiments, the cell is a human cell. In certain embodiments, the cell is a non-human mammalian cell, such as a cell of a non-human primate, cow, sheep, pig, dog, monkey, rabbit, rodent (eg, rat or mouse). In certain embodiments, the cell is a non-mammalian eukaryotic cell, such as a poultry bird (eg, chicken), a fish or a crustacean (eg, scorpion, shrimp) cells. In certain embodiments, the cell is a plant cell, such as a cell possessed by a monocot or a dicot or a cultivated plant or a cell of a food crop such as cassava, corn, sorghum, soybean, wheat, oat or rice, for example Algae, tree or production of plants, fruits or vegetables (for example, trees such as citrus, nut trees; nightshade, cotton, tobacco, tomatoes, grapes, coffee, cocoa, etc.).
在某些实施方案中,所述细胞是干细胞或干细胞系。In certain embodiments, the cell is a stem cell or stem cell line.
术语定义Definition of Terms
在本发明中,除非另有说明,否则本文中使用的科学和技术名词具有本领域技术人员所通常理解的含义。并且,本文中所用的分子遗传学、核酸化学、化学、分子生物学、生物化学、细胞培养、微生物学、细胞生物学、基因组学和重组DNA等操作步骤均为相应领域内广泛使用的常规步骤。同时,为了更好地理解本发明,下面提供相关术语的定义和解释。In the present invention, the scientific and technical terms used herein have the meanings commonly understood by those skilled in the art, unless otherwise stated. Furthermore, the procedures of molecular genetics, nucleic acid chemistry, chemistry, molecular biology, biochemistry, cell culture, microbiology, cell biology, genomics, and recombinant DNA used herein are routine steps widely used in the corresponding fields. . Also, for a better understanding of the present invention, definitions and explanations of related terms are provided below.
在本发明中,表述“Cas13e”是指,本发明人首次发现并鉴定的一种Cas效应蛋白,其具有选自下列的氨基酸序列:In the present invention, the expression "Cas13e" refers to a Cas effector protein first discovered and identified by the present inventors having an amino acid sequence selected from the group consisting of:
(i)SEQ ID NOs:1-7任一项所示的序列;(i) the sequence of any one of SEQ ID NOs: 1-7;
(ii)与SEQ ID NOs:1-7任一项所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in any one of SEQ ID NOs: 1-7 (eg, 1, 2, 3, 4, 5, 6 a sequence of 7, 7 or 9 amino acid substitutions, deletions or additions; or
(iii)与SEQ ID NOs:1-7任一项所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列。(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95% of the sequence set forth in any one of SEQ ID NOs: 1-7 a sequence of at least 96%, at least 97%, at least 98%, or at least 99% sequence identity.
本发明的Cas13e是一种在导向RNA引导下与靶RNA序列特定位点结合并切割的核糖核酸内切酶。The Cas13e of the present invention is an endonuclease which binds to a specific site of a target RNA sequence and cleaves under the guidance of a guide RNA.
如本文中所使用的,术语“规律成簇的间隔短回文重复(CRISPR)-CRISPR-相关(Cas)(CRISPR-Cas)系统”或“CRISPR系统”可互换地使用并且具有本领域技术人员通常理解的含义,其通常包含与CRISPR相关(“Cas”)基因的表达有关的转录产物或其他元件,或者能够指导所述Cas基因活性的转录产物或其他元件。此类转录产物或其他元件可以包含编码Cas效应蛋白的序列和包含CRISPR RNA(crRNA)的导向RNA,以及在CRISPR-Cas9系统中所含有的反式作用crRNA(tracrRNA)序列,或来自CRISPR基因座的其他序列或转录产物。在本发明所述的基于Cas13e的CRISPR系统中,不需要tracrRNA序列。As used herein, the term "regular clustered interspersed short palindrome repeat (CRISPR)-CRISPR-related (Cas) (CRISPR-Cas) system" or "CRISPR system" is used interchangeably and has skill in the art. A person generally understands the meaning, which typically includes a transcription product or other element associated with the expression of a CRISPR-associated ("Cas") gene, or a transcription product or other element capable of directing the activity of the Cas gene. Such transcription products or other elements may comprise a sequence encoding a Cas effector protein and a targeting RNA comprising CRISPR RNA (crRNA), and a trans-acting crRNA (tracrRNA) sequence contained in the CRISPR-Cas9 system, or from a CRISPR locus Other sequences or transcripts. In the Cas13e-based CRISPR system of the present invention, tracrRNA sequences are not required.
如本文中所使用的,术语“Cas效应蛋白”、“Cas效应酶”可互换地使用并且是指,CRISPR-Cas系统中呈现的任一种大于长度900个氨基酸的蛋白质。在某些情况下,这类蛋白是指从Cas基因座中鉴定的蛋白。As used herein, the terms "Cas effector protein", "Cas effector enzyme" are used interchangeably and refer to any of the proteins presented in the CRISPR-Cas system that are greater than 900 amino acids in length. In some cases, such proteins refer to proteins identified from the Cas locus.
如本文中所使用的,术语“导向RNA(guide RNA)”、“成熟crRNA”可互换地使用并且具有本领域技术人员通常理解的含义。一般而言,导向RNA可以包含同向(direct)重复序列和导向序列(guide sequence),或者基本上由或由同向重复序列和导向序列(在内源性CRISPR系统背景下也称为间隔序列(spacer))组成。在某些情况下,导向序列是与靶序列具有足够互补性从而与所述靶序列杂交并引导CRISPR/Cas复合物与所述靶序列的特异性结合的任何多核苷酸序列。在某些实施方案中,当最佳比对时,导向序列与其相应靶序列之间的互补程度为至少50%、至少60%、至少70%、至少80%、至少90%、至少95%、或至少99%。确定最佳比对在本领域的普通技术人员的能力范围内。例如,存在公开和可商购的比对算法和程序,诸如但不限于ClustalW、matlab中的史密斯-沃特曼算法(Smith-Waterman)、Bowtie、Geneious、Biopython以及SeqMan。As used herein, the terms "guide RNA", "mature crRNA" are used interchangeably and have the meaning as commonly understood by one of ordinary skill in the art. In general, the targeting RNA may comprise a direct repeat sequence and a guide sequence, or consist essentially of or consist of a homologous repeat sequence and a guide sequence (also referred to as a spacer sequence in the context of an endogenous CRISPR system). (spacer)) composition. In some cases, the targeting sequence is any polynucleotide sequence that is sufficiently complementary to the target sequence to hybridize to the target sequence and direct the specific binding of the CRISPR/Cas complex to the target sequence. In certain embodiments, when optimally aligned, the degree of complementarity between the targeting sequence and its corresponding target sequence is at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, Or at least 99%. Determining the optimal alignment is within the abilities of one of ordinary skill in the art. For example, there are publicly available and commercially available alignment algorithms and programs such as, but not limited to, ClustalW, Smith-Waterman in Matlab, Bowtie, Geneious, Biopython, and SeqMan.
在某些情况下,所述导向序列在长度上为至少5个、至少10个、至少15个、至少16个、至少17个、至少18个、至少19个、至少20个、至少21个、至少22个、至少23个、至少24个、至少25个、至少26个、至少27个、至少28个、至少29个、至少30个、至少35个、至少40个、至少45个或至少50个核苷酸。在某些情况下,所述导 向序列在长度上为不超过50个、45个、40个、35个、30个、25个、24个、23个、22个、21个、20个、15个、10个或更少个核苷酸。In some cases, the targeting sequence is at least 5, at least 10, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21 in length, At least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 35, at least 40, at least 45 or at least 50 Nucleotides. In some cases, the guide sequence is no more than 50, 45, 40, 35, 30, 25, 24, 23, 22, 21, 20, 15 in length. , 10 or fewer nucleotides.
在某些情况下,所述同向重复序列在长度上为至少10个、至少15个、至少16个、至少17个、至少18个、至少19个、至少20个、至少21个、至少22个、至少23个、至少24个、至少25个、至少26个、至少27个、至少28个、至少29个、至少30个、至少35个、至少40个、至少45个、至少50个、至少55个、至少56个、至少57个、至少58个、至少59个、至少60个、至少61个、至少62个、至少63个、至少64个、至少65个或至少70个核苷酸。在某些情况下,所述同向重复序列在长度上为不超过70个、65个、64个、63个、62个、61个、60个、59个、58个、57个、56个、55个、50个、45个、40个、35个、30个、29个、28个、27个、26个、25个、24个、23个、22个、21个、20个、15个、10个或更少个核苷酸。In some cases, the isotropic repeats are at least 10, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22 in length. , at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 35, at least 40, at least 45, at least 50, At least 55, at least 56, at least 57, at least 58, at least 59, at least 60, at least 61, at least 62, at least 63, at least 64, at least 65 or at least 70 nucleotides . In some cases, the same direction repeat sequence is no more than 70, 65, 64, 63, 62, 61, 60, 59, 58, 57, 56 in length. , 55, 50, 45, 40, 35, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 15 , 10 or fewer nucleotides.
如本文中所使用的,术语“CRISPR/Cas复合物”是指,导向RNA(guide RNA)或成熟crRNA与Cas蛋白结合所形成的核糖核蛋白复合体,其包含杂交到靶序列上并且与Cas蛋白结合的导向序列。该核糖核蛋白复合体能够识别并切割能与该导向RNA或成熟crRNA杂交的多核苷酸。As used herein, the term "CRISPR/Cas complex" refers to a ribonucleoprotein complex formed by the binding of a guide RNA or a mature crRNA to a Cas protein, which comprises hybridization to a target sequence and with Cas Protein-directed targeting sequences. The ribonucleoprotein complex is capable of recognizing and cleaving a polynucleotide that hybridizes to the targeting RNA or mature crRNA.
因此,在形成CRISPR/Cas复合物的情况下,“靶序列”是指被设计为具有靶向性的导向序列所靶向的多核苷酸,例如与该导向序列具有互补性的序列,其中靶序列与导向序列之间的杂交将促进CRISPR/Cas复合物的形成。完全互补性不是必需的,只要存在足够互补性以引起杂交并且促进一种CRISPR/Cas复合物的形成即可。在本发明中,所述靶序列是RNA。因此,在本发明中“靶序列”与“靶RNA”可互换使用。所述靶序列可以是任何合适形式的RNA,例如mRNA、tRNA、rRNA、miRNA、siRNA或shRNA。在本发明中,表述“靶序列”可以是对细胞(例如,真核细胞)而言任何内源或外源的RNA序列。在某些情况下,所述靶序列位于细胞的细胞核或细胞质中。在某些情况下,该靶序列可位于真核细胞的一个细胞器例如线粒体或叶绿体内。可被用于整合到包含该靶序列的RNA序列中的序列或模板被称为“RNA模板”。在某些实施方案中,所述RNA模板为外源核酸。Thus, in the context of the formation of a CRISPR/Cas complex, a "target sequence" refers to a polynucleotide that is designed to be targeted by a targeting sequence, such as a sequence that is complementary to the targeting sequence, wherein the target Hybridization between the sequence and the targeting sequence will promote the formation of the CRISPR/Cas complex. Complete complementarity is not required as long as sufficient complementarity exists to cause hybridization and promote the formation of a CRISPR/Cas complex. In the present invention, the target sequence is RNA. Therefore, "target sequence" and "target RNA" are used interchangeably in the present invention. The target sequence can be any suitable form of RNA, such as mRNA, tRNA, rRNA, miRNA, siRNA or shRNA. In the present invention, the expression "target sequence" may be any endogenous or exogenous RNA sequence to a cell (eg, a eukaryotic cell). In some cases, the target sequence is located in the nucleus or cytoplasm of the cell. In some cases, the target sequence can be located in an organelle of a eukaryotic cell, such as a mitochondria or chloroplast. A sequence or template that can be used to integrate into an RNA sequence comprising the target sequence is referred to as an "RNA template." In certain embodiments, the RNA template is an exogenous nucleic acid.
在某些情况下,靶RNA可以是编码基因产物(例如蛋白质)的序列(例如mRNA或pre-mRNA)或非编码序列(例如,ncRNA,lncRNA,tRNA或rRNA)。靶RNA的非限制性实例包括与信号传导生物化学途径相关的序列(例如信号传导生化途径相关RNA)或疾病相关RNA。所述“疾病相关RNA”是指这样的RNA序列,与未患疾病对照的 组织或细胞相比,该RNA序列所产生的翻译产物在来自受疾病影响的组织或细胞中以异常水平或异常形式存在。所述“疾病相关RNA”可以是由表达水平异常升高的基因转录而来,也可以是由表达异常降低的基因转录而来的RNA,其中改变的表达水平与疾病的发生和/或发展相关。疾病相关RNA也指从具有突变或基因变异的基因转录的RNA,其直接负责或与引起疾病病因的基因连锁不平衡。翻译后的产品可能是已知的或未知的,可能处于正常或异常水平。In some cases, the target RNA can be a sequence encoding a gene product (eg, a protein) (eg, mRNA or pre-mRNA) or a non-coding sequence (eg, ncRNA, lncRNA, tRNA or rRNA). Non-limiting examples of target RNA include sequences associated with signaling biochemical pathways (eg, signaling biochemical pathway-associated RNA) or disease-associated RNA. The "disease-associated RNA" refers to an RNA sequence produced by the RNA sequence in an abnormal level or abnormal form in a tissue or cell affected by the disease, compared to a tissue or a cell not having a disease control. presence. The "disease-associated RNA" may be transcribed from a gene having an abnormally elevated expression level, or may be an RNA transcribed from a gene having abnormally decreased expression, wherein the altered expression level is related to the occurrence and/or development of the disease. . Disease-associated RNA also refers to RNA transcribed from a gene having a mutation or gene mutation that is directly responsible for or is in linkage disequilibrium with the gene causing the disease. The translated product may be known or unknown and may be at a normal or abnormal level.
在某些情况下,所述靶RNA可以包含干扰RNA(即,存于RNA干扰通路的RNA,例如shRNA、siRNA等)。在某些实施方案中,所述靶RNA是microRNA(miRNA)。In some cases, the target RNA can comprise interfering RNA (ie, RNA present in the RNA interference pathway, eg, shRNA, siRNA, etc.). In certain embodiments, the target RNA is a microRNA (miRNA).
如本文中所使用的,术语“野生型”具有本领域技术人员通常理解的含义,其表示生物、菌株、基因的典型形式或者当它在自然界存在时区别于突变体或变体形式的特征,其可从自然中的来源分离并且没有被人为有意地修饰。As used herein, the term "wild type" has the meaning commonly understood by those skilled in the art to mean a typical form of a organism, a strain, a gene, or a feature that distinguishes it from a mutant or variant when it exists in nature. It can be isolated from sources in nature and not intentionally modified.
如本文中所使用的,术语“非天然存在的”或“工程化的”可互换地使用并且表示人工的参与。当这些术语用于描述核酸分子或多肽时,其表示该核酸分子或多肽至少基本上从它们在自然界中或如发现于自然界中的与其结合的至少另一种组分游离出来。As used herein, the terms "non-naturally occurring" or "engineered" are used interchangeably and refer to artificial participation. When these terms are used to describe a nucleic acid molecule or polypeptide, it is meant that the nucleic acid molecule or polypeptide is at least substantially freed from at least one other component of its association in nature or as found in nature.
如本文中所使用的,术语“直系同源物(orthologue,ortholog)”具有本领域技术人员通常理解的含义。作为进一步指导,如本文中所述的蛋白质的“直系同源物”是指属于不同物种的蛋白质,该蛋白质执行与作为其直系同源物的蛋白相同或相似的功能。As used herein, the term "orthologue, ortholog" has the meaning as commonly understood by one of ordinary skill in the art. As a further guide, an "ortholog" of a protein as referred to herein refers to a protein belonging to a different species that performs the same or similar function as a protein that is an ortholog thereof.
如本文中所使用的,术语“同一性”用于指两个多肽之间或两个核酸之间序列的匹配情况。当两个进行比较的序列中的某个位置都被相同的碱基或氨基酸单体亚单元占据时(例如,两个DNA分子的每一个中的某个位置都被腺嘌呤占据,或两个多肽的每一个中的某个位置都被赖氨酸占据),那么各分子在该位置上是同一的。两个序列之间的“百分数同一性”是由这两个序列共有的匹配位置数目除以进行比较的位置数目×100的函数。例如,如果两个序列的10个位置中有6个匹配,那么这两个序列具有60%的同一性。例如,DNA序列CTGACT和CAGGTT共有50%的同一性(总共6个位置中有3个位置匹配)。通常,在将两个序列比对以产生最大同一性时进行比较。这样的比对可通过使用,例如,可通过计算机程序例如Align程序(DNAstar,Inc.)方便地进行的Needleman等人(1970)J.Mol.Biol.48:443-453的方法来实现。还可使用已整合入ALIGN程序(版本2.0)的E.Meyers和W.Miller(Comput.Appl Biosci.,4:11-17(1988))的算法,使用PAM120权重残基表(weight residue table)、12 的缺口长度罚分和4的缺口罚分来测定两个氨基酸序列之间的百分数同一性。此外,可使用已整合入GCG软件包(可在www.gcg.com上获得)的GAP程序中的Needleman和Wunsch(J MoI Biol.48:444-453(1970))算法,使用Blossum 62矩阵或PAM250矩阵以及16、14、12、10、8、6或4的缺口权重(gap weight)和1、2、3、4、5或6的长度权重来测定两个氨基酸序列之间的百分数同一性。As used herein, the term "identity" is used to mean the matching of sequences between two polypeptides or between two nucleic acids. When a position in the two sequences being compared is occupied by the same base or amino acid monomer subunit (for example, a position in each of the two DNA molecules is occupied by adenine, or two Each position in each of the polypeptides is occupied by lysine, and then each molecule is identical at that position. The "percent identity" between the two sequences is a function of the number of matching positions shared by the two sequences divided by the number of positions to be compared x 100. For example, if 6 of the 10 positions of the two sequences match, then the two sequences have 60% identity. For example, the DNA sequences CTGACT and CAGGTT share 50% identity (3 out of a total of 6 positions match). Typically, the comparison is made when the two sequences are aligned to produce maximum identity. Such alignment can be achieved by, for example, the method of Needleman et al. (1970) J. Mol. Biol. 48: 443-453, which can be conveniently performed by a computer program such as the Align program (DNAstar, Inc.). It is also possible to use the algorithm of E. Meyers and W. Miller (Comput. Appl Biosci., 4: 11-17 (1988)) integrated into the ALIGN program (version 2.0), using the PAM 120 weight residue table. The gap length penalty of 12 and the gap penalty of 4 were used to determine the percent identity between the two amino acid sequences. In addition, the Needleman and Wunsch (J MoI Biol. 48: 444-453 (1970)) algorithms in the GAP program integrated into the GCG software package (available at www.gcg.com) can be used, using the Blossum 62 matrix or The PAM250 matrix and the gap weight of 16, 14, 12, 10, 8, 6 or 4 and the length weight of 1, 2, 3, 4, 5 or 6 to determine the percent identity between two amino acid sequences .
如本文中所使用的,术语“载体”是指,可将多聚核苷酸插入其中的一种核酸运载工具。当载体能使插入的多核苷酸编码的蛋白获得表达时,载体称为表达载体。载体可以通过转化,转导或者转染导入宿主细胞,使其携带的遗传物质元件在宿主细胞中获得表达。载体是本领域技术人员公知的,包括但不限于:质粒;噬菌粒;柯斯质粒;人工染色体,例如酵母人工染色体(YAC)、细菌人工染色体(BAC)或P1来源的人工染色体(PAC);噬菌体如λ噬菌体或M13噬菌体及动物病毒等。可用作载体的动物病毒包括但不限于,逆转录酶病毒(包括慢病毒)、腺病毒、腺相关病毒、疱疹病毒(如单纯疱疹病毒)、痘病毒、杆状病毒、乳头瘤病毒、乳头多瘤空泡病毒(如SV40)。一种载体可以含有多种控制表达的元件,包括但不限于,启动子序列、转录起始序列、增强子序列、选择元件及报告基因。另外,载体还可含有复制起始位点。The term "vector," as used herein, refers to a nucleic acid vehicle into which a polynucleotide can be inserted. A vector is referred to as an expression vector when the vector enables expression of the protein encoded by the inserted polynucleotide. The vector can be introduced into the host cell by transformation, transduction or transfection, and the genetic material element carried thereby can be expressed in the host cell. Vectors are well known to those skilled in the art and include, but are not limited to, plasmids; phagemids; cosmids; artificial chromosomes, such as yeast artificial chromosomes (YAC), bacterial artificial chromosomes (BAC), or P1 derived artificial chromosomes (PAC). Phage such as lambda phage or M13 phage and animal virus. Animal viruses useful as vectors include, but are not limited to, retroviruses (including lentiviruses), adenoviruses, adeno-associated viruses, herpes viruses (such as herpes simplex virus), poxviruses, baculoviruses, papillomaviruses, nipples Multi-tumor vacuolar virus (such as SV40). A vector may contain a variety of elements that control expression, including, but not limited to, promoter sequences, transcription initiation sequences, enhancer sequences, selection elements, and reporter genes. In addition, the vector may also contain an origin of replication.
如本文中所使用的,术语“宿主细胞”是指,可用于导入载体的细胞,其包括但不限于,如大肠杆菌或枯草菌等的原核细胞,如酵母细胞或曲霉菌等的真菌细胞,如S2果蝇细胞或Sf9等的昆虫细胞,或者如纤维原细胞,CHO细胞,COS细胞,NSO细胞,HeLa细胞,BHK细胞,HEK 293细胞或人细胞等的动物细胞。As used herein, the term "host cell" refers to a cell that can be used to introduce a vector, including, but not limited to, a prokaryotic cell such as Escherichia coli or Bacillus subtilis, such as a fungal cell such as a yeast cell or an Aspergillus. For example, S2 Drosophila cells or insect cells such as Sf9, or animal cells such as fibroblasts, CHO cells, COS cells, NSO cells, HeLa cells, BHK cells, HEK 293 cells or human cells.
本领域技术人员将理解,表达载体的设计可取决于诸如待转化的宿主细胞的选择、所希望的表达水平等因素。一种载体可以被引入到宿主细胞中而由此产生转录物、蛋白质、或肽,包括由如本文所述的蛋白、融合蛋白、分离的核酸分子等(例如,CRISPR转录物,如核酸转录物、蛋白质、或酶)。One of skill in the art will appreciate that the design of the expression vector can depend on factors such as the choice of host cell to be transformed, the level of expression desired, and the like. A vector can be introduced into a host cell to thereby produce a transcript, protein, or peptide, including a protein, fusion protein, isolated nucleic acid molecule, etc. as described herein (eg, a CRISPR transcript, such as a nucleic acid transcript) , protein, or enzyme).
如本文中所使用的,术语“调节元件”旨在包括启动子、增强子、内部核糖体进入位点(IRES)、和其他表达控制元件(例如转录终止信号,如多聚腺苷酸化信号和多聚U序列),其详细描述可参考戈德尔(Goeddel),《基因表达技术:酶学方法》(GENE EXPRE SSION TECHNOLOGY:METHODS IN ENZYMOLOGY)185,学术出版社(Academic Press),圣地亚哥(San Diego),加利福尼亚州(1990)。在某些情况下,调节元件包括指导一个核苷酸序列在许多类型的宿主细胞中的组成型表达的那些序列以及指导该核苷酸序 列只在某些宿主细胞中表达的那些序列(例如,组织特异型调节序列)。组织特异型启动子可主要指导在感兴趣的期望组织中的表达,所述组织例如肌肉、神经元、骨、皮肤、血液、特定的器官(例如肝脏、胰腺)、或特殊的细胞类型(例如淋巴细胞)。在某些情况下,调节元件还可以时序依赖性方式(如以细胞周期依赖性或发育阶段依赖性方式)指导表达,该方式可以是或者可以不是组织或细胞类型特异性的。在某些情况下,术语“调节元件”涵盖的是增强子元件,如WPRE;CMV增强子;在HTLV-I的LTR中的R-U5’片段((Mol.Cell.Biol.,第8(1)卷,第466-472页,1988);SV40增强子;以及在兔β-珠蛋白的外显子2与3之间的内含子序列(Proc.Natl.Acad.Sci.USA.,第78(3)卷,第1527-31页,1981)。As used herein, the term "regulatory element" is intended to include promoters, enhancers, internal ribosome entry sites (IRES), and other expression control elements (eg, transcription termination signals, such as polyadenylation signals and Poly U sequence), a detailed description can be found in Goeddel, GENE EXPRE SSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego ), California (1990). In certain instances, regulatory elements include those sequences that direct constitutive expression of a nucleotide sequence in a plurality of types of host cells, as well as those sequences that direct expression of the nucleotide sequence only in certain host cells (eg, Tissue-specific regulatory sequence). Tissue-specific promoters can primarily direct expression in a desired tissue of interest, such as muscle, neurons, bone, skin, blood, specific organs (eg, liver, pancreas), or specific cell types (eg, Lymphocytes). In some cases, the regulatory elements may also direct expression in a time-dependent manner (eg, in a cell cycle dependent or developmental stage dependent manner), which may or may not be tissue or cell type specific. In some cases, the term "regulatory element" encompasses enhancer elements such as WPRE; CMV enhancer; R-U5' fragment in LTR of HTLV-I ((Mol. Cell. Biol., 8th ( 1) Vol., pp. 466-472, 1988); SV40 enhancer; and intron sequence between exons 2 and 3 of rabbit β-globin (Proc. Natl. Acad. Sci. USA., Vol. 78(3), pp. 1527-31, 1981).
如本文中所使用的,术语“启动子”具有本领域技术人员公知的含义,其是指一段位于基因的上游能启动下游基因表达的非编码核苷酸序列。组成型(constitutive)启动子是这样的核苷酸序列:当其与编码或者限定基因产物的多核苷酸可操作地相连时,在细胞的大多数或者所有生理条件下,其导致细胞中基因产物的产生。诱导型启动子是这样的核苷酸序列,当可操作地与编码或者限定基因产物的多核苷酸相连时,基本上只有当对应于所述启动子的诱导物在细胞中存在时,其导致所述基因产物在细胞内产生。组织特异性启动子是这样的核苷酸序列:当可操作地与编码或者限定基因产物的多核苷酸相连时,基本上只有当细胞是该启动子对应的组织类型的细胞时,其才导致在细胞中产生基因产物。As used herein, the term "promoter" has the meaning well-known to those skilled in the art and refers to a non-coding nucleotide sequence located upstream of the gene that initiates expression of the downstream gene. A constitutive promoter is a nucleotide sequence that, when operably linked to a polynucleotide encoding or defining a gene product, results in a gene product in the cell under most or all physiological conditions of the cell. The production. An inducible promoter is a nucleotide sequence that, when operably linked to a polynucleotide encoding or defining a gene product, results in substantially only when an inducer corresponding to the promoter is present in the cell The gene product is produced intracellularly. A tissue-specific promoter is a nucleotide sequence that, when operably linked to a polynucleotide encoding or defining a gene product, is substantially only caused when the cell is a cell of the tissue type corresponding to the promoter Gene products are produced in the cells.
如本文中所使用的,术语“可操作地连接”旨在表示感兴趣的核苷酸序列以一种允许该核苷酸序列的表达的方式被连接至该一种或多种调节元件(例如,处于一种体外转录/翻译系统中或当该载体被引入到宿主细胞中时,处于该宿主细胞中)。As used herein, the term "operably linked" is intended to mean that a nucleotide sequence of interest is linked to the one or more regulatory elements in a manner that allows expression of the nucleotide sequence (eg, In an in vitro transcription/translation system or in the host cell when the vector is introduced into a host cell).
如本文中所使用的,术语“互补性”是指核酸与另一个核酸序列借助于传统的沃森-克里克或其他非传统类型形成一个或多个氢键的能力。互补百分比表示一个核酸分子中可与一个第二核酸序列形成氢键(例如,沃森-克里克碱基配对)的残基的百分比(例如,10个之中有5、6、7、8、9、10个即为50%、60%、70%、80%、90%、和100%互补)。“完全互补”表示一个核酸序列的所有连续残基与一个第二核酸序列中的相同数目的连续残基形成氢键。如本文使用的“基本上互补”是指在一个具有8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、30、35、40、45、50个或更多个核苷酸的区域上至少为60%、65%、70%、75%、80%、85%、90%、95%、97%、98%、99%、或100%的互补程度,或者是指在严格条件下杂交的两个核酸。As used herein, the term "complementarity" refers to the ability of a nucleic acid to form one or more hydrogen bonds with another nucleic acid sequence by means of conventional Watson-Crick or other non-traditional types. Percent complement indicates the percentage of residues in a nucleic acid molecule that can form a hydrogen bond (eg, Watson-Crick base pairing) with a second nucleic acid sequence (eg, 5, 6, 7, 8 out of 10) 9, 10, that is 50%, 60%, 70%, 80%, 90%, and 100% complementary). "Completely complementary" means that all contiguous residues of one nucleic acid sequence form a hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. As used herein, "substantially complementary" means having 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, At least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98 in the region of 30, 35, 40, 45, 50 or more nucleotides %, 99%, or 100% complementarity, or two nucleic acids that hybridize under stringent conditions.
如本文中所使用的,对于杂交的“严格条件”是指与靶序列具有互补性的一个核酸主要地与该靶序列杂交并且基本上不杂交到非靶序列上的条件。严格条件通常是序列依赖性的,并且取决于许多因素而变化。一般而言,该序列越长,则该序列特异性地杂交到其靶序列上的温度就越高。严格条件的非限制性实例描述于蒂森(Tijssen)(1993)的《生物化学和分子生物学中的实验室技术-核酸探针杂交》(Laboratory Techniques In Biochemi stryAnd Molecular Biology-Hybridization With Nucleic Acid Probes),第I部分,第二章,“杂交原理概述和核酸探针分析策略”(“Overview of principles of hybridization and the strategy of nucleic acid probe assay”),爱思唯尔(Elsevier),纽约。As used herein, "stringent conditions" for hybridization refers to conditions under which a nucleic acid that is complementary to a target sequence primarily hybridizes to the target sequence and does not substantially hybridize to a non-target sequence. Stringent conditions are usually sequence dependent and vary depending on many factors. In general, the longer the sequence, the higher the temperature at which the sequence specifically hybridizes to its target sequence. Non-limiting examples of stringent conditions are described in "Technology Techniques In Biochemi stry And Molecular Biology-Hybridization With Nucleic Acid Probes" by Tijssen (1993). ), Part I, Chapter 2, "Overview of principles of hybridization and the strategy of nucleic acid probe assay", Elsevier, New York.
如本文中所使用的,术语“杂交”是指其中一个或多个多核苷酸反应形成一种复合物的反应,该复合物经由这些核苷酸残基之间的碱基的氢键键合而稳定化。氢键键合可以借助于沃森-克里克碱基配对、Hoogstein结合或以任何其他序列特异性方式而发生。该复合物可包含形成一个双链体的两条链、形成多链复合物的三条或多条链、单个自我杂交链、或这些的任何组合。杂交反应可以构成一个更广泛的过程(如PCR的开始、或经由一种酶的多核苷酸的切割)中的一个步骤。能够与一个给定序列杂交的序列被称为该给定序列的“互补物”。As used herein, the term "hybridization" refers to a reaction in which one or more polynucleotides react to form a complex that hydrogen bonds through the bases between these nucleotide residues. And stabilized. Hydrogen bonding can occur by means of Watson-Crick base pairing, Hoogstein binding or in any other sequence specific manner. The complex may comprise two chains forming one duplex, three or more chains forming a multi-strand complex, a single self-hybridizing strand, or any combination of these. The hybridization reaction can constitute a step in a broader process, such as the initiation of PCR, or the cleavage of a polynucleotide via an enzyme. A sequence that is capable of hybridizing to a given sequence is referred to as the "complement" of the given sequence.
如本文中所使用的,术语“表达”是指,藉此从DNA模板转录成多核苷酸(如转录成mRNA或其他RNA转录物)的过程和/或转录的mRNA随后藉此翻译成肽、多肽或蛋白质的过程。转录物和编码的多肽可以总称为“基因产物”。如果多核苷酸来源于基因组DNA,表达可以包括真核细胞中mRNA的剪接。As used herein, the term "expression" refers to a process by which a DNA template is transcribed into a polynucleotide (eg, transcribed into mRNA or other RNA transcript) and/or transcribed mRNA, which is subsequently translated into a peptide, The process of a polypeptide or protein. The transcripts and encoded polypeptides may be collectively referred to as "gene products." If the polynucleotide is derived from genomic DNA, expression can include splicing of mRNA in eukaryotic cells.
如本文中所使用的,术语“接头”是指,由多个氨基酸残基通过肽键连接形成的线性多肽。本发明的接头可以为人工合成的氨基酸序列,或天然存在的多肽序列,例如具有铰链区功能的多肽。此类接头多肽是本领域众所周知的(参见例如,Holliger,P.等人(1993)Proc.Natl.Acad.Sci.USA 90:6444-6448;Poljak,R.J.等人(1994)Structure 2:1121-1123)。As used herein, the term "linker" refers to a linear polypeptide formed by the joining of multiple amino acid residues by peptide bonds. The linker of the invention may be a synthetic amino acid sequence, or a naturally occurring polypeptide sequence, such as a polypeptide having the function of a hinge region. Such linker polypeptides are well known in the art (see, for example, Holliger, P. et al. (1993) Proc. Natl. Acad. Sci. USA 90:6444-6448; Poljak, RJ et al. (1994) Structure 2: 1121- 1123).
如本文中所使用的,术语“治疗”是指,治疗或治愈病症,延缓病症的症状的发作,和/或延缓病症的发展。As used herein, the term "treating" refers to treating or curing a condition, delaying the onset of symptoms of the condition, and/or delaying the progression of the condition.
如本文中所使用的,术语“受试者”包括但不限于各种动物,例如哺乳动物,例如牛科动物、马科动物、羊科动物、猪科动物、犬科动物、猫科动物、兔科动物、啮齿类动物(例如,小鼠或大鼠)、非人灵长类动物(例如,猕猴或食蟹猴)或人。As used herein, the term "subject" includes, but is not limited to, various animals, such as mammals, such as bovine, equine, ovine, porcine, canine, feline, A rabbit, a rodent (eg, a mouse or rat), a non-human primate (eg, a macaque or a cynomolgus monkey) or a human.
发明的有益效果Advantageous effects of the invention
本发明的Cas效应蛋白及系统能够高效的进行RNA编辑、检测活体病毒、RNA干扰、基因选择性剪切、荧光原位杂交等,并在基因转录水平实现对基因的调控。这将为基因组工程和生物技术的新应用提供重要资源。The Cas effector protein and system of the invention can efficiently perform RNA editing, detecting living virus, RNA interference, gene selective scission, fluorescence in situ hybridization, etc., and realizing regulation of genes at the level of gene transcription. This will provide important resources for new applications in genomic engineering and biotechnology.
下面将结合附图和实施例对本发明的实施方案进行详细描述,但是本领域技术人员将理解,下列附图和实施例仅用于说明本发明,而不是对本发明的范围的限定。根据附图和优选实施方案的下列详细描述,本发明的各种目的和有利方面对于本领域技术人员来说将变得显然。The embodiments of the present invention will be described in detail below with reference to the accompanying drawings and embodiments. The various objects and advantageous aspects of the invention will be apparent to those skilled in the <
附图说明DRAWINGS
图1显示了cas13e蛋白的两个HEPN功能域。Figure 1 shows the two HEPN domains of the cas13e protein.
图2显示了cas13e的repeat序列的二级结构。A,B,C,D,E,F,G分别是cas13e1.1,cas13e1.2,cas13e1.3,cas13e1.4,cas13e1.5,cas13e2.1,cas13e2.2的repeat二级结构。Figure 2 shows the secondary structure of the repeat sequence of cas13e. A, B, C, D, E, F, and G are the repeat secondary structures of cas13e1.1, cas13e1.2, cas13e1.3, cas13e1.4, cas13e1.5, cas13e2.1, and cas13e2.2, respectively.
图3显示了cas13e1.3蛋白对pre-crRNA在细菌体内加工活性。Figure 3 shows the activity of cas13e1.3 protein on the processing of pre-crRNA in bacteria.
图4显示了cas13e1.3蛋白对pre-crRNA在体外的加工活性。Figure 4 shows the in vitro processing activity of cas13e1.3 protein on pre-crRNA.
序列信息Sequence information
本发明涉及的部分序列的信息提供于下面的表1中。Information on the partial sequences involved in the present invention is provided in Table 1 below.
表1:序列的描述Table 1: Description of the sequence
SEQ ID NO:SEQ ID NO: 描述 description
11 Cas13e1.1的氨基酸序列Amino acid sequence of Cas13e1.1
22 Cas13e1.2的氨基酸序列Amino acid sequence of Cas13e1.2
33 Cas13e1.3的氨基酸序列Amino acid sequence of Cas13e1.3
44 Cas13e1.4的氨基酸序列Amino acid sequence of Cas13e1.4
55 Cas13e1.5的氨基酸序列Amino acid sequence of Cas13e1.5
66 Cas13e2.1的氨基酸序列Amino acid sequence of Cas13e2.1
77 Cas13e2.2的氨基酸序列Amino acid sequence of Cas13e2.2
88 Cas13e1.1的编码核酸序列Encoding nucleic acid sequence of Cas13e1.1
99 Cas13e1.2的编码核酸序列Encoding nucleic acid sequence of Cas13e1.2
1010 Cas13e1.3的编码核酸序列Encoding nucleic acid sequence of Cas13e1.3
1111 Cas13e1.4的编码核酸序列Encoding nucleic acid sequence of Cas13e1.4
1212 Cas13e1.5的编码核酸序列Encoding nucleic acid sequence of Cas13e1.5
1313 Cas13e2.1的编码核酸序列Encoding nucleic acid sequence of Cas13e2.1
1414 Cas13e2.2的编码核酸序列Encoding nucleic acid sequence of Cas13e2.2
1515 Cas13e1.1/原型同向重复序列Cas13e1.1/prototype repeating sequence
1616 Cas13e1.2/原型同向重复序列Cas13e1.2/prototype repeating sequence
1717 Cas13e1.3/原型同向重复序列Cas13e1.3/prototype repeating sequence
1818 Cas13e1.4/原型同向重复序列Cas13e1.4/prototype repeating sequence
1919 Cas13e1.5/原型同向重复序列Cas13e1.5/prototype repeating sequence
2020 Cas13e2.1/原型同向重复序列Cas13e2.1/prototype repeating sequence
21twenty one Cas13e2.2/原型同向重复序列Cas13e2.2/prototype repeating sequence
22twenty two NLS的氨基酸序列NLS amino acid sequence
23twenty three Cas13e1.1-NLS融合蛋白的氨基酸序列Amino acid sequence of Cas13e1.1-NLS fusion protein
24twenty four Cas13e1.2-NLS融合蛋白的氨基酸序列Amino acid sequence of Cas13e1.2-NLS fusion protein
2525 Cas13e1.3-NLS融合蛋白的氨基酸序列Amino acid sequence of Cas13e1.3-NLS fusion protein
2626 Cas13e1.4-NLS融合蛋白的氨基酸序列Amino acid sequence of Cas13e1.4-NLS fusion protein
2727 Cas13e1.5-NLS融合蛋白的氨基酸序列Amino acid sequence of Cas13e1.5-NLS fusion protein
2828 Cas13e2.1-NLS融合蛋白的氨基酸序列Amino acid sequence of Cas13e2.1-NLS fusion protein
2929 Cas13e2.2-NLS融合蛋白的氨基酸序列Amino acid sequence of Cas13e2.2-NLS fusion protein
3030 Cas13e1.1/CRISPR阵列Cas13e1.1/CRISPR Array
3131 Cas13e1.2/CRISPR阵列Cas13e1.2/CRISPR Array
3232 Cas13e1.3/CRISPR阵列Cas13e1.3/CRISPR Array
3333 Cas13e1.4/CRISPR阵列Cas13e1.4/CRISPR array
3434 Cas13e1.5/CRISPR阵列Cas13e1.5/CRISPR Array
3535 Cas13e2.1/CRISPR阵列Cas13e2.1/CRISPR array
3636 Cas13e2.2/CRISPR阵列Cas13e2.2/CRISPR Array
具体实施方式detailed description
现参照下列意在举例说明本发明(而非限定本发明)的实施例来描述本发明。The invention is described with reference to the following examples which are intended to illustrate, but not limit the invention.
除非特别指明,否则基本上按照本领域内熟知的以及在各种参考文献中描述的常规方法进行实施例中描述的实验和方法。例如,本发明中所使用的免疫学、生物化学、化学、分子生物学、微生物学、细胞生物学、基因组学和重组DNA等常规技术,可参见参见萨姆布鲁克(Sambrook)、弗里奇(Fritsch)和马尼亚蒂斯(Maniatis),《分子克隆:实验室手册》(MOLECULAR CLONING:A LABORATORY MANUAL),第2次编辑(1989);《当代分子生物学实验手册》(CURRENT  PROTOCOLS IN MOLECULAR BIOLOGY)(F.M.奥苏贝尔(F.M.Ausubel)等人编辑,(1987));《酶学方法》(METHODS IN ENZYMOLOGY)系列(学术出版公司):《PCR 2:实用方法》(PCR 2:A PRACTICAL APPROACH)(M.J.麦克弗森(M.J.MacPherson)、B.D.黑姆斯(B.D.Hames)和G.R.泰勒(G.R.Taylor)编辑(1995))、哈洛(Harlow)和拉内(Lane)编辑(1988)《抗体:实验室手册》(ANTIBODIES,ALABORATORY MANUAL),以及《动物细胞培养》(ANIMAL CELLCULTURE)(R.I.弗雷谢尼(R.I.Freshney)编辑(1987))。The experiments and methods described in the Examples are carried out essentially according to conventional methods well known in the art and described in various references, unless otherwise indicated. For example, conventional techniques such as immunology, biochemistry, chemistry, molecular biology, microbiology, cell biology, genomics, and recombinant DNA used in the present invention can be found in Sambrook, Fry ( Fritsch) and Maniatis, MOLECULAR CLONING: A LABORATORY MANUAL, 2nd edition (1989); CURRENT PROTOCOLS IN MOLECULAR BIOLOGY) (FMAubel et al., (1987)); METHOS IN ENZYMOLOGY series (academic publishing company): "PCR 2: Practical Methods" (PCR 2: A PRACTICAL) APPROACH) (MJ MacPherson, BD Hames, and GR Taylor (1995)), Harlow, and Lane Editor (1988) : ANTIBODIES, ALABORATORY MANUAL, and ANIMAL CELLCULTURE (RI Freshney, eds. (1987)).
另外,实施例中未注明具体条件者,按照常规条件或制造商建议的条件进行。所用试剂或仪器未注明生产厂商者,均为可以通过市购获得的常规产品。本领域技术人员知晓,实施例以举例方式描述本发明,且不意欲限制本发明所要求保护的范围。本文中提及的全部公开案和其他参考资料以其全文通过引用合并入本文。In addition, those which do not specify the specific conditions in the examples are carried out according to the conventional conditions or the conditions recommended by the manufacturer. The reagents or instruments used are not indicated by the manufacturer, and are conventional products that can be obtained commercially. The invention is described by way of example, and is not intended to limit the scope of the invention. All publications and other references mentioned herein are incorporated by reference in their entirety.
除非特别指明,以下实施例中涉及的序列合成均由南京金斯瑞生物科技有限公司完成,涉及的测序均由上海英骏生物技术有限公司完成。Unless otherwise specified, the sequence synthesis involved in the following examples was completed by Nanjing Kingsray Biotechnology Co., Ltd., and the sequencing involved was completed by Shanghai Yingjun Biotechnology Co., Ltd.
实施例1. Cas13e蛋白和Cas13e导向RNA的鉴定Example 1. Identification of Cas13e protein and Cas13e targeting RNA
首先下载NCBI细菌基因组数据以及JGI宏基因组数据。用pilercr1.06软件搜索包含CRISPR序列的Contig,然后对该contig用MetaGeneMark软件预测蛋白,筛选>=800个氨基酸的蛋白。使用Hmmsearch方法对蛋白功能注释,排除已知功能的cas蛋白以及在nr数据库有明显功能域注释的蛋白(E-value<=e-10)。mcl软件对保留下的蛋白进行家族分类。First download NCBI bacterial genomic data and JGI metagenomic data. Contig containing the CRISPR sequence was searched using the primercr 1.06 software, and then the protein was predicted using the MetaGeneMark software for the contig, and a protein of >=800 amino acids was screened. The function of the protein was annotated using the Hmmsearch method, excluding the cas protein of known function and the protein with significant functional domain annotation in the nr database (E-value<=e-10). The mcl software classifies the proteins retained.
在此基础上,本发明人获得并鉴定了一种全新的Cas效应蛋白家族,即Cas13e,其可分为两个亚家族,即cas13e.1和cas13e.2。家族蛋白序列分别命名为Cas13e1.1(SEQ ID NO:1)、Cas13e1.2(SEQ ID NO:2)、Cas13e1.3(SEQ ID NO:3)、Cas13e1.4(SEQ ID NO:4)、Cas13e1.5(SEQ ID NO:5)、Cas13e2.1(SEQ ID NO:6)、Cas13e2.2(SEQ ID NO:7),编码这7个蛋白的DNA分别如SEQ ID NOs:8-14所示。这7种蛋白所对应的原型同向重复序列(原型repeat序列,pre-crRNA中所含有的repeat序列)分别如SEQ ID NOs:15-21所示。通过将蛋白进行多重序列比对,分析含有“RxxxxH”的功能域,从而获得上述7种cas13e蛋白的两个HEPN结构域(图1)。进一步,通过ViennaRNA软件分析获得上述7种cas13e蛋白的原型repeat序列的二级结构(图2)。On this basis, the inventors obtained and identified a novel family of Cas effector proteins, namely Cas13e, which can be divided into two subfamilies, namely cas13e.1 and cas13e.2. The family protein sequences are designated Cas13e1.1 (SEQ ID NO: 1), Cas13e1.2 (SEQ ID NO: 2), Cas13e1.3 (SEQ ID NO: 3), Cas13e1.4 (SEQ ID NO: 4), Cas13e1.5 (SEQ ID NO: 5), Cas13e2.1 (SEQ ID NO: 6), Cas13e2.2 (SEQ ID NO: 7), the DNA encoding these seven proteins are as shown in SEQ ID NOs: 8-14, respectively. Show. The prototype homologous repeat sequences corresponding to the seven proteins (prototype repeat sequence, the repeat sequence contained in the pre-crRNA) are shown in SEQ ID NOs: 15-21. The two HEPN domains of the above seven cas13e proteins were obtained by performing multiple sequence alignment of the proteins to analyze the functional domains containing "RxxxxH" (Fig. 1). Further, the secondary structure of the prototype repeat sequence of the above seven cas13e proteins was obtained by analysis by Vienna RNA software (Fig. 2).
实施例2. Cas13e蛋白对pre-crRNA的加工Example 2. Processing of pre-crRNA by Cas13e protein
2.1 Cas13e1.3蛋白对pre-crRNA在大肠杆菌内的加工活性:2.1 Cas13e1.3 protein processing activity of pre-crRNA in E. coli:
1、人工合成SEQ ID NO:10所示的双链DNA分子,同时人工合成编码SEQ ID NO:32(CRISPR阵列:repeat+spacer1+repeat+spacer2+repeat)的双链DNA分子。1. A double-stranded DNA molecule of SEQ ID NO: 10 is artificially synthesized, and a double-stranded DNA molecule encoding SEQ ID NO: 32 (CRISPR array: repeat + spacer 1 + repeat + spacer + repeat) is artificially synthesized.
2、将步骤1中合成的双链DNA分子与原核表达载体pACYC-Duet-1连接,得到重组质粒pACYC-Duet-1-CRISPR/cas13e1.3。对重组质粒pACYC-Duet-1-CRISPR/cas13e1.3进行一代测序以确认包含步骤1所述的DNA序列。2. The double-stranded DNA molecule synthesized in the step 1 was ligated to the prokaryotic expression vector pACYC-Duet-1 to obtain a recombinant plasmid pACYC-Duet-1-CRISPR/cas13e1.3. The recombinant plasmid pACYC-Duet-1-CRISPR/cas13e1.3 was subjected to one-generation sequencing to confirm the DNA sequence described in the step 1.
3、将重组质粒pACYC-Duet-1-CRISPR/cas13e1.3导入大肠杆菌EC100,得到重组菌,将该重组菌命名为EC100-CRISPR/cas13e1.3。3. The recombinant plasmid pACYC-Duet-1-CRISPR/cas13e1.3 was introduced into Escherichia coli EC100 to obtain a recombinant strain, and the recombinant strain was named EC100-CRISPR/cas13e1.3.
4、取EC100-CRISPR/cas13e1.3的单克隆,接种至100mL LB液体培养基(含50μg/mL氨苄霉素),37℃、200rpm振荡培养12h,得到培养菌液。4. The monoclonal clone of EC100-CRISPR/cas13e1.3 was taken, inoculated into 100 mL of LB liquid medium (containing 50 μg/mL ampicillin), and cultured at 37 ° C, shaking at 200 rpm for 12 hours to obtain a culture bacterial solution.
5、提取细菌RNA:转移1.5mL细菌培养物到预冷微量离心管中,在4℃,6000×g,离心5分钟。离心后,弃上清液,将细胞沉淀重新悬浮于预热至95℃的200μL MaxBacterial Enhancement Reagent中,吹吸混匀混合。95℃孵育4分钟。向溶解产物中加入1mL
Figure PCTCN2019084340-appb-000001
并吹吸混匀,室温下孵育5分钟。加入0.2mL冷氯仿,用手摇动管混合15秒,室温下孵育2-3分钟。4℃,12,000×g离心15分钟。取600μL上清于新管中,加入0.5mL冷异丙醇沉淀RNA,颠倒混匀,室温下孵育10分钟。4℃下以15,000×g离心10分钟,弃上清,加入1mL 75%乙醇,涡旋混匀。4℃,7500×g离心5分钟,弃上清,空气风干。将RNA沉淀溶解到50μL RNase-free water中,在60℃下孵育10分钟。
5. Extraction of bacterial RNA: Transfer 1.5 mL of the bacterial culture to a pre-chilled microcentrifuge tube, centrifuge at 6,000 x g for 5 minutes at 4 °C. After centrifugation, the supernatant was discarded, and the cell pellet was resuspended in 200 μL MaxBacterial Enhancement Reagent preheated to 95 ° C, and mixed by pipetting. Incubate at 95 ° C for 4 minutes. Add 1 mL to the lysate
Figure PCTCN2019084340-appb-000001
Mix well by pipetting and incubate for 5 minutes at room temperature. Add 0.2 mL of cold chloroform, mix by shaking the tube for 15 seconds, and incubate for 2-3 minutes at room temperature. Centrifuge at 12,000 x g for 15 minutes at 4 °C. 600 μL of the supernatant was taken in a new tube, and 0.5 mL of cold isopropanol was added to precipitate RNA, which was mixed by inversion and incubated at room temperature for 10 minutes. Centrifuge at 15,000 x g for 10 minutes at 4 ° C, discard the supernatant, add 1 mL of 75% ethanol, and vortex to mix. Centrifuge at 7500 x g for 5 minutes at 4 ° C, discard the supernatant, and air dry. The RNA pellet was dissolved in 50 μL of RNase-free water and incubated at 60 ° C for 10 minutes.
6、DNA的消化:20ugRNA溶解到39.5μL dH 2O,65℃,5min。冰上5min,加入0.5μL RNAI,5μL buffer,5μL DNaseI,37℃ 45min(50μL体系)。加50μL dH 2O,调整体积到100μL。2mL Phase-Lock tube 16000g离心30s后,加100μL酚:氯仿:异戊醇(25:24:1)、100μL消化的RNA,摇15s,15℃,16000g离心12min。取上清于一个新的1.5mL离心管中,加入与上清等体积的异丙醇1/10NaoAC,反应1h或-20℃过夜。4℃,16000g离心30min,弃上清。加350μL 75%乙醇洗涤沉淀,4℃,16000g离心10min,弃上清。晾干,加入20μL RNase-free water,65℃,5min溶解沉淀。NanoDrop测浓度,跑胶。 6. Digestion of DNA: 20 ug of RNA was dissolved in 39.5 μL dH 2 O at 65 ° C for 5 min. On ice for 5 min, 0.5 μL of RNAI, 5 μL of buffer, 5 μL of DNase I, and 37 ° C for 45 min (50 μL system) were added. Add 50 μL dH 2 O and adjust the volume to 100 μL. After centrifugation of 2 mL Phase-Lock tube at 16000 g for 30 s, 100 μL of phenol:chloroform:isoamyl alcohol (25:24:1), 100 μL of digested RNA was added, shaken for 15 s, centrifuged at 16000 g for 12 min at 15 ° C. The supernatant was taken in a new 1.5 mL centrifuge tube, and an equal volume of isopropanol 1/10 NaoAC was added to the supernatant for 1 h or -20 ° C overnight. Centrifuge at 16000 g for 30 min at 4 ° C, discard the supernatant. The precipitate was washed by adding 350 μL of 75% ethanol, centrifuged at 16000 g for 10 min at 4 ° C, and the supernatant was discarded. Dry, add 20 μL of RNase-free water, 65 ° C, and dissolve the precipitate for 5 min. NanoDrop measures the concentration and runs the glue.
7、3’脱磷酸化及5’磷酸化:将消化的RNA~20μg,各加水至42.5μL,90℃ 2min。 冰上冷却5min。加5μL 10×T4 PNK buffer,0.5μL RNaI,2μL T4PNK(50μL),37℃ 6h。加1μL T4PNK,1.25μL(100mM)ATP,37℃ 1h。加47.75μL dH 2O,调整体积到100μL。2mL Phase-Lock tube 16000g离心30s后,加100μL酚:氯仿:异戊醇(25:24:1)、100μL消化的RNA,摇15s,15℃,16000g离心12min。取上清于一个新的1.5mL离心管中,加与上清等体积的异丙醇,总体积1/10NaoAC,反应1h或-20℃过夜。4℃,16000g离心30min,弃上清。加350μL 75%乙醇洗涤沉淀,4℃,16000g离心10min,弃上清。晾干,加入21μL RNase-free water,65℃,5min溶解沉淀,NanoDrop测浓度。 7, 3' dephosphorylation and 5' phosphorylation: ~20 μg of digested RNA, each added to 42.5 μL, 90 ° C for 2 min. Cool on ice for 5 min. 5 μL of 10×T4 PNK buffer, 0.5 μL of RNaI, 2 μL of T4PNK (50 μL), and 37 ° C for 6 h were added. Add 1 μL of T4PNK, 1.25 μL (100 mM) ATP, 1 hour at 37 °C. Add 47.75 μL dH 2 O and adjust the volume to 100 μL. After centrifugation of 2 mL Phase-Lock tube at 16000 g for 30 s, 100 μL of phenol:chloroform:isoamyl alcohol (25:24:1), 100 μL of digested RNA was added, shaken for 15 s, centrifuged at 16000 g for 12 min at 15 ° C. The supernatant was taken in a new 1.5 mL centrifuge tube, and an equal volume of isopropanol was added to the supernatant in a total volume of 1/10 NaoAC for 1 h or -20 ° C overnight. Centrifuge at 16000 g for 30 min at 4 ° C, discard the supernatant. The precipitate was washed by adding 350 μL of 75% ethanol, centrifuged at 16000 g for 10 min at 4 ° C, and the supernatant was discarded. Dry, add 21 μL of RNase-free water, 65 ° C, dissolve the precipitate for 5 min, and measure the concentration by NanoDrop.
8、RNA单磷酸化:20μL RNA,90℃ 1min,冰上冷却5min。加入2μL RNA 5’Polphosphatase 10×Reaction buffer,0.5μL Inhibitor,1μL RNA 5’Polphosphatase(20Units),加RNase-free water至20μL,37℃ 60min。加80μL dH 2O,调整体积到100μL。2mL Phase-Lock tube16000g离心30s后,加100μL酚:氯仿:异戊醇(25:24:1)、100μL消化的RNA,摇15s,15℃,16000g离心12min。取上清于一个新的1.5mL离心管中,加与上清等体积的异丙醇,总体积1/10NaoAC,反应1h或-20℃过夜。4℃,16000g离心30min,弃上清,加350μL 75%乙醇洗涤沉淀,4℃,16000g离心10min,弃上清。晾干,加入21μL RNase-free water,65℃,5min溶解沉淀,NanoDrop测浓度。 8. RNA monophosphorylation: 20 μL of RNA, 1 min at 90 ° C, and cooled on ice for 5 min. 2 μL of RNA 5'Polphosphatase 10×Reaction buffer, 0.5 μL of Inhibitor, 1 μL of RNA 5'Polphosphatase (20 Units), RNase-free water to 20 μL, and 37 ° C for 60 min were added. Add 80 μL dH 2 O and adjust the volume to 100 μL. After centrifugation of 2 mL Phase-Lock tube 16000 g for 30 s, 100 μL of phenol:chloroform:isoamyl alcohol (25:24:1), 100 μL of digested RNA was added, shaken for 15 s, centrifuged at 16000 g for 12 min at 15 ° C. The supernatant was taken in a new 1.5 mL centrifuge tube, and an equal volume of isopropanol was added to the supernatant in a total volume of 1/10 NaoAC for 1 h or -20 ° C overnight. After centrifugation at 16000 g for 30 min at 4 ° C, the supernatant was discarded, and the precipitate was washed by adding 350 μL of 75% ethanol, centrifuged at 16000 g for 10 min at 4 ° C, and the supernatant was discarded. Dry, add 21 μL of RNase-free water, 65 ° C, dissolve the precipitate for 5 min, and measure the concentration by NanoDrop.
9、cDNA文库的准备:16.5μL RNase-free water。5μL Poly(A)Polymerase 10×Reaction buffer。5μL 10mM ATP。1.5μL RiboGuard RNase Inhibitor。20μL RNA Substrate。2μL Poly(A)Polymerase(4Units)。50μL总体积。37℃ 20min。加50μL dH 2O,调整体积到100μL。2mL Phase-Lock tube16000g离心30s后,加100μL酚:氯仿:异戊醇(25:24:1)、100μL消化的RNA,摇15s,15℃,16000g离心12min。取上清于一个新的1.5mL离心管中,加与上清等体积的异丙醇,总体积1/10NaoAC,反应1h或-20℃过夜。4℃,16000g离心30min,弃上清,晾干,加入11μL RNase-free water,65℃,5min溶解沉淀,NanoDrop测浓度。 9. Preparation of cDNA library: 16.5 μL RNase-free water. 5 μL Poly(A) Polymerase 10×Reaction buffer. 5 μL of 10 mM ATP. 1.5 μL RiboGuard RNase Inhibitor. 20 μL RNA Substrate. 2 μL Poly(A) Polymerase (4 Units). 50 μL total volume. 37 ° C for 20 min. Add 50 μL dH 2 O and adjust the volume to 100 μL. After centrifugation of 2 mL Phase-Lock tube 16000 g for 30 s, 100 μL of phenol:chloroform:isoamyl alcohol (25:24:1), 100 μL of digested RNA was added, shaken for 15 s, centrifuged at 16000 g for 12 min at 15 ° C. The supernatant was taken in a new 1.5 mL centrifuge tube, and an equal volume of isopropanol was added to the supernatant in a total volume of 1/10 NaoAC for 1 h or -20 ° C overnight. After centrifugation at 16000 g for 30 min at 4 ° C, the supernatant was discarded, air-dried, 11 μL of RNase-free water was added, and the precipitate was dissolved at 65 ° C for 5 min, and the concentration was measured by NanoDrop.
10、将cDNA文库加上测序接头后送至北京贝瑞合康进行测序。10. The cDNA library was added to the sequencing linker and sent to Beijing Berry Hekang for sequencing.
11、对原始数据进行质量过滤,去除碱基平均质量值低于30的序列。对序列去除接头后,保留25nt到50nt的RNA序列,用bowtie将其比对到CRISPR阵列的参考序列上。结果如图3所示,峰形图为二代测序序列比对CRISPR座的结构,垂直线所示为酶切位点,灰色长方形为Repeat结构示意图,深灰色菱形为间隔序列结构示意图。通过 Cas13e1.3比对结果得到的酶切位点信息,可以得知Cas13e1.3的pre-crRNA能够在大肠杆菌体内被Cas13e1.3成功加工成成熟crRNA,其由27nt的Repeat序列和32nt的spacer序列构成;使用ViennaRNA和VARNA对成熟的crRNA进行结构预测和可视化分析,结果显示,crRNA的Repeat序列(成熟Repeat序列)的3’端可以形成一个18个碱基大小的颈环。11. Perform mass filtering on the raw data to remove sequences with a base average mass value below 30. After removing the linker from the sequence, the RNA sequence of 25 nt to 50 nt was retained and aligned to the reference sequence of the CRISPR array using bowtie. The results are shown in Fig. 3. The peak shape is the structure of the second-generation sequencing sequence alignment of the CRISPR block, the vertical line shows the enzyme cleavage site, the gray rectangle is the Repeat structure diagram, and the dark gray diamond shape is the spacer sequence structure diagram. Through the cleavage site information obtained by Cas13e1.3 alignment, it can be known that the pre-crRNA of Cas13e1.3 can be successfully processed into mature crRNA by Cas13e1.3 in E. coli, which consists of 27 nt Repeat sequence and 32 nt spacer. Sequence construction; structural prediction and visual analysis of mature crRNA using ViennaRNA and VARNA showed that the 3' end of the Repeat sequence of the crRNA (mature Repeat sequence) can form a 18-base-sized neck ring.
2.2 Cas13e1.3蛋白对pre-crRNA的体外加工活性:2.2 In vitro processing activity of Cas13e1.3 protein on pre-crRNA:
1、pre-crRNA的转录:1. Transcription of pre-crRNA:
设计precrRNA转录模板的引物,转录模板的结构为:T7启动子+Cas13e1.3的CRISPR阵列(SEQ ID NO:32),引物的设计使用Primer5.0软件,保证上下游引物有至少18bp的重叠序列;使用引物退火程序获得双链DNA模板。使用MinElute PCR Purifcation Kit进行模板的纯化,用Nanodrop测定浓度,冻存-20℃备用。使用NEB的HiScribe T7高效RNA合成试剂盒进行RNA转录,设置PCR反应程序为:37℃/3h或31℃/forever,加入DNAseI,37℃/45min。Primers for designing a precrRNA transcript template. The structure of the transcription template is: T7 promoter + CRISPR array of Cas13e1.3 (SEQ ID NO: 32). The primers were designed using Primer 5.0 software to ensure that the upstream and downstream primers have overlapping sequences of at least 18 bp. A double stranded DNA template was obtained using a primer annealing procedure. The template was purified using a MinElute PCR Purifcation Kit, and the concentration was measured with Nanodrop, and stored at -20 ° C until use. RNA transcription was performed using NEB's HiScribe T7 high-efficiency RNA synthesis kit, and the PCR reaction procedure was set to: 37 ° C / 3 h or 31 ° C / forever, DNAseI was added, 37 ° C / 45 min.
2、precrRNA的纯化:2. Purification of precrRNA:
使用混合液酚:氯仿:异戊醇(25:24:1)抽提去除体系内的DNAseI;跑胶并从聚丙烯酰胺凝胶中纯化pre-crRNA,使用ZYMO RESEARCH的ZR Small-RNATM PAGE Recovery Kit试剂盒纯化回收pre-crRNA,用Nanodrop测定浓度,-80℃冻存备用。Use the mixture phenol: chloroform: isoamyl alcohol (25:24:1) to remove DNAseI from the system; run the gel and purify the pre-crRNA from the polyacrylamide gel, using ZR Small-RNATM PAGE Recovery from ZYMO RESEARCH The kit was purified and recovered for pre-crRNA, and the concentration was measured with Nanodrop, and stored at -80 ° C for use.
3、建立体外酶切体系3. Establish an in vitro enzyme digestion system
(1)配置如下反应体系,轻轻吹打混匀后短暂离心。置于37℃,1h;(1) Configure the following reaction system, gently pipette and mix briefly and then centrifuge briefly. Placed at 37 ° C, 1 h;
Figure PCTCN2019084340-appb-000002
Figure PCTCN2019084340-appb-000002
RNA Cleavage Buffer(可配置成10*母液):40mM Tris-HCl(pH=8.0),50mM KCl,0.5mM TECP,10mM MgCl 2RNA Cleavage Buffer (10 * may be configured liquor): 40mM Tris-HCl (pH = 8.0), 50mM KCl, 0.5mM TECP, 10mM MgCl 2.
此外,为了验证Cas13e1.3是否为EDTA敏感型蛋白,还设计了EDTA组,该组向反应体系中加入EDTA。In addition, in order to verify whether Cas13e1.3 is an EDTA-sensitive protein, an EDTA group was also designed, which added EDTA to the reaction system.
(2)向以上反应体系中加入10μl 2×RNA loading dye,置于98℃,3min。反应结束后立即置于冰上2min;(2) To the above reaction system, 10 μl of 2× RNA loading dye was added and placed at 98 ° C for 3 min. Immediately after the end of the reaction, it was placed on ice for 2 min;
(3)10%TBE-Urea聚丙烯酰胺凝胶上样孔中上样10μl,150V,40min;(3) 10% TBE-Urea polyacrylamide gel loading well 10μl, 150V, 40min;
(4)在1×TBE电泳缓冲液中加入SYBR-Gold核酸凝胶染料,置入凝胶,室温染10min后扫胶。(4) The SYBR-Gold nucleic acid gel dye was added to 1×TBE running buffer, placed in a gel, and stained for 10 minutes at room temperature.
电泳结果如图4所示,Cas13e1.3能够在体外切割pre-crRNA,并且不属于EDTA敏感型蛋白。The electrophoresis results are shown in Figure 4. Cas13e1.3 is capable of cleaving pre-crRNA in vitro and is not an EDTA-sensitive protein.
尽管本发明的具体实施方式已经得到详细的描述,但本领域技术人员将理解:根据已经公布的所有教导,可以对细节进行各种修改和变动,并且这些改变均在本发明的保护范围之内。本发明的全部分为由所附权利要求及其任何等同物给出。While the invention has been described in detail, it will be understood by the . The invention is generally divided into the appended claims and any equivalents thereof.

Claims (69)

  1. 一种蛋白,其具有SEQ ID NOs:1-7任一项所示的氨基酸序列或其直系同源物(ortholog)、同源物、变体或功能性片段;其中,所述直系同源物、同源物、变体或功能性片段基本保留了其所源自的序列的生物学功能;A protein having the amino acid sequence set forth in any one of SEQ ID NOs: 1-7, or an ortholog, homolog, variant or functional fragment thereof; wherein the ortholog a homologue, variant or functional fragment substantially retains the biological function of the sequence from which it is derived;
    例如,所述直系同源物、同源物、变体与其所源自的序列相比具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性;For example, the ortholog, homolog, variant has at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94 compared to the sequence from which it is derived. %, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity;
    例如,所述直系同源物、同源物、变体与SEQ ID NOs:1-7任一项所示的序列相比具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性,并且基本保留了其所源自的序列的生物学功能;For example, the ortholog, homolog, variant has at least 80%, at least 85%, at least 90%, at least 91%, at least compared to the sequence set forth in any one of SEQ ID NOs: 1-7. 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity and substantially retains the biological function of the sequence from which it is derived ;
    例如,所述蛋白是CRISPR/Cas系统中的效应蛋白。For example, the protein is an effector protein in the CRISPR/Cas system.
  2. 权利要求1所述的蛋白,其包含选自下列的序列,或由选自下列的序列组成:The protein of claim 1 comprising a sequence selected from the group consisting of: or consisting of a sequence selected from the group consisting of:
    (i)SEQ ID NOs:1-7任一项所示的序列;(i) the sequence of any one of SEQ ID NOs: 1-7;
    (ii)与SEQ ID NOs:1-7任一项所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in any one of SEQ ID NOs: 1-7 (eg, 1, 2, 3, 4, 5, 6 a sequence of 7, 7 or 9 amino acid substitutions, deletions or additions; or
    (iii)与SEQ ID NOs:1-7任一项所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列;(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95% of the sequence set forth in any one of SEQ ID NOs: 1-7 a sequence of at least 96%, at least 97%, at least 98%, or at least 99% sequence identity;
    例如,所述蛋白具有SEQ ID NOs:1-7任一项所示的氨基酸序列。For example, the protein has the amino acid sequence set forth in any one of SEQ ID NOs: 1-7.
  3. 权利要求1或2所述的蛋白,其包含选自下列的序列,或由选自下列的序列组成:The protein according to claim 1 or 2, which comprises a sequence selected from the following or consists of a sequence selected from the group consisting of:
    (i)SEQ ID NO:1所示的序列;(i) the sequence shown as SEQ ID NO: 1;
    (ii)与SEQ ID NO:1所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 1 (eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
    (iii)与SEQ ID NO:1所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列;(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, of the sequence set forth in SEQ ID NO: a sequence of at least 97%, at least 98%, or at least 99% sequence identity;
    例如,所述蛋白具有SEQ ID NO:1所示的氨基酸序列。For example, the protein has the amino acid sequence set forth in SEQ ID NO: 1.
  4. 权利要求1或2所述的蛋白,其包含选自下列的序列,或由选自下列的序列组成:The protein according to claim 1 or 2, which comprises a sequence selected from the following or consists of a sequence selected from the group consisting of:
    (i)SEQ ID NO:2所示的序列;(i) the sequence shown as SEQ ID NO: 2;
    (ii)与SEQ ID NO:2所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 2 (eg, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
    (iii)与SEQ ID NO:2所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列;(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, of the sequence set forth in SEQ ID NO: a sequence of at least 97%, at least 98%, or at least 99% sequence identity;
    例如,所述蛋白具有SEQ ID NO:2所示的氨基酸序列。For example, the protein has the amino acid sequence set forth in SEQ ID NO: 2.
  5. 权利要求1或2所述的蛋白,其包含选自下列的序列,或由选自下列的序列组成:The protein according to claim 1 or 2, which comprises a sequence selected from the following or consists of a sequence selected from the group consisting of:
    (i)SEQ ID NO:3所示的序列;(i) the sequence shown as SEQ ID NO:3;
    (ii)与SEQ ID NO:3所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) having one or more amino acid substitutions, deletions or additions compared to the sequence set forth in SEQ ID NO: 3 (eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
    (iii)与SEQ ID NO:3所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列;(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, and the sequence set forth in SEQ ID NO: a sequence of at least 97%, at least 98%, or at least 99% sequence identity;
    例如,所述蛋白具有SEQ ID NO:3所示的氨基酸序列。For example, the protein has the amino acid sequence set forth in SEQ ID NO:3.
  6. 权利要求1或2所述的蛋白,其包含选自下列的序列,或由选自下列的序列组成:The protein according to claim 1 or 2, which comprises a sequence selected from the following or consists of a sequence selected from the group consisting of:
    (i)SEQ ID NO:4所示的序列;(i) the sequence shown as SEQ ID NO:4;
    (ii)与SEQ ID NO:4所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 4 (eg, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
    (iii)与SEQ ID NO:4所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列;(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, of the sequence set forth in SEQ ID NO: a sequence of at least 97%, at least 98%, or at least 99% sequence identity;
    例如,所述蛋白具有SEQ ID NO:4所示的氨基酸序列。For example, the protein has the amino acid sequence set forth in SEQ ID NO:4.
  7. 权利要求1或2所述的蛋白,其包含选自下列的序列,或由选自下列的序列组成:The protein according to claim 1 or 2, which comprises a sequence selected from the following or consists of a sequence selected from the group consisting of:
    (i)SEQ ID NO:5所示的序列;(i) the sequence shown as SEQ ID NO:5;
    (ii)与SEQ ID NO:5所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 5 (eg, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
    (iii)与SEQ ID NO:5所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列;(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, and the sequence set forth in SEQ ID NO: a sequence of at least 97%, at least 98%, or at least 99% sequence identity;
    例如,所述蛋白具有SEQ ID NO:5所示的氨基酸序列。For example, the protein has the amino acid sequence set forth in SEQ ID NO:5.
  8. 权利要求1或2所述的蛋白,其包含选自下列的序列,或由选自下列的序列组成:The protein according to claim 1 or 2, which comprises a sequence selected from the following or consists of a sequence selected from the group consisting of:
    (i)SEQ ID NO:6所示的序列;(i) the sequence shown as SEQ ID NO:6;
    (ii)与SEQ ID NO:6所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 6 (eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
    (iii)与SEQ ID NO:6所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列;(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, of the sequence set forth in SEQ ID NO: a sequence of at least 97%, at least 98%, or at least 99% sequence identity;
    例如,所述蛋白具有SEQ ID NO:6所示的氨基酸序列。For example, the protein has the amino acid sequence set forth in SEQ ID NO: 6.
  9. 权利要求1或2所述的蛋白,其包含选自下列的序列,或由选自下列的序列组成:The protein according to claim 1 or 2, which comprises a sequence selected from the following or consists of a sequence selected from the group consisting of:
    (i)SEQ ID NO:7所示的序列;(i) the sequence shown as SEQ ID NO:7;
    (ii)与SEQ ID NO:7所示的序列相比具有一个或多个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加)的序列;或(ii) a substitution, deletion or addition of one or more amino acids compared to the sequence set forth in SEQ ID NO: 7 (eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 9 or 10 amino acid substitutions, deletions or additions; or
    (iii)与SEQ ID NO:7所示的序列具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性的序列;(iii) having at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, and the sequence set forth in SEQ ID NO: a sequence of at least 97%, at least 98%, or at least 99% sequence identity;
    例如,所述蛋白具有SEQ ID NO:7所示的氨基酸序列。For example, the protein has the amino acid sequence set forth in SEQ ID NO: 7.
  10. 一种缀合物,其包含权利要求1-9任一项所述的蛋白以及修饰部分;A conjugate comprising the protein of any one of claims 1-9 and a modified moiety;
    例如,所述修饰部分选自另外的蛋白或多肽、可检测的标记,及其任意组合;For example, the modified moiety is selected from additional proteins or polypeptides, detectable labels, and any combination thereof;
    例如,所述修饰部分任选地通过接头连接至所述蛋白的N端或C端;For example, the modified moiety is optionally linked to the N-terminus or C-terminus of the protein via a linker;
    例如,所述修饰部分融合至所述蛋白的N端或C端;For example, the modified moiety is fused to the N-terminus or C-terminus of the protein;
    例如,所述另外的蛋白或多肽选自表位标签、报告基因序列、核定位信号(NLS)序列、靶向部分、转录激活结构域(例如,VP64)、转录抑制结构域(例如,KRAB结构域或SID结构域)、核酸酶结构域(例如,Fok1),具有选自下列的活性的结构域:甲基化酶活性,去甲基化酶,转录激活活性,转录抑制活性,转录释放因子活性,组蛋白修饰活性,核酸酶活性,单链RNA切割活性,双链RNA切割活性,单链DNA切割活性,双链DNA切割活性和核酸结合活性;以及其任意组合;For example, the additional protein or polypeptide is selected from the group consisting of an epitope tag, a reporter gene sequence, a nuclear localization signal (NLS) sequence, a targeting moiety, a transcriptional activation domain (eg, VP64), a transcriptional repression domain (eg, a KRAB structure) a domain or SID domain), a nuclease domain (eg, Fok1), having a domain selected from the group consisting of methylase activity, demethylase, transcriptional activation activity, transcriptional repression activity, transcriptional release factor Activity, histone modification activity, nuclease activity, single-strand RNA cleavage activity, double-stranded RNA cleavage activity, single-strand DNA cleavage activity, double-strand DNA cleavage activity and nucleic acid binding activity; and any combination thereof;
    例如,所述缀合物包含表位标签;For example, the conjugate comprises an epitope tag;
    例如,所述缀合物包含NLS序列;For example, the conjugate comprises an NLS sequence;
    例如,所述NLS序列如SEQ ID NO:22所示;For example, the NLS sequence is set forth in SEQ ID NO:22;
    例如,所述NLS序列位于、靠近或接近所述蛋白的末端(例如,N端或C端)。For example, the NLS sequence is located at, near, or near the end of the protein (eg, the N-terminus or the C-terminus).
  11. 一种融合蛋白,其包含权利要求1-9任一项所述的蛋白以及另外的蛋白或多肽;A fusion protein comprising the protein of any one of claims 1-9 and an additional protein or polypeptide;
    例如,所述另外的蛋白或多肽任选地通过接头连接至所述蛋白的N端或C端;For example, the additional protein or polypeptide is optionally linked to the N-terminus or C-terminus of the protein via a linker;
    例如,所述另外的蛋白或多肽选自表位标签、报告基因序列、核定位信号(NLS)序列、靶向部分、转录激活结构域(例如,VP64)、转录抑制结构域(例如,KRAB结 构域或SID结构域)、核酸酶结构域(例如,Fok1),具有选自下列的活性的结构域:甲基化酶活性,去甲基化酶,转录激活活性,转录抑制活性,转录释放因子活性,组蛋白修饰活性,核酸酶活性,单链RNA切割活性,双链RNA切割活性,单链DNA切割活性,双链DNA切割活性和核酸结合活性;以及其任意组合;For example, the additional protein or polypeptide is selected from the group consisting of an epitope tag, a reporter gene sequence, a nuclear localization signal (NLS) sequence, a targeting moiety, a transcriptional activation domain (eg, VP64), a transcriptional repression domain (eg, a KRAB structure) a domain or SID domain), a nuclease domain (eg, Fok1), having a domain selected from the group consisting of methylase activity, demethylase, transcriptional activation activity, transcriptional repression activity, transcriptional release factor Activity, histone modification activity, nuclease activity, single-strand RNA cleavage activity, double-stranded RNA cleavage activity, single-strand DNA cleavage activity, double-strand DNA cleavage activity and nucleic acid binding activity; and any combination thereof;
    例如,所述融合蛋白包含表位标签;For example, the fusion protein comprises an epitope tag;
    例如,所述融合蛋白包含NLS序列;For example, the fusion protein comprises an NLS sequence;
    例如,所述NLS序列如SEQ ID NO:22所示;For example, the NLS sequence is set forth in SEQ ID NO:22;
    例如,所述NLS序列位于、靠近或接近所述蛋白的末端(例如,N端或C端);For example, the NLS sequence is located at, near, or near the end of the protein (eg, the N-terminus or the C-terminus);
    例如,所述融合蛋白具有选自下列的氨基酸序列:SEQ ID NOs:23-29。For example, the fusion protein has an amino acid sequence selected from the group consisting of SEQ ID NOs: 23-29.
  12. 一种分离的核酸分子,其包含选自下列的序列,或由选自下列的序列组成:An isolated nucleic acid molecule comprising a sequence selected from the group consisting of: or consisting of a sequence selected from the group consisting of:
    (i)SEQ ID NOs:15-21任一项所示的序列;(i) the sequence set forth in any one of SEQ ID NOs: 15-21.
    (ii)与SEQ ID NOs:15-21任一项所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) having one or more base substitutions, deletions or additions (for example, 1, 2, 3, 4, 5, compared to the sequence shown in any one of SEQ ID NOs: 15-21) a sequence of 6, 7, 8, or 10 base substitutions, deletions, or additions;
    (iii)与SEQ ID NOs:15-21任一项所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90% of the sequence set forth in any one of SEQ ID NOs: 15-21. a sequence of at least 95% sequence identity;
    (iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
    (v)(i)-(iii)任一项中所述的序列的互补序列;(v) the complement of the sequence set forth in any of (i)-(iii);
    并且,(ii)-(v)中任一项所述的序列基本保留了其所源自的序列的生物学功能;Furthermore, the sequence of any one of (ii)-(v) substantially retains the biological function of the sequence from which it is derived;
    例如,所述核酸分子包含一个或多个茎环或优化的二级结构;For example, the nucleic acid molecule comprises one or more stem loops or an optimized secondary structure;
    例如,(ii)-(v)中任一项所述的序列保留了其所源自的序列的二级结构;For example, the sequence of any of (ii)-(v) retains the secondary structure of the sequence from which it is derived;
    例如,所述核酸分子包含选自下列的序列,或由选自下列的序列组成:For example, the nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
    (a)SEQ ID NOs:15-21任一项所示的核苷酸序列;(a) the nucleotide sequence shown in any one of SEQ ID NOs: 15-21.
    (b)在严格条件下与(a)中所述的序列杂交的序列;或(b) a sequence that hybridizes under stringent conditions to the sequence described in (a); or
    (c)(a)中所述的序列的互补序列;(c) the complement of the sequence described in (a);
    例如,所述分离的核酸分子是RNA;For example, the isolated nucleic acid molecule is RNA;
    例如,所述分离的核酸分子是CRISPR/Cas系统中的同向重复序列。For example, the isolated nucleic acid molecule is a homologous repeat in the CRISPR/Cas system.
  13. 权利要求12所述的分离的核酸分子,其包含选自下列的序列,或由选自下列的序列组成:The isolated nucleic acid molecule of claim 12 comprising a sequence selected from the group consisting of: or consisting of a sequence selected from the group consisting of:
    (i)SEQ ID NO:15所示的序列;(i) the sequence of SEQ ID NO: 15;
    (ii)与SEQ ID NO:15所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) a substitution, deletion or addition of one or more bases compared to the sequence shown in SEQ ID NO: 15 (eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
    (iii)与SEQ ID NO:15所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: Sequence of sequence identity;
    (iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
    (v)(i)-(iii)任一项中所述的序列的互补序列;(v) the complement of the sequence set forth in any of (i)-(iii);
    例如,所述核酸分子包含选自下列的序列,或由选自下列的序列组成:For example, the nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
    (a)SEQ ID NO:15所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO: 15;
    (b)在严格条件下与(a)中所述的序列杂交的序列;或(b) a sequence that hybridizes under stringent conditions to the sequence described in (a); or
    (c)SEQ ID NO:15所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 15.
  14. 权利要求12所述的分离的核酸分子,其包含选自下列的序列,或由选自下列的序列组成:The isolated nucleic acid molecule of claim 12 comprising a sequence selected from the group consisting of: or consisting of a sequence selected from the group consisting of:
    (i)SEQ ID NO:16所示的序列;(i) the sequence shown as SEQ ID NO:16;
    (ii)与SEQ ID NO:16所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) having one or more base substitutions, deletions or additions compared to the sequence shown in SEQ ID NO: 16 (for example, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
    (iii)与SEQ ID NO:16所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: Sequence of sequence identity;
    (iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
    (v)(i)-(iii)任一项中所述的序列的互补序列;(v) the complement of the sequence set forth in any of (i)-(iii);
    例如,所述核酸分子包含选自下列的序列,或由选自下列的序列组成:For example, the nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
    (a)SEQ ID NO:16所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO: 16;
    (b)在严格条件下与(a)中所述的序列杂交的序列;或(b) a sequence that hybridizes under stringent conditions to the sequence described in (a); or
    (c)SEQ ID NO:16所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 16.
  15. 权利要求12所述的分离的核酸分子,其包含选自下列的序列,或由选自下列的序列组成:The isolated nucleic acid molecule of claim 12 comprising a sequence selected from the group consisting of: or consisting of a sequence selected from the group consisting of:
    (i)SEQ ID NO:17所示的序列;(i) the sequence shown as SEQ ID NO:17;
    (ii)与SEQ ID NO:17所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) having one or more base substitutions, deletions or additions compared to the sequence shown in SEQ ID NO: 17 (for example, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
    (iii)与SEQ ID NO:17所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: Sequence of sequence identity;
    (iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
    (v)(i)-(iii)任一项中所述的序列的互补序列;(v) the complement of the sequence set forth in any of (i)-(iii);
    例如,所述核酸分子包含选自下列的序列,或由选自下列的序列组成:For example, the nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
    (a)SEQ ID NO:17所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO: 17;
    (b)在严格条件下与(a)中所述的序列杂交的序列;(b) a sequence that hybridizes under stringent conditions to the sequence described in (a);
    (c)SEQ ID NO:17所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 17.
  16. 权利要求12所述的分离的核酸分子,其包含选自下列的序列,或由选自下列的序列组成:The isolated nucleic acid molecule of claim 12 comprising a sequence selected from the group consisting of: or consisting of a sequence selected from the group consisting of:
    (i)SEQ ID NO:18所示的序列;(i) the sequence shown as SEQ ID NO:18;
    (ii)与SEQ ID NO:18所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) a substitution, deletion or addition of one or more bases compared to the sequence shown in SEQ ID NO: 18 (eg 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
    (iii)与SEQ ID NO:18所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: 18. Sequence of sequence identity;
    (iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
    (v)(i)-(iii)任一项中所述的序列的互补序列;(v) the complement of the sequence set forth in any of (i)-(iii);
    例如,所述核酸分子包含选自下列的序列,或由选自下列的序列组成:For example, the nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
    (a)SEQ ID NO:18所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO: 18;
    (b)在严格条件下与(a)中所述的序列杂交的序列;(b) a sequence that hybridizes under stringent conditions to the sequence described in (a);
    (c)SEQ ID NO:18所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 18.
  17. 权利要求12所述的分离的核酸分子,其包含选自下列的序列,或由选自下列的序列组成:The isolated nucleic acid molecule of claim 12 comprising a sequence selected from the group consisting of: or consisting of a sequence selected from the group consisting of:
    (i)SEQ ID NO:19所示的序列;(i) the sequence shown as SEQ ID NO:19;
    (ii)与SEQ ID NO:19所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) having one or more base substitutions, deletions or additions compared to the sequence shown in SEQ ID NO: 19 (for example, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
    (iii)与SEQ ID NO:19所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: Sequence of sequence identity;
    (iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
    (v)(i)-(iii)任一项中所述的序列的互补序列;(v) the complement of the sequence set forth in any of (i)-(iii);
    例如,所述核酸分子包含选自下列的序列,或由选自下列的序列组成:For example, the nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
    (a)SEQ ID NO:19所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO: 19;
    (b)在严格条件下与(a)中所述的序列杂交的序列;(b) a sequence that hybridizes under stringent conditions to the sequence described in (a);
    (c)SEQ ID NO:19所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 19.
  18. 权利要求12所述的分离的核酸分子,其包含选自下列的序列,或由选自下列的序列组成:The isolated nucleic acid molecule of claim 12 comprising a sequence selected from the group consisting of: or consisting of a sequence selected from the group consisting of:
    (i)SEQ ID NO:20所示的序列;(i) the sequence shown as SEQ ID NO:20;
    (ii)与SEQ ID NO:20所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) having one or more base substitutions, deletions or additions compared to the sequence shown in SEQ ID NO: 20 (for example, 1, 2, 3, 4, 5, 6, 7) a sequence of 8, 8 or 10 base substitutions, deletions or additions;
    (iii)与SEQ ID NO:20所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO: Sequence of sequence identity;
    (iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
    (v)(i)-(iii)任一项中所述的序列的互补序列;(v) the complement of the sequence set forth in any of (i)-(iii);
    例如,所述核酸分子包含选自下列的序列,或由选自下列的序列组成:For example, the nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
    (a)SEQ ID NO:20所示的核苷酸序列;(a) the nucleotide sequence shown in SEQ ID NO:20;
    (b)在严格条件下与(a)中所述的序列杂交的序列;(b) a sequence that hybridizes under stringent conditions to the sequence described in (a);
    (c)SEQ ID NO:20所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO: 20.
  19. 权利要求12所述的分离的核酸分子,其包含选自下列的序列,或由选自下列的序列组成:The isolated nucleic acid molecule of claim 12 comprising a sequence selected from the group consisting of: or consisting of a sequence selected from the group consisting of:
    (i)SEQ ID NO:21所示的序列;(i) the sequence shown as SEQ ID NO:21.
    (ii)与SEQ ID NO:21所示的序列相比具有一个或多个碱基的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个碱基的置换、缺失或添加)的序列;(ii) a substitution, deletion or addition of one or more bases compared to the sequence shown in SEQ ID NO: 21 (for example, 1, 2, 3, 4, 5, 6, 7 a sequence of 8, 8 or 10 base substitutions, deletions or additions;
    (iii)与SEQ ID NO:21所示的序列具有至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%的序列同一性的序列;(iii) having at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% of the sequence set forth in SEQ ID NO:21. Sequence of sequence identity;
    (iv)在严格条件下与(i)-(iii)任一项中所述的序列杂交的序列;或(iv) a sequence that hybridizes under stringent conditions to the sequence set forth in any one of (i)-(iii); or
    (v)(i)-(iii)任一项中所述的序列的互补序列;(v) the complement of the sequence set forth in any of (i)-(iii);
    例如,所述核酸分子包含选自下列的序列,或由选自下列的序列组成:For example, the nucleic acid molecule comprises a sequence selected from the group consisting of: or consists of a sequence selected from the group consisting of:
    (a)SEQ ID NO:21所示的核苷酸序列;(a) the nucleotide sequence shown as SEQ ID NO: 21;
    (b)在严格条件下与(a)中所述的序列杂交的序列;(b) a sequence that hybridizes under stringent conditions to the sequence described in (a);
    (c)SEQ ID NO:21所示的核苷酸序列的互补序列。(c) the complement of the nucleotide sequence shown by SEQ ID NO:21.
  20. 一种复合物,其包含:A composite comprising:
    (i)蛋白组分,其选自:权利要求1-9任一项所述的蛋白、权利要求10所述的缀合物、权利要求11所述的融合蛋白,及其任意组合;和(i) a protein component selected from the group consisting of: the protein of any one of claims 1-9, the conjugate of claim 10, the fusion protein of claim 11, and any combination thereof;
    (ii)核酸组分,其包含权利要求12-19任一项所述的分离的核酸分子和能够与靶序列杂交的导向序列,(ii) a nucleic acid component comprising the isolated nucleic acid molecule of any one of claims 12-19 and a targeting sequence capable of hybridizing to the target sequence,
    其中,所述蛋白组分与核酸组分相互结合形成复合物;Wherein the protein component and the nucleic acid component are combined with each other to form a complex;
    例如,所述导向序列连接于所述核酸分子的3’端或5’端;For example, the targeting sequence is ligated to the 3' or 5' end of the nucleic acid molecule;
    例如,所述导向序列包含所述靶序列的互补序列;For example, the targeting sequence comprises the complement of the target sequence;
    例如,所述核酸组分是CRISPR/Cas系统中的导向RNA;For example, the nucleic acid component is a targeting RNA in a CRISPR/Cas system;
    例如,所述核酸分子是RNA;For example, the nucleic acid molecule is RNA;
    例如,所述复合物不包含tracrRNA。For example, the complex does not comprise tracrRNA.
  21. 权利要求20所述的复合物,其包含:The composite of claim 20 comprising:
    (i)蛋白组分,其选自:权利要求3所述的蛋白、包含所述蛋白的缀合物或融合蛋白;以及(i) a protein component selected from the group consisting of the protein of claim 3, a conjugate comprising the protein, or a fusion protein;
    (ii)核酸组分,其包含权利要求13所述的分离的核酸分子以及所述导向序列。(ii) a nucleic acid component comprising the isolated nucleic acid molecule of claim 13 and the targeting sequence.
  22. 权利要求20所述的复合物,其包含:The composite of claim 20 comprising:
    (i)蛋白组分,其选自:权利要求4所述的蛋白、包含所述蛋白的缀合物或融合蛋白;以及(i) a protein component selected from the group consisting of the protein of claim 4, a conjugate comprising the protein, or a fusion protein;
    (ii)核酸组分,其包含权利要求14所述的分离的核酸分子以及所述导向序列。(ii) a nucleic acid component comprising the isolated nucleic acid molecule of claim 14 and the targeting sequence.
  23. 权利要求20所述的复合物,其包含:The composite of claim 20 comprising:
    (i)蛋白组分,其选自:权利要求5所述的蛋白、包含所述蛋白的缀合物或融合蛋白;以及(i) a protein component selected from the group consisting of the protein of claim 5, a conjugate comprising the protein, or a fusion protein;
    (ii)核酸组分,其包含权利要求15所述的分离的核酸分子以及所述导向序列。(ii) a nucleic acid component comprising the isolated nucleic acid molecule of claim 15 and the targeting sequence.
  24. 权利要求20所述的复合物,其包含:The composite of claim 20 comprising:
    (i)蛋白组分,其选自:权利要求6所述的蛋白、包含所述蛋白的缀合物或融合蛋白;以及(i) a protein component selected from the group consisting of the protein of claim 6, a conjugate comprising the protein, or a fusion protein;
    (ii)核酸组分,其包含权利要求16所述的分离的核酸分子以及所述导向序列。(ii) a nucleic acid component comprising the isolated nucleic acid molecule of claim 16 and the targeting sequence.
  25. 权利要求20所述的复合物,其包含:The composite of claim 20 comprising:
    (i)蛋白组分,其选自:权利要求7所述的蛋白、包含所述蛋白的缀合物或融合蛋白;以及(i) a protein component selected from the group consisting of the protein of claim 7, a conjugate comprising the protein, or a fusion protein;
    (ii)核酸组分,其包含权利要求17所述的分离的核酸分子以及所述导向序列。(ii) a nucleic acid component comprising the isolated nucleic acid molecule of claim 17 and the targeting sequence.
  26. 权利要求20所述的复合物,其包含:The composite of claim 20 comprising:
    (i)蛋白组分,其选自:权利要求8所述的蛋白、包含所述蛋白的缀合物或融合蛋白;以及(i) a protein component selected from the group consisting of the protein of claim 8, a conjugate comprising the protein, or a fusion protein;
    (ii)核酸组分,其包含权利要求18所述的分离的核酸分子以及所述导向序列。(ii) a nucleic acid component comprising the isolated nucleic acid molecule of claim 18 and the targeting sequence.
  27. 权利要求20所述的复合物,其包含:The composite of claim 20 comprising:
    (i)蛋白组分,其选自:权利要求9所述的蛋白、包含所述蛋白的缀合物或融合蛋白;以及(i) a protein component selected from the group consisting of the protein of claim 9, a conjugate comprising the protein, or a fusion protein;
    (ii)核酸组分,其包含权利要求19所述的分离的核酸分子以及所述导向序列。(ii) a nucleic acid component comprising the isolated nucleic acid molecule of claim 19 and the targeting sequence.
  28. 一种分离的核酸分子,其包含:An isolated nucleic acid molecule comprising:
    (i)编码权利要求1-9任一项所述的蛋白,或权利要求11所述的融合蛋白的核苷酸序列;(i) a nucleotide sequence encoding the protein of any one of claims 1-9, or the fusion protein of claim 11;
    (ii)编码权利要求12-19任一项所述的分离的核酸分子的核苷酸序列;和/或,(ii) a nucleotide sequence encoding the isolated nucleic acid molecule of any one of claims 12-19; and/or,
    (iii)包含(i)和(ii)的核苷酸序列;(iii) a nucleotide sequence comprising (i) and (ii);
    例如,(i)-(iii)任一项中所述的核苷酸序列经密码子优化用于在原核细胞或真核细胞中进行表达。For example, the nucleotide sequence set forth in any of (i)-(iii) is codon optimized for expression in prokaryotic or eukaryotic cells.
  29. 一种载体,其包含权利要求28所述的分离的核酸分子。A vector comprising the isolated nucleic acid molecule of claim 28.
  30. 一种宿主细胞,其包含权利要求28所述的分离的核酸分子或权利要求29所述的载体。A host cell comprising the isolated nucleic acid molecule of claim 28 or the vector of claim 29.
  31. 一种组合物,其包含:A composition comprising:
    (i)第一组分,其选自:权利要求1-9任一项所述的蛋白、权利要求10所述的缀合物、权利要求11所述的融合蛋白、编码所述蛋白或融合蛋白的核苷酸序列,以及其任意组合;和(i) a first component selected from the group consisting of the protein of any one of claims 1-9, the conjugate of claim 10, the fusion protein of claim 11, encoding the protein or fusion a nucleotide sequence of a protein, and any combination thereof; and
    (ii)第二组分,其为包含导向RNA的核苷酸序列,或者编码所述包含导向RNA的核苷酸序列的核苷酸序列;(ii) a second component which is a nucleotide sequence comprising a targeting RNA or a nucleotide sequence encoding the nucleotide sequence comprising the targeting RNA;
    其中,所述导向RNA包含同向重复序列和导向序列,所述导向序列能够与靶序列杂交;Wherein the targeting RNA comprises a homologous repeat sequence and a targeting sequence, the targeting sequence being capable of hybridizing to the target sequence;
    所述导向RNA能够与(i)中所述的蛋白、缀合物或融合蛋白形成复合物;The targeting RNA is capable of forming a complex with the protein, conjugate or fusion protein described in (i);
    例如,所述同向重复序列是权利要求12-19任一项中所定义的分离的核酸分子;For example, the isotropic repeat is an isolated nucleic acid molecule as defined in any one of claims 12-19;
    例如,所述导向序列连接至所述同向重复序列的3’端或5’端;For example, the targeting sequence is ligated to the 3' or 5' end of the homologous repeat;
    例如,所述导向序列包含所述靶序列的互补序列;For example, the targeting sequence comprises the complement of the target sequence;
    例如,所述组合物不包含tracrRNA;For example, the composition does not comprise tracrRNA;
    例如,所述组合物是非天然存在的或经修饰的;For example, the composition is non-naturally occurring or modified;
    例如,所述组合物中的至少一个组分是非天然存在的或经修饰的;For example, at least one component of the composition is non-naturally occurring or modified;
    例如,所述第一组分是非天然存在的或经修饰的;和/或,所述第二组分是非天然存在的或经修饰的。For example, the first component is non-naturally occurring or modified; and/or the second component is non-naturally occurring or modified.
  32. 权利要求31所述的组合物,其中:The composition of claim 31 wherein:
    所述第一组分选自:权利要求3所述的蛋白、或包含所述蛋白的缀合物或融合蛋白、或编码所述蛋白或融合蛋白的核苷酸序列,以及其任意组合;The first component is selected from the group consisting of: the protein of claim 3, or a conjugate or fusion protein comprising the protein, or a nucleotide sequence encoding the protein or fusion protein, and any combination thereof;
    所述同向重复序列是权利要求13中所定义的分离的核酸分子。The isotropic repeat sequence is an isolated nucleic acid molecule as defined in claim 13.
  33. 权利要求31所述的组合物,其中:The composition of claim 31 wherein:
    所述第一组分选自:权利要求4所述的蛋白、或包含所述蛋白的缀合物或融合蛋白、或编码所述蛋白或融合蛋白的核苷酸序列,以及其任意组合;The first component is selected from the group consisting of: the protein of claim 4, or a conjugate or fusion protein comprising the protein, or a nucleotide sequence encoding the protein or fusion protein, and any combination thereof;
    所述同向重复序列是权利要求14中所定义的分离的核酸分子。The isotropic repeat is an isolated nucleic acid molecule as defined in claim 14.
  34. 权利要求31所述的组合物,其中:The composition of claim 31 wherein:
    所述第一组分选自:权利要求5所述的蛋白、或包含所述蛋白的缀合物或融合蛋白、或编码所述蛋白或融合蛋白的核苷酸序列,以及其任意组合;The first component is selected from the group consisting of: the protein of claim 5, or a conjugate or fusion protein comprising the protein, or a nucleotide sequence encoding the protein or fusion protein, and any combination thereof;
    所述同向重复序列是权利要求15中所定义的分离的核酸分子。The isotropic repeat is an isolated nucleic acid molecule as defined in claim 15.
  35. 权利要求31所述的组合物,其中:The composition of claim 31 wherein:
    所述第一组分选自:权利要求6所述的蛋白、或包含所述蛋白的缀合物或融合蛋白、或编码所述蛋白或融合蛋白的核苷酸序列,以及其任意组合;The first component is selected from the group consisting of: the protein of claim 6, or a conjugate or fusion protein comprising the protein, or a nucleotide sequence encoding the protein or fusion protein, and any combination thereof;
    所述同向重复序列是权利要求16中所定义的分离的核酸分子。The isotropic repeat is an isolated nucleic acid molecule as defined in claim 16.
  36. 权利要求31所述的组合物,其中:The composition of claim 31 wherein:
    所述第一组分选自:权利要求7所述的蛋白、或包含所述蛋白的缀合物或融合蛋白、或编码所述蛋白或融合蛋白的核苷酸序列,以及其任意组合;The first component is selected from the group consisting of: the protein of claim 7, or a conjugate or fusion protein comprising the protein, or a nucleotide sequence encoding the protein or fusion protein, and any combination thereof;
    所述同向重复序列是权利要求17中所定义的分离的核酸分子。The isotropic repeat is an isolated nucleic acid molecule as defined in claim 17.
  37. 权利要求31所述的组合物,其中:The composition of claim 31 wherein:
    所述第一组分选自:权利要求8所述的蛋白、或包含所述蛋白的缀合物或融合蛋 白、或编码所述蛋白或融合蛋白的核苷酸序列,以及其任意组合;The first component is selected from the group consisting of: the protein of claim 8, or a conjugate or fusion protein comprising the protein, or a nucleotide sequence encoding the protein or fusion protein, and any combination thereof;
    所述同向重复序列是权利要求18中所定义的分离的核酸分子。The isotropic repeat is an isolated nucleic acid molecule as defined in claim 18.
  38. 权利要求31所述的组合物,其中:The composition of claim 31 wherein:
    所述第一组分选自:权利要求9所述的蛋白、或包含所述蛋白的缀合物或融合蛋白、或编码所述蛋白或融合蛋白的核苷酸序列,以及其任意组合;The first component is selected from the group consisting of: the protein of claim 9, or a conjugate or fusion protein comprising the protein, or a nucleotide sequence encoding the protein or fusion protein, and any combination thereof;
    所述同向重复序列是权利要求19中所定义的分离的核酸分子。The isotropic repeat sequence is an isolated nucleic acid molecule as defined in claim 19.
  39. 一种组合物,其包含一种或多种载体,所述一种或多种载体包含:A composition comprising one or more carriers, the one or more carriers comprising:
    (i)第一核酸,其为编码权利要求1-9任一项所述的蛋白或权利要求11所述的融合蛋白的核苷酸序列;任选地所述第一核酸可操作地连接至第一调节元件;以及(i) a first nucleic acid, which is a nucleotide sequence encoding the protein of any one of claims 1-9 or the fusion protein of claim 11; optionally the first nucleic acid is operably linked to First adjustment element;
    (ii)第二核酸,其编码包含导向RNA的核苷酸序列;任选地所述第二核酸可操作地连接至第二调节元件;(ii) a second nucleic acid encoding a nucleotide sequence comprising a targeting RNA; optionally the second nucleic acid is operably linked to a second regulatory element;
    其中:among them:
    所述第一核酸与第二核酸存在于相同或不同的载体上;The first nucleic acid and the second nucleic acid are present on the same or different carrier;
    所述导向RNA包含同向重复序列和导向序列,所述导向序列能够与靶序列杂交;The targeting RNA comprises a homologous repeat sequence and a targeting sequence capable of hybridizing to the target sequence;
    所述导向RNA能够与(i)中所述的效应蛋白或融合蛋白形成复合物;The targeting RNA is capable of forming a complex with the effector protein or fusion protein described in (i);
    例如,所述同向重复序列是权利要求12-19任一项中所定义的分离的核酸分子;For example, the isotropic repeat is an isolated nucleic acid molecule as defined in any one of claims 12-19;
    例如,所述导向序列连接至所述同向重复序列的3’端或5’端;For example, the targeting sequence is ligated to the 3' or 5' end of the homologous repeat;
    例如,所述导向序列包含所述靶序列的互补序列;For example, the targeting sequence comprises the complement of the target sequence;
    例如,所述组合物不包含tracrRNA;For example, the composition does not comprise tracrRNA;
    例如,所述组合物是非天然存在的或经修饰的;For example, the composition is non-naturally occurring or modified;
    例如,所述组合物中的至少一个组分是非天然存在的或经修饰的;For example, at least one component of the composition is non-naturally occurring or modified;
    例如,所述第一调节元件是启动子,例如诱导型启动子;For example, the first regulatory element is a promoter, such as an inducible promoter;
    例如,所述第二调节元件是启动子,例如诱导型启动子。For example, the second regulatory element is a promoter, such as an inducible promoter.
  40. 权利要求39所述的组合物,其中:The composition of claim 39 wherein:
    所述第一核酸为编码权利要求3所述的蛋白或包含所述蛋白的融合蛋白的核苷酸序列;The first nucleic acid is a nucleotide sequence encoding the protein of claim 3 or a fusion protein comprising the protein;
    所述同向重复序列是权利要求13中所定义的分离的核酸分子。The isotropic repeat sequence is an isolated nucleic acid molecule as defined in claim 13.
  41. 权利要求39所述的组合物,其中:The composition of claim 39 wherein:
    所述第一核酸为编码权利要求4所述的蛋白或包含所述蛋白的融合蛋白的核苷酸序列;The first nucleic acid is a nucleotide sequence encoding the protein of claim 4 or a fusion protein comprising the protein;
    所述同向重复序列是权利要求14中所定义的分离的核酸分子。The isotropic repeat is an isolated nucleic acid molecule as defined in claim 14.
  42. 权利要求39所述的组合物,其中:The composition of claim 39 wherein:
    所述第一核酸为编码权利要求5所述的蛋白或包含所述蛋白的融合蛋白的核苷酸序列;The first nucleic acid is a nucleotide sequence encoding the protein of claim 5 or a fusion protein comprising the protein;
    所述同向重复序列是权利要求15中所定义的分离的核酸分子。The isotropic repeat is an isolated nucleic acid molecule as defined in claim 15.
  43. 权利要求39所述的组合物,其中:The composition of claim 39 wherein:
    所述第一核酸为编码权利要求6所述的蛋白或包含所述蛋白的融合蛋白的核苷酸序列;The first nucleic acid is a nucleotide sequence encoding the protein of claim 6 or a fusion protein comprising the protein;
    所述同向重复序列是权利要求16中所定义的分离的核酸分子。The isotropic repeat is an isolated nucleic acid molecule as defined in claim 16.
  44. 权利要求39所述的组合物,其中:The composition of claim 39 wherein:
    所述第一核酸为编码权利要求7所述的蛋白或包含所述蛋白的融合蛋白的核苷酸序列;The first nucleic acid is a nucleotide sequence encoding the protein of claim 7 or a fusion protein comprising the protein;
    所述同向重复序列是权利要求17中所定义的分离的核酸分子。The isotropic repeat is an isolated nucleic acid molecule as defined in claim 17.
  45. 权利要求39所述的组合物,其中:The composition of claim 39 wherein:
    所述第一核酸为编码权利要求8所述的蛋白或包含所述蛋白的融合蛋白的核苷酸序列;The first nucleic acid is a nucleotide sequence encoding the protein of claim 8 or a fusion protein comprising the protein;
    所述同向重复序列是权利要求18中所定义的分离的核酸分子。The isotropic repeat is an isolated nucleic acid molecule as defined in claim 18.
  46. 权利要求39所述的组合物,其中:The composition of claim 39 wherein:
    所述第一核酸为编码权利要求9所述的蛋白或包含所述蛋白的融合蛋白的核苷酸序列;The first nucleic acid is a nucleotide sequence encoding the protein of claim 9 or a fusion protein comprising the protein;
    所述同向重复序列是权利要求19中所定义的分离的核酸分子。The isotropic repeat sequence is an isolated nucleic acid molecule as defined in claim 19.
  47. 权利要求31-46任一项所述的组合物,其中,所述靶序列是来自原核细胞或真核细胞的RNA序列;或者,所述靶序列是非天然存在的RNA序列。The composition of any one of claims 31 to 46, wherein the target sequence is an RNA sequence derived from a prokaryotic cell or a eukaryotic cell; or the target sequence is a non-naturally occurring RNA sequence.
  48. 权利要求31-47任一项所述的组合物,其中,所述靶序列存在于细胞内;The composition of any one of claims 31 to 47, wherein the target sequence is present in a cell;
    例如,所述靶序列存在于细胞核内或细胞质(例如,细胞器)内;For example, the target sequence is present in the nucleus or in the cytoplasm (eg, organelle);
    例如,所述细胞是原核细胞。For example, the cell is a prokaryotic cell.
  49. 权利要求31-48任一项所述的组合物,其中,所述蛋白连接有一个或多个NLS序列,或者,所述缀合物或融合蛋白包含一个或多个NLS序列;The composition of any one of claims 31-48, wherein the protein is linked to one or more NLS sequences, or the conjugate or fusion protein comprises one or more NLS sequences;
    例如,所述NLS序列连接至所述蛋白的N端或C端;For example, the NLS sequence is linked to the N-terminus or C-terminus of the protein;
    例如,所述NLS序列融合至所述蛋白的N端或C端。For example, the NLS sequence is fused to the N-terminus or C-terminus of the protein.
  50. 一种试剂盒,其包括一种或多种选自下列的组分:权利要求1-9任一项所述的蛋白、权利要求10所述的缀合物、权利要求11所述的融合蛋白、权利要求12-19任一项所述的分离的核酸分子、权利要求20-27任一项所述的复合物、权利要求28所述的分离的核酸分子、权利要求29所述的载体、权利要求31-49任一项所述的组合物;A kit comprising one or more components selected from the group consisting of the protein of any one of claims 1-9, the conjugate of claim 10, and the fusion protein of claim 11. The isolated nucleic acid molecule according to any one of claims 12 to 19, the complex according to any one of claims 20 to 27, the isolated nucleic acid molecule according to claim 28, the vector according to claim 29, The composition of any of claims 31-49;
    例如,所述试剂盒包含权利要求31-38任一项所述的组合物,以及使用所述组合物的说明书;For example, the kit comprises the composition of any of claims 31-38, and instructions for using the composition;
    例如,所述试剂盒包含权利要求39-46任一项所述的组合物,以及使用所述组合物的说明书。For example, the kit comprises the composition of any of claims 39-46, and instructions for using the composition.
  51. 一种递送组合物,其包含递送载体,以及选自下列的一种或多种:权利要求1-9任一项所述的蛋白、权利要求10所述的缀合物、权利要求11所述的融合蛋白、权利要求12-19任一项所述的分离的核酸分子、权利要求20-27任一项所述的复合物、权利要求28所述的分离的核酸分子、权利要求29所述的载体、权利要求31-49任一项所述的组合物;A delivery composition comprising a delivery vehicle, and one or more selected from the group consisting of the protein of any of claims 1-9, the conjugate of claim 10, and the method of claim 11. A fusion protein, the isolated nucleic acid molecule of any one of claims 12 to 19, the complex of any one of claims 20 to 27, the isolated nucleic acid molecule of claim 28, or the method of claim 29. A carrier according to any one of claims 31 to 49;
    例如,所述递送载体是粒子;For example, the delivery vehicle is a particle;
    例如,所述递送载体选自脂质颗粒、糖颗粒、金属颗粒、蛋白颗粒、脂质体、外泌体、微泡、基因枪或病毒载体(例如,复制缺陷型逆转录病毒、慢病毒、腺病毒或腺相关病毒)。For example, the delivery vehicle is selected from the group consisting of a lipid particle, a sugar particle, a metal particle, a protein particle, a liposome, an exosome, a microvesicle, a gene gun, or a viral vector (eg, a replication-defective retrovirus, a lentivirus, Adenovirus or adeno-associated virus).
  52. 一种修饰靶序列的方法,其包括:将权利要求20-27任一项所述的复合物、权利要求31-49任一项所述的组合物或权利要求51所述的递送组合物与所述靶序列接触,或者递送至包含所述靶序列的细胞中;其中,所述靶序列与感兴趣的基因相关或者存在于所述感兴趣的基因中,并且所述靶序列为RNA;A method of modifying a target sequence, comprising: combining the complex of any one of claims 20-27, the composition of any one of claims 31-49, or the delivery composition of claim 51 The target sequence is contacted or delivered to a cell comprising the target sequence; wherein the target sequence is associated with or is present in the gene of interest, and the target sequence is RNA;
    例如,所述靶序列存在于细胞内;For example, the target sequence is present in a cell;
    例如,所述细胞是原核细胞;For example, the cell is a prokaryotic cell;
    例如,所述靶序列存在于体外的核酸分子(例如,质粒)中;For example, the target sequence is present in a nucleic acid molecule (eg, a plasmid) in vitro;
    例如,所述修饰是指所述靶序列的断裂,如双链断裂或单链断裂;For example, the modification refers to a cleavage of the target sequence, such as a double-strand break or a single-strand break;
    例如,所述修饰还包括将外源核酸插入所述断裂中;For example, the modification further comprises inserting an exogenous nucleic acid into the fragment;
    例如,所述靶序列是ssRNA。For example, the target sequence is an ssRNA.
  53. 权利要求52所述的方法,其包括将权利要求21所述的复合物、权利要求32所述的组合物或权利要求40所述的组合物与所述靶序列接触,或者递送至包含所述靶序列的细胞中。The method of claim 52, comprising contacting the complex of claim 21, the composition of claim 32, or the composition of claim 40 with the target sequence, or delivering to the In the cells of the target sequence.
  54. 权利要求52所述的方法,其包括将权利要求22所述的复合物、权利要求33所述的组合物或权利要求41所述的组合物与所述靶序列接触,或者递送至包含所述靶序列的细胞中。The method of claim 52, comprising contacting the complex of claim 22, the composition of claim 33, or the composition of claim 41 with the target sequence, or delivering to the In the cells of the target sequence.
  55. 权利要求52所述的方法,其包括将权利要求23所述的复合物、权利要求34所述的组合物或权利要求42所述的组合物与所述靶序列接触,或者递送至包含所述靶序列的细胞中。The method of claim 52, comprising contacting the complex of claim 23, the composition of claim 34, or the composition of claim 42 with the target sequence, or delivering to the In the cells of the target sequence.
  56. 权利要求52所述的方法,其包括将权利要求24所述的复合物、权利要求35所述的组合物或权利要求43所述的组合物与所述靶序列接触,或者递送至包含所述靶序列的细胞中。The method of claim 52, comprising contacting the complex of claim 24, the composition of claim 35, or the composition of claim 43 with the target sequence, or delivering to the inclusion of the In the cells of the target sequence.
  57. 权利要求52所述的方法,其包括将权利要求25所述的复合物、权利要求36所述的组合物或权利要求44所述的组合物与所述靶序列接触,或者递送至包含所述靶序列 的细胞中。The method of claim 52, comprising contacting the complex of claim 25, the composition of claim 36, or the composition of claim 44 with the target sequence, or delivering to the In the cells of the target sequence.
  58. 权利要求52所述的方法,其包括将权利要求26所述的复合物、权利要求37所述的组合物或权利要求45所述的组合物与所述靶序列接触,或者递送至包含所述靶序列的细胞中。The method of claim 52, comprising contacting the complex of claim 26, the composition of claim 37, or the composition of claim 45 with the target sequence, or delivering to the In the cells of the target sequence.
  59. 权利要求52所述的方法,其包括将权利要求27所述的复合物、权利要求38所述的组合物或权利要求46所述的组合物与所述靶基序列接触,或者递送至包含所述靶序列的细胞中。The method of claim 52, comprising contacting the complex of claim 27, the composition of claim 38, or the composition of claim 46 with the target sequence, or delivering to the inclusion In the cells of the target sequence.
  60. 权利要求52-59任一项所述的方法,其中所述的蛋白、缀合物、融合蛋白、分离的核酸分子、复合物、载体或组合物包含于递送载体中;The method of any one of claims 52-59, wherein the protein, conjugate, fusion protein, isolated nucleic acid molecule, complex, vector or composition is included in a delivery vehicle;
    例如,所述递送载体选自脂质颗粒、糖颗粒、金属颗粒、蛋白颗粒、脂质体、外泌体、病毒载体(如复制缺陷型逆转录病毒、慢病毒、腺病毒或腺相关病毒)。For example, the delivery vehicle is selected from the group consisting of lipid particles, sugar particles, metal particles, protein particles, liposomes, exosomes, viral vectors (eg, replication defective retroviruses, lentiviruses, adenoviruses, or adeno-associated viruses) .
  61. 权利要求52-60任一项所述的方法,其用于RNA干扰或调节基因表达。The method of any of claims 52-60 for use in RNA interference or regulation of gene expression.
  62. 权利要求1-9任一项所述的蛋白、权利要求10所述的缀合物、权利要求11所述的融合蛋白、权利要求12-19任一项所述的分离的核酸分子、权利要求20-27任一项所述的复合物、权利要求28所述的分离的核酸分子、权利要求29所述的载体、权利要求31-49任一项所述的组合物或权利要求50所述的试剂盒,在选自下列的一项或多项中的用途,或者在制备制剂中的用途,所述制剂用于选自下列的一项或多项:The protein of any one of claims 1-9, the conjugate of claim 10, the fusion protein of claim 11, the isolated nucleic acid molecule of any of claims 12-19, the claims The complex of any one of claims 20 to 27, the isolated nucleic acid molecule of claim 28, the vector of claim 29, the composition of any one of claims 31-49, or the method of claim 50 A kit, for use in one or more of the following, or in the preparation of a formulation for one or more selected from the group consisting of:
    (1)修饰靶序列;(1) modifying the target sequence;
    (2)RNA干扰;或(2) RNA interference; or
    (3)调控基因表达。(3) Regulate gene expression.
  63. 一种由权利要求52-61任一项所述的方法获得的细胞或其子代,其中所述细胞包含在其野生型中不存在的修饰;A cell or a progeny thereof obtained by the method of any one of claims 52-61, wherein the cell comprises a modification that is not found in its wild type;
    例如,所述修饰导致至少一种RNA产物的转录或翻译发生改变;For example, the modification results in a change in transcription or translation of at least one RNA product;
    例如,所述修饰导致至少一种RNA产物的表达增加;For example, the modification results in increased expression of at least one RNA product;
    例如,所述修饰导致至少一种RNA产物的表达降低;For example, the modification results in decreased expression of at least one RNA product;
    例如,所述细胞为原核细胞。For example, the cell is a prokaryotic cell.
  64. 权利要求63所述的细胞或其子代的产物。The product of the cell of claim 63 or a progeny thereof.
  65. 一种体外的、离体的或体内的细胞或细胞系或它们的子代,所述细胞或细胞系或它们的子代包含:权利要求1-9任一项所述的蛋白、权利要求10所述的缀合物、权利要求11所述的融合蛋白、权利要求12-19任一项所述的分离的核酸分子、权利要求20-27任一项所述的复合物、权利要求28所述的分离的核酸分子、权利要求29所述的载体、权利要求31-49任一项所述的组合物;An in vitro, ex vivo or in vivo cell or cell line or a progeny thereof, the cell or cell line or a progeny thereof comprising: the protein of any one of claims 1-9, claim 10 The conjugate, the fusion protein of claim 11, the isolated nucleic acid molecule of any one of claims 12 to 19, the complex of any one of claims 20-27, and the method of claim 28. An isolated nucleic acid molecule, the vector of claim 29, the composition of any one of claims 31-49;
    例如,所述细胞或细胞系或它们的子代包含:权利要求21所述的复合物、权利要求32所述的组合物或权利要求40所述的组合物;For example, the cell or cell line or a progeny thereof comprises: the complex of claim 21, the composition of claim 32 or the composition of claim 40;
    例如,所述细胞或细胞系或它们的子代包含:权利要求22所述的复合物、权利要求33所述的组合物或权利要求41所述的组合物;For example, the cell or cell line or a progeny thereof comprises: the complex of claim 22, the composition of claim 33 or the composition of claim 41;
    例如,所述细胞或细胞系或它们的子代包含:权利要求23所述的复合物、权利要求34所述的组合物或权利要求42所述的组合物;For example, the cell or cell line or a progeny thereof comprises: the complex of claim 23, the composition of claim 34 or the composition of claim 42;
    例如,所述细胞或细胞系或它们的子代包含:权利要求24所述的复合物、权利要求35所述的组合物或权利要求43所述的组合物;For example, the cell or cell line or a progeny thereof comprises: the complex of claim 24, the composition of claim 35 or the composition of claim 43;
    例如,所述细胞或细胞系或它们的子代包含:权利要求25所述的复合物、权利要求36所述的组合物或权利要求44所述的组合物;For example, the cell or cell line or a progeny thereof comprises: the complex of claim 25, the composition of claim 36 or the composition of claim 44;
    例如,所述细胞或细胞系或它们的子代包含:权利要求26所述的复合物、权利要求37所述的组合物或权利要求45所述的组合物;For example, the cell or cell line or a progeny thereof comprises: the complex of claim 26, the composition of claim 37 or the composition of claim 45;
    例如,所述细胞或细胞系或它们的子代包含:权利要求27所述的复合物、权利要求38所述的组合物或权利要求46所述的组合物;For example, the cell or cell line or a progeny thereof comprises: the complex of claim 27, the composition of claim 38 or the composition of claim 46;
    例如,所述细胞是真核细胞;For example, the cell is a eukaryotic cell;
    例如,所述细胞是动物细胞(例如,哺乳动物细胞,例如人类细胞)或植物细胞;For example, the cell is an animal cell (eg, a mammalian cell, such as a human cell) or a plant cell;
    例如,所述细胞是干细胞或干细胞系。For example, the cell is a stem cell or stem cell line.
  66. 一种检测靶序列的方法,其包括将权利要求20-27任一项所述的复合物、权利要求31-49任一项所述的组合物或权利要求51所述的递送组合物与所述靶序列接触,或者 递送至包含所述靶序列的细胞中;其中,所述靶序列是RNA;A method of detecting a target sequence, comprising the composition of any one of claims 20-27, the composition of any one of claims 31-49, or the delivery composition of claim 51 The target sequence is contacted or delivered to a cell comprising the target sequence; wherein the target sequence is RNA;
    例如,所述复合物、组合物或递送组合物中所包含的导向序列能够与所述靶序列杂交;For example, a targeting sequence contained in the complex, composition or delivery composition is capable of hybridizing to the target sequence;
    例如,所述靶序列存在于体外的核酸分子中;For example, the target sequence is present in a nucleic acid molecule in vitro;
    例如,所述靶序列存在于细胞内;For example, the target sequence is present in a cell;
    例如,所述细胞是活细胞;For example, the cell is a living cell;
    例如,所述复合物、组合物或递送组合物所包含的蛋白组分带有可检测的标记;For example, the protein component comprised by the complex, composition or delivery composition carries a detectable label;
    例如,所述复合物、组合物或递送组合物所包含的蛋白组分融合有荧光蛋白(例如GFP);For example, the protein component comprised by the complex, composition or delivery composition is fused with a fluorescent protein (eg GFP);
    例如,所述方法是northern blot印记杂交;For example, the method is northern blot imprinting;
    例如,所述方法是荧光原位杂交。For example, the method is fluorescence in situ hybridization.
  67. 权利要求1-9任一项所述的蛋白、权利要求10所述的缀合物、权利要求11所述的融合蛋白、权利要求12-19任一项所述的分离的核酸分子、权利要求20-27任一项所述的复合物、权利要求28所述的分离的核酸分子、权利要求29所述的载体、权利要求31-49任一项所述的组合物、权利要求50所述的试剂盒或权利要求51所述的递送组合物,用于检测靶序列的用途,或者在制备制剂中的用途,所述制剂用于检测靶序列。The protein of any one of claims 1-9, the conjugate of claim 10, the fusion protein of claim 11, the isolated nucleic acid molecule of any of claims 12-19, the claims The complex of any one of claims 20 to 27, the isolated nucleic acid molecule of claim 28, the vector of claim 29, the composition of any one of claims 31-49, and the method of claim 50. A kit or a delivery composition according to claim 51 for use in detecting a target sequence or in the preparation of a preparation for detecting a target sequence.
  68. 一种调节靶细胞的状态的方法,其包括将权利要求20-27任一项所述的复合物、权利要求31-49任一项所述的组合物或权利要求51所述的递送组合物引入所述靶细胞中;A method of modulating the state of a target cell, comprising the composition of any one of claims 20-27, the composition of any one of claims 31-49, or the delivery composition of claim 51 Introduced into the target cell;
    例如,所述靶细胞中存在与所述复合物、组合物或递送组合物中所包含的导向序列杂交的靶序列;For example, a target sequence that hybridizes to a targeting sequence contained in the complex, composition or delivery composition is present in the target cell;
    例如,所述调节靶细胞的状态包括:For example, the state of regulating the target cell comprises:
    (1)在体外或体内诱导细胞休眠;(1) inducing cell dormancy in vitro or in vivo;
    (2)在体外或体内诱导细胞周期停滞;(2) Inducing cell cycle arrest in vitro or in vivo;
    (3)在体外或体内抑制细胞生长和/或细胞增殖;(3) inhibiting cell growth and/or cell proliferation in vitro or in vivo;
    (4)在体外或体内诱导细胞无反应性;(4) inducing cell anergy in vitro or in vivo;
    (5)在体外或体内诱导细胞凋亡;(5) inducing apoptosis in vitro or in vivo;
    (6)在体外或体内诱导细胞坏死;(6) inducing cell necrosis in vitro or in vivo;
    (7)在体外或体内诱导细胞死亡;或(7) inducing cell death in vitro or in vivo; or
    (8)在体外或体内诱导程序性细胞死亡。(8) Inducing programmed cell death in vitro or in vivo.
  69. 权利要求1-9任一项所述的蛋白、权利要求10所述的缀合物、权利要求11所述的融合蛋白、权利要求12-19任一项所述的分离的核酸分子、权利要求20-27任一项所述的复合物、权利要求28所述的分离的核酸分子、权利要求29所述的载体、权利要求31-49任一项所述的组合物、权利要求50所述的试剂盒或权利要求51所述的递送组合物,在选自下列的一项或多项中的用途,或者在制备制剂中的用途,所述制剂用于选自下列的一项或多项:The protein of any one of claims 1-9, the conjugate of claim 10, the fusion protein of claim 11, the isolated nucleic acid molecule of any of claims 12-19, the claims The complex of any one of claims 20 to 27, the isolated nucleic acid molecule of claim 28, the vector of claim 29, the composition of any one of claims 31-49, and the method of claim 50. Kit or the delivery composition of claim 51, for use in one or more of the following, or in the preparation of a formulation for one or more selected from the group consisting of :
    (1)在体外或体内诱导细胞休眠;(1) inducing cell dormancy in vitro or in vivo;
    (2)在体外或体内诱导细胞周期停滞;(2) Inducing cell cycle arrest in vitro or in vivo;
    (3)在体外或体内抑制细胞生长和/或细胞增殖;(3) inhibiting cell growth and/or cell proliferation in vitro or in vivo;
    (4)在体外或体内诱导细胞无反应性;(4) inducing cell anergy in vitro or in vivo;
    (5)在体外或体内诱导细胞凋亡;(5) inducing apoptosis in vitro or in vivo;
    (6)在体外或体内诱导细胞坏死;(6) inducing cell necrosis in vitro or in vivo;
    (7)在体外或体内诱导细胞死亡;或(7) inducing cell death in vitro or in vivo; or
    (8)在体外或体内诱导程序性细胞死亡。(8) Inducing programmed cell death in vitro or in vivo.
PCT/CN2019/084340 2018-04-25 2019-04-25 Rna-edited crispr/cas effector protein and system WO2019206233A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201980028197.0A CN112020560B (en) 2018-04-25 2019-04-25 RNA-edited CRISPR/Cas effect protein and system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810377468 2018-04-25
CN201810377468.0 2018-04-25

Publications (1)

Publication Number Publication Date
WO2019206233A1 true WO2019206233A1 (en) 2019-10-31

Family

ID=68294799

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/084340 WO2019206233A1 (en) 2018-04-25 2019-04-25 Rna-edited crispr/cas effector protein and system

Country Status (2)

Country Link
CN (1) CN112020560B (en)
WO (1) WO2019206233A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020098772A1 (en) * 2018-11-15 2020-05-22 中国农业大学 Crispr-cas12j enzyme and system
WO2022098681A3 (en) * 2020-11-03 2022-06-16 Caspr Biotech Corporation Novel class 2 crispr-cas rna-guided endonucleases
WO2023030340A1 (en) * 2021-08-30 2023-03-09 Huigene Therapeutics Co., Ltd. Novel design of guide rna and uses thereof
EP4110933A4 (en) * 2020-02-28 2024-02-21 Huidagene Therapeutics Co Ltd Type vi-e and type vi-f crispr-cas system and uses thereof

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101706085B1 (en) * 2012-10-23 2017-02-14 주식회사 툴젠 Composition for cleaving a target DNA comprising a guide RNA specific for the target DNA and Cas protein-encoding nucleic acid or Cas protein, and use thereof
JP2016507244A (en) * 2013-02-27 2016-03-10 ヘルムホルツ・ツェントルム・ミュンヒェン・ドイチェス・フォルシュンクスツェントルム・フューア・ゲズントハイト・ウント・ウムベルト(ゲーエムベーハー)Helmholtz Zentrum MuenchenDeutsches Forschungszentrum fuer Gesundheit und Umwelt (GmbH) Gene editing in oocytes by Cas9 nuclease
US10240145B2 (en) * 2015-11-25 2019-03-26 The Board Of Trustees Of The Leland Stanford Junior University CRISPR/Cas-mediated genome editing to treat EGFR-mutant lung cancer
CA3039409A1 (en) * 2016-10-07 2018-04-12 Integrated Dna Technologies, Inc. S. pyogenes cas9 mutant genes and polypeptides encoded by same

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
WINSTON X. YAN ET AL.: "Casl3d is a Compact RNA-targeting Type VI CRISPR Effector Positively Modulated by a WYL Domain-Containing Accessory Protein", MOL CELL, vol. 70, no. 2, 19 April 2018 (2018-04-19), pages 327 - 339, XP055529724, DOI: 10.1016/j.molcel.2018.02.028 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020098772A1 (en) * 2018-11-15 2020-05-22 中国农业大学 Crispr-cas12j enzyme and system
EP4110933A4 (en) * 2020-02-28 2024-02-21 Huidagene Therapeutics Co Ltd Type vi-e and type vi-f crispr-cas system and uses thereof
WO2022098681A3 (en) * 2020-11-03 2022-06-16 Caspr Biotech Corporation Novel class 2 crispr-cas rna-guided endonucleases
WO2023030340A1 (en) * 2021-08-30 2023-03-09 Huigene Therapeutics Co., Ltd. Novel design of guide rna and uses thereof

Also Published As

Publication number Publication date
CN112020560B (en) 2024-02-23
CN112020560A (en) 2020-12-01

Similar Documents

Publication Publication Date Title
JP7216877B2 (en) Novel CRISPR/Casl2f enzymes and systems
JP7460178B2 (en) CRISPR-Cas12j enzyme and system
WO2019206233A1 (en) Rna-edited crispr/cas effector protein and system
JP5735927B2 (en) Re-engineering the primary structure of mRNA to enhance protein production
WO2019214604A1 (en) Crispr/cas effector protein and system
CA2989834A1 (en) Crispr enzymes and systems
CA3012607A1 (en) Crispr enzymes and systems
WO2019201331A1 (en) Crispr/cas effector protein and system
CN113881652B (en) Novel Cas enzymes and systems and applications
CN113015798B (en) CRISPR-Cas12a enzymes and systems
CN114438055B (en) Novel CRISPR enzymes and systems and uses
WO2020087631A1 (en) System and method for genome editing based on c2c1 nucleases
CN113930410A (en) Novel CRISPR-Cas12L enzymes and systems
CN113930411A (en) Novel CRISPR-Cas12M enzymes and systems
WO2023173682A1 (en) Optimized cas protein and use thereof
WO2023143150A1 (en) Novel cas enzyme and system and use
JP2015180203A (en) REENGINEERING mRNA PRIMARY STRUCTURE FOR ENHANCED PROTEIN PRODUCTION
CN113930412A (en) Novel CRISPR-Cas12N enzymes and systems
CN113930413A (en) Novel CRISPR-Cas12j.23 enzymes and systems
CN117050971A (en) Cas muteins and uses thereof
CN116286739A (en) Mutant Cas proteins and uses thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19792442

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19792442

Country of ref document: EP

Kind code of ref document: A1