EP4540376A2 - Manipulierte acr-proteine zur modulation der crispr-aktivität - Google Patents

Manipulierte acr-proteine zur modulation der crispr-aktivität

Info

Publication number
EP4540376A2
EP4540376A2 EP23824718.3A EP23824718A EP4540376A2 EP 4540376 A2 EP4540376 A2 EP 4540376A2 EP 23824718 A EP23824718 A EP 23824718A EP 4540376 A2 EP4540376 A2 EP 4540376A2
Authority
EP
European Patent Office
Prior art keywords
acr
amino acid
target
seq
crispr nuclease
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP23824718.3A
Other languages
English (en)
French (fr)
Inventor
David Rabuka
Allison SHARRAR
Luisa Mayumi Arake DE TACCA
Michael SHELLE
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Acrigen Biosciences Inc
Original Assignee
Acrigen Biosciences Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Acrigen Biosciences Inc filed Critical Acrigen Biosciences Inc
Publication of EP4540376A2 publication Critical patent/EP4540376A2/de
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/102Mutagenizing nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14141Use of virus, viral particle or viral elements as a vector
    • C12N2750/14143Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases [RNase]; Deoxyribonucleases [DNase]

Definitions

  • CRISPR-Cas systems are found in diverse bacterial and archaeal species, serving as an immune defense mechanism against phage infection.
  • the simplicity, programmability, and versatility of CRISPR-Cas systems have facilitated the genetic modification of many organisms and offer immense therapeutic potential for the treatment of human disease.
  • CRISPR-Cas mediated genome editing can be associated with off-targeting, e.g., introduction of unintended mutations, insertions, or deletions, and DNA restructuring at unintended “off-target” sites.
  • off-target editing caused by CRISPR-Cas systems has been reported in various cell and animal models, including in human cells, and such events might accumulate in vivo with prolonged nuclease activity.
  • human genetic variations and uncertain Cas protein expression lifetimes in vivo add to the unpredictability of off-target events, which should be addressed for safe clinical translation of CRISPR-Cas systems.
  • Unintended editing events can potentially lead to genomic instability, disrupt the functionality of genes, and cause serious adverse events including cell death and cancer.
  • compositions and methods that modify Cas activity e.g., decrease off-target events (e.g., relative to on-target events), thus leading to enhanced nucleic acid targeting specificity.
  • the present disclosure provides such compositions and methods.
  • Anti-CRISPR (Acr) proteins were discovered in phages for their ability to bind to and inhibit certain Cas proteins, thwarting the CRISPR system’s attempt to cleave invading phage DNA. Thus, Acr proteins are apparently used by phage to evade CRISPR-Cas systems.
  • the inventors have generated engineered Acr proteins (ErAcrs) that are not naturally occurring (see, e.g., SEQ ID NOs.: 126-132 and 264-267) and can be used to achieve a balance in which Cas proteins retain sufficient activity to perform a desired on-target nucleic acid functions (e.g., loading, complex assembling, binding and cleavage), but are inhibited to a degree that decreases off-target activity.
  • ErAcrs engineered Acr proteins
  • compositions and methods disclosed herein include an anti-CRISPR (Acr) polypeptide (an ErAcr), or a nucleic acid encoding the Acr polypeptide, where the Acr polypeptide includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the sequence set forth in any one of SEQ ID Nos.: 126-132.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • compositions and methods disclosed herein include an anti-CRISPR (Acr) polypeptide (an ErAcr), or a nucleic acid encoding the Acr polypeptide, where the Acr polypeptide includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the sequence set forth in any one of SEQ ID Nos.: 264- 267.
  • a subject Acr polypeptide has 90% or more sequence identity with the sequence set forth in any one of SEQ ID Nos.: 126-132.
  • a subject Acr polypeptide comprises the sequence set forth in any one of SEQ ID Nos.: 126-132. In some cases, a subject Acr polypeptide has 90% or more sequence identity with the sequence set forth in any one of SEQ ID Nos.: 264-267. In some cases, a subject Acr polypeptide comprises the sequence set forth in any one of SEQ ID Nos.: 264-267. [0007] In general, an Acr polypeptide is capable of decreasing activity of a CRISPR nuclease.
  • a subject Acr polypeptide decreases OFF target CRISPR nuclease activity by 10% or more (e.g., 20% or more, 30% or more 50% or more, etc.) as compared with OFF target (off-target) CRISPR nuclease activity in the absence of the Acr polypeptide.
  • a subject Acr polypeptide decreases ON target (on-target) CRISPR nuclease activity by no more than 40% (e.g., no more than 30%, no more than 20%, etc.) as compared with ON target CRISPR nuclease activity in the absence of the Acr polypeptide.
  • a subject Acr polypeptide increases the ratio of ON target to OFF target CRISPR nuclease activity as compared with the ratio of ON target to OFF target CRISPR nuclease activity in the absence of the Acr polypeptide (e.g., by at least 1.25x, 1.5x, 2x, 3x, etc.).
  • the CRISPR nuclease is a Cas12a protein. In some embodiments, the CRISPR nuclease is a NUX protein.
  • a subject Acr polypeptide decreases (inhibits) activity of a Cas12a nuclease (see, e.g., SEQ ID NOs.: 175 and 245-262).
  • a subject Acr polypeptide decreases activity of a CRISPR nuclease having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, or 100% sequence identity) with the Cas12a nuclease amino acid sequence set forth in any one of SEQ ID NOs.: 175 and 245-262.
  • a subject Acr polypeptide decreases activity of a CRISPR nuclease having 90% or more sequence identity (e.g., 95% or more, or 100% sequence identity) with the Cas12a nuclease amino acid sequence set forth in any one of SEQ ID NOs.: 175 and 245-262. In some cases, a subject Acr polypeptide decreases activity of a CRISPR nuclease having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, or 100% sequence identity) with the Cas12a nuclease amino acid sequence set forth in SEQ ID NO: 175.
  • a subject Acr polypeptide decreases activity of a CRISPR nuclease having 90% or more sequence identity (e.g., 95% or more, or 100% sequence identity) with the Cas12a nuclease amino acid sequence set forth in SEQ ID NO: 175.
  • a subject Acr polypeptide decreases (inhibits) activity of a NUX protein (see, e.g., SEQ ID NOs.: 1-86 and 176-244).
  • a subject Acr polypeptide decreases activity of a CRISPR nuclease having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, or 100% sequence identity) with the NUX protein amino acid sequence set forth in any one of SEQ ID NOs.: 1-86 and 176-244. In some cases, a subject Acr polypeptide decreases activity of a CRISPR nuclease having 90% or more sequence identity (e.g., 95% or more, or 100% sequence identity) with the NUX protein amino acid sequence set forth in any one of SEQ ID NOs.: 1-86 and 176-244.
  • a subject Acr polypeptide decreases activity of a CRISPR nuclease having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, or 100% sequence identity) with the NUX protein amino acid sequence set forth in any one of SEQ ID NOs.: 176-178. In some cases, a subject Acr polypeptide decreases activity of a CRISPR nuclease having 90% or more sequence identity (e.g., 95% or more, or 100% sequence identity) with the NUX protein amino acid sequence set forth in any one of SEQ ID NOs.: 176-178.
  • a subject Acr polypeptide decreases activity of a CRISPR nuclease having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO: 263. In some cases, a subject Acr polypeptide decreases activity of a CRISPR nuclease having 90% or more sequence identity (e.g., 95% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO: 263.
  • a subject Acr polypeptide can be fused to a fusion partner. In some such cases, a subject Acr polypeptide is fused to a nuclear localization signal (NLS).
  • NLS nuclear localization signal
  • the Acr polypeptide can decrease (inhibit) activity of the CRISPR nuclease.
  • the Acr polypeptide of such systems can be any of those discussed above (e.g., having 75% or more sequence identity with SEQ ID NOs.: 126-132, e.g., having 75% or more sequence identity with SEQ ID NOs.: 264-267), and the CRISPR nuclease can also be any of those discussed above (e.g., a Cas12a nuclease such as SEQ ID NOs.: 175 and 245-262, a NUX protein such as SEQ ID NOs: 1-86 and 176-244, and the like – see, e.g., the CRISPR nuclease of SEQ ID NO: 263).
  • a subject system will also include a guide RNA for targeting the CRISPR nuclease to a target sequence of a target nucleic acid.
  • the Acr polypeptide of a subject system can include an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, or 100% sequence identity) with the Acr amino acid sequence set forth in any one of SEQ ID NOs.: 165-169.
  • the Acr polypeptide of a subject system can include an amino acid sequence having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, or 100% sequence identity) with the Acr amino acid sequence set forth in any one of SEQ ID NOs.: 165-169.
  • sequence identity e.g., 85% or more, 90% or more, 95% or more, or 100% sequence identity
  • the CRISPR nuclease and the Acr polypeptide of the system do not naturally occur together.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, or 100% sequence identity) with the Acr amino acid sequence set forth in any one of SEQ ID NOs.: 165-169
  • the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, or 100% sequence identity) with the NUX protein amino acid sequence set forth in any one of SEQ ID NOs.: 1-86 and 176-244 or with the Cas12a amino acid sequence set forth in any one of SEQ ID NOs: 175 and 245-262.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, or 100% sequence identity) with the Acr amino acid sequence set forth in any one of SEQ ID NOs.: 165-169, and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, or 100% sequence identity) with the amino acid sequence set forth in 263.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, or 100% sequence identity
  • the Acr polypeptide and the CRISPR nuclease can each, independently, be provided as a nucleic acid encoding (DNA, RNA) the protein or can be provided in protein form. When both are provided as nucleic acids, the Acr polypeptide and the CRISPR nuclease can be encoded on separate nucleic acids (e.g., provided as two separate vectors), or can be encoded on the same nucleic acid (e.g., provided on the same vector).
  • a translation control element e.g., an IRES sequence, a 2A peptide encoding sequence, a non-canonical start codon, or any combination thereof
  • a translation control element can be operably linked to the Acr polypeptide coding sequence and/or to the CRISPR nuclease coding sequence.
  • a first promoter is operably linked to the Acr polypeptide coding sequence and a second promoter is operably linked to the CRISPR nuclease coding sequence.
  • the first promoter is stronger than the second promoter and in other embodiments, the second promoter is stronger than the first promoter.
  • Suitable promoters include, but are not limited to: CMV, miniCMV, EF1A, CAG, CBh, EFS, SV40, and hPGK.
  • a nucleic acid (encoding the Acr polypeptide, the CRISPR nuclease, or both) is a viral vector, e.g., an AAV vector.
  • the Acr polypeptide and/or the CRISPR nuclease are provided as a nucleic acid or in protein form, either can In some cases, by included in a lipid nanoparticle (LNP) formulation.
  • LNP lipid nanoparticle
  • contacting includes contacting the target nucleic acid with an Acr polypeptide and a CRISPR nuclease (see above), where the contacting results in a modification to the nucleotide sequence (e.g., substitution, insertion, deletion) of the target nucleic acid.
  • the contacting includes delivery of the CRISPR nuclease and the Acr polypeptide at a 1:1 ratio.
  • the contacting includes delivery of the CRISPR nuclease and the Acr polypeptide at a ratio (CRISPR:Acr) in a range of from 1:1.25 to 1:10.
  • the contacting includes delivery of the Acr polypeptide and the CRISPR nuclease at a ratio (Acr:CRISPR) in a range of from 1:1.25 to 1:10.
  • the contacting can occur in vitro, ex vivo, and in vivo, and the contacting can occur in a cell, e.g., a eukaryotic cell, an animal cell, a mammalian cell, a human cell, etc.
  • the target nucleic acid encodes a gene product (e.g., a non-coding RNA, an mRNA/protein, and the like).
  • the on-target/off-target CRISPR activity referred to in a subject composition, system, or method can include editing of the target nucleic acid (e.g., via DNA cleavage in the presence or absence of a donor polynucleotide), e.g., genome editing.
  • editing efficiency of ON target CRISPR nuclease activity is greater than editing efficiency of OFF target CRISPR nuclease activity (e.g., at least 2 times, at least 4 times, at least 5 times, or at least 10 times greater).
  • the OFF target CRISPR nuclease activity is at an off-target site that includes no more than 5 mismatches as compared to the on-target site.
  • the OFF target activity of the CRISPR nuclease in the presence of the Acr polypeptide is decreased when compared with the OFF target activity of the CRISPR nuclease in the absence of the Acr polypeptide (e.g., by 10% or more, 20% or more, 30% or more, etc.).
  • the ON target activity of the CRISPR nuclease in the presence of the Acr polypeptide is decreased as compared with the ON target activity of the CRISPR nuclease in the absence of the Acr polypeptide (e.g., by no more than 10%, by no more than 20%, by no more than 30%, by no more than 40%, etc.).
  • a subject method includes measuring editing efficiency at the ON target site. In some cases, a subject method includes measuring editing efficiency at one or more off-target sites.
  • FIG.1 depicts editing efficiency of AsCas12a (SEQ ID NO: 175) in the presence of Acrs - tested in HEK293T cells. Acx-105 represents a negative control; this Acr inhibits Ca9 but does not inhibit AsCas12a.
  • FIG.2 depicts editing efficiency of NUX (SEQ ID NO: 176) in the presence of Acrs - tested in HEK293T cells.
  • FIG.3 depicts results from dosing experiments using the CRISPR nuclease AsCas12a with increasing amounts (left to right) of the Acr Acx-175, using a guide that matched the DNMT1 (for on-targeting editing) and one guide that was mismatched at position 9 (to measure off-target editing).
  • Acx-105 (which does not inhibit AsCas12a) was added to keep the relative amount of Acr protein consistent across the samples.
  • FIG.4 depicts results from dosing experiments using the CRISPR nuclease NUX (SEQ ID NO: 177) with the Acr Acx-175 (SEQ ID NO: 165) to observe editing activity targeting DNMT1.
  • FIG.5 depicts results from using seven engineered Acrs (ErAcrs) (Table 4, Acx- 305, -306, -307, -308, -310, -311) that were transferred into a CMV expressing vector and tested in HEK293T cells with NUX (SEQ ID NO: 178) and DNMT1 for on-target editing and with a guide RNA with a single base change at position 8 (MM8) for off- target editing.
  • ErAcrs engineered Acrs
  • FIG.6A-6C depict results from using four of the engineered Acrs (Acx-306, Acx- 308, Acx-310 and Acx-311). Three different ratios of Nux:Acx (2:1 (FIG.6A), 1:1 (FIG. 6B) and 1:2 (FIG.6C)) were transfected into HEK293T cells.
  • the target was DNMT1.
  • On-target editing was measured with a DNMT1 guide RNA and off-target editing was measured with a guide RNA with a single base change at position 9.
  • FIG.7 depicts results in HEK293T cells comparing Acrs, Acx-175, Acx-306, Acx- 310, and negative control Acx-105 with the nuclease NUX (SEQ ID NO: 178) using guide RNAs with either a perfect match to the target, or a mismatch of one base at different positions in the guide RNA.
  • FIG.8A-8B depict results from using the Acr Acx-315 (SEQ ID NO: 132) with NUX (SEQ ID NO: 178) in HEK293T cells over a range of Nux:Acx ratios for on-target editing for DNMT1and off-target editing (with MM8 guide RNA).
  • FIG.9A-9D depict schematics of non-limiting examples of arrangements of components in a subject nucleic acid.
  • P1 and P2 are promoters
  • Cas is the Cas- encoding sequence
  • Acr is the Acr-encoding sequence
  • 2A is a 2A peptide encoding sequence
  • X is a ‘spacer’ sequence.
  • FIG.10A-10F depict schematics of non-limiting examples of arrangements of components in a subject nucleic acid.
  • FIG.11A-11B depict schematics of two non-limiting examples of arrangements of components in a subject nucleic acid.
  • “P1” and “P2” are promoters, “Cas” is the Cas- encoding sequence, “Acr” is the Acr-encoding sequence, “IRES” is an IRES sequence, and “X” is a ‘spacer’ sequence.
  • FIG.11A-11B depict schematics of two non-limiting examples of arrangements of components in a subject nucleic acid.
  • “P1” and “P2” are promoters
  • “Cas” is the Cas- encoding sequence
  • “Acr” is the Acr-encoding sequence
  • Asterisks denotes a non-AUG start codon.
  • FIG.12 depicts results from on-target vs off-target editing experiments using ErAcr3, ErAcr4, ErAcr13, and ErAcr20.
  • FIG.13 depicts a sequence alignment of ErAcr3, ErAcr4, ErAcr13, and ErAcr20 (SEQ ID Nos.: 264-27, respectively).
  • this term includes, but is not limited to, single-, double-, or multi-stranded DNA or RNA, genomic DNA, cDNA, DNA-RNA hybrids, or a polymer comprising purine and pyrimidine bases or other natural, chemically or biochemically modified, non-natural, or derivatized nucleotide bases.
  • Standard Watson-Crick base-pairing includes: adenine (A) pairing with thymidine (T), adenine (A) pairing with uracil (U), and guanine (G) pairing with cytosine (C) [DNA, RNA].
  • RNA molecules e.g., dsRNA
  • DNA molecule with an RNA molecule e.g., when a DNA target nucleic acid base pairs with a guide RNA, etc.
  • guanine (G) can also base pair with uracil (U).
  • G/U base-pairing is at least partially responsible for the degeneracy (i.e., redundancy) of the genetic code in the context of tRNA anti-codon base-pairing with codons in mRNA.
  • a guanine (e.g., of dsRNA duplex of a guide RNA molecule; of a guide RNA base pairing with a target nucleic acid, etc.) is considered complementary to both a uracil (U) and to an adenine (A).
  • G guanine
  • U uracil
  • A adenine
  • Hybridization requires that the two nucleic acids contain complementary sequences, although mismatches between bases are possible.
  • the conditions appropriate for hybridization between two nucleic acids depend on the length of the nucleic acids and the degree of complementarity, variables well known in the art. The greater the degree of complementarity between two nucleotide sequences, the greater the value of the melting temperature (Tm) for hybrids of nucleic acids having those sequences.
  • Tm melting temperature
  • the length for a hybridizable nucleic acid is 8 nucleotides or more (e.g., 10 nucleotides or more, 12 nucleotides or more, 15 nucleotides or more, 17 nucleotides or more, 18 nucleotides or more, 20 nucleotides or more, 22 nucleotides or more, 25 nucleotides or more, or 30 nucleotides or more).
  • Temperature, wash solution salt concentration, and other conditions may be adjusted as necessary according to factors such as length of the region of complementation and the degree of complementation.
  • ''peptide ''polypeptide
  • protein refers to a polymeric form of amino acids of any length, which can include coded and non-coded amino acids, chemically or biochemically modified or derivatized amino acids, and polypeptides having modified peptide backbones.
  • "Binding” as used herein refers to a non-covalent interaction between macromolecules (e.g., between a protein and a nucleic acid such as DNA or RNA).
  • the macromolecules While in a state of non-covalent interaction, the macromolecules are said to be “associated” or “interacting” or “binding” (e.g., when a molecule X is said to interact with a molecule Y, it is meant the molecule X binds to molecule Y in a non- covalent manner). Not all components of a binding interaction need be sequence- specific (e.g., contacts with phosphate residues in a DNA backbone), but some portions of a binding interaction may be sequence-specific.
  • Binding interactions can generally be characterized by a dissociation constant (KD) of less than 10 6 M, less than 10 M, less than 10 -8 M, less than 10 -9 M, less than 10 -10 M, less than 10 -11 M, less than 10 -12 M, less than 10 -13 M, less than 10 -14 M, or less than 10 -15 M.
  • KD dissociation constant
  • Affinity refers to the strength of binding, increased binding affinity being correlated with a lower KD.
  • a “promoter” or a “promoter sequence” is a DNA regulatory region capable of binding RNA polymerase and initiating transcription of a downstream (3' direction) coding or non-coding sequence.
  • the promoter sequence is bounded at its 3' terminus by the transcription initiation site and extends upstream (5' direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background.
  • Eukaryotic promoters will often, but not always, contain "TATA” boxes and "CAT” boxes.
  • Various promoters, including constitutive, tissue-specific, and inducible promoters may be used to drive expression by the various vectors of the present disclosure.
  • the level of expression of a given promoter can be described as weak, medium, or strong – and thus, promoters can be categorized as weak, medium, or strong promoters.
  • operably linked refers to a juxtaposition wherein the components so described are in a relationship permitting them to function in their intended manner.
  • a promoter is operably linked to a nucleotide sequence (the nucleotide sequence can also be said to be operably linked to the promoter) if the promoter affects transcription of said nucleotide sequence.
  • a translational control element is operably linked to a protein-coding sequence (the protein-coding sequence can also be said to be operably linked to the translational control element) if the translational control element affects translation of protein from the protein-coding sequence.
  • a “coordinated delivery system” as used herein refers to the coordinated delivery of a an Acr protein and a Cas nuclease.
  • a coordinated delivery system includes 1 or more nucleic acids (e.g., vectors) for expression of an Acr protein and a CRISPR nuclease, e.g., in a host cell.
  • a coordinated delivery system provides a single nucleic acid (e.g., vector) for expression of an Acr protein and a CRISPR nuclease.
  • a coordinated delivery system provides more than one nucleic acid (e.g., vector) for expression of an Acr protein and a Cas nuclease and the expression and/or function is coordinated such as with the provision of a split-Cas from 2 vectors.
  • the expression and/or function of an Acr protein and a Cas nuclease is linked (i.e., coordinated) by virtue of translational control element selected to regulate the translation of the Acr protein and/or CRISPR nuclease.
  • the effect may be prophylactic in terms of completely or partially preventing a disease or symptom thereof and/or may be therapeutic in terms of a partial or complete cure for a disease and/or adverse effect attributable to the disease.
  • Treatment covers any treatment of a disease in a mammal, e.g., in a human, and includes: (a) preventing the disease from occurring in a subject which may be predisposed to the disease but has not yet been diagnosed as having it; (b) inhibiting the disease, i.e., arresting its development; and (c) relieving the disease, i.e., causing regression of the disease.
  • subject and "host,”,” used interchangeably herein, refer to an individual organism that expresses or is intended to express the coordinated delivery system and/or the Cas nuclease and/or Acr proteins decribed herein.
  • Hosts include, but are not limited to fungi (such as yeasts), plants, algae, insects, animals such as birds and mammals, (e.g., a mammal, including, but not limited to, murines, simians, humans, mammalian farm animals, mammalian sport animals, and mammalian pets).
  • Hosts include, but are not limited to, microbes such as bacteria and fungi (such as yeasts), plants, algae, insects, animals such as birds and mammals, (e.g., a mammal, including, but not limited to, murines, simians, humans, mammalian farm animals, mammalian sport animals, and mammalian pets).
  • a mammal including, but not limited to, murines, simians, humans, mammalian farm animals, mammalian sport animals, and mammalian pets.
  • on-target or “ON target”
  • off-target or “OFF target” are used herein to refer to the locations of CRISPR complex activity (e.g., target DNA cleavage, DNA editing) within a target DNA. Both location types (on- and off- target) are based on the guide sequence of the guide RNA.
  • CRISPR complex mediated events that take place at a location based on a 100% match with the guide sequence are considered “on-target” while those that take place at (undesired) locations that are not based on a 100% match with the guide sequence are “off-target”.
  • sequence of the target DNA is known (e.g., a large portion of the genome of a target cell has been sequenced)
  • likely off-target sites can be predicted for a given guide sequence.
  • off- target events are more likely to take place at sequences that are closer to a 100% match between the guide RNA and target than sequences that are at lower percentages of identity between the guide RNA and target.
  • an off-target event may occur where there is 1 mismatch between the guide RNA and target, 2 mismatches between the guide RNA and target or 3 mismatches between the guide RNA and target.
  • a sequence analysis of the target DNA can provide a list of expected possible off-target sites within a target DNA.
  • the number of predicted off-target sites will depend on the target DNA sequence, but In some cases, the number of predicted off-target sites will be in a range of from 10-200 predicted sites (e.g., from 10-150, 10-100, 10-50, 15-200, 15-150, 15-100, 15-80, 20-200, 20-150, 20-100, or 20-80 predicted sites).
  • 10-200 predicted sites e.g., from 10-150, 10-100, 10-50, 15-200, 15-150, 15-100, 15-80, 20-200, 20-150, 20-100, or 20-80 predicted sites.
  • the present disclosure provides Acrs, including non-naturally occurring engineered Acr proteins (ErAcrs), that can be used to achieve a balance in which Cas proteins retain sufficient activity to perform a desired on-target nucleic acid functions (e.g., loading, complex assembling, binding and cleavage), but are inhibited to a degree that decreases off-target activity as compared to the activity of the Cas protein without the presence of the Acr (see, e.g., SEQ ID NOs.: 126-132).
  • ErAcrs non-naturally occurring engineered Acr proteins
  • compositions and methods disclosed herein include an anti-CRISPR (Acr) polypeptide or a nucleic acid encoding the Acr polypeptide, where the Acr polypeptide includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the sequence set forth in any one of SEQ ID Nos.: 126-132.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • CRISPR complex and “effector complex” as used herein refer to the protein-RNA complex that is guided to a specific sequence within a target nucleic acid (e.g., target genomic DNA) by the RNA component – often referred to as a “guide RNA”.
  • the functions of the effector complex are carried out by a single protein (which can be referred to as an “effector protein”, also referred to herein as a “CRISPR nuclease” or simply a “Cas protein”) – where the natural protein is an endonuclease (e.g., see Zetsche et al, Cell.2015 Oct 22;163(3):759-71; Makarova et al, Nat Rev Microbiol.2015 Nov;13(11):722-36; Shmakov et al., Mol Cell.2015 Nov 5;60(3):385-97; Shmakov et al., Nat Rev Microbiol.2017 Mar;15(3):169-182; Koonin et al., Curr Opin Microbiol.2017 Jun;37:67-78; and Makarova et al., Nat Rev Microbiol.
  • CRISPR/Cas protein class 2 CRISPR/Cas protein or “CRISPR/Cas effector protein” or “Cas effector protein” or more simply “CRISPR nuclease” or “Cas protein” is used herein to encompass the effector protein from class 2 CRISPR systems.
  • CRISPR nucleases include Cas12a proteins and NUX proteins (described elsewhere herein).
  • Acr proteins are proteins that inhibit (decrease activity of) Cas proteins – thereby acting as negative regulators of the CRISPR complex.
  • a subject Acr protein is an inhibitor of a Cas protein of a class 2 CRISPR complex (e.g., a class 2 effector protein such as a Cas12 protein, e.g., a Cas12a – also known as Cpf1) – thereby directly regulating the effector protein of a CRISPR complex.
  • a Cas protein of a class 2 CRISPR complex e.g., a class 2 effector protein such as a Cas12 protein, e.g., a Cas12a – also known as Cpf1
  • the effector CRISPR-Cas nucleases that complex with gRNA are highly diverse and spread across 6 distinct types (Types I–VI). So far, anti-CRISPR proteins (Acrs) have been discovered that inhibit CRISPR Type I, II, and V systems.
  • pyogenes SpCas9
  • CRISPR- Cas enzyme for biotechnological applications and has also been deployed in DNA- binding applications.
  • Acr proteins that function against the Type II-A system have been discovered by a bioinformatics approach that surveys bacterial genomes for self- targeting.
  • specific inhibitory mechanisms have been determined for some of them (e.g., AcrIE1, AcrIF1-3, AcrIF10, AcrIIA2, AcrIIA4, AcrIIC1-3, and AcrVA5).
  • the known mechanisms are highly diverse, presenting a pool of off-switch modalities to draw from.
  • Acr proteins can act at 3 different steps of CRISPR-Cas-mediated immunity including 1) inhibiting guide RNA loading, 2) blocking DNA binding, and 3) preventing DNA cleavage.
  • the most common mechanism observed to date is that the anti-CRISPR protein occupies the DNA-binding site on the Cas protein, thus mimicking DNA and inhibiting the DNA-binding and cleavage activity of the protein.
  • the mechanisms by which Acrs block DNA binding can be different. For example, even though AcrIF1, AcrIF2, and AcrIF10 bind to different subunits of the cascade effector complex of the type I-F CRISPR-Cas system, they all prevent DNA binding to the complex.
  • AcrIIC3 also blocks DNA binding but uses a fourth and distinct mechanism - promoting dimerization of Cas9.
  • a 3.9- ⁇ resolution cryo-electron structure revealed that the Cas9-sgRNA- AcrIIA4 complex has AcrIIA4 bound to the PAM-interacting domain of Cas9, thus preventing the target DNA binding.
  • AcrIIA4 binds only to assembled Cas9- sgRNA complexes, not to Cas9 protein alone or to preformed Cas9-sgRNA-DNA complexes.
  • a subject “Acr protein” comprises the wild type (natural) sequence.
  • a subject Acr Protein is mutated.
  • the inventors have generated engineered Acr proteins (ErAcrs) that are not naturally occurring (see, e.g., SEQ ID NOs.: 126-132) and can be used to achieve a balance in which Cas proteins retain sufficient activity to perform a desired on-target nucleic acid functions (e.g., loading, complex assembling, binding and cleavage), but are inhibited to a degree that decreases off-target activity.
  • ErAcrs engineered Acr proteins
  • compositions and methods disclosed herein include an anti-CRISPR (Acr) polypeptide (an ErAcr), or a nucleic acid encoding the Acr polypeptide.
  • Acr anti-CRISPR
  • ErAcr engineered Acr proteins
  • Examples of engineered Acr proteins (ErAcrs) provided herein include SEQ ID NOs: 126-132 (see, e.g., the working examples). Additional examples of ErAcrs provided herein include SEQ ID NOs: 264-267 (see, e.g., example 7 below). Additional examples of Acr proteins provided herein include those set forth as SEQ ID NOs.165- 169.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 126-132.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 126-132. In some cases, the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 126-132.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 126-132. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 126-132.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 126.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 126. In some cases, the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 126.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 126. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 126.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 127.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 127.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 127.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 127. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 127.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 128.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 128.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 128.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 128. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 128.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 129.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 129.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 129.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 129. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 129.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 130.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 130.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 130.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 130. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 130.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 131.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 131.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 131.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 131. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 131.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 132.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 132.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 132.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 132. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 132.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 126-132 and 264-267.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 126-132 and 264-267. In some cases, the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 126-132 and 264-267.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 126-132 and 264-267. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 126-132 and 264-267.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 264-267.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 264-267.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 264-267.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 264-267. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 264-267.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 264.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 264.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 264.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 264. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 264.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 265.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 265.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 265.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 265. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 265.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 266.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 266.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 266.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 266. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 266.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 267.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 267.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 267.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 267. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 267.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 165-169.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 165-169. In some cases, the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 165-169.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 165-169. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in any one of SEQ ID NOs: 165-169.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 165.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 165.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 165.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 165. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 165. [0075] In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 165, but has an amino acid variation at least one position between and including amino acid 2 and amino acid 159 of SEQ ID NO:165 (see, e.g., Table 6). In some cases, an Acr polypeptide includes one or more amino acid variations set forth in Table 6.
  • an Acr polypeptide includes one or more amino acid variations set forth in Table 7.
  • a subject Acr protein i.e., polypeptide
  • comprises an amino acid sequence having 75% or more sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 166.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 166.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 166. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 166.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 167.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 167.
  • the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 167.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 167. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 167.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 168.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 168. In some cases, the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 168.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 168. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 168.
  • a subject Acr protein i.e., polypeptide
  • a subject Acr protein comprises an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 169.
  • the subject Acr protein comprises an amino acid sequence having 85% or more sequence identity (e.g., 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 169. In some cases, the subject Acr protein comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 169.
  • the subject Acr protein comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the Acr protein amino acid sequence set forth in SEQ ID NO: 169. In some cases, the subject Acr protein comprises the Acr protein amino acid sequence set forth in SEQ ID NO: 169.
  • CRISPR Nucleases [0080] As with a subject Acr polypeptide, in some cases, a subject CRISPR nuclease comprises a wild type (natural) sequence.
  • the CRISPR nuclease comprises an amino acid sequence having 70% or more sequence identity (e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with a wild type CRISPR nuclease (e.g., a Cas12a or a Nux protein).
  • sequence identity e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity
  • a wild type CRISPR nuclease e.g., a Cas12a or a Nux protein
  • the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with a wild type CRISPR nuclease (e.g., a Cas12a or a Nux protein).
  • the CRISPR nuclease comprises an amino acid sequence having 90% or more sequence identity (e.g., 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with a wild type CRISPR nuclease (e.g., a Cas12a or a Nux protein).
  • the CRISPR nuclease comprises an amino acid sequence having 95% or more sequence identity (e.g., 97% or more, 98% or more, 99% or more, or 100% sequence identity) with a wild type CRISPR nuclease (e.g., a Cas12a or a Nux protein).
  • the CRISPR nuclease comprises the amino acid sequence of a wild type CRISPR nuclease (e.g., a Cas12a or a Nux protein).
  • a subject CRISPR nuclease has been ‘evolved’ such that it has low overall sequence homology to the natural CRISPR nuclease, but retains the identifiable characteristic domain(s) of that protein.
  • the CRISPR nuclease is a Cas12a protein (see, e.g., SEQ ID NOs.: 175 and 245-262). In some cases, (e.g., in systems and/or methods disclosed herein), the CRISPR nuclease is a NUX protein (see, e.g., SEQ ID NOs.: 1-86 and 176-244). In some cases, the CRISPR nuclease is SEQ ID NO: 263.
  • the CRISPR nuclease comprises an amino acid sequence having 70% or more sequence identity (e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:263.
  • the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:263.
  • the CRISPR nuclease comprises an amino acid sequence having 90% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:263. In some cases, the CRISPR nuclease comprises an amino acid sequence having 95% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:263. In some cases, the CRISPR nuclease comprises the amino acid sequence set forth in SEQ ID NO:263.
  • CRISPR nucleases include those of Table 1. [0083] Table 1. Examples of CRISPR nuclease proteins Name Sequence SEQ ID NO: (AsCas12a) MTQFEGFTNLYQVSKTLRFELIPQGKTLKHIQE 175 QGFIEEDKARNDHYKELKPIIDRIYKTYADQCL QLVQLDWENLSAAIDSYRKEKTEETRNALIEE QATYRNAIHDYFIGRTDNLTDAINKRHAEIYKG LFKAELFNGKVLKQLGTVTTTEHENALLRSFD KFTTYFSGFYENRKNVFSAEDISTAIPHRIVQD NFPKFKENCHIFTRLITAVPSLREHFENVKKAI GIFVSTSIEEVFSFPFYNQLLTQTQIDLYNQLLG GISREAGTEKIKGLNEVLNLAIQKNDETAHIIAS LPHRFIPLFKQILSDRNTLSFILEEFKSDEEVIQ SFCKYKTLLRNENV
  • the Cas effector protein i.e., CRISPR nuclease
  • is a variant (is modified/mutated) i.e., includes one or more amino acid mutations such as substitution(s), insertion(s), deletion(s) relative to a wildtype Cas effector protein).
  • the CRISPR nuclease has one or more amino acid mutations relative to a wild type protein.
  • Cas12a protein the CRISPR nuclease comprises an amino acid sequence having 70% or more sequence identity (e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs:175 and 245-262.
  • the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 175 and 245-262.
  • the CRISPR nuclease comprises an amino acid sequence having 90% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 175 and 245-262.
  • the CRISPR nuclease comprises an amino acid sequence having 95% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 175 and 245-262. In some cases, the CRISPR nuclease comprises the amino acid sequence set forth in any one of SEQ ID NOs: 175 and 245-262.
  • the CRISPR nuclease comprises an amino acid sequence having 70% or more sequence identity (e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 245-262.
  • the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 245-262.
  • the CRISPR nuclease comprises an amino acid sequence having 90% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 245-262. In some cases, the CRISPR nuclease comprises an amino acid sequence having 95% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 245-262.
  • the CRISPR nuclease comprises the amino acid sequence set forth in any one of SEQ ID NOs: 245-262. [0087] In some cases, the CRISPR nuclease comprises an amino acid sequence having 70% or more sequence identity (e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:175.
  • sequence identity e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity
  • the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:175.
  • the CRISPR nuclease comprises an amino acid sequence having 90% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:175.
  • the CRISPR nuclease comprises an amino acid sequence having 95% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:175. In some cases, the CRISPR nuclease comprises the amino acid sequence set forth in SEQ ID NO: 175.
  • NUX protein is used herein to mean a nuclease having Cas-like activity (an RNA-guided CRISPR effector protein) and having an amino acid sequence of any one of SEQ ID NOs: 1-86 and 176-244, or at least 70% identity, 80% identity, 85% identity, 90% identity, 95% identity, 98% percent identity, or 99% identity to an amino acid sequence of any one of SEQ ID NOs: 1-86 and 176-244.
  • the amino acid sequence of the Nux protein lacks identity or significant amino acid homology with certain known Cas nucleases.
  • the amino acid sequence of the NUX protein lacks identity or significant amino acid homology with Cas12a (Cpf1) protein.
  • the amino acid sequence of the NUX protein lacks identity or significant amino acid homology with a Cas9 such as SaCas9 or SpyCas9. In some embodiments, the amino acid sequence of the NUX protein has less than 50%, less than 48%, less than 45%, less than 40%, less than 35% or less than 34% with Cas12a (Cpf1) protein, SaCas9 or SpyCas9 proteins.
  • the CRISPR nuclease comprises an amino acid sequence having 70% or more sequence identity (e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs:1- 86 and 176-244.
  • the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 1-86 and 176-244.
  • the CRISPR nuclease comprises an amino acid sequence having 90% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 1-86 and 176-244.
  • the CRISPR nuclease comprises an amino acid sequence having 95% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 1-86 and 176-244. In some cases, the CRISPR nuclease comprises the amino acid sequence set forth in any one of SEQ ID NOs: 1-86 and 176-244.
  • the CRISPR nuclease comprises an amino acid sequence having 70% or more sequence identity (e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs:176-178.
  • the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 176-178.
  • the CRISPR nuclease comprises an amino acid sequence having 90% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 176-178. In some cases, the CRISPR nuclease comprises an amino acid sequence having 95% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in any one of SEQ ID NOs: 176-178.
  • the CRISPR nuclease comprises the amino acid sequence set forth in any one of SEQ ID NOs: 176-178. [0091] In some cases, the CRISPR nuclease comprises an amino acid sequence having 70% or more sequence identity (e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:176.
  • 70% or more sequence identity e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity
  • the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:176. In some cases, the CRISPR nuclease comprises an amino acid sequence having 90% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:176.
  • the CRISPR nuclease comprises an amino acid sequence having 95% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:176. In some cases, the CRISPR nuclease comprises the amino acid sequence set forth in SEQ ID NO:176.
  • the CRISPR nuclease comprises an amino acid sequence having 70% or more sequence identity (e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:177.
  • the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:177.
  • the CRISPR nuclease comprises an amino acid sequence having 90% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:177. In some cases, the CRISPR nuclease comprises an amino acid sequence having 95% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:177. In some cases, the CRISPR nuclease comprises the amino acid sequence set forth in SEQ ID NO:177.
  • the CRISPR nuclease comprises an amino acid sequence having 70% or more sequence identity (e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:178.
  • the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:178.
  • the CRISPR nuclease comprises an amino acid sequence having 90% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:178. In some cases, the CRISPR nuclease comprises an amino acid sequence having 95% or more sequence identity (e.g., 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the amino acid sequence set forth in SEQ ID NO:178. In some cases, the CRISPR nuclease comprises the amino acid sequence set forth in SEQ ID NO:178.
  • a subject composition, system, or method includes a guide RNA (or a nucleic acid encoding the guide RNA).
  • a subject composition, system, or method e.g., a vector or vector system
  • the promoter is an RNA polymerase III promoter (e.g. U6, H1), which can be used to express non-coding RNAs in eukaryotic cells.
  • a “guide RNA” is nucleic acid that binds to a Cas protein (e.g., a Class 2 CRISPR nuclease such as Cas12a), thus forming a CRISPR complex (a protein-RNA effector complex) – and can target the CRISPR complex to a specific ‘on-target’ target sequence within a target nucleic acid (e.g., genomic DNA, e.g., eukaryotic or prokaryotic genomic DNA).
  • a target nucleic acid e.g., genomic DNA, e.g., eukaryotic or prokaryotic genomic DNA.
  • a hybrid DNA/RNA can be made such that a guide RNA includes DNA bases in addition to RNA bases - but the term “guide RNA” is still used herein to encompass such hybrid molecules.
  • a guide RNA provides target specificity to the CRISPR complex by including a targeting segment, which includes a guide sequence (also referred to herein as a targeting sequence), which is a nucleotide sequence that is complementary to a sequence of a target nucleic acid.
  • a subject guide RNA includes (i) a guide sequence (also referred to as a “spacer” or “targeting sequence”) that hybridizes to a target sequence (also referred to as a “protospacer”) of a target nucleic acid, e.g., target DNA; and (ii) a constant region (e.g., a region that is adjacent to the guide sequence and binds to the Cas protein).
  • a “constant region” can also be referred to herein as a “protein-binding segment” or a “handle.”
  • an on-target event e.g., target DNA cleavage, transcription modulation, DNA methylation, histone modification
  • CRISPR complex mediated events that take place at a location that is not a 100% match with the guide sequence is referred to herein as an off-target event.
  • a guide RNA can be referred to by the protein to which it corresponds.
  • a guide RNA when the guide RNA binds to and guides a class 2 CRISPR/Cas effector protein, the guide RNA can be referred to as a “class 2 guide RNA.”
  • the class 2 CRISPR/Cas effector protein is a Cpf1 (Cas12a) protein
  • the corresponding guide RNA can be referred to as a “Cpf1 guide RNA” or “Cas12a guide RNA.”
  • a guide RNA includes two separate nucleic acid molecules: an “activator” (e.g., a tracrRNA) and a “targeter” (e.g., a crRNA) and is referred to herein as a “dual guide RNA”, a “double-molecule guide RNA”, a “two- molecule guide RNA”, or a “dgRNA.”
  • the guide RNA is one molecule.
  • the corresponding guide RNA is naturally a single molecule
  • the corresponding guide RNA is naturally two separate molecules (e.g., a crRNA and a tracrRNA) – and the two molecules (an activator, e.g., tracrRNA, and a targeter, e.g., a crRNA) can be covalently linked to one another, e.g., via chemical linkage or intervening nucleotides.
  • the guide RNA When the guide RNA is one molecule, the guide RNA can be referred to as a “single guide RNA”, a “single-molecule guide RNA,” a “one-molecule guide RNA”, or simply “sgRNA.” “Guide RNA” (or “gRNA”) is a generic term that encompasses dual guide and single guide formats. [0099] The guide sequence has complementarity with (hybridizes to) a target sequence of the target nucleic acid (e.g., target DNA).
  • a target sequence of the target nucleic acid e.g., target DNA
  • the guide sequence is 15- 28 nucleotides (nt) in length (e.g., 15-26, 15-24, 15-22, 15-20, 15-18, 16-28, 16-26, 16- 24, 16-22, 16-20, 16-18, 17-26, 17-24, 17-22, 17-21, 17-20, 17-19, 17-18, 18-26, 18-24, 18-22, 18-20, or 19-21 nt in length).
  • the guide sequence is 18-24 nucleotides (nt) in length.
  • the guide sequence is 17-18 nucleotides (nt) in length.
  • the guide sequence is at least 15 nt long (e.g., at least 16, 18, 20, or 22 nt long).
  • the guide sequence is at least 17 nt long. In some cases, the guide sequence is at least 18 nt long. In some cases, the guide sequence is at least 20 nt long. In some cases, the guide sequence is 20 nt long. [00100] In some cases, the constant region (also referred to as a scaffold) of a guide RNA is 15 or more nucleotides (nt) in length (e.g., 18 or more, 20 or more, 21 or more, 22 or more, 23 or more, 24 or more, 25 or more, 26 or more, 27 or more, 28 or more, 29 or more, 30 or more, 31 or more nt, 32 or more, 33 or more, 34 or more, 35 or more , 40 or more, 45 or more, 50 or more, 60 or more, 70 or more, 80 or more, 90 or more, or 100 or more nt in length).
  • nt nucleotides
  • the constant region of a guide RNA is 18 or more nt in length.
  • Guide RNAs with various modifications to increase efficiency relative to naturally existing guide RNAs are known in the art and are readily available to one of ordinary skill in art. See, e.g., Moon et al., Trends Biotechnol. 2019 Aug;37(8):870-881, “Improving CRISPR Genome Editing by Engineering Guide RNAs”.
  • guide RNA as used herein encompasses such modifications and any convenient guide RNA can be used with the methods and compositions disclosed herein (e.g., as part of a subject system – for example as RNA or as encoded by a subject nucleic acid).
  • Acr/CRISPR nuclease combinations – “Systems” [00102] In general, the term “system” is used herein when referring to the combination of an Acr polypeptide (described herein) (in protein and/or nucleic acid form) and a CRISPR nuclease (described herein) (in protein and/or nucleic acid form) that it inhibits.
  • a subject system further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide and the CRISPR nuclease do not naturally occur together (i.e., they are heterologous to one another).
  • the Acr polypeptide and/or the CRISPR nuclease includes a variant amino acid sequence (e.g., an engineered Acr polypeptide disclosed herein).
  • the Acr polypeptide and the CRISPR nuclease are naturally occurring proteins, but the two proteins do not exist together in nature (e.g., the Acr polypeptide may be from a phage that does not naturally infect the bacterial species which harbors the CRISPR nuclease).
  • the Acr polypeptide may be from a phage that does not naturally infect the bacterial species which harbors the CRISPR nuclease.
  • subject systems can include any combination thereof.
  • a subject system includes an Acr polypeptide or a nucleic acid (DNA or RNA) encoding the Acr polypeptide; and a CRISPR nuclease or a nucleic acid (DNA or RNA) encoding the CRISPR nuclease.
  • a subject system includes the CRISPR nuclease and a nucleic acid encoding the Acr polypeptide.
  • a subject system includes the Acr polypeptide and a nucleic acid encoding the CRISPR nuclease.
  • a subject system includes a nucleic acid encoding the Acr polypeptide and a nucleic acid encoding the CRISPR nuclease.
  • both the nucleotide sequence encoding the Acr polypeptide and the nucleotide sequence encoding CRISPR nuclease are on the same nucleic acid (e.g. the same vector).
  • the nucleotide sequence encoding the Acr polypeptide and the nucleotide sequence encoding CRISPR nuclease are on different nucleic acids (e.g., different vectors).
  • the Acr polypeptide and the CRISPR nuclease of a subject system can be any of those described herein, in any desired combination.
  • Acr protein (SEQ ID NOs: 165-169) plus CRISPR nuclease (Cas12a or NUX) [00105]
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 1-86 and 175-262.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86 and 175-262.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-168; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86 and 175-262.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 165; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86 and 175-262.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 175 and 245-262.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 175 and 245-262.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-168; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 175 and 245-262.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 165; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 175 and 245-262.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes the amino acid sequence set forth as SEQ ID NO: 175.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 175.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-168; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 175.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 165; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 175.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 1-86 and 176-244.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86 and 176-244.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-168; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86 and 176-244.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 165; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86 and 176-244.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 176- 178.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 176-178.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-168; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 176-178.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 165; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 176-178.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 176.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-167; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 176.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 165; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 176.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 165; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 177.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 165-169; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 178.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 178.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 165; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 178.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • a guide RNA that guides the CRISPR nuclease to a target sequence
  • a nucleic acid encoding the guide RNA.
  • Acrs SEQ ID NOs: 126-132
  • CRISPR nuclease [00113]
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 1-86 and 175-262.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86 and 175-262.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 1-86 and 175-263.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86 and 175-263.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 175 and 245-262.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 175 and 245-262.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes the amino acid sequence set forth as SEQ ID NO: 175.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 175.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 1-86 and 176-244.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86 and 176-244.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 176- 178.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 176-178.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes the amino acid sequence set forth as SEQ ID NO: 263.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 126-132; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 263.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • a guide RNA that guides the CRISPR nuclease to a target sequence
  • a nucleic acid encoding the guide RNA.
  • Acrs SEQ ID NOs: 264-267) plus CRISPR nuclease [00120]
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 1-86 and 175-263.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86 and 175-263.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 175 and 245-262.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 175 and 245-262.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes the amino acid sequence set forth as SEQ ID NO: 175.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 175.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 1-86 and 176-244.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86 and 176-244.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 1-86, 176-244, and 263.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 1-86, 176-244, and 263.
  • sequence identity e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes the amino acid sequence set forth set forth as any one of SEQ ID NOs.: 176- 178.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 176-178.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • the Acr polypeptide of a subject system includes the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes the amino acid sequence set forth as SEQ ID NO: 263.
  • the Acr polypeptide of a subject system includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as any one of SEQ ID NOs.: 264-267; and the CRISPR nuclease includes an amino acid sequence having 75% or more sequence identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence identity) with the amino acid sequence set forth as SEQ ID NO: 263.
  • a system described in this paragraph further includes a guide RNA (that guides the CRISPR nuclease to a target sequence) or a nucleic acid encoding the guide RNA.
  • a protein e.g., an Acr protein and/or a CRISPR nuclease such as Cas12a or a NUX protein
  • fusion partners e.g., one or more NLSs, a protein tag, and the like.
  • Suitable fusion partners include but are not limited to: (i) subcellular localization sequences (e.g., one or more, two or more, or three or more nuclear localization signals (NLSs) for targeting to the nucleus, a sequence to keep the fusion protein out of the nucleus, e.g., a nuclear export sequence (NES), a sequence to keep the fusion protein retained in the cytoplasm, a mitochondrial localization signal for targeting to the mitochondria, a chloroplast localization signal for targeting to a chloroplast, an ER retention signal, and the like); (ii) protein tags, e.g., for ease of tracking and/or purification (e.g., a fluorescent protein, such as green fluorescent protein (GFP), YFP, RFP, CFP, mCherry, tdTomato, and the like, MBP, CBP, strep tag, GST, HA, FLAG, poly(His), Myc, V5, Spot, NE, AviTag, and the like); and
  • a subject protein e.g., an Acr protein and/or a CRISPR nuclease such as Cas12a or a NUX protein
  • a subject protein can have a fusion partner that provides for tagging (e.g., GFP), and can also have a subcellular localization sequence (e.g., one or more NLSs).
  • a fusion protein might also have a tag for ease of tracking and/or purification.
  • a subject protein e.g., an Acr protein such as a subject engineered Acr protein
  • can have one or more NLSs e.g., two or more, three or more, four or more, five or more, 1, 2, 3, 4, or 5 NLSs.
  • a fusion partner is located at or near the C-terminus (e.g., within about 50 amino acids of the C-terminus), near the N- terminus (e.g., within about 50 amino acids of the N-terminus), or at both the N-terminus and C-terminus.
  • Non-limiting examples of NLSs include an NLS sequence derived from: the NLS of the SV40 virus large T-antigen, having the amino acid sequence PKKKRKV (SEQ ID NO: 87); the NLS from nucleoplasmin (e.g., the nucleoplasmin bipartite NLS with the sequence KRPAATKKAGQAKKKK (SEQ ID NO: 88)); the c-myc NLS having the amino acid sequence PAAKRVKLD (SEQ ID NO: 89) or RQRRNELKRSP (SEQ ID NO: 90); the hRNPA1 M9 NLS having the sequence NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY (SEQ ID NO: 91); the sequence RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV (SEQ ID NO: 92) of the IBB domain from importin-alpha; the sequences VSRKRPRP (SEQ ID NO:
  • NLS has the sequence GRSSDDEATADSQHAAPPKKKRKV (SEQ ID NO: 125).
  • NLS or multiple NLSs are of sufficient strength to drive accumulation of the Cas protein in a detectable amount in the nucleus of a eukaryotic cell.
  • a fusion partner includes a "Protein Transduction Domain” or PTD (also known as a CPP – cell penetrating peptide), which refers to a polypeptide, polynucleotide, carbohydrate, or organic or inorganic compound that facilitates traversing a lipid bilayer, micelle, cell membrane, organelle membrane, or vesicle membrane.
  • PTD Protein Transduction Domain
  • a PTD attached to another molecule which can range from a small polar molecule to a large macromolecule and/or a nanoparticle, facilitates the molecule traversing a membrane, for example going from extracellular space to intracellular space, or cytosol to within an organelle.
  • a PTD is covalently linked to the amino terminus of a polypeptide and in some embodiments, a PTD is covalently linked to the carboxyl terminus of a polypeptide.
  • the PTD is inserted internally at a suitable insertion site.
  • a subject Cas protein includes (is conjugated to, is fused to) one or more PTDs (e.g., two or more, three or more, four or more PTDs).
  • PTDs include but are not limited to a minimal undecapeptide protein transduction domain (corresponding to residues 47-57 of HIV-1 TAT comprising YGRKKRRQRRR; SEQ ID NO:103); a polyarginine sequence comprising a number of arginines sufficient to direct entry into a cell (e.g., 3, 4, 5, 6, 7, 8, 9, 10, or 10-50 arginines); a VP22 domain (Zender et al. (2002) Cancer Gene Ther. 9(6):489-96); an Drosophila Antennapedia protein transduction domain (Noguchi et al. (2003) Diabetes 52(7):1732-1737); a truncated human calcitonin peptide (Trehin et al. (2004) Pharm.
  • a minimal undecapeptide protein transduction domain corresponding to residues 47-57 of HIV-1 TAT comprising YGRKKRRQRRR; SEQ ID NO:103
  • a polyarginine sequence comprising a number of arginines sufficient to
  • Exemplary PTDs include but are not limited to, YGRKKRRQRRR (SEQ ID NO:108), RKKRRQRRR (SEQ ID NO:109); an arginine homopolymer of from 3 arginine residues to 50 arginine residues;
  • Exemplary PTD domain amino acid sequences include, but are not limited to, any of the following: YGRKKRRQRRR (SEQ ID NO:110); RKKRRQRR (SEQ ID NO:111); YARAAARQARA (SEQ ID NO:112); THRLPRRRRRR (SEQ ID NO:113); and GGRRARRRRRR (SEQ ID NO:114).
  • the PTD is an activatable CPP (ACPP) (Aguilera et al. (2009) Integr Biol (Camb) June; 1(5-6): 371-381).
  • ACPPs comprise a polycationic CPP (e.g., Arg9 or “R9”) connected via a cleavable linker to a matching polyanion (e.g., Glu9 or “E9”), which reduces the net charge to nearly zero and thereby inhibits adhesion and uptake into cells.
  • a polyanion e.g., Glu9 or “E9”
  • Linkers (e.g., for fusion partners)
  • a subject Cas protein can fused to a fusion partner via a linker polypeptide (e.g., one or more linker polypeptides).
  • the linker polypeptide may have any of a variety of amino acid sequences. Proteins can be joined by a spacer peptide, generally of a flexible nature, although other chemical linkages are not excluded. Suitable linkers include polypeptides of between 4 amino acids and 40 amino acids in length, or between 4 amino acids and 25 amino acids in length. These linkers can be produced by using synthetic, linker-encoding oligonucleotides to couple the proteins, or can be encoded by a nucleic acid sequence encoding the fusion protein.
  • Peptide linkers with a degree of flexibility can be used.
  • the linking peptides may have virtually any amino acid sequence, bearing in mind that the preferred linkers will have a sequence that results in a generally flexible peptide.
  • the use of small amino acids, such as glycine and alanine, are of use in creating a flexible peptide. The creation of such sequences is routine to those of skill in the art.
  • a variety of different linkers are commercially available and are considered suitable for use.
  • linker polypeptides include glycine polymers (G)n, glycine-serine polymers (including, for example, (GS)n, GSGGSn (SEQ ID NO: 115), GGSGGSn (SEQ ID NO: 116), and GGGSn (SEQ ID NO: 117 where n is an integer of at least one), glycine-alanine polymers, alanine-serine polymers.
  • Exemplary linkers can comprise amino acid sequences including, but not limited to, GGSG (SEQ ID NO: 118), GGSGG (SEQ ID NO: 119), GSGSG (SEQ ID NO: 120), GSGGG (SEQ ID NO: 121), GGGSG (SEQ ID NO: 122), GSSSG (SEQ ID NO: 123), and the like.
  • GGSG SEQ ID NO: 118
  • GGSGG SEQ ID NO: 119
  • GSGSG SEQ ID NO: 120
  • GSGGG SEQ ID NO: 121
  • GGGSG SEQ ID NO: 122
  • GSSSG SEQ ID NO: 123
  • compositions, systems, and methods that include one or more nucleic acids, where the one or more nucleic acids encode an Acr protein and a CRISPR nuclease – where the Acr protein is an inhibitor of the Cas protein (e.g., Cas effector protein/CRISPR nuclease), such as inhibiting off-target editing more preferentially than on-target editing.
  • both coding sequences are present on the same nucleic acid (e.g., vector) and In some cases, they are present on separate nucleic acids (e.g., separate vectors).
  • Subject nucleic acids can include a translational control element that is operably linked to the Acr protein or the Cas protein (e.g., CRISPR nuclease such as the Cas12a, NUX protein, or variant thereof), or both - in order to achieve a desired balance (expression level ratio) between the two proteins.
  • a subject nucleic acid includes a translational control element that is operably linked to (and therefore regulates/modulates translation of) a sequence encoding the Acr protein.
  • a subject nucleic acid includes a translational control element that is operably linked to (and therefore regulates/modulates translation of) a sequence encoding the Cas protein.
  • sequence encoding the Acr protein and the sequence encoding the Cas protein are both operably linked to a translational control element, e.g., independently (in which case the two control elements can be the same or different).
  • sequence encoding the Acr protein and the sequence encoding the Cas protein are both operably linked to the same translational control element (e.g., IRES element, 2A peptide encoding sequence) such that the sequences are part of a polycistronic transcript.
  • a subject translational control element is a polycistronic linker.
  • a translational control element promotes (causes) the production of independent gene products (e.g., the Acr protein and the CRISPR nuclease) from the same transcript.
  • a translational control element links a first protein coding sequence (e.g., a sequence encoding an Acr protein) with a second protein coding sequence (e.g., a sequenced encoding a CRISPR nuclease) such that the first and second proteins (e.g., the Acr protein and CRISPR nuclease) are encoded by a polycistronic sequence.
  • both protein sequences would therefore both be operably linked to the same promotor, and the RNA transcribed therefrom would include both protein-coding sequences in addition to the sequence encoded by the translational control element.
  • more than one translational control element e.g., IRES element, 2A peptide, non-AUG start codon
  • a given protein e.g., an Acr protein, a CRISPR nuclease such as Cas12a or a Nux protein.
  • a given protein e.g., an Acr protein, a CRISPR nuclease such as Cas12a or a Nux protein. Any convenient combination of translational control elements can be used.
  • 2A peptide [00138]
  • a translational control element that can function as a polycistronic linker and facilitate the production of separate protein products (e.g., two separate proteins) from the same single RNA transcript is a 2A peptide sequence.
  • a “2A peptide” it is meant a small peptide sequence (usually 18–25 amino acids although several of such sequences can be placed in tandem) that allows for expression (translation) of discrete protein products from a single RNA transcript (e.g., through a self-“cleaving” event often referred to as “ribosome skipping” – although the disclosure herein does not rely on and is not bound by the mechanism of action), even though the separate proteins are encoding as part of the same open reading frame (ORF).
  • 2A peptides are readily identifiable by their consensus motif (DXEXNPGP, sometimes described as DVEXNPGP) and their ability to promote protein cleavage/skipping.
  • 2A peptide sequence may be used in a subject nucleic acid.
  • 2A peptides include, but are not limited to 2A peptides from a virus such as foot-and-mouth disease virus (F2A), equine Rhinitis A virus (E2A), porcine teschovirus-1 (P2A) or Thosea asigna virus (T2A). See, e.g., Szymczak-Workman, A. et al. “Design and Construction of 2A Peptide-Linked Multicistronic Vectors”.
  • F2A foot-and-mouth disease virus
  • E2A equine Rhinitis A virus
  • P2A porcine teschovirus-1
  • T2A Thosea asigna virus
  • a subject 2A peptide coding sequence will be positioned so as to regulate the expression (translation) of a subject protein (e.g., Acr protein or Cas protein) – and as such will be positioned 5’ of (usually immediately 5’ of) and in frame with the protein coding sequence which it regulates.
  • FIG.9 provides non-limiting illustrative examples of embodiments in which a 2A peptide coding sequence is positioned in different ways. In some cases, a 2A peptide sequence is position 5’ of (and usually immediately 5’ of) a Cas protein coding sequence. In some cases, a 2A peptide sequence is position 5’ of (and usually immediately 5’ of) an Acr protein coding sequence.
  • the Cas-encoding sequence and the Acr-encoding sequence are operably linked to the same promoter (encoded by a polycistronic sequence) and are positioned in tandem, with a 2A peptide sequence positioned between them (see, e.g., FIG.9A and 9B).
  • the Cas protein encoding sequence is positioned 5’ of the Acr encoding sequence, and the 2A peptide encoding sequence is therefore 3’ of the Cas sequence and 5’ of the Acr sequence.
  • the Acr protein encoding sequence is positioned 5’ of the Cas encoding sequence, and the 2A peptide encoding sequence is therefore 3’ of the Acr sequence and 5’ of the Cas sequence.
  • the Cas-encoding sequence and the Acr-encoding sequence are operably linked to a first promoter and a second promoter, respectively, such that they are transcribed as separate transcripts (see, e.g., FIG.9C and 9D).
  • the first and second promoters (labeled “P1” and “P2” in the figure) can be different from one another or can be the same (i.e., can be a copy of the same promoter).
  • the 2A peptide sequence regulates (and is therefore positioned 5’ of) the Acr sequence. In other cases, the 2A peptide sequence regulates (and is therefore positioned 5’ of) the Cas sequence. In some cases, in which the Cas-encoding sequence and the Acr-encoding sequence are transcribed as separate sequences, each one is regulated by a 2A peptide sequence (and each is therefore positioned 3’ of the 2A peptide sequence).
  • a “spacer” protein coding sequence is used 5’ of the 2A peptide sequence such that the ‘spacer’ sequence is transcribed as part of a polycistronic sequence with the protein sequence being regulated.
  • a Cas protein coding sequence and an Acr protein coding sequence are operably linked to different promoters (P1 and P2).
  • a spacer sequence (labeled as “X” in the figure) is positioned 5’ of a 2A peptide sequence, which is 5 of an Acr coding sequence and therefore the spacer sequence and the Acr sequence are transcribed as part of the same RNA.
  • a Cas protein coding sequence and an Acr protein coding sequence are again operably linked to different promoters (P1 and P2).
  • a spacer sequence (labeled as “X” in the figure) is positioned 5’ of a 2A peptide sequence, which is 5’ of a Cas coding sequence – and therefore the spacer sequence and the Cas sequence are transcribed as part of the same RNA.
  • presence of the 2A peptide sequence in this RNA results in the production of the Cas protein as a separate protein.
  • a ‘spacer’ protein can be any desired sequence – as its purpose is to simply provide a sequence to be translated that is 5’ of (N-terminal to) the 2A peptide sequence.
  • a spacer sequence can be any convenient length, from very short to encoding an entire protein sequence. In some cases, the spacer is 2 or more amino acids long (e.g., 3 or more, 4 or more 5 or more, 10 or more, or 20 or more amino acids).
  • the spacer has a length of from 1 to 100 amino acids (e.g., 1 to 80, 1 to 50, 1 to 40, 1 to 30, 1 to 20, 1 to 10, 2 to 100, 2 to 80, 2 to 50, 2 to 40, 2 to 30, 2 to 20, 2 to 10, 5 to 100, 5 to 80, 5 to 50, 5 to 40, 5 to 30, 5 to 20, or 5 to 10 amino acids).
  • 1 to 100 amino acids e.g., 1 to 80, 1 to 50, 1 to 40, 1 to 30, 1 to 20, 1 to 10, 2 to 100, 2 to 80, 2 to 50, 2 to 40, 2 to 30, 2 to 20, 2 to 10, 5 to 100, 5 to 80, 5 to 50, 5 to 40, 5 to 30, 5 to 20, or 5 to 10 amino acids.
  • spacer sequence examples include, but are not limited to: linker sequences, repeated single amino acids (e.g., AAAA), random sequences, fragments of proteins, and marker proteins (e.g., a fluorescent protein such as GFP, YFP, CFP, RFP, and the like, a drug selectable protein marker, an enzyme such as beta-galactosidase, etc.).
  • marker proteins e.g., a fluorescent protein such as GFP, YFP, CFP, RFP, and the like, a drug selectable protein marker, an enzyme such as beta-galactosidase, etc.
  • 2A peptide sequences include, but are not limited to those set forth as SEQ ID NOs: 133-138.
  • 2A peptide sequences can be used in tandem, and multiple different 2A peptide sequences can be positioned one after another, in any desired combination (see “E2A- F2A” and “T2A-E2A-F2A” above as non-limiting examples).
  • a 2A peptide sequence is selected from the group consisting of: P2A, F2A, E2A, T2A, and any combination thereof.
  • a translational control element encodes 2 or more 2A peptides in tandem (e.g.., 3 or more, 4 or more, or 5 or more).
  • a translational control element encodes 2, 3, 4, or 52A peptides in tandem.
  • a translational control element encodes one 2A peptide.
  • a translational control element is an internal ribosome entry site (IRES) sequence.
  • an “internal ribosome entry site,” or “IRES” it is meant a nucleotide sequence that allows for the initiation of protein translation within the interior of a messenger RNA (mRNA) sequence (i.e., downstream of the first start codon).
  • mRNA messenger RNA
  • IRES internal ribosome entry site
  • the protein located in the first cistron is synthesized by the cap-dependent initiation mechanism, while translation initiation of the second protein is directed by the IRES segment located in the intercistronic spacer region between the two protein coding regions.
  • IRESs have been isolated from viral genomes and cellular genomes. Artificially engineered IRESs are also known in the art.
  • sequences described herein as IRES sequences, which function as part of RNA molecules will have correlative sequences in the encoding DNA molecules, e.g., RNA sequence 5’-uuacuggc-3’ would correspond to DNA sequence 5’-ttactggc-3’, and vice versa”.
  • IRES sequence or simply “IRES” is used herein to refer to either sequence.
  • Any convenient IRES may be employed in the subject compositions, sytems, and methods. Examples of IRES sequences include but are not limited to those listed in FIG.13A-13D (SEQ ID NOs: 139-159).
  • IRES sequences include but are not limited to those listed in FIG.13A-13D (SEQ ID NOs: 139-159).
  • a subject system is to be used for expressing Cas and Acr proteins in non-animal cells (e.g., plants/plant cells), they should select a convenient IRES sequence appropriate for the desired cell type (e.g., an IRES from Triticum mosaic virus (TriMV)).
  • TriMV Triticum mosaic virus
  • an IRES sequence comprises a nucleotide sequence having 70% or more sequence identity (e.g., 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 97% or more, 98% or more, 99% or more, or 100% sequence identity) with the nucleotide sequence set forth in any one of SEQ ID NOs: 139-159.
  • an IRES sequence is selected from the group consisting of the following IRES sequences: EMCV, BIP, CAT-1, c-myc, HCV, VCIP, Apaf-1, mEMCV-1, mEMCV-2, HRV, NRF, FGF-1, KMI1, KMI2, (GAAA)16, (PPT19)4, EMCV mutant 5 (SEQ ID NO: 140), EMCV mutant 10 (SEQ ID NO: 141) EMCV mutant 15 (SEQ ID NO: 142) and EMCV mutant 21 (SEQ ID NO: 143) [also see, e.g., international patent publication WO2022072673].
  • a subject IRES sequence will be positioned in a subject nucleic acid so as to regulate the expression (translation) of a subject protein (e.g., Acr protein or Cas protein) – and as such will be positioned 5’ of (usually immediately 5’ of) the protein coding sequence which it regulates.
  • FIG.10 provides non-limiting illustrative examples of embodiments in which and IRES sequence is positioned in different ways. In some cases, an IRES sequence is positioned 5’ of (and usually immediately 5’ of) a Cas protein coding sequence. In some cases, an IRES sequence is positioned 5’ of (and usually immediately 5’ of) an Acr protein coding sequence.
  • the Cas-encoding sequence and the Acr-encoding sequence are operably linked to the same promoter (encoded by a polycistronic sequence) and are positioned in tandem, with an IRES sequence positioned between them (see, e.g., FIG. 10, first and second examples).
  • the Cas protein encoding sequence is positioned 5’ of the Acr encoding sequence, and the IRES sequence is therefore 3’ of the Cas sequence and 5’ of the Acr sequence.
  • the Acr protein encoding sequence is positioned 5’ of the Cas encoding sequence, and the IRES sequence is therefore 3’ of the Acr sequence and 5’ of the Cas sequence.
  • the Cas-encoding sequence and the Acr-encoding sequence are operably linked to a first promoter and a second promoter, respectively, such that they are transcribed as separate transcripts (see, e.g., FIG.10,C-F).
  • the first and second promoters (labeled “P1” and “P2” in the figure) can be different from one another or can be the same (i.e., can be a copy of the same promoter).
  • the IRES sequence regulates (and is therefore positioned 5’ of) the Acr sequence. In other cases, the IRES sequence regulates (and is therefore positioned 5’ of) the Cas sequence.
  • each one is regulated by an IRES sequence (and each is therefore positioned 3’ of an IRES sequence).
  • a “spacer” protein coding sequence is used 5’ of the IRES sequence such that the ‘spacer’ sequence is transcribed as part of a polycistronic sequence with the protein sequence being regulated (see, e.g., the FIG.10 E and F).
  • a Cas protein coding sequence and an Acr protein coding sequence are operably linked to different promoters (P1 and P2).
  • a spacer sequence (labeled as “X” in the figure) is positioned 5’ of an IRES sequence, which is 5’ of an Acr coding sequence – and therefore the spacer sequence and the Acr sequence are transcribed as part of the same RNA, but the presence of the IRES sequence causes the Acr protein to be produced as a separate protein.
  • a Cas protein coding sequence and an Acr protein coding sequence are again operably linked to different promoters (P1 and P2).
  • a spacer sequence (labeled as “X” in the figure) is positioned 5’ of an IRES sequence, which is 5’ of a Cas coding sequence – and therefore the spacer sequence and the Cas sequence are transcribed as part of the same RNA, but the presence of the IRES sequence causes the Cas protein to be produced as a separate protein.
  • a ‘spacer’ protein can be any desired sequence – as its purpose is to simply provide a sequence to be translated that is 5’ of the IRES sequence.
  • a spacer sequence can be any convenient length, from very short to encoding an entire protein sequence.
  • the spacer is 2 or more amino acids long (e.g., 3 or more, 4 or more 5 or more, 10 or more, or 20 or more amino acids). In some cases, the spacer has a length of from 1 to 100 amino acids (e.g., 1 to 80, 1 to 50, 1 to 40, 1 to 30, 1 to 20, 1 to 10, 2 to 100, 2 to 80, 2 to 50, 2 to 40, 2 to 30, 2 to 20, 2 to 10, 5 to 100, 5 to 80, 5 to 50, 5 to 40, 5 to 30, 5 to 20, or 5 to 10 amino acids).
  • 1 to 100 amino acids e.g., 1 to 80, 1 to 50, 1 to 40, 1 to 30, 1 to 20, 1 to 10, 2 to 100, 2 to 80, 2 to 50, 2 to 40, 2 to 30, 2 to 20, 2 to 10, 5 to 100, 5 to 80, 5 to 50, 5 to 40, 5 to 30, 5 to 20, or 5 to 10 amino acids.
  • spacer sequence examples include, but are not limited to: linker sequences, repeated single amino acids (e.g., AAAA), random sequences, fragments of proteins, and marker proteins (e.g., a fluorescent protein such as GFP, YFP, CFP, RFP, and the like, a drug selectable protein marker, an enzyme such as beta-galactosidase, etc.).
  • linker sequences repeated single amino acids (e.g., AAAA)
  • marker proteins e.g., a fluorescent protein such as GFP, YFP, CFP, RFP, and the like, a drug selectable protein marker, an enzyme such as beta-galactosidase, etc.
  • a translational control element is a non-AUG start codon (also referred to as a non-AUG initiation codon).
  • non-AUG start codon or non-AUG initiation codon is meant to include any non-AUG polynucleotide (typically a triplet) that functions as a start site for translation initiation with reduced efficiency relative to that of an AUG start codon. Examples of naturally occurring alternate start codon usage are described for example in Kozak (1991) J. Cell Biol.115(4): 887-903; Mehdi et al. (1990) Gene 91:173-178; Kozak (1989) Mol. Cell.
  • non-AUG start codons have decreased translation efficiencies compared to that of an AUG.
  • a non-AUG start codon is used as the initiation codon for the sequence encoding the Acr protein.
  • a non-AUG start codon is used as the initiation codon for the sequence encoding the Cas protein (e.g., CRISPR nuclease).
  • a non-AUG start codon (used with the Acr sequence or with the Cas sequence) is any one of: CUG, GUG, ACG, AUA, UUG, GCG, AGG, AAG, AUC, or AUU.
  • a non-AUG start codon (used with the Acr sequence or with the Cas sequence) is any one of: CUG, GUG, ACG, AUA, or UUG.
  • a non-AUG start codon used with the Acr sequence is any one of: CUG, GUG, ACG, AUA, UUG, GCG, AGG, AAG, AUC, or AUU.
  • a non-AUG start codon used with the Acr sequence is any one of: CUG, GUG, ACG, AUA, or UUG.
  • a non-AUG start codon used with the Cas sequence is any one of: CUG, GUG, ACG, AUA, UUG, GCG, AGG, AAG, AUC, or AUU.
  • a non-AUG start codon used with the Cas sequence is any one of: CUG, GUG, ACG, AUA, or UUG.
  • a non-AUG start codon used with the Acr sequence is CUG.
  • a non-AUG start codon used with the Acr sequence is GUG.
  • a non-AUG start codon used with the Acr sequence is ACG.
  • a non-AUG start codon used with the Cas sequence is CUG.
  • a non-AUG start codon used with the Cas sequence is GUG. In some cases, a non-AUG start codon used with the Cas sequence is ACG.
  • the translation efficiency of a non-AUG start codon can also be affected by its sequence context; for example, in eukaryotic cells an optimal Kozak consensus sequence has been reported to have a positive effect on translation initiation at non- AUG start codons (Mehdi et al. (1990) Gene 91:173-178; Kozak (1989) Mol. Cell. Biol. 9(11): 5073-5080).
  • the complete Kozak DNA consensus sequence is GCCRCCATGG (SEQ ID NO:160), where the start codon ATG (AUG in RNA) is just prior to the final “G”, the A of the ATG start codon is designated as the +1 position, and "R" at position -3 is a purine (A or G).
  • the two most highly conserved positions are a purine, usually an A, at - 3 and a G at +4 (Kozak (1991) J Cell Biol 115(4): 887-903).
  • a subject non-AUG start codon e.g., any of those discussed above
  • an impaired Kozak sequence i.e., a Kozak sequence that does not conform to the consensus.
  • RNA sequence ATG would correspond to RNA sequence AUG, and vice versa.
  • a subject non-AUG initiation codon will be positioned in a subject nucleic acid so as to regulate the expression (translation initiation) of a subject protein (e.g., Acr protein or Cas protein) – and as such will be positioned 5’ of (usually immediately 5’ of) and in frame with the protein coding sequence which it regulates.
  • a subject protein e.g., Acr protein or Cas protein
  • the sequence encoding the Acr protein does not include its native AUG start codon. In some such cases, the sequence encoding the Acr protein does not include an AUG codon. In some cases, (e.g., when a non-AUG start codon is used as the initiation codon for the Cas encoding sequence), the sequence encoding the CRISPR nuclease does not include its native AUG start codon. In some cases, the sequence encoding the CRISPR nuclease does not include an AUG codon.
  • the sequence encoding the subject protein i.e., the CRISPR nuclease or the Acr protein
  • the sequence encoding the subject protein is codon optimized as a whole or in part, to avoid having an out-of-frame AUG that could direct translational machinery to an incorrect reading frame contained in the nucleic acid encoding the Cas or Acr protein (i.e., a reading frame that does not encode the subject protein).
  • the first 10, 15, 20, 25, 30, 35, 40, 45, 50 or more than 50 codons from the start of the subject protein are optimized so as not to include an out-of-frame AUG.
  • the Cas-encoding sequence and the Acr-encoding sequence are operably linked to a first promoter and a second promoter, respectively, such that they are transcribed as separate transcripts (see, e.g., FIG.11).
  • the first and second promoters (labeled “P1” and “P2” in the figure) can be different from one another or can be the same (i.e., can be a copy of the same promoter).
  • more than one translational control element can be used to control expression of a protein (e.g., Acr protein, CRISPR nuclease).
  • more than one (e.g., two, two or more, three) translational control elements are used to control expression of the Acr protein.
  • more than one (e.g., two, two or more, three) translational control elements are used to control expression of the CRISPR nuclease.
  • one or more (e.g., two, two or more, three) translational control elements e.g., 2A peptide. IRES, non-AUG start codon) are used to control expression of the Acr protein and/or the CRISPR nuclease.
  • one or more (e.g., two, two or more, three) translational control elements e.g., 2A peptide, IRES, non-AUG start codon
  • one or more (e.g., two, two or more, three) translational control elements e.g., 2A peptide. IRES, non-AUG start codon) are used to control expression of the CRISPR nuclease.
  • compositions, methods and systems of ACRs and nucleases provided herein include expression cassettes, such as on one or more vectors, which expression cassettes include promoters to drive expression of the genes encoding a Cas protein (e.g., a Class 2 effector protein/CRISPR nuclease) and/or an Acr protein, and/or a guide RNA.
  • a Cas protein e.g., a Class 2 effector protein/CRISPR nuclease
  • an Acr protein e.g., a Class 2 effector protein/CRISPR nuclease
  • an Acr protein e.g., a Class 2 effector protein/CRISPR nuclease
  • an Acr protein e.g., a Class 2 effector protein/CRISPR nuclease
  • an Acr protein e.g., a Class 2 effector protein/CRISPR nuclease
  • an Acr protein e.g., a Class 2 effector protein
  • an Acr coding sequence is operably linked to a first promoter and a CRISPR nuclease sequence is operably linked to a second promoter.
  • the first and second promoters are the same – such that the two protein coding sequences are transcribed as separate RNAs, but are controlled by the same promoter sequence (i.e., there are two copies of the same promoter – one controlling expression of one protein and another controlling expression of the other).
  • the first and second promoters are different promoters.
  • a promoter can be a constitutively active promoter (i.e., a promoter that is constitutively in an active/”ON” state), it may be an inducible promoter (i.e., a promoter whose state, active/”ON” or inactive/“OFF”, is controlled by an external stimulus, e.g., the presence of a particular temperature, compound, or protein.), it may be a spatially restricted promoter (i.e., transcriptional control element, enhancer, etc.)(e.g., tissue specific promoter, cell type specific promoter, etc.), and/or it may be a temporally restricted promoter, e.g., the promoter is in the “ON” state or “OFF” state during specific stages of embryonic development or during specific stages of a biological
  • Suitable promoters can be derived from viruses and can therefore be referred to as viral promoters, or they can be derived from any convenient organism. Suitable promoters can be derived from viruses and can therefore be referred to as viral promoters, or they can be derived from any organism, including prokaryotic or eukaryotic organisms.
  • Exemplary promoters include, but are not limited to the SV40 early promoter, mouse mammary tumor virus long terminal repeat (LTR) promoter; adenovirus major late promoter (Ad MLP); a herpes simplex virus (HSV) promoter, a cytomegalovirus (CMV) promoter such as the CMV immediate early promoter region (CMVIE), a rous sarcoma virus (RSV) promoter, a human U6 small nuclear promoter (U6) (Miyagishi et al.
  • LTR mouse mammary tumor virus long terminal repeat
  • Ad MLP adenovirus major late promoter
  • HSV herpes simplex virus
  • CMV cytomegalovirus
  • CMVIE CMV immediate early promoter region
  • RSV rous sarcoma virus
  • U6 small nuclear promoter U6 small nuclear promoter
  • an enhanced U6 promoter e.g., Xia et al., Nucleic Acids Res.2003 Sep 1;31(17)
  • a human H1 promoter H1
  • Pol III promoters such as U6, enhanced U6, and H1 are generally used to express non-coding RNAs such as guide RNAs.
  • inducible promoters include, but are not limited to, heat shock promoter, Tetracycline-regulated promoter, Steroid-regulated promoter, Metal-regulated promoter, estrogen receptor-regulated promoter, etc.
  • Inducible promoters can therefore be regulated by molecules including, but not limited to, doxycycline; an estrogen receptor; an estrogen receptor fusion; an estrogen analog; IPTG; etc.
  • Inducible promoters suitable for use include any inducible promoter described herein or known to one of ordinary skill in the art.
  • inducible promoters include, without limitation, chemically/biochemically-regulated and physically-regulated promoters such as alcohol-regulated promoters, tetracycline-regulated promoters (e.g., anhydrotetracycline (aTc)-responsive promoters and other tetracycline-responsive promoter systems, which include a tetracycline repressor protein (tetR), a tetracycline operator sequence (tetO) and a tetracycline transactivator fusion protein (tTA)), steroid- regulated promoters (e.g., promoters based on the rat glucocorticoid receptor, human estrogen receptor, moth ecdysone receptors, and promoters from the steroid/retinoid/thyroid receptor superfamily), metal-regulated promoters (e.g., promoters derived from metallothionein (proteins that bind and sequester metal ions) genes from yeast,
  • Suitable promoters include but are not limited to the following: Mammalian (Pol II) promoters (for nuclease and Acrs) • retroviral Rous sarcoma virus (RSV) • LTR promoter (optionally with the RSV enhancer), • cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) • SV40 promoter • dihydrofolate reductase promoter, • beta.-actin promoter, • phosphoglycerol kinase (PGK) promoter, • EF1.alpha. (EF1a) promoter.
  • RSV Rous sarcoma virus
  • CMV cytomegalovirus
  • SV40 promoter • dihydrofolate reductase promoter
  • beta.-actin promoter • phosphoglycerol kinase (PGK) promoter
  • PGK phosphoglycerol kinase
  • MMLV LTR promoters • HIV LTR promoters, MCMV LTR promoters, • MND, • Ubc, • CAG, • HSV TK promoter, • fos promoter, • E2F promoter • polyoma virus • adenovirus, fowlpox virus • bovine papilloma virus • avian sarcoma virus Eukaryotic tissue-specific • Bowman et al., 1995 Proc. Natl. Acad. Sci. USA 92,12115-12119 describe a brain-specific transferrin promoter; • synapsin I promoter is neuron specific (Schoch et al., 1996 J. Biol.
  • necdin promoter is post-mitotic neuron specific (Uetsuki et al., 1996 J. Biol. Chem.271, 918-924); • neurofilament light promoter is neuron specific (Charron et al., 1995 J. Biol. Chem.270, 30604-30610); • acetylcholine receptor promoter is neuron specific (Wood et al., 1995 J. Biol. Chem.270, 30933-30940); • potassium channel promoter is high-frequency firing neuron specific (Gan et al., 1996 J. Biol.
  • GLUT4 promoter is skeletal muscle specific (Olson and Pessin, 1995 J. Biol. Chem.270, 23491-23495); • Slow/fast troponins promoter is slow/fast twitch myofibre specific (Corin et al., 1995 Proc. Natl. Acad. Sci. USA 92, 6185-6189); • Actin promoter is smooth muscle specific (Shimizu et al., 1995 J. Biol. Chem. 270, 7631-7643); • Myosin heavy chain promoter is smooth muscle specific (Kallmeier et al., 1995 J. Biol.
  • E-cadherin promoter is epithelium specific (Hennig et al., 1996 J. Biol. Chem. 271, 595-602); • cytokeratins promoter is keratinocyte specific (Alexander et al., 1995 B. Hum. Mol. Genet.4, 993-999); • transglutaminase 3 promoter is keratinocyte specific (J. Lee et al., 1996 J. Biol. Chem.271, 4561-4568); • bullous pemphigoid antigen promoter is basal keratinocyte specific (Tamai et al., 1995 J. Biol.
  • keratin 6 promoter is proliferating epidermis specific (Ramirez et al., 1995 Proc. Natl. Acad. Sci. USA 92, 4783-4787); • collagen 1 promoter is hepatic stellate cell and skin/tendon fibroblast specific (Houglum et al., 1995 J. Clin. Invest.96, 2269-2276); • type X collagen promoter is hypertrophic chondrocyte specific (Long & Linsenmayer, 1995 Hum. Gene Ther.6, 419-428); • Factor VII promoter is liver specific (Greenberg et al., 1995 Proc. Natl. Acad. Sci.
  • fatty acid synthase promoter is liver and adipose tissue specific (Soncini et al., 1995 J. Biol. Chem.270, 30339-3034); • carbamoyl phosphate synthetase I promoter is portal vein hepatocyte and small intestine specific (Christoffels et al., 1995 J. Biol. Chem.270, 24932-24940); • the Na--K--Cl transporter promoter is kidney (loop of Henle) specific (Igarashi et al., 1996 J. Biol.
  • scavenger receptor A promoter is macrophages and foam cell specific (Horvai et al., 1995 Proc. Natl. Acad. Sci. USA 92, 5391-5395); • glycoprotein IIb promoter is megakaryocyte and platelet specific (Block & Poncz, 1995 Stem Cells 13, 135-145); • yc chain promoter is hematopoietic cell specific (Markiewicz et al., 1996 J. Biol. Chem.271, 14849-14855); • CD11b promoter is mature myeloid cell specific (Dziennis et al., 1995 Blood 85, 319-329).
  • Host cells can be ex vivo (e.g., fresh isolate - early passage), in vivo, or in culture in vitro (e.g., immortalized cell line).
  • the targeted nucleic acid is chromosomal (e.g., the host cell’s genome) and in some cases, the targeted nucleic acid is from a pathogen, e.g., the genome of a pathogen within the host cell.
  • Cells may be from established cell lines or they may be primary cells, where “primary cells”, “primary cell lines”, and “primary cultures” are used interchangeably herein to refer to cells and cells cultures that have been derived from a subject and allowed to grow in vitro for a limited number of passages, i.e. splittings, of the culture.
  • primary cultures are cultures that may have been passaged 0 times, 1 time, 2 times, 4 times, 5 times, 10 times, or 15 times, but not enough times go through the crisis stage.
  • the primary cell lines are maintained for fewer than 10 passages in culture.
  • Suitable host cells include, but are not limited to: a cell of a single-cell eukaryotic organism; a plant cell; an algal cell, e.g., Botryococcus braunii, Chlamydomonas reinhardtii, Nannochloropsis gaditana, Chlorella pyrenoidosa, Sargassum patens, C. agardh, and the like; a fungal cell (e.g., a yeast cell); an animal cell; a cell from an invertebrate animal (e.g.
  • a cell of an insect e.g., a mosquito; a bee; an agricultural pest; etc.
  • a cell of an arachnid e.g., a spider; a tick; etc.
  • a cell of a vertebrate animal e.g., a fish, an amphibian, a reptile, a bird, a mammal
  • a cell of a mammal e.g., a cell of a rodent; a cell of a human; a cell of a non- human mammal; a cell of a rodent (e.g., a mouse, a rat); a cell of a lagomorph (e.g., a rabbit); a cell of an ungulate (e.g., a cow, a horse, a camel, a llama, a vicu ⁇ a,
  • a stem cell e.g. an embryonic stem (ES) cell, an induced pluripotent stem (iPS) cell, a germ cell (e.g., an oocyte, a sperm, an oogonia, a spermatogonia, etc.), an adult stem cell, a somatic cell, e.g. a fibroblast, a hematopoietic cell, a neuron, a muscle cell, a bone cell, a hepatocyte, a pancreatic cell; an in vitro or in vivo embryonic cell of an embryo at any stage, e.g., a 1-cell, 2-cell, 4-cell, 8-cell, etc.
  • ES embryonic stem
  • iPS induced pluripotent stem
  • a germ cell e.g., an oocyte, a sperm, an oogonia, a spermatogonia, etc.
  • a somatic cell e.g. a fibroblast,
  • Suitable host cells include, but are not limited to: a bacterial cell; an archaeal cell; a cell of a single- cell eukaryotic organism; a plant cell; an algal cell, e.g., Botryococcus braunii, Chlamydomonas reinhardtii, Nannochloropsis gaditana, Chlorella pyrenoidosa, Sargassum patens, C. agardh, and the like; a fungal cell (e.g., a yeast cell); an animal cell; a cell from an invertebrate animal (e.g.
  • a cell of an insect e.g., a mosquito; a bee; an agricultural pest; etc.
  • a cell of an arachnid e.g., a spider; a tick; etc.
  • a cell of a vertebrate animal e.g., a fish, an amphibian, a reptile, a bird, a mammal
  • a cell of a mammal e.g., a cell of a rodent; a cell of a human; a cell of a non-human mammal; a cell of a rodent (e.g., a mouse, a rat); a cell of a lagomorph (e.g., a rabbit); a cell of an ungulate (e.g., a cow, a horse, a camel, a llama, a vicu ⁇ a,
  • a stem cell e.g. an embryonic stem (ES) cell, an induced pluripotent stem (iPS) cell, a germ cell (e.g., an oocyte, a sperm, an oogonia, a spermatogonia, etc.), an adult stem cell, a somatic cell, e.g. a fibroblast, a hematopoietic cell, a neuron, a muscle cell, a bone cell, a hepatocyte, a pancreatic cell; an in vitro or in vivo embryonic cell of an embryo at any stage, e.g., a 1-cell, 2-cell, 4- cell, 8-cell, etc.
  • ES embryonic stem
  • iPS induced pluripotent stem
  • a germ cell e.g., an oocyte, a sperm, an oogonia, a spermatogonia, etc.
  • a somatic cell e.g. a fibroblast,
  • Suitable cells include a stem cell (e.g. an embryonic stem (ES) cell, an induced pluripotent stem (iPS) cell; a germ cell (e.g., an oocyte, a sperm, an oogonia, a spermatogonia, etc.); a somatic cell, e.g. a fibroblast, an oligodendrocyte, a glial cell, a hematopoietic cell, a neuron, a muscle cell, a bone cell, a hepatocyte, a pancreatic cell, etc.
  • ES embryonic stem
  • iPS induced pluripotent stem
  • germ cell e.g., an oocyte, a sperm, an oogonia, a spermatogonia, etc.
  • a somatic cell e.g. a fibroblast, an oligodendrocyte, a glial cell, a hematopoietic cell,
  • Suitable cells include human embryonic stem cells, fetal cardiomyocytes, myofibroblasts, mesenchymal stem cells, autotransplated expanded cardiomyocytes, adipocytes, totipotent cells, pluripotent cells, blood stem cells, myoblasts, adult stem cells, bone marrow cells, mesenchymal cells, embryonic stem cells, parenchymal cells, epithelial cells, endothelial cells, mesothelial cells, fibroblasts, osteoblasts, chondrocytes, exogenous cells, endogenous cells, stem cells, hematopoietic stem cells, bone-marrow derived progenitor cells, myocardial cells, skeletal cells, fetal cells, undifferentiated cells, multi-potent progenitor cells, unipotent progenitor cells, monocytes, cardiac myoblasts, skeletal myoblasts, macrophages, capillary endothelial cells, xenogenic cells, allogenic cells, and
  • the cell is an immune cell, a neuron, an epithelial cell, and endothelial cell, or a stem cell.
  • the immune cell is a T cell, a B cell, a monocyte, a natural killer cell, a dendritic cell, or a macrophage.
  • the immune cell is a cytotoxic T cell.
  • the immune cell is a helper T cell.
  • the immune cell is a regulatory T cell (Treg).
  • the cell is a stem cell. Stem cells include adult stem cells. Adult stem cells are also referred to as somatic stem cells.
  • Adult stem cells are resident in differentiated tissue, but retain the properties of self-renewal and ability to give rise to multiple cell types, usually cell types typical of the tissue in which the stem cells are found.
  • somatic stem cells include muscle stem cells; hematopoietic stem cells; epithelial stem cells; neural stem cells; mesenchymal stem cells; mammary stem cells; intestinal stem cells; mesodermal stem cells; endothelial stem cells; olfactory stem cells; neural crest stem cells; and the like.
  • Stem cells of interest include mammalian stem cells, where the term “mammalian” refers to any animal classified as a mammal, including humans; non- human primates; domestic and farm animals; and zoo, laboratory, sports, or pet animals, such as dogs, horses, cats, cows, mice, rats, rabbits, etc.
  • the stem cell is a human stem cell.
  • the stem cell is a rodent (e.g., a mouse; a rat) stem cell.
  • the stem cell is a non-human primate stem cell.
  • the stem cell is a hematopoietic stem cell (HSC).
  • HSCs are mesoderm-derived cells that can be isolated from bone marrow, blood, cord blood, fetal liver and yolk sac. HSCs are characterized as CD34 + and CD3-. HSCs can repopulate the erythroid, neutrophil-macrophage, megakaryocyte and lymphoid hematopoietic cell lineages in vivo. In vitro, HSCs can be induced to undergo at least some self-renewing cell divisions and can be induced to differentiate to the same lineages as is seen in vivo. As such, HSCs can be induced to differentiate into one or more of erythroid cells, megakaryocytes, neutrophils, macrophages, and lymphoid cells.
  • the stem cell is a neural stem cell (NSC).
  • NSCs neural stem cells
  • a neural stem cell is a multipotent stem cell which is capable of multiple divisions, and under specific conditions can produce daughter cells which are neural stem cells, or neural progenitor cells that can be neuroblasts or glioblasts, e.g., cells committed to become one or more types of neurons and glial cells respectively. Methods of obtaining NSCs are known in the art.
  • the stem cell is a mesenchymal stem cell (MSC).
  • Promoters and translation control elements such as those described above can be used to achieve delivery of a subject Acr protein and/or a CRISPR nuclease at different levels relative to one another. For example, in some embodiments, the goal is to deliver the CRISPR nuclease and the Acr polypeptide at a 1:1 ratio.
  • the goal is to deliver the CRISPR nuclease and the Acr polypeptide at a ratio (CRISPR:Acr) in a range of from 1:1.25 to 1:10 (e.g., 1:2 to 1:10, 1:4 to 1:10, 1:5 to 1:10, 1:1.25 to 1:5, or 1:2 to 1:5).
  • a subject system can be configured (e.g., for use in a method) to deliver the CRISPR nuclease and the Acr polypeptide at a ratio (CRISPR:Acr) of, e.g., 1:1.25, 1:1.5, 1:2, 1:3, 1:4, 1:5, 1:6, 1:7, 1:8, 1:9, or 1:10.
  • the goal is to deliver the Acr polypeptide and the CRISPR nuclease at a ratio (Acr:CRISPR) in a range of from 1:1.25 to 1:10 (e.g., 1:2 to 1:10, 1:4 to 1:10, 1:5 to 1:10, 1:1.25 to 1:5, or 1:2 to 1:5).
  • a subject system can be configured (e.g., for use in a method) to deliver the Acr polypeptide and the CRISPR nuclease at a ratio (Acr:CRISPR) of, e.g., 1:1.25, 1:1.5, 1:2, 1:3, 1:4, 1:5, 1:6, 1:7, 1:8, 1:9, or 1:10.
  • a wild type CRISPR nuclease normally has nuclease activity that cleaves a target nucleic acid (e.g., a double stranded DNA (dsDNA)) at a target site defined by (i) the region of complementarity between the guide sequence of the guide RNA and the target nucleic acid; and (ii) a short motif referred to as the “protospacer adjacent motif” (PAM) in the target nucleic acid.
  • dsDNA double stranded DNA
  • CRISPR based programmable gene editing tools e.g., CRISPR/Cas RNA-guided proteins, CRISPR/Cas guide RNAs, and PAMs
  • CRISPR/Cas RNA-guided proteins CRISPR/Cas guide RNAs, and PAMs
  • Zetsche et al Cell.2015 Oct 22;163(3):759-71
  • Makarova et al Nat Rev Microbiol.2015 Nov;13(11):722-36
  • Shmakov et al. Mol Cell.2015 Nov 5;60(3):385-97
  • Shmakov et al. Nat Rev Microbiol.2017 Mar;15(3):169-182
  • Koonin et al. Curr Opin Microbiol.2017 Jun;37:67-78
  • Makarova et al. Nat Rev Microbiol.2020 Feb;18(2):67-83; all of which are hereby incorporated by reference in their entirety.
  • a "vector” or “expression vector” is a replicon, such as plasmid, phage, virus, or cosmid, to which another DNA segment, i.e. an “insert”, may be attached so as to bring about the replication and/or expression of the attached segment in a cell.
  • An “expression cassette” comprises a DNA sequence (coding or non-coding) operably linked to a promoter.
  • a subject vector is a viral vector (e.g., AAV, lentivirus, adenovirus).
  • a subject vector includes an origin of replication (e.g., can be a plasmid).
  • both an Acr protein and its target Cas protein (the protein that the Acr inhibits) is present in a single vector – which ensures that all cells receiving the Cas protein (e.g., an endonuclease such as Cas9, Cas12a, and the like) will also express the Acr “off-switch”.
  • the translation of one or both proteins can be regulated by a translational control element in order to achieve a proper balance (expression level ratio) between the two proteins.
  • Vectors may be provided directly to a target host cell (target cell).
  • the cells are contacted with vectors comprising the subject nucleic acids (e.g., recombinant expression vectors) such that the vectors are taken up by the cells.
  • vectors comprising the subject nucleic acids (e.g., recombinant expression vectors)
  • Methods for contacting cells with nucleic acid vectors that are plasmids include electroporation, calcium chloride transfection, microinjection, and lipofection are well known in the art.
  • cells can be contacted with viral particles comprising the subject viral expression vectors (e.g., adeno-associated virus (AAV)).
  • a subject vector is a viral construct, e.g., a recombinant adeno-associated virus construct (see, e.g., U.S.
  • Suitable expression vectors include, but are not limited to, viral vectors (e.g.
  • viral vectors based on vaccinia virus; poliovirus; adenovirus see, e.g., Li et al., Invest Opthalmol Vis Sci 35:25432549, 1994; Borras et al., Gene Ther 6:515524, 1999; Li and Davidson, PNAS 92:77007704, 1995; Sakamoto et al., H Gene Ther 5:10881097, 1999; WO 94/12649, WO 93/03769; WO 93/19191; WO 94/28938; WO 95/11984 and WO 95/00655); adeno-associated virus (see, e.g., Ali et al., Hum Gene Ther 9:8186, 1998, Flannery et al., PNAS 94:69166921, 1997; Bennett et al., Invest Opthalmol Vis Sci 38:28572863, 1997; Jomary et al., Gene Ther 4:683690
  • SV40 herpes simplex virus
  • human immunodeficiency virus see, e.g., Miyoshi et al., PNAS 94:10319 23, 1997; Takahashi et al., J Virol 73:78127816, 1999
  • a retroviral vector e.g., Murine Leukemia Virus, spleen necrosis virus, and vectors derived from retroviruses such as Rous Sarcoma Virus, Harvey Sarcoma Virus, avian leukosis virus, a lentivirus, human immunodeficiency virus, myeloproliferative sarcoma virus, and mammary tumor virus
  • retroviral vector e.g., Murine Leukemia Virus, spleen necrosis virus, and vectors derived from retroviruses such as Rous Sarcoma Virus, Harvey Sarcoma Virus, avian leukosis virus, a lentivirus, human immunodeficiency virus, myeloprol
  • a subject vector is an AAV vector.
  • AAV adeno-associated virus
  • AAV it is meant the virus itself or derivatives thereof.
  • the term covers all subtypes and both naturally occurring and recombinant forms, except where required otherwise, for example, AAV type 1 (AAV-1), AAV type 2 (AAV-2), AAV type 3 (AAV-3), AAV type 4 (AAV-4), AAV type 5 (AAV-5), AAV type 6 (AAV-6), AAV type 7 (AAV-7), AAV type 8 (AAV-8), AAV type 9 (AAV-9), AAV type 10 (AAV-10), AAV type 11 (AAV- 11), avian AAV, bovine AAV, canine AAV, equine AAV, primate AAV, non-primate AAV, ovine AAV, a hybrid AAV (i.e., an AAV comprising a capsid protein of one AAV subtype and genomic material of another subtype), an AAV comprising a mutant
  • a capsid protein with regions or domains or individual amino acids that are derived from two or more different serotypes of AAV, e.g. AAV-DJ, AAV-LK3, AAV-LK19).
  • Primary AAV refers to AAV that infect primates
  • non-primate AAV refers to AAV that infect non-primate mammals
  • bovine AAV refers to AAV that infect bovine mammals, etc.
  • a subject vector is an integrative vector, e.g., integrates into the genome of a target cell.
  • a “recombinant AAV vector”, or “rAAV vector” it is meant an AAV virus or AAV viral chromosomal material comprising a polynucleotide sequence not of AAV origin (i.e., a polynucleotide heterologous to AAV), typically a nucleic acid sequence of interest to be integrated into the cell following the subject methods.
  • the heterologous polynucleotide is flanked by at least one, and generally by two AAV inverted terminal repeat sequences (ITRs).
  • the recombinant viral vector also comprises viral genes important for the packaging of the recombinant viral vector material.
  • a “packaging” it is meant a series of intracellular events that result in the assembly and encapsidation of a viral particle, e.g. an AAV viral particle.
  • Examples of nucleic acid sequences important for AAV packaging include the AAV "rep” and “cap” genes, which encode for replication and encapsidation proteins of adeno-associated virus, respectively.
  • the term rAAV vector encompasses both rAAV vector particles and rAAV vector plasmids.
  • a “viral particle” refers to a single unit of virus comprising a capsid encapsidating a virus-based polynucleotide, e.g.
  • An "AAV viral particle” refers to a viral particle composed of at least one AAV capsid protein (typically by all of the capsid proteins of a wild-type AAV) and an encapsidated polynucleotide AAV vector. If the particle comprises a heterologous polynucleotide (i.e. a polynucleotide other than a wild-type AAV genome, such as a transgene to be delivered to a mammalian cell), it is typically referred to as an "rAAV vector particle” or simply an "rAAV vector”.
  • rAAV vector particle i.e. a polynucleotide other than a wild-type AAV genome, such as a transgene to be delivered to a mammalian cell
  • production of rAAV particle necessarily includes production of rAAV vector, as such a vector is contained within an rAAV particle.
  • a rAAV virion can be constructed using methods that are well known in the art. See, e.g., Koerber et al. (2009) Mol. Ther.17:2088; Koerber et al. (2008) Mol Ther.16:1703–1709; U.S. Patent Nos.7,439,065, 6,951,758, and 6,491,907.
  • the heterologous sequence(s) can be directly inserted into an AAV genome which has had the major AAV open reading frames ("ORFs") excised therefrom.
  • AAV genome can also be deleted, so long as a sufficient portion of the ITRs remain to allow for replication and packaging functions.
  • constructs can be designed using techniques well known in the art. See, e.g., U.S. Pat. Nos.5,173,414 and 5,139,941; International Publication Nos. WO 92/01070 (published Jan.23, 1992) and WO 93/03769 (published March 4, 1993); Lebkowski et al. (1988) Molec. Cell. Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring Harbor Laboratory Press); Carter, B. J. (1992) Current Opinion in Biotechnology 3:533-539; Muzyczka, N.
  • an AAV expression vector can be introduced into a suitable host cell using known techniques, such as by transfection.
  • transfection techniques are generally known in the art. See, e.g., Graham et al. (1973) Virology, 52:456, Sambrook et al.
  • transfection methods include calcium phosphate co-precipitation (Graham et al. (1973) Virol. 52:456-467), direct micro-injection into cultured cells (Capecchi, M. R. (1980) Cell 22:479-488), electroporation (Shigekawa et al. (1988) BioTechnigues 6:742-751), liposome mediated gene transfer (Mannino et al.
  • Suitable cells for producing rAAV virions include microorganisms, yeast cells, insect cells, and mammalian cells, that can be, or have been, used as recipients of a heterologous DNA molecule. Cells from the stable human cell line, 293 (readily available through, e.g., the American Type Culture Collection under Accession Number ATCC CRL1573) can be used.
  • the human cell line 293 is a human embryonic kidney cell line that has been transformed with adenovirus type-5 DNA fragments (Graham et al. (1977) J. Gen. Virol.36:59), and expresses the adenoviral E1a and E1b genes (Aiello et al. (1979) Virology 94:460).
  • the 293 cell line is readily transfected, and provides a convenient platform in which to produce rAAV virions.
  • Methods of producing an AAV virion in insect cells are known in the art, and can be used to produce a subject rAAV virion. See, e.g., U.S. Patent Publication No. 2009/0203071; U.S.
  • AAV virus that is produced may be replication competent or replication- incompetent.
  • a "replication-competent" virus e.g. a replication-competent AAV refers to a phenotypically wild-type virus that is infectious, and is also capable of being replicated in an infected cell (e.g., in the presence of a helper virus or helper virus functions).
  • replication competence generally requires the presence of functional AAV packaging genes.
  • rAAV vectors as described herein are replication-incompetent in mammalian cells (especially in human cells) by virtue of the lack of one or more AAV packaging genes.
  • Retroviruses for example, lentiviruses, are suitable for use in methods of the present disclosure.
  • Commonly used retroviral vectors are “defective”, i.e. unable to produce viral proteins required for productive infection. Rather, replication of the vector requires growth in a packaging cell line.
  • the retroviral nucleic acids comprising the nucleic acid are packaged into viral capsids by a packaging cell line.
  • Different packaging cell lines provide a different envelope protein (ecotropic, amphotropic or xenotropic) to be incorporated into the capsid, this envelope protein determining the specificity of the viral particle for the cells (ecotropic for murine and rat; amphotropic for most mammalian cell types including human, dog and mouse; and xenotropic for most mammalian cell types except murine cells).
  • the appropriate packaging cell line may be used to ensure that the cells are targeted by the packaged viral particles.
  • Methods of introducing subject vector expression vectors into packaging cell lines and of collecting the viral particles that are generated by the packaging lines are well known in the art. Nucleic acids can also introduced by direct micro-injection (e.g., injection of RNA).
  • proteins may instead be provided to cells as RNA (e.g., an RNA comprising the translational control element as discussed elsewhere herein).
  • Methods of introducing RNA into cells are known in the art and may include, for example, direct injection, transfection, or any other method used for the introduction of DNA.
  • one or more proteins e.g., a CRISPR nuclease and/or an Acr polypeptide
  • one protein coding sequence can be introduced as nucleic acid (RNA or DNA) where the protein coding sequence is operably linked to a translational control element; and the other protein (e.g., a Cas effector) is introduced as a polypeptide.
  • a polypeptide may optionally be fused to a polypeptide domain that increases solubility of the product.
  • the domain may be linked to the polypeptide through a defined protease cleavage site, e.g. a TEV sequence, which is cleaved by TEV protease.
  • the linker may also include one or more flexible sequences, e.g. from 1 to 10 glycine residues.
  • the cleavage of the fusion protein is performed in a buffer that maintains solubility of the product, e.g. in the presence of from 0.5 to 2 M urea, in the presence of polypeptides and/or polynucleotides that increase solubility, and the like.
  • Domains of interest include endosomolytic domains, e.g. influenza HA domain; and other polypeptides that aid in production, e.g. IF2 domain, GST domain, GRPE domain, and the like.
  • the polypeptide may be formulated for improved stability.
  • the peptides may be PEGylated, where the polyethyleneoxy group provides for enhanced lifetime in the blood stream.
  • M ETHODS [00205] The present disclosure provides a method for nucleic acid targeting (e.g., modifying a target nucleic acid, e.g., for cleaving DNA such as in genome editing applications), where the Acr polypeptide and the CRISPR nuclease are delivered to the target nucleic acid as a subject system (e.g., in nucleic acid or protein form). In some cases, the contact is in a cell-free environment in vitro.
  • a subject method includes introducing an Acr protein (or a nucleic acid encoding it) and a CRISPR nuclease (or a nucleic acid encoding it) into a host cell, whereby the ratio of on-target to off-target nucleic acid activity (e.g., cleavage) that results from said introducing is increased relative to the ratio of on-target to off-target nucleic acid targeting that would result in the absence of the Acr protein.
  • the ratio of on-target to off-target nucleic acid targeting that results is caused by an increase in on-target activity. In some cases, the ratio of on-target to off-target nucleic acid targeting that results is caused by a decrease in off-target activity. In some cases, the ratio of on-target to off-target nucleic acid targeting that results is caused by both an increase in on-target activity and a decrease in off-target activity.
  • the targeted cell can be any desired cell / cell type. Examples of suitable cells and promoters are described in detail elsewhere herein (see, e.g., the “promoter” section.
  • the cell is a prokaryotic cell, a plant cell, an insect cell, a vertebrate cell, an invertebrate cell, an animal cell, a mammalian cell, or a human cell.
  • the cell is a eukaryotic cell, a plant cell, an insect cell, a vertebrate cell, an invertebrate cell, an animal cell, a mammalian cell, or a human cell.
  • the cell is ex vivo.
  • the cell is in vivo.
  • the cell is in culture in vitro.
  • the nucleic acid targeted by the CRISPR complex is the genome of a pathogen (e.g., a virus) – In some cases, the pathogen is in the host cell. In some embodiments the nucleic acid targeted by the CRISPR complex (on-target events) is the genome of a pathogen (virus, bacteria, and the like) – In some cases, the pathogen is in the host cell. In some embodiments the nucleic acid targeted by the CRISPR complex (on-target events) is and RNA molecule. In some cases, the on-target nucleic acid targeting alters expression of a protein within the host cell (e.g., via decreasing transcription of the mRNA).
  • a pathogen e.g., a virus
  • the pathogen is in the host cell.
  • the nucleic acid targeted by the CRISPR complex (on-target events) is the genome of a pathogen (virus, bacteria, and the like) – In some cases, the pathogen is in the host cell. In some embodiments the nucleic acid
  • the on-target nucleic acid targeting alters expression of an RNA (e.g., a noncoding RNA, an mRNA, a microRNA, and the like) within the host cell.
  • an RNA e.g., a noncoding RNA, an mRNA, a microRNA, and the like
  • the on-target nucleic acid targeting activity of the CRISPR complex causes gene editing (e.g., correction of a genetic mutation in the host cell genome).
  • a subject method includes a step of measuring editing efficiency at an ON target site. In some embodiments, a subject method includes a step of measuring editing efficiency at one or more OFF target sites.
  • an on-target (“ON target”) event e.g., target DNA cleavage / editing
  • OFF target off-target
  • any convenient method can be used to measure on-target and off-target events and the selection of method will depend on the type of CRISPR complex used and desired outcome of the complex’s activity (e.g., when using a nickase protein, when performing double stranded target cleavage, when using a donor polynucleotide – which can edit the target by introducing known heterologous sequence; when not using a donor polynucleotide which can lead to numerous different indels, etc.)
  • suitable assays include but are not limited to: mismatch cleavage assays (e.g., surveyor assay, T7E1 mismatch assay), PCR assays; PCR/sequencing assays, direct sequencing assays such as next generation sequencing, and the like (and any combination thereof).
  • Sequencing assays or alternative expression assays such as qRT-PCR and/or microarray analysis can be used when the activity of the CRISPR complex results in an alteration of expression of a target sequence (e.g., when a promoter sequence is targeted, when a coding sequence is targeted and the new sequence is susceptible to nonsense-mediated decay, and the like).
  • Various assays exist to test for on-target and off-target activities and any desired assay or combination of assays can be used.
  • a desirable outcome e.g., a desirable balance between reduced ON target and OFF target CRISPR nuclease activity
  • a particular Acr protein e.g., see above for a description of subject Acr proteins.
  • the off-target sites are predicted and/or known sites, and In some cases, the off-target sites can be identified after the fact (e.g., based on a genome-wide hunt such as can be achieved using high throughput /next generation sequencing methods such as RNA or DNA sequencing methods).
  • the Acr polypeptide of the compositions, systems, or methods described herein decreases OFF target CRISPR nuclease activity (i.e., OFF target activity of the CRISPR nuclease).
  • the Acr polypeptide decreases OFF target CRISPR nuclease activity by 10% or more (e.g., 20% or more, 30% or more, 40% or more, 50% or more, 60% or more, 70% or more, 80% or more, 85% or more, 90% or more, 95% or more, or 99% or more) as compared with OFF target CRISPR nuclease activity in the absence of the Acr polypeptide.
  • the Acr polypeptide decreases OFF target CRISPR nuclease activity by 20% or more (e.g., 30% or more, 40% or more, 50% or more, 60% or more, 70% or more, 80% or more, 85% or more, 90% or more, 95% or more, or 99% or more) as compared with OFF target CRISPR nuclease activity in the absence of the Acr polypeptide.
  • the Acr polypeptide of the compositions, systems, or methods described herein decreases ON target CRISPR nuclease activity (i.e., ON target activity of the CRISPR nuclease) as compared with ON target CRISPR nuclease activity in the absence of the Acr polypeptide.
  • the Acr polypeptide decreases ON target CRISPR nuclease activity by no more than 50% (e.g., by no more than 40%, 30%, 20%, 10%, 5%, 3%, 2%, or 1%) as compared with ON target CRISPR nuclease activity in the absence of the Acr polypeptide.
  • the Acr polypeptide decreases ON target CRISPR nuclease activity by no more than 40% (e.g., by no more than 40%, 30%, 20%, 10%, 5%, 3%, 2%, or 1%) as compared with ON target CRISPR nuclease activity in the absence of the Acr polypeptide.
  • the Acr polypeptide of the compositions, systems, or methods described herein increases the ratio of ON target to OFF target CRISPR nuclease activity as compared with the ratio of ON target to OFF target CRISPR nuclease activity in the absence of the Acr polypeptide.
  • the Acr polypeptide increases the ratio of ON target to OFF target CRISPR nuclease activity by at least 1.25x (i.e., 1.25 times) (e.g., by at least 1.5x, 2x, 2.5x, 3x, 4x, 5x, 6x, 7x, 8x, 9x, or 10x) as compared the ratio of ON target to OFF target CRISPR nuclease activity in the absence of the Acr polypeptide.
  • the Acr polypeptide increases the ratio of ON target to OFF target CRISPR nuclease activity by at least 1.5x (e.g., by at least 2x, 2.5x, 3x, 4x, 5x, 6x, 7x, 8x, 9x, or 10x) as compared the ratio of ON target to OFF target CRISPR nuclease activity in the absence of the Acr polypeptide.
  • the Acr polypeptide increases the ratio of ON target to OFF target CRISPR nuclease activity by at least 2x (e.g., by at least 2.5x, 3x, 4x, 5x, 6x, 7x, 8x, 9x, or 10x) as compared the ratio of ON target to OFF target CRISPR nuclease activity in the absence of the Acr polypeptide.
  • the editing efficiency of ON target CRISPR nuclease activity is at least 4 times (4x) greater than editing efficiency of OFF target CRISPR nuclease activity (e.g., at least 5x, 6x, 7x, 8x, 9x, 10x, 20x, 25x, 30x, 35x, 40x, 50x, or 100x).
  • the editing efficiency of ON target CRISPR nuclease activity is at least 6 times (6x) greater than editing efficiency of OFF target CRISPR nuclease activity (e.g., at least 7x, 8x, 9x, 10x, 20x, 25x, 30x, 35x, 40x, 50x, or 100x).
  • the editing efficiency of ON target CRISPR nuclease activity is at least 10 times (10x) greater than editing efficiency of OFF target CRISPR nuclease activity (e.g., at least 20x, 25x, 30x, 35x, 40x, 50x, or 100x).
  • the OFF target CRISPR nuclease activity is at an off- target site that comprises no more than 5 mismatches (e.g., no more than 4, no more than 3, no more than 2 or no more than 1) as compared to the on-target site (i.e., mismatches between the intended target site and the off-target site).
  • the OFF target CRISPR nuclease activity is at an off-target site that comprises 5 or less mismatches (e.g., 4 or less, 3 or less, 2 or less or 1 mismatch) as compared to the on-target site (i.e., mismatches between the intended target site and the off-target site).
  • the OFF target CRISPR nuclease activity is at an off-target site that comprises no more than 3 mismatches (e.g., more than 2 or no more than 1) as compared to the on-target site (i.e., mismatches between the intended target site and the off-target site).
  • the OFF target CRISPR nuclease activity is at an off-target site that comprises 3or less mismatches (e.g., 2 or less or 1 mismatch) as compared to the on-target site (i.e., mismatches between the intended target site and the off-target site).
  • a desirable outcome is one in which the off-target rate is less than 100 off-target events detected per cell population (e.g., off-target cleavage events such as insertion/deletions (indels) detected per cell population).
  • the number of cells in the cell population is in a range of from 10 4 to 10 6 (e.g., In some cases, the number of cells in the cell population is about 10 5 cells).
  • a desirable outcome is one in which the off-target rate is less than 90 off-target events detected per cell population (e.g., less than 80, less than 70, less than 60, less than 50, less than 40, less than 30, less than 20, less than 10, or less than 5 off-target events per cell). In some cases, a desirable outcome is one in which the off-target rate is less than 50 off-target events detected per cell population (e.g., less than 40, less than 30, less than 20, less than 10, or less than 5 off-target events per cell).
  • a desirable outcome is one in which the off-target rate is less than 100 off-target events detected per 10 5 cells. In some cases, a desirable outcome is one in which the off-target rate is less than 90 off-target events detected per 10 5 cells (e.g., less than 80, less than 70, less than 60, less than 50, less than 40, less than 30, less than 20, less than 10, or less than 5 off-target events per cell). In some cases, a desirable outcome is one in which the off-target rate is less than 50 off-target events detected per 10 5 cells (e.g., less than 40, less than 30, less than 20, less than 10, or less than 5 off-target events per cell).
  • a desirable outcome is one in which less than 50% (e.g., less than 45%, less than 40%, or less than 35%) of the total measured nucleic acid targeting events (e.g., cleavage) are off-target events.
  • the ratio of on-target to off-target events is greater than 1 (e.g., greater than 1.2, greater than 1.5, greater than 1.8, greater than 2, greater than 2.2, or greater than 2.5).
  • the events can be measured after passaging the host cell (e.g., In some cases, for 10 or more generations) after the Acr and Cas proteins are introduced.
  • a desirable outcome is an outcome in which, after passaging the host cell (e.g., for 10 or more generations) after the Acr and Cas proteins are introduced, less than 50% (e.g., less than 45%, less than 40%, or less than 35%) of the total measured nucleic acid targeting events (e.g., cleavage) are off-target events.
  • the ratio of on- target to off-target events is greater than 1 (e.g., greater than 1.2, greater than 1.5, greater than 1.8, greater than 2, greater than 2.2, or greater than 2.5).
  • off-target sites can In some cases, be predicted.
  • the rate (frequency) of off-target activity (e.g., cleavage/editing) will vary from site to site, e.g., when measuring rates of activity using a population of cells.
  • a desirable outcome is one in which the measured frequency of off-target events is less than 50% (e.g., less than 45%, less than 40%, less than 35%, less than 30%, less than 25%, less than 20%, less than 15%, less than 10%, less than 5%, less than 2%, or less than 1%) when compared to the off-target events measured (or expected) in the absence of the Acr protein.
  • on can measure the frequency of off-target events at one particular predicted or known off-target site (or at any number of off-target sites – predicted/known or not predict/known) in the presence of the Acr protein (meaning – when the experiment is performed in the present of the Acr protein) and in the absence of the Acr protein – and the number of off-target events when the Acr protein is present is less than 50% (e.g., less than 45%, less than 40%, or less than 35%, less than 30%, less than 25%, less than 20%, less than 15%, less than 10%, less than 5%, less than 2%, or less than 1%) compared to the number of off-target evens when the Acr protein is absent.
  • 50% e.g., less than 45%, less than 40%, or less than 35%, less than 30%, less than 25%, less than 20%, less than 15%, less than 10%, less than 5%, less than 2%, or less than 16% compared to the number of off-target evens when the Acr protein is absent.
  • both the Cas protein and the Acr protein will be delivered to a host cell as DNA and in some such cases the sequence encoding the two proteins will be present on the same nucleic acid (e.g., DNA vector) or on separate nucleic acids.
  • a subject protein e.g., Cas protein and/or Acr protein
  • either protein (or both) can be introduced into a host cell as RNA encoding the protein.
  • the RNA encoding the two proteins can be delivered in an appropriate ratio to achieve the desired affect (i.e., increased ratio of on-target to off-target CRISPR complex activity) – e.g., by decreasing off-target activity while retaining desirable on-target activity, and one or more translational control elements can be present on the RNAs.
  • either protein (or both) can be introduced into a host cell directly as proteins.
  • the Cas protein can be delivered as an RNP (ribonucleoprotein complex) in which it is already complexed with an appropriate guide RNA.
  • the other protein e.g., the Acr protein
  • the Cas protein and the Acr protein can be delivered in any desired format (DNA, RNA, protein).
  • the Cas protein can be delivered as DNA, RNA, or protein; if the Cas protein is delivered as RNA, the Acr protein can be delivered as DNA, RNA, or protein; and if the Cas protein is delivered as protein, the Acr protein can be delivered as DNA or RNA.
  • the Cas protein can be delivered as DNA, RNA, or protein; if the Acr protein is delivered as RNA, the Cas protein can be delivered as DNA, RNA, or protein; and if the Acr protein is delivered as protein, the Cas protein can be delivered as DNA or RNA.
  • nucleic acids and proteins can be delivered to cells using any convenient method.
  • Methods of introducing nucleic acids and/or proteins into a host cell e.g., prokaryotic cell, eukaryotic cell, plant cell, animal cell, insect cell, mammalian cell, human cell, and the like
  • a host cell e.g., prokaryotic cell, eukaryotic cell, plant cell, animal cell, insect cell, mammalian cell, human cell, and the like
  • Suitable methods include, e.g., viral infection (e.g., AAV, adenovirus, lentiviral), transfection, conjugation, protoplast fusion, lipofection, electroporation, calcium phosphate precipitation, polyethyleneimine (PEI)-mediated transfection, DEAE-dextran mediated transfection, liposome-mediated transfection, particle gun technology, calcium phosphate precipitation, direct micro injection, nanoparticle-mediated nucleic acid delivery (see, e.g., Panyam et., al Adv Drug Deliv Rev.2012 Sep 13. pii: S0169- 409X(12)00283-9), and the like.
  • viral infection e.g., AAV, adenovirus, lentiviral
  • transfection conjugation, protoplast fusion, lipofection, electroporation, calcium phosphate precipitation, polyethyleneimine (PEI)-mediated transfection, DEAE-dextran mediated transfection, liposome-mediated transfection, particle gun technology,
  • a protein of the present disclosure is provided as a nucleic acid (e.g., an mRNA, a DNA, a plasmid, an expression vector, a viral vector, etc.) that encodes the protein.
  • a subject protein is provided directly as a protein (e.g., without an associated guide RNA or with an associate guide RNA, i.e., as a ribonucleoprotein complex).
  • a subject protein can be introduced into a cell (provided to the cell) by any convenient method; such methods are known to those of ordinary skill in the art. As an illustrative example, a subject protein can be injected directly into a cell.
  • a subject protein can be introduced into a cell (e.g, eukaryotic cell) via nucleofection; via a protein transduction domain (PTD) conjugated to the protein, etc.
  • a subject protein is delivered to a cell (e.g., a target host cell) in a particle, or associated with a particle.
  • DOTAP 1,2-dioleoyl-3-trimethylammonium-propane
  • DMPC 1,2-ditetradecanoyl- sn-glycero-3-phosphocholine
  • the hydrophilic polymer comprises
  • a subject protein may be delivered using particles or lipid envelopes.
  • a biodegradable core-shell structured nanoparticle with a poly ( ⁇ -amino ester) (PBAE) core enveloped by a phospholipid bilayer shell can be used.
  • PBAE poly ( ⁇ -amino ester)
  • particles/nanoparticles based on self assembling bioadhesive polymers are used; such particles/nanoparticles may be applied to oral delivery of peptides, intravenous delivery of peptides and nasal delivery of peptides, e.g., to the brain.
  • Other embodiments, such as oral absorption and ocular delivery of hydrophobic drugs are also contemplated.
  • a molecular envelope technology which involves an engineered polymer envelope which is protected and delivered to the site of the disease, can be used.
  • Lipidoid compounds e.g., as described in US patent application 20110293703 are also useful in the administration of polynucleotides, and can be used to deliver a subject protein or nucleic acid (RNA or DNA).
  • the aminoalcohol lipidoid compounds are combined with an agent to be delivered to a cell or a subject to form microparticles, nanoparticles, liposomes, or micelles.
  • the aminoalcohol lipidoid compounds may be combined with other aminoalcohol lipidoid compounds, polymers (synthetic or natural), surfactants, cholesterol, carbohydrates, proteins, lipids, etc. to form the particles. These particles may then optionally be combined with a pharmaceutical excipient to form a pharmaceutical composition.
  • lipid nanoparticles are used to deliver a subject protein or nucleic acid to a target cell. Negatively charged polymers such as RNA may be loaded into LNPs at low pH values (e.g., pH 4) where the ionizable lipids display a positive charge. However, at physiological pH values, the LNPs exhibit a low surface charge compatible with longer circulation times.
  • Ionizable cationic lipids can include, but are not limited to: 1,2-dilineoyl-3-dimethylammonium-propane (DLinDAP), 1,2- dilinoleyloxy-3-N,N-dimethylaminopropane (DLinDMA), 1,2-dilinoleyloxy-keto-N,N- dimethyl-3-aminopropane (DLinKDMA), and 1,2-dilinoleyl-4-(2-dimethylaminoethyl)- [1,3]-dioxolane (DLinKC2-DMA).
  • LNPs 1,2-dilineoyl-3-dimethylammonium-propane
  • DLinDMA 1,2- dilinoleyloxy-3-N,N-dimethylaminopropane
  • DLinKDMA 1,2-dilinoleyloxy-keto-N,N- dimethyl-3-aminopropane
  • a nucleic acid may be encapsulated in LNPs containing DLinDAP, DLinDMA, DLinK-DMA, and DLinKC2-DMA (cationic lipid:DSPC:CHOL: PEGS-DMG or PEG-C-DOMG at 40:10:40:10 molar ratios). In some cases, 0.2% SP-DiOC18 is incorporated.
  • Suitable lipids include for LNPs include, but are not limited to, DLin-KC2-DMA4, C12-200 and colipids disteroylphosphatidyl choline, cholesterol, and PEG-DMG may be formulated with a subject protein or nucleic acid using a spontaneous vesicle formation procedure.
  • the component molar ratio may be about 50/10/38.5/1.5 (DLin-KC2-DMA or C12- 200/disteroylphosphatidyl choline/cholesterol/PEG-DMG).
  • the particles may be extruded up to three times through 80 nm membranes prior to adding, e.g., RNA such as guide RNA.
  • Particles containing the highly potent amino lipid 16 may be used, in which the molar ratio of the four lipid components 16, DSPC, cholesterol and PEG-lipid (50/10/38.5/1.5) which may be further optimized to enhance activity.
  • Spherical Nucleic Acid (SNATM) constructs and other nanoparticles (particularly gold nanoparticles) can be used to deliver a subject protein or nucleic acid to a target cell.
  • SNATM Spherical Nucleic Acid
  • Nanoparticles with RNA may be constructed with polyethyleneimine (PEI) that is PEGylated with an Arg-Gly-Asp (RGD) peptide ligand attached at the distal end of the polyethylene glycol (PEG).
  • PEI polyethyleneimine
  • RGD Arg-Gly-Asp
  • a “nanoparticle” refers to any particle having a diameter of less than 1000 nm.
  • nanoparticles suitable for use in delivering a subject protein or nucleic acid to a target cell have a diameter of 500 nm or less, e.g., from 25 nm to 35 nm, from 35 nm to 50 nm, from 50 nm to 75 nm, from 75 nm to 100 nm, from 100 nm to 150 nm, from 150 nm to 200 nm, from 200 nm to 300 nm, from 300 nm to 400 nm, or from 400 nm to 500 nm.
  • nanoparticles suitable for use in delivering a a subject protein or nucleic acid to a target cell have a diameter of from 25 nm to 200 nm.
  • Nanoparticles suitable for use in delivering a subject protein or nucleic acid to a target cell may be provided in different forms, e.g., as solid nanoparticles (e.g., metal such as silver, gold, iron, titanium), non-metal, lipid-based solids, polymers), suspensions of nanoparticles, or combinations thereof.
  • Metal, dielectric, and semiconductor nanoparticles may be prepared, as well as hybrid structures (e.g., core- shell nanoparticles).
  • Nanoparticles made of semiconducting material may also be labeled quantum dots if they are small enough (typically below 10 nm) that quantization of electronic energy levels occurs.
  • nanoscale particles are used in biomedical applications as drug carriers or imaging agents and may be adapted for similar purposes in the present disclosure.
  • Semi-solid and soft nanoparticles are also suitable for use in delivering a subject protein or nucleic acid to a target cell.
  • a prototype nanoparticle of semi-solid nature is the liposome.
  • a carrier/medium can include a microparticle.
  • Microparticles can include, but are not limited to, liposomes, nanoparticles, microspheres, nanospheres, microcapsules, and nanocapsules.
  • microparticle can include one or more of the following: a poly(lactide-co-glycolide), aliphatic polyesters including, but not limited to, poly-glycolic acid and poly-lactic acid, hyaluronic acid, modified polysacchrides, chitosan, cellulose, dextran, polyurethanes, polyacrylic acids, psuedo- poly(amino acids), polyhydroxybutrate-related copolymers, polyanhydrides, polymethylmethacrylate, poly(ethylene oxide), lecithin and phospholipids – in any combination thereof.
  • a poly(lactide-co-glycolide) aliphatic polyesters including, but not limited to, poly-glycolic acid and poly-lactic acid, hyaluronic acid, modified polysacchrides, chitosan, cellulose, dextran, polyurethanes, polyacrylic acids, psuedo- poly(amino acids), polyhydroxybutrate
  • a carrier/medium can include a liposome, e.g., one that is capable of attaching and releasing therapeutic agents (e.g., the subject nucleic acids and/or proteins).
  • Liposomes are microscopic spherical lipid bilayers surrounding an aqueous core that are made from amphiphilic molecules such as phospholipids.
  • a liposome may trap a therapeutic agent between the hydrophobic tails of the phospholipid micelle.
  • Water soluble agents can be entrapped in the core and lipid- soluble agents can be dissolved in the shell-like bilayer.
  • Liposomes have a special characteristic in that they enable water soluble and water insoluble chemicals to be used together in a medium without the use of surfactants or other emulsifiers. Liposomes can form spontaneously by forcefully mixing phosopholipids in aqueous media. Water soluble compounds are dissolved in an aqueous solution capable of hydrating phospholipids. Upon formation of the liposomes, therefore, these compounds are trapped within the aqueous liposomal center. The liposome wall, being a phospholipid membrane, holds fat soluble materials such as oils. Liposomes provide controlled release of incorporated compounds. In addition, liposomes can be coated with water soluble polymers, such as polyethylene glycol to increase the pharmacokinetic half-life.
  • water soluble polymers such as polyethylene glycol
  • Liposomes can be made from several different types of lipids; however, phospholipids are most commonly used to generate liposomes. Although liposome formation is spontaneous when a lipid film is mixed with an aqueous solution, it can also be expedited by applying force in the form of shaking by using a homogenizer, sonicator, or an extrusion apparatus. Several other additives may be added to liposomes in order to modify their structure and properties. For instance, either cholesterol or sphingomyelin may be added to the liposomal mixture in order to help stabilize the liposomal structure and to prevent the leakage of the liposomal inner cargo.
  • a liposome formulation may be mainly comprised of natural phospholipids and lipids such as 1,2-distearoryl-sn-glycero-3-phosphatidyl choline (DSPC), sphingomyelin, egg phosphatidylcholines and monosialoganglioside.
  • DSPC 1,2-distearoryl-sn-glycero-3-phosphatidyl choline
  • sphingomyelin sphingomyelin
  • egg phosphatidylcholines monosialoganglioside.
  • a cationic or anionic liposome is used as part of a subject composition or method, or liposomes having neutral lipids can also be used.
  • Cationic liposomes can include negatively-charged materials by mixing the materials and fatty acid liposomal components and allowing them to charge-associate.
  • a cationic or anionic liposome depends upon the desired pH of the final liposome mixture.
  • examples of cationic liposomes include but are not limited to: lipofectin, lipofectamine, and lipofectace.
  • a subject protein or nucleic acid may be delivered encapsulated in PLGA microspheres such as that further described in US published applications 20130252281 and 20130245107 and 20130244279.
  • an exosome is used to deliver a subject protein or nucleic acid to a target cell. Exosomes are endogenous nano-vesicles that transport RNAs and proteins, and which can deliver RNA to the brain and other target organs.
  • a poly(beta-amino alcohol) can be used to deliver a subject protein or nucleic acid to a target cell.
  • US Patent Publication No.20130302401 relates to a class of poly(beta-amino alcohols) (PBAAs) that has been prepared using combinatorial polymerization.
  • Sugar-based particles may be used, for example GalNAc, as described with reference to WO2014118272 (incorporated herein by reference) and Nair, J K et al., 2014, Journal of the American Chemical Society 136 (49), 16958-16961) can be used to deliver a subject protein or nucleic acid to a target cell.
  • Supercharged proteins can be used to deliver a subject protein or nucleic acid to a target cell.
  • Supercharged proteins are a class of engineered or naturally occurring proteins with unusually high positive or negative net theoretical charge. Both supernegatively and superpositively charged proteins exhibit the ability to withstand thermally or chemically induced aggregation. Superpositively charged proteins are also able to penetrate mammalian cells. Associating cargo with these proteins, such as plasmid DNA, RNA, or other proteins, can facilitate the functional delivery of these macromolecules into mammalian cells both in vitro and in vivo.
  • CPPs Cell Penetrating Peptides
  • CPPs typically have an amino acid composition that either contains a high relative abundance of positively charged amino acids such as lysine or arginine or has sequences that contain an alternating pattern of polar/charged amino acids and non-polar, hydrophobic amino acids.
  • Carriers or mediums contemplated include materials such as gelatin, collagen, cellulose esters, dextran sulfate, pentosan polysulfate, chitin, saccharides, albumin, fibrin sealants, synthetic polyvinyl pyrrolidone, polyethylene oxide, polypropylene oxide, block polymers of polyethylene oxide and polypropylene oxide, polyethylene glycol, acrylates, acrylamides, methacrylates including, but not limited to, 2-hydroxyethyl methacrylate, poly(ortho esters), cyanoacrylates, gelatin-resorcin-aldehyde type bioadhesives, polyacrylic acid and copolymers and block copolymers thereof.
  • a subject composition or method may include a donor polynucleotide.
  • a donor polynucleotide a nucleic acid comprising a donor sequence
  • a donor polynucleotide can also be provided to the cell.
  • a “donor sequence” or “donor polynucleotide” or “donor template” it is meant a nucleic acid sequence to be inserted at the site targeted by the CRISPR complex (e.g., after dsDNA cleavage, after nicking a target DNA, after dual nicking a target DNA, and the like).
  • the donor sequence is provided to the cell as single-stranded DNA.
  • the donor template is provided to the cell as double-stranded DNA. It may be introduced into a cell in linear or circular form. If introduced in linear form, the ends of the donor sequence may be protected (e.g., from exonucleolytic degradation) by any convenient method and such methods are known to those of skill in the art.
  • one or more dideoxynucleotide residues can be added to the 3' terminus of a linear molecule and/or self-complementary oligonucleotides can be ligated to one or both ends. See, for example, Chang et al. (1987) Proc. Natl. Acad Sci USA 84:4959-4963; Nehls et al. (1996) Science 272:886- 889.
  • a donor template can be introduced into a cell as part of a vector molecule having additional sequences such as, for example, replication origins, promoters and genes encoding antibiotic resistance.
  • donor template can be introduced as naked nucleic acid, as nucleic acid complexed with an agent such as a liposome or poloxamer, or can be delivered by viruses (e.g., adenovirus, AAV).
  • viruses e.g., adenovirus, AAV.
  • Set A An anti-CRISPR (Acr) polypeptide, or a nucleic acid encoding the Acr polypeptide, wherein the Acr polypeptide comprises an amino acid sequence having 75% or more sequence identity with the sequence set forth in any one of SEQ ID Nos.: 126-132. 2. The Acr polypeptide or nucleic acid of 1, wherein said amino acid sequence has 90% or more sequence identity with the sequence set forth in any one of SEQ ID Nos.126-132. 3. The Acr polypeptide or nucleic acid of 1, wherein the Acr polypeptide comprises the amino acid sequence set forth in any one of SEQ ID Nos.126-132. 4.
  • the Acr polypeptide or nucleic acid of any one of 1-3 wherein the Acr polypeptide decreases OFF target CRISPR nuclease activity by 10% or more as compared with OFF target CRISPR nuclease activity in the absence of the Acr polypeptide. 5.
  • the Acr polypeptide or nucleic acid of any one of 1-4 wherein the Acr polypeptide decreases ON target CRISPR nuclease activity by no more than 40% as compared with ON target CRISPR nuclease activity in the absence of the Acr polypeptide. 6.
  • the Acr polypeptide or nucleic acid of any one of 1-5 wherein the Acr polypeptide increases the ratio of ON target to OFF target CRISPR nuclease activity by at least 1.25 times as compared with the ratio of ON target to OFF target CRISPR nuclease activity in the absence of the Acr polypeptide.
  • NLS nuclear localization signal
  • a system comprising: (i) an Acr polypeptide or a nucleic acid encoding the Acr polypeptide, wherein the Acr polypeptide is the Acr polypeptide of any one of 1-10, or comprises an amino acid sequence having 80% or more sequence identity with the CRISPR nuclease amino acid sequence set forth in any one of SEQ ID NOs.: 165-169; and (ii) a CRISPR nuclease or a nucleic acid encoding the CRISPR nuclease. 12. The system of 11, wherein the CRISPR nuclease is a Cas12a nuclease. 13.
  • the system of 11, wherein the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity with the Cas12a nuclease amino acid sequence set forth in any one of SEQ ID NOs.: 175 and 245-262. 14. The system of 11, wherein the CRISPR nuclease comprises an amino acid sequence having 80% or more sequence identity with the NUX protein amino acid sequence set forth in any one of SEQ ID NOs.: 1-86 and 176-244. 15. The system of any one of 11-14, further comprising a guide RNA or a nucleic acid encoding the guide RNA. 16.
  • the system comprises the nucleic acid encoding the Acr polypeptide and/or the nucleic acid encoding the CRISPR nuclease. 17. The system of 16, wherein the system comprises a first nucleic acid encoding the Acr polypeptide and a second nucleic acid encoding the CRISPR nuclease. 18. The system of 16, wherein the nucleic acid encoding the Acr polypeptide and the nucleic acid encoding the CRISPR nuclease are the same nucleic acid such that the system comprises a nucleic acid that encodes both the Acr polypeptide and the CRISPR nuclease. 19.
  • a translation control element is operably linked to the Acr polypeptide coding sequence or to the CRISPR nuclease coding sequence.
  • the translation control element is selected from the group consisting of an IRES sequence, a 2A peptide encoding sequence, a non- canonical start codon or any combination thereof.
  • a first promoter is operably linked to the Acr polypeptide coding sequence and a second promoter is operably linked to the CRISPR nuclease coding sequence. 22. The system of 21, wherein the first promoter is a stronger promoter than the second promoter. 23.
  • the viral vector is an AAV vector.
  • the viral vector is an AAV vector.
  • the viral vector is an AAV vector.
  • one or more of said nucleic acids is comprised in a lipid nanoparticle (LNP).
  • LNP lipid nanoparticle
  • the system comprises the Acr polypeptide and/or the CRISPR nuclease in protein form.
  • the Acr polypeptide and/or the CRISPR nuclease is comprised in a lipid nanoparticle (LNP).
  • a method of modifying a target nucleic acid comprising: contacting the target nucleic acid with an Acr polypeptide and a CRISPR nuclease, wherein the Acr polypeptide and the CRISPR nuclease are delivered to the target nucleic acid as the system of any one of 11-35, wherein the contacting results in a modification to the nucleotide sequence of the target nucleic acid.
  • the modification results from deletion of one or more nucleotides.
  • the contacting step delivers the CRISPR nuclease and the Acr polypeptide at a 1:1 ratio.
  • the contacting step delivers the Acr polypeptide and the CRISPR nuclease at a ratio (Acr:CRISPR) in a range of from 1:1.25 to 1:10.
  • the method of 43 wherein the animal cell is a human cell. 45. The method of 42, wherein the eukaryotic cell is a stem cell. 46. The method of any of 36-45, wherein the target nucleic acid encodes a gene product. 47. The method of any one of 36-46, wherein editing efficiency of ON target CRISPR nuclease activity is at least 4 times greater than editing efficiency of OFF target CRISPR nuclease activity. 48. The method of 47, wherein said OFF target CRISPR nuclease activity is at an off- target site that comprises no more than 5 mismatches as compared to the on- target site. 49.
  • the 13 The Acr polypeptide or nucleic acid of any one of 8-10, wherein the CRISPR nuclease comprises an amino acid sequence having 80%, 85%, 90% or 95% or more sequence identity with the NUX protein amino acid sequence set forth in any one of SEQ ID NOs.: 1-86 and 176-244. 14.
  • a system comprising: (i) an Acr polypeptide or a nucleic acid encoding the Acr polypeptide, wherein the Acr polypeptide is the Acr polypeptide of any one of claims 1- 14, comprises an amino acid as set forth in any one of SEQ ID NOs.: 165-169, or comprises an amino acid sequence having 80%, 85%, 90% or 95% or more sequence identity with the Acr amino acid sequence set forth in any one of SEQ ID NOs.: 165-169; and (ii) a CRISPR nuclease or a nucleic acid encoding the CRISPR nuclease.
  • the system of 15, wherein the CRISPR nuclease is a Cas12a nuclease. 17. The system of 15, wherein the CRISPR nuclease comprises an amino acid sequence having 80%, 85%, 90% or 95% or more sequence identity with the Cas12a nuclease amino acid sequence set forth in any one of SEQ ID NOs.: 175 and 245-262. 18. The system of 15, wherein the CRISPR nuclease comprises an amino acid sequence as set forth in SEQ ID NO.263 or an amino acid sequence having 80%, 85%, 90% or 95% or more sequence identity with the CRISPR nuclease amino acid sequence set forth in SEQ ID NO.263 19.
  • the CRISPR nuclease comprises an amino acid sequence as set forth in any one of SEQ ID NOs.: 1-86 and 176-244 or an amino acid sequence having 80%, 85%, 90% or 95% or more sequence identity with the NUX protein amino acid sequence set forth in any one of SEQ ID NOs.: 1-86 and 176-244.
  • 20. The system of any one of 15-19, further comprising a guide RNA or a nucleic acid encoding the guide RNA.
  • 21. The system of any one of 15-20, wherein the system comprises the nucleic acid encoding the Acr polypeptide and/or the nucleic acid encoding the CRISPR nuclease. 22.
  • the nucleic acid encoding the Acr polypeptide and the nucleic acid encoding the CRISPR nuclease are the same nucleic acid such that the system comprises a nucleic acid that encodes both the Acr polypeptide and the CRISPR nuclease.
  • 24. The system of any one of 21-23, wherein a translation control element is operably linked to the Acr polypeptide coding sequence or to the CRISPR nuclease coding sequence. 25.
  • the system of 24, wherein the translation control element is selected from the group consisting of an IRES sequence, a 2A peptide encoding sequence, a non- canonical start codon or any combination thereof.
  • 26. The system of any one of 21-23, wherein a first promoter is operably linked to the Acr polypeptide coding sequence and a second promoter is operably linked to the CRISPR nuclease coding sequence. 27. The system of 26, wherein the first promoter is a stronger promoter than the second promoter. 28. The system of 26, wherein the second promoter is a stronger promoter than the first promoter. 29. The system of any one of 21-28, wherein one or more of said nucleic acids is a viral vector. 30.
  • the viral vector is an AAV vector.
  • 31. The system of any one of 21-30, wherein one or more of said nucleic acids is comprised in a lipid nanoparticle (LNP).
  • LNP lipid nanoparticle
  • 32. The system of any one of 15-20, wherein the system comprises the Acr polypeptide and/or the CRISPR nuclease in protein form.
  • 33. The system of 32, wherein the Acr polypeptide and/or the CRISPR nuclease is comprised in a lipid nanoparticle (LNP). 34.
  • a method of modifying a target nucleic acid comprising: contacting the target nucleic acid with an Acr polypeptide and a CRISPR nuclease, wherein the Acr polypeptide and the CRISPR nuclease are delivered to the target nucleic acid as the system of any one of 15-37, wherein the contacting results in a modification to the nucleotide sequence of the target nucleic acid.
  • 39. The method of 38, wherein the modification results from deletion of one or more nucleotides.
  • the contacting step delivers the CRISPR nuclease and the Acr polypeptide at a 1:1 ratio. 41.
  • Example 1 Vector Construction Nuclease expression vectors [00248] Codon-optimized genes encoding AsCas12a, NUX (SEQ ID NO: 176), NUX (SEQ ID NO: 177) and NUX (SEQ ID NO: 178) were synthesized and cloned into a mammalian expression vector under the CMV promoter.
  • Vector included an in-frame nuclear localization signal OPT NLS (GRSSDDEATADSQHAAPPKKKRKV) (SEQ ID NO: 125) followed by the linker (GGSGGSGGSGGSGGSGGSGGSGGS) (SEQ ID NO: 124) and followed by a 3x HA tag.
  • DR direct repeat
  • nucleases NUX SEQ ID NO: 176
  • NUX SEQ ID NO: 178
  • AsCas12a Table 2
  • DR19s is NUX (SEQ ID NO: 176)
  • NUX SEQ ID NO: 177)
  • NUX SEQ ID NO: 178
  • direct repeat and DR18s is AsCas12a direct repeat.
  • a spacer target sequence was placed downstream of the DR sequence.
  • the spacer sequence used for DNMT was CTGATGGTCCATGTCTGTTA (SEQ ID NO: 172) for which the PAM was TTTC.
  • the spacer sequence used for FANCF1 was GGCGGGGTCCAGTTCCGGGA (SEQ ID NO: 162) for which the PAM was TTTG. Mismatched guides were made by changing a single nucleotide on a given position into its counterpart such as A->T and G->C.
  • Table 2 Direct repeat sequences used in the guide vectors Direct Repeat Sequence SEQ ID NO: DR19s AATTTCTACTGTGTGTAGAT 163 DR18s AATTTCTACTCTTGTAGAT 164 Acr Vectors
  • Potential Anti-CRISPR (Acr) sequences for use with nucleases were identified in EBI and NCBI databases (Table 3).
  • Example 1 The vectors constructed in Example 1 were transfected into cells using Mirus Transit X2 reagent, MirusBio CAT MIR6003. Tests were performed in 96 well plates transfected with 100 ng of nuclease expression vector, 100 ng of Acr expression vector and 50 ng of targeting guide vector following the Mirus Transit X2 transfection recommendations. Samples were incubated for 72h and harvested with Quick Extract. Genomic DNA was amplified using genomic region-specific primers.
  • Target FWD Primer REV Primer DNMT1 CCAGAATGCACAAAGTACTGCAC GCCAAAGCCCGAGAGAGTGCC (SEQ ID NO: 170) (SEQ ID NO: 171)
  • Samples were checked on a 2% agarose gel for purity and sequenced by Sanger sequencing. TIDE analysis was performed following the method of Brinkman et al., 2014 as well as TIDE’s website (https://tide.nki.nl/) recommendations. TIDE output data on editing efficiency was plotted using Prism software. The Acrs were compared to AcrVA1 (ACX-137) (SEQ ID NO: 67) and AcrIIA4 (ACX-105) (SEQ ID NO: 35). Results are shown in Figure 1.
  • Acx-178 showed little inhibition and the inhibition activity of Acx-176 was less effective as compared to Acx-175, whereas Acx-176 was a stronger inhibitor than Acx- 175 with Cas12a.
  • Example 3 Dosage dependent nuclease inhibiting properties of Acx-175 [00255] In a series of dosing experiments, Acx-175 showed an intrinsic ErAcr property that inhibits off- target editing while leaving on-target activity substantially unchanged. [00256] The AsCas12a endonuclease was co-transfected with decreasing doses of Acx- 175 into HEK293T cells to observe editing activity targeting DNMT1. Transfection, incubation and analysis including measurements of editing efficiency were performed as described in Example 2.
  • Off target editing was measured using a guide RNA with a single base change at position 9 (MM9, CTGATGGTGCATGTCTGTTA: SEQ ID NO: 173). Results are shown in Figure 3.
  • the NUX (SEQ ID NO: 177) endonuclease was co-transfected with decreasing doses of Acx-175 to observe editing activity targeting DNMT1. Transfection, incubation and analysis protocols were performed as Example 2.
  • Off target editing was measured using a guide RNA with a single base change at position 9 (MM9) and a guide RNA with a single base change at position 5 (MM5, CTGAAGGTCCATGTCTGTTA: SEQ ID NO: 174). Results are shown in Figure 4.
  • Example 4 Generation of engineered Acrs
  • Strong nuclease inhibitor, Acx-175 (SEQ ID NO: 165) was mutagenized using a random mutagenesis kit (Genemorph II). Mutants were selected using the DEAD/ALIVE bacterial screen approach as described by Huimin Zhao in 2005; https://academic.oup.com/nar/article/33/18/e154/2401371.
  • Expression vectors were constructed using NUX (SEQ ID NO: 178) and the mutagenized Acx-175 library as described in Example 1.
  • the target was DNMT1 and a guide RNA with a single base change at position 8 (MM8, CTGATGGaCCATGTCTGTTA: SEQ ID NO: 161) was used to measure off-target editing.
  • the first round of the screen resulted in six Acx-175 variants (Table 4, except for Acx-315) that were transferred into a CMV expressing vector and tested in HEK293T cells with NUX (SEQ ID NO: 178) and DNMT1 for validation following the methods in Example 2. Results are shown in Figure 5.
  • WT amino acid WT AA Variants (amino acid ition) position

Landscapes

  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Zoology (AREA)
  • General Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Biotechnology (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Peptides Or Proteins (AREA)
EP23824718.3A 2022-06-15 2023-06-08 Manipulierte acr-proteine zur modulation der crispr-aktivität Pending EP4540376A2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202263352494P 2022-06-15 2022-06-15
PCT/US2023/068149 WO2023244934A2 (en) 2022-06-15 2023-06-08 Engineered acr proteins for modulating crispr activity

Publications (1)

Publication Number Publication Date
EP4540376A2 true EP4540376A2 (de) 2025-04-23

Family

ID=89191962

Family Applications (1)

Application Number Title Priority Date Filing Date
EP23824718.3A Pending EP4540376A2 (de) 2022-06-15 2023-06-08 Manipulierte acr-proteine zur modulation der crispr-aktivität

Country Status (4)

Country Link
EP (1) EP4540376A2 (de)
JP (1) JP2025520547A (de)
CN (1) CN120051567A (de)
WO (1) WO2023244934A2 (de)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4367229A4 (de) * 2021-07-09 2025-05-21 Acrigen Biosciences Zusammensetzungen und verfahren für nukleinsäuremodifikationen

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210363206A1 (en) * 2018-06-18 2021-11-25 The Regents Of The University Of California Proteins that inhibit cas12a (cpf1), a cripr-cas nuclease
US20230374502A1 (en) * 2020-10-02 2023-11-23 Acrigen Biosciences, Inc. Compositions and methods for enhanced nucleic acid targeting specificity

Also Published As

Publication number Publication date
JP2025520547A (ja) 2025-07-03
WO2023244934A2 (en) 2023-12-21
WO2023244934A3 (en) 2024-02-08
CN120051567A (zh) 2025-05-27

Similar Documents

Publication Publication Date Title
EP3585159B1 (de) Verfahren zum testen der fähigkeit einer crispr/cas9-nuklease, einen genomischen zielort in vivo zu modifizieren, unter verwendung einer cas-transgenen maus oder ratte, deren genom eine cas9-expressionskassette enthält
US11021719B2 (en) Methods and compositions for assessing CRISPER/Cas-mediated disruption or excision and CRISPR/Cas-induced recombination with an exogenous donor nucleic acid in vivo
US12385025B2 (en) Compositions and methods for nucleic acid modifications
US20190032156A1 (en) Methods and compositions for assessing crispr/cas-induced recombination with an exogenous donor nucleic acid in vivo
JP2023548860A (ja) Casとdnaポリメラーゼとの会合による、予測可能かつテンプレートフリーの遺伝子編集の強化
EP4540376A2 (de) Manipulierte acr-proteine zur modulation der crispr-aktivität
WO2023240229A2 (en) Compositions and methods for nucleic acid modifications
US20230374502A1 (en) Compositions and methods for enhanced nucleic acid targeting specificity
US20240309349A1 (en) Compositions and methods for nucleic acid modifications
WO2023147558A2 (en) Crispr methods for correcting bag3 gene mutations in vivo
US20230081547A1 (en) Non-human animals comprising a humanized klkb1 locus and methods of use
US20250135032A1 (en) Crispr methods for correcting bag3 gene mutations in vivo
WO2025076084A1 (en) Engineered nucleases and compositions and methods thereof
WO2023235725A2 (en) Crispr-based therapeutics for c9orf72 repeat expansion disease
WO2025014798A2 (en) Engineered nucleases
AU2024309884A1 (en) Methods and compositions for increasing homology-directed repair
EP4565603A1 (de) Aggregationsresistente varianten von tdp-43
HK40018000A (en) Method of testing the ability of a crispr/cas9 nuclease to modify a target genomic locus in vivo, by making use of a cas-transgenic mouse or rat which comprises a cas9 expression cassette in its genome
HK40018000B (en) Method of testing the ability of a crispr/cas9 nuclease to modify a target genomic locus in vivo, by making use of a cas-transgenic mouse or rat which comprises a cas9 expression cassette in its genome

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20241213

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC ME MK MT NL NO PL PT RO RS SE SI SK SM TR

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40125390

Country of ref document: HK