WO2019185750A1 - A high-throughput platform to select for regulators of crispr-cas associated activity - Google Patents

A high-throughput platform to select for regulators of crispr-cas associated activity Download PDF

Info

Publication number
WO2019185750A1
WO2019185750A1 PCT/EP2019/057785 EP2019057785W WO2019185750A1 WO 2019185750 A1 WO2019185750 A1 WO 2019185750A1 EP 2019057785 W EP2019057785 W EP 2019057785W WO 2019185750 A1 WO2019185750 A1 WO 2019185750A1
Authority
WO
WIPO (PCT)
Prior art keywords
crispr
gene
polypeptide
rna
cells
Prior art date
Application number
PCT/EP2019/057785
Other languages
French (fr)
Inventor
Ruben VAZQUEZ-URIBE
Eric VAN DER HELM
Morten Otto Alexander SOMMER
Sang Woo Lee
Original Assignee
Danmarks Tekniske Universitet
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Danmarks Tekniske Universitet filed Critical Danmarks Tekniske Universitet
Publication of WO2019185750A1 publication Critical patent/WO2019185750A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/102Mutagenizing nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1079Screening libraries by altering the phenotype or phenotypic trait of the host
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2521/00Reaction characterised by the enzymatic activity
    • C12Q2521/30Phosphoric diester hydrolysing, i.e. nuclease
    • C12Q2521/301Endonuclease

Definitions

  • TITLE A high-throughput platform to select for regulators of CRISPR-Cas associated activity
  • the invention relates to a method for in vivo detection of small molecules or polypeptides that act as regulators of CRISPR-Cas associated activity, in particular the method is for use in the screening of molecule libraries and non-host DNA libraries.
  • the method employs cells comprising genes expressing CRISPR-Cas associated activity and a reporter protein to allow the detection of said regulators.
  • the invention further relates to cells comprising genes expressing of CRISPR-Cas associated activity and a reporter protein for use in the in vivo screening and detection methods.
  • CRISPR-Cas systems originate from bacteria where they function as adaptive immune systems that cleave foreign DNA or RNA in a sequence specific manner.
  • Components of type II CRISPR-Cas systems such as Cas9 from Streptoccocus pyogenes, have been exploited for precise and programmable gene editing.
  • CRISPR-Cas technologies represent effective biotechnological tools that broadly impact life sciences. Applications range from treating human genetic diseases to eliminating pathogens using gene-drives. Beyond genome editing, CRISPR systems have several other applications, including programmable transcriptional regulators and ultra-sensitive detection devices.
  • ACR anti-CRISPR
  • step (I) introducing a self-replicating library of non-host DNA fragments into said cells; b. culturing the cells obtained from step (a) (I) or (II) on said defined culture medium for a period sufficient for uptake or expression of a candidate regulator of CRISPR-Cas associated activity; c. culturing the cells obtained from step (b) on said defined culture medium wherein said medium is supplemented with one or more agents to induce expression of said CRISPR-Cas associated activity, and
  • step (c) detecting expression of a reporter gene present in cells obtained from step (c), wherein said cells comprise:
  • a first gene encoding a polypeptide or a first gene cluster encoding a polypeptide complex, said polypeptide or polypeptide complex having RNA- guided endonuclease activity, wherein said first gene or first gene cluster is operably linked a promoter having a measured strength of >0.65 on the
  • nucleic acid molecule operably linked to a promoter and whose expression is inducible by at least one of said one or more agents
  • nucleic acid molecule encodes one or more RNA molecules capable of guiding said polypeptide or polypeptide complex having RNA-guided endonuclease activity to said reporter gene or its transcript; and wherein the polypeptide or polypeptide complex having RNA-guided endonuclease activity is capable of inactivating said reporter gene or its transcript.
  • a first gene encoding a polypeptide or a first gene cluster encoding a polypeptide complex, said polypeptide or polypeptide complex having RNA- guided endonuclease activity, wherein said first gene or first gene cluster is operably linked to a promoter having a measured stength of >0.65 on the Anderson scale and whose expression is inducible by at least one agent;
  • nucleic acid molecule operably linked to a promoter and whose expression is inducible by at least one agent
  • nucleic acid molecule encodes one or more RNA molecules capable of guiding said polypeptide or polypeptide complex having RNA-guided endonuclease activity to said reporter gene or its transcript;
  • polypeptide or polypeptide complex having RNA-guided endonuclease activity is capable of inactivating said reporter gene or its transcript.
  • a third embodiment of the invention provides for the use of a biological cell according to the invention, for detecting a regulator of CRISPR-Cas associated activity.
  • a fourth embodiment of the invention provides a biological cell library of non-host DNA fragments, wherein each cell of the library is a cell according to the invention, and wherein each cell further comprises a self-replicating genetic element comprising a) a non-host DNA fragment, and
  • a fifth embodiment of the invention provides for a use of a biological cell library of non-host DNA fragments according to the invention, for detecting a regulator of CRISPR-Cas associated activity.
  • Figure 1 is a cartoon illustrating the method of the invention; comprising providing a biological cell comprising a gene or gene cluster encoding a protein or protein complex having RNA-guided endonuclease activity, a nucleic acid molecule encoding chimeric gRNA (sgRNA) or a CRISPR array, as well as a reporter gene encoding a selection marker, and then either contacting the cell with a molecule, or introducing a self- replicating library of non-host DNA into the cell, such as to allow detection of molecules or expressed polypeptides that inhibit inactivation of the reporter gene or its transcript by the RNA-guided endonuclease.
  • sgRNA chimeric gRNA
  • CRISPR array CRISPR array
  • Figure 2 is a cartoon illustrating the pCasens3 and pDual3 plasmids and interaction with a non-peptide molecule with anti-CRISPR activity.
  • Plasmid pCasens3 (to the left) comprises: the Streptococcus pyogenes Cas9 gene (spCas9) [SEQ ID No. : 150] cloned into the backbone of the plasmid pSEVA47. Additionally, a sigma70 constitutive promoter (P23100) [SEQ ID No. : 148], adjacent to a DNA molecule encoding a theophylline translational riboswitch [SEQ ID No.
  • pSEVA47 comprises a low copy number origin of replication, SC101 [SEQ ID No. : 155]; and the antibiotic resistance gene aadA [SEQ ID No. : 153] under the control of its native (constitutive) promoter and ribosomal binding site, conferring resistance against Spectinomycin.
  • Plasmid pDual3 (to the right) comprises a chloramphenicol resistance gene (CmR) [SEQ ID No.
  • plasmid further comprises an araC L-arabinose sensor gene [SEQ ID No. : 159] that regulates the pBAD promoter in response to the metabolite, L-arabinose.
  • pDual3 comprises a low copy number origin of replication, pl5A [SEQ ID No. : 163] .
  • Figure 3 is a cartoon illustrating (A) the negative control plasmid comprising a gene [SEQ ID No. : 140] encoding Green Fluorescent Protein (GFP) under the control of a constitutive Ptet promoter [SEQ ID No. : 139]; and (B) positive control plasmid comprising a gene [SEQ ID No. : 146] encoding AcrIIA2 protein under the control of a constitutive Ptet promoter [SEQ ID No. : 139] .
  • Both negative and positive control plasmids further comprise a kanamycin resistance gene [SEQ ID No. : 144] under the control of its native (constitutive) promoter and ribosomal binding site and a colEl origin of replication [SEQ ID No. : 143] .
  • Figure 4 is a histogram showing the number of colony forming units (CFU. ml 1 ) of E. coli cells comprising the pDual3 plasmid in combination with 3 alternative versions of the pCasens3 plasmid; following cultivation on selective media comprising chloramphicol, where the sgRNA and Cas9 genes of the genetic circuit are induced as indicated with arabinose and theophylline respectively.
  • CFU. ml 1 colony forming units
  • the 3 forms of pCasens3 comprise a Cas9 gene operably linked to either a medium strength promoter of ⁇ 0.50 on the Anderson scale (ProC); the medium strength promoter (ProC) together with a theophylline riboswitch 5' of the cas9 gene; or a strong promoter (BBa J23100 [SEQ ID No. : 172]) together with a theophylline riboswitch 5' of the cas9 gene.
  • a strong promoter BBa J23100 [SEQ ID No. : 172]
  • Figure 5 shows (A) a cartoon illustrating the molecular mechanism whereby in E. coli cells comprising the Genetic Circuit (£. coli- GC), the genetic circuit is induced to express the cas9 gene and sgRNA, the formation of a Cas9-sgRNA complex, and the endonuclease-mediated cleavage of the reporter gene (Cat), leading to cell death when exposed to chloramphenicol .
  • the cartoon further shows that co-expression of an acrIIA2 gene in the E. coli- GC cells can inhibit Cas9-sgRNA inactivation of the Cat gene, leading to cell survival under selective conditions.
  • (B) is a histogram showing the number of colony forming units (CFU. ml 1 ) of E. coli- GC cells comprising a negative control plasmid (expressing GFP) or the positive control plasmid (expressing AcrIIA2) following cultivation on selective media, where the genetic circuit is either induced or non-induced.
  • Figure 6 is a cartoon illustrating (A) a metagenomics DNA expression plasmid comprising a fragment of (a library of) metagenomics DNA fragments cloned down stream of a constitutive Ptet promoter [SEQ ID No. : 139]; a kanamycin resistance gene under the control of its native (constitutive) promoter and ribosomal binding site; together with the genetic circuit comprising plasmids pCasens3 and pDual3; (B) a positive control plasmid comprising a gene encoding AcrIIA2 protein and the genetic circuit plasmids pCasens3 and pDual3.
  • A a metagenomics DNA expression plasmid comprising a fragment of (a library of) metagenomics DNA fragments cloned down stream of a constitutive Ptet promoter [SEQ ID No. : 139]; a kanamycin resistance gene under the control of its native (cons
  • Figure 7 is a cartoon showing use of host cells comprising the genetic circuit to screen a metagenomics DNA expression library for anti-CRISPR proteins.
  • Figure 8 is a histogram showing the number of colony forming units (CFU.ml 1 ) of E. coli- GC cells comprising selected metagenomics inserts, designated AC11 - AC42; as compared to a negative control plasmid (expressing GFP) or the positive control plasmid (expressing AcrIIA2) following cultivation on selective media, where the genetic circuit is induced.
  • CFU.ml 1 colony forming units
  • Figure 9 is a histogram showing the number of colony forming units (CFU.ml -1 ) of E. coli- GC cells comprising plasmids pTypel-B-Cas in combination with pDual3; following sequential cultivation in growth media that are either non-inducing or inducing for Typel-B-Cas expression; and then on either non-selective or selective media comprising chloramphicol, wherein Typel-B-Cas expression was induced using 0 - lOOuM IPTG.
  • Figure 10 is a histogram showing the number of colony forming units (CFU.ml 1 ) of E.
  • coli- GC cells comprising plasmids pTypell-A-Cas in combination with pDual3; following sequential cultivation in growth media that are non-inducing or inducing for TypeII-A- Cas expression; and then on either non-selective or selective media comprising chloramphicol, wherein Typell-A-Cas expression was induced using 0 - lOOuM IPTG.
  • Figure 11 is a histogram showing the number of colony forming units (CFU.ml 1 ) of E. coli- GC cells comprising plasmids pTypeV-A-Cas in combination with pDual3; following sequential cultivation in growth media that are non-inducing or inducing for TypeV-A- Cas expression; and then on either non-selective or selective media comprising chloramphicol, wherein TypeV-A-Cas expression was induced using 2mM theophylline.
  • CFU.ml 1 colony forming units
  • Figure 12 is a cartoon showing a method for functional screening for anti-CRISPR molecules using biological cells comprising the genetic circuit according to the invention that is suitable for screening : a metagenomics library encoding candidate anti-CRISPR proteins;an isolated candidate anti-CRISPR protein; and an isolated anti- CRISPR small molecule.
  • the method comprises 6 steps:
  • the medium comprises nutrients essential for growth and when required one or more agents selective for maintenance of genes of the genetic circuit including the reporter gene, but devoid of agents required for, or capable of, inducing expression of the genetic circuit (i.e. the genes or gene clusters encoding an RNA-guided endonuclease and its cognate RNA molecules capable of guiding said polypeptide or polypeptide complex having RNA-guided endonuclease activity to said reporter gene or its transcript).
  • step 2) An aliquot of the cell culture produced in step 1) is inoculated into said defined growth medium to produce a cell suspension; but wherein said medium is both devoid of agents required for, or capable of, inducing expression of said genetic circuit; and devoid of agents selective for maintenance of genes of the genetic circuit including the reporter gene;
  • cells in the cell suspension produced in step 2) are either:
  • step 3) i) is incubated to allow expression of a candidate anti-CRISPR protein encoded by a DNA molecule in the metagenomic library; or from - step 3) ii) is incubated to allow intracellular uptake of the candidate anti-CRISPR small molecule or candidate anti-CRISPR protein;
  • Agent(s) capable of inducing expression of said genetic circuit are added to the cell suspension(s) resulting from step 4; and the suspensions are further incubated under conditions suitable for inducing expression of genes of the genetic circuit;
  • Cells derived from the cell suspension(s) resulting from step 5) are plated on agar plates composed of a defined media comprising nutrients essential for growth and supplemented with agents required for, or capable of, inducing expression of said genetic circuit and as required agents selective for maintenance of genes of the genetic circuit including the reporter gene.
  • the plates are incubated under suitable conditions for a time sufficient to detect viable biological cells.
  • Figure 13 is a histogram showing the number of colony forming units (CFU .ml-1) of E. coli- GC cells comprising a CRISPR-Cas genetic circuit consisting of either: (1) pCasens3 (SP(II-A)) and pDUAL3; (2) pTypeV-A-Cas (FN(V-A) and pDual3; (3) pTypell-A-Cas (STl(II-A) and pDual3; or (4) pTypel-B-Cas (CD(I-B)) and pDual3; following incubation on liquid medium comprising either M9 minimal medium or yeast extract medium (2xYT); each medium with or without the addition of agents to induce expression of the respective CRISPR-Cas genetic circuit, and then respectively plated on agar medium comprising either M9 minimal medium or yeast extract (2xYT); each medium comprising chloramphenicol, and with or without addition of said inducing agents.
  • Figure 14 is a histogram showing the the number of colony forming units (CFU . ml 1 ) of E. coli cells comprising the pDual3 plasmid in combination with 3 alternative versions of the pCasens3 plasmid following culture in the presence of chloramphicol and in the presence, or absence, of arabinose and theophylline to simulateously induce expression of the sgRNA and Cas9 genes respectively.
  • the 3 forms of pCasens3 plasmid comprise a Cas9 gene operably linked to either a weak promoter (J23116 [SEQ ID No. : 176]) shown in the left-hand bar; medium promoter (J23111 [SEQ ID No.
  • Agent is a small molecule capable of inducing expression of one or more genes operably linked to a promoter in the genetic circuit of the invention .
  • Amino acid sequence identity indicates a quantitative measure of the degree of homology between two amino acid sequences of substantially equal length . The two sequences to be compared must be aligned to give a best possible fit, by means of the insertion of gaps or alternatively, truncation at the ends of the protein sequences.
  • sequence identity can be calculated as ((Nref-Ndif)100)/(Nref), wherein Ndif is the total number of non-identical residues in the two sequences when aligned and wherein Nref is the number of residues in one of the sequences.
  • Sequence identity can alternatively be calculated by the BLAST program e.g. the BLASTP program (Pearson W.R and D.J. Lipman (1988)) (www.ncbi. nlm. nih.gov/cgi-bin/BLAST).
  • alignment is performed with the sequence alignment method ClustalW with default parameters as described by Thompson J., et al 1994, available at http://www2.ebi.ac.uk/clustalw/.
  • the numbers of substitutions, insertions, additions or deletions of one or more amino acid residues in the polypeptide as compared to its comparator polypeptide is limited, i .e. no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 substitutions, no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 insertions, no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 additions, and no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 deletions.
  • substitutions are conservative amino acid substitutions : limited to exchanges within members of group 1 : Glycine, Alanine, Valine, Leucine, Isoleucine; group 2 : Serine, Cysteine, Selenocysteine, Threonine, Methionine; group 3 : proline; group 4 : Phenylalanine, Tyrosine, Tryptophan; Group 5 : Aspartate, Glutamate, Asparagine, Glutamine.
  • Cis element is a sequence of DNA that is capable of conferring inducible expression of a gene that is functionally linked to a promoter; where the cis element may induce expression by regulating transcription of the gene or by regulating translation of transcript product of the gene.
  • Cognate is used to refer to interacting pairs of functional entities; more specifically that each protein or protein cluster having RNA-guided endonuclease activity has a cognate mat-crRNA, and in the case of Type II CRISPR-Cas systems both a cognate mat-crRNA and cognate tracrRNA, that are required for their activity.
  • CRISPR system is a CJustered R_egularly Interspaced S_hort P_alindromic R_epeats is bacterial immune system that cleave DNA or RNA in a sequence specific manner, mediated by a RNA guided nuclease.
  • E. coli ToplO is E. coli having chromosomal Genotype mcrA, A(mrr-hsdRMS- mcrBC), Phi80lacZ(del)M15, AlacX74, deoR, recAl, araD139, A(ara-leu)7697, galU, galK, rpsL(SmR), endAl, nupG
  • gi number (genlnfo identifier) is a unique integer which identifies a particular sequence, independent of the database source, which is assigned by NCBI to all sequences processed into Entrez, including nucleotide sequences from DDBJ/EMBL/GenBank, protein sequences from SWISS-PROT, PIR and many others.
  • Heterologous gene and heterologous DNA molecule have a different genetic origin from the recombinant biological cell in which they are expressed; and this also applies to the protein or transcript that they encode.
  • the nucleotide sequence of the heterologous gene or heterologous DNA molecule may be optimized (e.g. codon optimization) with respect to the recombinant biological cell in which they are expressed.
  • Heterologous gene and heterologous DNA molecule may be located on (and therefor be a part of) the chromosome or a episome of the recombinant cell, and may be inserted into this location by recombinant DNA cloning.
  • Promoter is a region of DNA that initiates transcription of a particular gene. Promoters are located near the transcription start sites of genes, on the same strand and upstream on the DNA (towards the 5' region of the sense strand). A promoter that is "functionally linked" to a gene is capable of driving expression of said gene. A promoter may be drive expression constitutively or only in response to an inducer.
  • Promoter activity is the measured strength i .e. measured relative activity of a promoter to drive expression of a reporter gene encoding Red Fluorescent Protein (RFP) in E. coli. Measurement is performed on cells of f.
  • RFP Red Fluorescent Protein
  • coli strain TGI [K-12 supE thi-1 A(lac-proAB) A(mcrB-hsdSM)5, (rK-mK-)] comprising the plasmid J61002 (as described in http://parts.igem .Org/Part: BBa_J61002) in which the tested promoter linked to the RFP reported gene is cloned, and where the cells are grown in LB media to saturation .
  • the range of measured promoter strengths is relative to the Anderson Promoter Collection Scale (Anderson scale) of 0.00 to 1.00, wherein promoter J23100 [SEQ ID No. : 172] has the highest measured relative strength of 1.00; and the promoter J23112 [SEQ ID No. : 171] has a measured relative strength of 0.00, corresponding to non-detectable promoter activity (as described by
  • a strong promoter is one having a measured strength of >0.65 on this Anderson scale.
  • RBS Ribosomal Binding Site is a sequence of nucleotides upstream of the start codon of an mRNA transcript that is responsible for the recruitment of a ribosome during the initiation of protein translation .
  • Small molecule library library of small organic molecules; preferably having a mol mass of less than 900 daltons; for example libraries available from Asinex (www.asinex.com).
  • a first embodiment of the invention provides a method for screening a molecule or library of molecules (in particular small molecules) or polypeptides for a candidate molecule or polypeptide capable of regulating CRISPR-Cas associated activity.
  • the regulator may be an inhibitor or an enhancer of CRISPR-Cas associated activity.
  • the method is based on an in vivo assay employing cells; where the cells comprise genes encoding and capable of expressing the essential components required for CRISPR-Cas associated activity ( Figure 1).
  • the method of the invention has broad application, since it can be used for screening for regulators of all types of CRISPR-Cas systems having CRISPR-Cas associated activity.
  • CRISPR-Cas systems encoded by a CRISPR locus, find their origin in adaptive immune systems in a wide range of archeae and bacteria, they share a common functional property of cleaving a reporter DNA molecule or its transcript; and thereby allowing genetic material to be added, removed, or altered at particular locations in the genome of a biological cell or in a DNA molecule.
  • a CRISPR locus consists of a CRISPR array, comprising short variable DNA sequences (called spacer sequence(s)) interspersed by short palindromic repeat sequences (called repeat sequence(s)), this being flanked by diverse cognate cas genes.
  • the CRISPR-Cas systems have been classified into 2 classes; whereby members of Class 1 comprise a multi-subunit subunit protein complex, each subunit encoded by a gene in the CRISPR locus; while members of Class 2 comprise a single multidomain protein encoded by a single gene.
  • Class 1 includes type I, III and IV CRISPR-Cas systems; while Class 2, includes type II, V and VI CRISPR-Cas systems.
  • Class 1 includes type I, III and IV CRISPR-Cas systems; while Class 2, includes type II, V and VI CRISPR-Cas systems.
  • Class 1 includes type I, III and IV CRISPR-Cas systems
  • Class 2 includes type II, V and VI CRISPR-Cas systems.
  • there are six types which have further been classified into over 28 subtypes; based on their protein structure, signature Cas genes and operon arrangements (Koonin, E. V. et al . , 2017).
  • the loci comprising Class 1, type I CRISPR-Cas systems share in common a signature gene cas3 (or its variant cas3 ' ) encoding a nuclease that both unwinds and cleaves target double-stranded DNA (dsDNA) and RNA-DNA duplexes.
  • Type I systems comprise seven subtypes, I-A to I-F, and I-U.
  • the loci comprising Class 1, type III CRISPR-Cas systems all contain the signature gene caslO, as well genes encoding Cas5 gene and several paralogues of Cas7 that co-transcriptionally target RNA and DNA.
  • Type III systems comprise four subtypes, III-A to III-D.
  • the loci comprising Class 1, type IV CRISPR-Cas system comprise two subtypes, IV-A and IV-B; where the Csfl can serve as as signature gene.
  • the loci for the Class 2, type II CRISPR-Cas systems comprise the signature Cas9 gene encoding a multidomain protein that combines the functions of a crRNA-effector complex and cleaves target DNA. All Type II loci are characterised by a nucleic acid sequence encoding a tracrRNA, which is partially complementary to the repeats within the respective CRISPR array. Type II systems comprise three subtypes, II-A to II-C. Class 2, type V CRISPR-Cas systems comprise the signature gene Cpfl, also encoding a single multidomain protein. Unlike Cas9, the Cpfl gene is commonly located outside a CRISPR-Cas locus. Type V systems are currently divided into five subtypes, V-A to V-E.
  • Class 2 typell CRISPR-Cas systems For the purpose of expression of members of the Class 2 typell CRISPR-Cas systems in a cell, it is sufficient to express the signature Cas gene encoding the multidomain protein having RNA guided-endonuclease activity. Expression of all other Class 1 and 2 CRISPR-Cas systems can be achieved by co-expressing the several genes encoding the Cas proteins that constitute the multi-subunit subunit protein complex having RNA guided-endonuclease activity.
  • cognate single or multi-subunit Cas protein(s) effector complex
  • Processing of the pre-crRNA is mediated by either an endonuclease subunit of the effector complex; or in the case of type II CRISPR-cas system by a combination of a RNase III protein together with a trans-activating CRISPR RNA (tracrRNA); where tracrRNA serves to recruit the components of the type II effector complex.
  • endonuclease subunit of the effector complex or in the case of type II CRISPR-cas system by a combination of a RNase III protein together with a trans-activating CRISPR RNA (tracrRNA); where tracrRNA serves to recruit the components of the type II effector complex.
  • tracrRNA trans-activating CRISPR RNA
  • an individual mat-crRNA comprising at least one spacer sequence, flanked at least at one end (preferably at both ends) by a palindromic repeat sequence, is sufficient to specifically guide the cognate effector complex to cleave a reporter nucleic acid molecule (DNA or RNA).
  • a complete CRISPR array pre-crRNA
  • the reporter nucleic acid molecule comprises a sequence that is complementary to the spacer sequence.
  • a tracrRNA is additionally necessary of the expression of a type II CRISPR-cas system.
  • Synthetic DNA molecules can conveniently be engineered to express a RNA transcript comprising a spacer sequence flanked, at least at one end, by a palindromic repeat sequence (i.e. a mat-RNA), for use in the expression of the Class 1 or 2 CRISPR-Cas systems.
  • a palindromic repeat sequence i.e. a mat-RNA
  • the RNA transcript of the synthetic DNA molecule can comprise a a mat-RNA fused with a tracrRNA sequence, also called single-guide RNA (sgRNA).
  • a second embodiment of the invention provides cells for use in screening molecules or polypeptides (or a library thereof) for a candidate molecule or polypeptide capable of regulating CRISPR-Cas associated activity ( Figure 1).
  • the cells, employed in the method of screening comprise a genetic circuit designed for detecting molecules or polypeptides for regulators (inhibitors or enhancers) of CRISPR-Cas associated activity.
  • the genetic circuit is engineered to operate in a host biological cell and provide an in vivo based screen for molecules or polypeptides having anti-CRISPR associated activity.
  • the genetic circuit comprises genes or cluster of genes encoding a RNA-guided CRISPR-Cas system and its cognate RNA(s); wherein said CRISPR-Cas system comprises either a multi-subunit subunit protein complex (Class 1), each subunit being encoded by a gene derivable from (or present in) the corresponding native CRISPR locus; or a single multidomain protein (Class 2) encoded by a single gene.
  • the genetic circuit further comprises a reporter gene that functions as a marker for CRISPR activity.
  • Each gene or gene cluster of the genetic circuit, present in a cell is functionally-linked to a promoter and RBSs.
  • the biological cell may further comprise a cis element (DNA) in combination with the promoter for regulating the expression of one or more of the genes or gene clusters of the genetic circuit.
  • a cis element DNA
  • Each of said one or more of said genes or gene clusters of the genetic circuit may be located on an episome or on the cell's chromosome.
  • the genes and DNA molecules encoding the components of the Class 1 or 2 CRISPR-Cas systems and their respective promoters are cloned into a suitable host cell; more preferably integrated into plasmids.
  • These genes and DNA molecules may be heterologous in origin with respect to the host cell; and are preferably heterologous with respect to their cognate promoter.
  • the host biological cell is preferably a bacterial cell.
  • the cognate RNA serves to guide the system to a reporter gene, such that its endonuclease activity cleaves and inactivates the reporter gene or its transcript. Loss of expression of the inactivated reporter gene is detectable in a cell comprising the genetic circuit.
  • a molecule or polypeptide that is capable of preventing inactivation of the reporter gene in a cell comprising the genetic circuit is detectable, either by means of detection of a product of the reporter gene itself, or by the survival and growth of cells that continue to express the reporter gene under selective growth conditions.
  • Cells comprising the genetic circuit are both suitable for screening molecules entering the cell, as well as polypeptides that are recombinantly expressed within the cell.
  • Non-host cell DNA libraries encoding polypeptides can be introduced into recombinant cells comprising the genetic circuit and rapidly screened for regulators of CRISPR-Cas associated activity. I Method for in vivo detection of small molecules or polypeptides that act as regulators of CRISPR-Cas associated activity
  • step (b) culturing the cells obtained from step (a) (I) or (II) on said defined culture medium for a period sufficient for uptake or expression of said a candidate regulator of CRISPR-Cas associated activity;
  • step (b) culturing the cells obtained from step (b) on said defined culture medium
  • said medium is supplemented with one or more agents to induce expression of said CRISPR-Cas associated activity
  • step (c) detecting expression of a reporter gene present in cells obtained from step (c), wherein said recombinant biological cells comprise:
  • a first gene encoding a polypeptide or a first gene cluster encoding a polypeptide complex, said polypeptide or polypeptide complex having RNA- guided endonuclease activity, wherein said gene or gene cluster is operably linked a first inducible promoter having a measured strength of >0.65 on the Anderson scale, or a combination of a first promoter having a measured strength of >0.65 on the Anderson scale and a first cis element conferring inducible expression, wherein said inducible expression is induced by at least one of said one or more agents;
  • a reporter gene operably linked to a constitutive promoter
  • nucleic acid molecule encoding any one of:
  • pre-crRNA molecule encoded by a DNA molecule comprising a CRISPR array
  • pre-crRNA comprises a spacer sequence complementary to a nucleic acid sequence of at least 15 nucleotides of at least one strand of said reporter gene or its transcript
  • a mat-crRNA molecule comprising at least one spacer sequence flanked by at least one palindromic repeat sequence, wherein the spacer sequence is complementary to a nucleic acid sequence of at least 15 nucleotides of at least one strand of said reporter gene or its transcript;
  • nucleic acid molecule is operably linked a second inducible promoter having a measured strength of >0.65 on the Anderson Scale, or a combination of a second promoter and second cis element conferring inducible 10 expression, wherein said inducible expression is induced by at least one of said one or more agents;
  • palindromic repeat sequence and the trans-activating crRNA are cognate with respect to the polypeptide or polypeptide complex having RNA- 15 guided endonuclease activity;
  • RNA-guided endonuclease activity is capable of inactivating said heterologous reporter gene or its transcript.
  • said reporter gene is comprised on a plasmid devoid of genes capable of expressing CRISPR-Cas associated activity.
  • the first gene or first gene cluster encoding a polypeptide or polypeptide complex, respectively having RNA-guided endonuclease activity may be selected from any gene or gene cluster encoding cas protein(s) having CRISPR associated activity.
  • cas proteins or cas protein complexes suitable for expression in 25 recombinant cells of the invention include those belonging to the Type 1, Type II, Type III; Type IV, Type V, and Type VI and their subtypes.
  • the first gene or first gene cluster and its respective encoded polypeptide or polypeptide complex will be heterologous in origin with respect to both the promoter to which it is operably linked, and to the recombinant biological cell in which it is expressed.
  • Examples of the 30 components of cas proteins or cas protein complexes of the various RNA-guided CRISPR-Cas system are listed in Table 1 below together with their DNA coding sequences and respective amino acid sequences:
  • N/A not applicable or required; * non-essential Cas component of Type IB.
  • the nucleic acid molecule in the recombinant biological cell may comprise a sequence encoding a mature crRNA (mat-crRNA) comprising at least one spacer sequence 5 flanked at least at one end (preferably at both ends) by a palindromic repeat sequence.
  • the spacer sequence is complementary to a nucleic acid sequence of at least 15 nucleotides of at least one strand of said reporter gene or its transcript.
  • This complementary sequence guides the polypeptide or polypeptide complex having RNA- guided endonuclease activity to the site of cleavage in the reporter gene or its 10 transcript. Accordingly, the complementary sequence of the spacer is selected such that cleavage of the reporter gene or its transcript prevents expression of the reporter gene or its transcript.
  • the nucleic acid molecule in the recombinant biological cell may further comprise a sequence encoding a trans-activating crRNA (tracrRNA).
  • tracrRNA trans-activating crRNA
  • the polypeptide or polypeptide complex having RNA-guided endonuclease activity selectively associates with its cognate palindromic repeat sequence and, where applicable also its cognate tracrRNA.
  • the cognate palindromic repeat sequence and cognate tracrRNA are encoded by the nucleotide sequences listed in Table 1.
  • the nucleic acid molecule in the recombinant biological cell comprises a sequence encoding a single RNA transcript comprising a fusion of the mat- crRNA and the tracrRNA.
  • the amino acid sequence of the polypeptide or polypeptides of a protein complex having RNA-guided endonuclease activity has at least 70, 75, 80, 81, 82, 83, 84, 85, 86, 87, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% amino acid sequence identity to an amino acid sequence selected from Table 1.
  • the at least one cognate palindromic repeat sequence is encoded by a nucleic acid sequence at the corresponding position in the list of Table 1; while the cognate trans-activating crRNA is encoded by a nucleotide sequence at the corresponding position in the list of Table 1.
  • the reporter gene in the recombinant biological cell encodes a reporter polypeptide, whose expression in the cell is detectable; for example either by virtue of the inherent properties of the polypeptide (e.g. a polypeptide with fluorescent properties, such as GFP), or by means of the catalytic activity of the polypeptide (e.g . a polypeptide that confers resistance to a growth inhibitor (e.g. a protein conferring resistance to an antibiotic such as tetracycline, kanamycin, neomycin, tobramycin, chloramphenicol, spectinomycin, gentamycin, ampicillin and carbenicillin, as exemplified in Example 1.1) or whose activity complements an auxotrophic phenotype (e.g .
  • a growth inhibitor e.g. a protein conferring resistance to an antibiotic such as tetracycline, kanamycin, neomycin, tobramycin, chloramphenicol, spectinomycin, gentamycin, amp
  • the reporter gene or its transcript is the target for the polypeptide or polypeptide complex having RNA-guided endonuclease activity.
  • the reporter gene comprises a protospacer adjacent motif (PAM) located adjacent to the sequence complementary to the mat-crRNA spacer sequence, in particular where the RNA- guided endonuclease polypeptide is a Type II A Cas (e.g. Cas9).
  • a canonical PAM is the sequence 5'-NGG-3' where "N" is any nucleobase.
  • RNA-guided endonuclease protein or protein complex and respective cognate pre-crRNA, mat-crRNA and/or tracrRNA molecules e.g. sgRNA
  • sgRNA tracrRNA molecules
  • the first inducible promoter, or a combination of the first promoter and the first cis element conferring inducible expression is both structurally and functionally different from the second inducible promoter, or a combination of the second promoter and the second cis element; such that expression of the protein or protein complex having RNA-guided endonuclease activity and its cognate pre-crRNA, mat-crRNA and/or tracrRNA molecules (e.g . sgRNA) can be regulated independently.
  • a suitable inducible promoter includes an IPTG-inducible T5 or T7 promoter.
  • inducible expression may be conferred by a promoter regulated by a cis element encoding a biosensor transcription factor, for example the araBAD promoter pBAD, whose expression is regulated by metabolites L-arabinose, and mevalonate (as exemplified in Example 1.1).
  • expression of a gene is regulated translationally by means by a constitutive promoter in combination with a cis element encoding a riboswitch, for example a theophyllin riboswitch (as described in Example 1.1).
  • the selectivity of the screening method is improved by employing a strong inducible promoter; for example a constitutive promoter selected from the Anderson promoter collection having a measured strength (activity) of >0.65, or 0.7 or above f http://parts.ioem.oro/Promoters/Cata loo/Anderson ' ) such as J23100 [SEQ ID No. : 172], J23101 [SEQ ID No. : 173], J23102 [SEQ ID No. : 174], and J23104 [SEQ ID No. : 175], having a measured strength of 1.00; 0.70; 0.86; and 0.72 respectively (see Examples 2.2 and 6).
  • a strong inducible promoter for example a constitutive promoter selected from the Anderson promoter collection having a measured strength (activity) of >0.65, or 0.7 or above f http://parts.ioem.oro/Promoters/Cata loo/Anderson ' ) such as J23100 [SEQ ID No. :
  • Expression of the reporter gene in a recombinant biological cell is preferably driven by a constitutive promoter, (e.g . Ptet promoter, without the repressor promoter or a sigma70 promoter from the Anderson promoter collection.
  • a constitutive promoter e.g . Ptet promoter, without the repressor promoter or a sigma70 promoter from the Anderson promoter collection.
  • Each of said first gene, first gene cluster, reporter gene and nucleic acid molecule may independently be a component of an episome that is stably replicated in the recombinant biological cell or an integrated component of its chromosome.
  • the episome is a plasmid comprising an origin of replication (ori) conferring a low copy number, such as pSClOl and pl5A described in Example 1.1 ( Figure 2).
  • the use of a low copy number plasmid is advantageous, since it reduces variability between the recombinant cells; which in turn enhances the stringency of the screening assay.
  • the functional operation of the method of the invention involves the inactivation of a reporter gene such that a reporter polypeptide is no longer expressed. Inactivation of the reporter gene by DNA cleavage also leads to a progressive elimination of the episome or chromosome comprising the reporter gene.
  • the reporter gene is located on a plasmid, separate from the location of the other first genes and nucleic acid molecule required for expression of the CRISPR associated activity.
  • the recombinant cells continue to express inducible CRISPR associated activity independently of the reporter gene, ensuring complete inactivation of the reporter gene, thereby limiting the frequency of false positives.
  • the recombinant biological cell is preferably bacterial, belonging to a genus selected from the group consisting of: Escherichia spp., (e.g. E. coli ); Bacillus spp., (e.g. B. subtilis ); Pseudomonas spp., (e.g . P. fluorescens, P. putida ); Corynebacterium spp., (e.g. C. giutamicum ); Agrobacterium spp., (e.g. A. tumefaciens ); Cauiobacter spp., (e.g. C.
  • the host biological cell is species of the genus Escherichia spp., in particular E. coli.
  • the method preferably comprises the following steps:
  • step (c) providing each compartment of step (c) with said defined culture medium and culturing for a period sufficient to induce uptake of a candidate regulator of CRISPR-Cas associated activity;
  • step (d) e. culturing the cells obtained from step (d) on said defined culture medium
  • said medium is supplemented with one or more agents to induce expression of said CRISPR-Cas associated activity
  • step (d) detecting cells expressing the reporter gene in the one or more compartments of said multi-compartment container obtained in step (d). wherein said recombinant biological cells are according to the first embodiment as defined in section I.
  • the small molecule library is typically composed of small molecules, including peptidomimetics, natural or synthetic compounds, for example libraries of diverse macrocycle compounds.
  • the growth medium in step (d) is either solid or liquid; and its composition is chosen to be compatible with the properties of the reporter protein expressed the the reporter gene in the recombinant cells.
  • the reporter polypeptide is detectable by virtue of the inherent properties of the polypeptide (e.g. a polypeptide with fluorescent properties, such as GFP)
  • the reporter polypeptide is detectable by virtue of its catalytic activity (e.g. a polypeptide that confers resistance to a growth inhibitor (antibiotic resistance protein, as exemplified in Example 1.1) or whose activity complements an auxotrophic phenotype
  • the composition of the growth medium should facilitate the mode of detection.
  • the growth medium should be supplemented with growth restrictive concentrations of the same growth inhibitor.
  • the polypeptide complements an auxotrophic phenotype, the nutrient required for growth of the auxotroph should be excluded from the growth medium.
  • Potential regulators of CRISPR-Cas associated activity are identified by detecting recombinant cells expressing the reporter polypeptide in the one or more compartments of said multi-compartment container in step (e).
  • the means for detection is compatible with the properties of the reporter polypeptide.
  • fluorescence detection tools are available for detection of expressed fluorescent polypeptides; while detection of cell growth suffices to detect recombinant cells cultured on restrictive growth medium .
  • the method preferably comprises the following steps:
  • step (b) culturing the one or more aliquots of cells obtained from step (b) on said
  • step (c) culturing the cells obtained from step (c) on said defined culture medium wherein said medium is supplemented with one or more agents to induce expression of said CRISPR-Cas associated activity, and
  • composition of the growth medium is one that facilitates the mode of selection provided by the reporter polypeptide, as described in section II.
  • potential regulators of CRISPR-Cas associated activity encoded by genes in the library of DNA fragments are identified by detecting recombinant cells expressing the reporter polypeptide, as described in section II.
  • the recombinant cells produced in step (b), that are cultured in step (c) are cultured on a growth medium that is solid, at a plating density that ensures the growth of single colonies.
  • a suitable plating density is about 0.5 to 1.5* 106 cells/cm2; which generates an overgrown plate during growth under non-selective growth medium.
  • Single colonies detected in step (d) can be selected; and the non-host DNA fragment cloned in a self-replicating plasmid of the library (and present in cells of the selected colony) can be isolated, amplified and sequenced in order to identify the candidate polypeptides as regulators of CRISPR-Cas associated activity.
  • a recombinant biological cell comprising a genetic circuit for detecting regulators of CRISPR-Cas associated activity
  • the invention according to the second embodiment provides recombinant bacterial cells comprising a genetic system for detecting a regulator of CRISPR-Cas associated activity ( Figures 1 and 2), wherein the genetic circuit preferably comprises:
  • a) a 1 st self-replicating genetic element comprising,
  • polypeptide complex said polypeptide or polypeptide complex having RNA- guided endonuclease activity, wherein said gene or gene cluster is operably linked a first inducible promoter having a measured strength of >0.65 on the Anderson Scale, or a combination of a first promoter having a measured strength of >0.65 on the Anderson Scale and a first cis element conferring inducible expression wherein said inducible expression is induced by at least one agent;
  • said 1 st self-replicating genetic element or said 2 nd self-replicating genetic element further comprises :
  • nucleic acid molecule encoding any one of:
  • pre-crRNA molecule encoded by a DNA molecule comprising a CRISPR array
  • pre-crRNA comprises a spacer sequence complementary to a nucleic acid sequence of at least 15 nucleotides of at least one strand of said reporter gene or its transcript
  • a mat-crRNA molecule comprising at least one spacer sequence flanked by at least one palindromic repeat sequence, wherein the spacer sequence is complementary to a nucleic acid sequence of at least 15 nucleotides of at least one strand of said reporter gene or its transcript;
  • polypeptide or polypeptide complex having RNA-guided endonuclease activity is capable of inactivating said heterologous reporter gene or its transcript
  • nucleic acid molecule is operably linked an second inducible promoter measured strength of >0.65, or a combination of a second promoter and second cis element conferring inducible expression, wherein said inducible expression is induced by at least one agent.
  • said reporter gene is comprised on a plasmid devoid of genes capable of expressing CRISPR-Cas associated activity.
  • the first gene or first gene cluster encoding a polypeptide or polypeptide complex, respectively having RNA-guided endonuclease activity include those belonging to the Type 1, Type II, Type III; Type IV, Type V, and Type VI and their subtypes; as exemplified in Table 1.
  • the first gene or first gene cluster and their respective encoded proteins are heterologous with respect to the host cell in which they are expressed.
  • the CRISPR associated protein is a Type II subtype A protein, in particular a Cas9 protein .
  • the CRISPR associated protein is preferably a class 2 type V system comprising the signature gene Cpfl encoding a Cpfl protein (e.g. WP_014550095.1).
  • the CRISPR associated protein is preferably a Class I type 1 system comprising the signature gene Cas3 encoding a Cas3 protein.
  • the nucleic acid molecule in the genetic circuit encodes a pre-crRNA or a mat-crRNA molecule comprising a palindromic repeat sequence that is cognate to the polypeptide or polypeptide complex having RNA-guided endonuclease activity.
  • the repeat and tracrRNA preferably have a cognate sequence listed in Table 1.
  • the nucleic acid molecule in the genetic circuit comprises a sequence encoding a single RNA transcript comprising a fusion of the gRNA and tracrRNA, and is heterologous with respect to the host cell in which it is expressed.
  • the first reporter gene encoding the selectable marker polypeptide in the 1 st self- replicating genetic element allows the selection of recombinant cells comprising the 1 st self-replicating genetic element and its maintenance therein .
  • the selectable marker polypeptide encoded by the first reporter gene and the reporter polypeptide encoded by the second reporter gene must be different, to allow plasmid selection and reporter detection to be independent of each other.
  • Suitable marker polypeptides include those conferring resistance to a growth inhibitor (for a protein conferring resistance to an antibiotic such as tetracycline, kanamycin, neomycin, tobramycin, chloramphenicol, spectinomycin, gentamycin, ampicillin and carbenicillin, as exemplified in Example 1.1).
  • heterologous gene encoding the selectable marker polypeptide is preferably driven by a constitutive promoter, (e.g . Ptet promoter, without the repressor promoter or a sigma70 constitutive promoter (http://parts.igem .org/Promoters/Catalog/Anderson).
  • the reporter gene in the genetic circuit encodes a reporter polypeptide having the same properties as the reporter gene in recombinant cells described in Section I.
  • Induced expression of CRISPR-Cas associated activity in recombinant cell comprising the genetic circuit leads to a rapid cleavage of the reporter gene or it transcript. For this reason, expression of the RNA-guided endonuclease protein or protein complex and its cognate gRNA in a host cell is tightly regulated. Inducible expression of the component genes in the genetic circuit is regulated in the same manner as the corresponding genes in the recombinant cells described in Section I.
  • the 1 st and 2 nd self-replicating genetic element may be any episome that is stably replicated in a host cell, for example a plasmid. Suitable plasmids for use in the genetic circuit are the same as the corresponding plasmids for use in the recombinant cells described in Section I.
  • Self-replicating genetic elements e.g. plasmids
  • methods for assembling the genetic circuit therein; and their subsequent cloning in a suitable host cell are illustrated in Example 1, using commercially available methods and tools that are known to those skilled in the art (see, e.g., Sambrook et al . , Molecular Cloning : A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, 1989).
  • Suitable recombinant biological cells comprising the genetic circuit according to the second embodiment of the invention circuit of the invention may be obtained using the same host cells as those used to obtain the recombinant biological cells described in Section I.
  • the genetic circuit of the invention may be transformed into the chosen biological host cell employing standard transformation protocols, including electroporation as illustrated in Example 1.3.
  • V A recombinant biological cell library comprising a genetic circuit for detecting regulators of CRISPR-Cas associated activity
  • the invention provides a recombinant biological cell library of non-host DNA fragments wherein each cell of the library comprises the genetic circuit described in section IV.
  • the library is a metagenomics DNA library where the DNA fragments of metagenomic DNA are cloned into a self-replicating vector (e.g . plasmid or cosmid comprising an origin of replication).
  • a self-replicating vector e.g . plasmid or cosmid comprising an origin of replication.
  • the average size of the metagenomic DNA insert in a plasmid is about 1.5 kb; but using a cosmid can be up to about 40 kb.
  • expression of open reading frames in the cloned DNA fragments may be driven by endogenous promoters within the DNA fragments; a suitable constitutive promoter may be provided upstream of the DNA insert cloning site in the self-replicating vector.
  • the self-replicating vector into which the metagenomic DNA is cloned is provided with a gene encoding a selectable marker polypeptide, which is different from the marker polypeptides encoded by the reporter gene and the heterologous gene encoding a marker polypeptide in the genetic circuit.
  • the recombinant biological cell library is conveniently constructed in the recombinant bacterial cells comprising the genetic circuit according to the invention.
  • the DNA fragments of metagenomic DNA can be derived from human, cow and pig fecal samples and soil samples, and the library can be constructed as described in Example 4.1 and 4.2.
  • the efficiency of the method for functional screening for anti-CRISPR molecules is improved by employing biological cells comprising the genetic circuit according to the invention, wherein expression of the RNA guided CRISPR-Cas activity is controlled by a a strong promoter and inducible expression, and a selectable plasmid having the reporter gene (see section I, IV and Example 2).
  • the frequency of false positives is further reduced by using defined media to precisely regulate the expression of the genetic circuit. This is achieved by using the following method, which is suitable for screening either a metagenomics library encoding candidate anti-CRISPR proteins; or one or more isolated candidate anti-CRISPR proteins; or one or more isolated anti- CRISPR small molecules.
  • the method comprises 6 steps:
  • the medium is selective for recombinant biological cells comprising the gene or gene cluster encoding an RNA-guided endonuclease that confers said inducible CRISPR-Cas associated activity and the nucleic acid molecule encoding its cognate RNA molecule that is capable of guiding said RNA-guided endonuclease.
  • the medium comprises both nutrients essential for growth and antibiotics selective for cells maintaining the respective plasmids.
  • the defined medium is devoid of agents required for, or capable of, inducing expression of the genes or gene clusters encoding an RNA- guided endonuclease and its cognate RNA molecules capable of guiding said polypeptide or polypeptide complex having RNA-guided endonuclease activity to said reporter gene or its transcript.
  • step 2) An aliquot of the cell culture produced in step 1) is inoculated into said defined growth medium comprising nutrients essential for growth to produce a cell suspension; said medium being both devoid of agents required for, or capable of, inducing expression of the genes or gene clusters encoding an RNA-guided endonuclease and its cognate RNA molecule; and devoid of antibiotics selective for cells maintaining the respective plasmids;
  • cells in the cell suspension produced in step 2) are either:
  • a cell suspension resulting from step 3) i) is incubated (preferably between 0.25 -2 h) to allow expression of candidate anti-CRISPR protein encoded by DNA molecules in the metagenomic library; or 3) ii) is incubated (preferably between 0.25 -2 h) to allow intracellular uptake of the candidate anti-CRISPR small molecule or candidate anti- CRISPR protein;
  • Agents are added to the incubated cell suspension(s) resulting from step 4; and the suspensions are incubated under conditions suitable for said agents to induce expression of the genes or gene clusters encoding an RNA-guided endonuclease and its cognate RNA molecule present in the biological cell provided in step 1); whereby said expressed endonuclease activity is capable of inactivating the reporter gene or its transcript present in the biological cell during said incubation (preferably between 1-3 h);
  • Cells derived from the incubated cell suspension(s) resulting from step 5) are plated on agar plates composed of a defined media comprising nutrients essential for growth and supplemented with agents required for, or capable of, inducing expression of the genes or gene clusters encoding an RNA-guided endonuclease and nucleic acid molecule(s) encoding its cognate RNA molecule(s).
  • the medium further comprises antibiotics selective for cells maintaining the respective plasmids.
  • the reporter gene or its transcript encodes an antibiotic resistance protein
  • the defined media agar is further supplemented with the respective antibiotic. The plates are incubated under suitable conditions for a time sufficient to detect viable biological cells.
  • Plasmid pCasens3 containing the Streptococcus pyogenes Cas9 gene (spCas9) [SEQ ID No. : 150] was constructed in a single step by using USER cloning (Genee et al. 2014).
  • a DNA fragment containing spCas9 gene [SEQ ID No. : 150] and its endogenous terminator [SEQ ID No. : 152] was PCR amplified from a plasmid DS-SPcas (supplied by addgene: Plasmid #48645) (Esvelt et al .
  • the plasmid pSEVA47 contains: the low copy number origin of replication pSClOl [SEQ ID No. : 155], and the antibiotic resistance gene aadA [SEQ ID No. : 153] that confers resistance against Spectinomycin and a restriction site for insertion of a gene.
  • a DNA molecule [SEQ ID No.
  • Plasmid pDual3 for arabinose inducible sgRNA expression was constructed by using USER cloning (Genee et al . 2014) to insert the respective genes into the backbone of the plasmid was from pSEVA3610, described by Martinez-Garcia et al., 2015.
  • the chimeric gRNA encoding sequence [SEQ ID No. : 161] with a TrrnB terminator [SEQ ID No. : 162] and a chloramphenicol resistance gene (CmR) [SEQ ID No. : 164] were cloned downstream of the inducible pBAD promoter [SEQ ID No. : 158] and araC gene [SEQ ID No.
  • plasmid comprises a low copy number origin of replication, pl5A [SEQ ID No. : 163] .
  • the first twenty nucleotide sequence of the encoded gRNA transcript is complementary to nucleotides 96 : 115 of the + strand in the CmR gene having [SEQ ID No. : 164]
  • Control plasmids for testing the genetic circuit include a negative control plasmid comprising a gfp gene [SEQ ID No. : 140] with rrnB T1 terminator [SEQ ID No. : 142] capable of expression of GFP (A); and a positive control plasmid comprising an acrlla2 gene [SEQ ID No. : 159] capable of expression of the anti-CRISPR protein, AcrIIA2, each under the control of a constitutive Ptet promoter (lacking the TET repressor) and RBS [SEQ ID No. : 139] (B). Both plasmids have a ColEl ori [SEQ ID No.
  • control plasmids are transformed into host cells, comprising the genetic circuit (i.e. pCasens3 and pDual3 plasmids as described in Example 1.1); the expression of the anti-CRISPR protein, AcrIIA2 is capable of inhibiting Cas9 mediated inactivation of the CmR gene.
  • the genetic circuit i.e. pCasens3 and pDual3 plasmids as described in Example 1.1
  • the expression of the anti-CRISPR protein, AcrIIA2 is capable of inhibiting Cas9 mediated inactivation of the CmR gene.
  • E.coli TOPIO were made electro-competent and transformed with the genetic circuit comprising the plasmids : pCasens3 and pDual3 (obtained in example 1.1) as follows : i. E.coli TOPIO cells (supplied by ThermoFischer) were cultured in 3 ml of 2xYT media for 16-18 hours at 37°C and 250 rpm;
  • the culture from (i) was inoculated 1 : 100 3 ml of pre-warmed 2xYT media; and cultured at 37°C at 250 rpm until the cell density reached 0.5-0.7 OD600; iii. the cultures from (ii) were cooled on ice; and the cells harvested by
  • the cells were washed by re-suspension in cold water and then harvested
  • microcentrifuge tubes to which a 1 m I sample of pCasens3 and pDual3 DNA was added; and this DNA-cell mixture was then transferred to a cold cuvette and electroporated with a pulse having 1.8 kv, 200 ohms and 25 pF.
  • the recovered transformed cells were diluted 10 L 0 - 10 L 8; plated on
  • selective solid media comprising Luria Broth agar supplemented with : 50 pg/mL spectinomycin; 30 pg/mL chloramphenicol and 50 pg/mL kanamycin; and incubate overnight at 37°C.
  • Transformed cell colonies comprising the pCasens3 and pDual3 plasmids (£. coli genetic circuit [E. coli-GC ]) were selected; amplified and stored.
  • E.coli TOPIO containing the genetic circuit comprising the plasmids : pCasens3 and pDual3 were made electro-competent and transformed with either the negative control plasmid expressing GFP or positive control comprised a plasmid expressing AcrIIA2, as described in Example 1.2, employing the transformation protocol described in Example 1.3, but where the host cells were E. coli-GC.
  • Example 2 Optimisation of the genetic circuit
  • RNA-guided endonuclease The use of host cells comprising the genetic circuit to screen for regulators of RNA- guided endonucleases relies on the stringency of the genetic circuit; such that the activity of a candidate regulator can be clearly detected.
  • the importance of regulation of the expression of the RNA-guided endonuclease was investigated.
  • the pCasens3 plasmid was modified to facilitate the comparison of 3 plasmids, differing only in the promoter region operably linked to the Cas9 gene. These modifications allowed an analysis of the impact of both the strength and on-off regulation of the promoter to be assessed.
  • E.coli TOPIO cells Three samples of E.coli TOPIO cells were transformed (as described in Example 1.3) with the genetic circuit comprising the pDual3 plasmid together with one of 3 versions of the pCasens3 plasmid; and the transformed cells were then duplicate plated on LB agar comprising the antibiotics (50 pg/mL spectinomycin; 30 pg/mL chloramphenicol and 50 pg/mL kanamycin); where one set of plates was further supplemented with 1% arabinose to induce sgRNA expression (alone) or together with 2mM theophylline for inducible Cas9 expression.
  • the antibiotics 50 pg/mL spectinomycin; 30 pg/mL chloramphenicol and 50 pg/mL kanamycin
  • E.coli TOPIO cells transformed with genetic circuits comprising the pDual3 plasmid and each of the 3 versions of the pCasens3 plasmid, are dependent on expression of a functional camR gene in order to grow on the LB plates comprising chloramphenicol. Effective inactivation of the camR gene, by the Cas9-sgRNA can be determined by detecting the loss of cell growth under these conditions.
  • E.coli TOPIO cells comprising the first version of the pCasens3 plasmid comprising a Cas9 gene regulated by a medium strength constitutive promoter (ProC promoter [SEQ ID No. : 167]; Esvelt, K. M., 2013) (see Figure 4) showed slower growth than cells comprising the second and third versions of the pCasens3 plasmid when grown on growth medium lacking both theophylline and arabinose required for inducing expression of the Cas9 and sgRNA genes respectively.
  • the observed growth depression may be attributable to a potential fitness cost associated with the expression of Cas9, since it is a large protein (> 1300 amino acids in length).
  • E.coli TOPIO cells comprising the second version of the pCasens3 plasmid comprising a Cas9 gene regulated by the medium strength promoter (ProC promoter [SEQ ID No. : 167 ] in combination with the inducible theophylllin riboswitch (see Figure 4) showed a faster growth than the first version, which correspondingly may be attributed to the "off" state of cas9 expression, and absence of its fitness cost.
  • the log reduction in growth was limited, indicating that the medium strength promoter following induction, failed to drive sufficient expression of the Cas9 gen in order to obtain optimal inactivation of the camR gene.
  • E.coli TOPIO cells comprising the third version of the pCasens3 plasmid comprising a Cas9 gene regulated by the strong promoter (BBa J23100) combined with the inducible theophylllin riboswitch (see Figure 4) also showed a faster growth than the first version, which correspondingly may be attributed to the "off" state of cas9 expression, and absence of its fitness cost. Additionally, when plated on media inducing expression of the Cas9-sgRNA, the log reduction in growth was substantial, indicating that a tighly regulated and strong promoter controlling expression of the Cas9 gene is important in order to obtain optimal inactivation of the camR gene; and an effective screening genetic circuit.
  • E.coli TOPIO cells comprising a fourth version of the pCasens3 plasmid comprising a Cas9 gene operably linked a non-regulated strong constitutive promoter (BBa J23100) is additionally tested to confirm the importance of promoter regulation.
  • BBa J23100 non-regulated strong constitutive promoter
  • Optimised genetic circuit locates reporter gene for CRISPR-Cas on a separate plasmid
  • the effect of the genetic location of the reporter gene on the efficacy of CRISPR-Cas is tested as follows:
  • the pCasens3 plasmid is modified by insertion of the arabinose inducible sgRNA expression cassette, comprising the chimeric gRNA encoding sequence [SEQ ID No. : 161] with a TrrnB terminator [SEQ ID No. : 162] cloned downstream of the inducible pBAD promoter [SEQ ID No. : 158] and araC gene [SEQ ID No. : 159] encoding an L-arabinose-inducible transcription factor, to create pCasens4.
  • a chloramphenicol resistance gene (CmR) [SEQ ID No. : 164] is inserted into a second plasmid having the low copy number origin of replication, pl5A [SEQ ID No. : 163] to create pCmR.
  • E.coli TOPIO cells transformed either pCasens4 and pCmR, or pCasens3 and pDual3 (see Example 1.1) are cultivated under aerobic conditions in M9 minimal medium (http://cshprotocols.cshlp.Org/content/2010/8/pdb. recl2295.short), supplemented with the antibiotics (spectinomycin 50ug/mL and chloramphenicol 30ug/mL) and the two culturare incubated at 37°C; 250RPM .
  • M9 minimal medium http://cshprotocols.cshlp.Org/content/2010/8/pdb. recl2295.short
  • the antibiotics spectinomycin 50ug/mL and chloramphenicol 30ug/mL
  • the two transformed cell suspensions are then supplemented with 2mM theophylline; 1% arabinose; and incubated at 37°C; 250RPM for a period of ⁇ 2h, to allow induced expression of the RNA-guided CRISPR- Cas genes, and inactivation of the CmR reporter gene.
  • serial dilutions of the cells are plated in solid M9-agar plates supplemented with 50 ug/ml Spectomycin; 50 ug/ml Kanamycin; 30ug/ml chloramphenicol; and the inducing agents, 2mM theophylline; 1% arabinose.
  • reporter gene in this example the CmR reporter gene
  • the reporter gene is located on a plasmid, separate from the location of the other first genes and nucleic acid molecule required for expression of the CRISPR associated activity this serves to enhance inactivation of the reporter gene, thereby limiting the frequency of false positives.
  • Example 3 Use of host cells comprising the genetic circuit to screen for anti- CRISPR molecules
  • E. coli-GC cells transformed with additional plasmids were employed as controls: E. coli-GC cells acting as a negative control comprised a plasmid expressing GFP; and E. coli-GC cells acting as a positive control comprised a plasmid expressing AcrIIA2.
  • E. coli-GC (see Example 1.2) were prepared in filtered sterilized minimal media supplemented with 50 pg/mL spectinomycin and 30 pg/mL chloramphenicol to maintain the genetic circuit, and comprising glucose as the sole carbon source.
  • the measured OD600 values were normalized to the positive and negative controls in each plate to determine a normalized percent of inhibitory activity of each compound .
  • Z scores per plate per replicate were calculated, based on the normalized OD600 values. A positive hit was given for a Z score above 2.5.
  • E. coli-GC host cells transformed with the negative control expressing GFP ( Figure 3A); and the positive control plasmid expressing AcrIIA2 ( Figure 3B and 6B), were screened for anti-CRISPR activity as follows :
  • 1 st set of plates used to determine cell death comprised LB agar supplemented with : 2mM theophylline; 1% arabinose and antibiotics (50 pg/mL
  • Host cells comprising the genetic circuit have been screened using a wide range of small molecules, including peptidomimetics, natural or synthetic compounds, for example libraries of diverse macrocycle compounds.
  • Circa 20 positive hits were obtained from screening a libra ry of diverse macrocycles (Asinex Macrocycle libra ry) providing evidence that the screening method is suitable for detecting molecules having anti-CRISPR activity.
  • the sheared DNA (200mI) was size-selected by separation on a 1 % agarose gel with DNA dye and DNA migrating in the 3-6 kb region was extracted using a Qiagen gel extaction kit;
  • the library was amplified by inoculation into 10 ml LB+Kan 50 and grown over night; 5 ml 50 % glycerol is added and the library is aliquoted into 1 ml portions and frozen at -80 C.
  • plasmid DNA comprising the amplified metagenomic library was extracted and recovered using Machery-Negel plasmid DNA purification kit as follows :
  • An overnight culture of bacteria comprising the metagenomic library was grow in 3 ml LB medium supplemented with 50 pg/mL kanamycin;
  • the culture was centrifuged, and the pelleted bacteria were resuspend the bacteria in buffer, to which a denaturing solution was added;
  • plasmid-containing supernatant was recovered and purified through the Nucleospin ® plasmid column on the kit; and bound plasmid DNA was eluted from the column using water or a neutral buffer such as Tris : EDTA, to provide a plasmid DNA comprising the metagenomic DNA expression library.
  • E.coli TOP10 containing the genetic circuit comprising the plasmids: pCasens3 and pDual3 (E. coli-GC) were made electro-competent and transformed with plasmids for expression of each metagenomic library (obtained in example 4.1) as follows:
  • pCasens3 and pDual3 were cultured in 3 ml of 2xYT media with 50 pg/mL spectinomycin and 30 pg/mL chloramphenicol for 16-18 hours at 37°C and 250 rpm;
  • the culture from (i) was inoculated 1 : 100 in 3 ml of pre-warmed 2xYT media with 50 pg/mL spectinomycin and 30 pg/mL chloramphenicol; and cultured at 37°C at 250 rpm until the cell density reached 0.5-0.7 OD600;
  • Colonies growing on the plates were collected by adding 1 ml H 2 0, after which the colonies were scraped off the plate with a sterile loop. The bacterial cells were then pelleted and washed in water by centrifugation and the plasmid DNA in collected bacterial cells was extracted using a Machery-Negel plasmid DNA purification kit.
  • Example 5 A genetic circuit for screening for anti-CRISPR molecules of Class 1 (Type 1-B) and Class 2 (Type II-A and V-A) CRISPR-Cas systems
  • the genetic circuit for use in screening for anti-CRISPR molecules comprising gene(s) encoding a Class 1 or 2 CRISPR-Cas system, in combination with a nucleic acid molecule encoding a cognate guide RNA (cognate palindromic repeat sequence and as required a tracRNA) were validated, as demonstated below.
  • the cognate RNA was expressed from the pDual plasmid which comprises a gene encoding cognate guide RNA targeting the chloramphenicol resistance gene (CmR), under the control of an arabinose-inducible promoter, a chloramphenicol resistance gene (cat), and a pl5a origin of replication (see right-hand pDua!3 plasmid in Figure 2).
  • CmR chloramphenicol resistance gene
  • cat chloramphenicol resistance gene
  • pl5a origin of replication see right-hand pDua!3 plasmid in Figure 2.
  • type I CRISPR-Cas systems are composed of a multi-subunit subunit protein complex, each subunit encoded by its respective gene.
  • the endogenous promoter of the type I-B CRISPR-Cas operon was replaced with the IPTG inducible strong Ptac promoter.
  • type II CRISPR-Cas systems comprise a single multidomain protein encoded by a single gene.
  • the type II-A CRISPR-Cas gene from Streptococcus thermophilus encoding the S. thermophilus Cas9 protein (SEQ ID No: 168), was cloned in a plasmid ( Esvelt et a I . , 2013), such that expression of Cas9 was placed under control of an
  • IPTG-inducible Ptac promoter creating plasmid pTypell-A-Cas.
  • type V CRISPR-Cas systems also comprise a single multidomain protein encoded by a single gene.
  • the type V-A CRISPR-Cas from Franciseiia novicida strain ul l2 comprises a Casl2 gene (formerly designated Cpfl), encoding the Franciseiia novicida Casl2 effector protein (WP_014550095.1; SEQ ID No : 91), was cloned in a medium copy number plasmid having a cloDF13 origin of replication, and
  • aadA gene spectinomycin resistance (aadA gene) (as described by Zetsche et al ., 2015).
  • the plasmid was modified to place expression of Casl2 gene under control of the strong constitutive promoter J23100 [SEQ ID No. : 172] (as defined by
  • E.coli TOPIO were made electro-competent and transformed with plasmids comprising :
  • pDual5 comprising Cognate palindromic repeat SEQ ID No. : 169 and tracRNA gene having SEQ ID No : 170
  • pTypeV-A-Cas comprising coding sequence for Casl2; SEQ ID No : 91
  • pDual6 comprising Cognate palindromic repeat SEQ ID No. : 125
  • Transformants were cultured overnight at 37°C with shaking (250RPM) in a growth medium supplemented with antibiotics (spectinomycin 50ug/mL, kanamycin 50ug/mL, and chloramphenicol 30ug/mL); but devoid of inducers of CRISPR-Cas expression.
  • antibiotics spectinomycin 50ug/mL, kanamycin 50ug/mL, and chloramphenicol 30ug/mL
  • the resulting cultures were serially diluted from 10 1 to 10 7 and then plated on LB-agar with and without chloramphenicol.
  • the frequency of false positives detected using the method for functional screening for anti-CRISPR molecules is reduced by employing biological cells comprising the genetic circuit according to the invention, wherein expression of the RNA guided CRISPR-Cas activity is controlled by a inducible strong promoter, and a selectable plasmid having the reporter gene, in combination with the use of a defined growth medium, such that induction of RNA guided CRISPR-Cas activity and reporter gene selection is retricted to defined steps of the method .
  • the method (with reference to Figure 12) was tested by:
  • E. coli cells comprising the genetic circuit on pCasens3 and pDual3; by cultivating the cells under aerobic conditions in M9 minimal medium (http://cshprotocols.cshlp.Org/content/2010/8/pdb. recl2295.short), supplemented with the antibiotics (spectinomycin 50ug/mL and chloramphenicol 30ug/mL) and incubating the culture at 37°C; 250RPM without inducers(figure 12, step 1). Since the medium is devoid of the inducing agents, theophylline and arabinose, E. coli cells comprising the genetic circuit are selected without inducing expression of the RNA- guided CRISPR-Cas genes.
  • step 2) An aliquot of the cell culture produced in step 1) is inoculated into M9 minimal medium; said medium being both devoid of the inducing agents (theophylline and arabinose) and antibiotics (spectinomycin and chloramphenicol) to produce a cell culture.
  • inducing agents theophylline and arabinose
  • antibiotics spectinomycin and chloramphenicol
  • Cells derived from the cell culture produced in step 2, are transformed with the control plasmid comprising a gene [SEQ ID No. : 140] encoding Green Fluorescent Protein (GFP) under the control of a constitutive Ptet promoter [SEQ ID No. : 139] .
  • GFP Green Fluorescent Protein
  • Transformed cells obtained in step 3, are incubated at 37°C for a period of ⁇ lh, in M9 minimal medium, sufficient to allow expression of the GFP protein, and providing a transformed cell culture.
  • step 4 The transformed cell culture obtained in step 4 is supplemented with 2mM theophylline; 1% arabinose; and the suspensions are incubated at 37°C; 250RPM for a period of ⁇ 2h, to allow induced expression of the RNA-guided CRISPR-Cas genes, and inactivation of the CmR reporter gene in cells. Since the growth medium is devoid of chloramphenicol, growth conditions are non-selective for functional expression of this reporter gene.
  • Cells derived from the incubated cell suspension resulting from step 5) are plated on a defined media agar supplemented with both 2mM theophylline; 1% arabinose;and the antibiotics: spectinomycin 50ug/mL, kanamycin 50 pg/mL, and chloramphenicol 30ug/mL) and incubated at 37°C for a time sufficient to detect viable biological cells.
  • the effect of the strength of the promoter used to control inducible expression of RNA guided CRISPR-Cas activity on the efficiency of inactivation of the reporter gene (camR) gene cleavage, measured as cell survivors, and corresponding reduction in false-positives using the screening assay is demonstated in Figure 14.
  • the three types of cells tested comprised the CRISPR-Cas9 genetic cicuit, differing only in that their respective Cas9 gene was operably linked to a promoter having promoter strengths of 0.16; 0.58 or 1.00 on the Anderson scale, each combined with a theophylline riboswitch 5' of the cas9 gene.
  • the cells were tested according to steps 1, 2, 5 and 6, and omitting transformation steps 3 and 4 ( Figure 12).

Abstract

The invention relates to a method for in vivo detection of small molecules or polypeptides that act as regulators of CRISPR-Cas associated activity, in particular the method is for use in the screening of molecule libraries and non-host DNA libraries. The method employs cells comprising genes expressing CRISPR-Cas associated activity and a reporter protein to allow the detection of said regulators. The invention further relates to cells comprising genes expressing of CRISPR-Cas associated activity and a reporter protein for use in said in vivo screening and detection methods.

Description

TITLE: A high-throughput platform to select for regulators of CRISPR-Cas associated activity
Technical field of the invention
The invention relates to a method for in vivo detection of small molecules or polypeptides that act as regulators of CRISPR-Cas associated activity, in particular the method is for use in the screening of molecule libraries and non-host DNA libraries. The method employs cells comprising genes expressing CRISPR-Cas associated activity and a reporter protein to allow the detection of said regulators. The invention further relates to cells comprising genes expressing of CRISPR-Cas associated activity and a reporter protein for use in the in vivo screening and detection methods.
Background of the invention
CRISPR-Cas systems originate from bacteria where they function as adaptive immune systems that cleave foreign DNA or RNA in a sequence specific manner. Components of type II CRISPR-Cas systems, such as Cas9 from Streptoccocus pyogenes, have been exploited for precise and programmable gene editing. Given the significant impact of CRISPR-Cas systems, efforts have been made to discover new systems in cultured and recently uncultured bacterial species. CRISPR-Cas technologies represent effective biotechnological tools that broadly impact life sciences. Applications range from treating human genetic diseases to eliminating pathogens using gene-drives. Beyond genome editing, CRISPR systems have several other applications, including programmable transcriptional regulators and ultra-sensitive detection devices. However, there is increasing concern about the safety and precision of CRISPR-Cas. Indeed, the 2016 worldwide threat assessment of U.S. intelligence listed genome editing technologies as a potential weapon of mass destruction and proliferation. Accordingly, more research and tools are needed to control the CRISPR-Cas technology in order to reduce potential risks of genome editing, or to contain their spread and potential harm to other organisms.
A promising method to control CRISPR activity is via anti-CRISPR (ACR) proteins. These naturally occurring inhibitors can potentially inactivate CRISPR systems when needed, or decrease off-target effects, without significantly affecting on-target efficiency. ACRs were initially discovered for type I CRISPR systems. Subsequent bioinformatic mining identified ACR activity against type II-C Cas9 from Neisseria meningitidis and type II-A Cas9 from Streptococcus pyogenes (spCas9). More recently an additional ACR against spCas9 was identified by cloning and testing multiple genes from a phage that was able to escape CRISPR-based immunity from Streptococcus thermophillus, reflecting the biological arms race occurring between phages and CRISPR systems. Given the abundance of CRISPR systems in bacteria as well as the abundance of uncharacterized phages, it is likely that we currently have only revealed a minute proportion of ACRs in the environment. Discovery of novel genes with this functionality is, however, complicated both by a lack of abundant reference ACR families as well as the limited availability of phage genomic information . Accordingly, there exists a need for better methods and genetic tools for identifying further ACRs, both those found in nature as well as small molecules exhibiting the functional properties of ACRs.
Summary of the invention
A first embodiment of the invention provides a method for in vivo detection of regulators of CRISPR-Cas associated activity comprising :
a. providing cells capable of inducible CRISPR-Cas associated activity wherein said cells are prepared on a defined medium devoid of one or more agents for inducing expression of said activity, and either
I. contacting said cells with a molecule or library of molecules; or
II. introducing a self-replicating library of non-host DNA fragments into said cells; b. culturing the cells obtained from step (a) (I) or (II) on said defined culture medium for a period sufficient for uptake or expression of a candidate regulator of CRISPR-Cas associated activity; c. culturing the cells obtained from step (b) on said defined culture medium wherein said medium is supplemented with one or more agents to induce expression of said CRISPR-Cas associated activity, and
d. detecting expression of a reporter gene present in cells obtained from step (c), wherein said cells comprise:
i. a first gene encoding a polypeptide or a first gene cluster encoding a polypeptide complex, said polypeptide or polypeptide complex having RNA- guided endonuclease activity, wherein said first gene or first gene cluster is operably linked a promoter having a measured strength of >0.65 on the
Anderson scale and whose expression is inducible by at least one of said one or more agents;
ii . a reporter gene operably linked to a promoter;
iii . a nucleic acid molecule operably linked to a promoter and whose expression is inducible by at least one of said one or more agents,
wherein said nucleic acid molecule encodes one or more RNA molecules capable of guiding said polypeptide or polypeptide complex having RNA-guided endonuclease activity to said reporter gene or its transcript; and wherein the polypeptide or polypeptide complex having RNA-guided endonuclease activity is capable of inactivating said reporter gene or its transcript.
A second embodiment of the invention provides a biological cell comprising :
i. a first gene encoding a polypeptide or a first gene cluster encoding a polypeptide complex, said polypeptide or polypeptide complex having RNA- guided endonuclease activity, wherein said first gene or first gene cluster is operably linked to a promoter having a measured stength of >0.65 on the Anderson scale and whose expression is inducible by at least one agent;
ii. a reporter gene operably linked to a promoter,
iii. a nucleic acid molecule operably linked to a promoter and whose expression is inducible by at least one agent,
wherein said nucleic acid molecule encodes one or more RNA molecules capable of guiding said polypeptide or polypeptide complex having RNA-guided endonuclease activity to said reporter gene or its transcript; and
wherein the polypeptide or polypeptide complex having RNA-guided endonuclease activity is capable of inactivating said reporter gene or its transcript.
A third embodiment of the invention provides for the use of a biological cell according to the invention, for detecting a regulator of CRISPR-Cas associated activity.
A fourth embodiment of the invention provides a biological cell library of non-host DNA fragments, wherein each cell of the library is a cell according to the invention, and wherein each cell further comprises a self-replicating genetic element comprising a) a non-host DNA fragment, and
b) a gene encoding a selectable marker polypeptide and operably linked to a promoter.
A fifth embodiment of the invention provides for a use of a biological cell library of non-host DNA fragments according to the invention, for detecting a regulator of CRISPR-Cas associated activity.
Description of the invention
Figures:
Figure 1 is a cartoon illustrating the method of the invention; comprising providing a biological cell comprising a gene or gene cluster encoding a protein or protein complex having RNA-guided endonuclease activity, a nucleic acid molecule encoding chimeric gRNA (sgRNA) or a CRISPR array, as well as a reporter gene encoding a selection marker, and then either contacting the cell with a molecule, or introducing a self- replicating library of non-host DNA into the cell, such as to allow detection of molecules or expressed polypeptides that inhibit inactivation of the reporter gene or its transcript by the RNA-guided endonuclease. Figure 2 is a cartoon illustrating the pCasens3 and pDual3 plasmids and interaction with a non-peptide molecule with anti-CRISPR activity. Plasmid pCasens3 (to the left) comprises: the Streptococcus pyogenes Cas9 gene (spCas9) [SEQ ID No. : 150] cloned into the backbone of the plasmid pSEVA47. Additionally, a sigma70 constitutive promoter (P23100) [SEQ ID No. : 148], adjacent to a DNA molecule encoding a theophylline translational riboswitch [SEQ ID No. : 149] were inserted upstream of the spCas9 gene in pSEVA47, in order to control its expression . pSEVA47 comprises a low copy number origin of replication, SC101 [SEQ ID No. : 155]; and the antibiotic resistance gene aadA [SEQ ID No. : 153] under the control of its native (constitutive) promoter and ribosomal binding site, conferring resistance against Spectinomycin. Plasmid pDual3 (to the right) comprises a chloramphenicol resistance gene (CmR) [SEQ ID No. : 164] operably linked to an upstream constitutive promoter and RBS; and a DNA sequence encoding a sgRNA [SEQ ID No. : 161] whose transcription/expression is regulated by the inducible pBAD promoter [SEQ ID No. : 158], located upstream of the respective sequences. The plasmid further comprises an araC L-arabinose sensor gene [SEQ ID No. : 159] that regulates the pBAD promoter in response to the metabolite, L-arabinose. pDual3 comprises a low copy number origin of replication, pl5A [SEQ ID No. : 163] .
Figure 3 is a cartoon illustrating (A) the negative control plasmid comprising a gene [SEQ ID No. : 140] encoding Green Fluorescent Protein (GFP) under the control of a constitutive Ptet promoter [SEQ ID No. : 139]; and (B) positive control plasmid comprising a gene [SEQ ID No. : 146] encoding AcrIIA2 protein under the control of a constitutive Ptet promoter [SEQ ID No. : 139] . Both negative and positive control plasmids further comprise a kanamycin resistance gene [SEQ ID No. : 144] under the control of its native (constitutive) promoter and ribosomal binding site and a colEl origin of replication [SEQ ID No. : 143] .
Figure 4 is a histogram showing the number of colony forming units (CFU. ml 1) of E. coli cells comprising the pDual3 plasmid in combination with 3 alternative versions of the pCasens3 plasmid; following cultivation on selective media comprising chloramphicol, where the sgRNA and Cas9 genes of the genetic circuit are induced as indicated with arabinose and theophylline respectively. The 3 forms of pCasens3 comprise a Cas9 gene operably linked to either a medium strength promoter of <0.50 on the Anderson scale (ProC); the medium strength promoter (ProC) together with a theophylline riboswitch 5' of the cas9 gene; or a strong promoter (BBa J23100 [SEQ ID No. : 172]) together with a theophylline riboswitch 5' of the cas9 gene.
Figure 5 shows (A) a cartoon illustrating the molecular mechanism whereby in E. coli cells comprising the Genetic Circuit (£. coli- GC), the genetic circuit is induced to express the cas9 gene and sgRNA, the formation of a Cas9-sgRNA complex, and the endonuclease-mediated cleavage of the reporter gene (Cat), leading to cell death when exposed to chloramphenicol . The cartoon further shows that co-expression of an acrIIA2 gene in the E. coli- GC cells can inhibit Cas9-sgRNA inactivation of the Cat gene, leading to cell survival under selective conditions. (B) is a histogram showing the number of colony forming units (CFU. ml 1) of E. coli- GC cells comprising a negative control plasmid (expressing GFP) or the positive control plasmid (expressing AcrIIA2) following cultivation on selective media, where the genetic circuit is either induced or non-induced.
Figure 6 is a cartoon illustrating (A) a metagenomics DNA expression plasmid comprising a fragment of (a library of) metagenomics DNA fragments cloned down stream of a constitutive Ptet promoter [SEQ ID No. : 139]; a kanamycin resistance gene under the control of its native (constitutive) promoter and ribosomal binding site; together with the genetic circuit comprising plasmids pCasens3 and pDual3; (B) a positive control plasmid comprising a gene encoding AcrIIA2 protein and the genetic circuit plasmids pCasens3 and pDual3.
Figure 7 is a cartoon showing use of host cells comprising the genetic circuit to screen a metagenomics DNA expression library for anti-CRISPR proteins.
Figure 8 is a histogram showing the number of colony forming units (CFU.ml 1) of E. coli- GC cells comprising selected metagenomics inserts, designated AC11 - AC42; as compared to a negative control plasmid (expressing GFP) or the positive control plasmid (expressing AcrIIA2) following cultivation on selective media, where the genetic circuit is induced.
Figure 9 is a histogram showing the number of colony forming units (CFU.ml-1) of E. coli- GC cells comprising plasmids pTypel-B-Cas in combination with pDual3; following sequential cultivation in growth media that are either non-inducing or inducing for Typel-B-Cas expression; and then on either non-selective or selective media comprising chloramphicol, wherein Typel-B-Cas expression was induced using 0 - lOOuM IPTG. Figure 10 is a histogram showing the number of colony forming units (CFU.ml 1) of E. coli- GC cells comprising plasmids pTypell-A-Cas in combination with pDual3; following sequential cultivation in growth media that are non-inducing or inducing for TypeII-A- Cas expression; and then on either non-selective or selective media comprising chloramphicol, wherein Typell-A-Cas expression was induced using 0 - lOOuM IPTG.
Figure 11 is a histogram showing the number of colony forming units (CFU.ml 1) of E. coli- GC cells comprising plasmids pTypeV-A-Cas in combination with pDual3; following sequential cultivation in growth media that are non-inducing or inducing for TypeV-A- Cas expression; and then on either non-selective or selective media comprising chloramphicol, wherein TypeV-A-Cas expression was induced using 2mM theophylline.
Figure 12 is a cartoon showing a method for functional screening for anti-CRISPR molecules using biological cells comprising the genetic circuit according to the invention that is suitable for screening : a metagenomics library encoding candidate anti-CRISPR proteins;an isolated candidate anti-CRISPR protein; and an isolated anti- CRISPR small molecule. The method comprises 6 steps:
1) Cultivation of biological cells, comprising the genetic circuit according to the invention, on a defined growth medium. The medium comprises nutrients essential for growth and when required one or more agents selective for maintenance of genes of the genetic circuit including the reporter gene, but devoid of agents required for, or capable of, inducing expression of the genetic circuit (i.e. the genes or gene clusters encoding an RNA-guided endonuclease and its cognate RNA molecules capable of guiding said polypeptide or polypeptide complex having RNA-guided endonuclease activity to said reporter gene or its transcript).
2) An aliquot of the cell culture produced in step 1) is inoculated into said defined growth medium to produce a cell suspension; but wherein said medium is both devoid of agents required for, or capable of, inducing expression of said genetic circuit; and devoid of agents selective for maintenance of genes of the genetic circuit including the reporter gene;
3) cells in the cell suspension produced in step 2) are either:
i) transformed with a metagenomic library to produce a transformed cell suspension, or
ii) contacted with a candidate anti-CRISPR small molecule or candidate anti-CRISPR protein by their addition to said cell suspension;
4) the cell suspension resulting from :
- step 3) i) is incubated to allow expression of a candidate anti-CRISPR protein encoded by a DNA molecule in the metagenomic library; or from - step 3) ii) is incubated to allow intracellular uptake of the candidate anti-CRISPR small molecule or candidate anti-CRISPR protein;
5) Agent(s) capable of inducing expression of said genetic circuit are added to the cell suspension(s) resulting from step 4; and the suspensions are further incubated under conditions suitable for inducing expression of genes of the genetic circuit;
6) Cells derived from the cell suspension(s) resulting from step 5) are plated on agar plates composed of a defined media comprising nutrients essential for growth and supplemented with agents required for, or capable of, inducing expression of said genetic circuit and as required agents selective for maintenance of genes of the genetic circuit including the reporter gene. The plates are incubated under suitable conditions for a time sufficient to detect viable biological cells.
Figure 13 is a histogram showing the number of colony forming units (CFU .ml-1) of E. coli- GC cells comprising a CRISPR-Cas genetic circuit consisting of either: (1) pCasens3 (SP(II-A)) and pDUAL3; (2) pTypeV-A-Cas (FN(V-A) and pDual3; (3) pTypell-A-Cas (STl(II-A) and pDual3; or (4) pTypel-B-Cas (CD(I-B)) and pDual3; following incubation on liquid medium comprising either M9 minimal medium or yeast extract medium (2xYT); each medium with or without the addition of agents to induce expression of the respective CRISPR-Cas genetic circuit, and then respectively plated on agar medium comprising either M9 minimal medium or yeast extract (2xYT); each medium comprising chloramphenicol, and with or without addition of said inducing agents.
Figure 14 is a histogram showing the the number of colony forming units (CFU . ml 1) of E. coli cells comprising the pDual3 plasmid in combination with 3 alternative versions of the pCasens3 plasmid following culture in the presence of chloramphicol and in the presence, or absence, of arabinose and theophylline to simulateously induce expression of the sgRNA and Cas9 genes respectively. The 3 forms of pCasens3 plasmid comprise a Cas9 gene operably linked to either a weak promoter (J23116 [SEQ ID No. : 176]) shown in the left-hand bar; medium promoter (J23111 [SEQ ID No. : 177]) shown in the central bar; or strong promoter (J23100 [SEQ ID No. : 172]) shown in the right-hand bar; having increasing promoter strengths of 0.16; 0.58 and 1.00 on the Anderson scale, each combined with a theophylline riboswitch 5' of the cas9 gene.
Abbreviations and definition of terms:
Agent: is a small molecule capable of inducing expression of one or more genes operably linked to a promoter in the genetic circuit of the invention . Amino acid sequence identity: The term "sequence identity" as used herein, indicates a quantitative measure of the degree of homology between two amino acid sequences of substantially equal length . The two sequences to be compared must be aligned to give a best possible fit, by means of the insertion of gaps or alternatively, truncation at the ends of the protein sequences. The sequence identity can be calculated as ((Nref-Ndif)100)/(Nref), wherein Ndif is the total number of non-identical residues in the two sequences when aligned and wherein Nref is the number of residues in one of the sequences. Sequence identity can alternatively be calculated by the BLAST program e.g. the BLASTP program (Pearson W.R and D.J. Lipman (1988)) (www.ncbi. nlm. nih.gov/cgi-bin/BLAST). In one embodiment of the invention, alignment is performed with the sequence alignment method ClustalW with default parameters as described by Thompson J., et al 1994, available at http://www2.ebi.ac.uk/clustalw/.
Preferably, the numbers of substitutions, insertions, additions or deletions of one or more amino acid residues in the polypeptide as compared to its comparator polypeptide is limited, i .e. no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 substitutions, no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 insertions, no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 additions, and no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 deletions. Preferably the substitutions are conservative amino acid substitutions : limited to exchanges within members of group 1 : Glycine, Alanine, Valine, Leucine, Isoleucine; group 2 : Serine, Cysteine, Selenocysteine, Threonine, Methionine; group 3 : proline; group 4 : Phenylalanine, Tyrosine, Tryptophan; Group 5 : Aspartate, Glutamate, Asparagine, Glutamine.
Cis element: is a sequence of DNA that is capable of conferring inducible expression of a gene that is functionally linked to a promoter; where the cis element may induce expression by regulating transcription of the gene or by regulating translation of transcript product of the gene.
Cognate: is used to refer to interacting pairs of functional entities; more specifically that each protein or protein cluster having RNA-guided endonuclease activity has a cognate mat-crRNA, and in the case of Type II CRISPR-Cas systems both a cognate mat-crRNA and cognate tracrRNA, that are required for their activity.
CRISPR system: is a CJustered R_egularly Interspaced S_hort P_alindromic R_epeats is bacterial immune system that cleave DNA or RNA in a sequence specific manner, mediated by a RNA guided nuclease.
E. coli ToplO : is E. coli having chromosomal Genotype mcrA, A(mrr-hsdRMS- mcrBC), Phi80lacZ(del)M15, AlacX74, deoR, recAl, araD139, A(ara-leu)7697, galU, galK, rpsL(SmR), endAl, nupG
gi number: (genlnfo identifier) is a unique integer which identifies a particular sequence, independent of the database source, which is assigned by NCBI to all sequences processed into Entrez, including nucleotide sequences from DDBJ/EMBL/GenBank, protein sequences from SWISS-PROT, PIR and many others. Heterologous gene and heterologous DNA molecule: have a different genetic origin from the recombinant biological cell in which they are expressed; and this also applies to the protein or transcript that they encode. The nucleotide sequence of the heterologous gene or heterologous DNA molecule may be optimized (e.g. codon optimization) with respect to the recombinant biological cell in which they are expressed. Heterologous gene and heterologous DNA molecule may be located on (and therefor be a part of) the chromosome or a episome of the recombinant cell, and may be inserted into this location by recombinant DNA cloning.
Promoter: is a region of DNA that initiates transcription of a particular gene. Promoters are located near the transcription start sites of genes, on the same strand and upstream on the DNA (towards the 5' region of the sense strand). A promoter that is "functionally linked" to a gene is capable of driving expression of said gene. A promoter may be drive expression constitutively or only in response to an inducer. Promoter activity: is the measured strength i .e. measured relative activity of a promoter to drive expression of a reporter gene encoding Red Fluorescent Protein (RFP) in E. coli. Measurement is performed on cells of f. coli strain TGI [K-12 supE thi-1 A(lac-proAB) A(mcrB-hsdSM)5, (rK-mK-)] comprising the plasmid J61002 (as described in http://parts.igem .Org/Part: BBa_J61002) in which the tested promoter linked to the RFP reported gene is cloned, and where the cells are grown in LB media to saturation . The range of measured promoter strengths is relative to the Anderson Promoter Collection Scale (Anderson scale) of 0.00 to 1.00, wherein promoter J23100 [SEQ ID No. : 172] has the highest measured relative strength of 1.00; and the promoter J23112 [SEQ ID No. : 171] has a measured relative strength of 0.00, corresponding to non-detectable promoter activity (as described by
http://parts.iaem.orQ/Promoters/CataloQ/Anderson'). As defined herein, a strong promoter is one having a measured strength of >0.65 on this Anderson scale.
RBS : Ribosomal Binding Site is a sequence of nucleotides upstream of the start codon of an mRNA transcript that is responsible for the recruitment of a ribosome during the initiation of protein translation .
Small molecule library: library of small organic molecules; preferably having a mol mass of less than 900 daltons; for example libraries available from Asinex (www.asinex.com).
Detailed description of the invention
A first embodiment of the invention provides a method for screening a molecule or library of molecules (in particular small molecules) or polypeptides for a candidate molecule or polypeptide capable of regulating CRISPR-Cas associated activity. The regulator may be an inhibitor or an enhancer of CRISPR-Cas associated activity. The method is based on an in vivo assay employing cells; where the cells comprise genes encoding and capable of expressing the essential components required for CRISPR-Cas associated activity (Figure 1). The method of the invention has broad application, since it can be used for screening for regulators of all types of CRISPR-Cas systems having CRISPR-Cas associated activity.
While CRISPR-Cas systems, encoded by a CRISPR locus, find their origin in adaptive immune systems in a wide range of archeae and bacteria, they share a common functional property of cleaving a reporter DNA molecule or its transcript; and thereby allowing genetic material to be added, removed, or altered at particular locations in the genome of a biological cell or in a DNA molecule. In general terms, a CRISPR locus consists of a CRISPR array, comprising short variable DNA sequences (called spacer sequence(s)) interspersed by short palindromic repeat sequences (called repeat sequence(s)), this being flanked by diverse cognate cas genes. The CRISPR-Cas systems have been classified into 2 classes; whereby members of Class 1 comprise a multi-subunit subunit protein complex, each subunit encoded by a gene in the CRISPR locus; while members of Class 2 comprise a single multidomain protein encoded by a single gene. Class 1, includes type I, III and IV CRISPR-Cas systems; while Class 2, includes type II, V and VI CRISPR-Cas systems. Thus, in total there are six types, which have further been classified into over 28 subtypes; based on their protein structure, signature Cas genes and operon arrangements (Koonin, E. V. et al . , 2017). More specifically: the loci comprising Class 1, type I CRISPR-Cas systems share in common a signature gene cas3 (or its variant cas3') encoding a nuclease that both unwinds and cleaves target double-stranded DNA (dsDNA) and RNA-DNA duplexes. Type I systems comprise seven subtypes, I-A to I-F, and I-U. The loci comprising Class 1, type III CRISPR-Cas systems all contain the signature gene caslO, as well genes encoding Cas5 gene and several paralogues of Cas7 that co-transcriptionally target RNA and DNA. Type III systems comprise four subtypes, III-A to III-D. The loci comprising Class 1, type IV CRISPR-Cas system comprise two subtypes, IV-A and IV-B; where the Csfl can serve as as signature gene.
The loci for the Class 2, type II CRISPR-Cas systems, comprise the signature Cas9 gene encoding a multidomain protein that combines the functions of a crRNA-effector complex and cleaves target DNA. All Type II loci are characterised by a nucleic acid sequence encoding a tracrRNA, which is partially complementary to the repeats within the respective CRISPR array. Type II systems comprise three subtypes, II-A to II-C. Class 2, type V CRISPR-Cas systems comprise the signature gene Cpfl, also encoding a single multidomain protein. Unlike Cas9, the Cpfl gene is commonly located outside a CRISPR-Cas locus. Type V systems are currently divided into five subtypes, V-A to V-E.
For the purpose of expression of members of the Class 2 typell CRISPR-Cas systems in a cell, it is sufficient to express the signature Cas gene encoding the multidomain protein having RNA guided-endonuclease activity. Expression of all other Class 1 and 2 CRISPR-Cas systems can be achieved by co-expressing the several genes encoding the Cas proteins that constitute the multi-subunit subunit protein complex having RNA guided-endonuclease activity.
When expression of the CRISPR locus is activated, transcription of the CRISPR array yields a pre-crRNA, that is processed into individual mature crRNAs (mat-crRNA). Each mat-crRNA sequence, aided by its cognate single or multi-subunit Cas protein(s) (effector complex), functions as a guide to specifically target the effector complex to cleave a reporter nucleic acid molecule (DNA or RNA). Processing of the pre-crRNA is mediated by either an endonuclease subunit of the effector complex; or in the case of type II CRISPR-cas system by a combination of a RNase III protein together with a trans-activating CRISPR RNA (tracrRNA); where tracrRNA serves to recruit the components of the type II effector complex.
For the purpose of expressing of any member of the Class 1 or 2 CRISPR-Cas systems in a cell, an individual mat-crRNA comprising at least one spacer sequence, flanked at least at one end (preferably at both ends) by a palindromic repeat sequence, is sufficient to specifically guide the cognate effector complex to cleave a reporter nucleic acid molecule (DNA or RNA). Alternatively, a complete CRISPR array (pre-crRNA) can be expressed. The reporter nucleic acid molecule comprises a sequence that is complementary to the spacer sequence. A tracrRNA is additionally necessary of the expression of a type II CRISPR-cas system. Synthetic DNA molecules can conveniently be engineered to express a RNA transcript comprising a spacer sequence flanked, at least at one end, by a palindromic repeat sequence (i.e. a mat-RNA), for use in the expression of the Class 1 or 2 CRISPR-Cas systems. Flowever, in the case a type II CRISPR-cas system, the RNA transcript of the synthetic DNA molecule can comprise a a mat-RNA fused with a tracrRNA sequence, also called single-guide RNA (sgRNA).
A second embodiment of the invention provides cells for use in screening molecules or polypeptides (or a library thereof) for a candidate molecule or polypeptide capable of regulating CRISPR-Cas associated activity (Figure 1). The cells, employed in the method of screening, comprise a genetic circuit designed for detecting molecules or polypeptides for regulators (inhibitors or enhancers) of CRISPR-Cas associated activity. The genetic circuit is engineered to operate in a host biological cell and provide an in vivo based screen for molecules or polypeptides having anti-CRISPR associated activity. In general terms, the genetic circuit comprises genes or cluster of genes encoding a RNA-guided CRISPR-Cas system and its cognate RNA(s); wherein said CRISPR-Cas system comprises either a multi-subunit subunit protein complex (Class 1), each subunit being encoded by a gene derivable from (or present in) the corresponding native CRISPR locus; or a single multidomain protein (Class 2) encoded by a single gene. The genetic circuit further comprises a reporter gene that functions as a marker for CRISPR activity. Each gene or gene cluster of the genetic circuit, present in a cell, is functionally-linked to a promoter and RBSs. The biological cell may further comprise a cis element (DNA) in combination with the promoter for regulating the expression of one or more of the genes or gene clusters of the genetic circuit. Each of said one or more of said genes or gene clusters of the genetic circuit may be located on an episome or on the cell's chromosome. Preferably, the genes and DNA molecules encoding the components of the Class 1 or 2 CRISPR-Cas systems and their respective promoters are cloned into a suitable host cell; more preferably integrated into plasmids. These genes and DNA molecules may be heterologous in origin with respect to the host cell; and are preferably heterologous with respect to their cognate promoter. The host biological cell (recombinant host cell) is preferably a bacterial cell.
When the genetic circuit encoding RNA-guided CRISPR-Cas system and its cognate RNA(s) is expressed in the biological cell, the cognate RNA serves to guide the system to a reporter gene, such that its endonuclease activity cleaves and inactivates the reporter gene or its transcript. Loss of expression of the inactivated reporter gene is detectable in a cell comprising the genetic circuit. Correspondingly, a molecule or polypeptide that is capable of preventing inactivation of the reporter gene in a cell comprising the genetic circuit is detectable, either by means of detection of a product of the reporter gene itself, or by the survival and growth of cells that continue to express the reporter gene under selective growth conditions. Cells comprising the genetic circuit are both suitable for screening molecules entering the cell, as well as polypeptides that are recombinantly expressed within the cell. Non-host cell DNA libraries encoding polypeptides can be introduced into recombinant cells comprising the genetic circuit and rapidly screened for regulators of CRISPR-Cas associated activity. I Method for in vivo detection of small molecules or polypeptides that act as regulators of CRISPR-Cas associated activity
The invention according to the first embodiment provides a method for in vivo detection of regulators of CRISPR-Cas associated activity preferably comprising the steps of:
a. providing recombinant biological cells having inducible CRISPR-Cas associated activity wherein said cells are prepared on a defined medium devoid of one or more agents for inducing expression of said activity, and either
I. contacting each of one or more aliquots of said recombinant biological cells with a small molecule or an individual molecule of a small molecule library; or
II. introducing a self-replicating library of non-host DNA fragments into said recombinant biological cells;
b. culturing the cells obtained from step (a) (I) or (II) on said defined culture medium for a period sufficient for uptake or expression of said a candidate regulator of CRISPR-Cas associated activity;
c. culturing the cells obtained from step (b) on said defined culture medium
wherein said medium is supplemented with one or more agents to induce expression of said CRISPR-Cas associated activity; and
d. detecting expression of a reporter gene present in cells obtained from step (c), wherein said recombinant biological cells comprise:
i. a first gene encoding a polypeptide or a first gene cluster encoding a polypeptide complex, said polypeptide or polypeptide complex having RNA- guided endonuclease activity, wherein said gene or gene cluster is operably linked a first inducible promoter having a measured strength of >0.65 on the Anderson scale, or a combination of a first promoter having a measured strength of >0.65 on the Anderson scale and a first cis element conferring inducible expression, wherein said inducible expression is induced by at least one of said one or more agents;
ii. a reporter gene operably linked to a constitutive promoter;
iii. a nucleic acid molecule encoding any one of:
o a pre-crRNA molecule (encoded by a DNA molecule comprising a CRISPR array) wherein said pre-crRNA comprises a spacer sequence complementary to a nucleic acid sequence of at least 15 nucleotides of at least one strand of said reporter gene or its transcript;
o a mat-crRNA molecule comprising at least one spacer sequence flanked by at least one palindromic repeat sequence, wherein the spacer sequence is complementary to a nucleic acid sequence of at least 15 nucleotides of at least one strand of said reporter gene or its transcript;
o the pre-crRNA molecule and a trans-activating crRNA; or
5 o the mat-crRNA molecule and a trans-activating crRNA (or a fusion thereof),
wherein said nucleic acid molecule is operably linked a second inducible promoter having a measured strength of >0.65 on the Anderson Scale, or a combination of a second promoter and second cis element conferring inducible 10 expression, wherein said inducible expression is induced by at least one of said one or more agents;
and
wherein the palindromic repeat sequence and the trans-activating crRNA are cognate with respect to the polypeptide or polypeptide complex having RNA- 15 guided endonuclease activity; and
wherein said RNA-guided endonuclease activity is capable of inactivating said heterologous reporter gene or its transcript.
Preferably, said reporter gene is comprised on a plasmid devoid of genes capable of expressing CRISPR-Cas associated activity.
20
The first gene or first gene cluster encoding a polypeptide or polypeptide complex, respectively having RNA-guided endonuclease activity, may be selected from any gene or gene cluster encoding cas protein(s) having CRISPR associated activity. Examples of cas proteins or cas protein complexes, suitable for expression in 25 recombinant cells of the invention include those belonging to the Type 1, Type II, Type III; Type IV, Type V, and Type VI and their subtypes. In most cases the first gene or first gene cluster and its respective encoded polypeptide or polypeptide complex will be heterologous in origin with respect to both the promoter to which it is operably linked, and to the recombinant biological cell in which it is expressed. Examples of the 30 components of cas proteins or cas protein complexes of the various RNA-guided CRISPR-Cas system are listed in Table 1 below together with their DNA coding sequences and respective amino acid sequences:
Table 1
Figure imgf000015_0001
Figure imgf000016_0001
Figure imgf000017_0001
Figure imgf000018_0001
Figure imgf000019_0001
N/A = not applicable or required; * non-essential Cas component of Type IB.
The nucleic acid molecule in the recombinant biological cell may comprise a sequence encoding a mature crRNA (mat-crRNA) comprising at least one spacer sequence 5 flanked at least at one end (preferably at both ends) by a palindromic repeat sequence. The spacer sequence is complementary to a nucleic acid sequence of at least 15 nucleotides of at least one strand of said reporter gene or its transcript. This complementary sequence guides the polypeptide or polypeptide complex having RNA- guided endonuclease activity to the site of cleavage in the reporter gene or its 10 transcript. Accordingly, the complementary sequence of the spacer is selected such that cleavage of the reporter gene or its transcript prevents expression of the reporter gene or its transcript. The nucleic acid molecule in the recombinant biological cell may further comprise a sequence encoding a trans-activating crRNA (tracrRNA). The polypeptide or polypeptide complex having RNA-guided endonuclease activity selectively associates with its cognate palindromic repeat sequence and, where applicable also its cognate tracrRNA. Thus, when the polypeptide or polypeptide complex is selected from Table 1, then the cognate palindromic repeat sequence and cognate tracrRNA are encoded by the nucleotide sequences listed in Table 1.
In one embodiment, the nucleic acid molecule in the recombinant biological cell comprises a sequence encoding a single RNA transcript comprising a fusion of the mat- crRNA and the tracrRNA.
In a preferred embodiment, the amino acid sequence of the polypeptide or polypeptides of a protein complex having RNA-guided endonuclease activity has at least 70, 75, 80, 81, 82, 83, 84, 85, 86, 87, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% amino acid sequence identity to an amino acid sequence selected from Table 1. In this embodiment the at least one cognate palindromic repeat sequence is encoded by a nucleic acid sequence at the corresponding position in the list of Table 1; while the cognate trans-activating crRNA is encoded by a nucleotide sequence at the corresponding position in the list of Table 1.
The reporter gene in the recombinant biological cell encodes a reporter polypeptide, whose expression in the cell is detectable; for example either by virtue of the inherent properties of the polypeptide (e.g. a polypeptide with fluorescent properties, such as GFP), or by means of the catalytic activity of the polypeptide (e.g . a polypeptide that confers resistance to a growth inhibitor (e.g. a protein conferring resistance to an antibiotic such as tetracycline, kanamycin, neomycin, tobramycin, chloramphenicol, spectinomycin, gentamycin, ampicillin and carbenicillin, as exemplified in Example 1.1) or whose activity complements an auxotrophic phenotype (e.g . vitamin auxotrophy). The reporter gene or its transcript is the target for the polypeptide or polypeptide complex having RNA-guided endonuclease activity. In one embodiment the reporter gene comprises a protospacer adjacent motif (PAM) located adjacent to the sequence complementary to the mat-crRNA spacer sequence, in particular where the RNA- guided endonuclease polypeptide is a Type II A Cas (e.g. Cas9). A canonical PAM is the sequence 5'-NGG-3' where "N" is any nucleobase.
Expression of the RNA-guided endonuclease protein or protein complex and respective cognate pre-crRNA, mat-crRNA and/or tracrRNA molecules (e.g. sgRNA) in the recombinant biological cell leads to a rapid inactivation of the reporter gene or its transcript. For this reason, their expression should be tightly regulated. Preferably the first inducible promoter, or a combination of the first promoter and the first cis element conferring inducible expression is both structurally and functionally different from the second inducible promoter, or a combination of the second promoter and the second cis element; such that expression of the protein or protein complex having RNA-guided endonuclease activity and its cognate pre-crRNA, mat-crRNA and/or tracrRNA molecules (e.g . sgRNA) can be regulated independently. A suitable inducible promoter includes an IPTG-inducible T5 or T7 promoter. In a preferred embodiment, inducible expression may be conferred by a promoter regulated by a cis element encoding a biosensor transcription factor, for example the araBAD promoter pBAD, whose expression is regulated by metabolites L-arabinose, and mevalonate (as exemplified in Example 1.1). In another embodiment, expression of a gene is regulated translationally by means by a constitutive promoter in combination with a cis element encoding a riboswitch, for example a theophyllin riboswitch (as described in Example 1.1). The selectivity of the screening method is improved by employing a strong inducible promoter; for example a constitutive promoter selected from the Anderson promoter collection having a measured strength (activity) of >0.65, or 0.7 or above f http://parts.ioem.oro/Promoters/Cata loo/Anderson') such as J23100 [SEQ ID No. : 172], J23101 [SEQ ID No. : 173], J23102 [SEQ ID No. : 174], and J23104 [SEQ ID No. : 175], having a measured strength of 1.00; 0.70; 0.86; and 0.72 respectively (see Examples 2.2 and 6). Expression of the reporter gene in a recombinant biological cell is preferably driven by a constitutive promoter, (e.g . Ptet promoter, without the repressor promoter or a sigma70 promoter from the Anderson promoter collection. Each of said first gene, first gene cluster, reporter gene and nucleic acid molecule may independently be a component of an episome that is stably replicated in the recombinant biological cell or an integrated component of its chromosome. Preferably, the episome is a plasmid comprising an origin of replication (ori) conferring a low copy number, such as pSClOl and pl5A described in Example 1.1 (Figure 2). When a population of recombinant cells of the invention are used to screen for regulators of a CRISPR-Cas associated activity; the use of a low copy number plasmid is advantageous, since it reduces variability between the recombinant cells; which in turn enhances the stringency of the screening assay. The functional operation of the method of the invention involves the inactivation of a reporter gene such that a reporter polypeptide is no longer expressed. Inactivation of the reporter gene by DNA cleavage also leads to a progressive elimination of the episome or chromosome comprising the reporter gene. Accordingly, in a one embodiment, the reporter gene is located on a plasmid, separate from the location of the other first genes and nucleic acid molecule required for expression of the CRISPR associated activity. In this manner, the recombinant cells continue to express inducible CRISPR associated activity independently of the reporter gene, ensuring complete inactivation of the reporter gene, thereby limiting the frequency of false positives.
The recombinant biological cell is preferably bacterial, belonging to a genus selected from the group consisting of: Escherichia spp., (e.g. E. coli ); Bacillus spp., (e.g. B. subtilis ); Pseudomonas spp., (e.g . P. fluorescens, P. putida ); Corynebacterium spp., (e.g. C. giutamicum ); Agrobacterium spp., (e.g. A. tumefaciens ); Cauiobacter spp., (e.g. C. vibrioides ); Burkholderia spp., (e.g . B. graminis ); Rhizobium spp., (e.g. R. leguminosarum ), and Raistonia spp., (e.g. R. metaiiidurans). Preferably the host biological cell is species of the genus Escherichia spp., in particular E. coli.
II The method for screening a small molecule library for a regulator of CRISPR-Cas associated activity
When the method according to the first embodiment is employed to screen a small molecule library for a regulator of CRISPR-Cas associated activity, the method preferably comprises the following steps:
a. providing one or more aliquots of recombinant biological cells having inducible CRISPR-Cas associated activity wherein said cells are prepared on a defined medium devoid of one or more agents for inducing expression of said activity; b. providing a multi-compartment container suitable for cell culture;
c. delivering said aliquots of the recombinant cells into each of one or more
compartments of said multi-compartment container; whereby each aliquot of cells is brought into contact with a single small molecule of a small molecule library;
d. providing each compartment of step (c) with said defined culture medium and culturing for a period sufficient to induce uptake of a candidate regulator of CRISPR-Cas associated activity;
e. culturing the cells obtained from step (d) on said defined culture medium
wherein said medium is supplemented with one or more agents to induce expression of said CRISPR-Cas associated activity and
f. detecting cells expressing the reporter gene in the one or more compartments of said multi-compartment container obtained in step (d). wherein said recombinant biological cells are according to the first embodiment as defined in section I.
The small molecule library is typically composed of small molecules, including peptidomimetics, natural or synthetic compounds, for example libraries of diverse macrocycle compounds.
The growth medium in step (d) is either solid or liquid; and its composition is chosen to be compatible with the properties of the reporter protein expressed the the reporter gene in the recombinant cells. Thus, when the reporter polypeptide is detectable by virtue of the inherent properties of the polypeptide (e.g. a polypeptide with fluorescent properties, such as GFP), it is sufficient to employ a growth medium that supports growth of the recombinant cells. However, when the reporter polypeptide is detectable by virtue of its catalytic activity (e.g. a polypeptide that confers resistance to a growth inhibitor (antibiotic resistance protein, as exemplified in Example 1.1) or whose activity complements an auxotrophic phenotype, then the composition of the growth medium should facilitate the mode of detection. Thus, where the polypeptide confers resistance to a growth inhibitor, the growth medium should be supplemented with growth restrictive concentrations of the same growth inhibitor. Equally, where the polypeptide complements an auxotrophic phenotype, the nutrient required for growth of the auxotroph should be excluded from the growth medium.
Potential regulators of CRISPR-Cas associated activity are identified by detecting recombinant cells expressing the reporter polypeptide in the one or more compartments of said multi-compartment container in step (e). The means for detection is compatible with the properties of the reporter polypeptide. For example, fluorescence detection tools are available for detection of expressed fluorescent polypeptides; while detection of cell growth suffices to detect recombinant cells cultured on restrictive growth medium .
Ill The methods for screening a library of non-host DNA fragments for a regulator of CRISPR-Cas associated activity
When the method according to the first embodiment is employed to screen a library of non-host DNA fragments for genes encoding a regulator of CRISPR-Cas associated activity, the method preferably comprises the following steps:
a. providing one or more aliquots of recombinant biological cells having inducible CRISPR-Cas associated activity, wherein said cells are prepared on a defined medium devoid of one or more agents for inducing expression of said activity; b. introducing a self-replicating library of non-host DNA fragments into one or more aliquots of said recombinant biological cells;
c. culturing the one or more aliquots of cells obtained from step (b) on said
defined culture medium for a period sufficient to induce expression of a candidate regulator of CRISPR-Cas associated activity; d. culturing the cells obtained from step (c) on said defined culture medium wherein said medium is supplemented with one or more agents to induce expression of said CRISPR-Cas associated activity, and
e. detecting expression of a reporter gene present in cells obtained from step iii. f. Optionally, individual self-replicating library members in cells detected in step
(e) are isolated and re-transformed into recombinant biological cells having inducible CRISPR-Cas associated activity, and tested by repeating steps (c) to (e). The construction of the library of non-host DNA fragments and its introduction into the recombinant cells is further detailed in Section V and Example 4.
The composition of the growth medium is one that facilitates the mode of selection provided by the reporter polypeptide, as described in section II. Similarly, potential regulators of CRISPR-Cas associated activity encoded by genes in the library of DNA fragments are identified by detecting recombinant cells expressing the reporter polypeptide, as described in section II.
Preferably, the recombinant cells produced in step (b), that are cultured in step (c) are cultured on a growth medium that is solid, at a plating density that ensures the growth of single colonies. A suitable plating density is about 0.5 to 1.5* 106 cells/cm2; which generates an overgrown plate during growth under non-selective growth medium.
Single colonies detected in step (d) can be selected; and the non-host DNA fragment cloned in a self-replicating plasmid of the library (and present in cells of the selected colony) can be isolated, amplified and sequenced in order to identify the candidate polypeptides as regulators of CRISPR-Cas associated activity.
IV A recombinant biological cell comprising a genetic circuit for detecting regulators of CRISPR-Cas associated activity
The invention according to the second embodiment provides recombinant bacterial cells comprising a genetic system for detecting a regulator of CRISPR-Cas associated activity (Figures 1 and 2), wherein the genetic circuit preferably comprises:
a) a 1st self-replicating genetic element comprising,
i. a first gene encoding a polypeptide or a first gene cluster encoding
polypeptide complex, said polypeptide or polypeptide complex having RNA- guided endonuclease activity, wherein said gene or gene cluster is operably linked a first inducible promoter having a measured strength of >0.65 on the Anderson Scale, or a combination of a first promoter having a measured strength of >0.65 on the Anderson Scale and a first cis element conferring inducible expression wherein said inducible expression is induced by at least one agent;
ii. a first reporter gene encoding a selectable marker polypeptide and
operably linked to a constitutive promoter and
b) a 2nd self-replicating genetic element comprising,
i. a second reporter gene operably linked to a constitutive promoter; wherein either said 1st self-replicating genetic element or said 2nd self-replicating genetic element further comprises :
a nucleic acid molecule encoding any one of:
o a pre-crRNA molecule (encoded by a DNA molecule comprising a CRISPR array) wherein said pre-crRNA comprises a spacer sequence complementary to a nucleic acid sequence of at least 15 nucleotides of at least one strand of said reporter gene or its transcript;
o a mat-crRNA molecule comprising at least one spacer sequence flanked by at least one palindromic repeat sequence, wherein the spacer sequence is complementary to a nucleic acid sequence of at least 15 nucleotides of at least one strand of said reporter gene or its transcript;
o said pre-crRNA molecule and a trans-activating crRNA; or
o said mat-crRNA molecule and a trans-activating crRNA (or a fusion thereof),
wherein said polypeptide or polypeptide complex having RNA-guided endonuclease activity is capable of inactivating said heterologous reporter gene or its transcript, and
wherein said nucleic acid molecule is operably linked an second inducible promoter measured strength of >0.65, or a combination of a second promoter and second cis element conferring inducible expression, wherein said inducible expression is induced by at least one agent.
Preferably, said reporter gene is comprised on a plasmid devoid of genes capable of expressing CRISPR-Cas associated activity.
The first gene or first gene cluster encoding a polypeptide or polypeptide complex, respectively having RNA-guided endonuclease activity, include those belonging to the Type 1, Type II, Type III; Type IV, Type V, and Type VI and their subtypes; as exemplified in Table 1. In most cases the first gene or first gene cluster and their respective encoded proteins are heterologous with respect to the host cell in which they are expressed. In one preferred example the CRISPR associated protein is a Type II subtype A protein, in particular a Cas9 protein . Alternatively, the CRISPR associated protein is preferably a class 2 type V system comprising the signature gene Cpfl encoding a Cpfl protein (e.g. WP_014550095.1). Alternatively, the CRISPR associated protein is preferably a Class I type 1 system comprising the signature gene Cas3 encoding a Cas3 protein.
The nucleic acid molecule in the genetic circuit encodes a pre-crRNA or a mat-crRNA molecule comprising a palindromic repeat sequence that is cognate to the polypeptide or polypeptide complex having RNA-guided endonuclease activity. When the polypeptide of polypeptide complex is selected from Table 1, then the repeat and tracrRNA preferably have a cognate sequence listed in Table 1. In one embodiment, the nucleic acid molecule in the genetic circuit comprises a sequence encoding a single RNA transcript comprising a fusion of the gRNA and tracrRNA, and is heterologous with respect to the host cell in which it is expressed.
The first reporter gene encoding the selectable marker polypeptide in the 1st self- replicating genetic element allows the selection of recombinant cells comprising the 1st self-replicating genetic element and its maintenance therein . The selectable marker polypeptide encoded by the first reporter gene and the reporter polypeptide encoded by the second reporter gene must be different, to allow plasmid selection and reporter detection to be independent of each other. Suitable marker polypeptides include those conferring resistance to a growth inhibitor (for a protein conferring resistance to an antibiotic such as tetracycline, kanamycin, neomycin, tobramycin, chloramphenicol, spectinomycin, gentamycin, ampicillin and carbenicillin, as exemplified in Example 1.1). Expression of the heterologous gene encoding the selectable marker polypeptide is preferably driven by a constitutive promoter, (e.g . Ptet promoter, without the repressor promoter or a sigma70 constitutive promoter (http://parts.igem .org/Promoters/Catalog/Anderson). The reporter gene in the genetic circuit encodes a reporter polypeptide having the same properties as the reporter gene in recombinant cells described in Section I.
Induced expression of CRISPR-Cas associated activity in recombinant cell comprising the genetic circuit leads to a rapid cleavage of the reporter gene or it transcript. For this reason, expression of the RNA-guided endonuclease protein or protein complex and its cognate gRNA in a host cell is tightly regulated. Inducible expression of the component genes in the genetic circuit is regulated in the same manner as the corresponding genes in the recombinant cells described in Section I. The 1st and 2nd self-replicating genetic element may be any episome that is stably replicated in a host cell, for example a plasmid. Suitable plasmids for use in the genetic circuit are the same as the corresponding plasmids for use in the recombinant cells described in Section I.
Self-replicating genetic elements (e.g. plasmids); methods for assembling the genetic circuit therein; and their subsequent cloning in a suitable host cell, are illustrated in Example 1, using commercially available methods and tools that are known to those skilled in the art (see, e.g., Sambrook et al . , Molecular Cloning : A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, 1989).
Suitable recombinant biological cells comprising the genetic circuit according to the second embodiment of the invention circuit of the invention may be obtained using the same host cells as those used to obtain the recombinant biological cells described in Section I.
The genetic circuit of the invention may be transformed into the chosen biological host cell employing standard transformation protocols, including electroporation as illustrated in Example 1.3. V A recombinant biological cell library comprising a genetic circuit for detecting regulators of CRISPR-Cas associated activity
The invention, according to a third embodiment, provides a recombinant biological cell library of non-host DNA fragments wherein each cell of the library comprises the genetic circuit described in section IV.
Preferably the library is a metagenomics DNA library where the DNA fragments of metagenomic DNA are cloned into a self-replicating vector (e.g . plasmid or cosmid comprising an origin of replication). The average size of the metagenomic DNA insert in a plasmid is about 1.5 kb; but using a cosmid can be up to about 40 kb. While expression of open reading frames in the cloned DNA fragments may be driven by endogenous promoters within the DNA fragments; a suitable constitutive promoter may be provided upstream of the DNA insert cloning site in the self-replicating vector. Preferably the self-replicating vector into which the metagenomic DNA is cloned is provided with a gene encoding a selectable marker polypeptide, which is different from the marker polypeptides encoded by the reporter gene and the heterologous gene encoding a marker polypeptide in the genetic circuit. The recombinant biological cell library is conveniently constructed in the recombinant bacterial cells comprising the genetic circuit according to the invention. The DNA fragments of metagenomic DNA can be derived from human, cow and pig fecal samples and soil samples, and the library can be constructed as described in Example 4.1 and 4.2.
VI Enhancing the efficiency of the method for in vivo detection of anti-CRISPR polypeptides or small molecules using a recombinant biological cell of the invention.
The efficiency of the method for functional screening for anti-CRISPR molecules is improved by employing biological cells comprising the genetic circuit according to the invention, wherein expression of the RNA guided CRISPR-Cas activity is controlled by a a strong promoter and inducible expression, and a selectable plasmid having the reporter gene (see section I, IV and Example 2). The frequency of false positives is further reduced by using defined media to precisely regulate the expression of the genetic circuit. This is achieved by using the following method, which is suitable for screening either a metagenomics library encoding candidate anti-CRISPR proteins; or one or more isolated candidate anti-CRISPR proteins; or one or more isolated anti- CRISPR small molecules. The method comprises 6 steps:
1) Provision of recombinant biological cells capable of inducible CRISPR-Cas associated activity and a reporter gene according to the invention; and the cultivation of these cells under aerobic conditions in a defined growth medium (such as M9 minimal medium for prokaryotic cells), comprising only those nutrients essential for growth, to produce a multiplied cell culture. Preferably, the medium is selective for recombinant biological cells comprising the gene or gene cluster encoding an RNA-guided endonuclease that confers said inducible CRISPR-Cas associated activity and the nucleic acid molecule encoding its cognate RNA molecule that is capable of guiding said RNA-guided endonuclease. Thus, when said gene or gene cluster and said nucleic acid molecule is provided on one or more plasmid, then the medium comprises both nutrients essential for growth and antibiotics selective for cells maintaining the respective plasmids. Importantly, the defined medium is devoid of agents required for, or capable of, inducing expression of the genes or gene clusters encoding an RNA- guided endonuclease and its cognate RNA molecules capable of guiding said polypeptide or polypeptide complex having RNA-guided endonuclease activity to said reporter gene or its transcript.
2) An aliquot of the cell culture produced in step 1) is inoculated into said defined growth medium comprising nutrients essential for growth to produce a cell suspension; said medium being both devoid of agents required for, or capable of, inducing expression of the genes or gene clusters encoding an RNA-guided endonuclease and its cognate RNA molecule; and devoid of antibiotics selective for cells maintaining the respective plasmids;
3) cells in the cell suspension produced in step 2) are either:
i) transformed with a metagenomic library to produce a transformed cell suspension, or
ii) contacted with a candidate anti-CRISPR small molecule or a candidate anti-CRISPR protein by their addition to said cell suspension;
4) a cell suspension resulting from step 3) i) is incubated (preferably between 0.25 -2 h) to allow expression of candidate anti-CRISPR protein encoded by DNA molecules in the metagenomic library; or 3) ii) is incubated (preferably between 0.25 -2 h) to allow intracellular uptake of the candidate anti-CRISPR small molecule or candidate anti- CRISPR protein;
5) Agents are added to the incubated cell suspension(s) resulting from step 4; and the suspensions are incubated under conditions suitable for said agents to induce expression of the genes or gene clusters encoding an RNA-guided endonuclease and its cognate RNA molecule present in the biological cell provided in step 1); whereby said expressed endonuclease activity is capable of inactivating the reporter gene or its transcript present in the biological cell during said incubation (preferably between 1-3 h);
6) Cells derived from the incubated cell suspension(s) resulting from step 5) are plated on agar plates composed of a defined media comprising nutrients essential for growth and supplemented with agents required for, or capable of, inducing expression of the genes or gene clusters encoding an RNA-guided endonuclease and nucleic acid molecule(s) encoding its cognate RNA molecule(s). When said gene or gene cluster and said nucleic acid molecule(s) are provided on one or more plasmids, then the medium further comprises antibiotics selective for cells maintaining the respective plasmids. In the case that the reporter gene or its transcript encodes an antibiotic resistance protein, then the defined media agar is further supplemented with the respective antibiotic. The plates are incubated under suitable conditions for a time sufficient to detect viable biological cells.
EXAMPLES
Example 1 Construction and cloning of a genetic circuit for screening for anti- CRISPR molecules
1.1 Construction of pCasens3 and pDual3 plasmids
The pCasens3 and pDual3 plasmids, illustrated in Figure 2, were constructed as follows:
Plasmid pCasens3 containing the Streptococcus pyogenes Cas9 gene (spCas9) [SEQ ID No. : 150] was constructed in a single step by using USER cloning (Genee et al. 2014). A DNA fragment containing spCas9 gene [SEQ ID No. : 150] and its endogenous terminator [SEQ ID No. : 152] was PCR amplified from a plasmid DS-SPcas (supplied by addgene: Plasmid #48645) (Esvelt et al . 2013); and the amplified DNA fragment was cloned into the backbone of the plasmid pSEVA47 (as described by Martinez-Garcia et al. 2014). The plasmid pSEVA47 contains: the low copy number origin of replication pSClOl [SEQ ID No. : 155], and the antibiotic resistance gene aadA [SEQ ID No. : 153] that confers resistance against Spectinomycin and a restriction site for insertion of a gene. A DNA molecule [SEQ ID No. : 149] encoding a theophylline translational riboswitch was placed in front of Cas9 using a long forward primer: 5'AAGTCTAGCGAACCGCACTTAATACGACTCACTATAGGTACCGGTGATACCAGCATCGTCTT GATGCCCTTGGCAGCACCCTGCT AAGGT AACAACAAGATGATGGAT AAGAAAT ACT CAAT AGG CTTAGATATCGGCAC-3' [SEQ ID No. : 156]). Additionally, a sigma70 constitutive promoter (J23100 [SEQ ID No. : 172]; as defined by http://parts.igem.org/Promoters/Catalog/Anderson) was also introduced using a reverse primer in substitution for the endogenous of spCas9 promoter: 5'-ctctagTagctagcactgtacctaggactgagctagccgtcaaGTTAGCTGTGCTCTAGAAGCTAGCAG-3' [SEQ ID No. : 157]
Plasmid pDual3 for arabinose inducible sgRNA expression was constructed by using USER cloning (Genee et al . 2014) to insert the respective genes into the backbone of the plasmid was from pSEVA3610, described by Martinez-Garcia et al., 2015. The chimeric gRNA encoding sequence [SEQ ID No. : 161] with a TrrnB terminator [SEQ ID No. : 162] and a chloramphenicol resistance gene (CmR) [SEQ ID No. : 164] were cloned downstream of the inducible pBAD promoter [SEQ ID No. : 158] and araC gene [SEQ ID No. : 159] encoding an L-arabinose-inducible transcription factor. In the absence of L-arabinose, the AraC protein binds to operator sites within pBAD effectively repressing transcription . On addition of L-arabinose, AraC protein binds to L-arabinose to form a complex having a DNA-binding conformation that activates pBAD leading to induction of transcription of its cognate genes, namely sgRNA and CmR. The plasmid comprises a low copy number origin of replication, pl5A [SEQ ID No. : 163] . The first twenty nucleotide sequence of the encoded gRNA transcript is complementary to nucleotides 96 : 115 of the + strand in the CmR gene having [SEQ ID No. : 164]
Positive strand nucleotides 96 : 115 : CTATAACCAGACCGTTCAGC [SEQ ID No. : 166]
1.2 Construction of positive and negative control plasmids for testing the genetic circuit
Control plasmids for testing the genetic circuit (Figure 3) include a negative control plasmid comprising a gfp gene [SEQ ID No. : 140] with rrnB T1 terminator [SEQ ID No. : 142] capable of expression of GFP (A); and a positive control plasmid comprising an acrlla2 gene [SEQ ID No. : 159] capable of expression of the anti-CRISPR protein, AcrIIA2, each under the control of a constitutive Ptet promoter (lacking the TET repressor) and RBS [SEQ ID No. : 139] (B). Both plasmids have a ColEl ori [SEQ ID No. : 143] and kanamycin resistance gene [SEQ ID No. : 144] When the control plasmids are transformed into host cells, comprising the genetic circuit (i.e. pCasens3 and pDual3 plasmids as described in Example 1.1); the expression of the anti-CRISPR protein, AcrIIA2 is capable of inhibiting Cas9 mediated inactivation of the CmR gene.
1.3 Host cells transformed with pCasens3 and pDual3 plasmids
E.coli TOPIO were made electro-competent and transformed with the genetic circuit comprising the plasmids : pCasens3 and pDual3 (obtained in example 1.1) as follows : i. E.coli TOPIO cells (supplied by ThermoFischer) were cultured in 3 ml of 2xYT media for 16-18 hours at 37°C and 250 rpm;
ii. the culture from (i) was inoculated 1 : 100 3 ml of pre-warmed 2xYT media; and cultured at 37°C at 250 rpm until the cell density reached 0.5-0.7 OD600; iii. the cultures from (ii) were cooled on ice; and the cells harvested by
centrifugation at 10 000 xg for 1 min;
iv. the cells were washed by re-suspension in cold water and then harvested
again by centrifugation; then the cells were re-suspended in 1 ml glycerol and harvested by centrifugation; and finally the cells were re-suspended in the residual water and then placed on ice, ready for electroporation .
v. A 40 pi sample of the electrocompetant cells were transferred to
microcentrifuge tubes; to which a 1 m I sample of pCasens3 and pDual3 DNA was added; and this DNA-cell mixture was then transferred to a cold cuvette and electroporated with a pulse having 1.8 kv, 200 ohms and 25 pF.
vi. Immediately thereafter, a 975 mI volume of 37°C SOC [Ausubel F.M . et al.
(1987)] was added to the electroporated cells, which were mixed and transferred to a 15 ml- tube; and incubated on rollers at 37°C for 1 h;
vii . The recovered transformed cells were diluted 10L0 - 10L8; plated on
selective solid media comprising Luria Broth agar supplemented with : 50 pg/mL spectinomycin; 30 pg/mL chloramphenicol and 50 pg/mL kanamycin; and incubate overnight at 37°C.
viii . Transformed cell colonies comprising the pCasens3 and pDual3 plasmids (£. coli genetic circuit [E. coli-GC ]) were selected; amplified and stored.
1.4 E. coli-GC cells transformed with control plasmids
E.coli TOPIO containing the genetic circuit comprising the plasmids : pCasens3 and pDual3 (E. coli-GC) were made electro-competent and transformed with either the negative control plasmid expressing GFP or positive control comprised a plasmid expressing AcrIIA2, as described in Example 1.2, employing the transformation protocol described in Example 1.3, but where the host cells were E. coli-GC. Example 2 Optimisation of the genetic circuit
The use of host cells comprising the genetic circuit to screen for regulators of RNA- guided endonucleases relies on the stringency of the genetic circuit; such that the activity of a candidate regulator can be clearly detected. As a first step in optimizing the circuit, the importance of regulation of the expression of the RNA-guided endonuclease was investigated.
The pCasens3 plasmid was modified to facilitate the comparison of 3 plasmids, differing only in the promoter region operably linked to the Cas9 gene. These modifications allowed an analysis of the impact of both the strength and on-off regulation of the promoter to be assessed.
2.1 Methodology
Three samples of E.coli TOPIO cells were transformed (as described in Example 1.3) with the genetic circuit comprising the pDual3 plasmid together with one of 3 versions of the pCasens3 plasmid; and the transformed cells were then duplicate plated on LB agar comprising the antibiotics (50 pg/mL spectinomycin; 30 pg/mL chloramphenicol and 50 pg/mL kanamycin); where one set of plates was further supplemented with 1% arabinose to induce sgRNA expression (alone) or together with 2mM theophylline for inducible Cas9 expression.
2.2 Optimised genetic circuit requires regulated and strong expression of the Cas9 gene
E.coli TOPIO cells, transformed with genetic circuits comprising the pDual3 plasmid and each of the 3 versions of the pCasens3 plasmid, are dependent on expression of a functional camR gene in order to grow on the LB plates comprising chloramphenicol. Effective inactivation of the camR gene, by the Cas9-sgRNA can be determined by detecting the loss of cell growth under these conditions.
E.coli TOPIO cells comprising the first version of the pCasens3 plasmid comprising a Cas9 gene regulated by a medium strength constitutive promoter (ProC promoter [SEQ ID No. : 167]; Esvelt, K. M., 2013) (see Figure 4) showed slower growth than cells comprising the second and third versions of the pCasens3 plasmid when grown on growth medium lacking both theophylline and arabinose required for inducing expression of the Cas9 and sgRNA genes respectively. The observed growth depression may be attributable to a potential fitness cost associated with the expression of Cas9, since it is a large protein (> 1300 amino acids in length). When expression of sgRNA was induced in these cells with arabinose, in combination with the constitutively expressed Cas9, the growth rate fell as expected; indicating the inactivation of the camR gene. However, the difference in growth rate between non- induced and induced states of the cas9-sgRNA was small.
E.coli TOPIO cells comprising the second version of the pCasens3 plasmid comprising a Cas9 gene regulated by the medium strength promoter (ProC promoter [SEQ ID No. : 167 ] in combination with the inducible theophylllin riboswitch (see Figure 4) showed a faster growth than the first version, which correspondingly may be attributed to the "off" state of cas9 expression, and absence of its fitness cost. However, when plated on media inducing expression of the Cas9-sgRNA, the log reduction in growth was limited, indicating that the medium strength promoter following induction, failed to drive sufficient expression of the Cas9 gen in order to obtain optimal inactivation of the camR gene.
E.coli TOPIO cells comprising the third version of the pCasens3 plasmid comprising a Cas9 gene regulated by the strong promoter (BBa J23100) combined with the inducible theophylllin riboswitch (see Figure 4) also showed a faster growth than the first version, which correspondingly may be attributed to the "off" state of cas9 expression, and absence of its fitness cost. Additionally, when plated on media inducing expression of the Cas9-sgRNA, the log reduction in growth was substantial, indicating that a tighly regulated and strong promoter controlling expression of the Cas9 gene is important in order to obtain optimal inactivation of the camR gene; and an effective screening genetic circuit.
E.coli TOPIO cells comprising a fourth version of the pCasens3 plasmid comprising a Cas9 gene operably linked a non-regulated strong constitutive promoter (BBa J23100) is additionally tested to confirm the importance of promoter regulation.
2.3 Optimised genetic circuit locates reporter gene for CRISPR-Cas on a separate plasmid The effect of the genetic location of the reporter gene on the efficacy of CRISPR-Cas (when located on the same or on a separate genetic element) is tested as follows:
The pCasens3 plasmid is modified by insertion of the arabinose inducible sgRNA expression cassette, comprising the chimeric gRNA encoding sequence [SEQ ID No. : 161] with a TrrnB terminator [SEQ ID No. : 162] cloned downstream of the inducible pBAD promoter [SEQ ID No. : 158] and araC gene [SEQ ID No. : 159] encoding an L-arabinose-inducible transcription factor, to create pCasens4. A chloramphenicol resistance gene (CmR) [SEQ ID No. : 164] is inserted into a second plasmid having the low copy number origin of replication, pl5A [SEQ ID No. : 163] to create pCmR.
E.coli TOPIO cells, transformed either pCasens4 and pCmR, or pCasens3 and pDual3 (see Example 1.1) are cultivated under aerobic conditions in M9 minimal medium (http://cshprotocols.cshlp.Org/content/2010/8/pdb. recl2295.short), supplemented with the antibiotics (spectinomycin 50ug/mL and chloramphenicol 30ug/mL) and the two culturare incubated at 37°C; 250RPM . An aliquot of each of the resulting cultures is transferred to fresh M9 minimal medium (devoid of said antibiotics), and cells in the resulting two cell suspensions are transformed with the negative control plasmid expressing GFP (see Example 1.2 and Figure 3B) and then incubated for at 37°C for a period of ~ lh, in M9 minimal medium, sufficient to allow expression of plasmid encoded genes, e.g. GFP (this negative control plasmid substitutes for a plasmid expressing a candidate anti-CRISPR protein, where a corresponding period for expression of plasmid genes is required). The two transformed cell suspensions are then supplemented with 2mM theophylline; 1% arabinose; and incubated at 37°C; 250RPM for a period of ~2h, to allow induced expression of the RNA-guided CRISPR- Cas genes, and inactivation of the CmR reporter gene. Finally, serial dilutions of the cells are plated in solid M9-agar plates supplemented with 50 ug/ml Spectomycin; 50 ug/ml Kanamycin; 30ug/ml chloramphenicol; and the inducing agents, 2mM theophylline; 1% arabinose.
Further tests demonstrate that when the reporter gene (in this example the CmR reporter gene) is located on a plasmid, separate from the location of the other first genes and nucleic acid molecule required for expression of the CRISPR associated activity this serves to enhance inactivation of the reporter gene, thereby limiting the frequency of false positives.
Example 3 Use of host cells comprising the genetic circuit to screen for anti- CRISPR molecules
Growth of host cells comprising the genetic circuit, E. coli-GC (prepared as described in Example 1.3) on a medium comprising theophylline and arabinose leads to induction of expression of the cas9 endonuclease and sgRNA, respectively. Since the cas9 endonuclease, when complexed with the sgRNA transcript, will then selectively base- pair and cleave the chloramphenicol resistance gene (CmR) in the pDual3 plasmid, this results in a loss of chloramphenicol resistance, and a progressive elimination of this plasmid . Thus when expression of the cas9 endonuclease and sgRNA are co-induced, growth of the cells on a medium comprising chloramphenicol is inhibited . Anti-CRISPR molecules capable of being imported into the host cells and inhibiting cas9 mediated inactivation of the CmR gene are detected by the survival and grow of the host cells on a medium comprising chloramphenicol. In brief, E. coli-GC cells from an overnight culture are diluted and plated in microtiter wells, into which small molecules are introduced (pin-transferred) and cells are then incubated at 37°C. Cell growth can be quantitated 24 h later via OD600 (or 630) readings on in Envision. Wells showing cell growth, detected by an increase OD600, identify molecules having anti-CRISPR activity. E. coli-GC cells transformed with additional plasmids were employed as controls: E. coli-GC cells acting as a negative control comprised a plasmid expressing GFP; and E. coli-GC cells acting as a positive control comprised a plasmid expressing AcrIIA2.
3.1 Procedure for screening for anti-CRISPR molecules i. Overnight cultures of E. coli comprising the genetic circuit, E. coli-GC (see Example 1.2) were prepared in filtered sterilized minimal media supplemented with 50 pg/mL spectinomycin and 30 pg/mL chloramphenicol to maintain the genetic circuit, and comprising glucose as the sole carbon source.
ii. Screening was performed on multiwell assay plates (Corning 3710), in a
minimal media further containing 2mM theophylline to induce Cas9
endonuclease expression; and 1% arabinose to induce gRNA transcription as follows:
a. columns 1-23 of the multiwell assay plates were prefilled with 30 mI_ of minimal media comprising E. coli-GC cells of the strain carrying the negative control plasmid expressing GFP (the negative control), and column 24 of the multiwell assay plates were prefilled with 30 mI_ of minimal media comprising E. coli-GC cells of the strain carrying the positive plasmid expressing the inhibitor protein, AcrIIA2. b. 300 nL of each candidate anti-CRISPR molecule were pin-transferred to each well of each plate. All molecules were tested in duplicate on duplicate multiwall assay plates under the same conditions. The optical density (OD 600nm) of each well was measure at t=0 in order to correct for compounds having an inherent absorption at 600nm. The multiwell assay plates were sealed and incubated at 37°C for 18 h. iii. Cell growth in each well of the multi-well assay plates was determined by
measuring cell density using a PerkinElmer Envision (600 nm filter) and comparing OD600 of each well and subtracting the value of t=0h from t= 18h in order to correct for compounds having an inherent absorption at 600nm.
The measured OD600 values were normalized to the positive and negative controls in each plate to determine a normalized percent of inhibitory activity of each compound . Z scores per plate per replicate were calculated, based on the normalized OD600 values. A positive hit was given for a Z score above 2.5.
All hits were annotated based on the hit scores as Weak = W (<3.5), medium = M (3.5-4.5) and Strong=S (>4.5). The compounds were further ranked based on the number of hits per library and those hits having the strongest anti-CRISPR activity in each library.
3.2 E. coli-GC host cells detect proteins having anti-CRISPR activity
E. coli-GC host cells transformed with the negative control expressing GFP (Figure 3A); and the positive control plasmid expressing AcrIIA2 (Figure 3B and 6B), were screened for anti-CRISPR activity as follows :
i. Overnight cultures of said E.coli host cells were prepared by culture in 3ml 2xYT media supplemented with the following antibiotics (50 pg/mL
spectinomycin; 30 pg/mL chloramphenicol and 50 pg/mL kanamycin) at 37°C and 250RPM ;
ii. Said cultures were used to prepared dilutions from 10L0-10L7 and 5mI
volumes of each dilution was spotted onto a 1st and 2nd set of agar plates, in order to determine both cell death and cell viability, having the following agar composition :
1st set of plates used to determine cell death comprised LB agar supplemented with : 2mM theophylline; 1% arabinose and antibiotics (50 pg/mL
spectinomycin; 30 pg/mL chloramphenicol and 50 pg/mL kanamycin);
2nd set of plates used to determine cell viability comprised LB agar
supplemented with : 2mM theophylline; 1% arabinose; and antibiotics (50 pg/mL spectinomycin; and 50 pg/mL kanamycin).
The survival of E. coli-GC host cells comprising the negative control plasmid, and grown under conditions that induce expression of the genetic circuit, is significantly reduced (see Figure 5B). However, expression of the, AcrIIA2 is seen to enable cell growth and survival, consistent with the known anti-CRISPR properties of this protein (Figure 5A and B). 3.3 Small molecule libraries screening using the genetic circuit
Host cells comprising the genetic circuit (e.g . E. coli-GC) have been screened using a wide range of small molecules, including peptidomimetics, natural or synthetic compounds, for example libraries of diverse macrocycle compounds. Circa 20 positive hits were obtained from screening a libra ry of diverse macrocycles (Asinex Macrocycle libra ry) providing evidence that the screening method is suitable for detecting molecules having anti-CRISPR activity.
Example 4 Construction and screening of Metagenomic Libraries for anti-
CRISPR proteins
4.1 Construction of expression metagenome libraries from diverse sources
Functional metagenomic libraries from human, cow and pig fecal samples and soil samples were constructed as described previously [Sommer et al. 2009; Genee et a I . ,
2016] . The steps of the procedure were as follows:
i) total DNA was isolated from each of 5 g of fecal samples and a soil sample using the PowerMax Soil DNA Isolation Kit (Mobio Laboratories Inc.); where after total DNA isolated from each sample was treated as follows :
ii) the extracted DNA, in a total sample volume of 200pl in minitubes, was
fragmented by sonication into fragments of an average size of 4-5 kb using a Covaris E210 (Massachusetts, USA);
iii) the sheared DNA (200mI) was size-selected by separation on a 1 % agarose gel with DNA dye and DNA migrating in the 3-6 kb region was extracted using a Qiagen gel extaction kit;
iv) the gel-extracted DNA fragments were end-repaired using an End-It end
repair kit (Epicentre) and selected using PCR clean up column; and v) the end-repaired fragments were blunt-end cloned into the Hindi site, down stream of a constitutive Ptet promoter (lacking the tet repressor) and RBS [SEQ ID No. : 139] in the expression 100-200 ng pZE21 plasmid in a 1 : 5 ratio [Lutz and Bujard, 1997] using the Fast Link ligation kit (Epicentre) over-night and the reaction was cleaned with a spin column and eluted in a 15 mI twice; vi) 2 mI cleaned ligation mix was transformation into 50 mI E. coli toplO cells by electroporation; and cells in the transformation mix were recovered in 1 ml SOC medium for 2 h; using multiple electroporations to increase library size; vii) the transformation mix were plated in 1 : 100 and 1 : 1000 dilutions on
kanamycin 50 pg/ml plates to create a library;
viii) the library was amplified by inoculation into 10 ml LB+Kan 50 and grown over night; 5 ml 50 % glycerol is added and the library is aliquoted into 1 ml portions and frozen at -80 C. In brief, plasmid DNA, comprising the amplified metagenomic library was extracted and recovered using Machery-Negel plasmid DNA purification kit as follows :
1. An overnight culture of bacteria comprising the metagenomic library was grow in 3 ml LB medium supplemented with 50 pg/mL kanamycin;
2 the culture was centrifuged, and the pelleted bacteria were resuspend the bacteria in buffer, to which a denaturing solution was added;
3. the proteins and genomic DNA were pelleted by centrifugation, and the
plasmid-containing supernatant was recovered and purified through the Nucleospin ® plasmid column on the kit; and bound plasmid DNA was eluted from the column using water or a neutral buffer such as Tris : EDTA, to provide a plasmid DNA comprising the metagenomic DNA expression library.
4.2 Transformation and screening of Metagenomic DNA expression library into Host cells, comprising genetic circuit
E.coli TOP10 containing the genetic circuit comprising the plasmids: pCasens3 and pDual3 (E. coli-GC) were made electro-competent and transformed with plasmids for expression of each metagenomic library (obtained in example 4.1) as follows:
i. E.coli TOP10 cells (supplied by ThermoFischer) containing the plasmids :
pCasens3 and pDual3 were cultured in 3 ml of 2xYT media with 50 pg/mL spectinomycin and 30 pg/mL chloramphenicol for 16-18 hours at 37°C and 250 rpm;
ii. the culture from (i) was inoculated 1 : 100 in 3 ml of pre-warmed 2xYT media with 50 pg/mL spectinomycin and 30 pg/mL chloramphenicol; and cultured at 37°C at 250 rpm until the cell density reached 0.5-0.7 OD600;
iii. the cultures from (ii) were cooled on ice; and the cells harvested by
centrifugation at 10 000 xg for 1 min;
iv. the cells from (iii) were washed by re-suspension in cold water and then
harvested again by centrifugation; then the cells were re-suspended in 1 ml glycerol and harvested by centrifugation; and finally the cells were re suspended in the residual water and then placed on ice, ready for electroporation .
v. A 40 p i sample of the electrocompetant cells from (iv) were transferred to microcentrifuge tubes; to which a 1 m I sample of the metagenomic library was added; and this DNA-cell mixture was then transferred to a cold cuvette and electroporated with a pulse having 1.8 kv, 200 ohms and 25 pF. vi. Immediately thereafter, a 975 mI volume of 37°C SOC [Ausubel F.M. et al. (1987)] was added to the electroporated cells from (v), which were mixed and transferred to a 15 ml- tube; and incubated on rollers at 37°C for 1 h;
vii. serial dilutions from 10°- 108 from the recovered cells from (vi) were made; and lOOul of each dilution were plated on selective solid media comprising
Luria Broth agar supplemented with : 2mM theophylline; 1% arabinose; 50 pg/mL spectinomycin; 30 pg/mL chloramphenicol and 50 pg/mL kanamycin; and incubated overnight at 37°C.
viii. Colonies growing on the plates were collected by adding 1 ml H20, after which the colonies were scraped off the plate with a sterile loop. The bacterial cells were then pelleted and washed in water by centrifugation and the plasmid DNA in collected bacterial cells was extracted using a Machery-Negel plasmid DNA purification kit.
4.3 Detection of metagenomic DNA encoding proteins having anti-CRISPR activity
E.coli host cells containing the genetic circuit, transformed with the library of metagenomic DNA expression plasmids (Figure 6A) prepared as described in Example 2.2, were screened for expression of proteins having anti-CRISPR activity (as describe in Example 4.2 and figure 7). Colonies that appeared on the selective growth medium were selected, and their respective metagenomic DNA insert were each re-cloned in the metagenomics DNA expression plasmid (Figure 6A, plasmid to the left). 11 inserts showed anti-CRISPR activity above background in the selection assay, where in several cases the detected activity was similar to that of AcrIIA2 (Figure 8); confirming that the genetic circuit provides an efficient genetic tool for selecting new candidate anti- CRISPR polypeptides from diverse genetic sources.
Example 5 A genetic circuit for screening for anti-CRISPR molecules of Class 1 (Type 1-B) and Class 2 (Type II-A and V-A) CRISPR-Cas systems
The genetic circuit for use in screening for anti-CRISPR molecules, comprising gene(s) encoding a Class 1 or 2 CRISPR-Cas system, in combination with a nucleic acid molecule encoding a cognate guide RNA (cognate palindromic repeat sequence and as required a tracRNA) were validated, as demonstated below. In each case, the cognate RNA was expressed from the pDual plasmid which comprises a gene encoding cognate guide RNA targeting the chloramphenicol resistance gene (CmR), under the control of an arabinose-inducible promoter, a chloramphenicol resistance gene (cat), and a pl5a origin of replication (see right-hand pDua!3 plasmid in Figure 2). 5.1 Construction of a Type I-B CRISPR-Cas screening system
Class 1, type I CRISPR-Cas systems are composed of a multi-subunit subunit protein complex, each subunit encoded by its respective gene. The type I-B CRISPR-Cas operon from Clostridium difficile, encoding the effector protein cas3 and cascade genes cas6, cas7, cas8 and cas5, was cloned into a medium copy number plasmid having a cloDF13 origin of replication; and spectinomycin resistance (aadA gene) to create pTypel-B-Cas. The endogenous promoter of the type I-B CRISPR-Cas operon was replaced with the IPTG inducible strong Ptac promoter.
5.2 Construction of a Type IIA CRISPR-Cas system screening system
Class 2, type II CRISPR-Cas systems comprise a single multidomain protein encoded by a single gene. The type II-A CRISPR-Cas gene from Streptococcus thermophilus , encoding the S. thermophilus Cas9 protein (SEQ ID No: 168), was cloned in a plasmid ( Esvelt et a I . , 2013), such that expression of Cas9 was placed under control of an
IPTG-inducible Ptac promoter, creating plasmid pTypell-A-Cas.
5.3 Construction of a Type V-A CRISPR-Cas system screening system
Class 2, type V CRISPR-Cas systems also comprise a single multidomain protein encoded by a single gene. The type V-A CRISPR-Cas from Franciseiia novicida strain ul l2 comprises a Casl2 gene (formerly designated Cpfl), encoding the Franciseiia novicida Casl2 effector protein (WP_014550095.1; SEQ ID No : 91), was cloned in a medium copy number plasmid having a cloDF13 origin of replication, and
spectinomycin resistance (aadA gene) (as described by Zetsche et al ., 2015). The plasmid was modified to place expression of Casl2 gene under control of the strong constitutive promoter J23100 [SEQ ID No. : 172] (as defined by
http://parts.igem.org/Promoters/Catalog/Anderson), regulated by a DNA sequence [SEQ ID No. : 149] encoding a theophylline riboswitch; creating plasmid pTypeV-A-Cas. 5.4 Testing host cells transformed with genetic circuits expressing Class 1
(Type 1-B) and Class 2 (Type II-A and V-A) CRISPR-Cas systems
E.coli TOPIO were made electro-competent and transformed with plasmids comprising :
• pTypel-B-Cas (comprising coding sequences for cas3 [SEQ ID No. : 12] , cas6 [SEQ ID No. : 13], cas7[SEQ ID No. : 14], cas8 [SEQ ID No. : 15] and cas5[SEQ ID No. : 16]) and pDual4 (comprising Cognate palindromic repeat SEQ ID
No. : 110); or
• pTypell-A-Cas (comprising coding sequence for Cas9 2 having SEQ ID No :
168) and pDual5 (comprising Cognate palindromic repeat SEQ ID No. : 169 and tracRNA gene having SEQ ID No : 170); • pTypeV-A-Cas (comprising coding sequence for Casl2; SEQ ID No : 91) and pDual6 (comprising Cognate palindromic repeat SEQ ID No. : 125)
as described in example 1.3. Transformants were cultured overnight at 37°C with shaking (250RPM) in a growth medium supplemented with antibiotics (spectinomycin 50ug/mL, kanamycin 50ug/mL, and chloramphenicol 30ug/mL); but devoid of inducers of CRISPR-Cas expression. A subculture, derived from the over-night culture, was incubated under aerobic condition at 37C for ~6hrs in a growth medium devoid of antibiotics, but, supplemented with 1% arabinose and either 0, 10, 50 or lOOuM IPTG (or 2mM theophylline as inducer for pTypeV-A-Cas) to activate expression of the sgRNA and CRISPR-Cas system respectively. The resulting cultures were serially diluted from 10 1 to 10 7 and then plated on LB-agar with and without chloramphenicol.
As demonstated in Figures 9, 10 and 11, host E. coli cells transformed with each of the 3 tested CRISPR-Cas systems were able to grow under both non-selective and selective conditions (chloramphenicol media) provided that the genetic circuit encoding the CRISPR-Cas was not induced . Addition of as little as lOuM IPTG to induce expression of the genetic system, was for all 3 CRISPR-Cas systems sufficient to lead to a detectable loss of cell viability when grown under selective conditions. Example 6.0 Enhancing the efficiency of the method for in vivo detection of anti-CRISPR small molecules or polypeptides using recombinant biological cell of the invention.
The frequency of false positives detected using the method for functional screening for anti-CRISPR molecules is reduced by employing biological cells comprising the genetic circuit according to the invention, wherein expression of the RNA guided CRISPR-Cas activity is controlled by a inducible strong promoter, and a selectable plasmid having the reporter gene, in combination with the use of a defined growth medium, such that induction of RNA guided CRISPR-Cas activity and reporter gene selection is retricted to defined steps of the method . The method (with reference to Figure 12) was tested by:
1) Providing E. coli cells, comprising the genetic circuit on pCasens3 and pDual3; by cultivating the cells under aerobic conditions in M9 minimal medium (http://cshprotocols.cshlp.Org/content/2010/8/pdb. recl2295.short), supplemented with the antibiotics (spectinomycin 50ug/mL and chloramphenicol 30ug/mL) and incubating the culture at 37°C; 250RPM without inducers(figure 12, step 1). Since the medium is devoid of the inducing agents, theophylline and arabinose, E. coli cells comprising the genetic circuit are selected without inducing expression of the RNA- guided CRISPR-Cas genes. 2) An aliquot of the cell culture produced in step 1) is inoculated into M9 minimal medium; said medium being both devoid of the inducing agents (theophylline and arabinose) and antibiotics (spectinomycin and chloramphenicol) to produce a cell culture.
3) Cells, derived from the cell culture produced in step 2, are transformed with the control plasmid comprising a gene [SEQ ID No. : 140] encoding Green Fluorescent Protein (GFP) under the control of a constitutive Ptet promoter [SEQ ID No. : 139] .
4) Transformed cells, obtained in step 3, are incubated at 37°C for a period of ~ lh, in M9 minimal medium, sufficient to allow expression of the GFP protein, and providing a transformed cell culture.
5) The transformed cell culture obtained in step 4 is supplemented with 2mM theophylline; 1% arabinose; and the suspensions are incubated at 37°C; 250RPM for a period of ~2h, to allow induced expression of the RNA-guided CRISPR-Cas genes, and inactivation of the CmR reporter gene in cells. Since the growth medium is devoid of chloramphenicol, growth conditions are non-selective for functional expression of this reporter gene.
6) Cells derived from the incubated cell suspension resulting from step 5) are plated on a defined media agar supplemented with both 2mM theophylline; 1% arabinose;and the antibiotics: spectinomycin 50ug/mL, kanamycin 50 pg/mL, and chloramphenicol 30ug/mL) and incubated at 37°C for a time sufficient to detect viable biological cells.
The use of a defined growth medium for using the CRISPR-Cas screening method of the invention is shown to reduce the number of false positives for each of 5 different types of CRISPR-Cas (Figure 13).
The effect of the strength of the promoter used to control inducible expression of RNA guided CRISPR-Cas activity on the efficiency of inactivation of the reporter gene (camR) gene cleavage, measured as cell survivors, and corresponding reduction in false-positives using the screening assay is demonstated in Figure 14. The three types of cells tested comprised the CRISPR-Cas9 genetic cicuit, differing only in that their respective Cas9 gene was operably linked to a promoter having promoter strengths of 0.16; 0.58 or 1.00 on the Anderson scale, each combined with a theophylline riboswitch 5' of the cas9 gene. The cells were tested according to steps 1, 2, 5 and 6, and omitting transformation steps 3 and 4 (Figure 12). References
Ausubel F.M. et al. (1987) Current Protocols in Molecular Biology ;
Esvelt, K. M ., Mali, P., Braff, J . L, Moosburner, M ., Yaung, S. J., & Church, G. M . (2013). Orthogonal Cas9 proteins for RNA-guided gene regulation and editing . Nature Methods, 10(11), 1116-21. https://doi.org/10.1038/nmeth.2681
Genee, H. J., Bonde, M. T., Bagger, F. O., Jespersen, J . B., Sommer, M . O. a, Wernersson, R., & Olsen, L. R. (2014) . Software-supported USER cloning strategies for site-directed mutagenesis and DNA assembly. ACS Synthetic Biology, 4(3), 342- 349. https ://doi .org/10.1021/sb500194z
Genee H J, Bali A P, Petersen SD, Siedler, S, Bonde MT, Gronenberg LS, Kristensen, M and Harrison SJ and MOA Sommer (2016) Functional mining of transporters using synthetic selections. Nature Chemical Biology 12, 1015-1022;
Koonin, E. V., Makarova, K. S., & Zhang, F. (2017). Diversity, classification and evolution of CRISPR-Cas systems. Current Opinion in Microbiology, 37, 67-78.
https://doi .Org/10.1016/j. mib.2017.05.008
Lutz, R., Bujard, H., 1997. Independent and tight regulation of transcriptional units in Escherichia coli via the LacR / O , the TetR / O and AraC / I 1 -I 2 regulatory elements. Pharmacia 25: 1203-1210.
Martinez-Garcia, E., Aparicio, T., Gohi-Moreno, A., Fraile, S., & De Lorenzo, V.
(2015). SEVA 2.0: An update of the Standard European Vector Architecture for de-
/re-construction of bacterial functionalities. Nucleic Acids Research, 43(D1), D1183- D1189. https ://doi.org/10.1093/nar/gkul l l4
Sommer, M.O., Dantas, G. & Church, G. M. (2009) Functional characterization of the antibiotic resistance reservoir in the human microflora . Science 325, 1128-1131. Zetsche, B., Gootenberg, J. S., Abudayyeh, O. O., Slaymaker, I. M ., Makarova, K. S., Essletzbichler, P., ... Zhang, F. (2015). Cpfl Is a Single RNA-Guided Endonuclease of a Class 2 CRISPR-Cas System. Cell. https ://doi.Org/10.1016/j.cell .2015.09.038

Claims

Claims
1. A method for in vivo detection of regulators of CRISPR-Cas associated activity comprising :
a. providing biological cells capable of CRISPR-Cas associated activity wherein said cells are prepared on a defined medium devoid of one or more agents for inducing expression of said activity, and either
I. contacting said cells with a molecule or library of molecules; or II. introducing a self-replicating library of non-host DNA
fragments into said cells;
b. culturing the cells obtained from step (a) (I) or (II) on said defined culture medium for a period sufficient for uptake or expression of a candidate regulator of CRISPR-Cas associated activity; c. culturing the cells obtained from step (b) on said defined culture
medium, wherein said medium is supplemented with one or more agents to induce expression of said CRISPR-Cas associated activity and
d. detecting expression of a reporter gene present in cultured cells
obtained from step (c),
wherein said cells comprise:
i. a first gene encoding a polypeptide or a first gene cluster encoding a polypeptide complex, said polypeptide or polypeptide complex having RNA-guided endonuclease activity, wherein said gene or gene cluster is operably linked to a promoter having a measured strength of >0.65 on the Anderson scale, and whose expression is inducible by at least one of said one or more agents;
ii . a reporter gene operably linked to a promoter;
iii. a nucleic acid molecule operably linked to a promoter and whose
expression is inducible by at least one of said one or more agents, wherein said nucleic acid molecule encodes one or more RNA molecules capable of guiding said polypeptide or polypeptide complex having RNA-guided endonuclease activity to said reporter gene or its transcript, and
wherein the polypeptide or polypeptide complex having RNA-guided endonuclease activity is capable of inactivating said reporter gene or its transcript.
2. A method for in vivo detection of regulators of CRISPR-Cas associated activity of claim 1, wherein said polypeptide or polypeptide complex having RNA- guided endonuclease activity is selected from a group of CRISPR-Cas systems consisting of Type 1, Type II, Type III; Type IV, Type V, and Type VI.
3. A method for in vivo detection of regulators of CRISPR-Cas associated activity of claims 1 or 2, wherein the amino acid sequence(s) of said polypeptide or each member of said polypeptide complex having RNA-guided endonuclease activity has at least 80% amino acid sequence identity to an amino acid sequence or a corresponding member of a set of amino acid sequences selected from among Group A; and
wherein one of said RNA molecules comprises at least one cognate palindromic repeat sequence and one of said RNA molecules comprises a cognate trans activating crRNA or one of said RNA molecules comprises a fusion at least one cognate palindromic repeat sequence and one cognate trans-activating crRNA, wherein said repeat sequence is encoded by a nucleotide sequence at the corresponding position in the list of Group B; and wherein said cognate trans activating crRNA is encoded by a nucleotide sequence at the corresponding position in the list of Group C
Figure imgf000045_0001
4. A method for in vivo detection of regulators of CRISPR-Cas associated activity of claims 1 or 2, wherein the amino acid sequence of each member of said polypeptide complex having RNA-guided endonuclease activity has at least 80% amino acid sequence identity to a corresponding member of a set of amino acid sequences selected from among Group A;
wherein said one or more RNA molecules comprises at least one cognate palindromic repeat sequence, wherein said repeat sequence is encoded by a nucleotide sequence at the corresponding position in the list of Group B
Figure imgf000045_0002
Figure imgf000046_0001
5. A method for in vivo detection of regulators of CRISPR-Cas associated activity according to any one of claims 1-3, wherein said first gene encodes a Cas9 polypeptide having RNA-guided endonuclease activity, and
wherein one of said RNA molecules is a mat-crRNA and one of said RNA molecules is a cognate trans-activating crRNA, or said one RNA molecule is a fusion of a mat-crRNA and a cognate trans-activating crRNA.
6. A method for in vivo detection of regulators of CRISPR-Cas associated activity according to any one of claims 1-5, wherein the non-host DNA fragments of said self-replicating library are synthetic DNA or metagenomic DNA.
7. A method for in vivo detection of regulators of CRISPR-Cas associated activity according to any one of claims 1-6, wherein said biological cell is a bacterial cell.
8. A biological cell comprising :
i. a first gene encoding a polypeptide or a first gene cluster encoding a polypeptide complex, said polypeptide or polypeptide complex having RNA- guided endonuclease activity, wherein said first gene or first gene cluster is operably linked to a promoter having a measured strength of >0.65 on the Anderson scale, and whose expression is inducible by at least one agent; ii . a reporter gene operably linked to a promoter,
iii . a nucleic acid molecule operably linked to a promoter and whose
expression is inducible by at least one agent,
wherein said nucleic acid molecule encodes one or more RNA molecules capable of guiding said polypeptide or polypeptide complex having RNA- guided endonuclease activity to said reporter gene or its transcript and wherein the polypeptide or polypeptide complex having RNA-guided endonuclease activity is capable of inactivating said reporter gene or its transcript.
9. A biological cell according to claim 8, where said first gene or said nucleic acid molecule is operably linked to a constitutive promoter and a riboswitch capable of regulating translation of said gene or gene cluster in response to said at least one agent.
10. A biological cell according to claims 8 or 9, wherein said first gene or said nucleic acid molecule is operably linked to an inducible promoter and a cis element encoding a transcription factor capable of inducing expression of said a nucleic acid molecule in response to said at least one agent.
11. A biological cell according to any one of claims 8 to 10, wherein said first gene encodes a Cas9 polypeptide having RNA-guided endonuclease activity, and wherein one of said RNA molecules is a mat-crRNA and one of said RNA molecules is a cognate trans-activating crRNA, or said one RNA molecule is a fusion of a mat-crRNA and a cognate trans-activating crRNA.
12. A biological cell according to any one of claims 8 to 11, wherein said cell is a bacterial cell .
13. Use of a biological cell according to any one of claims 8 to 12, for detecting a regulator of CRISPR-Cas associated activity.
14. A biological cell library of non-host DNA fragments, wherein each cell of the library is a cell according to any one of claims 8 to 12, and wherein each cell further comprises a self-replicating genetic element comprising
a. a non-host DNA fragment, and b. a gene encoding a selectable marker polypeptide and operably linked to a promoter.
15. Use of a biological cell library of non-host DNA fragments according to claim 14, for detecting a regulator of CRISPR-Cas associated activity.
PCT/EP2019/057785 2018-03-27 2019-03-27 A high-throughput platform to select for regulators of crispr-cas associated activity WO2019185750A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
EP18164235 2018-03-27
EP18164235.6 2018-03-27
EP18209315 2018-11-29
EP18209315.3 2018-11-29

Publications (1)

Publication Number Publication Date
WO2019185750A1 true WO2019185750A1 (en) 2019-10-03

Family

ID=65818536

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2019/057785 WO2019185750A1 (en) 2018-03-27 2019-03-27 A high-throughput platform to select for regulators of crispr-cas associated activity

Country Status (1)

Country Link
WO (1) WO2019185750A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3831940A1 (en) * 2019-12-05 2021-06-09 Danmarks Tekniske Universitet Small molecule inhibitors of crispr-cas associated activity

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016011080A2 (en) * 2014-07-14 2016-01-21 The Regents Of The University Of California Crispr/cas transcriptional modulation
WO2016108926A1 (en) * 2014-12-30 2016-07-07 The Broad Institute Inc. Crispr mediated in vivo modeling and genetic screening of tumor growth and metastasis
WO2017160689A1 (en) * 2016-03-15 2017-09-21 University Of Massachusetts Anti-crispr compounds and methods of use
WO2017186854A1 (en) * 2016-04-27 2017-11-02 Danmarks Tekniske Universitet High throughput in vivo protease inhibitor selection platform

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016011080A2 (en) * 2014-07-14 2016-01-21 The Regents Of The University Of California Crispr/cas transcriptional modulation
WO2016108926A1 (en) * 2014-12-30 2016-07-07 The Broad Institute Inc. Crispr mediated in vivo modeling and genetic screening of tumor growth and metastasis
WO2017160689A1 (en) * 2016-03-15 2017-09-21 University Of Massachusetts Anti-crispr compounds and methods of use
WO2017186854A1 (en) * 2016-04-27 2017-11-02 Danmarks Tekniske Universitet High throughput in vivo protease inhibitor selection platform

Non-Patent Citations (19)

* Cited by examiner, † Cited by third party
Title
A. PAWLUK ET AL: "A New Group of Phage Anti-CRISPR Genes Inhibits the Type I-E CRISPR-Cas System of Pseudomonas aeruginosa", MBIO, vol. 5, no. 2, 15 April 2014 (2014-04-15), pages e00896 - 14, XP055474866, DOI: 10.1128/mBio.00896-14 *
APRIL PAWLUK ET AL: "Anti-CRISPR: discovery, mechanism and function", NATURE REVIEWS. MICROBIOLOGY, vol. 16, no. 1, 24 October 2017 (2017-10-24), GB, pages 12 - 17, XP055475543, ISSN: 1740-1526, DOI: 10.1038/nrmicro.2017.120 *
APRIL PAWLUK ET AL: "Inactivation of CRISPR-Cas systems by anti-CRISPR proteins in diverse bacterial species", NATURE MICROBIOLOGY, vol. 1, no. 8, 13 June 2016 (2016-06-13), XP055474868, DOI: 10.1038/nmicrobiol.2016.85 *
AUSUBEL F.M. ET AL., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, 1987
CHUAN DONG ET AL: "Anti-CRISPRdb: a comprehensive online resource for anti-CRISPR proteins", NUCLEIC ACIDS RESEARCH, vol. 46, no. D1, 25 September 2017 (2017-09-25), pages D393 - D398, XP055475568, ISSN: 0305-1048, DOI: 10.1093/nar/gkx835 *
DE DONG ET AL: "Structural basis of CRISPR-SpyCas9 inhibition by an anti-CRISPR protein", NATURE, vol. 546, no. 7658, 27 April 2017 (2017-04-27), GB, pages 436 - 439, XP055452726, ISSN: 0028-0836, DOI: 10.1038/nature22377 *
ESVELT, K. M.; MALI, P.; BRAFF, J. L.; MOOSBURNER, M.; YAUNG, S. J.; CHURCH, G. M.: "rthogonal Cas9 proteins for RNA-guided gene regulation and editing", NATURE METHODS, vol. 10, no. 11, 2013, pages 1116 - 21, XP055128928, Retrieved from the Internet <URL:https://doi.org/10.1038/nmeth.2681> DOI: doi:10.1038/nmeth.2681
GENEE H J; BALI A P; PETERSEN SD; SIEDLER, S; BONDE MT; GRONENBERG LS; KRISTENSEN, M; HARRISON SJ; MOA SOMMER: "Functional mining of transporters using synthetic selections", NATURE CHEMICAL BIOLOGY, vol. 12, 2016, pages 1015 - 1022
GENEE, H. J.; BONDE, M. T.; BAGGER, F. O.; JESPERSEN, J. B.; SOMMER, M. O. A; WERNERSSON, R.; OLSEN, L. R.: "Software-supported USER cloning strategies for site-directed mutagenesis and DNA assembly", ACS SYNTHETIC BIOLOGY, vol. 4, no. 3, 2014, pages 342 - 349, Retrieved from the Internet <URL:https://doi.org/10.1021/sb500194z>
JOSEPH BONDY-DENOMY ET AL: "Multiple mechanisms for CRISPR-Cas inhibition by anti-CRISPR proteins", NATURE, vol. 526, no. 7571, 23 September 2015 (2015-09-23), GB, pages 136 - 139, XP055474577, ISSN: 0028-0836, DOI: 10.1038/nature15254 *
KOONIN, E. V.; MAKAROVA, K. S.; ZHANG, F.: "Diversity, classification and evolution of CRISPR-Cas systems", CURRENT OPINION IN MICROBIOLOGY, vol. 37, 2017, pages 67 - 78, XP085276922, Retrieved from the Internet <URL:https://doi.org/10.1016/j.mib.2017.05.008> DOI: doi:10.1016/j.mib.2017.05.008
LUTZ, R.; BUJARD, H.: "Independent and tight regulation of transcriptional units in Escherichia coli via the LacR / O , the TetR / O and AraC / I 1 -I 2 regulatory elements", PHARMACIA, vol. 25, 1997, pages 1203 - 1210, XP001084137, DOI: doi:10.1093/nar/25.6.1203
MARTINEZ-GARCIA, E.; APARICIO, T.; GONI-MORENO, A.; FRAILE, S.; DE LORENZO, V.: "SEVA 2.0: An update of the Standard European Vector Architecture for de-/re-construction of bacterial functionalities", NUCLEIC ACIDS RESEARCH, vol. 43, no. D1, 2015, pages D1183 - D1189, Retrieved from the Internet <URL:https://doi.org/10.1093/nar/gkuII14>
RAUCH BENJAMIN J ET AL: "Inhibition of CRISPR-Cas9 with Bacteriophage Proteins", CELL, CELL PRESS, AMSTERDAM, NL, vol. 168, no. 1, 29 December 2016 (2016-12-29), pages 150, XP029882136, ISSN: 0092-8674, DOI: 10.1016/J.CELL.2016.12.009 *
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS
SOMMER, M.O.; DANTAS, G.; CHURCH, G.M.: "Functional characterization of the antibiotic resistance reservoir in the human microflora", SCIENCE, vol. 325, 2009, pages 1128 - 1131, XP055130442, DOI: doi:10.1126/science.1176950
YANG HUI ET AL: "Inhibition Mechanism of an Anti-CRISPR Suppressor AcrIIA4 Targeting SpyCas9", MOLECULAR CELL, ELSEVIER, AMSTERDAM, NL, vol. 67, no. 1, 9 June 2017 (2017-06-09), pages 117, XP085122090, ISSN: 1097-2765, DOI: 10.1016/J.MOLCEL.2017.05.024 *
YUWEI ZHU ET AL: "Structural insights into the inactivation of CRISPR-Cas systems by diverse anti-CRISPR proteins", BMC BIOLOGY, vol. 16, no. 1, 19 March 2018 (2018-03-19), XP055474878, DOI: 10.1186/s12915-018-0504-9 *
ZETSCHE, B.; GOOTENBERG, J. S.; ABUDAYYEH, O. O.; SLAYMAKER, I. M.; MAKAROVA, K. S.; ESSLETZBICHLER, P.; ZHANG, F.: "Cpfl Is a Single RNA-Guided Endonuclease of a Class 2 CRISPR-Cas System", CELL, 2015, Retrieved from the Internet <URL:https://doi.org/10.1016/j.ce.2015.09.038>

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3831940A1 (en) * 2019-12-05 2021-06-09 Danmarks Tekniske Universitet Small molecule inhibitors of crispr-cas associated activity

Similar Documents

Publication Publication Date Title
Wang et al. Improved seamless mutagenesis by recombineering using ccdB for counterselection
Yim et al. Isolation of fully synthetic promoters for high‐level gene expression in Corynebacterium glutamicum
Olsen et al. Function-based isolation of novel enzymes from a large library
Fernández‐Cabezón et al. Evolutionary approaches for engineering industrially relevant phenotypes in bacterial cell factories
Liang et al. Genome engineering of E. coli for improved styrene production
EP1501947A2 (en) Method of creating a library of bacterial clones with varying levels of gene expression
Baltz Genetic methods and strategies for secondary metabolite yield improvement in actinomycetes
WO2019185751A1 (en) Inhibitors of crispr-cas associated activity
US20140170708A1 (en) Compositions and methods for creating altered and improved cells and organisms
WO2019185750A1 (en) A high-throughput platform to select for regulators of crispr-cas associated activity
O'connor et al. Isolation of spectinomycin resistance mutations in the 16S rRNA of Salmonella enterica serovar Typhimurium and expression in Escherichia coli and Salmonella
JP5757608B2 (en) Method of selecting gene switch and gene circuit
WO2014093402A2 (en) A synthetic transcription factor and uses thereof
US20210123065A1 (en) Recombination systems for high-throughput chromosomal engineering of bacteria
EP2935582B1 (en) Compositions and methods for creating altered and improved cells and organisms
WO2020036181A1 (en) Method for isolating or identifying cell, and cell mass
KR101411716B1 (en) Method for Screening Microorganism with High Lysine Productivity Using Riboswitch
KR20130142942A (en) Method for screening microorganism with high l-tryptophan productivity using riboswitch
WO2020081958A2 (en) Compositions and methods for identifying mutations of genes of multi-gene systems having improved function
EP1386966A1 (en) Method for the expression of unknown environmental DNA into adapted host cells
CN113677795B (en) Novel DAHP synthetase
CN116179513B (en) Cpf1 protein and application thereof in gene editing
WO2003064623A2 (en) Methods and vectors for facilitating site-specific recombination
Liang et al. Genome engineering of E. coli for improved styrene production and polymerization
WO2024038003A1 (en) Methods and systems for generating nucleic acid diversity in crispr-associated genes

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19712242

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19712242

Country of ref document: EP

Kind code of ref document: A1