EP4069716A1 - Particle delivery systems - Google Patents
Particle delivery systemsInfo
- Publication number
- EP4069716A1 EP4069716A1 EP20829466.0A EP20829466A EP4069716A1 EP 4069716 A1 EP4069716 A1 EP 4069716A1 EP 20829466 A EP20829466 A EP 20829466A EP 4069716 A1 EP4069716 A1 EP 4069716A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- xdp
- sequence
- seq
- protein
- components
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/111—General methods applicable to biologically active non-coding nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2320/00—Applications; Uses
- C12N2320/30—Special therapeutic applications
- C12N2320/32—Special delivery means, e.g. tissue-specific
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16023—Virus like particles [VLP]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16041—Use of virus, viral particle or viral elements as a vector
- C12N2740/16043—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16211—Human Immunodeficiency Virus, HIV concerning HIV gagpol
- C12N2740/16222—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Definitions
- the Retroviridae family of viruses encompass several genera of viruses that cause chronic and deadly diseases characterized by long incubation periods, in humans and other mammalian species.
- the Retroviridae family includes Othoretrovirinae (Lentivirus, Alpharetrovirus, Betaretrovirus, Deltaretrovirus, Epsilonretrovirus, Gammaretrovirus), and Spumaretrovirinae .
- the best known lentivirus is the Human Immunodeficiency Virus (HIV), which causes acquired immune deficiency syndrome (AIDS).
- HIV Human Immunodeficiency Virus
- lentiviruses have gag, pol and env genes, coding for viral proteins in the order: 5'-gag-pol-env- 3'.
- the lentivirus system has been adapted to introduce gene editing systems into human or animal cells by the creation of virus-like particles (VLP) containing the gene editing systems.
- Retroviral systems have advantages over other gene-therapy methods, including high-efficiency infection of dividing and non-dividing cells, long-term stable expression of a transgene, and low immunogenicity.
- Lentiviruses have been successfully used for transduction of diabetic mice with the gene encoding PDGF (platelet-derived growth factor), a therapy being considered for use in humans (Lee JA, et al. Lentiviral transfection with the PDGF-B gene improves diabetic wound healing. Plast. Reconstr. Surg. 116 (2): 532 (2005)).
- VLP VLP-like therapeutics
- CRISPR nucleases CRISPR nucleases
- the present disclosure provides delivery particle (XDP) systems for the delivery of therapeutic payloads, including proteins, nucleic acids, small molecules and the like to target cells and tissues.
- XDP delivery particle
- the XDP system comprises nucleic acids encoding components selected from all or a portion of a retroviral gag polyprotein, a therapeutic payload, and a tropism factor, wherein the tropism factor is selected from the group consisting of a glycoprotein, an antibody fragment, a receptor, and a ligand to a target cell marker.
- the tropism factor is a glycoprotein having a sequence selected from the group of sequences consisting of SEQ ID NOS: 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490, 492, 494, 496,
- the glycoprotein is VSV-G.
- the glycoprotein comprises a sequence of SEQ ID NO: 438.
- the therapeutic payload can be a protein, a nucleic acid, or both a protein and a nucleic acid.
- the protein payload is selected from the group consisting of a cytokine, an interleukin, an enzyme, a receptor, a microprotein, a hormone, erythropoietin, a ribonuclease (RNAse), a deoxyribonuclease (DNAse), a blood clotting factor, an anticoagulant, a bone morphogenetic protein, an engineered protein scaffold, a thrombolytic protein, a CRISPR protein, and an anti-cancer modality.
- the therapeutic payload is a Class 1 or Class 2 CRISPR protein, wherein the Class 2 CRISPR protein selected from the group consisting of a Type II, Type V, or Type VI protein.
- the Class 2 CRISPR Type V protein is selected from the group consisting of Casl2a, Casl2b, Casl2c, Casl2d (CasY), Casl2j and CasX, wherein the CasX comprises a sequence of SEQ ID NOS: 21-233, 343-345, 350-353, 355-367 or 388-397 as set forth in Tables 1, 7, 8, 9, or 11, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% sequence identity thereto.
- the CasX comprises a sequence of SEQ ID NOS: 21-233, 343-345, 350-353, 355-367 or 388- 397.
- the therapeutic payload is a nucleic acid selected from the group consisting of a single-stranded antisense oligonucleotide (ASOs), a double-stranded RNA interference (RNAi) molecule, a DNA aptamer, and a CRISPR guide nucleic acid, wherein the CRISPR guide nucleic acid is a single-molecule guide RNA comprising a scaffold sequence and a targeting sequence, wherein the targeting sequence comprises between 14 and 30 nucleotides and is complementary to a target nucleic acid sequence, and wherein the scaffold sequence comprises a sequence of SEQ ID NOS: 597-781 as set forth in Table 3, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at
- the XDP system further comprises nucleic acids encoding one or more components selected from one or more protease cleavage sites, a gag-transframe region- pol protease polyprotein (gag-TFR-PR), a retroviral gag-pol polyprotein, and a non-retroviral protease capable of cleaving the protease cleavage sites.
- a gag-transframe region- pol protease polyprotein gag-TFR-PR
- retroviral gag-pol polyprotein a retroviral gag-pol polyprotein
- non-retroviral protease capable of cleaving the protease cleavage sites.
- the retroviral components of the XDP system are derived from a Orthoretrovirinae virus or a Spumaretrovirinae virus wherein the Orthoretrovirinae virus is selected from the group consisting of Alpharetrovirus, Betaretrovirus, Deltaretrovirus, Epsilonretrovirus, Gammaretrovirus, and Lentivirus, and the Spumaretrovirinae virus is selected from the group consisting of Bovispumavirus, Equispumavirus, Felispumavirus, Prosimiispumavirus, Simiispumavirus, and Spumavirus.
- Orthoretrovirinae virus is selected from the group consisting of Alpharetrovirus, Betaretrovirus, Deltaretrovirus, Epsilonretrovirus, Gammaretrovirus, and Lentivirus
- the Spumaretrovirinae virus is selected from the group consisting of Bovispumavirus, Equispumavirus, Felispumavirus, Prosimiispumavirus,
- the components of the XDP system are encoded on a single nucleic acid, on two nucleic acids, on three nucleic acids, on four nucleic acids, or on five nucleic acids, and the nucleic acids are configured according to any one of FIGS. 36-68.
- the components of the XDP system are encoded by nucleic acids selected from the group of sequences of SEQ ID NOS: 192, 193, 195, 196, 198-201, 782, and 234-339 as set forth in Table 5.
- the components of the XDP system are capable of self assembling into an XDP when the one or more nucleic acids are introduced into a eukaryotic host cell and are expressed.
- the therapeutic payload is encapsidated within the XDP upon self-assembly of the XDP.
- the therapeutic payload comprises a CasX and a guide RNA
- the CasX and guide RNA are complexed as a ribonucleoprotein complex (RNP) and, optionally, a donor template is also encapsidated in the XDP.
- RNP ribonucleoprotein complex
- the tropism factor is incorporated on the XDP surface upon self-assembly of the XDP.
- the nucleic acids encoding the retroviral components are all or a portion of an Alpharetrovirus gag polyprotein, wherein the gag polyprotein comprises one or more components selected from the group consisting of a matrix polypeptide (MA), a P2A peptide, a P2B peptide, a P10 peptide, a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC).
- MA matrix polypeptide
- P2A peptide a P2A peptide
- P2B peptide a P10 peptide
- CA capsid polypeptide
- NC nucleocapsid polypeptide
- the nucleic acids further comprise sequences encoding one or more components selected from an HIV pi peptide, an HIV p6 peptide, a Gag-Pol polyprotein, one or more protease cleavage sites, a non-retroviral, heterologous protease capable of cleaving the cleavage sites, and a gag-transframe region-pol protease polyprotein.
- the nucleic acids encoding the retroviral components are all or a portion of an Betaretrovirus gag polyprotein, wherein the gag polyprotein comprises one or more components selected from the group consisting of a matrix polypeptide (MA), a PP21/24 peptide, a P12/P3/P8 peptide, a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC).
- MA matrix polypeptide
- PP21/24 peptide a PP21/24 peptide
- P12/P3/P8 peptide a capsid polypeptide
- CA capsid polypeptide
- NC nucleocapsid polypeptide
- the nucleic acids further comprise sequences encoding one or more components selected from an HIV pi peptide, an HIV p6 peptide, a Gag-Pol polyprotein, one or more protease cleavage sites, a non-retroviral, heterologous protease capable of cleaving the cleavage sites, and a gag-transframe region-pol protease polyprotein.
- the nucleic acids encoding the retroviral components are all or a portion of a Deltaretrovirus gag polyprotein, wherein the gag polyprotein comprises one or more components selected from the group consisting of a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC).
- MA matrix polypeptide
- CA capsid polypeptide
- NC nucleocapsid polypeptide
- the nucleic acids further comprise sequences encoding one or more components selected from an HIV pi peptide, an HIV p6 peptide, a Gag-Pol polyprotein, one or more protease cleavage sites, a non-retroviral, heterologous protease capable of cleaving the cleavage sites, and a gag-transframe region-pol protease polyprotein.
- the nucleic acids encoding the retroviral components are all or a portion of a Epsilonretrovirus gag polyprotein, wherein the gag polyprotein comprises one or more components selected from the group consisting of a matrix polypeptide (MA), a p20 peptide, a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC).
- MA matrix polypeptide
- CA capsid polypeptide
- NC nucleocapsid polypeptide
- the nucleic acids further comprise sequences encoding one or more components selected from an HIV pi peptide, an HIV p6 peptide, a Gag- Pol polyprotein, one or more protease cleavage sites, a non-retroviral, heterologous protease capable of cleaving the cleavage sites, and a gag-transframe region-pol protease polyprotein.
- the nucleic acids encoding the retroviral components are all or a portion of a Gammanretrovirus gag polyprotein, wherein the gag polyprotein comprises one or more components selected from the group consisting of a matrix polypeptide (MA), a pl2 peptide, a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC).
- MA matrix polypeptide
- CA capsid polypeptide
- NC nucleocapsid polypeptide
- the nucleic acids further comprise sequences encoding one or more components selected from an HIV pi peptide, an HIV p6 peptide, a Gag- Pol polyprotein, one or more protease cleavage sites, a non-retroviral, heterologous protease capable of cleaving the cleavage sites, and a gag-transframe region-pol protease polyprotein.
- the nucleic acids encoding the retroviral components are all or a portion of a Lentivirus gag polyprotein, wherein the gag polyprotein comprises one or more components selected from the group consisting of a matrix polypeptide (MA), a capsid polypeptide (CA), a p2 peptide, a nucleocapsid polypeptide (NC), a pi peptide, and a p6 peptide.
- MA matrix polypeptide
- CA capsid polypeptide
- NC nucleocapsid polypeptide
- pi peptide a p6 peptide
- the nucleic acids further comprise sequences encoding one or more components selected from a Gag-Pol polyprotein, one or more protease cleavage sites, a non-retroviral, heterologous protease capable of cleaving the cleavage sites, and a gag-transframe region-pol protease polyprotein.
- the nucleic acids encoding the retroviral components are all or a portion of a Spumaretrovirinae gag polyprotein, wherein the gag polyprotein comprises one or more components selected from the group consisting of a p68 Gag polypeptide and a p3 Gag polypeptide.
- the nucleic acids further comprise sequences encoding one or more components selected from an HIV pi peptide, an HIV p6 peptide, a Gag-Pol polyprotein, one or more protease cleavage sites, a non- retroviral, heterologous protease capable of cleaving the cleavage sites, and a gag-transframe region-pol protease polyprotein.
- the CasX further comprises one or more NLS selected from the group of sequences consisting of PKKKRKV (SEQ ID NO: 130), KRPAATKKAGQAKKKK (SEQ ID NO: 131), PAAKRVKLD (SEQ ID NO: 132), RQRRNELKRSP (SEQ ID NO: 133),
- the non-retroviral, heterologous protease is selected from the group consisting of tobacco etch virus protease (TEV), potyvirus HC protease, potyvirus PI protease, PreScission (HRV3C protease), b virus NIa protease, B virus RNA-2- encoded protease, aphthovirus L protease, enterovirus 2A protease, rhinovirus 2A protease, picorna 3C protease, comovirus 24K protease, nepovirus 24K protease, RTSV (rice tungro spherical vims) 3C-like protease, parsnip yellow fleck vims protease, 3C-like protease, heparin, cathepsin, thrombin, factor Xa, metalloproteinase, and enterokinase.
- TSV tobacco etch virus protease
- the present disclosure provides eukaryotic cells comprising the XDP system of any one of the foregoing embodiments, wherein the cell is a packaging cell.
- the eukaryotic cell is selected from the group consisting of HEK293 cells, Lenti-X 293T cells, BHK cells, HepG2, Saos-2, HuH7, NSO cells, SP2/0 cells, YO myeloma cells, A549 cells, P3X63 mouse myeloma cells, PER cells, PER.C6 cells, hybridoma cells, VERO, NIH3T3 cells, COS, WI38, MRC5, A549, HeLa cells, CHO cells, and HT1080 cells.
- the present disclosure provides methods of making an XDP comprising a therapeutic payload, the method comprising propagating the packaging cell of any of the embodiments under conditions such that XDPs are produced, and harvesting the XDPs produced by the packaging cell.
- the present disclosure further provides an XDP produced by the foregoing methods.
- the XDP comprises a therapeutic payload of an RNP of a CasX and guide RNA and, optionally, a donor template of any of the embodiments disclosed herein.
- the present disclosure provides methods of modifying a target nucleic acid sequence in a cell, the methods comprising contacting the cell with the XDP comprising an RNP of any of the embodiments disclosed herein, wherein said contacting comprises introducing into the cell the RNP comprising the CasX protein, the guide RNA, and, optionally, the donor template nucleic acid sequence, resulting in modification of the target nucleic acid sequence.
- the modification comprises introducing one or more single-stranded breaks in the target nucleic acid sequence.
- the modification comprises introducing one or more double-stranded breaks in the target nucleic acid sequence.
- the modification comprises insertion of the donor template into the target nucleic acid sequence.
- the cell is modified in vitro or ex vivo. In another embodiment, the cell is modified in vivo.
- the XDP is administered to a subject at a therapeutically effective dose, wherein the subject is the subject is selected from the group consisting of mouse, rat, pig, non-human primate, and human.
- the XDP is administered by a route of administration selected from the group consisting of subcutaneous, intradermal, intraneural, intranodal, intramedullary, intramuscular, intravenous, intracerebroventricular, intracisternal, intrathecal, intracranial, intralumbar, intratracheal, intraosseous, inhalatory, intracontralateral striatum, intraocular, intravitreal, intralymphatical, intraperitoneal routes and sub-retinal routes.
- a route of administration selected from the group consisting of subcutaneous, intradermal, intraneural, intranodal, intramedullary, intramuscular, intravenous, intracerebroventricular, intracisternal, intrathecal, intracranial, intralumbar, intratracheal, intraosseous, inhalatory, intracontralateral striatum, intraocular, intravitreal, intralymphatical, intraperitoneal routes and sub-retinal routes.
- the therapeutically effective dose is at least about 1 x 10 5 particles/kg, or at least about 1 x 10 6 particles/kg, or at least about 1 x 10 7 particles/kg, or at least about 1 x 10 8 particles/kg, or at least about 1 x 10 9 particles/kg, or at least about 1 x 10 10 particles/kg, or at least about 1 x 10 11 particles/kg, or at least about 1 x 10 12 particles/kg, or at least about 1 x 10 13 particles/kg, or at least about 1 x 10 14 particles/kg, or at least about 1 x 10 15 particles/kg, or at least about 1 x 10 16 particles/kg.
- the XDP is administered to the subject according to a treatment regimen comprising one or more consecutive doses using a therapeutically effective dose of the XDP.
- the therapeutically effective dose is administered to the subject as two or more doses over a period of at least two weeks, or at least one month, or at least two months, or at least three months, or at least four months, or at least five months, or at least six months, or once a year, or every 2 or 3 years.
- XDP particles and XDP systems, for use as a medicament for the treatment of a subject having a disease.
- FIG. 1 shows an SDS-PAGE gel of StX2 purification fractions visualized by colloidal Coomassie staining, as described in Example 1.
- FIG. 2 shows the chromatogram from a size exclusion chromatography assay of the StX2, using of Superdex 200 16/600 pg Gel Filtration, as described in Example 1.
- FIG. 3 shows an SDS-PAGE gel of StX2 purification fractions visualized by colloidal Coomassie staining, as described in Example 1.
- FIG. 4 is a schematic showing the organization of the components in the pSTX34 plasmid used to assemble the CasX constructs, as described in Example 2.
- FIG. 5 is a schematic showing the steps of generating the CasX 119 variant, as described in Example 2.
- FIG. 6 shows an SDS-PAGE gel of purification samples, visualized on a Bio-Rad Stain-FreeTM gel, as described in Example 2.
- FIG. 7 shows the chromatogram of Superdex 200 16/600 pg Gel Filtration, as described in Example 2.
- FIG. 8 shows an SDS-PAGE gel of gel filtration samples, stained with colloidal Coomassie, as described in Example 2.
- FIG. 9 shows an SDS-PAGE gel of purification samples of CasX 438, visualized on a Bio-Rad Stain-FreeTM gel, as described in Example 2.
- FIG. 10 shows the chromatogram from a size exclusion chromatography assay of the CasX 438, using of Superdex 200 16/600 pg gel filtration, as described in Example 2.
- FIG. 11 shows an SDS-PAGE gel of CasX 438 purification fractions visualized by colloidal Coomassie staining, as described in Example, as described in Example 2.
- FIG. 12 shows an SDS-PAGE gel of purification samples of CasX 457, visualized on a Bio-Rad Stain-FreeTM gel, as described in Example 2.
- FIG. 13 shows the chromatogram from a size exclusion chromatography assay of the CasX 457, using of Superdex 200 16/600 pg gel filtration, as described in Example 2.
- FIG. 14 shows an SDS-PAGE gel of CasX 457 purification fractions visualized by colloidal Coomassie staining, as described in Example 2.
- FIG. 15 is a graph of the results of an assay for the quantification of active fractions of RNP formed by sgRNA174 and the CasX variants, as described in Example 9. Equimolar amounts of RNP and target were co-incubated and the amount of cleaved target was determined at the indicated timepoints. Mean and standard deviation of three independent replicates are shown for each timepoint. The biphasic fit of the combined replicates is shown. "2" refers to the reference CasX protein of SEQ ID NO:2.
- FIG. 16 shows the quantification of active fractions of RNP formed by CasX2 (reference CasX protein of SEQ ID NO:2) and the modified sgRNAs, as described in Example 9.
- FIG. 17 shows the quantification of active fractions of RNP formed by CasX 491 and the modified sgRNAs under guide-limiting conditions, as described in Example 9. Equimolar amounts of RNP and target were co-incubated and the amount of cleaved target was determined at the indicated timepoints. The biphasic fit of the data is shown.
- FIG. 18 shows the quantification of cleavage rates of RNP formed by sgRNA174 and the CasX variants, as described in Example 9.
- Target DNA was incubated with a 20-fold excess of the indicated RNP and the amount of cleaved target was determined at the indicated time points. Mean and standard deviation of three independent replicates are shown for each timepoint, except for 488 and 491 where a single replicate is shown. The monophasic fit of the combined replicates is shown.
- FIG. 19 shows the quantification of cleavage rates of RNP formed by CasX2 and the sgRNA variants, as described in Example 9.
- Target DNA was incubated with a 20-fold excess of the indicated RNP and the amount of cleaved target was determined at the indicated time points. Mean and standard deviation of three independent replicates are shown for each timepoint. The monophasic fit of the combined replicates is shown.
- FIG. 20 shows the quantification of initial velocities of RNP formed by CasX2 and the sgRNA variants, as described in Example 9. The first two time-points of the previous cleavage experiment were fit with a linear model to determine the initial cleavage velocity.
- FIG. 21 shows the quantification of cleavage rates of RNP formed by CasX491 and the sgRNA variants, as described in Example 9.
- Target DNA was incubated with a 20-fold excess of the indicated RNP at 10°C and the amount of cleaved target was determined at the indicated time points. The monophasic fit of the timepoints is shown.
- FIGS. 22A-D shows the quantification of cleavage rates of CasX variants on NTC PAMs, as described in Example 10.
- Target DNA substrates with identical spacers and the indicated PAM sequence were incubated with a 20-fold excess of the indicated RNP at 37°C and the amount of cleaved target was determined at the indicated time points. Monophasic fit of a single replicate is shown.
- FIG. 22A shows the results for sequences having a TTC PAM.
- FIG. 22B shows the results for sequences having a CTC PAM.
- FIG. 22C shows the results for sequences having a GTC PAM.
- FIG. 22D shows the results for sequences having a ATC PAM.
- FIG. 23 depicts the plasmids utilized in the creation of XDP comprising CasX, gNA, and pseudotyping proteins, as described in Example 13.
- FIG. 24 is a schematic of the steps using in the creation of XDP, as described in Example 13.
- FIG. 25 is a graph of the results of the editing of the dtTomato assay, as described in Example 16.
- FIG. 26A shows the results of percentage editing in mouse tdTomato neural progenitor cells (NPCs) with XDPs pseudotyped with serial concentrations of VSV-G, as described in Example 17.
- FIG. 26B shows the XDP titers determined by a commercially available Lenti-X p24 ELISA kit, as described in Example 17.
- FIG. 27 shows the percentage of editing in mouse tdTomato NPCs with XDPs pseudotyped with different glycoproteins, as described in Example 17.
- FIG. 28A shows the results of size distributions and viral titer comparisons of VSV-G pseudotyped XDP (both IX and 10X concentrated), rabies pseudotyped XDP and lentivirus (LV), as described in Example 17.
- FIG. 28B shows the size comparisons between VSV-G XDP, LV and Rabies XDP, as described in Example 17.
- FIG. 29 shows the results of percentage editing in mouse tdTomato NPCs with VSV-G pseudotyped XDPs carrying different RNPs, as described in Example 18.
- FIG. 30 shows the percentage editing in mouse tdTomato NPCs with VSV-G pseudotyped XDPs with titrated amounts of Gag-Pol vs Gag-Stx (Stx construct), as described in Example 19.
- FIG. 31 shows the titers for these different XDPs with varying amounts of Gag-Pol vs Gag-Stx constructs, as described in Example 19.
- FIG. 32 shows the amount of guide RNA per XDP titer for different constructs as assessed by QPCR, as described in Example 19.
- FIG. 33 shows the results of the relative knockout rates of B2M by XDPs containing two different B2M targeting spacers and one non targeting spacer, as described in Example 20.
- FIG. 34 shows representative SDS-PAGE and Western blot images of samples taken from throughout the centrifugation purification process for XDP particles, as described in Example 14.
- FIG. 35 shows the results of an editing assay for XDP configured as version 7, version 122 and version 123, as described in Example 21.
- FIG. 36A shows the schematic for the configuration of the components for version 1 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 36B shows the schematic for the configuration of the components for version 2 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 37A shows the schematic for the configuration of the components for version 3 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 37B shows the schematic for the configuration of the components for version 4 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 38A shows the schematic for the configuration of the components for version 5 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 38B shows the schematic for the configuration of the components for version 6 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 39A shows the schematic for the configuration of the components for version 7 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 39B shows the schematic for the configuration of the components for version 8 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 40 A shows the schematic for the configuration of the components for version 9 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 40B shows the schematic for the configuration of the components for version 10 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 41 A shows the schematic for the configuration of the components for version 11 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 41B shows the schematic for the configuration of the components for version 12 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 42A shows the schematic for the configuration of the components for version 13 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 42B shows the schematic for the configuration of the components for version 14 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 43 A shows the schematic for the configuration of the components for version 15 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 43B shows the schematic for the configuration of the components for version 16 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 44A shows the schematic for the configuration of the components for version 24 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 44B shows the schematic for the configuration of the components for version 25 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 45A shows the schematic for the configuration of the components for version 26 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 45B shows the schematic for the configuration of the components for version 27 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 46A shows the schematic for the configuration of the components for version 31 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 46B shows the schematic for the configuration of the components for version 32 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 47 A shows the schematic for the configuration of the components for version 33 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 47B shows the schematic for the configuration of the components for version 34 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 48 A shows the schematic for the configuration of the components for version 35 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 48B shows the schematic for the configuration of the components for version 36 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 49 A shows the schematic for the configuration of the components for version 37 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 49B shows the schematic for the configuration of the components for version 38 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 50A shows the schematic for the configuration of the components for version 39 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 50B shows the schematic for the configuration of the components for version 40 XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 51 A shows the schematic for the configuration of the components for version 17 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 5 IB shows the schematic for the configuration of the components for version 18 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 52A shows the schematic for the configuration of the components for versions 44 and 45 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 52B shows the schematic for the configuration of the components for versions 46, 47, 62, and 90 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 53 A shows the schematic for the configuration of the components for versions 48, 49, and 63 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 53B shows the schematic for the configuration of the components for version 50 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 54A shows the schematic for the configuration of the components for versions 51 and 52 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 54B shows the schematic for the configuration of the components for versions 53, 54, 55 and 91 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 55A shows the schematic for the configuration of the components for versions 56- 61 and 92 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 55B shows the schematic for the configuration of the components for versions 66a and 67a XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 56A shows the schematic for the configuration of the components for versions 66b and 67b XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 56B shows the schematic for the configuration of the components for versions 68a, 69a, 70a and 87a XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 57A shows the schematic for the configuration of the components for versions 68b, 69b, 70b and 87b XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 57B shows the schematic for the configuration of the components for versions 71a, 72a and 88a XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 58A shows the schematic for the configuration of the components for versions 71b, 72b and 88b XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 58B shows the schematic for the configuration of the components for versions 73a XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 59A shows the schematic for the configuration of the components for version 73b XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 59B shows the schematic for the configuration of the components for versions 74a and 75a XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 60A shows the schematic for the configuration of the components for versions 74b and 75b XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 60B shows the schematic for the configuration of the components for versions 76a, 77a, 78a, and 79a XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 61 A shows the schematic for the configuration of the components for versions 76b, 77b, 78b, and 79b XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 6 IB shows the schematic for the configuration of the components for versions 80a, 81a, 82a, 83a, 84a, 85a and 86a XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 62A shows the schematic for the configuration of the components for versions 80b, 81b, 82b, 83b, 84b, 85b, and 86b XDP and the four plasmids used in the transfection to create the XDP.
- FIG. 62B shows the schematic for the configuration of the components for versions 102 and 114 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 63 A shows the schematic for the configuration of the components for versions
- FIG. 63B shows the schematic for the configuration of the components for versions
- FIG. 64A shows the schematic for the configuration of the components for versions 106, 111, 112, 83b and 113 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 64B shows the schematic for the configuration of the components for versions 107 and 110 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 65 shows the schematic for the configuration of the components for version 118 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 66A shows the schematic for the configuration of the components for version 122 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 66B shows the schematic for the configuration of the components for version 103 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 67A shows the schematic for the configuration of the components for version 124 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 67B shows the schematic for the configuration of the components for version 126 XDP and the three plasmids used in the transfection to create the XDP.
- FIG. 68 shows the schematic for the configuration of the components for versions 128 XDP and the three plasmids used in the transfection to create the XDP.
- FIGS. 69A and 69B show the results of editing assays of the various XDP versions, as described in Example 22.
- FIG. 70 shows the results of editing assays of the various XDP versions, as described in Example 22.
- FIGS. 71 A and 71B shows the results of editing assays of the various XDP versions, as described in Example 23.
- FIG. 72 shows the results of editing assays of the various XDP versions, as described in Example 23.
- FIGS. 73 A and 73B shows the results of editing assays of the various XDP versions, as described in Example 23.
- FIG. 74 shows the results of editing assays of the various XDP versions, as described in Example 23.
- FIGS. 75A and 75B shows the results of editing assays of the various XDP versions, as described in Example 25.
- FIG. 76 shows the results of editing assays of the various XDP versions, as described in Example 25.
- FIG. 77 shows the results of editing assays of the various XDP versions, as described in Example 26.
- FIG. 78 shows the results of editing assays of the various XDP versions, as described in Example 26. DETAILED DESCRIPTION
- polynucleotide and “nucleic acid,” used interchangeably herein, refer to a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides.
- polynucleotide and “nucleic acid” encompass single-stranded DNA; double- stranded DNA; multi -stranded DNA; single-stranded RNA; double-stranded RNA; multi- stranded RNA; genomic DNA; cDNA; DNA-RNA hybrids; and a polymer comprising purine and pyrimidine bases or other natural, chemically or biochemically modified, non-natural, or derivatized nucleotide bases.
- Hybridizable or “complementary” are used interchangeably to mean that a nucleic acid (e.g., RNA, DNA) comprises a sequence of nucleotides that enables it to non-covalently bind, i.e., form Watson-Crick base pairs and/or G/U base pairs, “anneal”, or “hybridize,” to another nucleic acid in a sequence-specific, antiparallel, manner (i.e., a nucleic acid specifically binds to a complementary nucleic acid) under the appropriate in vitro and/or in vivo conditions of temperature and solution ionic strength.
- a nucleic acid e.g., RNA, DNA
- anneal i.e., antiparallel
- sequence of a polynucleotide need not be 100% complementary to that of its target nucleic acid to be specifically hybridizable; it can have at least about 70%, at least about 80%, or at least about 90%, or at least about 95% sequence identity and still hybridize to the target nucleic acid.
- a polynucleotide may hybridize over one or more segments such that intervening or adjacent segments are not involved in the hybridization event (e.g., a loop structure or hairpin structure, a 'bulge', ‘bubble’ and the like).
- a gene may include regulatory element sequences including, but not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites and locus control regions.
- Coding sequences encode a gene product upon transcription or transcription and translation; the coding sequences of the disclosure may comprise fragments and need not contain a full-length open reading frame.
- a gene can include both the strand that is transcribed as well as the complementary strand containing the anticodons.
- downstream refers to a nucleotide sequence that is located 3' to a reference nucleotide sequence.
- downstream nucleotide sequences relate to sequences that follow the starting point of transcription. For example, the translation initiation codon of a gene is located downstream of the start site of transcription.
- upstream refers to a nucleotide sequence that is located 5' to a reference nucleotide sequence.
- upstream nucleotide sequences relate to sequences that are located on the 5' side of a coding region or starting point of transcription. For example, most promoters are located upstream of the start site of transcription.
- regulatory element is used interchangeably herein with the term “regulatory sequence,” and is intended to include promoters, enhancers, and other expression regulatory elements (e.g. transcription termination signals, such as polyadenylation signals and poly-U sequences).
- regulatory elements include a transcription promoter such as, but not limited to, CMV, CMV+intron A, SV40, RSV, HIV-Ltr, elongation factor 1 alpha (EFla), MMLV-ltr, internal ribosome entry site (IRES) or P2A peptide to permit translation of multiple genes from a single transcript, metallothionein, a transcription enhancer element, a transcription termination signal, polyadenylation sequences, sequences for optimization of initiation of translation, and translation termination sequences.
- regulatory elements include exonic splicing enhancers.
- the choice of the appropriate regulatory element will depend on the encoded component to be expressed (e.g., protein or RNA) or whether the nucleic acid comprises multiple components that require different polymerases or are not intended to be expressed as a fusion protein.
- promoter refers to a DNA sequence that contains an RNA polymerase binding site, transcription start site, TATA box, and/or B recognition element and assists or promotes the transcription and expression of an associated transcribable polynucleotide sequence and/or gene (or transgene).
- a promoter can be synthetically produced or can be derived from a known or naturally occurring promoter sequence or another promoter sequence.
- a promoter can be proximal or distal to the gene to be transcribed.
- a promoter can also include a chimeric promoter comprising a combination of two or more heterologous sequences to confer certain properties.
- a promoter of the present disclosure can include variants of promoter sequences that are similar in composition, but not identical to, other promoter sequence(s) known or provided herein.
- a promoter can be classified according to criteria relating to the pattern of expression of an associated coding or transcribable sequence or gene operably linked to the promoter, such as constitutive, developmental, tissue-specific, inducible, etc.
- Enhancers refers to regulatory DNA sequences that, when bound by specific proteins called transcription factors, regulate the expression of an associated gene. Enhancers may be located in the intron of the gene, or 5’ or 3’ of the coding sequence of the gene. Enhancers may be proximal to the gene (i.e., within a few tens or hundreds of base pairs (bp) of the promoter), or may be located distal to the gene (i.e., thousands of bp, hundreds of thousands of bp, or even millions of bp away from the promoter). A single gene may be regulated by more than one enhancer, all of which are envisaged as within the scope of the instant disclosure.
- Recombinant means that a particular nucleic acid (DNA or RNA) is the product of various combinations of cloning, restriction, and/or ligation steps resulting in a construct having a structural coding or non-coding sequence distinguishable from endogenous nucleic acids found in natural systems.
- DNA sequences encoding the structural coding sequence can be assembled from cDNA fragments and short oligonucleotide linkers, or from a series of synthetic oligonucleotides, to provide a synthetic nucleic acid which is capable of being expressed from a recombinant transcriptional unit contained in a cell or in a cell-free transcription and translation system.
- sequences can be provided in the form of an open reading frame uninterrupted by internal non-translated sequences, or introns, which are typically present in eukaryotic genes.
- Genomic DNA comprising the relevant sequences can also be used in the formation of a recombinant gene or transcriptional unit. Sequences of non-translated DNA may be present 5’ or 3’ from the open reading frame, where such sequences do not interfere with manipulation or expression of the coding regions, and may indeed act to modulate production of a desired product by various mechanisms (see “enhancers” and “promoters”, above).
- recombinant polynucleotide or “recombinant nucleic acid” refers to one which is not naturally occurring, e.g., is made by the artificial combination of two otherwise separated segments of sequence through human intervention.
- This artificial combination is often accomplished by either chemical synthesis means, or by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques. Such is usually done to replace a codon with a redundant codon encoding the same or a conservative amino acid, while typically introducing or removing a sequence recognition site. Alternatively, it is performed to join together nucleic acid segments of desired functions to generate a desired combination of functions.
- This artificial combination is often accomplished by either chemical synthesis means, or by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques.
- recombinant polypeptide or “recombinant protein” refers to a polypeptide or protein which is not naturally occurring, e.g., is made by the artificial combination of two otherwise separated segments of amino sequence through human intervention.
- a protein that comprises a heterologous amino acid sequence is recombinant.
- contacting means establishing a physical connection between two or more entities.
- contacting a target nucleic acid with a guide nucleic acid means that the target nucleic acid and the guide nucleic acid are made to share a physical connection; e.g., can hybridize if the sequences share sequence similarity.
- modifying includes but is not limited to cleaving, nicking, editing, deleting, knocking in, knocking out, and the like.
- knock-out refers to the elimination of a gene or the expression of a gene.
- a gene can be knocked out by either a deletion or an addition of a nucleotide sequence that leads to a disruption of the reading frame.
- a gene may be knocked out by replacing a part of the gene with an irrelevant sequence.
- knock-down refers to reduction in the expression of a gene or its gene product(s). As a result of a gene knock-down, the protein activity or function may be attenuated or the protein levels may be reduced or eliminated.
- HDR homology-directed repair
- This process requires nucleotide sequence homology, and uses a donor template to repair or knock-out a target DNA, and leads to the transfer of genetic information from the donor to the target.
- Homology-directed repair can result in an alteration of the sequence of the target sequence by insertion, deletion, or mutation if the donor template differs from the target DNA sequence and part or all of the sequence of the donor template is incorporated into the target DNA.
- non-homologous end joining refers to the repair of double strand breaks in DNA by direct ligation of the break ends to one another without the need for a homologous template (in contrast to homology-directed repair, which requires a homologous sequence to guide repair). NHEJ often results in the loss (deletion) of nucleotide sequence near the site of the double- strand break.
- micro-homology mediated end joining refers to a mutagenic DSB repair mechanism, which always associates with deletions flanking the break sites without the need for a homologous template (in contrast to homology-directed repair, which requires a homologous sequence to guide repair). MMEJ often results in the loss (deletion) of nucleotide sequence near the site of the double- strand break.
- a polynucleotide or polypeptide has a certain percent "sequence similarity" or “sequence identity” to another polynucleotide or polypeptide, meaning that, when aligned, that percentage of bases or amino acids are the same, and in the same relative position, when comparing the two sequences.
- Sequence similarity (sometimes referred to as percent similarity, percent identity, or homology) can be determined in a number of different manners. To determine sequence similarity, sequences can be aligned using the methods and computer programs that are known in the art, including BLAST, available over the world wide web at ncbi.nlm.nih.gov/BLAST.
- Percent complementarity between particular stretches of nucleic acid sequences within nucleic acids can be determined using any convenient method.
- Example methods include BLAST programs (basic local alignment search tools) and PowerBLAST programs (Altschul et al., J. Mol. Biol., 1990, 215, 403-410; Zhang and Madden, Genome Res., 1997, 7, 649-656) or by using the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, Madison Wis.), e.g., using default settings, which uses the algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2, 482-489).
- polypeptide and “protein” are used interchangeably herein, and refer to a polymeric form of amino acids of any length, which can include coded and non-coded amino acids, chemically or biochemically modified or derivatized amino acids, and polypeptides having modified peptide backbones.
- the term includes fusion proteins, including, but not limited to, fusion proteins with a heterologous amino acid sequence.
- a “vector” or “expression vector” is a replicon, such as plasmid, phage, virus, or cosmid, to which another DNA segment, i.e., an “insert”, may be attached so as to bring about the replication or expression of the attached segment in a cell.
- nucleic acid refers to a nucleic acid, polypeptide, cell, or organism that is found in nature.
- a “mutation” refers to an insertion, deletion, substitution, duplication, or inversion of one or more amino acids or nucleotides as compared to a wild-type or reference amino acid sequence or to a wild-type or reference nucleotide sequence.
- isolated is meant to describe a polynucleotide, a polypeptide, or a cell that is in an environment different from that in which the polynucleotide, the polypeptide, or the cell naturally occurs.
- An isolated genetically modified host cell may be present in a mixed population of genetically modified host cells.
- a “host cell,” as used herein, denotes a eukaryotic cell, a prokaryotic cell, or a cell from a multicellular organism (e.g., a cell line) cultured as a unicellular entity, which eukaryotic or prokaryotic cells are used as recipients for a nucleic acid (e.g., an expression vector), and include the progeny of the original cell which has been genetically modified by the nucleic acid. It is understood that the progeny of a single cell may not necessarily be completely identical in morphology or in genomic or total DNA complement as the original parent, due to natural, accidental, or deliberate mutation.
- a “recombinant host cell” (also referred to as a “genetically modified host cell”) is a host cell into which has been introduced a heterologous nucleic acid, e.g., an expression vector.
- tropism refers to preferential entry of the XDP into certain cell or tissue type(s) and/or preferential interaction with the cell surface that facilitates entry into certain cell or tissue types, optionally and preferably followed by expression (e.g., transcription and, optionally, translation) of sequences carried by the XDP into the cell.
- HIV can be pseudotyped with vesicular stomatitis virus G-protein (VSV-G) envelope proteins (amongst others, described herein, below), which allows HIV to infect a wider range of cells because HIV envelope proteins target the virus mainly to CD4+ presenting cells.
- VSV-G vesicular stomatitis virus G-protein
- tropism factor refers to components integrated into the surface of an XDP that provides tropism for a certain cell or tissue type.
- Non-limiting examples of tropism factors include glycoproteins, antibody fragments (e.g., scFv, nanobodies, linear antibodies, etc.), receptors and ligands to target cell markers.
- a “target cell marker” refers to a molecule expressed by a target cell including but not limited to cell-surface receptors, cytokine receptors, antigens, tumor-associated antigens, glycoproteins, oligonucleotides, enzymatic substrates, antigenic determinants, or binding sites that may be present in the on the surface of a target tissue or cell that may serve as ligands for a tropism factor.
- an “antibody fragment” refers to a molecule other than an intact antibody that comprises a portion of an intact antibody and that binds the antigen to which the intact antibody binds.
- antibody fragments include but are not limited to Fv, Fab, Fab', Fab'-SH, F(ab')2, diabodies, single chain diabodies, linear antibodies, a single domain antibody, a single domain camelid antibody, single-chain variable fragment (scFv) antibody molecules, and multispecific antibodies formed from antibody fragments.
- a group of amino acids having aliphatic side chains consists of glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains consists of serine and threonine; a group of amino acids having amide-containing side chains consists of asparagine and glutamine; a group of amino acids having aromatic side chains consists of phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains consists of lysine, arginine, and histidine; and a group of amino acids having sulfur-containing side chains consists of cysteine and methionine.
- Exemplary conservative amino acid substitution groups are: valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine,
- treatment or “treating,” are used interchangeably herein and refer to an approach for obtaining beneficial or desired results, including but not limited to a therapeutic benefit and/or a prophylactic benefit.
- therapeutic benefit is meant eradication or amelioration of the underlying disorder or disease being treated.
- a therapeutic benefit can also be achieved with the eradication or amelioration of one or more of the symptoms or an improvement in one or more clinical parameters associated with the underlying disease such that an improvement is observed in the subject, notwithstanding that the subject may still be afflicted with the underlying disorder.
- terapéuticaally effective amount refers to an amount of a drug or a biologic, alone or as a part of a composition, that is capable of having any detectable, beneficial effect on any symptom, aspect, measured parameter or characteristics of a disease state or condition when administered in one or repeated doses to a subject such as a human or an experimental animal. Such effect need not be absolute to be beneficial.
- administering means a method of giving a dosage of a compound (e.g., a composition of the disclosure) or a composition (e.g., a pharmaceutical composition) to a subject.
- a “subject” is a mammal. Mammals include, but are not limited to, domesticated animals, non-human primates, humans, dogs, rabbits, mice, rats and other rodents.
- the present disclosure relates to particle delivery systems (XDP) designed to self-assemble particles comprising therapeutic payloads wherein the particles are designed for selective delivery to targeted cells.
- XDP particle delivery systems
- the term “XDP” refers to a non replicating, self-assembling, non-naturally occurring multicomponent structure composed of one or more viral proteins, polyproteins, virally-derived peptides or polypeptides, such as, but not limited to, capsid, coat, shell, as well as tropism factors such as envelope glycoproteins derived from viruses, antibody fragments, receptors or ligand utilized for tropism to direct the XDP to target cells or tissues, with a lipid layer (derived from the host cell), wherein the XDP are capable of self-assembly in a host cell and encapsidating or encompassing a therapeutic payload.
- the XDP of present disclosure can be utilized to specifically and selectively deliver therapeutic payloads to target cells or tissues.
- the XDP of the disclosure have utility in a variety of methods, including, but not limited to, use in delivering a therapeutic in a selective fashion to a target cell or organ for the treatment of a disease.
- the present disclosure provides XDP systems comprising one or more nucleic acids comprising sequences encoding the components of the XDP, the therapeutic payload, and tropism factors that, that, when introduced into an appropriate eukaryotic host cell, result in the expression of the individual XDP structural components, processing proteins, therapeutic payloads, and tropism factors that self-assemble into XDP particles that encapsidate the therapeutic payload, and that can be collected and purified for the methods and uses described herein.
- the therapeutic payloads packaged within the XDP comprise therapeutic proteins, described more fully below.
- the therapeutic payloads packaged within XDP comprise therapeutic nucleic acids or nucleic acids that encode therapeutic proteins.
- the XDP comprise therapeutic proteins and nucleic acids.
- the therapeutic payloads include gene editing systems such as CRISPR nucleases and guide RNA or zinc finger proteins useful for the editing of nucleic acids in target cells.
- the therapeutic payloads include Class 2 CRISPR-Cas systems.
- Class 2 systems are distinguished from Class 1 systems in that they have a single multi- domain effector protein and are further divided into a Type II, Type V, or Type VI system, described in Makarova, et al. Evolutionary classification of CRISPR-Cas systems: a burst of class 2 and derived variants. Nature Rev. Microbiol. 18:67 (2020), incorporated herein by reference.
- the nucleases include Class 2, Type II CRISPR/Cas effector polypeptides such as Cas9.
- the nucleases include Class 2, Type V CRISPR/Cas effector polypeptides such as a Casl2a, Casl2b, Casl2c, Casl2d (CasY), Casl2J, and CasX wherein the CRISPR nuclease and guide system can do one or more of the following: (i) modify (e.g., edit) a target ssDNA, dsDNA or RNA (e.g., cleave, nick, or methylate); (ii) modulate transcription of the target nucleic acid; (iii) bind the target nucleic acid (e.g., for purposes of isolation, blocking transcription, labeling, or imaging, etc.); or (v) modify a polypeptide associated with a target nucleic acid.
- modify e.g., edit
- RNA e.g., cleave, nick, or methylate
- the present disclosure provides XDP compositions, and methods to make the XDP compositions, designed to package ribonucleic acid particles (RNP) comprising CasX and guide RNA systems (CasX:gNA system) useful for the editing of nucleic acids in target cells, described more fully, below.
- RNP ribonucleic acid particles
- CasX:gNA system CasX:gNA system
- the present disclosure provides XDP compositions, nucleic acids that encode the components of the XDP (both structural as well as gene-editing components), as well as methods of making and using the XDP.
- the nucleic acids, the components of the compositions, and the methods of making and using them, are described herein, below. a. XDP Components
- XDP can be created in multiple forms and configurations (see, e.g., FIGS. 36-68) utilizing components derived from various sources and in different combinations.
- the structural components of the XDP of the present disclosure are derived from members of the Retroviridae family of viruses, described more fully, below.
- the major structural component of retroviruses is the polyprotein Gag, which also typically contain protease cleavage sites that, upon action by the viral protease, processes the Gag into subcomponents that, in the case of the replication of the source virus, then self-assemble in the host cell to make the core inner shell of the virus.
- the expression of Gag alone is sufficient to mediate the assembly and release of virus-like particles (VLPs) from host cells.
- VLPs virus-like particles
- Gag proteins from all retroviruses contain an N-terminal membrane-binding matrix (MA) domain, a capsid (CA) domain (with two subdomains), and a nucleocapsid (NC) domain that are structurally similar across retroviral genera but differ greatly in sequence. Outside these core domains, Gag proteins vary among retroviruses, and other linkers and domains may be present (Shur, F., et al. The Structure of Immature Virus-Like Rous Sarcoma Virus Gag Particles Reveals a Structural Role for the plO Domain in Assembly. J Virol. 89(20): 10294 (2015)).
- MA N-terminal membrane-binding matrix
- CA capsid
- NC nucleocapsid
- the assembly pathway of Gag into immature particles in the host cell is mediated by interactions between MA (which is responsible for targeting Gag polyprotein to the plasma membrane), between NC and RNA, and between CA domains (which, in the context of the present disclosure, assemble into the XDP capsid).
- MA which is responsible for targeting Gag polyprotein to the plasma membrane
- NC and RNA which, in the context of the present disclosure, assemble into the XDP capsid.
- CA domains which, in the context of the present disclosure, assemble into the XDP capsid.
- assembly takes place on the plasma membrane, but for betaretroviruses the particles are assembled in the cytoplasm and then transported to the plasma membrane.
- concomitant with, or shortly after, particle release, cleavage of Gag by the viral protease (PR) gives rise to separate MA, CA, and NC proteins, inducing a rearrangement of the internal viral structure, with CA forming the shell of the mature viral core.
- PR viral protease
- the present disclosure provides XDP comprising one or more structural components derived from a Retroviridae virus, a therapeutic payload (described more fully, below), and a tropism factor (described more fully, below).
- the virus structural components are derived from a Orthoretrovirinae virus.
- the Orthoretrovirinae virus is an Alpharetrovirus, a Betaretrovirus, a Deltaretrovirus, an Epsilonretrovirus , a Gammaretrovirus or a Lentivirus.
- the virus structural components are derived from a Spumaretrovirinae virus.
- the Spumaretrovirinae virus is a B ovaspumavirus, an Equispumavirus , a Felispumavirus , a Prosimiispumavirus or a Simiispumavirus .
- Retroviridae family of viruses have different subfamilies, including Orthoretrovirinae, Spumaretrovirinae , and unclassified Retroviridae .
- Many retroviruses cause serious diseases in humans, other mammals, and birds.
- Human retroviruses include Human Immunodeficiency Virus 1 (HIV-1) and HIV-2, the cause of the disease AIDS, and human T- lymphotropic virus (HTLV) also cause disease in humans.
- the subfamily Orthoretrovirinae include the genera Alpharetrovirus, Betaretrovirus, Deltaretrovirus, Epsilonretrovirus, Gammaretrovirus , and Lentivirus.
- Betaretrovirus examples include mouse mammary tumor virus, Mason-Pfizer monkey virus, and enzootic nasal tumor virus.
- Deltaretrovirus examples include the bovine leukemia virus and the human T-lymphotropic viruses.
- Epsilonretrovirus include Walleye dermal sarcoma virus, and Walleye epidermal hyperplasia virus 1 and 2.
- Gammaretrovirus include murine leukemia virus, Maloney murine leukemia virus, and feline leukemia virus, as well as viruses that infect other animal species.
- Lentivirus is a genus of retroviruses that cause chronic and deadly diseases, including HIV-1 and HIV-2, the cause of the disease AIDS, and also includes Simian immunodeficiency virus.
- the subfamily Spumaretrovirinae include the genera Bovispumavirus, Equispumavirus, Felispumavirus, Prosimiispumavirus,
- Retroviridae Simiispumavirus, and Spumavirus.
- Members of the Retroviridae have provided valuable research tools in molecular biology, and, in the context of the present disclosure, have been used in the generation of XDP for delivery systems. It has been discovered that the retroviral-derived structural components of XDP can be derived from each of the genera of Retroviridae , and that the resulting XDP are capable self-assembly in a host cell and encapsidating (or encompassing) therapeutic payloads that have utility in the targeted and selective delivery of the therapeutic payloads to target cells and tissues.
- the XDP retroviral components are derived from Alpharetrovirus , including but not limited to avian leukosis virus (ALV) and Rous sarcoma virus (RSV).
- the present disclosure provides XDP wherein the XDP comprises components selected from the group consisting of: a matrix polypeptide (MA); a p2A spacer peptide; ap2B spacer peptide; a plO spacer peptide; a capsid polypeptide (CA); a nucleocapsid polypeptide (NC); a Gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), p2A, p2B, plO, a nucleocapsid polypeptide (NC); a therapeutic payload; a tropism factor; a Gag-Pol polyprotein; a Gag-transframe region-Pol protease polyprotein; a cleavage site
- Gag components e.g., MA, CA, p2A, p2B, plO, and NC
- the cleavage site and protease are derived from an Alpharetrovirus , including but not limited to Avian leukosis virus and Rous sarcoma virus.
- the encoding sequences for these components are provided in Table 5, and the methods to create the encoding plasmids and produce the XDP in host cells are described herein, below.
- the XDP comprises one or mor Q Alpharetrovirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 192, 193, 195, 196, 198-201, 782, and 234 as set forth in Table 5, or a sequence having at least 80%, at least 90%, at least 95%, at least 95%, at least 97%, at least 98%, or at least 99% identity thereto.
- the XDP comprises one or more Alpharetrovirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 192, 193, 195, 196, 198-201, 782, 234 as set forth in Table 5.
- the XDP having Alpharetrovirus components can be designed in various configurations, including the configurations of FIGS. 36-68, and may be encoded by one, two, three or four nucleic acids, described more fully, below.
- the XDP comprise a subset of the components listed supra , such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads. These alternative configurations are described more fully, below, as well as in the Examples.
- the therapeutic payload is an RNP of a complexed CasX and gNA embodiment described herein, while the tropism factor is a viral glycoprotein embodiment described herein.
- the XDP viral components are derived from Betaretrovirus, including but not limited to mouse mammary tumor virus (MMTV), Mason-Pfizer monkey virus (MPMV), and enzootic nasal tumor virus (ENTV).
- the present disclosure provides XDP wherein the XDP comprises components selected from the group consisting of: a matrix polypeptide (MA); a pp21/24 spacer peptide; a p3-p8/pl2 spacer peptide; a capsid polypeptide (CA); a nucleocapsid polypeptide (NC); a Gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), pp21/24, p3-p8/pl2, a nucleocapsid polypeptide (NC); a therapeutic payload; a tropism factor; a Gag-Pol polyprotein; a Gag-transframe region- Pol protease polyprotein;
- Gag components e.g., MA, CA, pp2124 spacer, p3-p8/pl2 spacer, andNC
- the cleavage site and protease are derived from an Betaretrovirus , including but not limited to mouse mammary tumor virus, Mason-Pfizer monkey virus, and enzootic nasal tumor virus.
- the encoding sequences for these components are provided in Table 5, and the methods to create the encoding plasmids and produce the XDP in host cells are described herein, below.
- the XDP comprises one or more Betaretrovirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 235-257 as set forth in Table 5, or a sequence having at least 80%, at least 90%, at least 95%, at least 95%, at least 97%, at least 98%, or at least 99% identity thereto.
- the XDP comprises one or more Betaretrovirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 235-257 as set forth in Table 5.
- the XDP having Betaretrovirus components can be designed in various configurations, including the configurations of FIGS. 36-68, and may be encoded by one, two, three or four nucleic acids, described more fully, below.
- the XDP comprise a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads.
- the therapeutic payload is an RNP of a complexed CasX and gNA embodiment described herein, while the tropism factor is a viral glycoprotein embodiment described herein.
- the XDP viral components are derived from Deltaretrovirus, including but not limited to bovine leukemia virus (BLV) and the human T-lymphotropic viruses (HTLV1).
- BLV bovine leukemia virus
- HTLV1 human T-lymphotropic viruses
- the present disclosure provides XDP wherein the XDP comprises components selected from the group consisting of: a matrix polypeptide (MA); a capsid polypeptide (CA); a nucleocapsid polypeptide (NC); a Gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); a therapeutic payload; a tropism factor; a Gag-Pol polyprotein; a Gag-transframe region-Pol protease polyprotein; a cleavage site(s); and a non-retroviral, heterologous protease capable of cleaving the protease cleavage sites
- Gag components e.g., MA, CA, and NC
- the cleavage site and protease are derived from an Deltaretrovirus , including but not limited to bovine leukemia virus and the human T- lymphotropic viruses.
- the encoding sequences for these components are provided in Table 5, and the methods to create the encoding plasmids and produce the XDP in host cells are described herein, below.
- the XDP comprises one or mor Q Deltaretrovirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 258-272 as set forth in Table 5, or a sequence having at least 80%, at least 90%, at least 95%, at least 95%, at least 97%, at least 98%, or at least 99% identity thereto.
- the XDP comprises one or more Deltaretrovirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 258-272 as set forth in Table 5.
- the XDP having Deltaretrovirus components can be designed in various configurations, including the configurations of FIGS. 36-68, and may be encoded by one, two, three or four nucleic acids, described more fully, below.
- the XDP comprise a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads.
- the therapeutic payload is an RNP of a complexed CasX and gNA embodiment described herein, while the tropism factor is a viral glycoprotein embodiment described herein.
- the XDP viral components are derived from Epsilonretrovirus , including but not limited to Walleye dermal sarcoma virus (WDSV), and Walleye epidermal hyperplasia virus 1 and 2.
- the present disclosure provides XDP wherein the XDP comprises components selected from the group consisting of: a matrix polypeptide (MA); a p20 spacer peptide; a capsid polypeptide (CA); a nucleocapsid polypeptide (NC); a Gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), p20, a nucleocapsid polypeptide (NC); a therapeutic payload; a tropism factor; a Gag-Pol polyprotein; a Gag-transframe region-Pol protease polyprotein; a cleavage site(s); and a non-retroviral, heterologous protease capable of cleaving the group consisting of: a matrix polypeptide
- Gag components e.g., MA, CA, p20, and NC
- the cleavage site and protease are derived from an Epsilonretrovirus , including but not limited to Walleye dermal sarcoma virus, and Walleye epidermal hyperplasia virus 1 and 2.
- Epsilonretrovirus including but not limited to Walleye dermal sarcoma virus, and Walleye epidermal hyperplasia virus 1 and 2.
- the encoding sequences for these components are provided in Table 5, and the methods to create the encoding plasmids and produce the XDP in host cells are described herein, below.
- the XDP comprises one or more Epsilonretrovirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 273-277 as set forth in Table 5, or a sequence having at least 80%, at least 90%, at least 95%, at least 95%, at least 97%, at least 98%, or at least 99% identity thereto.
- the XDP comprises one or more Epsilonretrovirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 273-277 as set forth in Table 5.
- the XDP having Epsilonretrovirus components can be designed in various configurations, including the configurations of FIGS.
- the XDP comprise a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads. These alternative configurations are described more fully, below, as well as in the Examples.
- the therapeutic payload is an RNP of a complexed CasX and gNA embodiment described herein, while the tropism factor is a viral glycoprotein embodiment described herein.
- the XDP viral components are derived from Gammaretrovirus, including but not limited to murine leukemia virus (MLV), Maloney murine leukemia virus (MMLV), and feline leukemia virus (FLV).
- the present disclosure provides XDP wherein the XDP comprises components selected from the group consisting of: a matrix polypeptide (MA); a ppl2 spacer peptide; a capsid polypeptide (CA); a nucleocapsid polypeptide (NC); a Gag polyprotein comprising a matrix polypeptide (MA), a ppl2 spacer, a capsid polypeptide (CA), a nucleocapsid polypeptide (NC); a therapeutic payload; a tropism factor; a Gag-Pol polyprotein; a Gag-transframe region-Pol protease polyprotein; a cleavage site(s); and a non-retroviral, heterologous proteas
- Gag components e.g., MA, ppl2, CA, and NC
- the cleavage site and protease are derived from an Gammaretrovirus , including but not limited to Walleye dermal sarcoma virus, and Walleye epidermal hyperplasia virus 1 and 2.
- the encoding sequences for these components are provided in Table 5, and the methods to create the encoding plasmids and produce the XDP in host cells are described herein, below.
- the XDP comprises one or more Gammaretrovirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 278-287 as set forth in Table 5, or a sequence having at least 80%, at least 90%, at least 95%, at least 95%, at least 97%, at least 98%, or at least 99% identity thereto.
- the XDP comprises one or more Gammaretrovirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 278-287 as set forth in Table 5.
- the XDP having Gammaretrovirus components can be designed in various configurations, including the configurations of FIGS. 36-68, and may be encoded by one, two, three or four nucleic acids, described more fully, below.
- the XDP comprise a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads.
- the therapeutic payload is an RNP of a complexed CasX and gNA embodiment described herein, while the tropism factor is a viral glycoprotein embodiment described herein.
- the XDP viral components are derived from Lentivirus , including but not limited to HIV-1 and HIV-2, and Simian immunodeficiency virus (SIV).
- the present disclosure provides XDP wherein the XDP comprises components selected from the group consisting of: a matrix polypeptide (MA); a capsid (CA), a p2 spacer peptide, a nucleocapsid (NC), a pl/p6 spacer peptide; ); a Gag polyprotein comprising a matrix polypeptide (MA), CA, p2, NC, and pl/p6; a therapeutic payload; a tropism factor; a Gag-Pol polyprotein; a Gag-transframe region-Pol protease polyprotein; a cleavage site(s); and a non-retroviral, heterologous protease capable of cleaving the protease cleavage sites.
- MA matrix polypeptide
- CA capsid
- NC nucleocaps
- Gag components e.g., MA, CA, NC, and pl/p6
- the cleavage site and protease are derived from an Lentivirus , including but not limited to HIV-1, HIV-2, and Simian immunodeficiency virus (SIV).
- SIV Simian immunodeficiency virus
- the XDP comprises one or more Lentivirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 288-312 and 334-339 as set forth in Table 5, or a sequence having at least 80%, at least 90%, at least 95%, at least 95%, at least 97%, at least 98%, or at least 99% identity thereto.
- the XDP comprises one or more Lentivirus structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 288-312 and 334-339 as set forth in Table 5.
- the XDP having Lentivirus components can be designed in various configurations, including the configurations of FIGS.
- the XDP comprise a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads. These alternative configurations are described more fully, below, as well as in the Examples.
- the therapeutic payload is an RNP of a complexed CasX and gNA embodiment described herein, while the tropism factor is a viral glycoprotein embodiment described herein.
- the XDP viral components are derived from Spumaretrovirinae, including but not limited to Bovispumavirus, Equispumavirus, Felispumavirus, Prosimiispumavirus, Simiispumavirus, and Spumavirus.
- the present disclosure provides XDP wherein the XDP comprises components selected from the group consisting of: p68 Gag; a p3 Gag; a Gag polyprotein comprising of p68 Gag and p3 gag; a therapeutic payload; a tropism factor; a Gag-Pol polyprotein; a Gag-transframe region-Pol protease polyprotein; a cleavage site(s); and a non-retroviral, heterologous protease capable of cleaving the protease cleavage sites.
- Gag components e.g., p68 AND p3p20
- the cleavage site and protease are derived from an Spumaretrovirinae including but not limited to Bovispumavirus, Equispumavirus, Felispumavirus, Prosimiispumavirus, Simiispumavirus, and Spumavirus.
- the encoding sequences for these components are provided in Table 5, and the methods to create the encoding plasmids and produce the XDP in host cells are described herein, below.
- the XDP comprises one or more Spumaretrovirinae structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 313-333 as set forth in Table 5, or a sequence having at least 80%, at least 90%, at least 95%, at least 95%, at least 97%, at least 98%, or at least 99% identity thereto.
- the XDP comprises one or more Spumaretrovirinae structural components encoded by the sequences selected from the group consisting SEQ ID NOS: 313- 333 as set forth in Table 5.
- the XDP having Spumaretrovirus components can be designed in various configurations, including the configurations of FIGS.
- the XDP comprise a subset of the components listed in the paragraph, such as depicted in FIGS. 36- 68, which depict CasX and gNA as the therapeutic payloads. These alternative configurations are described more fully, below, as well as in the Examples.
- the therapeutic payload is an RNP of a complexed CasX and gNA embodiment described herein, while the tropism factor is a viral glycoprotein embodiment described herein.
- the present disclosure provides XDP wherein the retroviral components of the XDP are selected from different genera of the Retroviridae.
- the XDP can comprise two or more components selected from a matrix polypeptide (MA), a p2A spacer peptide, a p2B spacer peptide; a plO spacer peptide, a capsid polypeptide (CA), a nucleocapsid polypeptide (NC), a pp21/24 spacer peptide, a p3-P8 spacer peptide, a ppl2 spacer peptide, a p20 spacer peptide, a pl/p6 spacer peptide, a p68 Gag, a p3 Gag, a cleavage site(s), a Gag-Pol polyprotein; a Gag-transframe region-Pol protease polyprotein; and a non-retroviral, heterologous protease capable
- the accessory protein integrase (or its encoding nucleic acid) can be omitted from the XDP systems, as well as the HIV functional accessory genes vpr, vpx (HIV-2), which are dispensable for viral replication in vitro. Additionally, the nucleic acids of the XDP system do not require reverse transcriptase for the creation of the XDP compositions of the embodiments.
- the HIV-1 Gag-Pol component of the XDP can be truncated to Gag linked to the transframe region (TFR) composed of the transframe octapeptide (TFP) and 48 amino acids of the p6pol, separated by a protease cleavage site, hereinafter referred to as Gag-TFR-PR, described more fully, below.
- TFR transframe region
- TFP transframe octapeptide
- Gag-TFR-PR protease cleavage site
- the protease capable of cleaving the protease cleavage sites is selected from a retroviral protease, including any of the genera of the Retroviridae.
- the protease can be encoded by a sequence selected from the group consisting of SEQ ID NOS: 198, 234, 239, 245, 251, 257, 261, 266, 271, 276, 282, 287, 291,
- the protease capable of cleaving the protease cleavage sites is a non-retroviral, heterologous protease selected from the group of proteases consisting of tobacco etch virus protease (TEV), potyvirus HC protease, potyvirus Plprotease, PreScission (HRV3C protease), b virus NIa protease, B virus RNA-2-encoded protease, aphthovirus L protease, enterovirus 2A protease, rhinovirus 2 A protease, picoma 3C protease, comovirus 24K protease, nepovirus 24K protease, RTSV (rice tungro spherical
- the protease capable of cleaving the protease cleavage sites is PreScission Protease; a fusion protein of human rhinovirus (HRV) 3C protease and glutathione S-transferase (GST).
- HRV human rhinovirus
- GST glutathione S-transferase
- the protease capable of cleaving the protease cleavage sites is tobacco etch virus protease (TEV)
- TSV tobacco etch virus protease
- the protease capable of cleaving the protease cleavage sites is HIV-1 protease.
- the 99-amino acid protease (PR) of the precursor Gag— Pol polyprotein (which are encoded by overlapping open reading frames such that the synthesis of the of the Gag— Pol precursor results from a -1 frameshifting event) is flanked at its N-terminus by a transframe region (TFR) composed of the transframe octapeptide (TFP) and 48 amino acids of the p6pol, separated by a protease cleavage site.
- TFR transframe region
- TFP transframe octapeptide
- the Gag-Pol sequence comprises the encoded TFR-PR to facilitate the-1 frameshifting event.
- the XDP system utilizes a component comprised of the Gag polyprotein and a portion of the pol polyprotein comprising the TFR and the protease
- the component is referred to herein as “Gag-TFR-PR”, wherein the capability to facilitate the -1 frameshifting event is retained, along with the capability to produce the encoded protease.
- Gag-TFR-PR the component that facilitates the -1 frameshifting event is retained, along with the capability to produce the encoded protease.
- protease cleavage sites utilized in the encoded proteins of the XDPs and their encoding sequences in the nucleic acids will correlate with the protease that is incorporated into the XDP system.
- the protease cleavage site of the XDP component comprising all or a portion of a Gag polyprotein is located between the Gag polyprotein and the therapeutic payload such that upon maturation of the XDP particle, the therapeutic payload is not tethered to any component of the Gag polyprotein.
- the protease cleavage site is incorporated between the individual components of the Gag polyprotein as well as between the Gag polyprotein and the therapeutic payload.
- the encoded TEV protease cleavage sites can have the sequences EXXYXQ(G/S) (SEQ ID NO: 17), ENLYFQG (SEQ ID NO: 18) or ENLYFQS (SEQ ID NO: 19), wherein X represents any amino acid and cleavage by TEV occurs between Q and G or Q and S.
- the encoded HIV-1 cleavage sites can have the sequence SQNYPIVQ (SEQ ID NO: 20).
- the protease is PreScission
- the protease cleavage sites include the core amino acid sequence Leu-Phe-Gln/Gly-Pro (SEQ ID NO: 1010), cleaving between the Gin and Gly residues.
- the XDP comprising cleavage sites have protease cleavage sites that are identical.
- the XDP comprising cleavage sites have protease cleavage sites that are different and are substrates for different proteases.
- the XDP system can comprise a cleavage sequence that is susceptible to cleavage by two different proteases; e.g., HIV-1 and PreScission protease.
- the nucleic acids encoding the XDP would include encoding sequences for both proteases.
- Additional protease cleavage sites are envisaged as within the scope of the XDP of the instant invention, and include, inter alia , SEQ ID NOS: 874-897, and 934-946. d.
- Protein therapeutic payloads suitable for inclusion in the XDP of the present disclosure include a diversity of categories of protein-based therapeutics, including, but not limited to cytokines (e.g., IFNs a, b, and g, TNF-a, G-CSF, GM-CSF)), interleukins (e.g., IL-1 to IL-40), growth factors (e.g., VEGF, PDGF, IGF-1, EGF, and TGF-b), enzymes, receptors, microproteins, hormones (e.g., growth hormone, insulin), erythropoietin, RNAse, DNAse, blood clotting factors (e.g.
- cytokines e.g., IFNs a, b, and g, TNF-a, G-CSF, GM-CSF
- interleukins e.g., IL-1 to IL-40
- growth factors e.g., VEGF, PDGF, IGF
- FVII, FVIII, FIX, FX anticoagulants
- bone morphogenetic proteins engineered protein scaffolds, thrombolytics (e.g., streptokinase, tissue plasminogen activator, plasminogen, and plasmid), CRISPR proteins (Class 1 and Class 2 Type II, Type V, or Type VI) as well as engineered proteins such as anti-cancer modalities or biologies intended to treat diseases such as neurologic, metabolic, cardiovascular, liver, renal, or endocrine diseases and disorders.
- thrombolytics e.g., streptokinase, tissue plasminogen activator, plasminogen, and plasmid
- CRISPR proteins Class 1 and Class 2 Type II, Type V, or Type VI
- engineered proteins such as anti-cancer modalities or biologies intended to treat diseases such as neurologic, metabolic, cardiovascular, liver, renal, or endocrine diseases and disorders.
- Nucleic acid payloads suitable for inclusion in the XDP of the present disclosure include a diversity of categories, including sequences encoding the foregoing protein therapeutic payloads, as well as single-stranded antisense oligonucleotides (ASOs), double-stranded RNA interference (RNAi) molecules, DNA aptamers, nucleic acids utilized in gene therapy (e.g., guide RNAs utilized in CRISPR systems and donor templates), micro RNAs, ribozymes, RNA decoys and circular RNAs.
- ASOs single-stranded antisense oligonucleotides
- RNAi double-stranded RNA interference
- DNA aptamers DNA aptamers
- nucleic acids utilized in gene therapy e.g., guide RNAs utilized in CRISPR systems and donor templates
- micro RNAs e.g., ribozymes, RNA decoys and circular RNAs.
- the protein payload of the XDP comprises a CasX variant protein of any of the embodiments described herein, including the CasX variants of SEQ ID NOS: 21-233, 343-345, 350-353, 355-367 and 388-397 as set forth in Tables 1, 7, 8, 9 and 11, while the nucleic acid payload comprises one or more guide RNAs of any of the embodiments described herein, including the gNA variants with a scaffold sequence of SEQ ID NOS: 597-781 as set forth in Table 3 and, optionally, a donor template.
- the present disclosure provides XDP compositions and systems comprising a CRISPR nuclease and one or more guide nucleic acids engineered to bind target nucleic acid that have utility in genome editing of eukaryotic cells.
- the CRISPR nuclease employed in the XDP systems is a Class 2 nuclease.
- the CRISPR nuclease is a Class 2, Type V nuclease.
- members of Class 2, Type V CRISPR-Cas systems have differences, they share some common characteristics that distinguish them from the Cas9 systems.
- Type V nucleases possess a single RNA-guided RuvC domain-containing effector but no HNH domain, and they recognize T-rich PAM 5’ upstream to the target region on the non-targeted strand, which is different from Cas9 systems which rely on G-rich PAM at 3’ side of target sequences.
- Type V nucleases generate staggered double-stranded breaks distal to the PAM sequence, unlike Cas9, which generates a blunt end in the proximal site close to the PAM.
- Type V nucleases degrade ssDNA in trans when activated by target dsDNA or ssDNA binding in cis.
- the Type V nucleases utilized in the XDP embodiments recognize a 5’ TC PAM motif and produce staggered ends cleaved solely by the RuvC domain.
- the XDP comprise a Class 2, Type V nuclease selected from the group consisting of Casl2a, Casl2b, Casl2c, Casl2d (CasY), Casl2j and CasX.
- the present disclosure provides XDP comprising a ribonucleoprotein (RNP) of a complexed CasX protein and one or more guide nucleic acids (gNA) that are specifically designed to modify a target nucleic acid sequence in eukaryotic cells.
- RNP ribonucleoprotein
- gNA guide nucleic acids
- CasX protein refers to a family of proteins, and encompasses all naturally occurring CasX proteins (also referred to herein as a “wild-type” or “reference” CasX), as well as CasX variants with one or more modifications in at least one domain relative to a naturally-occurring reference CasX protein.
- Reference CasX proteins include, but are not limited to those isolated or derived from Deltaproteobacter , Planctomycetes, or Candidatus (as described in US20180346927A1 and WO2018064371A1, incorporated herein by reference). Exemplary embodiments of CasX variants envisaged as being within the scope of the disclosure are described herein, below.
- a Type V reference CasX protein is isolated or derived from Deltaproteobacteria.
- a CasX protein comprises a sequence at least 50% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 81% identical, at least 82% identical, at least 83% identical, at least 84% identical, at least 85% identical, at least 86% identical, at least 86% identical, at least 87% identical, at least 88% identical, at least 89% identical, at least 89% identical, at least 90% identical, at least 91% identical, at least 92% identical, at least 93% identical, at least 94% identical, at least 95% identical, at least 96% identical, at least 97% identical, at least 98% identical, at least 99% identical, at least 99.5% identical or 100% identical to a sequence of:
- a Type V reference CasX protein is isolated or derived from Planctomycetes.
- a CasX protein comprises a sequence at least 50% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 81% identical, at least 82% identical, at least 83% identical, at least 84% identical, at least 85% identical, at least 86% identical, at least 86% identical, at least 87% identical, at least 88% identical, at least 89% identical, at least 89% identical, at least 90% identical, at least 91% identical, at least 92% identical, at least 93% identical, at least 94% identical, at least 95% identical, at least 96% identical, at least 97% identical, at least 98% identical, at least 99% identical, at least 99.5% identical or 100% identical to a sequence of: 1 MQEIKRINKI RRRLVKDSNT KKAGKTGPMK TLLVRVMTPD LRERLENLRK KPENIP
- a Type V reference CasX protein is isolated or derived from Candidatus Sungbacteria.
- a CasX protein comprises a sequence at least 50% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 81% identical, at least 82% identical, at least 83% identical, at least 84% identical, at least 85% identical, at least 86% identical, at least 86% identical, at least 87% identical, at least 88% identical, at least 89% identical, at least 89% identical, at least 90% identical, at least 91% identical, at least 92% identical, at least 93% identical, at least 94% identical, at least 95% identical, at least 96% identical, at least 97% identical, at least 98% identical, at least 99% identical, at least 99.5% identical or 100% identical to a sequence of 1 MDNANKPSTK SLVNTTRISD HFGVTPGQVT RVFSFGIIPT KRQYAIIERW FAAVEAARER
- the disclosure provides CasX variant proteins for use in the XDP comprising a sequence that has at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 20, at least 30, at least 40 or at least 50 or more individual or sequential mutations relative to the sequence of a reference CasX protein of SEQ ID NO: 1, SEQ ID NO:2, or SEQ ID NO:3.
- These mutations can be insertions, deletions, amino acid substitutions, or any combinations thereof.
- a CasX variant in addition to the aforementioned mutations, can further comprise a substitution of a portion or all of a domain from a heterologous reference CasX, and the substituted domain can further comprise one or more mutations.
- Suitable mutagenesis methods for generating CasX variant proteins of the disclosure may include, for example, Deep Mutational Evolution (DME), deep mutational scanning (DMS), error prone PCR, cassette mutagenesis, random mutagenesis, staggered extension PCR, gene shuffling, or domain swapping.
- the CasX variants are designed, for example by selecting one or more desired mutations in a reference CasX. Any amino acid can be substituted for any other amino acid in the substitutions described herein.
- the substitution can be a conservative substitution (e.g., a basic amino acid is substituted for another basic amino acid).
- the substitution can be a non-conservative substitution (e.g., a basic amino acid is substituted for an acidic amino acid or vice versa).
- a proline in a reference CasX protein can be substituted for any of arginine, histidine, lysine, aspartic acid, glutamic acid, serine, threonine, asparagine, glutamine, cysteine, glycine, alanine, isoleucine, leucine, methionine, phenylalanine, tryptophan, tyrosine or valine to generate a CasX variant protein of the disclosure.
- the activity of a reference CasX protein is used as a benchmark against which the activity of one or more CasX variants are compared, thereby measuring improvements in function of the CasX variants.
- a CasX variant protein comprises at least one amino acid deletion relative to a reference CasX protein.
- a CasX variant protein comprises a deletion of 1-4 amino acids, 1-10 amino acids, 1-20 amino acids, 1-30 amino acids, 1-40 amino acids, 1-50 amino acids, 1-60 amino acids, 1-70 amino acids, 1-80 amino acids, 1-90 amino acids, 1-100 amino acids, 2-10 amino acids, 2-20 amino acids, 2-30 amino acids, 3-10 amino acids, 3-20 amino acids, 3-30 amino acids, 4-10 amino acids, 4-20 amino acids, 3-300 amino acids, 5-10 amino acids, 5-20 amino acids, 5-30 amino acids, 10-50 amino acids or 20-50 amino acids relative to a reference CasX protein.
- a CasX protein comprises a deletion of at least about 100 consecutive amino acids relative to a reference CasX protein. In some embodiments, a CasX variant protein comprises a deletion of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50 or 100 consecutive amino acids relative to a reference CasX protein. In some embodiments, a CasX variant protein comprises a deletion of 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 consecutive amino acids.
- a CasX variant protein comprises two or more deletions relative to a reference CasX protein, and the two or more deletions are not consecutive amino acids.
- a first deletion may be in a first domain of the reference CasX protein
- a second deletion may be in a second domain of the reference CasX protein.
- a CasX variant protein comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 non-consecutive deletions relative to a reference CasX protein.
- a CasX variant protein comprises at least 20 non-consecutive deletions relative to a reference CasX protein. Each non-consecutive deletion may be of any length of amino acids described herein, e.g., 1-4 amino acids, 1-10 amino acids, and the like.
- the CasX variant protein comprises one or more amino acid insertions relative to the sequence of SEQ ID NOS: 1, 2, or 3.
- a CasX variant protein comprises an insertion of 1 amino acid, an insertion of 2-3 consecutive or non- consecutive amino acids, 2-4 consecutive or non-consecutive amino acids, 2-5 consecutive or non-consecutive amino acids, 2-6 consecutive or non-consecutive amino acids, 2-7 consecutive or non-consecutive amino acids, 2-8 consecutive or non-consecutive amino acids, 2-9 consecutive or non-consecutive amino acids, 2-10 consecutive or non-consecutive amino acids,
- 2-20 consecutive or non-consecutive amino acids 2-30 consecutive or non-consecutive amino acids, 2-40 consecutive or non-consecutive amino acids, 2-50 consecutive or non-consecutive amino acids, 2-60 consecutive or non-consecutive amino acids, 2-70 consecutive or non- consecutive amino acids, 2-80 consecutive or non-consecutive amino acids, 2-90 consecutive or non-consecutive amino acids, 2-100 consecutive or non-consecutive amino acids, 3-10 consecutive or non-consecutive amino acids, 3-20 consecutive or non-consecutive amino acids,
- the CasX variant protein comprises an insertion of 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 consecutive or non-consecutive amino acids.
- a CasX variant protein comprises an insertion of at least about 100 consecutive or non-consecutive amino acids. Any amino acid, or combination of amino acids, can be inserted in the insertions described herein to generate a CasX variant protein.
- a CasX variant protein can comprise at least one substitution and at least one deletion relative to a reference CasX protein sequence, at least one substitution and at least one insertion relative to a reference CasX protein sequence, at least one insertion and at least one deletion relative to a reference CasX protein sequence, or at least one substitution, one insertion and one deletion relative to a reference CasX protein sequence.
- a CasX variant comprises some or all of the following domains: a non-target strand binding (NTSB) domain, a target strand loading (TSL) domain, a helical I domain, a helical II domain, an oligonucleotide binding domain (OBD), and a RuvC DNA cleavage domain (the latter which may be deleted in a catalytically dead CasX variant), described more fully, below.
- NTSB non-target strand binding
- TSL target strand loading
- OBD oligonucleotide binding domain
- RuvC DNA cleavage domain the latter which may be deleted in a catalytically dead CasX variant
- the at least one modification of the CasX variant protein comprises a deletion of at least a portion of one domain of the reference CasX protein, including the sequences of SEQ ID NOS: 1-3. In some embodiments, the deletion is in the NTSBD, TSLD, Helical I domain, Helical II domain, OBD, or RuvC DNA cleavage domain. In some embodiments, the CasX variant comprises at least one modification in the NTSB domain. In some embodiments, the CasX variant comprises at least one modification in the TSL domain. In some embodiments, the at least one modification in the TSL domain comprises an amino acid substitution of one or more of amino acids Y857, S890, or S932 of SEQ ID NO:2.
- the CasX variant comprises at least one modification in the helical I domain. In some embodiments, the at least one modification in the helical I domain comprises an amino acid substitution of one or more of amino acids S219, L249, E259, Q252, E292, L307, or D318 of SEQ ID NO:2. In some embodiments, the CasX variant comprises at least one modification in the helical II domain. In some embodiments, the at least one modification in the helical II domain comprises an amino acid substitution of one or more of amino acids D361, L379, E385, E386, D387, F399, L404, R458, C477, or D489 of SEQ ID NO:2.
- the CasX variant comprises at least one modification in the OBD domain.
- the at least one modification in the OBD comprises an amino acid substitution of one or more of amino acids F536, E552, T620, or 1658 of SEQ ID NO:2.
- the CasX variant comprises at least one modification in the RuvC DNA cleavage domain.
- the at least one modification in the RuvC DNA cleavage domain comprises an amino acid substitution of one or more of amino acids K682, G695, A708, V711, D732, A739, D733, L742, V747, F755, M771, M779, W782, A788, G791, L792, P793, Y797, M799, Q804, S819, or Y857 or a deletion of amino acid P793 of SEQ ID NO:2.
- the CasX variant comprises at least one modification compared to the reference CasX sequence of SEQ ID NO:2 is selected from one or more of: (a) an amino acid substitution of L379R; (b) an amino acid substitution of A708K; (c) an amino acid substitution of T620P; (d) an amino acid substitution of E385P; (e) an amino acid substitution of Y857R; (f) an amino acid substitution of I658V; (g) an amino acid substitution of F399L; (h) an amino acid substitution of Q252K; (i) an amino acid substitution of L404K; and (j) an amino acid deletion of P793.
- the CasX variant proteins of the disclosure have an enhanced ability to efficiently edit and/or bind target DNA, when complexed with a gNA as an RNP, utilizing PAM TC motif, including PAM sequences selected from TTC, ATC, GTC, or CTC, compared to an RNP of a reference CasX protein and reference gNA.
- the PAM sequence is located at least 1 nucleotide 5’ to the non-target strand of the protospacer having identity with the targeting sequence of the gNA in a assay system compared to the editing efficiency and/or binding of an RNP comprising a reference CasX protein and reference gNA in a comparable assay system.
- an RNP of a CasX variant and gNA variant exhibits greater editing efficiency and/or binding of a target sequence in the target DNA compared to an RNP comprising a reference CasX protein and a reference gNA in a comparable assay system, wherein the PAM sequence of the target DNA is TTC.
- an RNP of a CasX variant and gNA variant exhibits greater editing efficiency and/or binding of a target sequence in the target DNA compared to an RNP comprising a reference CasX protein and a reference gNA in a comparable assay system, wherein the PAM sequence of the target DNA is ATC.
- an RNP of a CasX variant and gNA variant exhibits greater editing efficiency and/or binding of a target sequence in the target DNA compared to an RNP comprising a reference CasX protein and a reference gNA in a comparable assay system, wherein the PAM sequence of the target DNA is CTC.
- an RNP of a CasX variant and gNA variant exhibits greater editing efficiency and/or binding of a target sequence in the target DNA compared to an RNP comprising a reference CasX protein and a reference gNA in a comparable assay system, wherein the PAM sequence of the target DNA is GTC.
- the increased editing efficiency and/or binding affinity for the one or more PAM sequences is at least 1.5-fold greater or more compared to the editing efficiency and/or binding affinity of an RNP of any one of the CasX proteins of SEQ ID NOS: 1-3 and the gNA of Table 2 for the PAM sequences.
- All variants that improve one or more functions or characteristics of the CasX variant protein when compared to a reference CasX protein described herein are envisaged as being within the scope of the disclosure.
- Exemplary improved characteristics of the CasX variant embodiments include, but are not limited to improved folding of the variant, improved binding affinity to the gNA, improved binding affinity to the target nucleic acid, improved ability to utilize a greater spectrum of PAM sequences in the editing and/or binding of target DNA, improved unwinding of the target DNA, increased editing activity, improved editing efficiency, improved editing specificity, increased percentage of a eukaryotic genome that can be efficiently edited, increased activity of the nuclease, increased target strand loading for double strand cleavage, decreased target strand loading for single strand nicking, decreased off-target cleavage, improved binding of the non-target strand of DNA, improved protein stability, improved proteimgNA (RNP) complex stability, improved protein solubility, improved proteimgNA (RNP) complex solubility, improved protein yield, improved
- the RNP of the CasX variant and the gNA variant exhibit one or more of the improved characteristics that are at least about 1.1 to about 100,000-fold improved relative to an RNP of the reference CasX protein of SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3 and the gNA of Table 2, when assayed in a comparable fashion.
- the one or more improved characteristics of an RNP of the CasX variant and the gNA variant are at least about 1.1, at least about 10, at least about 100, at least about 1000, at least about 10,000, at least about 100,000-fold or more improved relative to an RNP of the reference CasX protein of SEQ ID NO: 1, SEQ ID NO:2, or SEQ ID NO:3 and the gNA of Table 2.
- the one or more of the improved characteristics of an RNP of the CasX variant and the gNA variant are about 1.1 to 100,00-fold, about 1.1 to 10,00-fold, about 1.1 to 1,000-fold, about 1.1 to 500-fold, about 1.1 to 100-fold, about 1.1 to 50-fold, about 1.1 to 20-fold, about 10 to 100,00-fold, about 10 to 10,00-fold, about 10 to 1,000-fold, about 10 to 500- fold, about 10 to 100-fold, about 10 to 50-fold, about 10 to 20-fold, about 2 to 70-fold, about 2 to 50-fold, about 2 to 30-fold, about 2 to 20-fold, about 2 to 10-fold, about 5 to 50-fold, about 5 to 30-fold, about 5 to 10-fold, about 100 to 100,00-fold, about 100 to 10,00-fold, about 100 to 1,000-fold, about 100 to 500-fold, about 500 to 100,00-fold, about 500 to 10,00-fold, about 500 to 1,000-fold, about 500 to 750-fold, about 1,000 to 10
- the one or more improved characteristics of an RNP of the CasX variant and the gNA variant are about 1.1 -fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, 10-fold, 11-fold, 12-fold, 13-fold, 14-fold, 15-fold, 16-fold, 17-fold, 18-fold, 19-fold, 20-fold, 25-fold, 30-fold, 40-fold, 45-fold, 50-fold, 55-fold, 60-fold, 70-fold, 80-fold, 90-fold, 100-fold, 110-fold, 120-fold, 130-fold, 140-fold, 150-fold, 160-fold, 170-fold, 180-fold, 190-fold, 200-fold, 210-fold, 220-fold, 230-fold, 240-fold, 250-fold, 260- fold,
- an RNP comprising a CasX variant protein and a gNA of the disclosure at a concentration of 20 pM or less, is capable of cleaving a double stranded DNA target with an efficiency of at least 80%.
- the RNP at a concentration of 20 pM or less is capable of cleaving a double stranded DNA target with an efficiency of at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90% or at least 95%.
- the RNP at a concentration of 50 pM or less, 40 pM or less, 30 pM or less, 20 pM or less, 10 pM or less, or 5 pM or less is capable of cleaving a double stranded DNA target with an efficiency of at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90% or at least 95%.
- CasX variant is inclusive of variants that are fusion proteins; i.e., the CasX is “fused to” a heterologous sequence. This includes CasX variants comprising CasX variant sequences and N-terminal, C-terminal, or internal fusions of the CasX to a heterologous protein or domain thereof.
- the CasX variant protein comprises between 400 and 2000 amino acids, between 500 and 1500 amino acids, between 700 and 1200 amino acids, between 800 and 1100 amino acids or between 900 and 1000 amino acids.
- the CasX variant protein comprises one or more modifications comprising a region of non-contiguous residues that form a channel in which gNA:target DNA complexing occurs. In some embodiments, the CasX variant protein comprises one or more modifications comprising a region of non-contiguous residues that form an interface which binds with the gNA.
- the helical I, helical II and OBD domains all contact or are in proximity to the gNA:target DNA complex, and one or more modifications to non-contiguous residues within any of these domains may improve function of the CasX variant protein.
- the CasX variant protein comprises one or more modifications comprising a region of non-contiguous residues that form a channel which binds with the non target strand DNA.
- a CasX variant protein can comprise one or more modifications to non-contiguous residues of the NTSBD.
- the CasX variant protein comprises one or more modifications comprising a region of non-contiguous residues that form an interface which binds with the PAM.
- a CasX variant protein can comprise one or more modifications to non-contiguous residues of the helical I domain or OBD.
- the CasX variant protein comprises one or more modifications comprising a region of non-contiguous surface-exposed residues.
- surface-exposed residues refers to amino acids on the surface of the CasX protein, or amino acids in which at least a portion of the amino acid, such as the backbone or a part of the side chain is on the surface of the protein.
- Surface exposed residues of cellular proteins such as CasX which are exposed to an aqueous intracellular environment, are frequently selected from positively charged hydrophilic amino acids, for example arginine, asparagine, aspartate, glutamine, glutamate, histidine, lysine, serine, and threonine.
- a region of surface exposed residues comprises one or more insertions, deletions, or substitutions compared to a reference CasX protein.
- one or more positively charged residues are substituted for one or more other positively charged residues, or negatively charged residues, or uncharged residues, or any combinations thereof.
- one or more amino acids residues for substitution are near bound nucleic acid, for example residues in the RuvC domain or helical I domain that contact target DNA, or residues in the OBD or helical II domain that bind the gNA, can be substituted for one or more positively charged or polar amino acids.
- the CasX variant protein comprises one or more modifications comprising a region of non-contiguous residues that form a core through hydrophobic packing in a domain of the reference CasX protein.
- regions that form cores through hydrophobic packing are rich in hydrophobic amino acids such as valine, isoleucine, leucine, methionine, phenylalanine, tryptophan, and cysteine.
- RuvC domains comprise a hydrophobic pocket adjacent to the active site. In some embodiments, between 2 to 15 residues of the region are charged, polar, or base stacking.
- Charged amino acids may include, for example, arginine, lysine, aspartic acid, and glutamic acid, and the side chains of these amino acids may form salt bridges provided a bridge partner is also present.
- Polar amino acids may include, for example, glutamine, asparagine, histidine, serine, threonine, tyrosine, and cysteine. Polar amino acids can, in some embodiments, form hydrogen bonds as proton donors or acceptors, depending on the identity of their side chains.
- base-stacking includes the interaction of aromatic side chains of an amino acid residue (such as tryptophan, tyrosine, phenylalanine, or histidine) with stacked nucleotide bases in a nucleic acid. Any modification to a region of non-contiguous amino acids that are in close spatial proximity to form a functional part of the CasX variant protein is envisaged as within the scope of the disclosure.
- XDP comprising chimeric CasX proteins comprising protein domains from two or more different CasX proteins, such as two or more naturally occurring CasX proteins, or two or more CasX variant protein sequences as described herein.
- a “chimeric CasX protein” refers to a CasX containing at least two domains isolated or derived from different sources, such as two naturally occurring proteins, which may, in some embodiments, be isolated from different species.
- a chimeric CasX protein comprises a first domain from a first CasX protein and a second domain from a second, different CasX protein.
- the first domain can be selected from the group consisting of the NTSB, TSL, helical I, helical II, OBD and RuvC domains.
- the second domain is selected from the group consisting of the NTSB, TSL, helical I, helical II, OBD and RuvC domains with the second domain being different from the foregoing first domain.
- a chimeric CasX protein may comprise an NTSB, TSL, helical I, helical II, OBD domains from a CasX protein of SEQ ID NO: 2, and a RuvC domain from a CasX protein of SEQ ID NO: 1, or vice versa.
- a chimeric CasX protein may comprise an NTSB, TSL, helical II, OBD and RuvC domain from CasX protein of SEQ ID NO: 2, and a helical I domain from a CasX protein of SEQ ID NO: 1, or vice versa.
- a chimeric CasX protein may comprise an NTSB, TSL, helical II, OBD and RuvC domain from a first CasX protein, and a helical I domain from a second CasX protein.
- the domains of the first CasX protein are derived from the sequences of SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID NO: 3
- the domains of the second CasX protein are derived from the sequences of SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID NO: 3
- the first and second CasX proteins are not the same.
- domains of the first CasX protein comprise sequences derived from SEQ ID NO: 1 and domains of the second CasX protein comprise sequences derived from SEQ ID NO: 2.
- domains of the first CasX protein comprise sequences derived from SEQ ID NO: 1 and domains of the second CasX protein comprise sequences derived from SEQ ID NO: 3.
- domains of the first CasX protein comprise sequences derived from SEQ ID NO: 2 and domains of the second CasX protein comprise sequences derived from SEQ ID NO: 3.
- the CasX variant is selected of group consisting of CasX variants with sequences of SEQ ID NO: 102,
- a CasX variant protein comprises at least one chimeric domain comprising a first part from a first CasX protein and a second part from a second, different CasX protein.
- a “chimeric domain” refers to a domain containing at least two parts isolated or derived from different sources, such as two naturally occurring proteins or portions of domains from two reference CasX proteins.
- the at least one chimeric domain can be any of the NTSB, TSL, helical I, helical II, OBD or RuvC domains as described herein.
- the first portion of a CasX domain comprises a sequence of SEQ ID NO: 1 and the second portion of a CasX domain comprises a sequence of SEQ ID NO: 2. In some embodiments, the first portion of the CasX domain comprises a sequence of SEQ ID NO: 1 and the second portion of the CasX domain comprises a sequence of SEQ ID NO: 3. In some embodiments, the first portion of the CasX domain comprises a sequence of SEQ ID NO: 2 and the second portion of the CasX domain comprises a sequence of SEQ ID NO: 3. In some embodiments, the at least one chimeric domain comprises a chimeric RuvC domain.
- a chimeric RuvC domain comprises amino acids 661 to 824 of SEQ ID NO: 1 and amino acids 922 to 978 of SEQ ID NO: 2.
- a chimeric RuvC domain comprises amino acids 648 to 812 of SEQ ID NO: 2 and amino acids 935 to 986 of SEQ ID NO: 1.
- a CasX protein comprises a first domain from a first CasX protein and a second domain from a second CasX protein, and at least one chimeric domain comprising at least two parts isolated from different CasX proteins using the approach of the embodiments described in this paragraph.
- the chimeric CasX proteins having domains or portions of domains derived from SEQ ID NOS: 1, 2 and 3 can further comprise amino acid insertions, deletions, or substitutions of any of the embodiments disclosed herein.
- a CasX variant protein comprises a sequence of SEQ ID NOS: 21-233, 343-345, 350-353, 355-367 or 388-397 as set forth in Tables
- a CasX variant protein consists of a sequence of SEQ ID NO: 1
- a CasX variant protein comprises a sequence at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 81% identical, at least 82% identical, at least 83% identical, at least 84% identical, at least 85% identical, at least 86% identical, at least 86% identical, at least 87% identical, at least 88% identical, at least 89% identical, at least 89% identical, at least 90% identical, at least 91% identical, at least 92% identical, at least 93% identical, at least 94% identical, at least 95% identical, at least 96% identical, at least 97% identical, at least 98% identical, at least 99% identical, at least 99.5% identical to a sequence of SEQ ID NOS: 21-233, 343-345, 350-353, 355-367 or 388-397 as set forth in Tables 1, 7, 8, 9 or
- a CasX variant protein comprises a sequence set forth in Table 1, and further comprises one or more NLS disclosed herein at or near either the N-terminus, the C- terminus, or both. It will be understood that in some cases, the N-terminal methionine of the CasX variants of the Tables is removed from the expressed CasX variant during post- translational modification.
- XDP comprising CasX variant proteins comprising a heterologous protein fused to the CasX.
- the CasX variant protein is fused to one or more proteins or domains thereof that has a different activity of interest, resulting in a fusion protein.
- the CasX variant protein is fused to a protein (or domain thereof) that inhibits transcription, modifies a target nucleic acid, or modifies a polypeptide associated with a nucleic acid (e.g., histone modification).
- a heterologous polypeptide (or heterologous amino acid such as a cysteine residue or a non-natural amino acid) can be inserted at one or more positions within a CasX protein to generate a CasX fusion protein utilized in the XDP systems.
- a cysteine residue can be inserted at one or more positions within a CasX protein followed by conjugation of a heterologous polypeptide described below.
- a heterologous polypeptide or heterologous amino acid can be added at the N- or C-terminus of the CasX variant protein.
- a heterologous polypeptide or heterologous amino acid can be inserted internally within the sequence of the CasX protein.
- a variety of heterologous polypeptides are suitable for inclusion in a CasX variant fusion protein utilized in the XDP systems of the disclosure.
- the fusion partner can modulate transcription (e.g., inhibit transcription, increase transcription) of a target DNA.
- the fusion partner is a protein (or a domain from a protein) that inhibits transcription (e.g., a transcriptional repressor, a protein that functions via recruitment of transcription inhibitor proteins, modification of target DNA such as methylation, recruitment of a DNA modifier, modulation of histones associated with target DNA, recruitment of a histone modifier such as those that modify acetylation and/or methylation of histones, and the like).
- a transcriptional repressor a protein that functions via recruitment of transcription inhibitor proteins, modification of target DNA such as methylation, recruitment of a DNA modifier, modulation of histones associated with target DNA, recruitment of a histone modifier such as those that modify acetylation and/or methylation of histones, and the like.
- the fusion partner is a protein (or a domain from a protein) that increases transcription (e.g., a transcription activator, a protein that acts via recruitment of transcription activator proteins, modification of target DNA such as demethylation, recruitment of a DNA modifier, modulation of histones associated with target DNA, recruitment of a histone modifier such as those that modify acetylation and/or methylation of histones, and the like).
- a transcription activator e.g., a transcription activator, a protein that acts via recruitment of transcription activator proteins, modification of target DNA such as demethylation, recruitment of a DNA modifier, modulation of histones associated with target DNA, recruitment of a histone modifier such as those that modify acetylation and/or methylation of histones, and the like.
- a CasX fusion partner utilized in the XDP systems has enzymatic activity that modifies a target nucleic acid (e.g., nuclease activity, methyltransf erase activity, demethylase activity, DNA repair activity, DNA damage activity, deamination activity, dismutase activity, alkylation activity, depurination activity, oxidation activity, pyrimidine dimer forming activity, integrase activity, transposase activity, recombinase activity, polymerase activity, ligase activity, helicase activity, photolyase activity or glycosylase activity).
- a target nucleic acid e.g., nuclease activity, methyltransf erase activity, demethylase activity, DNA repair activity, DNA damage activity, deamination activity, dismutase activity, alkylation activity, depurination activity, oxidation activity, pyrimidine dimer forming activity, integrase activity, transposase activity, re
- a CasX fusion partner utilized in the XDP systems has enzymatic activity that modifies a polypeptide (e.g., a histone) associated with a target nucleic acid (e.g., methyltransferase activity, demethylase activity, acetyltransferase activity, deacetylase activity, kinase activity, phosphatase activity, ubiquitin ligase activity, deubiquitinating activity, adenylation activity, deadenylation activity, SUMOylating activity, deSUMOylating activity, ribosylation activity, deribosylation activity, myristoylation activity or demyristoylation activity).
- a polypeptide e.g., a histone
- a target nucleic acid e.g., methyltransferase activity, demethylase activity, acetyltransferase activity, deacetylase activity, kinase activity,
- proteins (or fragments thereof) that can be used as a CasX fusion partner utilized in the XDP systems to increase transcription include but are not limited to: transcriptional activators such as VP 16, VP64, VP48, VP160, p65 subdomain (e.g., from NFkB), and activation domain of EDLL and/or TAL activation domain (e.g., for activity in plants); histone lysine methyltransferases such as SET1A, SET1B, MLL1 to 5, ASH1, SYMD2, NSD1, and the like; histone lysine demethylases such as JHDM2a/b, UTX, JMJD3, and the like; histone acetyltransferases such as GCN5, PCAF, CBP, p300, TAF1, TIP60/PLIP, MOZ/MYST3, MORF/MYST4, SRC1, ACTR, PI 60, CLOCK, and the like; and DNA demethylation domains such as
- ROS1 ROS1, and the like.
- proteins (or fragments thereof) that can be used as a CasX fusion partner in an XDP to decrease transcription include but are not limited to: transcriptional repressors such as the Kruppel associated box (KRAB or SKD); KOX1 repression domain; the Mad mSIN3 interaction domain (SID); the ERF repressor domain (ERD), the SRDX repression domain (e.g., for repression in plants), and the like; histone lysine methyltransferases such as Pr-SET7/8, SUV4- 20H1, RIZ1, and the like; histone lysine demethylases such as JMJD2A/JHDM3A, JMJD2B, JMJD2C/GASC1, JMJD2D, JARID 1 A/RBP2, JARIDlB/PLU-1, JARID 1C/SMCX, JARIDID/SMCY, and the like; histone lysine deacetylase
- the CasX fusion partner utilized in the XDP systems has enzymatic activity that modifies the target nucleic acid (e.g., ssRNA, dsRNA, ssDNA, dsDNA).
- target nucleic acid e.g., ssRNA, dsRNA, ssDNA, dsDNA.
- enzymatic activity examples include but are not limited to: nuclease activity such as that provided by a restriction enzyme (e.g., Fokl nuclease), methyltransferase activity such as that provided by a methyltransferase (e.g., Hhal DNA m5c- methyltransf erase (M.Hhal), DNA methyltransferase 1 (DNMT1), DNA methyltransferase 3a (DNMT3a), DNA methyltransferase 3b (DNMT3b), METI, DRM3 (plants), ZMET2, CMT1, CMT2 (plants), and the like); demethylase activity such as that provided by a demethylase (e.g., Ten-Eleven Translocation (TET) di oxygenase 1 (TET 1 CD), TET1, DME, DMLl, DML2, ROS1, and the like), DNA repair activity, DNA damage activity, deamination activity such as that provided by a restriction enzyme (e.g
- CasX variant protein of the present disclosure utilized in the XDP systems is fused to a polypeptide selected from: a domain for increasing transcription (e.g., a VP 16 domain, a VP64 domain), a domain for decreasing transcription (e.g., a KRAB domain, e.g., from the Koxl protein), a core catalytic domain of a histone acetyltransferase (e.g., histone acetyltransferase p300), a protein/domain that provides a detectable signal (e.g., a fluorescent protein such as GFP), a nuclease domain (e.g., a Fokl nuclease), and a base editor (e.g., cytidine deaminase such as APOBECl).
- a domain for increasing transcription e.g., a VP 16 domain, a VP64 domain
- a domain for decreasing transcription e.g., from
- the CasX fusion partner utilized in the XDP systems has enzymatic activity that modifies a protein associated with the target nucleic acid (e.g., ssRNA, dsRNA, ssDNA, dsDNA) (e.g., a histone, an RNA binding protein, a DNA binding protein, and the like).
- a protein associated with the target nucleic acid e.g., ssRNA, dsRNA, ssDNA, dsDNA
- a histone e.g., an RNA binding protein, a DNA binding protein, and the like.
- enzymatic activity that modifies a protein associated with a target nucleic acid
- enzymatic activity that modifies a protein associated with a target nucleic acid
- HMT histone methyltransferase
- KDM1A histone demethylase 1A
- JHDM2a/b histone demethylase 1A
- Suitable chloroplast transit peptides include, but are not limited to:
- a CasX variant polypeptide of the present disclosure can include an endosomal escape peptide.
- an endosomal escape polypeptide comprises the amino acid sequence GLFXALLXLLXSLWXLLLXA (SEQ ID NO: 127), wherein each X is independently selected from lysine, histidine, and arginine.
- an endosomal escape polypeptide comprises the amino acid sequence GLFHALLHLLHSLWHLLLHA (SEQ ID NO: 128), or HHHHHHHHH (SEQ ID NO: 129).
- Non-limiting examples of CasX fusion partners for use when targeting ssRNA target nucleic acids include (but are not limited to): splicing factors (e.g., RS domains); protein translation components (e.g., translation initiation, elongation, and/or release factors; e.g., eIF4G); RNA methylases; RNA editing enzymes (e.g., RNA deaminases, e.g., adenosine deaminase acting on RNA (ADAR), including A to I and/or C to U editing enzymes); helicases; RNA-binding proteins; and the like. It is understood that a heterologous polypeptide can include the entire protein or in some cases can include a fragment of the protein (e.g., a functional domain).
- splicing factors e.g., RS domains
- protein translation components e.g., translation initiation, elongation, and/or release factors; e.g
- a fusion partner can be any domain capable of interacting with ssRNA (which, for the purposes of this disclosure, includes intramolecular and/or intermolecular secondary structures, e.g., double-stranded RNA duplexes such as hairpins, stem-loops, etc.), whether transiently or irreversibly, directly or indirectly, including but not limited to an effector domain selected from the group comprising; endonucleases (for example RNase III, the CRR22 DYW domain, Dicer, and PIN (PilT N-terminus) domains from proteins such as SMG5 and SMG6); proteins and protein domains responsible for stimulating RNA cleavage (for example CPSF, CstF, CFIm and CFIIm); exonucleases (for example XRN-1 or Exonuclease T); deadenylases (for example HNT3); proteins and protein domains responsible for nonsense mediated RNA decay (for example UPF1, UPF2, UPF3, U
- the effector domain may be selected from the group comprising endonucleases; proteins and protein domains capable of stimulating RNA cleavage; exonucleases; deadenylases; proteins and protein domains having nonsense mediated RNA decay activity; proteins and protein domains capable of stabilizing RNA; proteins and protein domains capable of repressing translation; proteins and protein domains capable of stimulating translation; proteins and protein domains capable of modulating translation (e.g., translation factors such as initiation factors, elongation factors, release factors, etc., e.g., eIF4G); proteins and protein domains capable of polyadenylation of RNA; proteins and protein domains capable of polyuridinylation of RNA; proteins and protein domains having RNA localization activity; proteins and protein domains capable of nuclear retention of RNA; proteins and protein domains having RNA nuclear export activity; proteins and protein domains capable of repression of RNA splicing; proteins and protein domains capable of stimulation of RNA splicing; proteins and protein domain
- RNA splicing factors that can be used (in whole or as fragments thereof) as a CasX fusion partners in the XDP systems have modular organization, with separate sequence- specific RNA binding modules and splicing effector domains.
- members of the serine/arginine-rich (SR) protein family contain N-terminal RNA recognition motifs (RRMs) that bind to exonic splicing enhancers (ESEs) in pre-mRNAs and C-terminal RS domains that promote exon inclusion.
- RRMs N-terminal RNA recognition motifs
- ESEs exonic splicing enhancers
- the hnRNP protein hnRNP A1 binds to exonic splicing silencers (ESSs) through its RRM domains and inhibits exon inclusion through a C- terminal glycine-rich domain.
- Some splicing factors can regulate alternative use of splice site (ss) by binding to regulatory sequences between the two alternative sites.
- ASF/SF2 can recognize ESEs and promote the use of intron proximal sites, whereas hnRNP A1 can bind to ESSs and shift splicing towards the use of intron distal sites.
- One application for such factors is to generate ESFs that modulate alternative splicing of endogenous genes, particularly disease associated genes.
- Bcl-x pre-mRNA produces two splicing isoforms with two alternative 5' splice sites to encode proteins of opposite functions.
- the long splicing isoform Bcl- xL is a potent apoptosis inhibitor expressed in long-lived post mitotic cells and is up-regulated in many cancer cells, protecting cells against apoptotic signals.
- the short isoform Bcl-xS is a pro- apoptotic isoform and expressed at high levels in cells with a high turnover rate (e.g., developing lymphocytes).
- the ratio of the two Bcl-x splicing isoforms is regulated by multiple cis -elements that are located in either the core exon region or the exon extension region (i.e., between the two alternative 5' splice sites).
- W02010075303 which is hereby incorporated by reference in its entirety.
- CasX fusion partners utilized in the XDP systems include, but are not limited to, proteins (or fragments thereof) that are boundary elements (e.g., CTCF), proteins and fragments thereof that provide periphery recruitment (e.g., Lamin A, Lamin B, etc.), and protein docking elements (e.g., FKBP/FRB, Pill/Abyl, etc.).
- boundary elements e.g., CTCF
- proteins and fragments thereof that provide periphery recruitment e.g., Lamin A, Lamin B, etc.
- protein docking elements e.g., FKBP/FRB, Pill/Abyl, etc.
- a heterologous polypeptide (a fusion partner) provides for subcellular localization of the CasX to which it is fused, i.e., the heterologous polypeptide contains a subcellular localization sequence (e.g., a nuclear localization signal (NLS) for targeting to the nucleus, a sequence to keep the fusion protein out of the nucleus, e.g., a nuclear export sequence (NES), a sequence to keep the fusion protein retained in the cytoplasm, a mitochondrial localization signal for targeting to the mitochondria, a chloroplast localization signal for targeting to a chloroplast, an ER retention signal, and the like).
- a subcellular localization sequence e.g., a nuclear localization signal (NLS) for targeting to the nucleus
- NES nuclear export sequence
- a sequence to keep the fusion protein retained in the cytoplasm e.g., a mitochondrial localization signal for targeting to the mitochondria, a chloroplast localization signal for targeting to a chloroplast
- a subject RNA-guided polypeptide does not include a NLS so that the protein is not targeted to the nucleus (which can be advantageous, e.g., when the target nucleic acid is an RNA that is present in the cytosol).
- a fusion partner can provide a tag (i.e., the heterologous polypeptide is a detectable label) for ease of tracking and/or purification (e.g., a fluorescent protein, e.g., green fluorescent protein (GFP), yellow fluorescent protein (YFP), red fluorescent protein (RFP), cyan fluorescent protein (CFP), mCherry, tdTomato, and the like; a histidine tag, e.g., a 6XHis tag; a hemagglutinin (HA) tag; a FLAG tag; a Myc tag; and the like).
- a fluorescent protein e.g., green fluorescent protein (GFP), yellow fluorescent protein (YFP), red fluorescent protein (RFP), cyan fluorescent protein (CFP), mCherry, tdTomato, and the like
- a histidine tag e.g., a 6XHis tag
- HA hemagglutinin
- FLAG tag a FLAG tag
- a CasX variant protein for use in the XDP systems includes (is fused to) a nuclear localization signal (NLS).
- NLS nuclear localization signal
- a CasX variant protein is fused to 2 or more, 3 or more, 4 or more, or 5 or more 6 or more, 7 or more, 8 or more NLSs.
- one or more NLSs (2 or more, 3 or more, 4 or more, or 5 or more NLSs) are positioned at or near (e.g., within 50 amino acids of) the N-terminus and/or the C-terminus.
- one or more NLSs (2 or more, 3 or more, 4 or more, or 5 or more NLSs) are positioned at or near (e.g., within 50 amino acids of) the N-terminus. In some cases, one or more NLSs (2 or more, 3 or more, 4 or more, or 5 or more NLSs) are positioned at or near (e.g., within 50 amino acids of) the C- terminus. In some cases, one or more NLSs (3 or more, 4 or more, or 5 or more NLSs) are positioned at or near (e.g., within 50 amino acids of) both the N-terminus and the C-terminus.
- an NLS is positioned at the N-terminus and an NLS is positioned at the C-terminus.
- a CasX variant protein includes (is fused to) between 1 and 10 NLSs (e.g., 1-9, 1- 8, 1-7, 1-6, 1-5, 2-10, 2-9, 2-8, 2-7, 2- 6, or 2-5 NLSs).
- a CasX variant protein includes (is fused to) between 2 and 5 NLSs (e.g., 2-4, or 2-3 NLSs).
- Non-limiting examples of NLSs include sequences derived from: the NLS of the SV40 virus large T-antigen, having the amino acid sequence PKKKRKV (SEQ ID NO: 130); the NLS from nucleoplasmin (e.g., the nucleoplasmin bipartite NLS with the sequence KRPAATKKAGQAKKKK (SEQ ID NO: 131); the c-myc NLS having the amino acid sequence PAAKRVKLD (SEQ ID NO: 132) or RQRRNELKRSP (SEQ ID NO: 133); the hRNPAl M9 NLS having the sequence NQ S SNF GPMKGGNF GGRS S GP Y GGGGQ YF AKPRN Q GGY (SEQ ID NO: 134); the sequence
- RMRIZFKNKGKDTAELRRRRVEV S VELRKAKKDEQILKRRNV SEQ ID NO: 135) of the IBB domain from importin-alpha; the sequences VSRKRPRP (SEQ ID NO: 136) and PPKKARED (SEQ ID NO: 137) of the myoma T protein; the sequence PQPKKKPL (SEQ ID NO: 138) of human p53; the sequence SALIKKKKKMAP (SEQ ID NO: 139) of mouse c-abl IV; the sequences DRLRR (SEQ ID NO: 140) and PKQKKRK (SEQ ID NO: 141) of the influenza virus NS1; the sequence RKLKKKIKKL (SEQ ID NO: 142) of the Hepatitis virus delta antigen; the sequence REKKKFLKRR (SEQ ID NO: 143) of the mouse Mxl protein; the sequence KRKGDE VDGVDE V AKKK SKK (SEQ ID NO: 144) of the human poly(ADP-rib
- PKKK SRKPKKK SRK (SEQ ID NO: 160), HKKKHPD AS VNF SEF SK (SEQ ID NO: 161), QRPGPYDRPQRPGPYDRP (SEQ ID NO: 162), LSPSLSPLLSPSLSPL (SEQ ID NO: 163), RGKGGKGLGKGGAKRHRK (SEQ ID NO: 164), PKRGRGRPKRGRGR (SEQ ID NO: 165), and PKKKRKVPPPPKKKRKV (SEQ ID NO: 166).
- NLS or multiple NLSs are of sufficient strength to drive accumulation of a reference or CasX variant fusion protein in the nucleus of a eukaryotic cell.
- Detection of accumulation in the nucleus may be performed by any suitable technique.
- a detectable marker may be fused to a reference or CasX variant fusion protein such that location within a cell may be visualized.
- Cell nuclei may also be isolated from cells, the contents of which may then be analyzed by any suitable process for detecting protein, such as immunohistochemistry, Western blot, or enzyme activity assay. Accumulation in the nucleus may also be determined.
- a reference or CasX variant fusion protein includes a "Protein Transduction Domain” or PTD (also known as a CPP - cell penetrating peptide), which refers to a protein, polynucleotide, carbohydrate, or organic or inorganic compound that facilitates traversing a lipid bilayer, micelle, cell membrane, organelle membrane, or vesicle membrane.
- PTD Protein Transduction Domain
- a PTD attached to another molecule which can range from a small polar molecule to a large macromolecule and/or a nanoparticle, facilitates the molecule traversing a membrane, for example going from an extracellular space to an intracellular space, or from the cytosol to within an organelle.
- a PTD is covalently linked to the amino terminus of a reference or CasX variant fusion protein. In some embodiments, a PTD is covalently linked to the carboxyl terminus of a reference or CasX variant fusion protein. In some cases, the PTD is inserted internally in the sequence of a reference or CasX variant fusion protein at a suitable insertion site. In some cases, a reference or CasX variant fusion protein includes (is conjugated to, is fused to) one or more PTDs (e.g., two or more, three or more, four or more PTDs). In some cases, a PTD includes one or more nuclear localization signals (NLS).
- NLS nuclear localization signals
- PTDs include but are not limited to peptide transduction domain of HIV TAT comprising Y GRKKRRQRRR (SEQ ID NO: 167), RKKRRQRR (SEQ ID NO: 168); YARAAARQARA (SEQ ID NO: 169); THRLPRRRRRR (SEQ ID NO: 170); and GGRRARRRRRR (SEQ ID NO: 171); a polyarginine sequence comprising a number of arginines sufficient to direct entry into a cell (e.g., 3, 4, 5, 6, 7, 8, 9, 10, or 10-50 arginines (SEQ ID NO: 172)); a VP22 domain (Zender et al. (2002) Cancer Gene Ther.
- the PTD is an activatable CPP (ACPP) (Aguilera et al. (2009) Integr Biol (Camb) June; 1(5-6): 371-381).
- ACPPs comprise a polycationic CPP (e.g., Arg9 or "R9") connected via a cleavable linker to a matching polyanion (e.g., Glu9 or "E9”), which reduces the net charge to nearly zero and thereby inhibits adhesion and uptake into cells.
- a reference or CasX variant fusion protein can include a CasX protein that is linked to an internally inserted heterologous amino acid or heterologous polypeptide (a heterologous amino acid sequence) via a linker polypeptide (e.g., one or more linker polypeptides).
- a reference or CasX variant fusion protein can be linked at the C-terminal and/or N-terminal end to a heterologous polypeptide (fusion partner) via a linker polypeptide (e.g., one or more linker polypeptides)
- the linker polypeptide may have any of a variety of amino acid sequences. Proteins can be joined by a spacer peptide, generally of a flexible nature, although other chemical linkages are not excluded. Suitable linkers include polypeptides of between 4 amino acids and 40 amino acids in length, or between 4 amino acids and 25 amino acids in length. These linkers are generally produced by using synthetic, linker encoding oligonucleotides to couple the proteins.
- Peptide linkers with a degree of flexibility can be used.
- the linking peptides may have virtually any amino acid sequence, bearing in mind that the preferred linkers will have a sequence that results in a generally flexible peptide.
- the use of small amino acids, such as glycine and alanine, are of use in creating a flexible peptide. The creation of such sequences is routine to those of skill in the art.
- a variety of different linkers are commercially available and are considered suitable for use.
- Example linker polypeptides include glycine polymers (G)n, glycine-serine polymer (including, for example, (GS)n, GSGGSn (SEQ ID NO: 177), GGSGGSn (SEQ ID NO: 178), and GGGSn (SEQ ID NO: 179), where n is an integer of at least one), glycine-alanine polymers, alanine-serine polymers, glycine-proline polymers, proline polymers and proline-alanine polymers.
- G glycine polymers
- glycine-serine polymer including, for example, (GS)n, GSGGSn (SEQ ID NO: 177), GGSGGSn (SEQ ID NO: 178), and GGGSn (SEQ ID NO: 179), where n is an integer of at least one
- glycine-alanine polymers glycine-alanine polymers
- alanine-serine polymers
- Example linkers can comprise amino acid sequences including, but not limited to, GGSG (SEQ ID NO: 180), GGSGG (SEQ ID NO: 181), GSGSG (SEQ ID NO: 182), GSGGG (SEQ ID NO: 183), GGGSG (SEQ ID NO: 184), GSSSG (SEQ ID NO: 185),GPGP (SEQ ID NO: 186), GGP, PPP, PPAPPA (SEQ ID NO: 187), PPPGPPP (SEQ ID NO: 188) and the like.
- linkers that are all or partially flexible, such that the linker can include a flexible linker as well as one or more portions that confer less flexible structure.
- the disclosure relates to XDP system components that encode or incorporate guide nucleic acids (gNA) of the CasX:gNA systems wherein the gNA comprises a targeting sequence engineered to be complementary to a target nucleic acid sequence to be edited.
- the gNA is capable of forming a complex with a CRISPR protein that has specificity to a protospacer adjacent motif (PAM) sequence comprising a TC motif in the complementary non-target strand, and wherein the PAM sequence is located 1 nucleotide 5’ of the sequence in the non-target strand that is complementary to the target nucleic acid sequence in the target strand of the target nucleic acid.
- the gNA is capable of forming a complex with a Class 2, Type V CRISPR nuclease.
- the gNA is capable of forming a complex with a CasX nuclease.
- Reference, or naturally-occurring gNA include, but are not limited to those isolated or derived from Deltaproteobacter , Planctomycetes, or Candidatus (as described in US20180346927A1 and WO2018064371A1, incorporated herein by reference), including the sequences of Table 2.
- the disclosure provides gNA variants having one or more modifications relative to a naturally-occurring gNA, the modified gNA hereinafter referred to as a “gNA variant”.
- the encoded gNA variant comprises or consists of a sequence that has at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, or at least 20, or at least 21, or at least 22, or at least 23, or at least 24, or at least 25 mutations relative to the sequence of a reference gNA.
- the gNA variant is a ribonucleic acid molecule (“gRNA”).
- the gNA variant is a deoxyribonucleic acid molecule (“gDNA”) in which uridine nucleotides have been replaced with thymidine.
- the gNA is a chimera, and comprises both DNA and RNA.
- multiple gNAs are delivered to the target cells or tissues in the XDP particles for the modification of a target nucleic acid.
- a pair of gNAs with targeting sequences to different regions of the target nucleic acid can be used in order to bind and cleave at two different sites within the gene or regulatory element, which is then edited by non-homologous end joining (NHEJ), homology-directed repair (HDR), homology-independent targeted integration (HITI), micro-homology mediated end joining (MMEJ), single strand annealing (SSA) or base excision repair (BER).
- NHEJ non-homologous end joining
- HDR homology-directed repair
- HITI homology-independent targeted integration
- MMEJ micro-homology mediated end joining
- SSA single strand annealing
- BER base excision repair
- a pair of gNAs can be incorporated into the XDP such that the CRISPR nuclease can bind and cleave at two different sites 5’ and 3’ of the exon(s) bearing the mutation(s) within the gene.
- cleavage refers to the breakage of the covalent backbone of a nucleic acid molecule; either DNA or RNA, by the nuclease.
- small indels introduced by the CasX:gNA systems of the embodiments described herein and cellular repair systems can restore the protein reading frame of the mutant gene (“refraining” strategy).
- the cells may be contacted with a single gNA.
- the disclosure contemplates use of targeting sequences that flank the segment 5’ and 3’ such that it can be deleted or replaced with a donor template having the correct sequence.
- a pair of gNAs with targeting sequences to different or overlapping regions of the target nucleic acid sequence can be used in order to bind and the CasX to cleave at two different or overlapping sites within or proximal to the exon or regulatory element of the gene, which is then edited by non-homologous end joining (NHEJ), homology-directed repair (HDR, which can include, for example, insertion of a donor template to replace all or a portion of an HTT exon), homology-independent targeted integration (HITI), micro-homology mediated end joining (MMEJ), single strand annealing (SSA) or base excision repair (BER).
- NHEJ non-homologous end joining
- HDR homology-directed repair
- HITI homology-independent targeted integration
- MMEJ micro-homology mediated end joining
- SSA single strand annealing
- BER base excision repair
- the gNA variants of the disclosure can be designed and created by a number of mutagenesis methods, which may include Deep Mutational Evolution (DME) (as described in U.S. patent application serial number PCT/US20/36506, incorporated by reference, herein), deep mutational scanning (DMS), error prone PCR, cassette mutagenesis, random mutagenesis, staggered extension PCR, gene shuffling, or domain swapping, in order to generate one or more gNA variants with enhanced or varied properties relative to the reference gNA.
- DME Deep Mutational Evolution
- DMS deep mutational scanning
- error prone PCR cassette mutagenesis
- random mutagenesis random mutagenesis
- staggered extension PCR staggered extension PCR
- gene shuffling gene shuffling
- domain swapping in order to generate one or more gNA variants with enhanced or varied properties relative to the reference gNA.
- the activity of reference gNAs may be used as a benchmark against which the activity of g
- a reference gNA may be subjected to one or more deliberate, targeted mutations in order to produce a gNA variant, for example a rationally designed variant.
- the gNAs of the disclosure comprise two segments: a targeting sequence and a protein-binding segment.
- the targeting segment of a gNA includes a nucleotide sequence (referred to interchangeably as a guide sequence, a spacer, a targeter, or a targeting sequence) that is complementary to (and therefore hybridizes with) a specific sequence (a target site) within the target nucleic acid sequence (e.g., a target ssRNA, a target ssDNA, a strand of a double stranded target DNA, etc.), described more fully below.
- the targeting sequence of a gNA is capable of binding to a target nucleic acid sequence, including a coding sequence, a complement of a coding sequence, a non-coding sequence, and to regulatory elements.
- the protein-binding segment (or “activator” or “protein-binding sequence”) interacts with (e.g., binds to) a CasX protein as a complex, forming an RNP (described more fully, below).
- the protein-binding segment is alternatively referred to herein as a “scaffold”, which is comprised of several regions, described more fully, below.
- the targeter and the activator portions each have a duplex-forming segment, where the duplex forming segment of the targeter and the duplex-forming segment of the activator have complementarity with one another and hybridize to one another to form a double stranded duplex (dsRNA duplex for a gRNA).
- dsRNA duplex for a gRNA double stranded duplex
- a targeter or “targeter RNA” is used herein to refer to a crRNA-like molecule (crRNA: "CRISPR RNA”) of a CasX dual guide RNA (and therefore of a CasX single guide RNA when the “activator” and the “targeter” are linked together; e.g., by intervening nucleotides).
- the crRNA has a 5' region that anneals with the tracrRNA followed by the nucleotides of the targeting sequence.
- a guide RNA (dgRNA or sgRNA) comprises a guide sequence and a duplex-forming segment of a crRNA, which can also be referred to as a crRNA repeat.
- a corresponding tracrRNA-like molecule also comprises a duplex-forming stretch of nucleotides that forms the other half of the dsRNA duplex of the protein-binding segment of the guide RNA.
- a targeter and an activator hybridize to form a dual guide NA, referred to herein as a “dual guide NA”, a “dual-molecule gNA”, a “dgNA”, a “double-molecule guide NA”, or a “two-molecule guide NA”.
- Site-specific binding and/or cleavage of a target nucleic acid sequence (e.g., genomic DNA) by the CasX protein can occur at one or more locations (e.g., a sequence of a target nucleic acid) determined by base-pairing complementarity between the targeting sequence of the gNA and the target nucleic acid sequence.
- the gNA of the disclosure have sequences complementarity to and therefore can hybridize with the target nucleic acid that is adjacent to a sequence complementary to a TC PAM motif or a PAM sequence, such as ATC, CTC, GTC, or TTC.
- a targeter can be modified by a user to hybridize with a specific target nucleic acid sequence, so long as the location of the PAM sequence is considered.
- the sequence of a targeter may be a non-naturally occurring sequence.
- the sequence of a targeter may be a naturally-occurring sequence, derived from the gene to be edited.
- the activator and targeter of the gNA are covalently linked to one another (rather than hybridizing to one another) and comprise a single molecule, referred to herein as a “single-molecule gNA,” “one-molecule guide NA,” “single guide NA”, “single guide RNA”, a “single-molecule guide RNA,” a “one-molecule guide RNA”, a “single guide DNA”, a “single-molecule DNA”, or a “one-molecule guide DNA”, (“sgNA”, “sgRNA”, or a “sgDNA”).
- the sgNA includes an “activator” or a “targeter” and thus can be an “activator-RNA” and a “targeter-RNA,” respectively.
- the assembled gNAs of the disclosure comprise four distinct regions, or domains: the RNA triplex, the scaffold stem, the extended stem, and the targeting sequence that, in the embodiments of the disclosure is specific for a target nucleic acid and is located on the 3’ end of the gNA.
- the RNA triplex, the scaffold stem, and the extended stem, together, are referred to as the “scaffold” of the gNA. i. RNA Triplex
- the RNA triplex comprises the sequence of a UUU— nX( ⁇ 4-15)— UUU stem loop (SEQ ID NO: 189) that ends with an AAAG after 2 intervening stem loops (the scaffold stem loop and the extended stem loop), forming a pseudoknot that may also extend past the triplex into a duplex pseudoknot.
- the UU-UUU-AAA sequence of the triplex forms as a nexus between the spacer, scaffold stem, and extended stem.
- the UUU-loop-UUU region is coded for first, then the scaffold stem loop, and then the extended stem loop, which is linked by the tetraloop, and then an AAAG closes off the triplex before becoming the spacer.
- an AAAG closes off the triplex before becoming the spacer.
- the triplex region is followed by the scaffold stem loop.
- the scaffold stem loop is a region of the gNA that is bound by CasX protein (such as a reference or CasX variant protein).
- the scaffold stem loop is a fairly short and stable stem loop. In some cases, the scaffold stem loop does not tolerate many changes, and requires some form of an RNA bubble. In some embodiments, the scaffold stem is necessary for CasX sgNA function.
- the scaffold stem of a CasX sgNA has a necessary bulge (RNA bubble) that is different from many other stem loops found in CRISPR/Cas systems. In some embodiments, the presence of this bulge is conserved across sgNA that interact with different CasX proteins.
- An exemplary sequence encoding a scaffold stem loop sequence of a gNA comprises the sequence CCAGCGACTATGTCGTATGG (SEQ ID NO: 190).
- the disclosure provides gNA variants wherein the scaffold stem loop is replaced with an RNA stem loop sequence from a heterologous RNA source with proximal 5' and 3' ends, such as, but not limited to stem loop sequences designated as MS2, Q b, U1 hairpin II, Uvsx, or PP7 stem loops, which can be used, in some cases, to facilitate transport out of the host cell nucleus.
- the heterologous RNA stem loop of the gNA is capable of binding a protein, an RNA structure, a DNA sequence, or a small molecule, which can facilitate the binding of gNA to CasX. k. Extended Stem Loop
- the scaffold stem loop is followed by the extended stem loop.
- the extended stem comprises a synthetic tracr and crRNA fusion that is largely unbound by the CasX protein.
- the extended stem loop can be highly malleable.
- a single guide gRNA is made with a GAAA tetraloop linker or a GAGAAA linker between the tracr and crRNA in the extended stem loop.
- the targeter and activator of a CasX sgNA are linked to one another by intervening nucleotides and the linker can have a length of from 3 to 20 nucleotides.
- the extended stem is a large 32-bp loop that sits outside of the CasX protein in the ribonucleoprotein complex.
- An exemplary sequence encoding an extended stem loop sequence of a sgNA comprises GCGCTT ATTT ATCGGAGAGAAATCCGAT AAAT AAGAAGC (SEQ ID NO: 191).
- the extended stem loop comprises a GAGAAA spacer sequence.
- the disclosure provides gNA variants wherein the extended stem loop is replaced with an RNA stem loop sequence from a heterologous RNA source with proximal 5’ and 3’ ends, such as, but not limited to stem loop sequences designated MS2, QP, U1 hairpin II, Uvsx, or PP7 stem loops.
- the heterologous RNA stem loop increases the stability of the gNA.
- the disclosure provides gNA variants having an extended stem loop region comprising at least 10, at least 100, at least 500, at least 1000, or at least 10,000 nucleotides, or at least 10-10,000, at least 10-1000, or at least 10-100 nucleotides.
- the extended stem loop comprises a GAGAAA spacer sequence. 1. Targeting Sequence (a.k.a. Spacer)
- the extended stem loop is followed by a region that forms part of the triplex, and then the targeting sequence (or “spacer”) at the 3’ end of the gNA.
- the targeting sequence targets the CasX ribonucleoprotein holo complex to a specific region of the target nucleic acid sequence of the gene to be modified.
- gNA targeting sequences of the disclosure have sequences complementarity to, and therefore can hybridize to, a portion of the HTT gene in a nucleic acid in a eukaryotic cell (e.g., a eukaryotic chromosome, chromosomal sequence, a eukaryotic RNA, etc.) as a component of the RNP when the TC PAM motif or any one of the PAM sequences TTC, ATC, GTC, or CTC is located 1 nucleotide 5’ to the non-target strand sequence complementary to the target sequence.
- a eukaryotic cell e.g., a eukaryotic chromosome, chromosomal sequence, a eukaryotic RNA, etc.
- the targeting sequence of a gNA can be modified so that the gNA can target a desired sequence of any desired target nucleic acid sequence, so long as the PAM sequence location is taken into consideration.
- the gNA scaffold is 5’ of the targeting sequence, with the targeting sequence on the 3’ end of the gNA.
- the PAM motif sequence recognized by the nuclease of the RNP is TC. In other embodiments, the PAM sequence recognized by the nuclease of the RNP is NTC.
- the gNA of the XDP systems comprises a targeting sequence (a) complementary to a nucleic acid sequence encoding i) a target protein, which may be a wild-type sequence or may comprise one or more mutations or ii) the regulatory element of the protein, which may be a wild-type sequence; or (b) complementary to a complement of a nucleic acid sequence encoding a protein or its regulatory element, which may comprise one or more mutations.
- the targeting sequence of the gNA is specific for a portion of a gene encoding a target protein comprising one or more mutations.
- the targeting sequence of a gNA is specific for a target gene exon.
- the targeting sequence of a gNA is specific for a target gene intron. In some embodiments, the targeting sequence of the gNA is specific for a target gene intron-exon junction. In some embodiments, the targeting sequence of the gNA is complementary to a sequence comprising one or more single nucleotide polymorphisms (SNPs) of the target gene or its complement. In other embodiments, the targeting sequence of the gNA is complementary to a sequence of an intergenic region of the target gene or a sequence complementary to an intergenic region of the target gene. [00253] In some embodiments, the targeting sequence of a gNA is specific for a regulatory element that regulates expression of a target gene.
- SNPs single nucleotide polymorphisms
- Such regulatory elements include, but are not limited to promoter regions, enhancer regions, intergenic regions, 5’ untranslated regions (5’ UTR), 3’ untranslated regions (3’ UTR), intergenic regions, gene enhancer elements, conserved elements, and regions comprising cis-regulatory elements.
- the promoter region is intended to encompass nucleotides within 5 kb of the target gene initiation point or, in the case of gene enhancer elements or conserved elements, can be 1 Mb or more distal to the target gene.
- the disclosure provides a gNA with a targeting sequence that hybridizes with target gene regulatory element.
- the targets are those in which the encoding gene of the target is intended to be knocked out or knocked down such that the target protein comprising mutations is not expressed or is expressed at a lower level in a cell.
- the disclosure provides a CasX:gNA system wherein the targeting sequence (or spacer) of the gNA is complementary to a nucleic acid sequence encoding the target protein, a portion of the target protein, a portion of a regulatory element, or the complement of a portion of a gene or a regulatory element for the target gene.
- the targeting sequence has between 14 and 35 consecutive nucleotides.
- the targeting sequence has 14, 15, 16, 18, 18, 19, 20, 21, 22, 23 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34 or 35 consecutive nucleotides. In some embodiments, the targeting sequence consists of 20 consecutive nucleotides. In some embodiments, the targeting sequence consists of 19 consecutive nucleotides. In some embodiments, the targeting sequence consists of 18 consecutive nucleotides. In some embodiments, the targeting sequence consists of 17 consecutive nucleotides. In some embodiments, the targeting sequence consists of 16 nucleotides. In some embodiments, the targeting sequence consists of 15 nucleotides. In some embodiments, the targeting sequence has 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27,
- the targeting sequence can comprise 0 to 5, 0 to 4, 0 to 3, or 0 to 2 mismatches relative to the target nucleic acid sequence and retain sufficient binding specificity such that the RNP comprising the gNA comprising the targeting sequence can form a complementary bond with respect to the target nucleic acid.
- the CasX:gNA of the XDP system comprises a first gNA and further comprises a second (and optionally a third, fourth or fifth) gNA, wherein the second gNA has a targeting sequence complementary a different portion of the target nucleic acid or its complement compared to the targeting sequence of the first gNA.
- the targeting sequences of the gNA By selection of the targeting sequences of the gNA, defined regions of the target nucleic acid can be modified or edited using the CasX:gNA systems described herein. m. gNA scaffolds
- the remaining regions of the gNA are referred to herein as the scaffold.
- the gNA scaffolds are derived from naturally-occurring sequences, described below as reference gNA.
- the gNA scaffolds are variants of reference gNA wherein mutations, insertions, deletions or domain substitutions are introduced to confer desirable properties on the gNA variant.
- a reference gRNA comprises a sequence isolated or derived from Deltaproteobacteria.
- the sequence is a CasX tracrRNA sequence.
- Exemplary CasX reference tracrRNA sequences isolated or derived from Deltaproteobacteria may include:
- Exemplary crRNA sequences isolated or derived from Deltaproteobacter may comprise a sequence of CCGAUAAGUAAAACGCAUCAAAG (SEQ ID NO: 194).
- a CasX reference gNA comprises a sequence at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 81% identical, at least 82% identical, at least 83% identical, at least 84% identical, at least 85% identical, at least 86% identical, at least 86% identical, at least 87% identical, at least 88% identical, at least 89% identical, at least 89% identical, at least 90% identical, at least 91% identical, at least 92% identical, at least 93% identical, at least 94% identical, at least 95% identical, at least 96% identical, at least 97% identical, at least 98% identical, at least 99% identical, at least 99.5% identical or 100% identical to a sequence isolated or derived from Deltaproteobacter .
- a reference guide RNA comprises a sequence isolated or derived from Planctomycetes.
- the sequence is a CasX tracrRNA sequence.
- Exemplary reference tracrRNA sequences isolated or derived from Planctomycetes may include: UACUGGCGCUUUUAUCUCAUUACUUUGAGAGCCAUCACCAGCGACUAUGUCGUA UGGGUAAAGCGCUUAUUUAUCGGAGA (SEQ ID NO: 8) and
- exemplary crRNA sequences isolated or derived from Planctomycetes may comprise a sequence of UCUCCGAUAAAUAAGAAGCAUCAAAG (SEQ ID NO: 197).
- a CasX reference gNA comprises a sequence at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 81% identical, at least 82% identical, at least 83% identical, at least 84% identical, at least 85% identical, at least 86% identical, at least 86% identical, at least 87% identical, at least 88% identical, at least 89% identical, at least 89% identical, at least 90% identical, at least 91% identical, at least 92% identical, at least 93% identical, at least 94% identical, at least 95% identical, at least 96% identical, at least 97% identical, at least 98% identical, at least 99% identical, at least 99.5% identical or 100% identical to a sequence isolated or derived from Planctomycetes.
- a reference gNA comprises a sequence isolated or derived from Candidatus Sungbacteria.
- the sequence is a CasX tracrRNA sequence.
- Exemplary CasX reference tracrRNA sequences isolated or derived from Candidatus Sungbacteria may comprise sequences of: GUUUACACACUCCCUCUCAUAGGGU (SEQ ID NO: 10), GUUUACACACUCCCUCUCAUGAGGU (SEQ ID M): 11), UUUUACAUACCCCCUCUCAUGGGAU (SEQ ID NO: 12) and
- a CasX reference guide RNA comprises a sequence at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 81% identical, at least 82% identical, at least 83% identical, at least 84% identical, at least 85% identical, at least 86% identical, at least 86% identical, at least 87% identical, at least 88% identical, at least 89% identical, at least 89% identical, at least 90% identical, at least 91% identical, at least 92% identical, at least 93% identical, at least 94% identical, at least 95% identical, at least 96% identical, at least 97% identical, at least 98% identical, at least 99% identical, at least 99.5% identical or 100% identical to a sequence isolated or derived from Candidatus Sungbacteria. [00258] Table 2 provides the sequences of reference gRNAs tracr, cr and scaffold sequences.
- the disclosure provides gNA sequences wherein the gNA has a scaffold comprising a sequence having at least one nucleotide modification relative to a reference gNA sequence having a sequence of any one of SEQ ID NOS: 4-16 of Table 2.
- a vector comprises a DNA encoding sequence for a gNA, or where a gNA is a gDNA or a chimera of RNA and DNA, that thymine (T) bases can be substituted for the uracil (U) bases of any of the gNA sequence embodiments described herein, including the sequences of Table 2 and Table 3.
- the disclosure relates to guide nucleic acid variants (referred to herein alternatively as “gNA variant” or “gRNA variant” when the nucleic acid variant comprises RNA), which comprise one or more modifications relative to a reference gRNA scaffold.
- gNA variant guide nucleic acid variants
- gRNA variant when the nucleic acid variant comprises RNA
- scaffold refers to all parts to the gNA necessary for gNA function with the exception of the spacer sequence.
- a gNA variant comprises one or more nucleotide substitutions, insertions, deletions, or swapped or replaced regions relative to a reference gRNA sequence of the disclosure.
- a mutation can occur in any region of a reference gRNA to produce a gNA variant.
- the scaffold of the gNA variant sequence has at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, or at least 70%, at least 80%, at least 85%, at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity to the sequence of SEQ ID NO: 4 or SEQ ID NO: 5.
- a gNA variant comprises one or more nucleotide changes within one or more regions of the reference gRNA that improve a characteristic relative to the reference gRNA.
- Exemplary regions include the RNA triplex, the pseudoknot, the scaffold stem loop, and the extended stem loop.
- the variant scaffold stem further comprises a bubble.
- the variant scaffold further comprises a triplex loop region.
- the variant scaffold further comprises a 5' unstructured region.
- the gNA variant scaffold comprises a scaffold stem loop having at least 60% sequence identity to SEQ ID NO:
- the gNA variant comprises a scaffold stem loop having the sequence of CCAGCGACUAUGUCGUAGUGG (SEQ ID NO: 202).
- the disclosure provides a gNA scaffold comprising, relative to SEQ ID NO:5, a C18G substitution, a G55 insertion, a U1 deletion, and a modified extended stem loop in which the original 6 nt loop and 13 most-loop-proximal base pairs (32 nucleotides total) are replaced by a Uvsx hairpin (4 nt loop and 5 loop-proximal base pairs; 14 nucleotides total) and the loop-distal base of the extended stem was converted to a fully base-paired stem contiguous with the new Uvsx hairpin by deletion of the A99 and substitution of G64U.
- the gNA scaffold comprises the sequence
- gNA variants that have one or more improved functions or characteristics, or add one or more new functions when the variant gNA is compared to a reference gRNA described herein, are envisaged as within the scope of the disclosure.
- a representative example of such a gNA variant is guide 174 (SEQ ID NO: 734).
- the gNA variant adds a new function to the RNP comprising the gNA variant.
- the gNA variant has an improved characteristic selected from: improved stability; improved solubility; improved transcription of the gNA; improved resistance to nuclease activity; increased folding rate of the gNA; decreased side product formation during folding; increased productive folding; improved binding affinity to a CasX protein; improved binding affinity to a target DNA when complexed with a CasX protein; improved gene editing when complexed with a CasX protein; improved specificity of editing when complexed with a CasX protein; and improved ability to utilize a greater spectrum of one or more PAM sequences, including ATC, CTC, GTC, or TTC, in the editing of target DNA when complexed with a CasX protein, or any combination thereof.
- the one or more of the improved characteristics of the gNA variant is at least about 1.1 to about 100,000-fold improved relative to the reference gNA of SEQ ID NO: 4 or SEQ ID NO: 5. In other cases, the one or more improved characteristics of the gNA variant is at least about 1.1, at least about 10, at least about 100, at least about 1000, at least about 10,000, at least about 100,000-fold or more improved relative to the reference gNA of SEQ ID NO: 4 or SEQ ID NO: 5.
- the one or more of the improved characteristics of the gNA variant is about 1.1 to 100,00-fold, about 1.1 to 10,00-fold, about 1.1 to 1,000-fold, about 1.1 to 500-fold, about 1.1 to 100-fold, about 1.1 to 50-fold, about 1.1 to 20-fold, about 10 to 100,00-fold, about 10 to 10,00-fold, about 10 to 1,000-fold, about 10 to 500-fold, about 10 to 100-fold, about 10 to 50-fold, about 10 to 20-fold, about 2 to 70-fold, about 2 to 50-fold, about 2 to 30-fold, about 2 to 20-fold, about 2 to 10-fold, about 5 to 50-fold, about 5 to 30-fold, about 5 to 10-fold, about 100 to 100,00-fold, about 100 to 10,00-fold, about 100 to 1,000-fold, about 100 to 500-fold, about 500 to 100,00-fold, about 500 to 10,00-fold, about 500 to 1,000-fold, about 500 to 750-fold, about 1,000 to 100,00-fold, about 10,000 to 100,00-fold, about
- the one or more improved characteristics of the gNA variant is about 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7- fold, 8-fold, 9-fold, 10-fold, 11-fold, 12-fold, 13-fold, 14-fold, 15-fold, 16-fold, 17-fold, 18-fold, 19-fold, 20-fold, 25-fold, 30-fold, 40-fold, 45-fold, 50-fold, 55-fold, 60-fold, 70-fold, 80-fold, 90-fold, 100-fold, 110-fold, 120-fold, 130-fold, 140-fold, 150-fold, 160-fold, 170-fold, 180-fold, 190-fold, 200-fold, 210-fold, 220-fold, 230-fold, 240-fold, 250-fold, 260-fold, 270-fold, 280- fold, 290
- a gNA variant can be created by subjecting a reference gRNA to a one or more mutagenesis methods, such as the mutagenesis methods described herein, below, which may include Deep Mutational Evolution (DME), deep mutational scanning (DMS), error prone PCR, cassette mutagenesis, random mutagenesis, staggered extension PCR, gene shuffling, or domain swapping, in order to generate the gNA variants of the disclosure.
- DME Deep Mutational Evolution
- DMS deep mutational scanning
- error prone PCR cassette mutagenesis
- random mutagenesis random mutagenesis
- staggered extension PCR staggered extension PCR
- gene shuffling gene shuffling
- domain swapping domain swapping
- a reference gRNA may be subjected to one or more deliberate, targeted mutations, substitutions, or domain swaps in order to produce a gNA variant, for example a rationally designed variant.
- exemplary gRNA variants produced by such methods are described in the Examples and representative sequences of gNA scaffolds are presented in Table 3.
- the gNA variant comprises one or more modifications compared to a reference guide nucleic acid scaffold sequence, wherein the one or more modification is selected from: at least one nucleotide substitution in a region of the gNA variant; at least one nucleotide deletion in a region of the gNA variant; at least one nucleotide insertion in a region of the gNA variant; a substitution of all or a portion of a region of the gNA variant; a deletion of all or a portion of a region of the gNA variant; or any combination of the foregoing.
- the modification is a substitution of 1 to 15 consecutive or non-consecutive nucleotides in the gNA variant in one or more regions. In other cases, the modification is a deletion of 1 to 10 consecutive or non-consecutive nucleotides in the gNA variant in one or more regions. In other cases, the modification is an insertion of 1 to 10 consecutive or non-consecutive nucleotides in the gNA variant in one or more regions. In other cases, the modification is a substitution of the scaffold stem loop or the extended stem loop with an RNA stem loop sequence from a heterologous RNA source with proximal 5' and 3' ends. In some cases, a gNA variant of the disclosure comprises two or more modifications in one region. In other cases, a gNA variant of the disclosure comprises modifications in two or more regions. In other cases, a gNA variant comprises any combination of the foregoing modifications described in this paragraph.
- a 5' G is added to a gNA variant sequence for expression in vivo, as transcription from a U6 promoter is more efficient and more consistent with regard to the start site when the +1 nucleotide is a G.
- two 5' Gs are added to a gNA variant sequence for in vitro transcription to increase production efficiency, as T7 polymerase strongly prefers a G in the +1 position and a purine in the +2 position.
- the 5’ G bases are added to the reference scaffolds of Table 2.
- the 5’ G bases are added to the variant scaffolds of Table 3.
- Table 3 provides exemplary gNA variant scaffold sequences of the disclosure.
- (-) indicates a deletion at the specified position(s) relative to the reference sequence of SEQ ID NO: 5
- (+) indicates an insertion of the specified base(s) at the position indicated relative to SEQ ID NO: 5
- (:) indicates the range of bases at the specified starristop coordinates of a deletion or substitution relative to SEQ ID NO: 5 and multiple insertions, deletions or substitutions are separated by commas; e.g., A14C, T17G.
- the gNA variant scaffold comprises any one of the sequences listed in Table 3, or SEQ ID NOS: 597-781, or a sequence having at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% sequence identity thereto.
- a vector comprises a DNA encoding sequence for a gNA, or where a gNA is a gDNA or a chimera of RNA and DNA, that thymine (T) bases can be substituted for the uracil (U) bases of any of the gNA sequence embodiments described herein.
- T thymine
- U uracil
- the gNA variant comprises a tracrRNA stem loop comprising the sequence -UUU-N4-25UUU- (SEQ ID NO: 203).
- the gNA variant comprises a scaffold stem loop or a replacement thereof, flanked by two triplet U motifs that contribute to the triplex region.
- the scaffold stem loop or replacement there of comprises at least 4 nucleotides, at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 11 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 16 nucleotides, at least 17 nucleotides, at least 18 nucleotides, at least 19 nucleotides, at least 20 nucleotides, at least 21 nucleotides, at least 22 nucleotides, at least 23 nucleotides, at least 24 nucleotides, or at least 25 nucleotides.
- the gNA variant comprises a crRNA sequence with -AAAG- in a location 5’ to the spacer region. In some embodiments, the -AAAG- sequence is immediately 5’ to the spacer region.
- the at least one nucleotide modification comprises at least one nucleotide deletion in the CasX variant gNA relative to the reference gRNA.
- a gNA variant comprises a deletion of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14,
- the at least one deletion comprises a deletion of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 or more consecutive nucleotides relative to a reference gRNA.
- the gNA variant comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 or more nucleotide deletions relative to the reference gRNA, and the deletions are not in consecutive nucleotides.
- a gNA variant may comprise a first deletion of one nucleotide, and a second deletion of two nucleotides and the two deletions are not consecutive.
- a gNA variant comprises at least two deletions in different regions of the reference gRNA.
- a gNA variant comprises at least two deletions in the same region of the reference gRNA.
- the regions may be the extended stem loop, scaffold stem loop, scaffold stem bubble, triplex loop, pseudoknot, triplex, or a 5’ end of the gNA variant. Any deletion of any nucleotide in a reference gRNA is contemplated as within the scope of the disclosure.
- the at least one nucleotide modification comprises at least one nucleotide insertion.
- a gNA variant comprises an insertion of 1, 2, 3, 4, 5,
- the at least one nucleotide insertion comprises an insertion of 1, 2, 3, 4, 5, 6,
- the gNA variant comprises 2 or more insertions relative to the reference gRNA, and the insertions are not consecutive.
- any length of insertions, and any combination of lengths of insertions, as described herein, are contemplated as within the scope of the disclosure.
- a gNA variant may comprise a first insertion of one nucleotide, and a second insertion of two nucleotides and the two insertions are not consecutive.
- a gNA variant comprises at least two insertions in different regions of the reference gRNA. In some embodiments, a gNA variant comprises at least two insertions in the same region of the reference gRNA.
- the regions may be the extended stem loop, scaffold stem loop, scaffold stem bubble, triplex loop, pseudoknot, triplex, or a 5’ end of the gNA variant. Any insertion of A, G, C, U (or T, in the corresponding DNA) or combinations thereof at any location in the reference gRNA is contemplated as within the scope of the disclosure.
- the at least one nucleotide modification comprises at least one nucleic acid substitution.
- a gNA variant comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 or more consecutive or non-consecutive substituted nucleotides relative to a reference gRNA.
- a gNA variant comprises 1-4 nucleotide substitutions relative to a reference gRNA.
- the at least one substitution comprises a substitution of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 or more consecutive nucleotides relative to a reference gRNA.
- the gNA variant comprises 2 or more substitutions relative to the reference gRNA, and the substitutions are not consecutive.
- any length of substituted nucleotides, and any combination of lengths of substituted nucleotides, as described herein, are contemplated as within the scope of the disclosure.
- a gNA variant may comprise a first substitution of one nucleotide, and a second substitution of two nucleotides and the two substitutions are not consecutive.
- a gNA variant comprises at least two substitutions in different regions of the reference gRNA.
- a gNA variant comprises at least two substitutions in the same region of the reference gRNA.
- the regions may be the triplex, the extended stem loop, scaffold stem loop, scaffold stem bubble, triplex loop, pseudoknot, triplex, or a 5’ end of the gNA variant. Any substitution of A, G, C, U (or T, in the corresponding DNA) or combinations thereof at any location in the reference gRNA is contemplated as within the scope of the disclosure.
- a gNA variant can comprise at least one substitution and at least one deletion relative to a reference gRNA, at least one substitution and at least one insertion relative to a reference gRNA, at least one insertion and at least one deletion relative to a reference gRNA, or at least one substitution, one insertion and one deletion relative to a reference gRNA.
- the gNA variant comprises a scaffold region at least 20% identical, at least 30% identical, at least 40% identical, at least 50% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, at least 90% identical, at least 91% identical, at least 92% identical, at least 93% identical, at least 94% identical, at least 95% identical, at least 96% identical, at least 97% identical, at least 98% identical, or at least 99% identical to any one of SEQ ID NOS: 4-16.
- the gNA variant comprises a scaffold region at least 60% homologous (or identical) to any one of SEQ ID NOS: 4-16.
- the gNA variant comprises a tracr stem loop at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, at least 90% identical, at least 91% identical, at least 92% identical, at least 93% identical, at least 94% identical, at least 95% identical, at least 96% identical, at least 97% identical, at least 98% identical, or at least 99% identical to SEQ ID NO:
- the gNA variant comprises a tracr stem loop at least 60% homologous (or identical) to SEQ ID NO: 14.
- the gNA variant comprises an extended stem loop at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, at least 90% identical, at least 91% identical, at least 92% identical, at least 93% identical, at least 94% identical, at least 95% identical, at least 96% identical, at least 97% identical, at least 98% identical, or at least 99% identical to SEQ ID NO:
- the gNA variant comprises an extended stem loop at least 60% homologous (or identical) to SEQ ID NO: 15.
- the gNA variant comprises an exogenous extended stem loop, with such differences from a reference gNA described as follows.
- an exogenous extended stem loop has little or no identity to the reference stem loop regions disclosed herein (e.g., SEQ ID NO: 15).
- an exogenous stem loop is at least 10 bp, at least 20 bp, at least 30 bp, at least 40 bp, at least 50 bp, at least 60 bp, at least 70 bp, at least 80 bp, at least 90 bp, at least 100 bp, at least 200 bp, at least 300 bp, at least 400 bp, at least 500 bp, at least 600 bp, at least 700 bp, at least 800 bp, at least 900 bp, at least 1,000 bp, at least 2,000 bp, at least 3,000 bp, at least 4,000 bp, at least 5,000 bp, at least 6,000 bp, at least 7,000 bp, at least 8,000 bp, at least 9,000 bp, at least 10,000 bp, at least 12,000 bp, at least 15,000 bp or at least 20,000 bp.
- the gNA variant comprises an extended stem loop region comprising at least 10, at least 100, at least 500, at least 1000, or at least 10,000 nucleotides.
- the heterologous stem loop increases the stability of the gNA.
- the heterologous RNA stem loop is capable of binding a protein, an RNA structure, a DNA sequence, or a small molecule.
- an exogenous stem loop region comprises an RNA stem loop or hairpin, for example a thermostable RNA such as MS2 (ACAUGAGGAUUACCCAUGU (SEQ ID NO: 204)), Qp (UGCAUGUCUAAGACAGCA (SEQ ID NO: 205)), U1 hairpin II
- AAUCCAUUGCACUCCGGAUU (SEQ ID NO: 206)), Uvsx (CCUCUUCGGAGG (SEQ ID NO: 207)), PP7 ( AGG AGUUU CU AU GG A A AC C CU (SEQ ID NO: 208)), Phage replication loop (AGGUGGGACGACCUCUCGGUCGUCCUAUCU (SEQ ID NO: 209)), Kissing loop a (UGCUCGCUCCGUUCGAGCA (SEQ ID NO: 210)), Kissing loop bl (UGCUCGACGCGUCCUCGAGCA (SEQ ID NO: 211)), Kissing loop_b2 (UGCUCGUUUGCGGCUACGAGCA (SEQ ID NO: 212)), G quadriplex M3q (AGGGAGGGAGGGAGAGG (SEQ ID NO: 213)), G quadriplex telomere basket (GGUUAGGGUUAGGGUUAGG (SEQ ID NO: 214)), Sarcin-ricin loop (CUGCUCAGUACGAGAG
- an exogenous stem loop comprises a long non-coding RNA (lncRNA).
- lncRNA refers to a non-coding RNA that is longer than approximately 200 bp in length.
- the 5’ and 3’ ends of the exogenous stem loop are base paired; i.e., interact to form a region of duplex RNA.
- the 5’ and 3’ ends of the exogenous stem loop are base paired, and one or more regions between the 5’ and 3’ ends of the exogenous stem loop are not base paired.
- the at least one nucleotide modification comprises: (a) substitution of 1 to 15 consecutive or non-consecutive nucleotides in the gNA variant in one or more regions; (b) a deletion of 1 to 10 consecutive or non-consecutive nucleotides in the gNA variant in one or more regions; (c) an insertion of 1 to 10 consecutive or non-consecutive nucleotides in the gNA variant in one or more regions; (d) a substitution of the scaffold stem loop or the extended stem loop with an RNA stem loop sequence from a heterologous RNA source with proximal 5' and 3' ends; or any combination of (a)-(d).
- the gNA variant comprises a scaffold stem loop sequence of CCAGCGACUAUGUCGUAGUGG (SEQ ID NO: 202). In some embodiments, the gNA variant comprises a scaffold stem loop sequence of CCAGCGACUAUGUCGUAGUGG (SEQ ID NO: 202) and at least 1, 2, 3, 4, or 5 mismatches thereto.
- the gNA variant comprises an extended stem loop region comprising less than 32 nucleotides, less than 31 nucleotides, less than 30 nucleotides, less than 29 nucleotides, less than 28 nucleotides, less than 27 nucleotides, less than 26 nucleotides, less than 25 nucleotides, less than 24 nucleotides, less than 23 nucleotides, less than 22 nucleotides, less than 21 nucleotides, or less than 20 nucleotides.
- the gNA variant comprises an extended stem loop region comprising less than 32 nucleotides.
- the gNA variant further comprises a thermostable stem loop.
- the gNA comprises an RNA binding domain.
- the RNA binding domain can be a retroviral Psi packaging element inserted into the gNA or is a stem loop with affinity to a protein selected from the group consisting of MS2, PP7, Qbeta, U1A, or phage R- loop, which can facilitate the binding of gNA to CasX.
- Similar RNA components with affinity to protein structures incorporated into the CasX include kissing loop a, kissing loop bl, kissing loop_b2, G quadriplex M3q, G quadriplex telomere basket, sarcin-ricin loop, and pseudoknots.
- the incorporation of the Psi packaging element inserted into the guide RNA facilitates the packaging of the XDP particle due, in part, to the high affinity binding of Psi sequences for the Gag NC protein. Further, due to the affinity of the CasX for the gNA, resulting in an RNP, the incorporation of the RNP into the XDP is further facilitated.
- an sgRNA variant comprises a sequence of SEQ ID NOS: 597- 781 or a sequence having having at least about 80%, at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% identity thereto.
- an sgRNA variant comprises a sequence of SEQ ID NOS: 597-781.
- an sgRNA variant comprises a sequence of SEQ ID NOS: 597-781 and a targeting sequence.
- a sgRNA variant comprises a sequence of SEQ ID NO: 600, SEQ ID NO: 602, SEQ ID NO: 659, SEQ ID NO: 603, SEQ ID NO: 660, SEQ ID NO: 661,
- SEQ ID NO: 662 SEQ ID NO: 599, SEQ ID NO: 663, SEQ ID NO: 601, SEQ ID NO: 604,
- SEQ ID NO: 608 SEQ ID NO: 656, SEQ ID NO: 666, SEQ ID NO: 610, SEQ ID NO: 667,
- SEQ ID NO: 608 SEQ ID NO: 669, SEQ ID NO: 598, SEQ ID NO: 670, SEQ ID NO: 671,
- SEQ ID NO: 605 SEQ ID NO: 672, SEQ ID NO: 734, SEQ ID NO: 735, SEQ ID NO: 736,
- the gNA variant comprises one or more additional changes to a sequence of any one of SEQ ID NOS: 732, 733, 734, 737, 740, 744, 745, or 755-781, or having at least about 80%, at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% identity thereto.
- the gNA variant comprises one or more additional changes to a sequence of any one of SEQ ID NOs: 597-781.
- the gNA variant comprises the sequence of any one of SEQ ID NOS:732, 733, 734, 737, 740, 744, 745, or 755-781.
- the gNA variant scaffold consists of the sequence of any one of SEQ ID NOS:732, 733, 734, 737, 740, 744, 745, or 755-781, and further comprises a targeting sequence of any of the embodiments described herein.
- a sgRNA variant comprises one or more additional changes to a sequence of SEQ ID NO: 600, SEQ ID NO: 659, SEQ ID NO: 603, SEQ ID NO: 660, SEQ ID NO: 661, SEQ ID NO: 662, SEQ ID NO: 599, SEQ ID NO: 663, SEQ ID NO: 601, SEQ ID NO: 604, SEQ ID NO: 608, SEQ ID NO: 656, SEQ ID NO: 666, SEQ ID NO: 610, SEQ ID NO: 667, SEQ ID NO: 608, SEQ ID NO: 669, SEQ ID NO: 598, SEQ ID NO: 670, SEQ ID NO: 671,
- SEQ ID NO: 605 SEQ ID NO: 672, SEQ ID NO: 734, SEQ ID NO: 735, SEQ ID NO: 736,
- the gNA variant comprises at least one modification, wherein the at least one modification compared to the reference guide scaffold of SEQ ID NO: 5 is selected from one or more of: (a) a C18G substitution in the triplex loop; (b) a G55 insertion in the stem bubble; (c) aUl deletion; (d) a modification of the extended stem loop wherein (i) a 6 nt loop and 13 loop-proximal base pairs are replaced by a Uvsx hairpin; and (ii) a deletion of A99 and a substitution of G65U that results in a loop-distal base that is fully base-paired.
- the gNA variant comprises the sequence of any one of SEQ ID NOS: 732, 733, 734, 737, 740
- the gNA variants utilized in the XDP systems further comprises a spacer (or targeting sequence) region located at the 3’ end of the gNA, described more fully, supra, wherein the spacer is designed with a sequence that is complementary to a target nucleic acid to be edited.
- the gNA variant comprises a targeting sequence of at least 14 to 30 nucleotides, wherein the sequence is complementary to the target nucleic acid to be edited.
- the targeting sequence has 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34 or 35 nucleotides.
- the gNA variant comprises a targeting sequence having 20 nucleotides.
- the targeting sequence has 25 nucleotides.
- the targeting sequence has 24 nucleotides.
- the targeting sequence has 23 nucleotides. In some embodiments, the targeting sequence has 22 nucleotides. In some embodiments, the targeting sequence has 21 nucleotides. In some embodiments, the targeting sequence has 20 nucleotides. In some embodiments, the targeting sequence has 19 nucleotides. In some embodiments, the targeting sequence has 18 nucleotides. In some embodiments, the targeting sequence has 17 nucleotides.
- the targeting sequence has 16 nucleotides. In some embodiments, the targeting sequence has 15 nucleotides. In some embodiments, the targeting sequence has 14 nucleotides. In some embodiments, the target nucleic acid comprises a PAM sequence located 5’ of the targeting sequence with at least a single nucleotide separating the PAM from the first nucleotide of the targeting sequence. In some embodiments, the PAM is located on the non- targeted strand of the target region, i.e. the strand that is complementary to the target nucleic acid. In some embodiments, the PAM sequence is a TC motif. In some embodiments, the PAM sequence is a ATC. In other embodiments, the PAM sequence is a TTC. In other embodiments, the PAM sequence is a GTC. In other embodiments, the PAM sequence is a CTC.
- the scaffold of the gNA variant is a variant comprising one or more additional changes to a sequence of a reference gRNA that comprises SEQ ID NO: 4 or SEQ ID NO: 5.
- the scaffold of the reference gRNA is derived from SEQ ID NO: 4 or SEQ ID NO: 5
- the one or more improved or added characteristics of the gNA variant are improved compared to the same characteristic in SEQ ID NO: 4 or SEQ ID NO: 5.
- the scaffold of the gNA variant is part of an RNP with a CasX variant protein comprising any one of the sequences of SEQ ID NOS: 21- 233, 343-345, 350-353, 355-367 or 388-397, or a sequence having at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity thereto.
- the gNA further comprises a targeting sequence. o. Chemically Modified gNA
- the disclosure relates to chemically-modified gNA.
- the present disclosure provides a chemically-modified gNA that has guide RNA functionality and has reduced susceptibility to cleavage by a nuclease.
- a gNA that comprises any nucleotide other than the four canonical ribonucleotides A, C, G, and U, or a deoxynucleotide is a chemically modified gNA.
- a chemically-modified gNA comprises any backbone or internucleotide linkage other than a natural phosphodiester internucleotide linkage.
- the retained functionality includes the ability of the modified gNA to bind to a CasX of any of the embodiments described herein. In certain embodiments, the retained functionality includes the ability of the modified gNA to bind to a target nucleic acid sequence. In certain embodiments, the retained functionality includes targeting a CasX protein or the ability of a pre-complexed CasX protein-gNA to bind to a target nucleic acid sequence. In certain embodiments, the retained functionality includes the ability to nick a target polynucleotide by a CasX-gNA. In certain embodiments, the retained functionality includes the ability to cleave a target nucleic acid sequence by a CasX-gNA. In certain embodiments, the retained functionality is any other known function of a gNA in a CasX system with a CasX protein of the embodiments of the disclosure.
- the disclosure provides a chemically-modified gNA in which a nucleotide sugar modification is incorporated into the gNA selected from the group consisting of 2'-0 — Cl-4alkyl such as 2 '-O-methyl (2'-OMe), 2'-deoxy (2'-H), 2'-0 — Cl -3 alkyl-0 — Cl- 3alkyl such as 2 '-m ethoxy ethyl (“2'-MOE”), 2'-fluoro (“2'-F”), 2'-amino (“2'-NH2”), 2'- arabinosyl (“2'-arabino”) nucleotide, 2'-F-arabinosyl (“2'-F-arabino”) nucleotide, 2'-locked nucleic acid (“LNA”) nucleotide, 2'-unlocked nucleic acid (“ULNA”) nucleotide, a sugar in L form (“L-su)-2'-O-methyl (2'
- an internucleotide linkage modification incorporated into the guide RNA is selected from the group consisting of: phosphorothioate “P(S)” (P(S)), phosphonocarboxylate (P(CH2)nCOOR) such as phosphonoacetate “PACE” (P(CH2COO-)), thiophosphonocarboxylate ((S)P(CH2)nCOOR) such as thiophosphonoacetate “thioPACE” ((S)P(CH2)nCOO-)), alkylphosphonate (P(C1- 3alkyl) such as methylphosphonate — P(CH3), boranophosphonate (P(BH3)), and phosphorodithioate (P(S)2).
- P(S) phosphorothioate
- P(CH2)nCOOR such as phosphonoacetate “PACE” (P(CH2COO-)
- the disclosure provides a chemically-modified gNA in which a nucleobase (“base”) modification is incorporated into the gNA selected from the group consisting of: 2-thiouracil (“2-thioU”), 2-thiocytosine (“2-thioC”), 4-thiouracil (“4-thioU”), 6- thioguanine (“6-thioG”), 2-aminoadenine (“2-aminoA”), 2-aminopurine, pseudouracil, hypoxanthine, 7-deazaguanine, 7-deaza-8-azaguanine, 7-deazaadenine, 7-deaza-8-azaadenine, 5- methylcytosine (“5-methylC”), 5-methyluracil (“5-methylU”), 5-hydroxymethylcytosine, 5- hydroxymethyluracil, 5,6-dehydrouracil, 5-propynylcytosine, 5-propynyluracil, 5- ethynylcytosine,
- the disclosure provides a chemically-modified gNA in which one or more isotopic modifications are introduced on the nucleotide sugar, the nucleobase, the phosphodiester linkage and/or the nucleotide phosphates, including nucleotides comprising one or more 15N, 13C, 14C, deuterium, 3H, 32P, 1251, 1311 atoms or other atoms or elements used as tracers.
- an “end” modification incorporated into the gNA is selected from the group consisting of: PEG (polyethyleneglycol), hydrocarbon linkers (including: heteroatom (0,S,N)-substituted hydrocarbon spacers; halo- substituted hydrocarbon spacers; keto-, carboxyl-, amido-, thionyl-, carbamoyl-, thionocarbamaoyl-containing hydrocarbon spacers), spermine linkers, dyes including fluorescent dyes (for example fluoresceins, rhodamines, cyanines) attached to linkers such as for example 6-fluorescein-hexyl, quenchers (for example dabcyl, BHQ) and other labels (for example biotin, digoxigenin, acridine, streptavidin, avidin, peptides and/or proteins).
- PEG polyethyleneglycol
- hydrocarbon linkers including: heteroatom (0,S,N)-substituted hydrocarbon spacer
- an “end” modification comprises a conjugation (or ligation) of the gNA to another molecule comprising an oligonucleotide of deoxynucleotides and/or ribonucleotides, a peptide, a protein, a sugar, an oligosaccharide, a steroid, a lipid, a folic acid, a vitamin and/or other molecule.
- the disclosure provides a chemically-modified gNA in which an “end” modification (described above) is located internally in the gNA sequence via a linker such as, for example, a 2-(4-butylamidofluorescein)propane-l,3-diol bis(phosphodiester) linker, which is incorporated as a phosphodiester linkage and can be incorporated anywhere between two nucleotides in the gNA.
- a linker such as, for example, a 2-(4-butylamidofluorescein)propane-l,3-diol bis(phosphodiester) linker, which is incorporated as a phosphodiester linkage and can be incorporated anywhere between two nucleotides in the gNA.
- the disclosure provides a chemically-modified gNA having an end modification comprising a terminal functional group such as an amine, a thiol (or sulfhydryl), a hydroxyl, a carboxyl, carbonyl, thionyl, thiocarbonyl, a carbamoyl, a thiocarbamoyl, a phoshoryl, an alkene, an alkyne, an halogen or a functional group-terminated linker that can be subsequently conjugated to a desired moiety selected from the group consisting of a fluorescent dye, a non-fluore scent label, a tag (for 14C, example biotin, avidin, streptavidin, or moiety containing an isotopic label such as 15 N, 13 C, deuterium, 3 H, 32 P, 125 I and the like), an oligonucleotide (comprising deoxynucleotides and/or rib
- the conjugation employs standard chemistry well-known in the art, including but not limited to coupling via N-hydroxysuccinimide, isothiocyanate, DCC (or DCI), and/or any other standard method as described in “Bioconjugate Techniques” by Greg T. Hermanson, Publisher Elsevier Science, 3 rd ed. (2013), the contents of which are incorporated herein by reference in its entirety.
- DCC or DCI
- the disclosure relates to the incorporation of tropism factors in the XDP to increase tropism and selectivity for target cells or tissues intended for gene editing.
- Tropism factors of the XDP embodiments include, but are not limited to, envelope glycoproteins derived from viruses, antibody fragments, and receptors or ligands that have binding affinity to target cell markers.
- the inclusion of such tropism factors on the surface of XDP particles enhances the ability of the XDP to selectively bind to and fuse with the cell membrane of a target cell bearing such target cell markers, increasing the therapeutic index and reducing unintended side effects of the therapeutic payload incorporated into the XDP.
- the XDP comprises one or more glycoproteins (GP) on the surface of the particle wherein the GP provides for enhanced or selective binding and fusion of the XDP to a target cell.
- the XDP comprises one or more antibody fragments on the surface of the particle wherein the antibody fragments provides for enhanced or selective binding and fusion of the XDP to a target cell.
- the XDP comprises one or more cell surface receptors, including G-protein-linked receptors, and enzyme- linked receptors, on the surface of the particle wherein the receptor provides for enhanced or selective binding and fusion of the XDP to a target cell.
- the XDP comprises one or more ligands on the surface of the particle wherein the ligand provides for enhanced or selective binding and fusion of the XDP to a target cell bearing a receptor to the ligand on the cell surface.
- the XDP comprises a combinations of one or more glycoproteins, antibody fragments, cell receptors, or ligands on the surface of the particle to provide for enhanced or selective binding and fusion of the XDP to a target cell.
- membrane fusion for viral entry is mediated by membrane glycoprotein complexes. Two basic mechanistic principles of membrane fusion have emerged as conserved among enveloped viruses; target membrane engagement and refolding into hairpin like structures (Plemper, RK.
- the envelope glycoproteins are typically observed as characteristic protein “spikes” on the surface of purified virions in electron microscopic images.
- the underlying mechanism of viral entry by enveloped viruses can be utilized to preferentially direct XDP to target particular cells or organs in a process known as pseudotyping.
- the XDP of the disclosure are pseudotyped by incorporation of a glycoprotein derived from an enveloped virus that has a demonstrated tropism for a particular organ or cell. Representative glycoproteins within the scope of the instant disclosure are listed in Table 4 and in the Examples.
- the viruses used to provide the glycoprotein include, but are not limited to Argentine hemorrhagic fever virus, Australian bat virus, Autographa californica multiple nucleopolyhedrovirus, Avian leukosis virus, baboon endogenous virus, Venezuelan hemorrhagic fever virus, Borna disease virus, Breda virus, Bunyamwera virus, Chandipura virus, Chikungunya virus, Crimean-Congo hemorrhagic fever virus, Dengue fever virus, Duvenhage virus, Eastern equine encephalitis virus, Ebola hemorrhagic fever virus, Ebola Zaire virus, enteric adenovirus, Ephemerovirus, Epstein-Bar virus (EBV), European bat virus 1, European bat virus 2, Fug Synthetic gP Fusion, Gibbon ape leukemia virus, Hantavirus, Hendra virus, hepatitis A virus, hepatitis B virus, hepatitis C virus, hepatit
- the XDP comprises one or more glycoprotein sequences of Table 4, or a sequence having at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% sequence identity thereto, wherein the glycoproteins are incorporated into the particle and exposed on the surface, providing tropism and enhanced selectivity for the XDP to the target cell to be edited.
- the glycoprotein has a sequence selected from the group consisting of SEQ ID NOS: 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462,
- the glycoprotein has a sequence selected from the group consisting of SEQ ID NOS: 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490, 492, 494, 496, 498,
- the glycoprotein is incorporated into the XDP system by inclusion of a nucleic acid encoding the glycoprotein in a plasmid vector of the XDP system, described below.
- a XDP comprising a glycoprotein derived from an enveloped virus in a capsid of a XDP of the embodiments exhibits at least a 2-fold, or at least a 3 -fold, or at least a 4-fold, or at least a 5-fold, or at least a 10-fold increase in binding of the XDP to a target cell compared to a XDP that does not have the glycoprotein.
- Representative examples demonstrating enhanced binding and uptake of XDP bearing glycoproteins to target cells leading to, in this case, enhance gene editing of target nucleic acid, are provided in the Examples, below.
- the present disclosure provides XDP comprising an antibody fragment linked to the exterior of the particle wherein the antibody fragment has specific binding affinity to a target cell marker or receptor on a target cell, tissue or organ, providing tropism for the XDP for the target cell.
- the antibody fragment is selected from the group consisting of an Fv, Fab, Fab', Fab'-SH, F(ab')2, diabody, single chain diabody, linear antibody, a single domain antibody, a single domain camelid antibody, and a single-chain variable fragment (scFv) antibody.
- Exemplary target cells include T cells, B cells, macrophages, liquid cancer cells (such as leukemia or myeloma cells), solid tumor cells, muscle cells, epithelial cells, endothelial cells, stem cells, dendritic cells, retinal cells, hepatic cells, cardiac cells, thyroid cells, neurons, glial cells, oligodendrocytes, Schwann cells, and pancreatic cells.
- Exemplary target organs include the brain, heart, liver, pancreas, lung, eye, stomach, small intestine, colon, and kidney.
- Exemplary tissues include skin, muscle, bone, epithelial, and connective tissue.
- the target cell marker or ligand can include cell receptors or surface proteins known to be expressed preferentially on a target cell for which nucleic acid editing is desired.
- a XDP comprising an antibody fragment in a capsid of a XDP of the embodiments exhibits at least a 2-fold, or at least a 3-fold, or at least a 4-fold, or at least a 5-fold, or at least a 10-fold increase in binding to a target cell bearing the target cell marker or receptor compared to a XDP that does not have the antibody fragment.
- the cancer cell markers or receptors can include, but not be limited to cluster of differentiation 19 (CD19), cluster of differentiation 3 (CD3), CD3d molecule (CD3D), CD3g molecule (CD3G), CD3e molecule (CD3E), CD247 molecule (CD247, or CD3Z), CD8a molecule (CD8), CD7 molecule (CD7), membrane metalloendopeptidase (CD 10), membrane spanning 4-domains A1 (CD20), CD22 molecule (CD22), TNF receptor superfamily member 8 (CD30), C-type lectin domain family 12 member A (CLL1), CD33 molecule (CD33), CD34 molecule (CD34), CD38 molecule (CD38), integrin subunit alpha 2b (CD41), CD44 molecule (Indian blood group) (CD44), CD47 molecule (CD47), integrin alpha 6 (CD49f), neural cell adhesion molecule 1 (CD56), CD70
- the cell markers or receptors can include, but not be limited to Adrenergic (e.g., alA, alb, ale, aid, a2a, a2b, a2c, a2d, b ⁇ , b2, b3), Dopaminergic (e.g., Dl, D2, D3, D4, D5), GABAergic (e.g., GABAA, GABABla, GABABlb, GABAB2, GAB AC), Glutaminergic (e.g., NMD A, AMP A, kainate, mGluRl, mGluR2, mGluR3, mGluR4, mGluR5, mGluR6, mGluR7), Histaminergic (e.g., HI, H2, H3), Cholinergic (e.g., Muscarinic (e.g., Ml, M2, M3, M4, M5; Nicotinic (e.g., muscle,
- Adrenergic e.g., alA
- the antibody fragment is conjugated to the XDP after its production and isolation from the producing host cell.
- the antibody fragment is produced as a part of the XDP capsid expressed by the producing host cell of the XDP system.
- the present disclosure provides a nucleic acid comprising a sequence encoding the antibody fragment operably linked to the nucleic acid encoding the XDP capsid or other XDP components.
- the present disclosure relates to nucleic acids encoding components of the XDP system and the incorporated therapeutic payloads, and the vectors that comprise the nucleic acids, as well as methods to make the nucleic acids and vectors.
- the present disclosure provides one or more nucleic acids encoding components including retroviral-derived XDP structural and processing components, therapeutic payloads, and tropism factors.
- the nucleic acids and vectors utilized for the key structural components and for processing and the assembly of XDP particles of the embodiments can be derived from a variety of viruses, such as retroviruses, including but not limited to Retroviridae family members Alpharetroviruses, Betaretroviruses, Gammaretroviruses, Deltaretroviruses, Epsilonretroviruses, Spumaretrovirinae, or lentiviruses such as human immunodeficiency- 1 (HIV-1), human immunodeficiency-2 (HIV-2), simian immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), bovine immunodeficiency virus (BIV), Jembrana Disease Virus (JDV), equine infectious anemia virus (EIAV), caprine arthritis
- retroviruses including
- the nucleic acids encoding the XDP retroviral components are derived from Alpharetrovirus , including but not limited to avian leukosis virus (ALV) and Rous sarcoma virus (RSV).
- ABV avian leukosis virus
- RSV Rous sarcoma virus
- the present disclosure provides nucleic acids encoding components selected from the group consisting of: a matrix polypeptide (MA); a p2A spacer peptide; ap2B spacer peptide; a plO spacer peptide; a capsid polypeptide (CA); a nucleocapsid polypeptide (NC); a Gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), p2A, p2B, plO, a nucleocapsid polypeptide (NC); a therapeutic payload; a tropism factor; a Gag-transframe region-Pol protease polyprotein; a protease cleavage site(s); and a protease capable of cleaving the protease cleavage sites.
- a matrix polypeptide MA
- a p2A spacer peptide ap2B spacer peptide
- Gag components e.g., MA, CA, p2A, p2B, plO, andNC
- protease cleavage site and protease are derived from an Alpharetrovirus , including but not limited to Avian leukosis virus and Rous sarcoma virus.
- the encoding sequences for the Alpharetrovirus- derived components are selected from the group consisting of SEQ ID NOS: 192, 193, 195, 196, 198-201, 782, and 234 as set forth in Table 5, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity thereto.
- the nucleic acids encode a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads.
- encoding nucleotides for protease cleavage sites are located between each of the individual components. In other cases, the protease cleavage sites are omitted. In a particular embodiment, an encoding sequence for a single protease cleavage site is located between the sequence encoding the nuclease and the linked retroviral component, which may be a retroviral sequence or a non-viral sequence, such as one that can be cleaved by TEV, PreScission Protease, or any of the other proteases disclosed herein. Representative configurations and sequences are presented in the Examples. In a particular embodiment, the encoded therapeutic payload is a CasX and gNA embodiment described herein, while the encoded tropism factor is a viral glycoprotein embodiment described herein.
- the nucleic acids encoding the XDP viral components are derived from Betaretrovirus, including but not limited to mouse mammary tumor virus (MMTV), Mason-Pfizer monkey virus (MPMV), and enzootic nasal tumor virus (ENTV).
- MMTV mouse mammary tumor virus
- MPMV Mason-Pfizer monkey virus
- ENTV enzootic nasal tumor virus
- the present disclosure provides nucleic acids encoding the XDP wherein the XDP comprises components selected from the group consisting of: a matrix polypeptide (MA); a pp21/24 spacer peptide; a p3-P8/pl2 spacer peptide; a capsid polypeptide (CA); a nucleocapsid polypeptide (NC); a Gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), pp21/24, p3-8/pl2, a nucleocapsid polypeptide (NC); a therapeutic payload; a tropism factor; a Gag-transframe region-Pol protease polyprotein; a protease cleavage site(s); and a protease capable of cleaving the protease cleavage sites.
- a matrix polypeptide MA
- a pp21/24 spacer peptide a p3-P8/
- Gag components e.g., MA, CA, pp21/24 spacer, p3-p8/pl2 spacer, andNC
- the protease cleavage site and protease are derived from an Betaretrovirus , including but not limited to mouse mammary tumor virus, Mason-Pfizer monkey virus, and enzootic nasal tumor virus.
- the encoding sequences for the Betaretrovirus- derived components are selected from the group consisting of SEQ ID NOS: 235-257 as set forth in Table 5, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity thereto.
- the nucleic acids encode a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads.
- encoding nucleotides for protease cleavage sites are located between each of the individual components. In other cases, the protease cleavage sites are omitted. In a particular embodiment, an encoding sequence for a single protease cleavage site is located between the sequence encoding the nuclease and the linked retroviral component, which may be a retroviral sequence or a non-viral sequence, such as one that can be cleaved by TEV, PreScission Protease, or any of the other proteases disclosed herein. Representative configurations and sequences are presented in the Examples. In a particular embodiment, the encoded therapeutic payload is a CasX and gNA embodiment described herein, while the encoded tropism factor is a viral glycoprotein embodiment described herein.
- the nucleic acids encoding the XDP viral components are derived from Deltaretrovirus, including but not limited to bovine leukemia virus (BLV) and the human T-lymphotropic viruses (HTLV1).
- BLV bovine leukemia virus
- HTLV1 human T-lymphotropic viruses
- the present disclosure provides nucleic acids encoding the XDP wherein the XDP comprises components selected from the group consisting of: a matrix polypeptide (MA); a capsid polypeptide (CA); a nucleocapsid polypeptide (NC); a Gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA) repeat a nucleocapsid polypeptide (NC); a therapeutic payload; a tropism factor; a Gag-transframe region-Pol protease polyprotein; a protease cleavage site(s); and a protease capable of cleaving the protease cleavage
- Gag components e.g., MA, CA, and NC
- the protease cleavage site and protease are derived from an Deltaretrovirus , including but not limited to bovine leukemia virus and the human T- lymphotropic viruses.
- the encoding sequences for the Deltaretrovirus- derived components are selected from the group consisting of the sequences SEQ ID NOS: 258- 272 as set forth in Table 5, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity thereto.
- the nucleic acids encode a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads.
- encoding nucleotides for protease cleavage sites are located between each of the individual components. In other cases, the protease cleavage sites are omitted. In a particular embodiment, an encoding sequence for a single protease cleavage site is located between the sequence encoding the nuclease and the linked retroviral component, which may be a retroviral sequence or a non-viral sequence, such as one that can be cleaved by TEV, PreScission Protease, or any of the other proteases disclosed herein. Representative configurations and sequences are presented in the Examples. In a particular embodiment, the encoded therapeutic payload is a CasX and gNA embodiment described herein, while the encoded tropism factor is a viral glycoprotein embodiment described herein.
- the nucleic acids encoding the XDP viral components are derived from Epsilonretrovirus , including but not limited to Walleye dermal sarcoma virus (WDSV), and Walleye epidermal hyperplasia virus 1 and 2.
- Epsilonretrovirus including but not limited to Walleye dermal sarcoma virus (WDSV), and Walleye epidermal hyperplasia virus 1 and 2.
- the present disclosure provides nucleic acids encoding the XDP wherein the XDP comprises components selected from the group consisting of: a matrix polypeptide (MA); a p20 spacer peptide; a capsid polypeptide (CA); a nucleocapsid polypeptide (NC); a Gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), p20, a nucleocapsid polypeptide (NC); a therapeutic payload; a tropism factor; a Gag-transframe region-Pol protease polyprotein; a protease cleavage site(s); and a protease capable of cleaving the protease cleavage sites.
- a matrix polypeptide MA
- CA capsid polypeptide
- NC nucleocapsid polypeptide
- Gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), p20
- Gag components e.g., MA, CA, p20, andNC
- the protease cleavage site and protease are derived from an Epsilonretrovirus , including but not limited to Walleye dermal sarcoma virus, and Walleye epidermal hyperplasia virus 1 and 2.
- the encoding sequences for the Epsilonretrovirus-denvcd components are selected from the group consisting of the sequences of SEQ ID NOS: 273-277 as set forth in Table 5, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity thereto.
- the nucleic acids encode a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads.
- encoding nucleotides for protease cleavage sites are located between each of the individual components. In other cases, the protease cleavage sites are omitted. In a particular embodiment, an encoding sequence for a single protease cleavage site is located between the sequence encoding the nuclease and the linked retroviral component, which may be a retroviral sequence or a non-viral sequence, such as one that can be cleaved by TEV, PreScission Protease, or any of the other proteases disclosed herein. Representative configurations and sequences are presented in the Examples. In a particular embodiment, the encoded therapeutic payload is a CasX and gNA embodiment described herein, while the encoded tropism factor is a viral glycoprotein embodiment described herein.
- the nucleic acids encoding the XDP viral components are derived from Gammaretrovirus, including but not limited to murine leukemia virus (MLV), Maloney murine leukemia virus (MMLV), and feline leukemia virus (FLV).
- MLV murine leukemia virus
- MMLV Maloney murine leukemia virus
- FLV feline leukemia virus
- the nucleic acids encoding the present disclosure provides XDP wherein the XDP comprises components selected from the group consisting of: a matrix polypeptide (MA); a ppl2 spacer peptide; a capsid polypeptide (CA); a nucleocapsid polypeptide (NC); a Gag polyprotein comprising a matrix polypeptide (MA), a ppl2 spacer, a capsid polypeptide (CA), a nucleocapsid polypeptide (NC); a therapeutic payload; a tropism factor; a Gag-transframe region-Pol protease polyprotein; a protease cleavage site(s); and a protease capable of cleaving the protease cleavage sites.
- a matrix polypeptide MA
- a ppl2 spacer peptide a capsid polypeptide
- NC nucleocapsid polypeptide
- Gag polyprotein comprising a matrix polypeptid
- Gag components e.g., MA, ppl2, CA, and NC
- the protease cleavage site and protease are derived from an Gammaretrovirus , including but not limited to Walleye dermal sarcoma virus, and Walleye epidermal hyperplasia virus 1 and 2.
- the encoding sequences for the Gammaretrovirus-denved components are selected from the group consisting of the sequences of SEQ ID NOS: 278-287 as set forth in Table 5, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity thereto.
- the nucleic acids encode a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads.
- encoding nucleotides for protease cleavage sites are located between each of the individual components. In other cases, the protease cleavage sites are omitted. In a particular embodiment, an encoding sequence for a single protease cleavage site is located between the sequence encoding the nuclease and the linked retroviral component, which may be a retroviral sequence or a non-viral sequence, such as one that can be cleaved by TEV, PreScission Protease, or any of the other proteases disclosed herein. Representative configurations and sequences are presented in the Examples. In a particular embodiment, the encoded therapeutic payload is a CasX and gNA embodiment described herein, while the encoded tropism factor is a viral glycoprotein embodiment described herein.
- the nucleic acids encoding the XDP viral components are derived from Lentivirus , including but not limited to HIV-1 and HIV-2, and Simian immunodeficiency virus (SIV).
- the present disclosure provides nucleic acids encoding the XDP wherein the XDP comprises components selected from the group consisting of: a matrix polypeptide (MA); a capsid (CA), a p2 spacer peptide, a nucleocapsid (NC), a pl/p6 spacer peptide; ); a Gag polyprotein comprising a matrix polypeptide (MA), CA, P2, NC, and pl/p6; a therapeutic payload; a tropism factor; a Gag-transframe region-Pol protease polyprotein; a protease cleavage site(s); and a protease capable of cleaving the protease cleavage sites.
- MA matrix polypeptide
- CA capsid
- NC nucleocaps
- Gag components e.g., MA, CA, NC, and pl/p6
- protease cleavage site and protease are derived from an Lentivirus , including but not limited to HIV-1, HIV-2, and Simian immunodeficiency virus (SIV).
- Lentivirus including but not limited to HIV-1, HIV-2, and Simian immunodeficiency virus (SIV).
- the encoding sequences for the Lentivirus- derived components are selected from the group consisting of the sequences of SEQ ID NOS: 288-312 and 334-339 as set forth in Table 5, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity thereto.
- the nucleic acids encode a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads.
- encoding nucleotides for protease cleavage sites are located between each of the individual components. In other cases, the protease cleavage sites are omitted. In a particular embodiment, an encoding sequence for a single protease cleavage site is located between the sequence encoding the nuclease and the linked retroviral component, which may be a retroviral sequence or a non-viral sequence, such as one that can be cleaved by TEV, PreScission Protease, or any of the other proteases disclosed herein. Representative configurations and sequences are presented in the Examples. In a particular embodiment, the encoded therapeutic payload is a CasX and gNA embodiment described herein, while the encoded tropism factor is a viral glycoprotein embodiment described herein.
- the nucleic acids encoding the XDP viral components are derived from Spumaretrovirinae, including but not limited to Bovispumavirus, Equispumavirus, Felispumavirus, Prosimiispumavirus, Simiispumavirus, and Spumavirus.
- the present disclosure provides nucleic acids encoding the XDP wherein the XDP comprises components selected from the group consisting of: P68 Gag; a p3 Gag; a Gag polyprotein comprising of P68 Gag and p3 gag; a therapeutic payload; a tropism factor; a Gag-transframe region-Pol protease polyprotein; a protease cleavage site(s); and a protease capable of cleaving the protease cleavage sites.
- the XDP comprises components selected from the group consisting of: P68 Gag; a p3 Gag; a Gag polyprotein comprising of P68 Gag and p3 gag; a therapeutic payload; a tropism factor; a Gag-transframe region-Pol protease polyprotein; a protease cleavage site(s); and a protease capable of cleaving the protease cleavage sites.
- Gag components e.g., MA, CA, p20, and NC
- the protease cleavage site and protease are derived from an Spumaretrovirinae including but not limited to Bovispumavirus, Equispumavirus, Felispumavirus, Prosimiispumavirus, Simiispumavirus, and Spumavirus.
- the encoding sequences for the Sumaretrovirinae- derived components are selected from the group consisting of the sequences of SEQ ID NOS: 313-333 as set forth in Table 5, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity thereto.
- the nucleic acids encode a subset of the components listed in the paragraph, such as depicted in FIGS. 36-68, which depict CasX and gNA as the therapeutic payloads.
- encoding nucleotides for protease cleavage sites are located between each of the individual components. In other cases, the protease cleavage sites are omitted. In a particular embodiment, an encoding sequence for a single protease cleavage site is located between the sequence encoding the nuclease and the linked retroviral component, which may be a retroviral sequence or a non-viral sequence, such as one that can be cleaved by TEV, PreScission Protease, or any of the other proteases disclosed herein. Representative configurations and sequences are presented in the Examples. In a particular embodiment, the encoded therapeutic payload is a CasX and gNA embodiment described herein, while the encoded tropism factor is a viral glycoprotein embodiment described herein.
- the present disclosure provides nucleic acids encoding the XDP wherein the retroviral components of the XDP are selected from different genera of the Retroviridae.
- the nucleic acids encoding the XDP can comprise two or more components selected from a matrix polypeptide (MA), a p2A spacer peptide, a p2B spacer peptide; a plO spacer peptide, a capsid polypeptide (CA), a nucleocapsid polypeptide (NC), a pp21/24 spacer peptide, a p3-p8 spacer peptide, a ppl2 spacer peptide, a p20 spacer peptide, a pl/p6 spacer peptide, a p68 Gag, a p3 Gag, a cleave site(s), and a protease capable of cleaving the protease cleavage sites wherein the components
- MA matrix polypeptide
- the accessory protein integrase (or its encoding nucleic acid) can be omitted from the XDP systems, as well as the HIV functional accessory genes vpr, vpx (HIV-2), which are dispensable for viral replication in vitro. Additionally, the nucleic acids of the XDP system do not require reverse transcriptase for the creation of the XDP compositions of the embodiments.
- the HIV-1 Gag-Pol component of the XDP can be truncated to Gag linked to the transframe region (TFR) composed of the transframe octapeptide (TFP) and 48 amino acids of the p6pol, separated by a protease cleavage site, hereinafter referred to as Gag-TFR-PR, described more fully, below.
- TFR transframe region
- TFP transframe octapeptide
- Gag-TFR-PR protease cleavage site
- the present disclosure provides nucleic acids encoding sequences for the tropism factors that are incorporated in, and displayed on the surface of the XDP, wherein the tropism factor confers an increased ability of the XDP to bind and fuse with the membrane of a target cell or tissue.
- the tropism factor is a glycoprotein
- the encoding nucleic acid is selected from the group consisting of the sequences of Table 4, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity thereto.
- the disclosure provides a nucleic acids encoding an antibody fragment, wherein the antibody fragment has specific binding affinity for a target cell marker or receptor on a target cell or tissue.
- the disclosure provides nucleic acids encoding a cell receptor, wherein the cell receptor has specific binding affinity for a target cell marker on a target cell or tissue.
- the disclosure provides nucleic acids encoding a ligand, wherein the ligand has specific binding affinity for a target cell marker or receptor on a target cell or tissue.
- the present disclosure further provides nucleic acids encoding or comprising the therapeutic payloads incorporated into the XDP.
- Exemplary therapeutic payloads have been described herein, supra.
- the therapeutic payload of the XDP is a CRISPR nuclease and one or more guide RNAs.
- the disclosure provides nucleic acids encoding the CasX nucleases of Table 1, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity thereto.
- nucleic acids are presented in Tables 6-8, 11 and 16 of the Examples, which disclose nucleic acids of SEQ ID NOS: 354, 340-342, 346-349, 378-387 and 426-431.
- the disclosure provides nucleic acids encoding the gNA variants of SEQ ID NO: 597- 781 set forth in Table 3, or a sequence having at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity thereto, and wherein the gNA further comprises a targeting sequence complementary to a target nucleic acid.
- the components of the XDP systems are encoded by one, two, three, four, five or more nucleic acids (see FIGS. 36-68, which are schematics of the representative plasmids and XDP configurations), which can encode single components or multiple components that are operably linked to (under the control of) regulatory elements operable in a eukaryotic cell and appropriate for the component to be expressed.
- the absolute order of the components encoded within a nucleic acid may be varied in order to take advantage of the placement of the regulatory elements, cleavage sequences, etc., such that each component can be expressed and/or utilized in the assembly of the XDP in an optimal fashion, as would be understood by one of ordinary skill in the art.
- the order (5’ to 3’) may be Gag-cleavage site-therapeutic payload or it may be therapeutic payload-cleavage site-gag, and it is intended that the same would apply for any combination of components encoded in a single nucleic acid.
- Representative regulatory elements are described herein.
- the disclosure provides nucleic acids comprising sequences encoding components of the XDP system selected from two or more of a retroviral Gag polyprotein (all or portions thereof), a protease cleavage site, a therapeutic payload, a Gag-Pol polyprotein, and a tropism factor, wherein the components are encoded on one, two, three, or four individual nucleic acids. In some embodiments of the foregoing, the components are encoded on a single nucleic acid.
- a first nucleic acid encodes the Gag polyprotein (or portions thereof) and the CasX protein as the therapeutic payload with, optionally, an intervening protease cleavage site between the two components, and a second nucleic acid encodes the Gag-Pol polyprotein (or portions thereof), the tropism factor and the gNA.
- a first nucleic acid encodes the Gag polyprotein (or portions thereof) and the CasX protein as the therapeutic payload with, optionally, and intervening protease cleavage site separating the two components
- a second nucleic acid encodes the Gag-Pol polyprotein
- a third nucleic acid encodes the tropism factor and the gNA.
- a first nucleic acid encodes the Gag polyprotein (or portions thereof) and the CasX protein as the therapeutic payload with, optionally, an intervening protease cleavage site separating the two components, a second nucleic acid encodes the tropism factor, a third nucleic acid encodes the Gag-Pol polyprotein (or portions thereof), and a fourth nucleic acid encodes the gNA.
- the protease cleavage sites are omitted. In other cases, protease cleavage sites are located between each component of the Gag polyprotein and, optionally, the therapeutic payload. Representative examples of the encoding nucleic acids of the foregoing embodiments are presented in the Examples.
- the disclosure provides nucleic acids comprising sequences encoding components of the XDP system comprising the Gag-TFR-PR polyprotein (or portions thereof), the protease cleavage site, the CasX protein as the therapeutic payload, the gNA, and the tropism factor, wherein the components are encoded on one, two, or three individual nucleic acids.
- the components are encoded on a single nucleic acid.
- a first nucleic acid encodes the Gag-TFR-PR polyprotein and the CasX protein as the therapeutic payload with an intervening protease cleavage site separating the two components
- a second nucleic acid encodes the tropism factor and the gNA.
- a first nucleic acid encodes the Gag-TFR-PR polyprotein and the CasX protein as the therapeutic payload with an intervening protease cleavage site separating the two components
- a second nucleic acid encodes the tropism factor
- a third nucleic acid encodes the gNA.
- protease cleavage sites are located between each component of the Gag polyprotein and, optionally, the CasX protein.
- Representative examples of the encoding nucleic acids of the foregoing embodiments are presented in the Examples (see Tables 16, 17, 19, 20, 22, 23, 24, 27, 30, 33 and 36 and the sequences contained therein).
- the disclosure provides nucleic acids comprising sequences encoding components of the XDP system comprising the Gag polyprotein (or portions thereof), the protease cleavage site, the protease, the CasX protein, the gNA and the tropism factor wherein the components are encoded on one, two, or three individual nucleic acids.
- the components are encoded on a single nucleic acid.
- a first nucleic acid encodes the Gag polyprotein, the protease, the CasX protein, and intervening protease cleavage sites located between the components, and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA.
- a first nucleic acid encodes the Gag polyprotein, the protease, the CasX protein and intervening protease cleavage sites between the components
- a second nucleic acid encodes the tropism factor
- a third nucleic acid encodes one or more gNA.
- the disclosure provides nucleic acids comprising sequences encoding components of the XDP system comprising the Gag-Pol polyprotein, the CasX protein, the protease cleavage site, the gNA, the RNA binding domain, and the tropism factor, wherein the components are encoded on one, two, or three individual nucleic acids.
- the components are encoded on a single nucleic acid.
- a first nucleic acid encodes the Gag-Pol polyprotein and the CasX with an intervening protease cleavage site between the two components
- a second nucleic acid encodes the tropism factor, the gNA and the RNA binding domain
- a first nucleic acid encodes the Gag-Pol polyprotein and the CasX with an intervening protease cleavage site between the two components
- a second nucleic acid encodes the tropism factor
- a third nucleic acid encodes the gNA and the RNA binding domain.
- the disclosure provides nucleic acids comprising sequences encoding components of the XDP system comprising the Gag-Pol polyprotein, the CasX protein, the protease cleavage site, the tropism factor, and the gNA, wherein the components are encoded on one, two, or three individual nucleic acids.
- the components are encoded on a single nucleic acid.
- a first nucleic acid encodes the first nucleic acid encodes the Gag-Pol polyprotein and the CasX with an intervening protease cleavage site between the two components
- a second nucleic acid encodes the tropism factor and the gNA.
- a first nucleic acid encodes the Gag-Pol polyprotein and the CasX with an intervening protease cleavage site between the two components
- a second nucleic acid encodes the tropism factor
- a third nucleic acid encodes the gNA.
- the disclosure provides nucleic acids comprising sequences encoding components of the XDP system comprising the MA, the CasX protein, the protease, the protease cleavage site, the gNA, and the tropism factor, wherein the components are encoded on one, two, three, or four individual nucleic acids.
- the components are encoded on a single nucleic acid.
- a first nucleic acid encodes the first nucleic acid encodes the MA, the CasX protein, the protease, and intervening protease cleavage sites between the three components
- a second nucleic acid encodes the tropism factor and the gNA.
- a first nucleic acid encodes the MA, the CasX protein the protease, and intervening protease cleavage sites between the three components
- a second nucleic acid encodes the tropism factor
- a third nucleic acid encodes the gNA.
- a first nucleic acid encodes the MA and the CasX protein with an intervening protease cleavage site between the two components
- a second nucleic acid encodes the tropism factor
- a third nucleic acid encodes the gNA
- a fourth nucleic acid encodes the protease.
- the first nucleic acid can further encode a CA component linked to the MA by an additional intervening protease cleavage site.
- the protease and protease cleavage sites are omitted.
- the disclosure provides nucleic acids comprising sequences encoding components of the XDP system comprising the Gag polyprotein (all or portions thereof), the CasX protein, the protease, the protease cleavage site, the gNA, the tropism factor, and the Gag-Pol polyprotein (all or portions thereof), wherein the components are encoded on two, three, or four individual nucleic acids.
- a first nucleic acid encodes the Gag polyprotein, the CasX protein, the protease, and intervening protease cleavage sites between the three components
- a second nucleic acid encodes the Gag-Pol polyprotein, the tropism factor, and the gNA.
- a first nucleic acid encodes the Gag polyprotein and the CasX protein with an intervening protease cleavage site between the two components
- a second nucleic acid encodes the protease
- a third nucleic acid encodes the tropism factor, the gNA, and the Gag-Pol polyprotein.
- a first nucleic acid encodes the Gag polyprotein, and the CasX protein with an intervening protease cleavage site between the two components, a second nucleic acid encodes the protease, a third nucleic acid encodes the tropism factor, and a fourth nucleic acid encodes the gNA and the Gag-Pol polyprotein.
- the protease and protease cleavage sites are omitted.
- the XDP system is encoded by a portion or all of a sequence selected from the group consisting of the nucleic acid sequences of SEQ ID NOs: 426-436, 784- 823, 828-873, 880-933, 947-1009 as set forth in Tables 16, 17, 19, 20, 22, 23, 24, 27, 30, 33, or 36, or a sequence having at least about 80%, at least about 90%, at least about 95%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% sequence identity thereto.
- the nucleic acids encoding the XDP system of any of the embodiments described herein further comprises a donor template nucleic acid wherein the donor template comprises a sequence to be inserted into a target nucleic acid to either correct a mutation or to knock-down or knock-out a gene.
- the donor template sequence comprises a non-homologous sequence flanked by two regions of homology 5’ and 3’ to the break sites of the target nucleic acid (i.e., homologous arms), facilitating insertion of the non-homologous sequence at the target region which can be mediated by HDR or HIT!
- the exogenous donor template inserted by HITI can be any length, for example, a relatively short sequence of between 1 and 50 nucleotides in length, or a longer sequence of about 50-1000 nucleotides in length.
- the lack of homology can be, for example, having no more than 20-50% sequence identity and/or lacking in specific hybridization at low stringency. In other cases, the lack of homology can further include a criterion of having no more than 5, 6, 7, 8, or 9 bp identity. In such cases, the use of homologous arms facilitates the insertion of the non- homologous sequence at the break site(s) introduced by the nuclease.
- the donor template polynucleotide comprises at least about 10, at least about 50, at least about 100, or at least about 200, or at least about 300, or at least about 400, or at least about 500, or at least about 600, or at least about 700, or at least about 800, or at least about 900, or at least about 1000, or at least about 10,000, or at least about 15,000 nucleotides.
- the donor template comprises at least about 10 to about 15,000 nucleotides, or at least about 100 to about 10,000 nucleotides, or at least about 400 to about 8,000 nucleotides, or at least about 600 to about 5000 nucleotides, or at least about 1000 to about 2000 nucleotides.
- the donor template sequence may comprise certain sequence differences as compared to the genomic sequence; e.g., restriction sites, nucleotide polymorphisms, selectable markers (e.g., drug resistance genes, fluorescent proteins, enzymes etc.), etc., which may be used to assess for successful insertion of the donor nucleic acid at the cleavage site or in some cases may be used for other purposes (e.g., to signify expression at the targeted genomic locus).
- sequence differences may include flanking recombination sequences such as FLPs, loxP sequences, or the like, that can be activated at a later time for removal of the marker sequence.
- the donor template comprises a nucleic acid encoding at least a portion of a target gene wherein the donor template nucleic acid comprises all or a portion of the wild-type sequence compared to the target gene comprising a mutation, wherein the donor template is inserted into the target nucleic acid of the cell by HDR during the gene editing process.
- the target gene upon insertion into the target nucleic acid, the target gene is corrected such that the functional gene product can be expressed.
- the donor template ranges in size from 10-10,000 nucleotides. In other embodiments, the donor template ranges in size from 100-1,000 nucleotides.
- the donor template is a single-stranded DNA template or a single stranded RNA template.
- the donor template is a double-stranded DNA template.
- the donor template nucleic acid is incorporated in the first nucleic acid of the XDP system.
- the donor template nucleic acid is incorporated in the second nucleic acid.
- the donor template nucleic acid is incorporated in the third nucleic acid.
- the donor template nucleic acid is incorporated in the fourth or a fifth nucleic acid.
- each of the individual nucleic acids are incorporated into plasmid vectors appropriate for transfection into a eukaryotic packaging cell, examples of which are detailed more fully, below, such that the XDP system will involve one, two, three, four, or five plasmids, as depicted in FIGS. 36-68.
- the nucleotide sequence encoding the components of the XDP system are operably linked to (under the control of) regulatory elements operable in a eukaryotic cell and appropriate for the component to be expressed.
- Exemplary regulatory elements include a transcription promoter (e.g., CMV, CMV+intron A, SV40, RSV, HIV-Ltr, MMLV-ltr, and metallothionein), a transcription enhancer element, a transcription termination signal, internal ribosome entry site (IRES) or p2A peptide to permit translation of multiple genes from a single transcript, polyadenylation sequences to promote downstream transcriptional termination, sequences for optimization of initiation of translation, and translation termination sequences.
- a transcription promoter e.g., CMV, CMV+intron A, SV40, RSV, HIV-Ltr, MMLV-ltr, and metallothionein
- a transcription enhancer element e.g., CMV, CMV+intron A, SV40, RSV, HIV-Ltr, MMLV-ltr, and metallothionein
- a transcription enhancer element e.g., CMV, CMV+intron
- the promoter is a constitutive promoter, such as a CMV promoter, CAGG, PGK, U6 (for RNA pol III, which synthesizes shRNAs), elongation factor 1 alpha (EF1 -alpha), or HI.
- a constitutive promoter such as the human cytomegalovirus immediate early (HCMV-IE) enhancer/promoter is used to compensate for the regulation of transcription normally provided by tat.
- HCMV-IE human cytomegalovirus immediate early
- the promoter can be an inducible promoter such as, but are not limited to, T7 RNA polymerase promoter, T3 RNA polymerase promoter, isopropyl-beta-D-thiogalactopyranoside (IPTG)-regulated promoter, heat shock promoter, or tetracycline-regulated promoter (TRE), or a negative inducible pLac promoter. Any strong promoter known to those skilled in the art can be used for driving the expression of the nucleic acid.
- the vector in the case of the nucleic acid encoding the lentiviral packaging components, can be a psPax2 (detailed in the Examples, SEQ ID NO: 430) or pMDLg/pRRE plasmid. In the case of the nucleic acid encoding the VSV-G pseudotyping viral envelope glycoprotein, the vector can be a pMD2.G plasmid.
- the vectors of the embodiments may also comprise a polyadenylation signal, which may be downstream, for example, of the therapeutic payload, such as the CasX sequence.
- the polyadenylation signal may be a SV40 polyadenylation signal, LTR polyadenylation signal, bovine growth hormone (bGH) polyadenylation signal, human growth hormone (hGH) polyadenylation signal, or human u- globin polyadenylation signal.
- the SV40 polyadenylation signal may be a polyadenylation signal from a pCEP4 vector (Invitrogen, San Diego, CA).
- the vectors of the embodiments may also comprise an enhancer upstream of the therapeutic payload, such as the CasX sequence or gNA sequence.
- the enhancer may be necessary for DNA expression.
- the enhancer may be human actin, human myosin, human hemoglobin, human muscle creatine or a viral enhancer such as one from CMV, HA, RSV, or EBV.
- Polynucleotide function enhancers are described in U.S. Patent Nos. 5,593,972, 5,962,428, and WO94/016737, the contents of each are fully incorporated by reference.
- the vector may also comprise a mammalian origin of replication in order to maintain the vector extrachromosomally and produce multiple copies of the vector in a cell.
- the vector may also comprise a regulatory element, which may be well suited for gene expression in a mammalian or human cell into which the vector is administered.
- the vector may also comprise a reporter gene, such as green fluorescent protein (“GFP”) and/or a selectable marker, such as hygromycin (“Hygro”).
- the vectors can include additional sequences encoding factors or accessory proteins that assist in the replication of viral proteins.
- the HIV-based vector comprises a sequence encoding tat, a protein involved in the activation of RNA Polymerase II, and that stimulates transcription and translation (Das, A., et al. The HIV-1 Tat Protein Has a Versatile Role in Activating Viral Transcription. J Virol. 85(18): 9506 (2011)).
- the HIV-based vector comprises a sequence encoding Rev, an RNA binding protein that is critical in the nuclear export of intron-containing HIV-1 RNA (Pollard, V., et al. The HIV-1 Rev protein.
- the HIV-based vector comprises a sequence encoding viral infectivity factor (Vif), an accessory proteins essential for viral replication that disrupts the antiviral activity of the mammalian enzyme APOBEC by targeting it for ubiquitination and cellular degradation (Yang, G., et al. Viral infectivity factor: a novel therapeutic strategy to block HIV-1 replication. Minireviw Med Chem 13(7): 1047 (2013)).
- the HIV-based vector comprises a sequence encoding Viral protein U (Vpu), an accessory protein essential for suppressing the antiviral activity of host cell restriction factors as well as the efficient release of viral particles from infected cells (Gonzalez, M.
- the HIV-based vector comprises a sequence encoding Negative Factor (Net), an accessory protein essential for both evading host adaptive cell-mediated immunity as well as enhancing infectivity in the target cell (Basmaciogullari, S., et al. The activity of Nef on HIV-1 infectivity. Frontiers Microbiol 5:232 (2014).
- Net Negative Factor
- the HIV-based vector comprises a sequence encoding Viral protein R (VpR), an accessory protein important for its interactions with a number of cellular proteins that impact viral replication in addition to a potential role in restricting host anti-viral pathways (Zhao, Richard Y, and Michael I Bukrinsky. HIV-1 accessory proteins: VpR. Methods Mol Biol 1087:125 (2014).
- the HIV-based vector comprises a sequence encoding any combination of tat, Vif, Rev, Vpu, Nef, and VpR.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding a matrix polypeptide (MA), a capsid polypeptide (CA), a nucleocapsid polypeptide (NC), a pl/p6 polypeptide and a CasX polypeptide.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, pl/p6 operably linked, for example by a ribosomal frameshift, to a protease (PRO), a reverse transcriptase (RT) and an integrase (INT).
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, a NC, pl/p6 and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, pl/p6, CasX and PRO.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding a matrix polypeptide (MA), a capsid polypeptide (CA), a nucleocapsid polypeptide (NC), a pl/p6 polypeptide and a CasX polypeptide.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, pl/p6 operably linked, for example by a ribosomal frameshift, to a CasX polypeptide, and PRO.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pl/p6 operably linked, for example by a ribosomal frameshift, to PRO, and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pl/p6, CasX and PRO.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pl/p6, and CasX.
- the second nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pl/p6, CasX and PRO.
- the third nucleic acid comprises, from 5’ to 3’, sequence encoding MA, CA, NC and pl/p6.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pl/p6, and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pl/p6, and CasX.
- the second nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, and pl/p6.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pi and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, CasX, and pl/p6 operably linked, for example by a ribosomal frameshift, to PRO.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, CasX, and pl/p6 operably linked, for example by a ribosomal frameshift, to PRO.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CasX, and pl/p6 operably linked, for example by a ribosomal frameshift, to PRO.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CasX, and PRO.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, CasX, and PRO.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pl/p6, tev cleavage sequence (TCS), and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, pl/p6, TCS and a TEV protease (TEV).
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pl/p6, TCS, and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, pl/p6, PreScission cleavage sequence (PCS) and a PreScission protease (PSP).
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pl/p6, TCS, and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, pl/p6, PCS and a PreScission protease (PSP).
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pl/p6, PCS, and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, pl/p6, PCS and PSP.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, pl/p6, PCS, and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, pl/p6, PCS and TEV.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, and pl/p6.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, PI and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, and pl/p6.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, CasX and Pl/p6 operably linked, for example by a ribosomal frameshift, to PRO.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, and pl/p6.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, CasX and Pl/p6 operably linked, for example by a ribosomal frameshift, to PRO.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, and pl/p6.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CasX, NC, and Pl/p6 operably linked, for example by a ribosomal frameshift, to PRO.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, and pl/p6.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CasX and Pl/p6 operably linked, for example by a ribosomal frameshift, to PRO.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, and pl/p6.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, NC, CasX and PRO.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, and pl/p6.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, CasX and PRO.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, and pl/p6.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, and pl/p6.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, CA, NC, and pl/p6.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, the Alpharetrovirus gag polyprotein components P2A, P2B, and P10, as well as CA, NC, PRO and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, pp21/24, P12/P3/P8, CA, NC operably linked, for example by a ribosomal frameshift, to PRO, and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, NC operably linked, for example by a ribosomal frameshift, to PRO, and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, p20, CA, NC, PRO, and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, ppl2, CA, NC, PRO, and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, P6 operably linked, for example by a ribosomal frameshift, to PRO, and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding p68- Gag operably linked, for example by a ribosomal frameshift, to PRO, and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, P2A, P2B, P10, CA and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, P2A, P2B, P10, CA and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, P2A, P2B, P10, CA, NC, PRO and CasX.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, pp21/24, P12/P3/P8, CA and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, pp21/24, P12/P3/P8, CA and CasX.
- the second nucleic acid comprises, from 5’ to 3’, MA, pp21/24, P12/P3/P8, CA, NC operably linked, for example by a ribosomal frameshift, to PRO and CasX.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA and CasX.
- the second nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC operably linked, for example by a ribosomal frameshift, to PRO and CasX.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, p20, CA and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, p20, CA and CasX.
- the second nucleic acid comprises, from 5’ to 3’, sequences encoding MA, p20, CA, NC operably linked, for example by a ribosomal frameshift, to PRO and CasX.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, ppl2, CA and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, ppl2, CA and CasX.
- the second nucleic acid comprises, from 5’ to 3’, sequences encoding MA, ppl2, CA, NC, PRO and CasX.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA and CasX.
- the second nucleic acid comprises, from 5’ to 3’, sequences encoding MA, CA, NC, P6 operably linked, for example by a ribosomal frameshift, to PRO and CasX.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, p68-Gag, p3-Gag and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises four nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, sequences encoding p68- Gag, p3-Gag and CasX.
- the second nucleic acid comprises, from 5’ to 3’, sequences encoding p68-Gag, p3-Gag operably linked, for example by a ribosomal frameshift, to PRO and CasX.
- the third nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the fourth nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, MA, P2A, P2B, P10, CA, NC and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, MA, CA, NC and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, MA, CA, NC, p6 and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, MA, pp21/24, P12/P3/P8, CA, NC and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, MA, ppl2, CA, NC and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, MA, p20, CA, NC and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, MA, CA, pl/p6 and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, MA, CA, NC, pl/p6, pl/p6 and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, MA, CA, NC, CasX and pl/p6.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- the XDP system of the disclosure comprises three nucleic acids.
- the first nucleic acid comprises, from 5’ to 3’, MA, CA, NC, P2, pl/p6 and CasX.
- the second nucleic acid comprises a sequence encoding a glycoprotein, for example VSV-G.
- the third nucleic acid comprises a sequence encoding a gNA.
- any of the components may be separated by sequences encoding protease cleavage sites, self-cleaving polypeptides, or internal ribosome entry sites, or any combination thereof.
- the present disclosure relates to packaging cells utilized in the production of XDP.
- packaging cell is used in reference to cell lines that do not contain a packaging signal, but do stably or transiently express viral structural proteins and replication enzymes (e.g., Gag, pol, etc.) which are necessary or useful for the correct packaging of XDP particles.
- the cell line can be any cell line suitable for the production of XDP, including primary ex vivo cultured cells (from an individual organism) as well as established cell lines.
- Cell types may include bacterial cells, yeast cells, and mammalian cells. Exemplary bacterial cell types may include E. coli.
- Exemplary yeast cell types may include Saccharomyces cerevisiae. Also suitable for use as packaging cells are insect cell lines, such as Spodoptera frugiperda sf9 cells.
- Exemplary mammalian cell types may include mouse, hamster, and human primary cells, as we as cell lines such as human embryonic kidney 293 (HEK293) cells, Lenti-X 293T cells, baby hamster kidney (BHK) cells, HepG2 cells, Saos-2 cells, HuH7 cells, NS0 cells, SP2/0 cells, YO myeloma cells, A549 cells, P3X63 mouse myeloma cells, PER cells, PER.C6 cells, hybridoma cells, VERO cells, NIH3T3 cells, COS cells, WI38 cells, MRC5 cells, A549 cells, HeLa cells, Chinese hamster ovary (CHO) cells, or HT1080 cells.
- HEK293 human embryonic kidney 293
- the eukaryotic cell is modified by one or more mutations one or more mutations to reduce expression of a cell surface marker that could be incorporated into the XDP.
- markers can include receptors or proteins capable of being bound by MHC receptors or that would otherwise trigger an immune response in a subject.
- vectors are introduced into the packaging cell that encode the particular therapeutic payload (e.g., a CasX:gNA designed for editing target nucleic acid), as well as the other viral-derived structural components, detailed above, (e.g., the Gag polyprotein, the pol polyprotein, the tropism factor, and, optionally, the donor template nucleic acid sequence).
- the vectors can remain as extra-chromosomal elements or some or all can be integrated into the host cell chromosomal DNA to create a stably-transformed packaging cell.
- the vectors comprising the nucleic acids of the XDP system are introduced into the cell via transfection, transduction, lipofection or electroporation to generate a packaging cell line.
- the introduction of the vectors can use one or more of the commercially available TransMessenger reagents from Qiagen, Stemfect RNA Transfection Kit from Stemgent, and TransIT-mRNA Transfection Kit from Mirus Bio LLC, Lonza nucleofection, Maxagen electroporation and the like. Methods for transfection, transduction or infection are well known to those of skill in the art.
- the packaging vectors are introduced into the cells together with a dominant selectable marker, such as neo, DHFR, Gin synthetase or ADA, followed by selection in the presence of the appropriate drug and isolation of clones.
- a selectable marker gene can be linked physically to genes encoding by the packaging vector.
- XDP with the encapsidated therapeutic payload from the transfected host cell can be mediated by the viral structural protein, Gag.
- Human immunodeficiency virus type 1 (HIV-1) Gag is synthesized as a precursor polyprotein, Pr55 gag . This polyprotein is comprised of four major structural domains, which are cleaved by the viral protease into pl7 matrix (MA), p24 capsid (CA), p7 nucleocapsid (NC), and p6, during or immediately after the budding process (Adamson CS., and Freed EO. Human immunodeficiency virus type 1 assembly, release, and maturation. Adv. Pharmacol. 55:347 (2007)).
- p55 Gag protein Utilizing an HIV-1 system, it is sufficient to express the p55 Gag protein to allow the efficient production of XDPs from cells (Gheysen et ah, Assembly and release of HIV-1 precursor Pr55Gag virus-like particles from recombinant baculovirus-infected insect cells. Cell. 59(1): 103 (1989)).
- MA constitutes the N-terminal domain of the Gag protein and is essential for membrane binding and localization of the Gag precursor to the plasma membrane.
- CA and NC domains promote Gag multimerization through direct protein-protein interactions and indirect RNA-mediated interactions, respectively. Inclusion of the late domain motif within p6 can promote release of XDP particles from the cell surface.
- the Gag polypeptide Upon expression, the Gag polypeptide is targeted to the cell membrane and incorporated in the XDP during membrane budding.
- the HIV-1 protease cleaves Pr55 gag into the mature Gag proteins pl7 matrix (MA), p24 capsid (CA), p7 nucleocapsid (NC), and p6.
- MA pl7 matrix
- CA p24 capsid
- NC p7 nucleocapsid
- the proteolytic processing of Gag results in a major transformation in XDP structure: MA remains associated with the inner face of the viral membrane, whereas CA condenses to form a shell around the NC complex (if incorporated). This rearrangement produces a morphological transition to a particle with a conical core characteristic similar to an infectious virion.
- components derived, in part, from retroviruses can be utilized to create XDP within packaging cells for delivery of the therapeutic payload to the target cells.
- the packaging cell transformed with the XDP system plasmids produce XDP that facilitate delivery of the encapsidated RNP of a CasX:gNA system to cells to effect editing of target nucleic acid.
- the present disclosure provides a recombinant expression system for use in the production of XDP in a selected host packaging cell, comprising an expression cassette comprising the nucleic acids of the XDP system described herein operably linked to regulatory elements compatible with expression in the selected host cell.
- the expression cassettes may be included on one or more vectors as described herein and in the Examples, and may use the same or different promoters.
- Exemplary regulatory elements include a transcription promoter such as, but not limited to, CMV, CMV+intron A, SV40, RSV, HIV-Ltr, elongation factor 1 alpha (EFla), MMLV-ltr, internal ribosome entry site (IRES) or p2A peptide to permit translation of multiple genes from a single transcript, metallothionein, a transcription enhancer element, a transcription termination signal, polyadenylation sequences, sequences for optimization of initiation of translation, and translation termination sequences.
- a transcription promoter such as, but not limited to, CMV, CMV+intron A, SV40, RSV, HIV-Ltr, elongation factor 1 alpha (EFla), MMLV-ltr, internal ribosome entry site (IRES) or p2A peptide to permit translation of multiple genes from a single transcript, metallothionein, a transcription enhancer element, a transcription termination signal, polyadenylation sequences, sequence
- control element will depend on the encoded component to be expressed (e.g., protein or RNA) or whether the nucleic acid comprises multiple components that require different polymerases or are not intended to be expressed as a fusion protein.
- the present disclosure provides methods of making an XDP comprising a therapeutic payload (e.g., an RNP of a CasX protein and a gNA), the method comprising propagating the packaging cell of the embodiments described herein comprising the expression cassettes or the integrated nucleic acids encoding the XDP systems of any one of the embodiments described herein under conditions such that XDPs are produced with the encapsidated therapeutic payload, followed by harvesting the XDPs produced by the packaging cell, as described below or in the Examples.
- the packaging cell produces XDP comprising RNP of a CasX and gNA and, optionally, a donor template for the editing of the target nucleic acid by HDR.
- the packaging cell can be, for example, a mammalian cell (e.g., HEK293 cells, Lenti- X 293T cells, BHK cells, HepG2 cells, Saos-2 cells, HuH7 cells, NSO cells, SP2/0 cells, YO myeloma cells, A549 cells, P3X63 mouse myeloma cells, PER cells, PER.C6 cells, hybridoma cells, VERO cells, NIH3T3 cells, COS cells, WI38 cells, MRC5 cells, A549 cells, HeLa cells, CHO cells, and HT1080 cells), an insect cell (e.g., Trichoplusia ni (Tn5) or Sf9), a bacterial cell, a plant cell, a yeast cell, an antigen presenting cell (e.g., primary, immortalized or tumor-derived lymphoid cells such as macrophages, monocytes, dendritic cells, B-cells,
- Packaging cells can be transfected by conventional methods, including electroporation, use of cationic polymers, calcium phosphate, virus-mediated transfection, transduction, or lipofection.
- the packaging cell can be modified to reduce or eliminate cell surface markers or receptors that would otherwise be incorporated into the XDP, thereby reducing an immune response to the cell surface markers or receptors by the subject receiving an administration of the XDP.
- the introduction of the vectors into the packaging cell can use one or more of the commercially available TransMessenger reagents from Qiagen, Stemfect RNA Transfection Kit from Stemgent, and TransIT-mRNA Transfection Kit from Mirus Bio LLC, Lonza nucleofection, Maxagen electroporation and the like. Methods for transfection, transduction or infection are well known to those of skill in the art.
- XDP are produced by the incubation of the transfected packaging cells in appropriate growth medium for 48 to 96 hours and are collected by filtration of the growth medium through a 0.45 micron filter.
- the XDP can be further concentrated by centrifugation in a 10% or a 10-30% density gradient sucrose buffer.
- the XDP can be concentrated by column chromatography, such as by use of an ion- exchange resin or a size exclusion resin.
- the XDP systems comprising CasX proteins and guides provided herein are useful in methods for modifying target nucleic acids in cells.
- the method utilizes any of the embodiments of the CasX:gNA systems described herein, and optionally includes a donor template embodiment described herein.
- the method knocks-down the expression of a mutant protein in cells comprising the target nucleic.
- the method knocks-out the expression of the mutant protein.
- the method results in the correction of the mutation in the target nucleic acid, resulting in the expression of functional protein.
- the method comprises contacting the cells comprising the target nucleic acid with an effective dose of XDPs comprising RNPs of CasX protein and a guide nucleic acid (gNA) comprising a targeting sequence complementary to the target nucleic acid, wherein said contacting results in modification of the target nucleic acid by the CasX protein.
- the XDP further comprises a donor template wherein the contacting of the cell with the XDP results in insertion of the donor template into the target nucleic acid sequence.
- the donor template is used in conjunction with the RNP to correct a mutation in the target nucleic acid gene, while in other cases the donor template is used to insert a mutation to knock-down or knock-out expression of the expression product of the target nucleic acid gene.
- the method of modifying a target nucleic acid in a cell comprises contacting the cells comprising the target nucleic acid with an effective dose of XDPs wherein the cell is modified in vitro or ex vivo.
- the cells are modified in vivo , wherein a therapeutically-effective dose of the XDP is administered to a subject.
- the method has the advantage over viral delivery systems in that the RNP are comparatively short-lived relative to the nucleic acids delivered in viral systems such as AAV.
- a further advantage of the XDP system is the ability to match the system to specific cell types by manipulating the tropism of the XDP.
- the half-life of the delivered RNP is about 24h, or about 48h, or about 72h, or about 96h, or about 120h, or about 1 week.
- the administration of the XDP results in the improvement of one, two, or more symptoms, clinical parameters or endpoints associated with the disease in the subject.
- the subject administered the XDP is selected from the group consisting of mouse, rat, pig, non-human primate, and human. In a particular embodiment, the subject is a human.
- the XDP is administered to the subject at a dose of at least about 1 x 10 5 XDP particles/kg, or at least about 1 x 10 6 particles/kg, or at least about 1 x 10 7 particles/kg, or at least about 1 x 10 8 particles/kg, or at least about 1 x 10 9 particles/kg, or at least about 1 x 10 10 particles/kg, or at least about 1 x 10 11 particles/kg, or at least about 1 x 10 12 particles/kg, or at least about 1 x 10 13 particles/kg, or at least about 1 x 10 14 particles/kg, or at least about 1 x 10 15 parti cles/kg, or at least about 1 x 10 16 particles/kg.
- the VLP is administered to the subject at a dose of at least about 1 x 10 5 particles/kg to at least about 1 x 10 16 particles/kg. In another embodiment, the VLP is administered to the subject at a dose of at least about 1 x 10 5 particles/kg to about 1 x 10 16 particles/kg, or at least about 1 x 10 6 particles/kg to about 1 x 10 15 particles/kg, or at least about 1 x 10 7 particles/kg to about 1 x 10 14 particles/kg. In other embodiments, the VLP is administered to the subject at a dose of at least about 1 x 10 5 particles/kg to at least about 1 x 10 16 particles/kg.
- the XDP is administered by a route of administration selected from the group consisting of subcutaneous, intradermal, intraneural, intranodal, intramedullary, intramuscular, intravenous, intra-arterial, intracerebroventricular, intracisternal, intrathecal, intracranial, intralumbar, intratracheal, intraosseous, inhalatory, intracontralateral striatum, intraocular, intravitreal, intralymphatical, intraperitoneal routes and sub-retinal routes.
- a route of administration selected from the group consisting of subcutaneous, intradermal, intraneural, intranodal, intramedullary, intramuscular, intravenous, intra-arterial, intracerebroventricular, intracisternal, intrathecal, intracranial, intralumbar, intratracheal, intraosseous, inhalatory, intracontralateral striatum, intraocular, intravitreal, intralymphatical, intraperitoneal routes and sub-retinal routes
- the disclosure provides a method of treatment of a subject having a disease according to a treatment regimen comprising one or more consecutive doses using a therapeutically effective dose of an XDP of any of the embodiments described herein.
- the therapeutically effective dose is administered as a single dose.
- the therapeutically effective dose is administered to the subject as two or more doses over a period of at least two weeks, or at least one month, or at least two months, or at least three months, or at least four months, or at least five months, or at least six months, or once a year, or every 2 or 3 years.
- kits comprising the compositions of the embodiments described herein.
- the kit comprises an XDP comprising a therapeutic payload of any of the embodiment described herein, an excipient and a suitable container (for example a tube, vial or plate).
- the therapeutic payload is an RNP of a CasX and a gNA.
- the kit further comprises a buffer, a nuclease inhibitor, a protease inhibitor, a liposome, a therapeutic agent, a label, a label visualization reagent, or any combination of the foregoing.
- the kit further comprises a pharmaceutically acceptable carrier, diluent or excipient.
- the kit further comprises instructions for use. IX. Exemplary Embodiments
- the XDP system comprises an editing efficiency of at least 75%, at least 80%, at least 85%, at least 87%, at least 90% or at least 91% as per the editing assay dilution in Table 25, or at least 70%, at least 75%, at least 80% or at least 85% as per the editing assay dilution of Table 26.
- the XDP system comprises version 44, encoded by plasmid pXDP40 (SEQ ID NO: 882) as described in Table 24.
- the XDP system comprises a VSV glycoprotein as encoded by pGP2, and an sgRNA.
- the XDP system comprises an editing efficiency of at least 25%, at least 30%, at least 35% or at least 37% as per the editing assay dilution in Table 25 or at least 5%, at least 10% or at least 13% as per the editing assay dilution of Table 26.
- the XDP system comprises version 63, encoded by plasmid pXDP62 (SEQ ID NO: 904) as described in Table 24.
- the XDP system comprises a VSV glycoprotein as encoded by pGP2, and an sgRNA.
- the XDP system comprises an editing efficiency of at least 60%, at least 65%, at least 70%, at least 75% or at least 77% as per the editing assay dilution in Table 28, or at least 20%, at least 25%, at least 30% or at least 32% as per the editing assay dilution of Table 29.
- the XDP system comprises version 74a, encoded by plasmid pXDP72 (SEQ ID NO:917) as described in Table 27.
- the XDP system comprises a VSV glycoprotein as encoded by pGP2, and an sgRNA.
- the XDP system comprises an editing efficiency of at least at least 50%, at least 55%, at least 60%, at least 65% or at least 67% as per the editing assay dilution in Table 28, or at least 25%, at least 30%, at least 35% or at least 38% as per the editing assay dilution of Table 29.
- the XDP system comprises version 75a, encoded by plasmid pXDP73 (SEQ ID NO:918) as described in Table 27.
- the XDP system comprises a VSV glycoprotein as encoded by pGP2, and an sgRNA.
- the XDP system comprises an editing efficiency of at least 75%, at least 80%, at least 85%, at least 87%, at least 90% or at least 91% as per the editing assay dilution in Table 31, or at least 70%, at least 75%, at least 80% or at least 85% as per the editing assay dilution of Table 32.
- the XDP system comprises version 44, encoded by plasmid pXDP40 (SEQ ID NO: 949) as described in Table 30.
- the XDP system comprises a VSV glycoprotein as encoded by pGP2, and an sgRNA.
- the XDP system comprises an editing efficiency of at least 25%, at least 30%, at least 35% or at least 37% as per the editing assay dilution in Table 31 or at least 5%, at least 10% or at least 13% as per the editing assay dilution of Table 32.
- the XDP system comprises version 63, encoded by plasmid pXDP62 (SEQ ID NO: 971) as described in Table 30.
- the XDP system comprises a VSV glycoprotein as encoded by pGP2, and an sgRNA.
- the XDP system comprises an editing efficiency of at least 75%, at least 80%, at least 85%, at least 87%, at least 90% or at least 94% as per the editing assay dilution in Table 34 or at least 75%, at least 80%, at least 85%, at least 87%, at least 90% or at least 95% as per the editing assay dilution of Table 35.
- the XDP system comprises version 102, encoded by plasmid pXDP127 (SEQ ID NO: 976) as described in Table 33.
- the XDP system comprises a VSV glycoprotein as encoded by pGP2, and an sgRNA.
- the XDP system comprises an editing efficiency of at least 70%, at least 75%, at least 80% or at least 84% as per the editing assay dilution in Table 34 or at least 70%, at least 75%, or at least 80% as per the editing assay dilution of Table 35.
- the XDP system comprises version 7, encoded by plasmid pXDP0017.
- the XDP system comprises a VSV glycoprotein as encoded by pGP2, and an sgRNA.
- the XDP system comprises an editing efficiency of at least at least 25%, at least 25%, at least 30% or at least 33% as per the editing assay dilution in Table 37 or at least 1.8 % as per the editing assay dilution of Table 38.
- the XDP system comprises version 66B, encoded by plasmid pXDP78 + pXDP54.
- the XDP system comprises a VSV glycoprotein as encoded by pGP2, and an sgRNA.
- the XDP system comprises an editing efficiency of at least 10%, at least 15%, at least 20% or at least 21% as per the editing assay dilution in Table 37 or at least 5%, at least 7% or at least 9% as per the editing assay dilution of Table 38.
- the XDP system comprises version 87B, encoded by plasmids pXDP83 + pXDP59.
- the XDP system comprises a VSV glycoprotein as encoded by pGP2, and an sgRNA.
- Editing efficiency may be measured by any known method or assay in the art. A person of skill in the art would know how to identify and use such assays. In some embodiments, the editing efficiency may be measured as %TDT positive cells, for example as shown in FIG. 69- 70.
- an XDP system comprises one or more plasmids or elements in an arrangement resulting in an increased editing efficiency compared an XDP system not comprising said arrangement.
- the XDP system may have an increased editing efficiency of at least 1%, 3%, 5%, 7%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99% or 100% compared to an XDP system not comprising the same elements and/or arrangement.
- an XDP system may be derived from Alpharetroviruses (avian leukosis virus (ALV) and rous sarcoma virus (RSV)), and encoded by the three plasmids encoding the Gag-protease-CasX, the glycoprotein (VSV-G), and the guide RNA (sgRNA).
- the elements of the structural plasmid may be arranged as: MA, P2A, P2B, P10, CA, NC, Pro and CasX (FIG. 52A).
- the XDP system version 44 comprises elements of a structural plasmid arranged as: MA, P2A, P2B, P10, CA, NC, Pro and CasX (FIG 52A), wherein version 44 has an increased editing efficiency of at least 1%, 3%, 5%, 7%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%,
- an XDP system may be encoded by the three plasmids as shown in FIG. 53 A.
- the elements of the structural plasmid may be arranged as: MA, CA, NC, Pro and CasX.
- the XDP system version 63 comprises elements of a structural plasmid arranged as: MA, CA, NC, Pro and CasX, wherein version 63 has an increased editing efficiency of at least 1%, 3%, 5%, 7%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99% or 100% compared to an XDP not comprising the same elements and/or arrangement.
- an XDP system may be derived from Gammaretroviruses (FLV and MMLV), and encoded by the three plasmids as shown in FIG. 59B.
- the elements of the structural plasmid may be arranged as: MA, ppl2, CA, and CasX.
- the XDP system version 74a comprises elements of a structural plasmid arranged as: MA, ppl2, CA, and CasX, wherein version 74a has an increased editing efficiency of at least 1%, 3%, 5%, 7%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%,
- an XDP system may be derived from Alpharetroviruses (avian leukosis virus (ALV) and rous sarcoma virus (RSV) and encoded by the three plasmids as shown in FIG. 62B.
- the elements of the structural plasmid may be arranged as: MA, P2A, P2B, P10, CA, NC, and CasX.
- the XDP system version 102 comprises elements of a structural plasmid arranged as: MA, P2A, P2B, P10, CA, NC, and CasX, wherein version 102 has an increased editing efficiency of at least 1%, 3%, 5%, 7%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99% or 100% compared to an XDP not comprising the same elements and/or arrangement.
- an XDP system may be encoded by three plasmids as shown in FIG. 39A.
- the elements of the structural plasmid may be arranged as: MA, CA, NC, pl/p6, and CasX.
- the XDP system version 7 comprises elements of a structural plasmid arranged as: MA, CA, NC, pl/p6, and CasX, wherein version 7 has an increased editing efficiency of at least 1%, 3%, 5%, 7%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99% or 100% compared to an XDP not comprising the same elements and/or arrangement.
- an XDP system may be encoded by the four plasmids as shown in FIG. 56A.
- the elements of structural plasmid 1 may be arranged as: MA, P2A, P2B, P10, CA, and CasX
- elements of structural plasmid 2 may be arranged as: MA, P2A, P2B, P10, CA, NC, Pro, and CasX.
- the XDP system version 66B comprises elements of a structural plasmid 1 arranged as: MA, P2A, P2B, P10, CA, and CasX, and elements of structural plasmid 2 arranged as: MA, P2A, P2B, P10, CA, NC, Pro, and CasX, wherein version 66B has an increased editing efficiency of at least 1%, 3%, 5%, 7%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%,
- an XDP system may be encoded by the four plasmids as shown in FIG. 57A.
- the elements of structural plasmid 1 may be arranged as: MA, pp21/24,
- the XDP system version 87B comprises elements of a structural plasmid larranged as: MA, pp21/24, P12/P3/P8, CA, and CasX, and elements of structural plasmid 2 arranged as: MA, pp21/24, P12/P3/P8, CA, NC, Pro, and CasX, wherein version 87B has an increased editing efficiency of at least 1%, 3%, 5%, 7%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99% or 100% compared to an XDP not comprising the same elements and/or arrangement.
- the XDP systems disclosed herein may be derived from the Retroviridae virus family, including Othoretrovirinae (Lentivirus, Alpharetrovirus, Betaretrovirus, Deltaretrovirus, Epsilonretrovirus, Gammaretrovirus), and Spumaretrovirinae .
- Othoretrovirinae Livirus, Alpharetrovirus, Betaretrovirus, Deltaretrovirus, Epsilonretrovirus, Gammaretrovirus
- Spumaretrovirinae Exemplary XDP system versions and their corresponding virus are shown in Tables 25, 26, 28, 29, 31, 32, 34, 35, 37 and 38.
- the invention may be defined by reference to the following sets of enumerated, illustrative embodiments:
- a CasX delivery particle (CasX XDP) system comprising: a. a first nucleic acid comprising a sequence encoding a fusion polypeptide that comprises: i) a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); ii) a CasX protein; and iii) a protease cleavage site between the gag polyprotein and the CasX protein; b. a second nucleic acid comprising a sequence encoding a guide RNA; c.
- a first nucleic acid comprising a sequence encoding a fusion polypeptide that comprises: i) a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); ii) a CasX protein; and iii) a protea
- a third nucleic acid comprising a sequence encoding a fusion polypeptide that comprises: i) a gag polyprotein; and ii) a pol polyprotein comprising at least a protease capable of cleaving the protease cleavage site between the CasX protein and the gag polyprotein; and d. a fourth nucleic acid, comprising a sequence encoding a pseudotyping viral envelope glycoprotein or an antibody fragment that provides for binding and fusion of the XDP to a target cell.
- a CasX delivery particle (CasX XDP) system comprising: a. a first nucleic acid comprising a sequence encoding a fusion polypeptide that comprises: i) a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); ii) a CasX protein; iii) a protease cleavage site between the gag polyprotein and the CasX protein; and iv) a protease capable of cleaving the protease cleavage site between the CasX protein and the gag polyprotein; b.
- a first nucleic acid comprising a sequence encoding a fusion polypeptide that comprises: i) a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); ii) a Ca
- a second nucleic acid comprising a sequence encoding a guide RNA; and c. a third nucleic acid, comprising a sequence encoding a pseudotyping viral envelope glycoprotein or antibody fragment that provides for binding and fusion of the XDP to a target cell.
- a CasX delivery particle (CasX XDP) system comprising: a. a first nucleic acid comprising a sequence encoding a fusion polypeptide that comprises: i) a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); ii) a CasX protein; and iii) a protease cleavage site between the gag polyprotein and the CasX protein; b. a second nucleic acid comprising a sequence encoding a guide RNA; c.
- a first nucleic acid comprising a sequence encoding a fusion polypeptide that comprises: i) a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); ii) a CasX protein; and iii) a protea
- a third nucleic acid comprising a sequence encoding a protease capable of cleaving the protease cleavage site between the CasX protein and the gag polyprotein; and d. a fourth nucleic acid, comprising a sequence encoding a pseudotyping viral envelope glycoprotein or antibody fragment that provides for binding and fusion of the XDP to a target cell.
- a CasX delivery particle (CasX XDP) system comprising: a. a first nucleic acid comprising a sequence encoding i) a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); and ii) a chimeric RNA comprising a guide RNA and a retroviral Psi packaging element inserted into the guide RNA; b. a second nucleic acid comprising a sequence encoding a Cas X protein; and c.
- a first nucleic acid comprising a sequence encoding i) a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); and ii) a chimeric RNA comprising a guide RNA and a retroviral Psi packaging element inserted into the guide RNA; b
- a CasX delivery particle (CasX XDP) system comprising: a. a first nucleic acid comprising a sequence encoding: i) a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); ii) an RNA binding domain protein; and iii) an optional protease cleavage site between the gag polyprotein and the RNA binding domain protein; b.
- a first nucleic acid comprising a sequence encoding: i) a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); ii) an RNA binding domain protein; and iii) an optional protease cleavage site between the gag polyprotein and the RNA binding domain protein; b.
- a second nucleic acid comprising a sequence encoding a guide RNA and a CasX protein
- a third nucleic acid comprising a sequence encoding a protease capable of cleaving the protease cleavage site between the gag polyprotein and the RNA binding domain protein
- a fourth nucleic acid comprising a sequence encoding a pseudotyping viral envelope glycoprotein or antibody fragment that provides for binding and fusion of the XDP to a target cell.
- Embodiment 1-6 The XDP system of embodiment 5, wherein the RNA binding domain protein is selected from the group consisting of MS2, PP7 or Qbeta, U1A, phage replication loop, kissing loop a, kissing loop bl, kissing loop_b2, G quadriplex M3q, G quadriplex telomere basket, sarcin-ricin loop, and pseudoknots.
- the RNA binding domain protein is selected from the group consisting of MS2, PP7 or Qbeta, U1A, phage replication loop, kissing loop a, kissing loop bl, kissing loop_b2, G quadriplex M3q, G quadriplex telomere basket, sarcin-ricin loop, and pseudoknots.
- Embodiment 1-7 The XDP system of any one of embodiments 1-3, comprising all or a portion of any one of the nucleic acid sequences of Table 8 or Table 9.
- Embodiment 1-8 The XDP system of any one of the preceding embodiments of Set I, wherein the gag polypeptide comprises one or more protease cleavage sites between the matrix polypeptide (MA) and the capsid polypeptide (CA) and/or between the capsid polypeptide (CA) and the nucleocapsid polypeptide (NC), wherein the one or more protease cleave sites are capable of being cleaved by the protease.
- the gag polypeptide comprises one or more protease cleavage sites between the matrix polypeptide (MA) and the capsid polypeptide (CA) and/or between the capsid polypeptide (CA) and the nucleocapsid polypeptide (NC), wherein the one or more protease cleave sites are capable of being cleaved by the protease.
- Embodiment 1-9 The XDP system of any one of the preceding embodiments of Set I, wherein the protease is selected from the group of proteases consisting of HIV- 1 protease, tobacco etch virus protease (TEV), potyvirus HC protease, potyvirus PI protease, PreScission, b virus NIa protease, B virus RNA-2-encoded protease, aphthovirus L protease, enterovirus 2A protease, rhinovirus 2 A protease, picoma 3C protease, comovirus 24K protease, nepovirus 24K protease, RTSV (rice tungro spherical virus) 3C-like protease, PYVF (parsnip yellow fleck virus) 3C-like protease, cathepsin, thrombin, factor Xa, metalloproteinases MMP-2, -3,
- Embodiment 1-11 The XDP system of embodiment 10, wherein the retrovirus is an alpharetrovirus, a betaretrovirus, a gammaretrovirus, a deltaretrovirus, a epsilonretrovirus, or a lentivirus.
- the retrovirus is an alpharetrovirus, a betaretrovirus, a gammaretrovirus, a deltaretrovirus, a epsilonretrovirus, or a lentivirus.
- Embodiment 1-12 The XDP system of embodiment 11, wherein the lentivirus is a human immunodeficiency virus (HIV).
- HIV human immunodeficiency virus
- Embodiment 1-13 The XDP system of any one of the preceding embodiments of Set I, wherein the gag polyprotein is a retroviral polyprotein.
- Embodiment 1-14 The XDP system of embodiment 13, wherein the gag polyprotein is derived from a alpharetrovirus, a betaretrovirus, a gammaretrovirus, a deltaretrovirus, a epsilonretrovirus, or a lentivirus.
- Embodiment 1-15 The XDP system of embodiment 14, wherein the gag polyprotein is a lentiviral polyprotein.
- Embodiment 1-16 The XDP system of embodiment 15, wherein the lentiviral gag polypeptide is an HIV-1 gag polyprotein.
- Embodiment 1-17 The XDP system of any one of embodiments 13-16, wherein the gag polypeptide further comprises a p6 polypeptide.
- Embodiment 1-18 The XDP system of embodiment 16 or embodiment 17, wherein the HIV-1 gag polypeptide comprises a MA polypeptide, a CA polypeptide, a p2 polypeptide, an NC polypeptide, a pi polypeptide, and a p6 polypeptide, and wherein the HIV gag polyprotein comprises one or more protease cleavage sites located between one or more of: a. the MA polypeptide and the CA polypeptide; b. the CA polypeptide and the p2 polypeptide; c. the p2 polypeptide and the NC polypeptide; d. the NC polypeptide and the pi polypeptide; and e. the pi polypeptide and the p6 polypeptide.
- the HIV gag polyprotein comprises one or more protease cleavage sites located between one or more of: a. the MA polypeptide and the CA polypeptide; b. the CA polypeptide and the p2 polypeptide; c. the p2 polypeptide and the
- Embodiment 1-19 The XDP system of embodiment 18, wherein the protease capable of cleaving the protease cleavage site is selected from the group of proteases consisting of HIV-1 protease, tobacco etch virus protease (TEV), poty virus HC protease, poty virus PI protease, PreScission, b virus NIa protease, B virus RNA-2-encoded protease, aphthovirus L protease, enterovirus 2 A protease, rhinovirus 2 A protease, picorna 3C protease, comovirus 24K protease, nepovirus 24K protease, RTSV (rice tungro spherical virus) 3C-like protease, PYVF (parsnip yellow fleck virus) 3C-like protease, cathepsin, thrombin, factor Xa, metalloproteinases MMP-2
- Embodiment 1-20 The XDP system of embodiment 19, wherein the protease capable of cleaving the protease cleavage site is HIV-1 protease.
- Embodiment 1-21 The XDP system of any one of the preceding embodiments of Set I, further comprising a nucleic acid encoding a retroviral packaging signal and further comprising a donor template nucleic acid complementary to a target nucleic acid.
- Embodiment 1-22 The XDP system of embodiment 21, wherein the donor template nucleic acid sequence comprises at least a portion of a target nucleic acid gene or a regulatory element of the target nucleic acid gene.
- Embodiment 1-2 The XDP system of embodiment 21 or embodiment 22, wherein the donor template nucleic acid sequence comprises a corrective sequence for a mutation in the target nucleic acid gene or regulatory element of the target nucleic acid gene.
- Embodiment 1-24 The XDP system of embodiment 21 or embodiment 22, wherein the donor template nucleic acid sequence comprises a mutation compared to the target nucleic acid gene or regulatory element of the target nucleic acid gene.
- Embodiment 1-25 The XDP system of embodiment 24, where the mutation is an insertion, a deletion, or a substitution of one or more nucleotides in the donor template nucleic acid sequence.
- Embodiment 1-26 The XDP system of any one of the preceding embodiments of Set I, wherein the guide RNA is a single-molecule guide RNA comprising a scaffold sequence and a targeting sequence, wherein the targeting sequence is complementary to a target nucleic acid sequence.
- the guide RNA is a single-molecule guide RNA comprising a scaffold sequence and a targeting sequence, wherein the targeting sequence is complementary to a target nucleic acid sequence.
- Embodiment 1-27 The XDP system of embodiment 26, wherein the guide RNA scaffold sequence has at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to a sequence selected from the group of sequences consisting of SEQ ID NOS: 4, 5, and 597-781.
- Embodiment 1-28 The XDP system of embodiment 26 or embodiment 27, wherein the targeting sequence of the guide RNA consists of 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 consecutive nucleotides.
- Embodiment 1-29. The XDP system of embodiment 28, wherein the targeting sequence of the guide RNA consists of 20 nucleotides.
- Embodiment 1-30 The XDP system of embodiment 28, wherein the targeting sequence of the guide RNA consists of 19 nucleotides.
- Embodiment 1-3 The XDP system of embodiment 28, wherein the targeting sequence of the guide RNA consists of 18 nucleotides.
- Embodiment 1-32 The XDP system of embodiment 28, wherein the targeting sequence of the guide RNA consists of 17 nucleotides.
- Embodiment 1-33 The XDP system of embodiment 28, wherein the targeting sequence of the guide RNA consists of 16 nucleotides.
- Embodiment 1-34 The XDP system of embodiment 28, wherein the targeting sequence of the guide RNA consists of 15 nucleotides.
- Embodiment 1-35 The XDP system of any one of the preceding embodiments of Set I, wherein the guide RNA further comprises one or more ribozymes.
- Embodiment 1-36 The XDP system of embodiment 35, wherein the one or more ribozymes are independently fused to a terminus of the guide RNA.
- Embodiment 1-37 The XDP system of embodiment 35 or embodiment 36, wherein at least one of the one or more ribozymes are a hepatitis delta virus (HDV) ribozyme, hammerhead ribozyme, pistol ribozyme, hatchet ribozyme, or tobacco ringspot virus (TRSV) ribozyme.
- HDV hepatitis delta virus
- TRSV tobacco ringspot virus
- Embodiment 1-39 The XDP system of any one of the preceding embodiments of Set I, wherein the CasX protein comprises a sequence having at least about 90%, or at least about 91%, or at least about 92%, or at least about 93%, or at least about 94%, or at least about 95%, or at least about 96%, or at least about 97%, or at least about 98%, or at least about 99%, or at least 100% sequence identity to a sequence selected from the group consisting of the sequences set forth in Table 1.
- Embodiment 1-40 The XDP system of any one of the preceding embodiments of Set I, wherein the CasX protein has binding affinity for a protospacer adjacent motif (PAM) sequence selected from the group consisting of TTC, ATC, GTC, and CTC.
- PAM protospacer adjacent motif
- Embodiment 1-4 The XDP system of any one of the preceding embodiments of Set I, wherein the CasX protein further comprises one or more nuclear localization signals (NLS).
- Embodiment 1-42. The XDP system of embodiment 41, wherein the one or more NLS are selected from the group of sequences consisting of SEQ ID NOS: 130-166.
- Embodiment 1-4 The CasX variant of embodiment 41 or embodiment 42, wherein the one or more NLS are expressed at the C-terminus of the CasX protein.
- Embodiment 1-44 The CasX variant of embodiment 41 or embodiment 42, wherein the one or more NLS are expressed at the N-terminus of the CasX protein.
- Embodiment 1-45 The CasX variant of embodiment 41 or embodiment 42, wherein the one or more NLS are expressed at the N-terminus and C-terminus of the CasX protein.
- Embodiment 1-46 The XDP system of any one of the preceding embodiments of Set I, wherein the CasX protein comprises a nuclease domain having nickase activity.
- Embodiment 1-47 The XDP system of any one of embodiments 1-45, wherein the CasX protein comprises a nuclease domain having double-stranded cleavage activity.
- Embodiment 1-48 The XDP system of any one of embodiments 1-45, wherein the CasX protein is a catalytically inactive CasX (dCasX) protein, and wherein the dCasX and the guide RNA retain the ability to bind to the target nucleic acid.
- the CasX protein is a catalytically inactive CasX (dCasX) protein
- the dCasX and the guide RNA retain the ability to bind to the target nucleic acid.
- Embodiment 1-49 The XDP system of embodiment 48, wherein the dCasX comprises a mutation at residues: a. D672, E769, and/or D935 corresponding to the CasX protein of SEQ ID NO: 1; or b. D659, E756 and/or D922 corresponding to the CasX protein of SEQ ID NO: 2.
- Embodiment 1-50 The XDP system of embodiment 49, wherein the mutation is a substitution of alanine for the residue.
- Embodiment 1-51 The XDP system of any one of the preceding embodiments of Set I, wherein the envelope glycoprotein is derived from an enveloped virus selected from the group consisting of influenza A, influenza B, influenza C virus, hepatitis A virus, hepatitis B virus, hepatitis C virus, hepatitis D virus, hepatitis E virus, rotavirus, Norwalk virus, enteric adenovirus, parvovirus, Dengue fever virus, monkey pox, Mononegavirales, rabies virus, Lagos bat virus, Mokola virus, Duvenhage virus, European bat virus 1, European bat virus 2,
- Embodiment 1-52 The XDP system of embodiment 51, wherein the envelope glycoprotein is derived from vesicular stomatitis virus (VSV).
- VSV vesicular stomatitis virus
- Embodiment 1-53 The XDP system of any one of embodiments 1-50, wherein the antibody fragment has binding affinity for a cell surface marker or receptor of a target cell.
- Embodiment 1-54 The XDP system of embodiment 53, wherein the antibody fragment is a scFv.
- Embodiment 1-55 A eukaryotic cell comprising the XDP system of any one of the preceding embodiments of Set I.
- Embodiment 1-56 The eukaryotic cell of embodiment 54, wherein the cell is a packaging cell.
- Embodiment 1-57 The eukaryotic cell of embodiment 55 or embodiment 56, wherein the eukaryotic cell is selected from the group consisting of HEK293 cells, Lenti-X 293T cells, BHK cells, HepG2, Saos-2, HuH7, NS0 cells, SP2/0 cells, YO myeloma cells, A549 cells, P3X63 mouse myeloma cells, PER cells, PER.C6 cells, hybridoma cells, VERO, NIH3T3 cells, COS, WI38, MRC5, A549, HeLa cells, CHO cells, or HT1080 cells.
- Embodiment 1-58 The eukaryotic cell of embodiment 56 or embodiment 57, wherein the packaging cell comprises one or more mutations to reduce expression of a cell surface marker.
- Embodiment 1-59 A method of making an XDP comprising a CasX protein, the method comprising: a. introducing the XDP system of any one of embodiments 1-54 into the packaging cell of any one of embodiments 56-58; b. propagating the packaging cell under conditions such that XDPs are produced; and c. harvesting the XDPs produced by the packaging cell.
- Embodiment 1-60 An XDP produced by the method of embodiment 59.
- Embodiment 1-61 An XDP comprising: a. a retroviral capsid (CA), matrix, (MA), and nucleocapsid (NC) polypeptides b. a pseudotyping viral envelope glycoprotein or an antibody fragment that provides for binding and fusion to a target cell; and c. a CasX protein and a guide RNA associated together in a ribonuclear protein complex (RNP) within the XDP.
- CA retroviral capsid
- MA matrix,
- NC nucleocapsid
- RNP ribonuclear protein complex
- Embodiment 1-62 The XDP of embodiment 61, comprising the CasX of any one of embodiments 39-50 and the guide RNA of any one of embodiments 26-38.
- Embodiment 1-63 The XDP of embodiment 61, wherein the pseudotyping viral envelope glycoprotein is derived from the packaging cell of embodiment 57 or embodiment 58 or a nucleic acid encoding the glycoprotein introduced into the packaging cell.
- Embodiment 1-64 The XDP of embodiment 60-63, further comprising a donor template nucleic acid sequence of any one of embodiments 21-25.
- Embodiment 1-65 A method of method of modifying a target nucleic acid sequence in a cell, the method comprising contacting the cell with the XDP of any one of embodiments 60-64, wherein said contacting comprises introducing into the cell the CasX, the guide RNA, and, optionally, the donor template nucleic acid sequence, resulting in modification of the target nucleic acid sequence.
- Embodiment 1-66 The method of embodiment 65, wherein the modification comprises introducing one or more single-stranded breaks in the target nucleic acid sequence.
- Embodiment 1-67 The method of embodiment 65, wherein the modification comprises introducing a double-stranded break in the target nucleic acid sequence.
- Embodiment 1-68 The method of any one of embodiments 65-67, wherein the modification comprises insertion of the donor template into the target nucleic acid sequence.
- Embodiment 1-69 The method of any one of embodiments 65-68, wherein the cell is modified in vitro.
- Embodiment 1-70 The method of any one of embodiments 65-68, wherein the cell is modified in vivo.
- Embodiment 1-71 The method of embodiment 70, wherein the XDP is administered to a subject.
- Embodiment 1-72 The method of embodiment 71, wherein the subject is the subject is selected from the group consisting of mouse, rat, pig, non-human primate, and human.
- Embodiment 1-73 The method of embodiment 71 or embodiment 72, wherein the XDP is administered by a route of administration selected from the group consisting of intravenous, intracerebroventricular, intracistemal, intrathecal, intracranial, lumbar, intratracheal, intraosseous, inhalatory, intracontralateral striatum, intraocular, intravitreal, and sub-retinal routes.
- Embodiment 1-74 The method of any one of embodiments 71-73, wherein the XDP is administered to the subject using a therapeutically effective dose.
- Embodiment 1-75 The method of embodiment 74, wherein the XDP is administered at a dose of at least about 1 x 10 5 particles, or at least about 1 x 10 6 particles, or at least about 1 x 10 7 particles, or at least about 1 x 10 8 particles, or at least about 1 x 10 9 particles, or at least about 1 x 10 10 particles, or at least about 1 x 10 11 particles, or at least about 1 x 10 12 particles, or at least about 1 x 10 13 particles, or at least about 1 x 10 14 particles, or at least about 1 x 10 15 particles, or at least about 1 x 10 16 particles.
- a CasX delivery particle (XDP) system comprising one or more nucleic acids comprising sequences encoding components selected from: a. a matrix polypeptide (MA); b. a capsid polypeptide (CA); c. a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); d. a CasX protein; e. a guide nucleic acid (gNA); f. a pseudotyping viral envelope glycoprotein or antibody fragment that provides for binding and fusion of the XDP to a target cell; g. an RNA binding domain; h.
- XDP CasX delivery particle
- a protease cleavage site i. a gag-transframe region-pol protease polyprotein (gag-TFR-PR); j . a gag-pol polyprotein; and k. a protease capable of cleaving the protease cleavage sites.
- gag-TFR-PR gag-transframe region-pol protease polyprotein
- j a gag-pol polyprotein
- k a protease capable of cleaving the protease cleavage sites.
- Embodiment II-2 The XDP system of Embodiment II- 1, wherein the encoded components comprise the gag polyprotein, the protease cleavage site, the CasX protein, the gag- pol polyprotein, the gNA, and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on two, three, or four individual nucleic acids.
- Embodiment II-3 The XDP system of Embodiment II-2, wherein a.
- a first nucleic acid encodes the gag polyprotein, the CasX protein, and an intervening protease cleavage site between the components; and a second nucleic acid encodes the gag-pol polyprotein, the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA; b. a first nucleic acid encodes the gag polyprotein, the CasX protein, and an intervening protease cleavage site between the components; a second nucleic acid encodes the gag-pol polyprotein; and a third nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA; or c.
- a first nucleic acid encodes the gag polyprotein, the CasX protein, and an intervening protease cleavage site between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; a third nucleic acid encodes the gag-pol polyprotein; and a fourth nucleic acid encodes the gNA.
- Embodiment II-4 The XDP system of Embodiment II- 1, wherein the encoded components are selected from the gag-TFR-PR polyprotein, the protease cleavage site, the CasX protein, the gNA, and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on one, two, or three individual nucleic acids.
- Embodiment II-5 The XDP system of Embodiment II-4, wherein a. the components are encoded on a single nucleic acid; b. a first nucleic acid encodes the gag-TFR-PR polyprotein, the CasX protein, and an intervening protease cleavage site between the components; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA; c.
- a first nucleic acid encodes the gag-TFR-PR polyprotein, the CasX protein, and an intervening protease cleavage site between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA.
- Embodiment II-6 The XDP system of Embodiment II- 1, wherein the encoded components are selected from the gag polyprotein, the protease cleavage site, the protease, the CasX protein, the gNA and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on one, two, or three individual nucleic acids.
- Embodiment II-7 The XDP system of Embodiment II-6, wherein a. the components are encoded on a single nucleic acid; b. a first nucleic acid encodes the gag polyprotein, the protease, the CasX protein, and intervening protease cleavage sites between the components; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA; c.
- a first nucleic acid encodes the gag polyprotein, the protease, the CasX protein and intervening protease cleavage sites between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA.
- Embodiment II-8 The XDP system of Embodiment II- 1, wherein the encoded components are selected from the gag-pol polyprotein, the CasX protein, the protease cleavage site, the gNA, the RNA binding domain, and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on one, two, or three individual nucleic acids.
- Embodiment II-9 The XDP system of Embodiment II-8, wherein a. the components are encoded on a single nucleic acid; b. a first nucleic acid encodes the gag-pol polyprotein,, the CasX protein, and intervening protease cleavage sites between the components; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment, the gNA and the RNA binding domain; or c.
- a first nucleic acid encodes the gag-pol polyprotein, the CasX protein, and an intervening protease cleavage site between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA and the RNA binding domain.
- Embodiment II- 10 The XDP system of Embodiment II- 1, wherein the encoded components are selected from the gag-TFR-PR polyprotein, the CasX protein, the protease cleavage site, the gNA, the RNA binding domain, and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on one, two, or three individual nucleic acids.
- Embodiment II- 11 The XDP system of Embodiment II- 10, wherein a. the components are encoded on a single nucleic acid; b. a first nucleic acid encodes the gag-TFR-PR polyprotein, the CasX protein, and an intervening protease cleavage site between the components; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment, the gNA and the RNA binding domain; or c.
- a first nucleic acid encodes the gag-TFR-PR polyprotein, the CasX protein, and an intervening protease cleavage site between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA and the RNA binding domain.
- Embodiment 11-12 The XDP system of any one of Embodiments II-8-11, wherein the RNA binding domain is a retroviral Psi packaging element inserted into the gNA or is a protein selected from the group consisting of MS2, PP7 or Qbeta, U1A, phage replication loop, kissing loop a, kissing loop bl, kissing loop_b2, G quadriplex M3q, G quadriplex telomere basket, sarcin-ricin loop, and pseudoknots.
- MS2, PP7 or Qbeta a retroviral Psi packaging element inserted into the gNA or is a protein selected from the group consisting of MS2, PP7 or Qbeta, U1A, phage replication loop, kissing loop a, kissing loop bl, kissing loop_b2, G quadriplex M3q, G quadriplex telomere basket, sarcin-ricin loop, and pseudoknots.
- Embodiment 11-13 The XDP system of Embodiment II- 1, wherein the encoded components are selected from the gag-pol polyprotein, the CasX protein, the protease cleavage site, the pseudotyping viral envelope glycoprotein or antibody fragment, and the gNA, wherein the components are encoded on one, two, or three individual nucleic acids.
- Embodiment 11-14 The XDP system of Embodiment 11-13, wherein a. the components are encoded on a single nucleic acid; b. a first nucleic acid encodes the gag-pol polyprotein, an intervening protease cleavage site, the CasX protein; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA; or c. a first nucleic acid encodes the gag-pol polyprotein, an intervening protease cleavage site, and the CasX protein; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA.
- Embodiment 11-15 The XDP system of Embodiment II- 1, wherein the encoded components are selected from the MA, the CasX protein, the protease, the protease cleavage site, the gNA, and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on one, two, three, or four individual nucleic acids.
- Embodiment 11-16 The XDP system of Embodiment 11-15, wherein a. the components are encoded on a single nucleic acid; b. a first nucleic acid encodes the MA, the CasX protein, the protease, and intervening protease cleavage sites between the components; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA; c.
- a first nucleic acid encodes the MA, the CasX protein the protease, and intervening protease cleavage sites between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA; or d. a first nucleic acid encodes the MA, an intervening protease cleavage site, and the CasX protein; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; a third nucleic acid encodes the gNA; and a fourth nucleic acid encodes the protease. [00525] Embodiment 11-17.
- Embodiment 11-18 The XDP system of Embodiment II- 1, wherein the encoded components are selected from the gag polyprotein, the CasX protein, the protease, the protease cleavage site, the gNA, the pseudotyping viral envelope glycoprotein or antibody fragment, and the gag-pol polyprotein, wherein the components are encoded on two, three, or four individual nucleic acids.
- Embodiment 11-19 The XDP system of Embodiment 11-18, wherein a. a first nucleic acid encodes the gag polyprotein, the CasX protein, the protease, and intervening protease cleavage sites between the components; and a second nucleic acid encodes the gag-pol polyprotein, the pseudotyping viral envelope glycoprotein or antibody fragment, and the gNA; or b.
- a first nucleic acid encodes the gag polyprotein, the intervening protease cleavage site, and the CasX protein
- a second nucleic acid encodes the protease
- a third nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment, the gNA and the gag-pol polyprotein
- a first nucleic acid encodes the gag polyprotein, the intervening protease cleavage site, and the CasX protein
- a second nucleic acid encodes the protease
- a third nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment
- a fourth nucleic acid encodes the gNA and the gag-pol polyprotein.
- Embodiment 11-20 The XDP system of Embodiment II-2 or Embodiment II-3, comprising all or a portion of any one of the nucleic acid sequences of Table 6.
- Embodiment 11-21 The XDP system of any one of the preceding embodiments of Set I of Set II, wherein the MA, the CA, the gag-TFR-PR polyprotein, the gag polyprotein, and the gag-pol polyprotein are derived from a retrovirus.
- Embodiment 11-22 The XDP system of Embodiment 11-21, wherein the retrovirus is selected from the group consisting of an alpharetrovirus, a betaretrovirus, a gammaretrovirus, a deltaretrovirus, an epsilonretrovirus, and a lentivirus.
- Embodiment 11-23 The XDP system of Embodiment 11-22, wherein the lentivirus is selected from the group consisting of human immunodeficiency- 1 (HIV-1), human immunodeficiency-2 (HIV-2), simian immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), and bovine immunodeficiency virus (BIV).
- HV-1 human immunodeficiency- 1
- HAV-2 human immunodeficiency-2
- SIV simian immunodeficiency virus
- FIV feline immunodeficiency virus
- BIV bovine immunodeficiency virus
- Embodiment 11-24 The XDP system of Embodiment 11-23, wherein the lentivirus is HIV-1 or SIV.
- Embodiment 11-25 The XDP system of any one of the preceding embodiments of Set I of Set II, wherein the gag polypeptide further comprises a p6 polypeptide.
- Embodiment 11-26 The XDP system of any one of the preceding embodiments of Set I of Set II, wherein the gag polypeptide comprises a MA polypeptide, a CA polypeptide, a p2 polypeptide, an NC polypeptide, a pi polypeptide, and a p6 polypeptide, and wherein the gag polyprotein comprises one or more protease cleavage sites located between one or more of: a. the MA polypeptide and the CA polypeptide; b. the CA polypeptide and the p2 polypeptide; c. the p2 polypeptide and the NC polypeptide; d. the NC polypeptide and the pi polypeptide; and e. the pi polypeptide and the p6 polypeptide.
- the gag polyprotein comprises one or more protease cleavage sites located between one or more of: a. the MA polypeptide and the CA polypeptide; b. the CA polypeptide and the p2 polypeptide; c. the
- Embodiment 11-27 The XDP system of any one of the preceding embodiments of Set I of Set II, wherein the protease capable of cleaving the protease cleavage site is selected from the group of proteases consisting of HIV-1 protease, tobacco etch virus protease (TEV), potyvirus HC protease, potyvirus PI protease, PreScission, b virus NIa protease, B virus RNA-2- encoded protease, aphthovirus L protease, enterovirus 2A protease, rhinovirus 2A protease, picorna 3C protease, comovirus 24K protease, nepovirus 24K protease, RTSV (rice tungro spherical virus) 3C-like protease, PYVF (parsnip yellow fleck virus) 3C-like protease, cathepsin, thrombin, factor
- Embodiment 11-28 The XDP system of Embodiment 11-27, wherein the protease capable of cleaving the protease cleavage site is HIV-1 protease.
- Embodiment 11-29 The XDP system of Embodiment 11-27, wherein the protease capable of cleaving the protease cleavage site is HIV-1 protease.
- EBV Epstein-Bar virus
- Embodiment 11-30 The XDP system of Embodiment 11-29, wherein the pseudotyping viral envelope glycoprotein is derived from vesicular stomatitis virus (VSV).
- Embodiment II-31 The XDP system of any one of Embodiments II- 1-29, wherein the pseudotyping viral envelope glycoprotein comprises a sequence having at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to a sequence selected from the group consisting of the sequences set forth in Table 4.
- Embodiment 11-32 The XDP system of any one of Embodiments II- 1-28, wherein the antibody fragment has binding affinity for a cell surface marker or receptor of a target cell.
- Embodiment 11-33 The XDP system of Embodiment 11-32, wherein the antibody fragment is a scFv.
- Embodiment 11-34 The XDP system of any one of the preceding embodiments of Set I of Set II, wherein the gNA is a single-molecule guide RNA comprising a scaffold sequence and a targeting sequence, wherein the targeting sequence is complementary to a target nucleic acid sequence.
- the gNA is a single-molecule guide RNA comprising a scaffold sequence and a targeting sequence, wherein the targeting sequence is complementary to a target nucleic acid sequence.
- Embodiment 11-35 The XDP system of Embodiment 11-29, wherein the guide RNA scaffold sequence has at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to a sequence selected from the group of sequences consisting of SEQ ID NOS: 4, 5, and 2101-2241.
- Embodiment 11-36 The XDP system of Embodiment 11-29 or Embodiment II- Embodiment 11-35, wherein the targeting sequence of the guide RNA consists of 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 consecutive nucleotides.
- Embodiment 11-37 The XDP system of Embodiment 11-36, wherein the targeting sequence of the guide RNA consists of 20 nucleotides.
- Embodiment 11-38 The XDP system of Embodiment 11-36, wherein the targeting sequence of the guide RNA consists of 19 nucleotides.
- Embodiment 11-39 The XDP system of Embodiment 11-36, wherein the targeting sequence of the guide RNA consists of 18 nucleotides.
- Embodiment 11-40 The XDP system of Embodiment 11-36, wherein the targeting sequence of the guide RNA consists of 17 nucleotides.
- Embodiment 11-41 The XDP system of Embodiment 11-36, wherein the targeting sequence of the guide RNA consists of 16 nucleotides.
- Embodiment 11-42 The XDP system of Embodiment 11-36, wherein the targeting sequence of the guide RNA consists of 15 nucleotides.
- Embodiment 11-43 The XDP system of any one of the preceding embodiments of Set I of Set II, wherein the guide RNA further comprises one or more ribozymes.
- Embodiment 11-44 The XDP system of Embodiment 11-43, wherein the one or more ribozymes are independently fused to a terminus of the guide RNA.
- Embodiment 11-45 The XDP system of Embodiment 11-43 or Embodiment 11-44, wherein at least one of the one or more ribozymes is a hepatitis delta virus (HDV) ribozyme, hammerhead ribozyme, pistol ribozyme, hatchet ribozyme, or tobacco ringspot virus (TRSV) ribozyme.
- HDV hepatitis delta virus
- TRSV tobacco ringspot virus
- Embodiment 11-46 The XDP system of any one of the preceding embodiments of Set I of Set II, wherein the guide RNA is chemically modified.
- Embodiment 11-47 The XDP system of any one of the preceding embodiments of Set I of Set II, wherein the CasX protein comprises a sequence having at least about 90%, or at least about 91%, or at least about 92%, or at least about 93%, or at least about 94%, or at least about 95%, or at least about 96%, or at least about 97%, or at least about 98%, or at least about 99%, or at least 100% sequence identity to a sequence selected from the group consisting of the sequences set forth in Table 1.
- Embodiment 11-48 The XDP system of any one of the preceding embodiments of Set I of Set II, wherein the CasX protein has binding affinity for a protospacer adjacent motif (PAM) sequence selected from the group consisting of TTC, ATC, GTC, and CTC.
- PAM protospacer adjacent motif
- Embodiment 11-49 The XDP system of Embodiment 11-48, wherein the binding affinity of the CasX protein for the PAM sequence is at least 1.5-fold greater compared to the binding affinity of any one of the CasX proteins of SEQ ID NOS: 1-3 for the PAM sequences.
- Embodiment 11-50 The XDP system of any one of the preceding embodiments of Set I of Set II, wherein the CasX protein further comprises one or more nuclear localization signals (NLS).
- NLS nuclear localization signals
- Embodiment II-51 The XDP system of Embodiment 11-50, wherein the one or more NLS are selected from the group of sequences consisting of PKKKRKV, KRPAATKKAGQAKKKK, PAAKRVKLD, RQRRNELKRSP,
- Embodiment 11-52 The CasX variant of Embodiment 11-50 or Embodiment 11-51, wherein the one or more NLS are fused to the C-terminus of the CasX protein.
- Embodiment 11-53 The CasX variant of Embodiment 11-50 or Embodiment 11-51, wherein the one or more NLS are fused to the N-terminus of the CasX protein.
- Embodiment 11-54 The CasX variant of Embodiment 11-50 or Embodiment 11-51, wherein the one or more NLS are fused to the N-terminus and C-terminus of the CasX protein.
- Embodiment 11-55 The XDP system of any one of the preceding embodiments of Set I of Set II, wherein the CasX protein comprises a nuclease domain having nickase activity.
- Embodiment 11-56 The XDP system of any one of Embodiments II- 1-54, wherein the CasX protein comprises a nuclease domain having double-stranded cleavage activity.
- Embodiment 11-57 The XDP system of any one of the preceding embodiments of Set I of Set II, further comprising a nucleic acid encoding a retroviral packaging signal.
- Embodiment 11-58 The XDP system of any one of the preceding embodiments of Set I of Set II, further comprising a donor template nucleic acid complementary to a target nucleic acid.
- Embodiment 11-59 The XDP system of Embodiment 11-58, wherein the donor template comprises two homologous arms complementary to sequences flanking a cleavage site in the target nucleic acid.
- Embodiment 11-60 The XDP system of Embodiment 11-58 or Embodiment 11-59, wherein the donor template nucleic acid sequence comprises a corrective sequence for a mutation in the target nucleic acid.
- Embodiment 11-61 The XDP system of Embodiment 11-58 or Embodiment 11-59, wherein the donor template nucleic acid sequence comprises a mutation compared to the target nucleic acid.
- Embodiment 11-62 The XDP system of Embodiment 11-61, where the mutation is an insertion, a deletion, or a substitution of one or more nucleotides in the donor template nucleic acid sequence.
- Embodiment 11-63 The XDP system of any one of Embodiments II- 1-54, wherein the CasX protein is a catalytically inactive CasX (dCasX) protein, and wherein the dCasX and the guide RNA retain the ability to bind to the target nucleic acid.
- the CasX protein is a catalytically inactive CasX (dCasX) protein
- the dCasX and the guide RNA retain the ability to bind to the target nucleic acid.
- Embodiment 11-64 The XDP system of Embodiment 11-63, wherein the dCasX comprises a mutation at residues: a. D672, E769, and/or D935 corresponding to the CasX protein of SEQ ID NO: 1; or b. D659, E756 and/or D922 corresponding to the CasX protein of SEQ ID NO: 2.
- Embodiment 11-65 The XDP system of Embodiment 11-64, wherein the mutation is a substitution of alanine for the residue.
- Embodiment 11-66 A eukaryotic cell comprising the XDP system of any one of the preceding embodiments of Set I of Set II.
- Embodiment 11-67 The eukaryotic cell of Embodiment 11-66, wherein the cell is a packaging cell.
- Embodiment 11-68 The eukaryotic cell of any one of Embodiments 11-66 or Embodiment 11-67, wherein the eukaryotic cell is selected from the group consisting of HEK293 cells, Lenti-X 293T cells, BHK cells, HepG2, Saos-2, HuH7, NS0 cells, SP2/0 cells, YO myeloma cells, A549 cells, P3X63 mouse myeloma cells, PER cells, PER.C6 cells, hybridoma cells, VERO, NIH3T3 cells, COS, WI38, MRC5, A549, HeLa cells, CHO cells, and HT1080 cells.
- Embodiment 11-69 The eukaryotic cell of Embodiment 11-67 or Embodiment 11-68, wherein the packaging cell comprises one or more mutations to reduce expression of a cell surface marker.
- Embodiment 11-70 The eukaryotic cell of any one of Embodiments 11-66-69, wherein all or a portion of the nucleic acids encoding the XDP system of any one of Embodiments II- 1- 56 are integrated into the genome of the eukaryotic cell.
- Embodiment 11-71 A method of making an XDP comprising a CasX protein and a gNA, the method comprising: a. propagating the packaging cell of any one of Embodiments 11-67-70 under conditions such that XDPs are produced; and b. harvesting the XDPs produced by the packaging cell.
- Embodiment 11-72 An XDP produced by the method of Embodiment 11-71.
- An XDP comprising one or more components selected from: a. a matrix polypeptide (MA); b. a capsid polypeptide (CA); c. a gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC); d. a CasX protein; e. a guide nucleic acid (gNA); f. a pseudotyping viral envelope glycoprotein or antibody fragment that provides for binding and fusion of the XDP to a target cell; and g. an RNA binding domain;
- Embodiment 11-74 The XDP of Embodiment 11-73, wherein the XDP comprises a. the matrix polypeptide (MA); b. the pseudotyping viral envelope glycoprotein or antibody fragment; and c. the CasX and the gNA contained within the XDP.
- Embodiment 11-75 The XDP of Embodiment 11-74, further comprising the capsid polypeptide (CA).
- Embodiment 11-76 The XDP of Embodiment 11-74 or Embodiment 11-75, further comprising the nucleocapsid polypeptide (NC).
- NC nucleocapsid polypeptide
- Embodiment 11-77 The XDP of any one of Embodiments 11-74-76, further comprising an RNA binding domain.
- Embodiment 11-78 The XDP of Embodiment 11-77, wherein the RNA binding domain is a retroviral Psi packaging element inserted into the gNA or is a protein selected from the group consisting of MS2, PP7 or Qbeta, U1A, phage replication loop, kissing loop a, kissing loop bl, kissing loop_b2, G quadriplex M3q, G quadriplex telomere basket, sarcin-ricin loop, and pseudoknots.
- MS2, PP7 or Qbeta a retroviral Psi packaging element inserted into the gNA or is a protein selected from the group consisting of MS2, PP7 or Qbeta, U1A, phage replication loop, kissing loop a, kissing loop bl, kissing loop_b2, G quadriplex M3q, G quadriplex telomere basket, sarcin-ricin loop, and pseudoknots.
- Embodiment 11-79 The XDP of any one of Embodiments 11-74-78, wherein the CasX and the gNA are associated together in a ribonuclear protein complex (RNP) within the XDP.
- Embodiment 11-80 The XDP of any one of Embodiments 11-74-79, comprising the CasX of any one of Embodiments 11-47-65 and the guide RNA of any one of Embodiments II- 34-46.
- Embodiment 11-81 Embodiment 11-81.
- Embodiment 11-82 The XDP of any one of Embodiments 11-73-80, wherein the pseudotyping viral envelope glycoprotein is derived from an enveloped virus selected from the group consisting of influenza A, influenza B, influenza C virus, hepatitis A virus, hepatitis B virus, hepatitis C virus, hepatitis D virus, hepatitis E virus, rotavirus, Norwalk virus, enteric adenovirus, parvovirus, Dengue fever virus, monkey pox, Mononegavirales, rabies virus, Lagos bat virus, Mokola virus, Duvenhage virus, European bat virus 1, European bat virus 2,
- Embodiment 11-83 The XDP of any one of Embodiments 11-73-82, further comprising the donor template nucleic acid sequence of any one of Embodiments 11-58-62.
- Embodiment 11-84 A method of method of modifying a target nucleic acid sequence in a cell, the method comprising contacting the cell with the XDP of any one of Embodiments II- 73-83, wherein said contacting comprises introducing into the cell the CasX protein, the guide RNA, and, optionally, the donor template nucleic acid sequence, resulting in modification of the target nucleic acid sequence.
- Embodiment 11-85 The method of Embodiment 11-84, wherein the modification comprises introducing one or more single-stranded breaks in the target nucleic acid sequence.
- Embodiment 11-86 The method of Embodiment 11-84, wherein the modification comprises introducing one or more double-stranded breaks in the target nucleic acid sequence.
- Embodiment 11-87 The method of any one of Embodiments 11-84-86, wherein the modification comprises insertion of the donor template into the target nucleic acid sequence.
- Embodiment 11-88 The method of any one of Embodiments 11-84-87, wherein the cell is modified in vitro.
- Embodiment 11-89 The method of any one of Embodiments 11-84-87, wherein the cell is modified in vivo.
- Embodiment 11-90 The method of Embodiment 11-89, wherein the XDP is administered to a subject.
- Embodiment 11-91 The method of Embodiment 11-90, wherein the subject is the subject is selected from the group consisting of mouse, rat, pig, non-human primate, and human. [00600] Embodiment 11-92.
- Embodiment 11-90 or Embodiment 11-91 wherein the XDP is administered by a route of administration selected from the group consisting of subcutaneous, intradermal, intraneural, intranodal, intramedullary, intramuscular, intravenous, intracerebroventricular, intracisternal, intrathecal, intracranial, intralumbar, intratracheal, intraosseous, inhalatory, intracontralateral striatum, intraocular, intravitreal, intralymphatical, intraperitoneal routes and sub-retinal routes.
- a route of administration selected from the group consisting of subcutaneous, intradermal, intraneural, intranodal, intramedullary, intramuscular, intravenous, intracerebroventricular, intracisternal, intrathecal, intracranial, intralumbar, intratracheal, intraosseous, inhalatory, intracontralateral striatum, intraocular, intravitreal, intralymphatical, intraperitoneal routes and sub
- Embodiment 11-93 The method of any one of Embodiments 11-90-92, wherein the XDP is administered to the subject using a therapeutically effective dose.
- Embodiment 11-94 The method of Embodiment 11-93, wherein the XDP is administered at a dose of at least about 1 x 10 5 particles, or at least about 1 x 10 6 particles, or at least about 1 x 10 7 particles, or at least about 1 x 10 8 particles, or at least about 1 x 10 9 particles, or at least about 1 x 10 10 particles, or at least about 1 x 10 11 particles, or at least about 1 x 10 12 particles, or at least about 1 x 10 13 particles, or at least about 1 x 10 14 particles, or at least about 1 x 10 15 particles, or at least about 1 x 10 16 particles.
- Embodiment 11-95 A method for introducing a CasX and gNA RNP into a cell having a target nucleic acid, comprising contacting the cell with the XDP of any one of Embodiments 11-79-83, such that the RNP enters the cell.
- Embodiment 11-96 The method of Embodiment 11-95, wherein the RNP binds to the target nucleic acid.
- Embodiment 11-97 The method of Embodiment 11-96, wherein the target nucleic acid is cleaved by the CasX.
- Embodiment 11-98 The method of any one of Embodiments 11-95-97, wherein the cell is modified in vitro.
- Embodiment 11-99 The method of any one of Embodiments 11-95-97, wherein the cell is modified in vivo.
- Embodiment II- 100 The method of Embodiment 11-99, wherein the XDP is administered to a subject.
- Embodiment II- 101 The method of Embodiment II- 100, wherein the subj ect is the subject is selected from the group consisting of mouse, rat, pig, non-human primate, and human.
- Embodiment 11-102 The method of any one of Embodiments II-99-101, wherein the XDP is administered to the subject using a therapeutically effective dose.
- Embodiment II- 103 The method of Embodiment 11-102, wherein the XDP is administered at a dose of at least about 1 x 10 5 particles, or at least about 1 x 10 6 particles, or at least about 1 x 10 7 particles, or at least about 1 x 10 8 particles, or at least about 1 x 10 9 particles, or at least about 1 x 10 10 particles, or at least about 1 x 10 11 particles, or at least about 1 x 10 12 particles, or at least about 1 x 10 13 particles, or at least about 1 x 10 14 particles, or at least about 1 x 10 15 particles, or at least about 1 x 10 16 particles.
- Embodiment III- 1 A CasX delivery particle (XDP) system comprising one or more nucleic acids comprising sequences encoding components selected from:
- gag polyprotein comprising a matrix polypeptide (MA), a capsid polypeptide (CA), and a nucleocapsid polypeptide (NC);
- gNA guide nucleic acid
- a protease cleavage site (h) a protease cleavage site; (i) a gag-transframe region-pol protease polyprotein (gag-TFR-PR);
- Embodiment III-2 The XDP system of Embodiment III- 1 , wherein the encoded components comprise the gag polyprotein, the protease cleavage site, the CasX protein, the gag- pol polyprotein, the gNA, and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on two, three, or four individual nucleic acids.
- Embodiment III-3 The XDP system of Embodiment III-2, wherein
- a first nucleic acid encodes the gag polyprotein, the CasX protein, and an intervening protease cleavage site between the components; and a second nucleic acid encodes the gag-pol polyprotein, the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA;
- a first nucleic acid encodes the gag polyprotein, the CasX protein, and an intervening protease cleavage site between the components; a second nucleic acid encodes the gag-pol polyprotein; and a third nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA; or
- a first nucleic acid encodes the gag polyprotein, the CasX protein, and an intervening protease cleavage site between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; a third nucleic acid encodes the gag-pol polyprotein; and a fourth nucleic acid encodes the gNA.
- Embodiment III-4 The XDP system of Embodiment III- 1 , wherein the encoded components are selected from the gag-TFR-PR polyprotein, the protease cleavage site, the CasX protein, the gNA, and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on one, two, or three individual nucleic acids.
- Embodiment III-5 The XDP system of Embodiment III-4, wherein
- a first nucleic acid encodes the gag-TFR-PR polyprotein, the CasX protein, and an intervening protease cleavage site between the components; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA;
- a first nucleic acid encodes the gag-TFR-PR polyprotein, the CasX protein, and an intervening protease cleavage site between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA.
- Embodiment III-6 The XDP system of Embodiment III- 1 , wherein the encoded components are selected from the gag polyprotein, the protease cleavage site, the protease, the CasX protein, the gNA and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on one, two, or three individual nucleic acids.
- Embodiment III-7 The XDP system of Embodiment III-6, wherein
- a first nucleic acid encodes the gag polyprotein, the protease, the CasX protein, and intervening protease cleavage sites between the components; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA;
- a first nucleic acid encodes the gag polyprotein, the protease, the CasX protein and intervening protease cleavage sites between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA.
- Embodiment III-8 The XDP system of Embodiment III- 1 , wherein the encoded components are selected from the gag-pol polyprotein, the CasX protein, the protease cleavage site, the gNA, the RNA binding domain, and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on one, two, or three individual nucleic acids.
- Embodiment III-9 The XDP system of Embodiment III-8, wherein
- a first nucleic acid encodes the gag-pol polyprotein, the CasX protein, and intervening protease cleavage sites between the components; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment, the gNA and the RNA binding domain;
- a first nucleic acid encodes the gag-pol polyprotein, the CasX protein, and an intervening protease cleavage site between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA and the RNA binding domain.
- Embodiment III- 10 The XDP system of Embodiment III- 1 , wherein the encoded components are selected from the gag-TFR-PR polyprotein, the CasX protein, the protease cleavage site, the gNA, the RNA binding domain, and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on one, two, or three individual nucleic acids.
- Embodiment III- 11 The XDP system of Embodiment III- 10, wherein
- a first nucleic acid encodes the gag-TFR-PR polyprotein, the CasX protein, and an intervening protease cleavage site between the components; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment, the gNA and the RNA binding domain;
- a first nucleic acid encodes the gag-TFR-PR polyprotein, the CasX protein, and an intervening protease cleavage site between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA and the RNA binding domain.
- Embodiment III- 12 The XDP system of any one of Embodiments III- 8-11, wherein the RNA binding domain is a retroviral Psi packaging element inserted into the gNA or is a protein selected from the group consisting of MS2, PP7 or Qbeta, U1A, phage replication loop, kissing loop a, kissing loop bl, kissing loop_b2, G quadriplex M3q, G quadriplex telomere basket, sarcin-ricin loop, and pseudoknots.
- MS2, PP7 or Qbeta a retroviral Psi packaging element inserted into the gNA or is a protein selected from the group consisting of MS2, PP7 or Qbeta, U1A, phage replication loop, kissing loop a, kissing loop bl, kissing loop_b2, G quadriplex M3q, G quadriplex telomere basket, sarcin-ricin loop, and pseudoknots.
- Embodiment III- 13 The XDP system of Embodiment III- 1 , wherein the encoded components are selected from the gag-pol polyprotein, the CasX protein, the protease cleavage site, the pseudotyping viral envelope glycoprotein or antibody fragment, and the gNA, wherein the components are encoded on one, two, or three individual nucleic acids.
- Embodiment III- 14 The XDP system of Embodiment III- 13 , wherein
- a first nucleic acid encodes the gag-pol polyprotein, an intervening protease cleavage site, the CasX protein; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA;
- a first nucleic acid encodes the gag-pol polyprotein, an intervening protease cleavage site, and the CasX protein; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA.
- Embodiment III- 15 The XDP system of Embodiment III- 1 , wherein the encoded components are selected from the MA, the CasX protein, the protease, the protease cleavage site, the gNA, and the pseudotyping viral envelope glycoprotein or antibody fragment, wherein the components are encoded on one, two, three, or four individual nucleic acids.
- Embodiment III- 16 The XDP system of Embodiment III- 15, wherein
- a first nucleic acid encodes the MA, the CasX protein, the protease, and intervening protease cleavage sites between the components; and a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment and the gNA;
- a first nucleic acid encodes the MA, the CasX protein the protease, and intervening protease cleavage sites between the components; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; and a third nucleic acid encodes the gNA; or
- a first nucleic acid encodes the MA, an intervening protease cleavage site, and the CasX protein; a second nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment; a third nucleic acid encodes the gNA; and a fourth nucleic acid encodes the protease.
- Embodiment III- 17 The XDP system of Embodiment III- 15 or Embodiment III- 16, further comprising the CA component linked between the MA and the CasX protein components with intervening protease cleavage sites.
- Embodiment III- 18 The XDP system of Embodiment III- 1 , wherein the encoded components are selected from the gag polyprotein, the CasX protein, the protease, the protease cleavage site, the gNA, the pseudotyping viral envelope glycoprotein or antibody fragment, and the gag-pol polyprotein, wherein the components are encoded on two, three, or four individual nucleic acids.
- Embodiment III- 19 The XDP system of Embodiment III- 18, wherein
- a first nucleic acid encodes the gag polyprotein, the CasX protein, the protease, and intervening protease cleavage sites between the components; and a second nucleic acid encodes the gag-pol polyprotein, the pseudotyping viral envelope glycoprotein or antibody fragment, and the gNA; or
- a first nucleic acid encodes the gag polyprotein, the intervening protease cleavage site, and the CasX protein
- a second nucleic acid encodes the protease
- a third nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment, the gNA and the gag-pol polyprotein
- a first nucleic acid encodes the gag polyprotein, the intervening protease cleavage site, and the CasX protein
- a second nucleic acid encodes the protease
- a third nucleic acid encodes the pseudotyping viral envelope glycoprotein or antibody fragment
- a fourth nucleic acid encodes the gNA and the gag-pol polyprotein.
- Embodiment III-20 The XDP system of Embodiment III-2 or Embodiment III-3, comprising all or a portion of any one of the nucleic acid sequences of Table 6.
- Embodiment III-21 The XDP system of any one of the preceding embodiments of Set I of Set III, wherein the MA, the CA, the gag-TFR-PR polyprotein, the gag polyprotein, and the gag-pol polyprotein are derived from a retrovirus.
- Embodiment III-22 The XDP system of Embodiment III-21, wherein the retrovirus is selected from the group consisting of an alpharetrovirus, a betaretrovirus, a gammaretrovirus, a deltaretrovirus, an epsilonretrovirus, and a lentivirus.
- the retrovirus is selected from the group consisting of an alpharetrovirus, a betaretrovirus, a gammaretrovirus, a deltaretrovirus, an epsilonretrovirus, and a lentivirus.
- Embodiment III-23 The XDP system of Embodiment III-22, wherein the lentivirus is selected from the group consisting of human immunodeficiency- 1 (HIV-1), human immunodeficiency-2 (HIV-2), simian immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), and bovine immunodeficiency virus (BIV).
- HV-1 human immunodeficiency- 1
- HV-2 human immunodeficiency-2
- SIV simian immunodeficiency virus
- FIV feline immunodeficiency virus
- BIV bovine immunodeficiency virus
- Embodiment III-24 The XDP system of Embodiment III-23, wherein the lentivirus is HIV-1 or SIV.
- Embodiment III-25 The XDP system of any one of the preceding embodiments of Set I of Set III, wherein the gag polypeptide further comprises a p6 polypeptide.
- Embodiment III-26 The XDP system of any one of the preceding embodiments of Set I of Set III, wherein the gag polypeptide comprises a MA polypeptide, a CA polypeptide, a p2 polypeptide, an NC polypeptide, a pi polypeptide, and a p6 polypeptide, and wherein the gag polyprotein comprises one or more protease cleavage sites located between one or more of:
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Virology (AREA)
- Gastroenterology & Hepatology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Seeds, Soups, And Other Foods (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Medicinal Preparation (AREA)
Abstract
Description
Claims
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962944982P | 2019-12-06 | 2019-12-06 | |
US202062968915P | 2020-01-31 | 2020-01-31 | |
US202062983460P | 2020-02-28 | 2020-02-28 | |
US202063035576P | 2020-06-05 | 2020-06-05 | |
US202063120864P | 2020-12-03 | 2020-12-03 | |
PCT/US2020/063488 WO2021113772A1 (en) | 2019-12-06 | 2020-12-04 | Particle delivery systems |
Publications (1)
Publication Number | Publication Date |
---|---|
EP4069716A1 true EP4069716A1 (en) | 2022-10-12 |
Family
ID=74046189
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP20829466.0A Pending EP4069716A1 (en) | 2019-12-06 | 2020-12-04 | Particle delivery systems |
Country Status (8)
Country | Link |
---|---|
EP (1) | EP4069716A1 (en) |
JP (1) | JP2023504536A (en) |
KR (1) | KR20220113442A (en) |
CN (1) | CN115175921A (en) |
AU (1) | AU2020398658A1 (en) |
CA (1) | CA3159320A1 (en) |
IL (1) | IL293591A (en) |
WO (1) | WO2021113772A1 (en) |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020247882A1 (en) | 2019-06-07 | 2020-12-10 | Scribe Therapeutics Inc. | Engineered casx systems |
GB2616584A (en) * | 2020-12-03 | 2023-09-13 | Scribe Therapeutics Inc | Engineered class 2 type V CRISPR systems |
WO2022120089A1 (en) | 2020-12-03 | 2022-06-09 | Scribe Therapeutics Inc. | Compositions and methods for the targeting of ptbp1 |
WO2022261149A2 (en) | 2021-06-09 | 2022-12-15 | Scribe Therapeutics Inc. | Particle delivery systems |
IL311610A (en) | 2021-09-21 | 2024-05-01 | Scribe Therapeutics Inc | Engineered casx repressor systems |
WO2023212396A1 (en) * | 2022-04-29 | 2023-11-02 | The Board Of Trustees Of The Leland Stanford Junior University | High capacity lentiviral vectors |
WO2023235818A2 (en) | 2022-06-02 | 2023-12-07 | Scribe Therapeutics Inc. | Engineered class 2 type v crispr systems |
WO2023235888A2 (en) | 2022-06-03 | 2023-12-07 | Scribe Therapeutics Inc. | COMPOSITIONS AND METHODS FOR CpG DEPLETION |
EP4314267A1 (en) | 2022-06-07 | 2024-02-07 | Scribe Therapeutics Inc. | Compositions and methods for the targeting of pcsk9 |
WO2023240027A1 (en) | 2022-06-07 | 2023-12-14 | Scribe Therapeutics Inc. | Particle delivery systems |
WO2023240074A1 (en) | 2022-06-07 | 2023-12-14 | Scribe Therapeutics Inc. | Compositions and methods for the targeting of pcsk9 |
WO2023240157A2 (en) | 2022-06-08 | 2023-12-14 | Scribe Therapeutics Inc. | Compositions and methods for the targeting of dmd |
WO2023240162A1 (en) | 2022-06-08 | 2023-12-14 | Scribe Therapeutics Inc. | Aav vectors for gene editing |
CN117344061B (en) * | 2023-12-05 | 2024-02-27 | 云南省细胞质量检测评价中心有限公司 | Method, kit, primer and probe for simultaneously detecting five human viruses EBV, HBV, HCV, HIV, HPV and application of method |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE302854T1 (en) | 1993-01-26 | 2005-09-15 | Univ Pennsylvania | COMPOSITIONS AND METHODS FOR ADMINISTRATION OF GENETIC MATERIAL |
US5593972A (en) | 1993-01-26 | 1997-01-14 | The Wistar Institute | Genetic immunization |
US5962428A (en) | 1995-03-30 | 1999-10-05 | Apollon, Inc. | Compositions and methods for delivery of genetic material |
WO2010075303A1 (en) | 2008-12-23 | 2010-07-01 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Splicing factors with a puf protein rna-binding domain and a splicing effector domain and uses of same |
US9580714B2 (en) | 2010-11-24 | 2017-02-28 | The University Of Western Australia | Peptides for the specific binding of RNA targets |
JP7059179B2 (en) * | 2015-10-20 | 2022-04-25 | アンスティチュ ナショナル ドゥ ラ サンテ エ ドゥ ラ ルシェルシュ メディカル | Methods and products for genetic engineering |
EP3523426A4 (en) | 2016-09-30 | 2020-01-22 | The Regents of The University of California | Rna-guided nucleic acid modifying enzymes and methods of use thereof |
JP2020518275A (en) * | 2017-05-03 | 2020-06-25 | バイオマリン ファーマシューティカル インコーポレイテッド | Improved lentivirus for transduction of hematopoietic stem cells |
-
2020
- 2020-12-04 KR KR1020227022874A patent/KR20220113442A/en unknown
- 2020-12-04 EP EP20829466.0A patent/EP4069716A1/en active Pending
- 2020-12-04 JP JP2022533213A patent/JP2023504536A/en active Pending
- 2020-12-04 AU AU2020398658A patent/AU2020398658A1/en active Pending
- 2020-12-04 WO PCT/US2020/063488 patent/WO2021113772A1/en unknown
- 2020-12-04 CA CA3159320A patent/CA3159320A1/en active Pending
- 2020-12-04 CN CN202080094851.0A patent/CN115175921A/en active Pending
- 2020-12-04 IL IL293591A patent/IL293591A/en unknown
Also Published As
Publication number | Publication date |
---|---|
JP2023504536A (en) | 2023-02-03 |
CA3159320A1 (en) | 2021-06-10 |
AU2020398658A1 (en) | 2022-07-07 |
IL293591A (en) | 2022-08-01 |
CN115175921A (en) | 2022-10-11 |
WO2021113772A1 (en) | 2021-06-10 |
KR20220113442A (en) | 2022-08-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP4069716A1 (en) | Particle delivery systems | |
US11976277B2 (en) | Particle delivery systems | |
US20240026385A1 (en) | Engineered class 2 type v crispr systems | |
WO2021050601A1 (en) | Compositions and methods for use in immunotherapy | |
CA3120282A1 (en) | Fusosome compositions for hematopoietic stem cell delivery | |
KR20200083550A (en) | How to rescue a stop codon through gene redirection by ACE-tRNA | |
CA3159316A1 (en) | Compositions and methods for the targeting of rhodopsin | |
AU2021237633A1 (en) | Compositions and methods for the targeting of C9orf72 | |
JP2023510352A (en) | Compositions and methods for targeting PCSK9 | |
US20240026386A1 (en) | Compositions and methods for the targeting of bcl11a | |
KR20230044420A (en) | Methods and compositions for producing viral fusosomes | |
JP2003519476A (en) | Fusion protein delivery system and uses thereof | |
JP2019514414A (en) | Particles for capsid formation in genome engineering systems | |
EP4286401A1 (en) | Use of an artificial protein or of a nucleic acid encoding the artificial protein for providing a functional ribonucleoprotein complex | |
WO2012118092A1 (en) | Fusion protein | |
Rousseau | Engineering Virus-Like Particles for the Delivery of Genome Editing Enzymes | |
WO2023240027A1 (en) | Particle delivery systems | |
IL303360A (en) | Engineered class 2 type v crispr systems | |
CN117120607A (en) | Engineered class 2V-type CRISPR system | |
Bowden | Development of a viral and a non-viral based gene transfer systems using the yeast Saccharomyces cerevisiae | |
Beitzel | The role of HMGA proteins in retroviral integration |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: UNKNOWN |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
17P | Request for examination filed |
Effective date: 20220630 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 40073741 Country of ref document: HK |
|
DAV | Request for validation of the european patent (deleted) | ||
DAX | Request for extension of the european patent (deleted) |