WO2023241669A1 - Crispr-cas effector protein, gene editing system therefor, and application - Google Patents

Crispr-cas effector protein, gene editing system therefor, and application Download PDF

Info

Publication number
WO2023241669A1
WO2023241669A1 PCT/CN2023/100529 CN2023100529W WO2023241669A1 WO 2023241669 A1 WO2023241669 A1 WO 2023241669A1 CN 2023100529 W CN2023100529 W CN 2023100529W WO 2023241669 A1 WO2023241669 A1 WO 2023241669A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
gene editing
crispr
protein
cancer
Prior art date
Application number
PCT/CN2023/100529
Other languages
French (fr)
Chinese (zh)
Inventor
张红玲
任文丹
Original Assignee
尧唐(上海)生物科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 尧唐(上海)生物科技有限公司 filed Critical 尧唐(上海)生物科技有限公司
Publication of WO2023241669A1 publication Critical patent/WO2023241669A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • A61K38/16Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • A61K38/43Enzymes; Proenzymes; Derivatives thereof
    • A61K38/46Hydrolases (3)
    • A61K38/465Hydrolases (3) acting on ester bonds (3.1), e.g. lipases, ribonucleases
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • A61K48/0008Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'non-active' part of the composition delivered, e.g. wherein such 'non-active' part is not delivered simultaneously with the 'active' part of the composition
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • A61K48/005Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
    • A61K48/0058Nucleic acids adapted for tissue specific expression, e.g. having tissue specific promoters as part of a contruct
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P13/00Drugs for disorders of the urinary system
    • A61P13/12Drugs for disorders of the urinary system of the kidneys
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P21/00Drugs for disorders of the muscular or neuromuscular system
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • A61P25/14Drugs for disorders of the nervous system for treating abnormal movements, e.g. chorea, dyskinesia
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • A61P25/28Drugs for disorders of the nervous system for treating neurodegenerative disorders of the central nervous system, e.g. nootropic agents, cognition enhancers, drugs for treating Alzheimer's disease or other forms of dementia
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P3/00Drugs for disorders of the metabolism
    • A61P3/06Antihyperlipidemics
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P35/00Antineoplastic agents
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P35/00Antineoplastic agents
    • A61P35/02Antineoplastic agents specific for leukemia
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P43/00Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P7/00Drugs for disorders of the blood or the extracellular fluid
    • A61P7/06Antianaemics
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/70Vectors or expression systems specially adapted for E. coli
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/06Animal cells or tissues; Human cells or tissues
    • C12N5/0602Vertebrate cells
    • C12N5/0684Cells of the urinary tract or kidneys
    • C12N5/0686Kidney cells
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2510/00Genetically modified cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian

Definitions

  • the present invention relates to the field of gene editing, specifically to a CRISPR-Cas effector protein, its gene editing system and applications.
  • the CRISPR locus is transcribed into a CRISPR RNA (crRNA) precursor (pre-crRNA), which is further processed into a small crRNA at the repetitive sequence in the presence of Cas protein and tracrRNA. Mature crRNA and Cas protein form Cas/crRNA complex.
  • crRNA guides the Cas/crRNA complex to find the target through its region complementary to the target sequence, and causes a double-stranded DNA break at the target position through the nuclease activity of the Cas protein, thereby causing the target DNA to lose its original shape.
  • Cas9, C2c1 and CasX all require two RNAs for guide RNA.
  • Common Cas9, C2c1, CasY and Cpf1 are usually around 1300 amino acids in size.
  • the PAM sequences of Cas9, Cpf1, CasX, and CasY are complex and diverse.
  • the existing CRISPR/Cas systems have problems such as serious off-target effects and low cleavage efficiency. Therefore, it is of great significance to develop new CRISPR/Cas systems with low off-target effects and high cleavage efficiency.
  • the main purpose of the present invention is to provide a CRISPR-Cas effector protein, its gene editing system and application, so as to solve the problem of low cutting efficiency of the CRISPR/Cas system in the existing technology.
  • a CRISPR-Cas effector protein includes any one of SEQ ID NO: 2, 1, 3-5 Proteins with at least 70% amino acid sequence identity.
  • the CRISPR-Cas effector protein includes more than 80%, preferably more than 90%, more preferably more than 95%, further preferably more than 99% of the amino acid sequence of any one of SEQ ID NO: 2, 1, 3-5. Identity protein; preferably, the CRISPR-Cas effector protein includes a RuvC domain.
  • the DNA molecule is a DNA molecule that is codon-optimized according to the codon preference of the host cell; preferably, the host cell includes a prokaryotic cell or a eukaryotic cell; preferably, the DNA molecule includes those in SEQ ID NO: 6 to 10 Any nucleotide sequence has 70% or more, preferably 90% or more, more preferably 95% or more, still more preferably 99%, even more preferably 100% nucleotide identity.
  • a gene editing system includes: a) an RNA guide or a nucleic acid encoding an RNA guide.
  • the RNA guide includes a direct repeat sequence and a spacer. sub-sequence, spacer sequence for hybridization with the target nucleic acid; b) the above-mentioned CRISPR-Cas effector protein, or CRISPR-Cas effector fusion protein, or DNA molecule, or recombinant vector, or host cell; DNA molecule, recombinant vector or The host cell can express CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein.
  • the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein combines with the RNA guide to perform the targeting function.
  • the hybridization sequence formed by hybridization between the spacer sequence and the target nucleic acid.
  • RNA guide includes one or more types.
  • the target nucleic acid includes DNA; preferably, DNA includes DNA derived from eukaryotes or DNA derived from prokaryotes; preferably, eukaryotes include animals or plants; preferably, DNA includes non-human mammalian DNA, Human DNA, insect DNA, avian DNA, reptile DNA, amphibian DNA, rodent DNA, fish DNA, worm DNA, nematode DNA or yeast DNA; preferably, non-human mammalian DNA includes non-human primate DNA .
  • the direct repeat sequence includes a nucleotide sequence having at least 80% identity with the nucleotide sequence of SEQ ID NO: 24, 27, 30, 33 or 36; preferably, the direct repeat sequence includes a nucleotide sequence with SEQ ID NO:
  • the nucleotide sequence of 24, 27, 30, 33 or 36 has at least 85% or more, more preferably more than 90%, further preferably more than 95% identity; preferably, the direct repeat sequence includes SEQ ID NO: 24, 27, 30, 33 or 36 nucleotide sequence.
  • the direct repeat sequence includes a first direct repeat sequence and a second direct repeat sequence; preferably, the RNA guide includes a first direct repeat sequence, a spacer sequence and a second direct repeat sequence connected in sequence; preferably, The first direct repeat sequence is the same as the second direct repeat sequence.
  • the target nucleic acid includes a pre-spacer adjacent motif
  • the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein can recognize the pre-spacer adjacent motif
  • the pre-spacer adjacent motif includes the nucleic acid sequence 5' -TTN-3', where N is any nucleotide; preferably, N is A, C or T.
  • the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein is combined with the RNA guide to form a protein-nucleic acid complex; preferably, the protein-nucleic acid complex is non-naturally occurring or modified; preferably , at least one component of the protein-nucleic acid complex is non-naturally occurring or modified.
  • a gene editing vector which contains the above-mentioned nucleic acid encoding an RNA guide.
  • the gene editing vector also contains the above-mentioned DNA molecule; preferably, the DNA molecule and the nucleic acid encoding the RNA guide are located on the same or different vectors; preferably, the DNA molecule is connected to the first regulatory element; preferably, the nucleic acid encoding the RNA guide
  • the nucleic acid of the object is connected to the second regulatory element; preferably, the first regulatory element and the second regulatory element are independently selected from one or more of inducible promoters, constitutive promoters or tissue-specific promoters; preferably Ground, the first regulatory element and the second regulatory element are independently selected from T7, SP6, T3, CMV, EF1a, SV40, PGK1, human ⁇ -actin, CAG, U6, H1, T7, T7lac, araBAD, trp, lac or Ptac one or more of them.
  • a method for combining the above gene editing system with a target nucleic acid in a cell includes: delivering the gene editing system to the cell, and the cell includes the target nucleic acid; CRISPR-Cas effector proteins or CRISPR-Cas effector fusion proteins combine with the RNA guide to allow the spacer sequence to bind to the target nucleic acid.
  • the target nucleic acid is double-stranded DNA or single-stranded DNA; preferably, the combination of the gene editing system and the target nucleic acid in the cell results in a change in the expression state of the target nucleic acid; preferably, the combination of the gene editing system and the target nucleic acid in the cell, Cause the target nucleic acid to be cleaved; preferably, the target nucleic acid is cleaved to cause the destruction of the target nucleic acid, or the replacement of a specific site of the target nucleic acid, or the removal of the target nucleic acid site, or the change of the function of the target nucleic acid region, or two positions on the target nucleic acid.
  • the sequence between points is inverted.
  • a cell containing a gene editing system includes the above gene editing system or a gene editing vector.
  • a method for targeting and editing a target nucleic acid includes contacting the target nucleic acid with the above-mentioned gene editing system.
  • a method for non-specific degradation of single-stranded DNA after identifying a target nucleic acid includes contacting the target nucleic acid with the above-mentioned gene editing system.
  • a method of targeting and nicking the non-spacer complementary strand of the double-stranded target DNA after identifying the spacer complementary strand of the double-stranded target DNA includes contacting double-stranded target DNA with the above-mentioned gene editing system.
  • a method for targeting and cutting double-stranded target DNA includes contacting the double-stranded target DNA with the above-mentioned gene editing system.
  • the non-spacer complementary strand of the double-stranded target DNA is nicked.
  • a method for inducing cell state changes includes contacting the above gene editing system with a target nucleic acid in the cell.
  • the cell state includes apoptosis or dormancy; preferably, the cells include eukaryotic cells or prokaryotic cells; preferably, the cells include mammalian cells or plant disease cells; preferably, the cells include cancer cells; preferably, the cells include infection Sexual cells or cells infected by an infectious agent; preferably, the cells include virus-infected cells, prion-infected cells; preferably, the cells include fungal cells, protozoa or parasite cells.
  • the application includes administering the gene editing system to a subject or ex vivo cells of a subject; preferably, the spacer sequence is complementary to at least 15 nucleotides of the target nucleic acid associated with the disorder or disease, the CRISPR-Cas effect
  • the daughter protein or CRISPR-Cas effector fusion protein cleaves the target nucleic acid
  • the disorder or disease includes cancer, infectious disease, metabolic disease or genetic disease
  • the cancer includes Wilms tumor, Ewing sarcoma, neuroendocrine disease tumor, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, kidney cancer, pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrium Cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, myelom
  • a eukaryotic cell line which eukaryotic cell line contains the above-mentioned cells containing the gene editing system, or is the progeny of the cells containing the gene editing system.
  • a multicellular organism which includes the above-mentioned cell containing the gene editing system.
  • multicellular organisms include model animals or model plants.
  • a method for obtaining plants with target traits is provided, using the above gene editing system to contact plant cells, modifying the genes of the plant cells or introducing the target genes,
  • the modified or target gene can express the target traits, obtain modified plant cells, and use the modified plant cells for regeneration to obtain plants with the target traits.
  • a container which container contains the above-mentioned kit.
  • the container includes a sterile container; preferably, the container includes a syringe.
  • an implantable device which includes the above gene editing system.
  • the gene editing system is in the matrix; preferably, the gene editing system is in the reservoir.
  • the above-mentioned CRISPR-Cas effector protein or a derivative of the above-mentioned CRISPR-Cas effector protein or a functional fragment of the above-mentioned CRISPR-Cas effector protein, as well as a heterologous functional domain, and an RNA guide or an RNA guide encoding the RNA guide nucleic acid;
  • the cancer includes Wilms tumor, Ewing sarcoma, neuroendocrine tumor, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, Kidney cancer, pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, myeloma, acute lymphocyte One or more of leukemia, acute myeloid leukemia, chronic lymphocytic leukemia, chronic myelogenous leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma, or urinary bladder cancer;
  • metabolic diseases include familial hypercholesterolemia (FH);
  • conditions or diseases include cancer, infectious diseases, metabolic diseases or genetic diseases;
  • the genetic diseases include transthyretin amyloidosis (ATTR), primary hyperoxaluric acidosis (PH1), and hereditary angioedema (HAE);
  • Figure 1 shows a schematic diagram of the protein domain of CasY1-CasY5 according to Embodiment 1 of the present invention, in which D, E, and D represent the catalytic residues of the three conserved motifs I, II, and III of the RuvC domain, and h represents the bridge. spiral structure.
  • Figure 4 shows a schematic diagram of the negative selection screening workflow according to Embodiment 2 of the present invention.
  • Figure 6 shows a schematic diagram of CasY1, CasY2, CasY3, CasY4 and CasY5 targeted cutting plasmids according to Embodiment 3 of the present invention.
  • Figure 7 shows the bacterial cleavage results of CasY1, CasY2, CasY3, CasY4 and CasY5 according to Example 3 of the present invention.
  • CRISPR RNA refers to RNA molecules containing guide sequences used by CRISPR effectors to target specific nucleic acid sequences.
  • crRNA contains spacer sequences that mediate target recognition and direct repeat sequences (also known as direct repeat or "DR" sequences) that form complexes with CRISPR-Cas effector proteins.
  • donor template nucleic acid refers to a nucleic acid molecule that can be used by one or more cellular proteins to alter the structure of the target nucleic acid after the CRISPR enzyme described herein has altered the target nucleic acid.
  • the donor template nucleic acid is linear.
  • CRISPR-Cas effector refers to a protein that performs an enzymatic activity or binds to a target site on a nucleic acid specified by an RNA guide.
  • CasY1, CasY2, CasY3, CasY4 and CasY5 proteins in the present invention can recognize PAMs containing or consisting of TTN, where N represents any nucleotide.
  • target nucleic acid refers to a specific nucleic acid substrate that contains a nucleic acid sequence complementary to all or part of the spacer in the RNA guide.
  • the target nucleic acid comprises a gene or a sequence within a gene.
  • the target nucleic acid includes a non-coding region (eg, a promoter).
  • the target nucleic acid is single-stranded or double-stranded.
  • the numerical value represented by "at least numerical value A” means a range including greater than or equal to numerical value A.
  • the use of “substantially” or “substantially” means that the standard deviation from the theoretical model or theoretical data is within the range of 5%, preferably 3%, and more preferably 1%.
  • the meaning expressed by "can” includes both the meaning of performing a certain process and not performing a certain process.
  • references to “some specific/preferred embodiments”, “other specific/preferred embodiments”, “implementations”, etc. refer to the specific elements described related to the embodiment (for example, Features, structures, properties and/or characteristics) are included in at least one embodiment described herein and may or may not be present in other embodiments. Additionally, it is to be understood that the described elements may be combined in various embodiments in any suitable manner.
  • nucleic acid and “nucleic acid molecule” refer to a compound containing a nucleobase and an acidic moiety, such as a nucleoside, a nucleotide, or a polymer of nucleotides.
  • polymeric nucleic acids such as nucleic acid molecules containing three or more nucleotides, are linear molecules in which adjacent nucleotides are linked to each other by phosphodiester bonds.
  • nucleic acid refers to a single nucleic acid residue (eg, a nucleotide and/or nucleoside).
  • nucleic acid refers to an oligonucleotide chain comprising three or more individual nucleotide residues.
  • oligonucleotide and “polynucleotide” are used interchangeably to refer to a polymer of nucleotides (eg, a string of at least three nucleotides).
  • nucleic acid includes RNA as well as single- and/or double-stranded DNA.
  • Nucleic acids may be naturally occurring, for example, in the context of genomes, transcripts, mRNA, tRNA, rRNA, siRNA, snRNA, plasmids, cosmids, chromosomes, chromatids or other naturally occurring nucleic acid molecules.
  • a nucleic acid molecule may be a non-naturally occurring molecule, such as a recombinant DNA or RNA, an artificial chromosome, an engineered genome or a fragment thereof, or a synthetic DNA, RNA, DNA/RNA hybrid, or include non-naturally occurring nucleotides or nucleosides.
  • nucleic acid examples include nucleic acid analogs, for example, analogs having backbones other than a phosphodiester backbone.
  • Nucleic acids can be purified from natural sources, produced and optionally purified using recombinant expression systems, chemically synthesized, and the like. Where appropriate, for example in the case of chemically synthesized molecules, the nucleic acid may comprise nucleoside analogs, for example analogs with chemically modified bases or sugars and backbone modifications.
  • polypeptide As used herein, the terms "polypeptide,” “peptide,” and “protein” are used interchangeably herein and refer to a polymer of amino acids of any length.
  • the polymer can be linear or branched, it can contain modified amino acids, and it can be interrupted by non-amino acids.
  • the term also includes amino acid polymers that have been modified (eg, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling component).
  • CRISPR-Cas effector fusion protein refers to a hybrid polypeptide comprising protein domains from at least two different proteins.
  • a protein can be located at the amino-terminal (N-terminal, N-terminal) portion or the carboxyl-terminal (C-terminal, C-terminal) portion of a CRISPR-Cas effector fusion protein, thus forming an "amino-terminal CRISPR-Cas” respectively Effector fusion protein” or "carboxy-terminal CRISPR-Cas effector fusion protein".
  • Any protein provided herein can be produced by any method known in the art.
  • the proteins provided herein can be produced via recombinant protein expression and purification, which is particularly suitable for CRISPR-Cas effector fusion proteins containing peptide linkers.
  • Methods used for recombinant protein expression and purification are Well known and include those described below, see, for example, Green and Sambrook, Molecular Cloning: A Laboratory Manual (4th ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (2012)).
  • the structure of a protein can be changed without adversely affecting its activity and functionality.
  • one or more conservative amino acid substitutions can be introduced in the amino acid sequence of the protein without affecting the activity and/or functionality of the protein molecule. Or adversely affect the three-dimensional structure. Examples and implementations of conservative amino acid substitutions will be apparent to those skilled in the art.
  • the amino acid residue can be replaced with another amino acid residue belonging to the same group as the site to be replaced, that is, a non-polar amino acid residue can be substituted for another non-polar amino acid residue, and a polar uncharged amino acid residue can be substituted.
  • Non-essential amino acid residues are amino acid residues that can be altered (deletion, substitution or replacement) without altering biological activity, whereas "essential” amino acid residues are required for biological activity.
  • a “conservative amino acid substitution” is a substitution in which an amino acid residue is replaced by an amino acid residue with a similar side chain. Amino acid substitutions can be made in non-conserved regions of the above-mentioned Cas proteins. Generally, such substitutions are not made to conserved amino acid residues, or to amino acid residues located within conserved motifs where such residues are required for protein activity. However, those skilled in the art will appreciate that functional variants may have fewer conservative or non-conservative changes in conserved regions.
  • CRISPR refers to clustered regularly interspaced short palindromic repeats (Clustered regularly interspaced short palindromic repeats), which are derived from the immune system of microorganisms.
  • target sequence refers to the nucleotide sequence in the target nucleic acid that is complementary or at least partially complementary to crRNA. After the Cas protein, crRNA and the target sequence form a ternary complex, the Cas protein exerts its influence on the target nucleic acid. Specific cleavage activity on target nucleic acid strands and/or non-nucleotide strands.
  • target sequence is used interchangeably with “target nucleic acid,” “target polynucleotide,” “target sequence,” and “target nucleic acid sequence.”
  • target strand refers to the nucleotide strand in the target nucleic acid that hybridizes to crRNA; the term “non-target strand” refers to the nucleotide strand in the target nucleic acid that does not hybridize with crRNA.
  • the term "deaminase” or “deaminase domain” refers to a protein or enzyme that catalyzes a deamination reaction.
  • the deaminase is an adenosine deaminase, which catalyzes the hydrolytic deamination of adenine to hypoxanthine.
  • the deaminase is an adenosine deaminase, which catalyzes the hydrolytic deamination of adenosine or adenine (A) to inosine (I).
  • Base Editor refers to a reagent that binds to a polynucleotide and has nucleobase modifying activity.
  • a base editor comprises a nucleobase modifying polypeptide (e.g., a deaminase) and a nucleic acid programmable nucleotide binding domain (e.g., a guide RNA) that binds to a guide polynucleotide (e.g., a guide RNA)
  • a guide polynucleotide e.g., a guide RNA
  • nucleic acid-programmable DNA-binding proteins e.g., a guide RNA
  • the agent is a biomolecular complex comprising a protein domain having base editing activity, i.e., capable of modifying bases (e.g., A, T, C, G or U).
  • the polynucleotide programmable DNA binding domain is fused or linked to a deaminase domain.
  • the agent is a CRISPR-Cas effector fusion protein comprising a domain with base editing activity.
  • a domain with base editing activity is capable of deaminating bases within a nucleic acid molecule.
  • the base editor is capable of deaminating one or more bases within a DNA molecule.
  • the base editor is an adenosine base editor (ABE).
  • coding sequence or "protein coding sequence” as used interchangeably herein refer to a polynucleotide fragment that encodes a protein. This region or sequence has a start codon near the 5' end and a stop codon near the 3' end.
  • the coding sequence may also be called an open reading frame.
  • nuclear localization sequence and “nuclear localization signal (NuclearLocalization Signal, NLS)” refer to the amino acid sequence that promotes protein import into the nucleus.
  • Nuclear localization sequences are known in the art and are described, for example, in Plank et al., International PCT application, PCT/EP2000/011690, filed on November 23, 2000, published as WO/2001/038547 on May 31, 2001, which The contents are incorporated herein by reference for their disclosure of exemplary nuclear localization sequences.
  • the NLS is an optimized NLS, such as described by Koblan et al., Nature Biotech. 2018 doi: 10.1038/nbt.4172.
  • the terms “complementary” or “hybrid” are used to refer to “polynucleotide” and “oligonucleotide” (which are interchangeable terms referring to is the nucleotide sequence).
  • the sequence “CAGT” is complementary to the sequence “GTCA”.
  • Complementarity can be “partial” or “total.”
  • Partial complementarity means that one or more nucleic acid bases are mismatched according to the base pairing rules.
  • “Total” or “complete” complementarity between nucleic acids means that each nucleic acid base is matched with another base under base pairing. Base matching rules.
  • the degree of complementarity between nucleic acid strands has an important impact on the efficiency and strength of hybridization between nucleic acid strands. This is particularly important in amplification reactions and detection methods that depend on binding between nucleic acids.
  • hybridization refers to any process that uses a nucleic acid strand to combine with a complementary strand through base pairing to form a hybridization complex to pair complementary nucleic acids.
  • sequence identity and “percent identity” refer to the percentage of nucleotides or amino acids that are identical (i.e., identical) between two or more polynucleotides or polypeptides. Sequence identity between two or more polynucleotides or polypeptides can be determined by aligning the nucleotide or amino acid sequences of the polynucleotides or polypeptides and the aligned polynucleotides or polypeptides containing The number of positions with identical nucleotides or amino acid residues is scored and compared to the number of positions in the aligned polynucleotide or polypeptide that contain different nucleotides or amino acid residues.
  • two or more sequences or subsequences have at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleotides "sequence identity" or "percent identity”.
  • sequences are substantially identical throughout the entire length of either or both compared biopolymers (eg, polynucleotides).
  • vector refers to a means of introducing a nucleic acid sequence into a cell thereby producing a transformed cell.
  • Vectors include plasmids, transposons, phages, viruses, liposomes, and episomes.
  • a "recombinant vector” or “expression vector” is a nucleic acid sequence containing a nucleotide sequence to be expressed in a recipient cell.
  • Expression vectors may include additional nucleic acid sequences to facilitate and/or facilitate expression of introduced sequences, such as initiation, termination, enhancers, promoters, and secretion sequences.
  • mammals include, but are not limited to, domesticated animals (e.g., cows, sheep, cats, dogs, and horses), primates (e.g., humans and non-human primates such as monkeys), rabbits, and rodents (e.g., mice and rats). mouse). In particular, individuals are people.
  • the methods disclosed herein may be performed in vitro, ex vivo, or in vivo, or the products may be present in in vitro, ex vivo, or in vivo forms.
  • in vitro refers to experiments using materials, biological substances, cells and/or tissues in laboratory conditions or culture media; whereas the term “in vivo” refers to experiments and procedures using intact multicellular organisms.
  • methods performed in vivo can be performed on non-human animals.
  • Ex vivo refers to an event that exists or occurs outside an organism, such as outside a human or animal body, such as an event that may exist or occur on tissue (eg, a whole organ) or cells taken from an organism.
  • the term "pharmaceutically acceptable carrier” refers to a pharmaceutically acceptable material, composition or vehicle, such as a liquid or solid filler, diluent, excipient, manufacturing aid (e.g. , lubricants, talc, magnesium stearate, calcium or zinc or stearic acid) or solvent encapsulated materials involving the delivery or transport of compounds from one site of the body (e.g., delivery site) to another site (e.g., organ, tissue or body part).
  • a pharmaceutically acceptable carrier is "acceptable” in the sense of being compatible with the other ingredients of the formulation and not deleterious to the tissues of the subject (eg, physiologically compatible, sterile, physiological pH, etc.).
  • materials that can serve as pharmaceutically acceptable carriers include: (1) sugars, such as lactose, glucose, and sucrose; (2) starches, such as corn starch and potato starch; (3) cellulose and its derivatives, such as Sodium carboxymethylcellulose, methylcellulose, ethylcellulose, microcrystalline cellulose and cellulose acetate; (4) powdered tragacanth; (5) malt; (6) gelatin; (7) lubricant, Such as magnesium stearate, sodium lauryl sulfate and talc; (8) Excipients, such as cocoa butter and suppository wax; (9) Oils, such as peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, Corn oil and soybean oil; (10) glycols, such as propylene glycol; (11) polyols, such as glycerol, sorbitol, mannitol, and polyethylene glycol (PEG); (12) esters, such as ethyl oleate
  • wetting agents, colorants, release agents, coating agents, sweeteners, flavorings, aroma Agents, preservatives and antioxidants may also be present in the formulation.
  • Terms such as “excipient,””pharmaceutically acceptable carrier,” and the like are used interchangeably herein.
  • an effective amount of a base editor may refer to an amount of base editor sufficient to induce mutation of a target site that is specifically bound by the mutation of the base editor.
  • effective amounts of reagents such as base editor CRISPR-Cas effector fusion proteins, deaminases, polynucleotides, etc., can vary depending on various factors, such as the desired biological response. , vary, for example, with the specific allele, genome or target site to be edited, with the cells or tissues targeted and the reagents used.
  • treatment means, as described herein, intended to reverse, alleviate a disease or condition or one or more symptoms thereof, delay the onset of a disease or condition or one or more symptoms thereof, or inhibit a disease or condition or clinical intervention for the progression of one or more of its symptoms.
  • treatment means, as described herein, intended to reverse, alleviate a disease or condition or one or more symptoms thereof, delay the onset of a disease or condition or one or more symptoms thereof, or Clinical intervention that inhibits the progression of a disease or condition or one or more of its symptoms.
  • treatment may be administered after one or more symptoms have developed and/or the disease has been diagnosed.
  • treatment may be administered in the absence of symptoms, for example, to prevent or delay the onset of symptoms or to inhibit the onset or progression of a disease.
  • the treatment may be administered to a susceptible individual prior to the onset of symptoms (eg, in view of a history of symptoms and/or in view of genetic or other susceptibility factors). Treatment can also be continued after symptoms have subsided, for example to prevent or delay their recurrence.
  • subject includes humans and non-human animals.
  • Non-human animals include vertebrates, such as mammals, and non-mammals, such as non-human primates, sheep, cats, horses, cattle, chickens, dogs, mice, rats, goats, rabbits, and pigs.
  • the subject is human.
  • patient or subject are used interchangeably herein.
  • a CRISPR-Cas effector protein which CRISPR-Cas effector protein includes at least 70% of the amino acid sequence of any one of SEQ ID NO: 1 to 5.
  • Identity proteins e.g., 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84 %, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%.
  • the CRISPR-Cas effector protein includes more than 80%, preferably more than 90%, more preferably more than 95%, further preferably more than 99% of the amino acid sequence of any one of SEQ ID NO: 1 to 5. % or above identity; preferably, the CRISPR-Cas effector protein includes a RuvC domain.
  • the CRISPR-Cas effector protein includes:
  • a protein with one or more point mutations as follows: K8X, F15X, N17X, K20X, K471X, W483X, H502X, R505X, K557X, K556X, R560X, Y673X, L676X, Y723X, N822X, K823X, E826X, K827X, K830X, K880X, L887X; or
  • a protein with one or more point mutations as follows: K317X, W330X, Y351X, K354X, D392X, F395X, N399X, Y509X, V512X, Y568X, N662X, K663X, E666X, R667X, K670X, K719X, L726X; or
  • a protein with one or more point mutations as follows: M9X, V16X, D18X, K21X, K518X, W531X, F550X, K553X, R609X, Y612X, R616X, Y730X, L733X, Y781X, N879X, K880X, E883X, K884X, K887X, K936X, F943X; where X is any amino acid.
  • the above-mentioned CRISPR-Cas effector protein has cutting activity and can specifically or non-specifically cut nucleotide chains to achieve the activity of the CRISPR-Cas system.
  • amino acid changes can occur at the active site or inactive site of the protein. , including within or outside the RuvC domain.
  • the protein obtained by changing the amino acid still has the cleavage activity of CRISPR-Cas effector protein.
  • a CRISPR-Cas effector fusion protein includes the above-mentioned CRISPR-Cas effector protein, or a derivative of the CRISPR-Cas effector protein. or functional fragments of CRISPR-Cas effector proteins, as well as heterologous functional domains.
  • the sequence of the functional fragment is less than the full-length sequence but retains the cleavage function of the above-mentioned CRISPR-Cas effector protein.
  • the missing residues in the functional fragment can be at the N-terminal, C-terminal and/or internal.
  • Derivatives refer to at least about 80% sequence identity with the above-mentioned CRISPR-Cas effector protein, and possess at least one of the same functions, such as the ability to bind to and form a complex with a crRNA containing at least one DR sequence.
  • Reasons for derivative formation include, but are not limited to, conservative amino acid residue substitutions.
  • CRISPR-Cas effectors are obtained by fusing heterologous functional domains to CRISPR-Cas effector proteins with cleavage activity, or derivatives of CRISPR-Cas effector proteins, or functional fragments of CRISPR-Cas effector proteins. Fusion proteins can have the activity of heterologous functional domains on the basis of normal cleavage activity. In specific use, heterologous functional domains in the existing technology can be flexibly selected to increase the function of the fusion protein.
  • the heterologous functional domain is located at the N-terminal, C-terminal or internal part of the CRISPR-Cas effector fusion protein; preferably, the heterologous functional domain includes but is not limited to positioning signal, reporter protein, CRISPR -Cas effector protein targeting portion, DNA binding domain, epitope tag, transcription activation domain, transcription repression domain, nuclease, deamination domain, methylase, demethylase, transcription release factor, HDAC (group One or more of protein deacetylase), cleavage active polypeptide, ligase;
  • the localization signal includes but is not limited to nuclear localization signal and/or nuclear export signal;
  • the nuclear export signal includes but is not limited to Human protein tyrosine kinase 2;
  • the reporter protein includes but is not limited to glutathione-S-transferase, horseradish peroxidase, chloramphenicol acetyltransferas
  • the above-mentioned deamination domain includes a deaminase or a functional fragment of a deaminase.
  • the effect of a base editor can be achieved by fusing the above-mentioned deamination domain with the above-mentioned heterologous functional domain and CRISPR-Cas effector protein.
  • Fusion proteins containing nuclear localization signals can interact with nuclear import carriers so that proteins can be transported into the nucleus.
  • the DNA binding domain DBD can recognize specific DNA sequences, thereby improving the targeting of the fusion protein.
  • a DNA molecule encoding the above-mentioned CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein is provided.
  • the DNA molecule is a DNA molecule that is codon-optimized according to the codon preference of the host cell; preferably, the host cell includes a prokaryotic cell or a eukaryotic cell; preferably, the DNA molecule includes SEQ ID
  • the nucleotide sequence of any one of NO: 6 to 10 has more than 70% (for example, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79% , 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96 %, 97%, 98%, 99% or 100%), preferably 90% or more, more preferably 95% or more, further preferably 99%, and still more preferably 100% identical nucleotides.
  • the above-mentioned DNA molecules include isolated DNA molecules. It can encode the above-mentioned CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein through transcription and translation.
  • the above-mentioned DNA molecules include single-stranded or double-stranded DNA, which can carry genetic information and thus achieve the function of encoding proteins. According to the host cell in which the DNA molecule is located or where the encoding occurs Codon preference and flexible codon optimization can achieve efficient expression of DNA molecules.
  • the above-mentioned nucleotides shown in SEQ ID NO: 6 to 10 can respectively encode the proteins shown in SEQ ID NO: 1 to 5.
  • a recombinant vector which includes the above-mentioned DNA molecule.
  • the DNA molecule is connected to a promoter; preferably, the promoter includes but is not limited to one or more of an inducible promoter, a constitutive promoter or a tissue-specific promoter; preferably , promoters include but are not limited to one or more of T7, SP6, T3, CMV, EF1a, SV40, PGK1, human ⁇ -actin, CAG, U6, H1, T7, T7lac, araBAD, trp, lac or Ptac;
  • the recombinant vector includes, but is not limited to, a retroviral vector, a lentiviral vector, an adenoviral vector, an adeno-associated virus vector, a herpes simplex vector or a phagemid vector; preferably, the recombinant vector includes a plasmid vector.
  • a host cell is provided, and the host cell is transformed with the above-mentioned recombinant vector.
  • a gene editing system includes: a) an RNA guide or a nucleic acid encoding an RNA guide.
  • the RNA guide includes a direct repeat sequence and a spacer sequence. , the spacer sequence is used to hybridize with the target nucleic acid; b) the above-mentioned CRISPR-Cas effector protein, or CRISPR-Cas effector fusion protein, or DNA molecule, or recombinant vector, or host cell; DNA molecule, recombinant vector or host cell Able to express CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein.
  • the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein is combined with the RNA guide to target the spacer.
  • the above-mentioned gene editing system includes RNA guides or nucleic acids encoding RNA guides, and CRISPR-Cas effector proteins, or CRISPR-Cas effector fusion proteins, or DNA molecules, recombinant vectors or host cells capable of expressing the above-mentioned proteins.
  • This gene editing system hybridizes and combines the RNA guide with the target nucleic acid to form a hybrid sequence; the RNA guide on the hybrid sequence can combine with the above-mentioned protein to form a complex, thereby positioning the protein close to the target nucleic acid, and the protein can exert activity and target.
  • the above-mentioned target nucleic acid may be modified in a targeted or non-targeted manner by cutting, nicking or the like.
  • the gene editing system does not contain tracrRNA.
  • TracrRNA is the abbreviation of trans-activating crRNA. It is transcribed separately in the gene editing system involving Cas9 protein and combines with crRNA to form gRNA (guide RNA), which combines with Cas9 protein to guide protein positioning. If the above gene editing system does not contain tracrRNA, the length and molecular weight of the gRNA that guides protein positioning will be smaller, and the binding domain between gRNA and Cas protein will be smaller, thereby reducing the molecular weight and size of the Cas protein and enriching the applications of the CRISPR-Cas system. Scenes.
  • the RNA guide includes one or more types.
  • RNA guides include 1 type, 2 types, 3 types, 4 types, 5 types or even more types.
  • the gene editing system includes a variety of RNA guides and can hybridize with multiple target nucleic acids at the same time, thereby exerting the activity of the gene editing system at multiple target nucleic acid sites, which can greatly improve the efficiency of gene editing and reduce the cost of gene editing and cell
  • the time required for experiments such as passaging and verification of editing results.
  • the target nucleic acid includes DNA; preferably, DNA includes DNA derived from eukaryotic organisms or DNA derived from prokaryotes; preferably, eukaryotic organisms include but are not limited to animals or plants; preferably, eukaryotic organisms include but are not limited to animals or plants; , DNA includes but is not limited to non-human mammal DNA, human DNA, insect DNA, avian DNA, reptile DNA, amphibian DNA, rodent DNA, fish DNA, worm DNA, nematode DNA or yeast DNA; Preferably, Non-human mammalian DNA includes, but is not limited to, non-human primate DNA.
  • RNA guides can be hybridized with DNA from different sources to achieve gene editing of multiple species.
  • the 3' end of the direct repeat sequence includes a stem-loop structure
  • the stem-loop structure includes a first stem nucleotide chain, a cyclic nucleotide chain, and a second stem nucleotide chain connected in sequence.
  • the first stem nucleotide chain and the second stem nucleotide chain hybridize with each other to form a stem of a stem-loop structure, and the cyclic nucleotide chain forms a ring of a stem-loop structure; preferably, the length of the first stem nucleotide chain is 5 or 6 nucleotides; preferably, the length of the second stem nucleotide chain is 5 nucleotides; preferably, the length of the cyclic nucleotide chain is 6, 7 or 8 nucleotides.
  • the stem-loop structure includes the nucleotide sequence of SEQ ID NO: 25, 28, 31, 34 or 37.
  • the above-mentioned RNA guide includes direct repeat sequences and spacer sequences, wherein the 3' end of the direct repeat sequence is a stem-loop structure with secondary structure.
  • the stem-loop structure includes a first stem nucleotide chain and a second stem nucleotide chain that can hybridize to each other to form a double strand, and the cyclic nucleotide chain forms a ring structure.
  • This stem-loop structure can bind to CRISPR-Cas effector proteins or CRISPR-Cas effector fusion proteins to guide protein positioning.
  • the N in the sequence indicates all bases.
  • the stem-loop structure corresponding to the nucleotide described in SEQ ID NO: 25 can bind to CasY1 protein; the stem-loop structure corresponding to the nucleotide described in SEQ ID NO: 28 can bind to CasY2 Protein binding; the stem-loop structure corresponding to the nucleotide described in SEQ ID NO: 31 can bind to the CasY3 protein; the stem-loop structure corresponding to the nucleotide described in SEQ ID NO: 34 can bind to the CasY4 protein; SEQ The stem-loop structure corresponding to the nucleotide described in ID NO: 37 can bind to the CasY5 protein.
  • the protein can be freely combined with the above-mentioned stem-loop structure, or can be flexibly selected according to the above-mentioned identity without affecting the binding of the protein to the RNA guide.
  • the direct repeat sequence includes at least 80% (e.g., 80%, 81%, 82%, 83%) with the nucleotide sequence of SEQ ID NO: 24, 27, 30, 33 or 36 , 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100 %) nucleotide sequence identity; preferably, the direct repeat sequence includes at least 85% or more, more preferably 90% or more, and further preferably more than 90% identity with the nucleotide sequence of SEQ ID NO: 24, 27, 30, 33 or 36 A nucleotide sequence with more than 95% identity; preferably, the direct repeat sequence includes the nucleotide sequence of SEQ ID NO: 24, 27, 30, 33 or 36.
  • the spacer sequence includes 80% (e.g., 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90% , 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%) or more complementary to the target nucleic acid; preferably, the spacer sequence includes more than 90%, more preferably More than 95%, more preferably more than 99%, even more preferably 100% complementary to the target nucleic acid; preferably, the length of the spacer sequence is 18-41nt; preferably, the length of the spacer sequence is 18-37nt; preferably, the spacer sequence The length of the subsequence is 18-26nt or 34-36nt; preferably, the length of the spacer subsequence is 20nt.
  • 80% e.g., 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90% , 9
  • the above-mentioned spacer sequence includes more than 80% fragments that are complementary to the target nucleic acid.
  • the RNA guide with the spacer sequence can bind firmly to the target nucleic acid, thereby realizing the gene editing system's targeting of the target nucleic acid. modification.
  • the length of the spacer sequence can be flexibly selected within a certain range. If the length is too short, the complementary chain formed will be shorter, have poor binding force, and have weak binding specificity, affecting gene editing efficiency and off-target rate; if the length is too long, The increased length of the RNA guide and the difficulty of complementary pairing will also affect the gene editing efficiency.
  • the direct repeat sequence includes a first direct repeat sequence and a second direct repeat sequence; preferably, the RNA guide includes a first direct repeat sequence, a spacer sequence and a second direct repeat sequence connected in sequence. Repeating sequence; preferably, the first direct repeating sequence and the second direct repeating sequence are identical.
  • RNA guide in this application refers to any RNA molecule that facilitates targeting of a protein of the invention to a target nucleic acid, including but not limited to crRNA, pre-crRNA (e.g., DR-spacer-DR), and mature crRNA (e.g., mature DR-spacer). sub, mature DR-spacer-mature DR).
  • the RNA guide used in the examples of this application is pre-crRNA.
  • the direct repeat sequence includes a first direct repeat sequence and a second direct repeat sequence located at both ends of the spacer sequence.
  • the stem-loop structure is between the first and second direct repeat sequence. all exist.
  • active RNA guides can be further obtained. For example, for the pre-crRNA composed of DR-spacer-DR, one of the direct repeat sequences is deleted after processing.
  • the target nucleic acid contains a pre-spacer adjacent motif
  • the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein is capable of recognizing the pre-spacer adjacent motif (PAM).
  • the adjacent motif comprises the nucleic acid sequence 5'-TTN-3', where N is any nucleotide; preferably, N is A, C or T.
  • the Cas enzyme can recognize short motifs associated with the target nucleic acid, thereby completing the cutting and modification of specific sites.
  • the PAM that can be specifically recognized by the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein in the above gene editing system is 5’-TTN-3’, where N is any nucleotide. And it has the highest recognition efficiency for 5’-TTA-3’, 5’-TTC-3’ and 5’-TTT-3’.
  • the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein is combined with the RNA guide to form a protein-nucleic acid complex; preferably, the protein-nucleic acid complex is non-naturally occurring or Modified; preferably, at least one component of the protein-nucleic acid complex is non-naturally occurring or modified.
  • RNA guides and Cas proteins can form protein-nucleic acid complexes.
  • the RNA guide can be modified or the Cas protein can be modified, or an unmodified protein-nucleic acid complex can be directly used (that is, the RNA guide and Cas protein are both natural), both of which can play a role in gene editing.
  • the RNA guide targets the target nucleic acid and modifies the target nucleic acid; preferably, the modification includes but is not limited to cleavage or nicking; preferably, the modification results in: (1) a change in the expression of the cell containing at least one gene product; or (2) the cell comprises an alteration in the expression of at least one gene product, wherein the expression of at least one gene product is increased; or (3) the cell comprises an alteration in the expression of at least one gene product, wherein the expression of at least one gene product Reduced; or (4) the cell contains an edited genome; preferably, the modification results in cytotoxicity; preferably, the modification results in inhibition of gene expression, reduction in gene expression, or enhancement of gene expression.
  • the target nucleic acid can be modified, including cutting or nicking.
  • Cleavage is the break of single-stranded DNA or double-stranded DNA.
  • a nick is a break in one of the DNA strands of double-stranded DNA.
  • the above-mentioned modifications to cellular genes can lead to changes in the expression of gene products in cells, including increases or decreases, and can also lead to editing of the cell genome.
  • the above modifications can act on the cell's own genome, or on foreign genes such as plasmids in the cell.
  • the above modifications can cause cytotoxicity, inhibit gene expression, reduce gene expression, or enhance gene expression.
  • the gene editing system includes a target nucleic acid or a nucleic acid encoding the target nucleic acid, and the target nucleic acid includes a homology arm fragment and a donor template nucleic acid; preferably, the target nucleic acid includes a sequence capable of hybridizing with the spacer sequence;
  • the homology arm fragment includes a 5' homology arm and a 3' homology arm, and the target nucleic acid consists of a 5' homology arm, a donor template nucleic acid and a 3' homology arm sequentially connected.
  • the above gene editing system may also include a target nucleic acid or a nucleic acid encoding the target nucleic acid.
  • the target nucleic acid includes a sequence that can hybridize with the spacer sequence, that is, the target nucleic acid can be positioned at a specific nucleic acid site (such as a target nucleic acid).
  • the target nucleic acid is the nucleic acid modified by the gene editing system; the donor template nucleic acid is the template for modifying the target nucleic acid.
  • the target nucleic acid includes a homology arm fragment and a donor template nucleic acid. The homology arm fragment can specifically bind to the nucleic acid surrounding a specific nucleic acid site.
  • the donor template nucleic acid carries specific genetic information and can be synthesized through homology.
  • Source recombination or other mechanisms after the above-mentioned CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein cleaves the target nucleic acid, specific genetic information is integrated into the position of the target nucleic acid, thereby completing the modification of the target nucleic acid, including Insertion, deletion or replacement of bases to complete gene editing.
  • Donor template nucleic acid means that one or more cellular proteins (such as homologous recombinase) can use it to change the target nucleic acid after the Cas enzyme has changed (modified, including the insertion, deletion or replacement of bases) the target nucleic acid.
  • the structure of a nucleic acid molecule For example, after using Cas protein to modify the target nucleic acid, homologous recombinase is used to modify and change the target nucleic acid by inserting, deleting, or replacing the donor template nucleic acid as a template.
  • the donor template nucleic acid can be double-stranded nucleic acid or single-stranded nucleic acid; the donor template nucleic acid can be linear or circular (for example, a plasmid can be used); the donor template nucleic acid can be an exogenous nucleic acid molecule.
  • Donor template nucleic acids include DNA or RNA.
  • the gene editing system exists in a deliverable form, and the delivery system is used to bring the gene editing system into contact with the target nucleic acid; preferably, the delivery system delivers the gene editing system into cells containing the target nucleic acid;
  • deliverable forms include, but are not limited to, nanoparticles, liposomes, exosomes, microvesicles, protein capsids, or particles for gene guns.
  • the gene editing system can be delivered into the cells through the above delivery system.
  • the contact between the gene editing system and the target nucleic acid is completed.
  • the delivery system can also be further positioned at specific cell types, internal cell structures, etc., to achieve the purpose of precise delivery of the gene editing system and improve the accuracy of gene editing.
  • a gene editing vector which contains the above-mentioned nucleic acid encoding an RNA guide.
  • the gene editing vector also contains the above-mentioned DNA molecule; preferably, the DNA molecule and the nucleic acid encoding the RNA guide are located on the same or different vectors; preferably, the DNA molecule is connected to the first regulatory element; Preferably, the nucleic acid encoding the RNA guide is connected to a second regulatory element; preferably, the first regulatory element and the second regulatory element are each independently selected from one of an inducible promoter, a constitutive promoter or a tissue-specific promoter.
  • the first regulatory element and the second regulatory element are independently selected from the group including but not limited to T7, SP6, T3, CMV, EF1a, SV40, PGK1, human ⁇ -actin, CAG, U6, H1, T7 , T7lac, araBAD, trp, lac or one or more of Ptac.
  • the above-mentioned gene editing vector contains a nucleic acid capable of encoding an RNA guide, capable of replicating the nucleic acid encoding the RNA guide, and encoding the RNA guide in the cell.
  • the above-mentioned recombinant vector containing DNA molecules and the gene editing vector can be used to express CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein and RNA guide respectively, that is, the above-mentioned gene editing system can be expressed.
  • the gene editing vector may also contain the above-mentioned DNA molecule, and the DNA molecule and the nucleic acid encoding the RNA guide are located on the same or different vectors.
  • the gene editing vector includes one vector; if they are on different vectors, multiple vectors make up the gene editing vector.
  • the above gene editing system can be expressed in cells.
  • a variety of regulatory elements such as promoters can be flexibly set independently to help RNA guides and/or proteins complete transcription, translation, purification and other tasks.
  • a method for binding the above-mentioned gene editing system to a target nucleic acid in a cell includes: delivering the gene editing system to the cell, and the cell includes the target nucleic acid; allowing CRISPR- Cas effector proteins or CRISPR-Cas effector fusion proteins combine with the RNA guide to bind the spacer sequence to the target nucleic acid.
  • the target nucleic acid is double-stranded DNA or single-stranded DNA; preferably, the combination of the gene editing system and the target nucleic acid in the cell causes a change in the expression state of the target nucleic acid; preferably, the gene editing system and the cell
  • the binding of the target nucleic acid results in the target nucleic acid being cleaved; preferably, the target nucleic acid is cleaved leading to the destruction of the target nucleic acid, or the replacement of a specific site of the target nucleic acid, or the removal of the target nucleic acid site, or the change of the function of the target nucleic acid region, or A sequence inversion between two sites on a target nucleic acid.
  • Target nucleic acid disruption e.g., target mutation, e.g., resulting in gene knockout
  • Target nucleic acid replacement e.g., resulting in target correction
  • Removal of target sites e.g., resulting in target deletion
  • Changes in function e.g., target nucleic acid activity or accessibility, resulting For example (transcriptional and/or epigenetic) gene or genomic region activation or gene or genomic region silencing.
  • Sequence inversion between two sites on the target nucleic acid can be achieved, for example, by cutting the target nucleic acid into two sites and then using the Donor sequence or Cre-loxP recombinase system.
  • Target nucleic acid destruction includes target nucleic acid mutation or gene knockout.
  • Target nucleic acid site-specific replacement includes correction of incorrect bases in the target nucleic acid. Removal of the target nucleic acid site can lead to deletion of the target nucleic acid site.
  • Changes in regional function include but not Limited to reactivation of target nucleic acid expression activity, or inactivation of target nucleic acid expression activity, or increase in expression level, or expression The amount is reduced, or the expression product is changed. Changes in the expression state of the target nucleic acid, including but not limited to gene silencing or gene expression activation.
  • a cell containing a gene editing system includes the above-mentioned gene editing system or gene editing vector.
  • the cell containing the gene editing system contains a modified target target locus, and the target target locus is a locus modified using the gene editing system; preferably, the modification of the target target locus results in: ( 1) The cells containing the gene editing system contain changes in the expression of at least one gene product; or (2) The cells containing the gene editing system contain changes in the expression of at least one gene product, wherein the expression of at least one gene product is increased; or (3) the cell containing the gene editing system contains an alteration in the expression of at least one gene product, wherein the expression of at least one gene product is reduced; or (4) the cell containing the gene editing system contains an edited genome; preferably, Cells containing gene editing systems include eukaryotic cells or prokaryotic cells; preferably, eukaryotic cells include but are not limited to animal cells, plant cells or human cells; preferably, animal cells include but are not limited to mammalian cells.
  • the above-mentioned cells containing the gene editing system contain the above-mentioned gene editing system or gene editing vector. Cells in various states such as gene editing has not occurred, gene editing is occurring, gene editing has been completed, etc., all belong to the above-mentioned cells containing the gene editing system.
  • the above-mentioned cells where gene editing is occurring and where gene editing has been completed contain modified target loci. It can lead to changes in the expression of gene products in cells, including increases or decreases, and can also lead to editing of the cell genome.
  • the above modifications can act on the cell's own genome, or on foreign genes such as plasmids in the cell. The above modifications can cause cytotoxicity, inhibit gene expression, reduce gene expression, or enhance gene expression.
  • target locus encompasses any DNA segment or region of a polynucleotide that is desired to be edited.
  • the target locus is a genomic locus.
  • the target locus may be native to the cell, or alternatively may comprise heterologous or exogenous DNA segments.
  • Heterologous or exogenous DNA segments may include transgenes, expression cassettes, polynucleotides encoding selectable markers, or heterologous or exogenous DNA regions.
  • the targeted loci may comprise genes from prokaryotes, eukaryotes, animals or plants, including non-human mammals, non-human cells, rodents, humans, mice, primates or any other The native, heterologous or exogenous genomic nucleic acid sequence of the target organism, or a combination thereof.
  • a method for targeting and editing a target nucleic acid includes contacting the target nucleic acid with the above-mentioned gene editing system.
  • a method for non-specific degradation of single-stranded DNA after identifying a target nucleic acid includes contacting the target nucleic acid with the above-mentioned gene editing system.
  • the above method can be realized by utilizing the activity similar to the side-cleaving effect of Cas12i.
  • Cas12i proteins can have incidental activity, that is, under certain circumstances, the activated Cas12i protein remains active after binding to the target sequence and continues to non-specifically cleave non-target oligonucleotides.
  • the Cas12i system can be used to detect the presence of specific target oligonucleotides.
  • the Cas12i system is engineered to nonspecifically cleave ssDNA or transcripts.
  • Cas12i is provided or expressed transiently or stably in in vitro systems or cells, and targets or triggers non-specific cleavage of cellular nucleic acids, such as ssDNA, such as viral ssDNA.
  • reporter nucleic acid refers to a molecule that can be cleaved or otherwise inactivated by the activated CRISPR system protein. Reporter nucleic acids contain nucleic acid elements that can be cleaved by CRISPR proteins. Cleavage of the nucleic acid element releases the agent or produces a conformational change that allows a detectable signal to be generated.
  • the reporter nucleic acid Prior to cleavage, or when the reporter nucleic acid is in an "active" state, the reporter nucleic acid prevents the generation or detection of a positive detectable signal.
  • a positive detectable signal can be any signal detectable using optical, fluorescent, chemiluminescent, electrochemical, or other detection methods known in the art.
  • a first signal i.e., a negative detectable signal
  • Second signal e. positive detectable signal.
  • the above application can be realized using the above method or the above CRISPR-Cas effector protein.
  • a method of targeting and nicking the non-spacer complementary strand of a double-stranded target DNA after identifying the spacer complementary strand of the double-stranded target DNA is provided.
  • Methods include contacting double-stranded target DNA with the gene editing system described above.
  • a method for targeting and cutting double-stranded target DNA includes contacting the double-stranded target DNA with the above-mentioned gene editing system.
  • the non-spacer complementary strand of the double-stranded target DNA is nicked.
  • a method for specifically editing double-stranded nucleic acids includes contacting the following under sufficient conditions for a sufficient amount of time, (1) the above-mentioned CRISPR-Cas an effector protein, or a CRISPR-Cas effector fusion protein, another enzyme having sequence-specific nicking activity, and an RNA guide, the RNA guide directs the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein, relative to The activity of another sequence-specific nicking enzyme nicks the opposite strand; and (2) double-stranded nucleic acids; the above method results in the formation of double-stranded breaks.
  • two of the above-mentioned CRISPR-Cas effector proteins, or a CRISPR-Cas effector fusion protein, or one of the above-mentioned proteins and another Cas protein with nicking activity can create double-stranded breaks with overhangs.
  • This approach reduces the potential for off-target modifications because double-strand breaks are expected to occur only at loci where both enzymes nick, thereby increasing genome editing specificity.
  • This approach is also known as the "double-nicking" or "paired-nickase” strategy.
  • a method for editing double-stranded nucleic acids includes contacting the following under sufficient conditions for a sufficient amount of time: (1) the above-mentioned CRISPR-Cas effector Protein, or CRISPR-Cas effector fusion protein, and a fusion protein of a protein domain with DNA modification activity, and an RNA guide targeting double-stranded nucleic acid; and (2) double-stranded nucleic acid; CRISPR-Cas effect of the fusion protein
  • the molecule is modified to nick the non-target strand of the double-stranded nucleic acid.
  • the above method can be realized by utilizing the activity similar to the side-cleaving effect of Cas12i.
  • the above-mentioned double-stranded nucleic acid includes, but is not limited to, viral DNA (eg, Pasteur virus, hepatitis virus, herpes virus, adenovirus, poxvirus, parvovirus, etc.).
  • the two strands of the double-stranded nucleic acid are cleaved at different sites, resulting in staggered cleavage; preferably, the two strands of the double-stranded nucleic acid are cleaved at the same site, resulting in flat double-stranded breaks (DSB).
  • DSB flat double-stranded breaks
  • a method for targeting and cutting single-stranded target DNA includes contacting the target nucleic acid with the above-mentioned gene editing system.
  • the above gene editing system is in contact with the target nucleic acid, and can use the gene editing system to edit the target nucleic acid.
  • the gene editing system can target nucleic acids for editing.
  • the above-mentioned contacts can be achieved both inside and outside the cell. Sufficient conditions and a sufficient amount of contact time indicate that the above method can proceed or complete the reaction conditions and reaction time, which can be flexibly adjusted according to the specific implementation of the above method.
  • the above gene editing system is in contact with the target nucleic acid and can use the gene editing system to edit the target nucleic acid to achieve different gene editing effects.
  • the gene editing system can target nucleic acids for editing.
  • the above-mentioned contact can be achieved inside or outside the cell, and the target nucleic acid includes but is not limited to the genome and isolated single-stranded or double-stranded DNA.
  • a method for inducing changes in cell state is provided. The method conservatively brings the above gene editing system into contact with the target nucleic acid in the cell.
  • the cell state includes but is not limited to apoptosis or dormancy; preferably, the cells include eukaryotic cells or prokaryotic cells; preferably, the cells include but are not limited to mammalian cells or plant disease cells; preferably, the cells include but are not limited to mammalian cells or plant disease cells; , cells include but are not limited to cancer cells; preferably, cells include but are not limited to infectious cells or cells infected by infectious agents; preferably, cells include but are not limited to cells infected by viruses and cells infected by prions; preferably, cells include but are not limited to cells infected by viruses and prions.
  • cells include, but are not limited to, fungal cells, protozoan or parasite cells.
  • the above-mentioned method of inducing cell state changes uses a gene editing system to contact target nucleic acids in cells that regulate growth, metabolism and other functions, thereby modifying the target nucleic acids over time, thereby inducing changes in cell state.
  • the characteristic target nucleic acids in cancer cells, parasite cells, etc. can be modified to change the state of the cells, causing them to undergo apoptosis or dormancy, thereby achieving the purpose of eliminating such cells.
  • an eighteenth typical embodiment of the present application there is provided an application of using the above gene editing system in preparing a drug for treating a subject's illness or disease.
  • the application includes administering the gene editing system to a subject or ex vivo cells of a subject; preferably, the spacer sequence is at least 15 nucleotides of a target nucleic acid associated with the disorder or disease.
  • Complementary, CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein cleaves the target nucleic acid;
  • the disorder or disease includes but is not limited to cancer, infectious disease, metabolic disease or genetic disease;
  • cancer includes but Not limited to Wilms tumor, Ewing sarcoma, neuroendocrine tumors, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, kidney cancer, pancreatic cancer , lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma,
  • the above gene editing system can be applied to the preparation of drugs for treating conditions or diseases in subjects.
  • Such applications include the application of gene editing systems to subjects or their isolated cells.
  • the application to subjects includes local application, systemic application or targeted application, etc., so as to achieve the effect of the drug.
  • the spacer sequence is complementary to at least 15 nucleotides of the target nucleic acid related to the disease or disease, which can ensure stable and specific binding between the spacer sequence and the target nucleic acid and prevent off-target situations.
  • the above conditions or diseases include but are not limited to cancer or infectious diseases.
  • target nucleic acids such as cancer cell genomes or subject defective genes can be modified to achieve the effect of drugs.
  • a eukaryotic cell line which includes the above-mentioned cells containing the gene editing system, or is the progeny of the cells containing the gene editing system.
  • a multicellular organism which includes the above-mentioned cell containing the gene editing system.
  • multicellular organisms include but are not limited to model animals or model plants.
  • the above-mentioned multicellular organisms are multicellular organisms modified using the above-mentioned gene editing system, and heritable or non-heritable genetic modifications are obtained under the action of the above-mentioned gene editing system.
  • the above-mentioned genetic modification includes gene insertion, deletion or replacement using a gene editing system, the result of genetic modification, and the genetic modification of the multicellular organism are controllable and predictable.
  • a method for obtaining plants with target traits is provided.
  • the above-mentioned gene editing system is used to contact plant cells to modify the genes of plant cells or introduce target genes. Modify or The target gene can express the target traits, obtain modified plant cells, and use the modified plant cells for regeneration to obtain plants with the target traits.
  • a method for identifying target traits in plants is provided.
  • the target genes in plant cells can express the target traits, and the above gene editing system is used to contact the plant cells, thereby identifying the target traits.
  • a kit which includes one or more components selected from the following: the above-mentioned CRISPR-Cas effector protein, CRISPR-Cas effector Fusion proteins, DNA molecules, recombinant vectors, host cells, gene editing systems, gene editing vectors, cells containing gene editing systems, eukaryotic cell lines, multicellular organisms; the components of the kit are distributed in the same or different containers middle.
  • a container which container contains the above-mentioned kit.
  • the container includes a sterile container; preferably, the container includes a syringe.
  • an implantable device is provided, and the implantable device includes the above-mentioned gene editing system.
  • the gene editing system is within the matrix; preferably, the gene editing system is within the reservoir.
  • target cells e.g., in vivo Target cells, where target cells are target cells in circulation, target cells in tissues, target cells in organs, etc.
  • Suitable for delivering a CRISPR-Cas effector polypeptide of the disclosure, a CRISPR-Cas effector fusion polypeptide of the disclosure, an RNP of the disclosure, a nucleic acid of the disclosure, or a CRISPR-Cas effector system of the disclosure to a target cell may include a container (e.g., reservoir, matrix, etc.) that contains CRISPR -Cas effector protein, CRISPR-Cas effector CRISPR-Cas effector fusion protein, RNP or CRISPR-Cas effector system (or components thereof, such as nucleic acids of the present disclosure).
  • Suitable implantable devices may include, for example, a polymeric substrate (such as a matrix) serving as the body of the device, and in some cases additional scaffolding materials (such as metals or additional polymers), as well as materials that enhance visibility and imaging .
  • Implantable delivery devices may be advantageous in providing localized and prolonged release, where the polypeptide and/or nucleic acid to be delivered is released directly to the target site, such as the extracellular matrix (ECM), vasculature surrounding tumors, diseased tissue wait.
  • Suitable implantable delivery devices include devices suitable for delivery to a cavity (such as the peritoneal cavity) and/or any other type of administration where the drug delivery system is not anchored or attached, including devices that are biostable and/or implantable.
  • a degradable and/or bioabsorbable polymeric substrate which may optionally be the matrix, for example.
  • Such matrices include materials that are efficiently processed and/or modified upon contact with a biological environment without forming biologically active, toxic and/or harmful by-products.
  • materials that can be used for bioabsorbable matrices include, for example, biopolymers (e.g., proteins, peptides, carbohydrates, polynucleotides, etc.), synthetic polymers, proteins, polysaccharides, silk, polydecane Glyceryl acid esters (PGS), polydioxanone, polylactic acid-co-glycolic acid (PLGA), polylactic acid (PLA), collagen, chitosan, silk protein and combinations thereof.
  • biopolymers e.g., proteins, peptides, carbohydrates, polynucleotides, etc.
  • synthetic polymers proteins, polysaccharides, silk, polydecane Glyceryl acid esters (PGS), polydioxanone, polylactic acid-co-glycolic acid (PLGA), polylactic acid (PLA
  • Silk materials that can be used in the bioabsorbable matrix include, for example, silk fibroin, modified silk fibroin, spider silk, insect silk, recombinant silk, and any combination thereof.
  • the above-mentioned reservoir reservoir composition includes a matrix of bioerodible, biocompatible polymers, including but not limited to polymer gel reservoirs or membrane-controlled reservoirs.
  • suitable implantable drug delivery devices include degradable polymers where the primary release mechanism is bulkerosion.
  • suitable implantable drug delivery devices contain non-degradable or slowly degrading polymers in which the primary release mechanism is diffusion rather than bulk erosion, such that the outer portion acts as a membrane and its inner portion serves as a drug reservoir, In fact, the drug reservoir is not affected by the surrounding environment for long periods of time (eg, from about one week to about several months). Combinations of different polymers with different release mechanisms may also optionally be used. Over the lifetime of the total release period, the concentration gradient can remain effectively constant, and therefore the diffusion rate is effectively constant (referred to as "zero-mode" diffusion).
  • the term "constant" means that the diffusion rate remains above a lower threshold for therapeutic effectiveness, but that it is still optionally characterized by an initial burst and/or may fluctuate, such as increase and decrease to a certain extent.
  • the diffusion rate can be maintained like this for long periods of time, and the diffusion rate can be considered constant to a certain level to optimize the duration of treatment, such as an effective silent period.
  • implantable delivery systems are designed to protect nucleotide-based therapeutics from degradation, whether chemical in nature or due to attack by enzymes and other factors within the subject's body.
  • the implantation site or target site of the device can be selected for maximum therapeutic efficacy.
  • a method of treating a disease or condition in a subject in need comprising administering to the subject any one or more of the following:
  • the above-mentioned CRISPR-Cas effector protein or a derivative of the above-mentioned CRISPR-Cas effector protein or a functional fragment of the above-mentioned CRISPR-Cas effector protein, as well as a heterologous functional domain, and an RNA guide or an RNA guide encoding the RNA guide nucleic acid;
  • the subject includes a subject suffering from a disorder or disease, including cancer, infectious disease, metabolic disease or genetic disease;
  • the cancer includes Wilms tumor, Ewing sarcoma, neuroendocrine tumor, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, Prostate cancer, liver cancer, kidney cancer, pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, bone marrow One or more of neoplasms, acute lymphoblastic leukemia, acute myeloid leukemia, chronic lymphocytic leukemia, chronic myelogenous leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma, or urinary bladder cancer;
  • the metabolic disease includes familial hypercholesterolemia (FH);
  • the genetic diseases include transthyretin amyloidosis (ATTR), primary hyperoxaluric acidosis (PH1), and hereditary angioedema (HAE);
  • TRR transthyretin amyloidosis
  • PH1 primary hyperoxaluric acidosis
  • HAE hereditary angioedema
  • the infectious agent of the infectious disease includes one or more of human immunodeficiency virus, herpes simplex virus-1, hepatitis B (HEPATITIS B) or herpes simplex virus-2;
  • the disorder or disease includes cystic fibrosis, progressive Duchenne muscular dystrophy, Becker muscular dystrophy, alpha-1-antitrypsin deficiency, Pompe disease, myotonic dystrophy Dysplasia, Huntington's disease, Fragile X syndrome syndrome, Friedreich's ataxia, amyotrophic lateral sclerosis, frontotemporal dementia, hereditary chronic kidney disease, hyperlipidemia, hypercholesterolemia, Leber's congenital amaurosis, sickle cell disease or one or more of beta thalassemias.
  • cystic fibrosis includes cystic fibrosis, progressive Duchenne muscular dystrophy, Becker muscular dystrophy, alpha-1-antitrypsin deficiency, Pompe disease, myotonic dystrophy Dysplasia, Huntington's disease, Fragile X syndrome syndrome, Friedreich's ataxia, amyotrophic lateral sclerosis, frontotemporal dementia, hereditary chronic kidney disease, hyperlipidemia
  • condition or disease includes cancer, infectious disease, metabolic disease or genetic disease;
  • the cancers include Wilms tumor, Ewing sarcoma, neuroendocrine tumors, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, kidney cancer, Pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, myeloma, acute lymphoblastic leukemia, acute One or more of myeloid leukemia, chronic lymphocytic leukemia, chronic myelogenous leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma, or urinary bladder cancer;
  • the infectious agent of the infectious disease includes one or more of human immunodeficiency virus, herpes simplex virus-1, hepatitis B (HEPATITIS B) or herpes simplex virus-2;
  • the metabolic disease includes familial hypercholesterolemia (FH);
  • the genetic diseases include transthyretin amyloidosis (ATTR), primary hyperoxaluric acidosis (PH1), and hereditary angioedema (HAE);
  • TRR transthyretin amyloidosis
  • PH1 primary hyperoxaluric acidosis
  • HAE hereditary angioedema
  • the disorder or disease includes cystic fibrosis, progressive Duchenne muscular dystrophy, Becker muscular dystrophy, alpha-1-antitrypsin deficiency, Pompe disease, myotonic Malnutrition, Huntington's disease, fragile X syndrome, Friedreich's ataxia, amyotrophic lateral sclerosis, frontotemporal dementia, hereditary chronic kidney disease, hyperlipidemia, hypercholesterolemia, Leber's disease One or more of congenital amaurosis, sickle cell disease, or beta thalassemia.
  • CasY1, CasY2, CasY3, CasY4 and CasY5 i.e. CasY1-CasY5 genes and RNA guides.
  • TBLASTN https://blast.ncbi.nlm.nih.gov/
  • TBLASTN https://blast.ncbi.nlm.nih.gov/
  • CasY1 SEQ ID NO: 1
  • CasY2 SEQ ID NO: 2
  • CasY1 SEQ ID NO: 3
  • CasY4 SEQ ID NO: 4
  • CasY5 SEQ ID NO: 5
  • Figure 1 shows a schematic diagram of the protein domain of CasY1-CasY5, in which D, E, and D represent the catalytic residues of the three conserved motifs I, II, and III of the RuvC domain, and h represents the bridge helix structure.
  • D-E-D represents Asp-Glu-Asp amino acids, which are conserved amino acid residues of the RuvC domain.
  • the coding DNA sequence of CasY1 is shown in SEQ ID NO: 6, the coding DNA sequence of CasY2 is shown in SEQ ID NO: 7, the coding DNA sequence of CasY3 is shown in SEQ ID NO: 8, and the coding DNA sequence of CasY4 is shown in SEQ ID NO. NO:9 is shown, and the coding DNA sequence of CasY5 is shown as SEQ ID NO:10.
  • the direct repeat sequences corresponding to CasY1-CasY5 are shown in SEQ ID NO: 24, 27, 30, 33, and 36 respectively.
  • the sequences of the stem-loop structure are SEQ ID NO: 25, 28, 31, 34, and 37 respectively.
  • RNA guide Pre-crRNA sequences are shown in SEQ ID NO: 26, 29, 32, 35, and 38 respectively.
  • nucleic acid sequence of the CasY2 protein (SEQ ID NO: 7) and the CasY2 crRNA-TTR sequence (SEQ ID NO: 12) were cloned into the expression vector pACYCDuet-1 to construct the recombinant plasmid pACYCDuet1-CasY2-crRNA, as described
  • the recombinant plasmid sequence is shown in SEQ ID NO: 18.
  • the nucleic acid sequence of the CasY3 protein (SEQ ID NO: 8) and the CasY3 crRNA-TTR sequence (SEQ ID NO: 13) were cloned into the expression vector pACYCDuet-1 to construct the recombinant plasmid pACYCDuet1-CasY3-crRNA.
  • the recombinant plasmid sequence As shown in SEQ ID NO: 19.
  • the nucleic acid sequence of the CasY4 protein (SEQ ID NO: 9) and the CasY4 crRNA-TTR sequence (SEQ ID NO: 14) were cloned into the expression vector pACYCDuet-1 to construct the recombinant plasmid pACYCDuet1-CasY4-crRNA.
  • the recombinant plasmid sequence As shown in SEQ ID NO:20.
  • the nucleic acid sequence of the CasY5 protein (SEQ ID NO: 10) and the CasY5 crRNA-TTR sequence (SEQ ID NO: 15) were cloned into the expression vector pACYCDuet-1 to construct the recombinant plasmid pACYCDuet1-CasY5-crRNA.
  • the recombinant plasmid sequence As shown in SEQ ID NO: 21.
  • the random sequence in SEQ ID NO: 22 of this application includes 6 random bases (n), that is, the types of random base sequences finally constructed reach 4 6 , with a total of 4096 permutations and combinations.
  • pACYCDuet1-CasY2-crRNA and pUC19-PAM library plasmids were also co-transfected into DH5 ⁇ competent cells; pACYCDuet1-CasY3-crRNA and pUC19-PAM library plasmids were co-transfected into DH5 ⁇ competent cells; pACYCDuet1-CasY4 -crRNA and pUC19-PAM library plasmid were co-transfected into DH5 ⁇ competent cells; pACYCDuet1-CasY5-crRNA and pUC19-PAM library plasmid were co-transfected into DH5 ⁇ competent cells.
  • DH5 ⁇ cells were transformed with two plasmids and treated at 37°C for 1 hour.
  • the plasmids were extracted and the PAM region sequence was subjected to PCR amplification and high-throughput sequencing.
  • Figure 3 is a schematic diagram of in vivo screening of effectors and library plasmid design.
  • the pACYC-Effector-crRNA plasmid is the plasmid (effector plasmid) shown in SEQ ID NO: 17-21, and a pUC19-PAM library targeting the PAM library was designed. plasmid.
  • Figure 4 is a schematic diagram of the negative selection screening workflow.
  • CasY1-CasY5 effectively edits target sequences with 5’-TTA, 5’-TTT, and 5’-TTC PAMs.
  • the PAM sequence of the CasY1-CasY5 protein is a 5’-TTN structure.
  • Statistics show that the editing activity of CRISPR/CasY1, CRISPR/CasY2, CRISPR/CasY3, CRISPR/CasY4, and CRISPR/CasY5 systems for 5’-TTA, 5’-TTT, and 5’-TTC is much higher than that of 5’-TTG.
  • the PAM domain analysis results of CasY1, CasY2, CasY3, CasY4, and CasY5 are shown in Figure 5.
  • "Bits" in Figure 5 refers to "bits”
  • Position refers to "position”.
  • the ccdb virus plasmid is shown in SEQ ID: 23.
  • the ccdb toxic plasmid (SEQ ID NO: 23) was transferred into Top10 competent cells. Through gradient experiments, it was found that when 64mM L-arabinose (L-ara) was added to the plate, the bacteria were lethal, indicating that this dose was a lethal dose.
  • pACYCDuet1-CasY2-crRNA recombinant plasmid SEQ ID NO: 18
  • pACYCDuet1-CasY3-crRNA recombinant plasmid SEQ ID NO: 19
  • pACYCDuet1-CasY4-crRNA recombinant plasmid SEQ ID NO: 20
  • pACYCDuet1 -CasY5-crRNA recombinant plasmid SEQ ID NO: 21
  • ccdb toxic plasmid were co-transfected into Top10 competent cells, and when 64mM L-arabinose (L-ara) was added, some bacteria survived, indicating that in the surviving part In bacteria, CasY2-CasY5 proteins cut the ccdb virulence plasmid respectively.
  • Rosetta (DE3) pLyseS (EMD Millipore) cells expressing CasY1-CasY5 respectively were inoculated into 10 ml LB medium and cultured at 37°C overnight. When the OD 600 of the bacteria reaches 0.2, lower the culture temperature to 21°C and continue culturing until the OD 600 reaches 0.6. Then add IPTG at a final concentration of 500 ⁇ M to induce Cas protein expression.
  • the lysate was filtered through a 0.22 ⁇ m filter (Millipore, Stericup) before being transferred to a nickel column (HisTrap FF, 5 ml) and eluted with an imidazole gradient. Proteins containing the expected sizes were pooled together, TEV protease (Sigma, T4455-10KU) was added, and the samples were dialyzed overnight in TEV buffer (500mM NaCl, 50mM HEPES [pH 7], 5mM MgCl, 2mM DTT). After dialysis, the sample was concentrated to 500 ⁇ l and stored frozen at -80°C.
  • TEV buffer 500mM NaCl, 50mM HEPES [pH 7], 5mM MgCl, 2mM DTT.
  • the purified protein (25nM) was reacted in cleavage buffer (NEBuffer 3, 5mM DTT) at 37°C for 20 minutes.
  • cleavage buffer NBuffer 3, 5mM DTT
  • dsDNA target DNA
  • the protein concentrations were 0nM, 50nM, 100nM, 200nM, 500nM and 1 ⁇ M respectively.
  • reaction product The materials were recovered using a purification and recovery kit (QIAGEN). Cleavage efficiency was tested on TBE-urea 6% polyacrylamide gel (Invitrogen).
  • FIG. 8 shows the results of in vitro cleavage activity, in which the in vitro cleavage activities of CasY1, CasY2, CasY3, CasY4, and CasY5 on target dsDNA are shown in Figure 8, Figure a, Figure b, Figure c, Figure d, and Figure e, respectively.
  • HEK293T cells purchased from ATCC
  • DMEM medium Gibco, 11965092
  • FBS v/v
  • Penicillin Streptomycin v/v
  • Culture was performed in a 37 °C cell culture incubator containing 5% CO2 .
  • LONZA transfection reagent (Lonza, Cat #V4XP-3032) was used for transfection according to the instructions, and the cells were counted to 2 ⁇ 10 6 .
  • CasY1 protein was mixed at a mass of 3 ⁇ g: 1.5 ⁇ g.
  • CasY2-CasY5 protein and Lbcpf1 protein were mixed with a variety of crRNA (chemically synthesized by Genscript Biotechnology Co., Ltd.) in the above proportions according to the above method, As shown in Table 1.
  • PCR amplification was performed on sequences near the target site, and the amplified PCR products were used for high-throughput deep sequencing (Jinweizhi Biotechnology Co., Ltd.).
  • the system used for target site sequence amplification is as follows: 2 ⁇ Taq Master Mix (Vazyme, P112-03) 25 ⁇ L; Primer-F (10 pmol/ ⁇ L) 1 ⁇ L; Primer-R (10 pmol/ ⁇ L) 1 ⁇ L; template 1 ⁇ L; ddH 2 O. Make up to 50 ⁇ L.
  • the above-mentioned embodiments of the present invention achieve the following technical effects: the above-mentioned CasY1, CasY2, CasY3, CasY4, and CasY5 proteins can be used in systems, methods, and compositions for manipulating nucleic acids in a targeted manner.
  • Non-naturally occurring engineered CRISPR-Cas systems for targeted modification of nucleic acids e.g., DNA
  • One or more nucleic acid components can exert the function and activity of the CRISPR-Cas system both inside and outside the cell.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Medicinal Chemistry (AREA)
  • Biotechnology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Animal Behavior & Ethology (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Neurology (AREA)
  • Plant Pathology (AREA)
  • Biophysics (AREA)
  • Epidemiology (AREA)
  • Urology & Nephrology (AREA)
  • Neurosurgery (AREA)
  • Hematology (AREA)
  • Diabetes (AREA)
  • Cell Biology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Psychology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Immunology (AREA)
  • Hospice & Palliative Care (AREA)
  • Psychiatry (AREA)

Abstract

Provided are a CRISPR-Cas effector protein, a gene editing system therefor, and an application. The CRISPR-Cas effector protein comprises a protein which is at least 70% identical to the amino acid sequence of any one of SEQ ID NOs: 1 to 5. The present application can solve the problem in the prior art of low CRISPR/Cas system cutting efficiency, and is suitable for the field of gene editing.

Description

CRISPR-Cas效应子蛋白、其基因编辑系统及应用CRISPR-Cas effector proteins, their gene editing systems and applications
本申请是以CN申请号为202210681597.5,申请日为2022年6月16日的中国申请为基础,并主张其优先权,该CN申请的公开内容作为整体引入本申请中。This application is based on the Chinese application with CN application number 202210681597.5 and the filing date is June 16, 2022, and claims its priority. The disclosure content of the CN application is incorporated into this application as a whole.
技术领域Technical field
本发明涉及基因编辑领域,具体而言,涉及一种CRISPR-Cas效应子蛋白、其基因编辑系统及应用。The present invention relates to the field of gene editing, specifically to a CRISPR-Cas effector protein, its gene editing system and applications.
背景技术Background technique
Clustered regularly interspaced short palindromic repeats(CRISPR)系统,是细菌和古细菌为了防御入侵噬菌体的DNA而形成的。CRISPR系统的免疫干扰过程主要包括3个阶段:适应、表达和干扰。适应阶段,CRISPR系统会将来自噬菌体或质粒的DNA短片段整合到前导序列和第一段重复序列之间,每一次整合都伴随着重复序列的复制,进而形成一个新的重复-间隔序列单元。表达阶段,CRISPR基因座会被转录成一段CRISPR RNA(crRNA)前体(pre-crRNA),该前体在Cas蛋白和tracrRNA的存在下会在重复序列处被进一步加工成小的crRNA。成熟的crRNA与Cas蛋白形成Cas/crRNA复合体。干扰阶段,crRNA通过其与靶序列互补的区域引导Cas/crRNA复合体寻找靶点,并在靶点位置通过Cas蛋白的核酸酶活性造成靶点位置的双链DNA断裂,从而使靶标DNA失去原有功能。The Clustered regularly interspaced short palindromic repeats (CRISPR) system was formed by bacteria and archaea to defend against the DNA of invading phages. The immune interference process of the CRISPR system mainly includes three stages: adaptation, expression and interference. In the adaptation phase, the CRISPR system integrates short DNA fragments from phages or plasmids between the leader sequence and the first repeat sequence. Each integration is accompanied by the replication of the repeat sequence, thereby forming a new repeat-spacer sequence unit. During the expression stage, the CRISPR locus is transcribed into a CRISPR RNA (crRNA) precursor (pre-crRNA), which is further processed into a small crRNA at the repetitive sequence in the presence of Cas protein and tracrRNA. Mature crRNA and Cas protein form Cas/crRNA complex. In the interference stage, crRNA guides the Cas/crRNA complex to find the target through its region complementary to the target sequence, and causes a double-stranded DNA break at the target position through the nuclease activity of the Cas protein, thereby causing the target DNA to lose its original shape. There are functions.
CRISPR系统分为I,II,III型三个家族,其中II型系统最常见的为CRISPR/Cas9系统,Cas9蛋白可在反式编码小RNA(trans-encoded small RNA,tracrRNA)的协助下将pre-crRNA加工成与tracrRNA结合的成熟crRNA。之后,人们发现通过人工构建模拟crRNA:tracrRNA复合体的单链嵌合体引导RNA(guide RNA),即可有效的介导Cas9蛋白对靶点的识别和切割。其中与靶点3′端紧邻的3个碱基必须是5′-NGG-3′的形式,从而构成Cas/crRNA复合体识别靶点所需的PAM(protospacer adjacent motif)结构。然而目前存在的不同的CRISPR/Cas各有不同的优点和缺陷。例如Cas9,C2c1和CasX均需要两条RNA进行向导RNA。常见的Cas9,C2c1,CasY和Cpf1通常大小在1300个氨基酸左右。此外,Cas9,Cpf1,CasX,CasY的PAM序列都复杂多样。且现有的CRISPR/Cas系统均存在脱靶效应严重、切割效率低等问题,因此开发脱靶效应低、切割效率高的新型CRISPR/Cas系统具有重要意义。CRISPR systems are divided into three families: I, II, and III. Among them, the most common type II system is the CRISPR/Cas9 system. The Cas9 protein can convert pre -crRNA is processed into mature crRNA that binds to tracrRNA. Later, it was discovered that by artificially constructing a single-stranded chimera guide RNA (guide RNA) that simulates the crRNA:tracrRNA complex, it can effectively mediate the recognition and cleavage of the target by the Cas9 protein. The three bases immediately adjacent to the 3' end of the target must be in the form of 5'-NGG-3', thus forming the PAM (protospacer adjacent motif) structure required for the Cas/crRNA complex to recognize the target. However, the different CRISPR/Cas currently existing have different advantages and disadvantages. For example, Cas9, C2c1 and CasX all require two RNAs for guide RNA. Common Cas9, C2c1, CasY and Cpf1 are usually around 1300 amino acids in size. In addition, the PAM sequences of Cas9, Cpf1, CasX, and CasY are complex and diverse. And the existing CRISPR/Cas systems have problems such as serious off-target effects and low cleavage efficiency. Therefore, it is of great significance to develop new CRISPR/Cas systems with low off-target effects and high cleavage efficiency.
发明内容Contents of the invention
本发明的主要目的在于提供一种CRISPR-Cas效应子蛋白、其基因编辑系统及应用,以解决现有技术中的CRISPR/Cas系统切割效率低的问题。 The main purpose of the present invention is to provide a CRISPR-Cas effector protein, its gene editing system and application, so as to solve the problem of low cutting efficiency of the CRISPR/Cas system in the existing technology.
为了实现上述目的,根据本发明的第一个方面,提供了一种CRISPR-Cas效应子蛋白,该CRISPR-Cas效应子蛋白包括与SEQ ID NO:2、1、3-5中任一项的氨基酸序列具有至少70%同一性的蛋白。In order to achieve the above object, according to the first aspect of the present invention, a CRISPR-Cas effector protein is provided, the CRISPR-Cas effector protein includes any one of SEQ ID NO: 2, 1, 3-5 Proteins with at least 70% amino acid sequence identity.
进一步地,CRISPR-Cas效应子蛋白包括与SEQ ID NO:2、1、3-5中任一项的氨基酸序列具有80%以上,优选90%以上,更优选95%以上,进一步优选99%以上同一性的蛋白;优选地,CRISPR-Cas效应子蛋白包括RuvC结构域。Further, the CRISPR-Cas effector protein includes more than 80%, preferably more than 90%, more preferably more than 95%, further preferably more than 99% of the amino acid sequence of any one of SEQ ID NO: 2, 1, 3-5. Identity protein; preferably, the CRISPR-Cas effector protein includes a RuvC domain.
进一步地,CRISPR-Cas效应子蛋白包括:a)SEQ ID NO:2、1、3-5中任一项所示的蛋白;或b)以SEQ ID NO:1所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:N21X,N23X,R25X,K26X,Q482X,S484X,R486X,S489X,R493X,H511X,C513X,H515X,N516X,R518X,R540X,K558X,Y560X,K562X,K565X,T600X,T672X,D676X,Q680X,Y683X,L686X,D693X,Y731X,G767X,R772X,K832X,K833X,Q836X,M896X;或c)以SEQ ID NO:2所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:R19X,R28X,R32X,K512X,N527X,W531X,R553X,K581X,K589X,I590X,R605X,K611X,R612X,R615X,Y777X,E877X,R931X;或d)以SEQ ID NO:3所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:K8X,F15X,N17X,K20X,K471X,W483X,H502X,R505X,K557X,K556X,R560X,Y673X,L676X,Y723X,N822X,K823X,E826X,K827X,K830X,K880X,L887X;或e)以SEQ ID NO:4所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:K317X,W330X,Y351X,K354X,D392X,F395X,N399X,Y509X,V512X,Y568X,N662X,K663X,E666X,R667X,K670X,K719X,L726X;或f)以SEQ ID NO:5所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:M9X,V16X,D18X,K21X,K518X,W531X,F550X,K553X,R609X,Y612X,R616X,Y730X,L733X,Y781X,N879X,K880X,E883X,K884X,K887X,K936X,F943X;其中X为任意氨基酸。Further, the CRISPR-Cas effector protein includes: a) the protein shown in any one of SEQ ID NO: 2, 1, 3-5; or b) based on the amino acid sequence shown in SEQ ID NO: 1, Proteins with one or more point mutations as follows: N21X, N23X, R25X, K26X, Q482X, S484X, R486X, S489X, R493X, H511X, C513X, H515X, N516X, R518X, R540X, K558X, Y560X, K562X, K565X, T600X, T672X, D676X, Q680X, Y683X, L686X, D693X, Y731X, G767X, R772X, K832X, K833X, Q836X, M896X; or c) Based on the amino acid sequence shown in SEQ ID NO: 2, perform one of the following or Proteins with multiple point mutations: R19X, R28X, R32X, K512X, N527X, W531X, R553X, K581X, K589X, I590X, R605X, K611X, R612X, R615X, Y777X, E877X, R931X; or d) SEQ ID NO: 3 Based on the amino acid sequence shown, the protein has one or more point mutations as follows: K8X, F15X, N17X, K20X, K471X, W483X, H502X, R505X, K557X, K556X, R560X, Y673X, L676X, Y723X, N822X, K823X, E826X, K827X, K830X, K880X, L887X; or e) A protein with one or more point mutations based on the amino acid sequence shown in SEQ ID NO: 4: K317X, W330X, Y351X, K354X, D392X , F395X, N399X, Y509X, V512X, Y568X, N662X, K663X, E666X, R667X, K670X, K719X, L726X; or f) Based on the amino acid sequence shown in SEQ ID NO: 5, perform one or more of the following points Mutated proteins: M9X, V16X, D18X, K21X, K518X, W531X, F550X, K553X, R609X, Y612X, R616X, Y730X, L733X, Y781X, N879X, K880X, E883X, K884X, K887X, K936X, F943 X; where X is any Amino acids.
为了实现上述目的,根据本发明的第二个方面,提供了一种CRISPR-Cas效应子融合蛋白,该CRISPR-Cas效应子融合蛋白包括上述CRISPR-Cas效应子蛋白、或CRISPR-Cas效应子蛋白的衍生物或CRISPR-Cas效应子蛋白的功能片段,以及异源功能结构域。In order to achieve the above object, according to the second aspect of the present invention, a CRISPR-Cas effector fusion protein is provided, which includes the above-mentioned CRISPR-Cas effector protein, or CRISPR-Cas effector protein Derivatives or functional fragments of CRISPR-Cas effector proteins, as well as heterologous functional domains.
进一步地,异源功能结构域位于CRISPR-Cas效应子融合蛋白的N端、C端或内部;优选地,异源功能结构域包括定位信号、报告蛋白、CRISPR-Cas效应子蛋白靶向部分、DNA结合域、表位标签、转录激活域、转录抑制域、核酸酶、脱氨结构域、甲基化酶、脱甲基酶、转录释放因子、HDAC、裂解活性多肽、连接酶中的一种或多种;优选地,定位信号包括核定位信号和/或核输出信号;优选地,核输出信号包括人类蛋白酪氨酸激酶2;优选地,报告蛋白包括谷胱甘肽-S-转移酶、辣根过氧化物酶、氯霉素乙酰转移酶、β-半乳糖苷酶、β-葡糖醛酸糖苷酶或自发荧光蛋白中的一种或多种;优选地,自发荧光蛋白包括绿色荧光蛋白、HcRed、DsRed、青荧光蛋白、黄色荧光蛋白或蓝色荧光蛋白中的一种或多种;优选地,DNA结合域包括甲基化结合蛋白、LexADBD或Gal4DBD中的一种或多种;优选地,表位标签包括组氨酸标签、V5标签、FLAG标签、流感病毒血凝素标签、Myc标签、VSV-G标签或硫氧还蛋白标签中的一种或多种;优选地,转录激活域包括VP64和/或VPR;优选地,转录抑制域包括 KRAB和/或SID;优选地,核酸酶包括FokI;优选地,脱氨结构域包括ADAR1、ADAR2、APOBEC、AID或TAD中的一种或多种;优选地,裂解活性多肽包括具有单链RNA裂解活性的多肽、具有双链RNA裂解活性的多肽、具有单链DNA裂解活性的多肽或具有双链DNA裂解活性的多肽;优选地,连接酶包括DNA连接酶和/或RNA连接酶。Further, the heterologous functional domain is located at the N-terminal, C-terminal or internal part of the CRISPR-Cas effector fusion protein; preferably, the heterologous functional domain includes a positioning signal, a reporter protein, a CRISPR-Cas effector protein targeting part, One of DNA binding domain, epitope tag, transcription activation domain, transcription repression domain, nuclease, deamination domain, methylase, demethylase, transcription release factor, HDAC, cleavage active polypeptide, ligase or more; preferably, the localization signal includes a nuclear localization signal and/or a nuclear export signal; preferably, the nuclear export signal includes human protein tyrosine kinase 2; preferably, the reporter protein includes glutathione-S-transferase , one or more of horseradish peroxidase, chloramphenicol acetyltransferase, β-galactosidase, β-glucuronidase or autofluorescent protein; preferably, the autofluorescent protein includes green One or more of fluorescent protein, HcRed, DsRed, cyan fluorescent protein, yellow fluorescent protein or blue fluorescent protein; preferably, the DNA binding domain includes one or more of methylation binding protein, LexADBD or Gal4DBD ; Preferably, the epitope tag includes one or more of histidine tag, V5 tag, FLAG tag, influenza virus hemagglutinin tag, Myc tag, VSV-G tag or thioredoxin tag; Preferably, The transcriptional activation domain includes VP64 and/or VPR; preferably, the transcriptional repression domain includes KRAB and/or SID; Preferably, the nuclease includes FokI; Preferably, the deamination domain includes one or more of ADAR1, ADAR2, APOBEC, AID or TAD; Preferably, the cleavage active polypeptide includes single-stranded RNA A polypeptide with cleavage activity, a polypeptide with double-stranded RNA cleavage activity, a polypeptide with single-stranded DNA cleavage activity or a polypeptide with double-stranded DNA cleavage activity; preferably, the ligase includes DNA ligase and/or RNA ligase.
为了实现上述目的,根据本发明的第三个方面,提供了一种DNA分子,该DNA分子编码上述CRISPR-Cas效应子蛋白、或CRISPR-Cas效应子融合蛋白。In order to achieve the above object, according to the third aspect of the present invention, a DNA molecule encoding the above-mentioned CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein is provided.
进一步地,DNA分子为根据宿主细胞的密码子偏好性进行密码子优化的DNA分子;优选地,宿主细胞包括原核细胞或真核细胞;优选地,DNA分子包括与SEQ ID NO:6至10中任一项的核苷酸序列具有70%以上,优选90%以上,更优选95%以上,进一步优选99%,更进一步优选为100%同一性的核苷酸。Further, the DNA molecule is a DNA molecule that is codon-optimized according to the codon preference of the host cell; preferably, the host cell includes a prokaryotic cell or a eukaryotic cell; preferably, the DNA molecule includes those in SEQ ID NO: 6 to 10 Any nucleotide sequence has 70% or more, preferably 90% or more, more preferably 95% or more, still more preferably 99%, even more preferably 100% nucleotide identity.
为了实现上述目的,根据本发明的第四个方面,提供了一种重组载体,该重组载体包含上述DNA分子。In order to achieve the above object, according to the fourth aspect of the present invention, a recombinant vector is provided, which contains the above-mentioned DNA molecule.
进一步地,DNA分子与启动子连接;优选地,启动子包括诱导型启动子、组成型启动子或组织特异性启动子中的一种或多种;优选地,启动子包括T7、SP6、T3、CMV、EF1a、SV40、PGK1、humanβ-actin、CAG、U6、H1、T7、T7lac、araBAD、trp、lac或Ptac中的一种或多种;优选地,重组载体包括逆转录病毒载体、慢病毒载体、腺病毒载体、腺相关病毒载体、单纯疱疹载体、质粒载体或噬菌粒载体;优选地,所述重组载体包括质粒载体。Further, the DNA molecule is connected to a promoter; preferably, the promoter includes one or more of an inducible promoter, a constitutive promoter or a tissue-specific promoter; preferably, the promoter includes T7, SP6, T3 , CMV, EF1a, SV40, PGK1, human β-actin, CAG, U6, H1, T7, T7lac, one or more of araBAD, trp, lac or Ptac; preferably, the recombinant vector includes retroviral vector, lentivirus vector, Viral vector, adenovirus vector, adeno-associated virus vector, herpes simplex vector, plasmid vector or phagemid vector; preferably, the recombinant vector includes a plasmid vector.
为了实现上述目的,根据本发明的第五个方面,提供了一种宿主细胞,该宿主细胞转化有上述重组载体。In order to achieve the above object, according to the fifth aspect of the present invention, a host cell is provided, which host cell is transformed with the above recombinant vector.
为了实现上述目的,根据本发明的第六个方面,提供了一种基因编辑系统,该基因编辑系统包括:a)RNA指导物或编码RNA指导物的核酸,RNA指导物包括直接重复序列和间隔子序列,间隔子序列用于与靶核酸杂交;b)上述CRISPR-Cas效应子蛋白,或CRISPR-Cas效应子融合蛋白,或DNA分子,或重组载体,或宿主细胞;DNA分子、重组载体或宿主细胞能够表达CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白,在基因编辑系统中,CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白,与RNA指导物结合后,靶向作用于间隔子序列与靶核酸杂交形成的杂交序列。In order to achieve the above objects, according to the sixth aspect of the present invention, a gene editing system is provided. The gene editing system includes: a) an RNA guide or a nucleic acid encoding an RNA guide. The RNA guide includes a direct repeat sequence and a spacer. sub-sequence, spacer sequence for hybridization with the target nucleic acid; b) the above-mentioned CRISPR-Cas effector protein, or CRISPR-Cas effector fusion protein, or DNA molecule, or recombinant vector, or host cell; DNA molecule, recombinant vector or The host cell can express CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein. In the gene editing system, the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein combines with the RNA guide to perform the targeting function. The hybridization sequence formed by hybridization between the spacer sequence and the target nucleic acid.
进一步地,基因编辑系统不包含tracrRNA。Further, the gene editing system does not contain tracrRNA.
进一步地,RNA指导物包括1种或多种。Further, the RNA guide includes one or more types.
进一步地,靶核酸包括DNA;优选地,DNA包括来源于真核生物的DNA或来源于原核生物的DNA;优选地,真核生物包括动物或植物;优选地,DNA包括非人类哺乳动物DNA、人类DNA、昆虫DNA、鸟类DNA、爬行动物DNA、两栖动物DNA、啮齿动物DNA、鱼类DNA、蠕虫DNA、线虫DNA或酵母DNA;优选地,非人类哺乳动物DNA包括非人类灵长类DNA。 Further, the target nucleic acid includes DNA; preferably, DNA includes DNA derived from eukaryotes or DNA derived from prokaryotes; preferably, eukaryotes include animals or plants; preferably, DNA includes non-human mammalian DNA, Human DNA, insect DNA, avian DNA, reptile DNA, amphibian DNA, rodent DNA, fish DNA, worm DNA, nematode DNA or yeast DNA; preferably, non-human mammalian DNA includes non-human primate DNA .
进一步地,直接重复序列的3’端包含茎环结构,茎环结构包括依次连接的第一茎核苷酸链、环核苷酸链和第二茎核苷酸链,第一茎核苷酸链和第二茎核苷酸链彼此杂交形成茎环结构的茎,环核苷酸链形成茎环结构的环;优选地,第一茎核苷酸链的长度为5或6个核苷酸;优选地,第二茎核苷酸链的长度为5个核苷酸;优选地,环核苷酸链的长度为6、7或8个核苷酸。Further, the 3' end of the direct repeat sequence includes a stem-loop structure. The stem-loop structure includes a first stem nucleotide chain, a cyclic nucleotide chain and a second stem nucleotide chain connected in sequence. The first stem nucleotide The cyclic nucleotide chain and the second stem nucleotide chain hybridize to each other to form the stem of the stem-loop structure, and the cyclic nucleotide chain forms the ring of the stem-loop structure; preferably, the length of the first stem nucleotide chain is 5 or 6 nucleotides ; Preferably, the length of the second stem nucleotide chain is 5 nucleotides; Preferably, the length of the cyclic nucleotide chain is 6, 7 or 8 nucleotides.
进一步地,茎环结构包括SEQ ID NO:25、28、31、34或37的核苷酸序列。Further, the stem-loop structure includes the nucleotide sequence of SEQ ID NO: 25, 28, 31, 34 or 37.
进一步地,直接重复序列包括与SEQ ID NO:24、27、30、33或36的核苷酸序列具有至少80%同一性的核苷酸序列;优选地,直接重复序列包括与SEQ ID NO:24、27、30、33或36的核苷酸序列具有至少85%以上,更优选90%以上,进一步优选95%以上同一性的核苷酸序列;优选地,直接重复序列包括SEQ ID NO:24、27、30、33或36的核苷酸序列。Further, the direct repeat sequence includes a nucleotide sequence having at least 80% identity with the nucleotide sequence of SEQ ID NO: 24, 27, 30, 33 or 36; preferably, the direct repeat sequence includes a nucleotide sequence with SEQ ID NO: The nucleotide sequence of 24, 27, 30, 33 or 36 has at least 85% or more, more preferably more than 90%, further preferably more than 95% identity; preferably, the direct repeat sequence includes SEQ ID NO: 24, 27, 30, 33 or 36 nucleotide sequence.
进一步地,间隔子序列的80%以上与靶核酸互补;优选地,间隔子序列的90%以上,更优选95%以上,进一步优选99%以上,更进一步优选100%与靶核酸互补;优选地,间隔子序列的长度为18-41nt;优选地,间隔子序列的长度为18-37nt;优选地,间隔子序列长度为18-26或34-36nt;优选地,间隔子序列长度为20nt。Further, more than 80% of the spacer sequence is complementary to the target nucleic acid; preferably, more than 90% of the spacer sequence is more than 95%, more preferably more than 99%, and even more preferably 100% is complementary to the target nucleic acid; preferably , the length of the spacer sequence is 18-41nt; preferably, the length of the spacer sequence is 18-37nt; preferably, the length of the spacer sequence is 18-26 or 34-36nt; preferably, the length of the spacer sequence is 20nt.
进一步地,直接重复序列包括第一直接重复序列和第二直接重复序列;优选地,RNA指导物包括按顺序依次连接的第一直接重复序列、间隔子序列以及第二直接重复序列;优选地,第一直接重复序列与第二直接重复序列相同。Further, the direct repeat sequence includes a first direct repeat sequence and a second direct repeat sequence; preferably, the RNA guide includes a first direct repeat sequence, a spacer sequence and a second direct repeat sequence connected in sequence; preferably, The first direct repeat sequence is the same as the second direct repeat sequence.
进一步地,靶核酸包含前间隔子相邻基序,CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白能够识别前间隔子相邻基序,前间隔子相邻基序包含核酸序列5’-TTN-3’,其中N是任何核苷酸;优选地,N为A、C或T。Further, the target nucleic acid includes a pre-spacer adjacent motif, the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein can recognize the pre-spacer adjacent motif, and the pre-spacer adjacent motif includes the nucleic acid sequence 5' -TTN-3', where N is any nucleotide; preferably, N is A, C or T.
进一步地,CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白,与RNA指导物结合,形成蛋白-核酸复合物;优选地,蛋白-核酸复合物是非天然存在的或经修饰的;优选地,蛋白-核酸复合物中的至少一个组分是非天然存在的或经修饰的。Further, the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein is combined with the RNA guide to form a protein-nucleic acid complex; preferably, the protein-nucleic acid complex is non-naturally occurring or modified; preferably , at least one component of the protein-nucleic acid complex is non-naturally occurring or modified.
进一步地,通过CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白和RNA指导物对靶核酸的靶向作用,对靶核酸进行修饰;优选地,修饰包括切割或切口;优选地,修饰导致:(1)细胞包含至少一种基因产物表达的改变;或(2)细胞包含至少一种基因产物的表达的改变,其中至少一种基因产物的表达增加;或(3)细胞包含至少一种基因产物的表达的改变,其中至少一种基因产物的表达减少;或(4)细胞包含经编辑的基因组;优选地,修饰导致细胞毒性;优选地,上述修饰导致抑制基因表达、降低基因表达或增强基因表达。Further, the target nucleic acid is modified through the targeting effect of the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein and the RNA guide on the target nucleic acid; preferably, the modification includes cutting or nicking; preferably, the modification results in (1) the cell contains an alteration in the expression of at least one gene product; or (2) the cell contains an alteration in the expression of at least one gene product, wherein the expression of at least one gene product is increased; or (3) the cell contains at least one Alteration in the expression of gene products, wherein the expression of at least one gene product is reduced; or (4) the cell contains an edited genome; preferably, the modification results in cytotoxicity; preferably, the above modification results in inhibition of gene expression, reduction in gene expression, or Enhance gene expression.
进一步地,基因编辑系统包括目标核酸或编码目标核酸的核酸目标核酸包括同源臂片段和供体模板核酸;优选地,目标核酸包含能够与间隔子序列杂交的序列;优选地,同源臂片段包括5’同源臂和3’同源臂,目标核酸由5’同源臂、供体模板核酸和3’同源臂顺序连接组成。Further, the gene editing system includes a target nucleic acid or a nucleic acid encoding the target nucleic acid. The target nucleic acid includes a homology arm fragment and a donor template nucleic acid; preferably, the target nucleic acid includes a sequence capable of hybridizing with the spacer sequence; preferably, the homology arm fragment It includes a 5' homology arm and a 3' homology arm. The target nucleic acid is composed of a 5' homology arm, a donor template nucleic acid and a 3' homology arm sequentially connected.
进一步地,基因编辑系统以可递送的形式存在,利用递送系统使基因编辑系统与靶核酸接触;优选地,递送系统将基因编辑系统递送入含有靶核酸的细胞中;优选地,可递送的形 式包括纳米颗粒、脂质体、外泌体、微泡、蛋白衣壳或基因枪所用的颗粒。Further, the gene editing system exists in a deliverable form, and the delivery system is used to bring the gene editing system into contact with the target nucleic acid; preferably, the delivery system delivers the gene editing system into cells containing the target nucleic acid; preferably, the delivery system is in a deliverable form. Formulas include nanoparticles, liposomes, exosomes, microvesicles, protein capsids or particles used in gene guns.
为了实现上述目的,根据本发明的第七个方面,提供了一种基因编辑载体,该基因编辑载体包含上述编码RNA指导物的核酸。In order to achieve the above object, according to the seventh aspect of the present invention, a gene editing vector is provided, which contains the above-mentioned nucleic acid encoding an RNA guide.
进一步地,基因编辑载体还包含上述DNA分子;优选地,DNA分子与编码RNA指导物的核酸位于相同或不同的载体上;优选地,DNA分子与第一调节元件连接;优选地,编码RNA指导物的核酸与第二调节元件连接;优选地,第一调节元件和第二调节元件分别独立选自诱导型启动子、组成型启动子或组织特异性启动子中的一种或多种;优选地,第一调节元件和第二调节元件分别独立选自T7、SP6、T3、CMV、EF1a、SV40、PGK1、humanβ-actin、CAG、U6、H1、T7、T7lac、araBAD、trp、lac或Ptac中的一种或多种。Further, the gene editing vector also contains the above-mentioned DNA molecule; preferably, the DNA molecule and the nucleic acid encoding the RNA guide are located on the same or different vectors; preferably, the DNA molecule is connected to the first regulatory element; preferably, the nucleic acid encoding the RNA guide The nucleic acid of the object is connected to the second regulatory element; preferably, the first regulatory element and the second regulatory element are independently selected from one or more of inducible promoters, constitutive promoters or tissue-specific promoters; preferably Ground, the first regulatory element and the second regulatory element are independently selected from T7, SP6, T3, CMV, EF1a, SV40, PGK1, human β-actin, CAG, U6, H1, T7, T7lac, araBAD, trp, lac or Ptac one or more of them.
为了实现上述目的,根据本发明的第八个方面,提供了一种上述基因编辑系统与细胞中靶核酸结合的方法,该方法包括:将基因编辑系统递送至细胞中,细胞包括靶核酸;使CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白,与RNA指导物结合,使间隔子序列与靶核酸结合。In order to achieve the above object, according to the eighth aspect of the present invention, a method for combining the above gene editing system with a target nucleic acid in a cell is provided. The method includes: delivering the gene editing system to the cell, and the cell includes the target nucleic acid; CRISPR-Cas effector proteins or CRISPR-Cas effector fusion proteins combine with the RNA guide to allow the spacer sequence to bind to the target nucleic acid.
进一步地,靶核酸为双链DNA或单链DNA;优选地,基因编辑系统与细胞中靶核酸的结合,导致靶核酸的表达状态改变;优选地,基因编辑系统与细胞中靶核酸的结合,导致靶核酸被切割;优选地,靶核酸被切割导致靶核酸破坏、或靶核酸特定位点替换、或靶核酸位点的移除、或靶核酸区域功能的改变、或靶核酸上两个位点之间的序列倒置。Further, the target nucleic acid is double-stranded DNA or single-stranded DNA; preferably, the combination of the gene editing system and the target nucleic acid in the cell results in a change in the expression state of the target nucleic acid; preferably, the combination of the gene editing system and the target nucleic acid in the cell, Cause the target nucleic acid to be cleaved; preferably, the target nucleic acid is cleaved to cause the destruction of the target nucleic acid, or the replacement of a specific site of the target nucleic acid, or the removal of the target nucleic acid site, or the change of the function of the target nucleic acid region, or two positions on the target nucleic acid. The sequence between points is inverted.
为了实现上述目的,根据本发明的第九个方面,提供了一种含有基因编辑系统的细胞,该含有基因编辑系统的细胞包括上述基因编辑系统、或基因编辑载体。In order to achieve the above object, according to the ninth aspect of the present invention, a cell containing a gene editing system is provided. The cell containing a gene editing system includes the above gene editing system or a gene editing vector.
进一步地,含有基因编辑系统的细胞包含经修饰的目的靶基因座,目的靶基因座为利用基因编辑系统修饰的基因座;优选地,目的靶基因座的修饰导致:(1)含有基因编辑系统的细胞包含至少一种基因产物的表达的改变;或(2)含有基因编辑系统的细胞包含至少一种基因产物的表达的改变,其中至少一种基因产物的表达增加;或(3)含有基因编辑系统的细胞包含至少一种基因产物的表达的改变,其中至少一种基因产物的表达减少;或(4)含有基因编辑系统的细胞包含经编辑的基因组;优选地,含有基因编辑系统的细胞包括真核细胞或原核细胞;优选地,真核细胞包括动物细胞、植物细胞或人类细胞;优选地,动物细胞包括哺乳动物细胞。Further, the cell containing the gene editing system contains a modified target target locus, and the target target locus is a locus modified by the gene editing system; preferably, the modification of the target target locus results in: (1) containing the gene editing system The cell contains an alteration in the expression of at least one gene product; or (2) the cell containing the gene editing system contains an alteration in the expression of at least one gene product, wherein the expression of at least one gene product is increased; or (3) the cell containing the gene The cell containing the editing system contains an altered expression of at least one gene product, wherein the expression of at least one gene product is reduced; or (4) the cell containing the gene editing system contains an edited genome; preferably, the cell containing the gene editing system It includes eukaryotic cells or prokaryotic cells; preferably, eukaryotic cells include animal cells, plant cells or human cells; preferably, animal cells include mammalian cells.
为了实现上述目的,根据本发明的第十个方面,提供了一种靶向和编辑靶核酸的方法,该方法包括使靶核酸与上述基因编辑系统接触。In order to achieve the above object, according to the tenth aspect of the present invention, a method for targeting and editing a target nucleic acid is provided, which method includes contacting the target nucleic acid with the above-mentioned gene editing system.
为了实现上述目的,根据本发明的第十一个方面,提供了一种在识别靶核酸后非特异性降解单链DNA的方法,该方法包括使靶核酸与上述基因编辑系统接触。In order to achieve the above object, according to an eleventh aspect of the present invention, a method for non-specific degradation of single-stranded DNA after identifying a target nucleic acid is provided, which method includes contacting the target nucleic acid with the above-mentioned gene editing system.
为了实现上述目的,根据本发明的第十二个方面,提供了一种在识别双链靶DNA的间隔子互补链后靶向双链靶DNA的非间隔子互补链并使其产生切口的方法,该方法包括使双链靶DNA与上述基因编辑系统接触。 In order to achieve the above object, according to a twelfth aspect of the present invention, there is provided a method of targeting and nicking the non-spacer complementary strand of the double-stranded target DNA after identifying the spacer complementary strand of the double-stranded target DNA. , the method includes contacting double-stranded target DNA with the above-mentioned gene editing system.
为了实现上述目的,根据本发明的第十三个方面,提供了一种靶向和切割双链靶DNA的方法,该方法包括使双链靶DNA与上述基因编辑系统接触。In order to achieve the above object, according to the thirteenth aspect of the present invention, a method for targeting and cutting double-stranded target DNA is provided, which method includes contacting the double-stranded target DNA with the above-mentioned gene editing system.
进一步地,在使双链DNA的间隔子互补链产生切口之前,使双链靶DNA的非间隔子互补链产生切口。Further, before nicking the spacer complementary strand of the double-stranded DNA, the non-spacer complementary strand of the double-stranded target DNA is nicked.
为了实现上述目的,根据本发明的第十四个方面,提供了一种特异性编辑双链核酸的方法,该方法包括在充分的条件下使以下进行接触充分的时间量,(1)上述CRISPR-Cas效应子蛋白、或CRISPR-Cas效应子融合蛋白、另一具有序列特异性切口活性的酶,以及RNA指导物,RNA指导物指导CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白,相对于另一序列特异性切口酶的活性使相对链产生切口;以及(2)双链核酸;方法导致双链断裂的形成。In order to achieve the above object, according to the fourteenth aspect of the present invention, there is provided a method for specifically editing double-stranded nucleic acids, which method includes contacting the following under sufficient conditions for a sufficient amount of time, (1) the above-mentioned CRISPR - a Cas effector protein, or a CRISPR-Cas effector fusion protein, another enzyme with sequence-specific nicking activity, and an RNA guide that directs the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein, The activity of a sequence-specific nicking enzyme relative to another nicks the opposing strand; and (2) a double-stranded nucleic acid; the method results in the formation of a double-stranded break.
为了实现上述目的,根据本发明的第十五个方面,提供了一种编辑双链核酸的方法,该方法包括在充分的条件下使以下进行接触充分的时间量:(1)上述CRISPR-Cas效应子蛋白、或CRISPR-Cas效应子融合蛋白,和具有DNA修饰活性的蛋白质结构域的融合蛋白,以及靶向双链核酸的RNA指导物;以及(2)双链核酸;融合蛋白的CRISPR-Cas效应子被修饰以使双链核酸的非靶链产生切口。In order to achieve the above object, according to the fifteenth aspect of the present invention, a method for editing double-stranded nucleic acid is provided, which method includes contacting the following under sufficient conditions for a sufficient amount of time: (1) the above-mentioned CRISPR-Cas An effector protein, or a CRISPR-Cas effector fusion protein, and a fusion protein of a protein domain with DNA modification activity, and an RNA guide targeting double-stranded nucleic acid; and (2) double-stranded nucleic acid; CRISPR-Cas fusion protein Cas effectors are modified to nick the non-target strand of a double-stranded nucleic acid.
进一步地,双链核酸的两条链在不同的位点被切割,导致交错切割;优选地,双链核酸的两条链在同一位点被切割,导致平双链断裂。Further, the two strands of the double-stranded nucleic acid are cleaved at different sites, resulting in staggered cleavage; preferably, the two strands of the double-stranded nucleic acid are cleaved at the same site, resulting in a flat double-stranded break.
为了实现上述目的,根据本发明的第十六个方面,提供了一种靶向并切割单链靶DNA的方法,该方法包括使靶核酸与上述基因编辑系统接触。In order to achieve the above object, according to the sixteenth aspect of the present invention, a method for targeting and cutting single-stranded target DNA is provided, which method includes contacting the target nucleic acid with the above-mentioned gene editing system.
为了实现上述目的,根据本发明的第十七个方面,提供了一种诱导细胞状态改变的方法,该方法包括使上述基因编辑系统与细胞中的靶核酸接触。In order to achieve the above object, according to the seventeenth aspect of the present invention, a method for inducing cell state changes is provided, which method includes contacting the above gene editing system with a target nucleic acid in the cell.
进一步地,细胞状态包括凋亡或休眠;优选地,细胞包括真核细胞或原核细胞;优选地,细胞包括哺乳动物细胞或植物病变细胞;优选地,细胞包括癌细胞;优选地,细胞包括感染性细胞或被感染原感染的细胞;优选地,细胞包括被病毒感染的细胞、被朊病毒感染的细胞;优选地,细胞包括真菌细胞、原生动物或寄生虫细胞。Further, the cell state includes apoptosis or dormancy; preferably, the cells include eukaryotic cells or prokaryotic cells; preferably, the cells include mammalian cells or plant disease cells; preferably, the cells include cancer cells; preferably, the cells include infection Sexual cells or cells infected by an infectious agent; preferably, the cells include virus-infected cells, prion-infected cells; preferably, the cells include fungal cells, protozoa or parasite cells.
为了实现上述目的,根据本发明的第十八个方面,提供了一种上述基因编辑系统在制备治疗受试者病症或疾病的药物中的应用。In order to achieve the above objects, according to the eighteenth aspect of the present invention, there is provided an application of the above-mentioned gene editing system in preparing drugs for treating conditions or diseases in subjects.
进一步地,应用包括向受试者或受试者的离体细胞施用基因编辑系统;优选地,间隔子序列与跟病症或疾病相关的靶核酸的至少15个核苷酸互补,CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白切割靶核酸;优选地,病症或疾病包括癌症、感染性疾病、代谢性疾病或遗传性疾病;优选地,癌症包括维尔姆斯瘤、尤文肉瘤、神经内分泌瘤、胶质母细胞瘤、神经母细胞瘤、黑色素瘤、皮肤癌、乳腺癌、结肠癌、直肠癌、前列腺癌、肝癌、肾癌、胰腺癌、肺癌、胆道癌、宫颈癌、子宫内膜癌、食管癌、胃癌、头颈癌、甲状腺髓样癌、卵巢癌、胶质瘤、淋巴瘤、白血病、骨髓瘤、急性淋巴细胞白血病、急性髓细胞性白血病、慢性淋巴细胞白血病、慢性髓细胞性白血病、何杰金氏淋巴瘤、非何杰金氏淋巴瘤或尿膀胱 癌中的一种或多种;优选地,代谢性疾病包括家族性高胆固醇血症(FH);优选地,遗传性疾病包括转甲状腺素蛋白淀粉样变(ATTR)、原发性高草尿酸症(PH1)、遗传性血管性水肿(HAE);优选地,病症或疾病包括囊性纤维化、进行性假肥大性肌营养不良、贝克肌营养不良、α-1-抗胰蛋白酶缺乏、庞贝病、强直性肌营养不良、亨廷顿病、脆性X综合征、弗里德赖希共济失调、肌萎缩侧索硬化、额颞叶痴呆、遗传性慢性肾脏病、高脂血症、高胆固醇血症、莱伯氏先天性黑蒙、镰状细胞病或β地中海贫血中的一种或多种;优选地,感染性疾病的感染原包括人类免疫缺陷病毒、单纯疱疹病毒-1、乙型肝炎(HEPATITIS B)或单纯疱疹病毒-2中的一种或多种。Further, the application includes administering the gene editing system to a subject or ex vivo cells of a subject; preferably, the spacer sequence is complementary to at least 15 nucleotides of the target nucleic acid associated with the disorder or disease, the CRISPR-Cas effect The daughter protein or CRISPR-Cas effector fusion protein cleaves the target nucleic acid; Preferably, the disorder or disease includes cancer, infectious disease, metabolic disease or genetic disease; Preferably, the cancer includes Wilms tumor, Ewing sarcoma, neuroendocrine disease tumor, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, kidney cancer, pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrium Cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, myeloma, acute lymphoblastic leukemia, acute myeloid leukemia, chronic lymphocytic leukemia, chronic myelogenous leukemia Leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma, or urinary bladder One or more cancers; preferably, the metabolic disease includes familial hypercholesterolemia (FH); preferably, the genetic disease includes transthyretin amyloidosis (ATTR), primary hyperoxaluric acid syndrome (PH1), hereditary angioedema (HAE); preferably, the disorder or disease includes cystic fibrosis, progressive Duchenne muscular dystrophy, Becker muscular dystrophy, alpha-1-antitrypsin deficiency, Pandora's disease Bayi disease, myotonic dystrophy, Huntington's disease, fragile X syndrome, Friedreich's ataxia, amyotrophic lateral sclerosis, frontotemporal dementia, hereditary chronic kidney disease, hyperlipidemia, high cholesterol One or more of Leber's congenital amaurosis, sickle cell disease or beta thalassemia; preferably, the infectious agent of the infectious disease includes human immunodeficiency virus, herpes simplex virus-1, type B One or more of hepatitis (HEPATITIS B) or herpes simplex virus-2.
为了实现上述目的,根据本发明的第十九个方面,提供了一种真核细胞系,该真核细胞系包含上述含有基因编辑系统的细胞,或为含有基因编辑系统的细胞的后代。In order to achieve the above object, according to the nineteenth aspect of the present invention, a eukaryotic cell line is provided, which eukaryotic cell line contains the above-mentioned cells containing the gene editing system, or is the progeny of the cells containing the gene editing system.
为了实现上述目的,根据本发明的第二十个方面,提供了一种多细胞生物体,该多细胞生物体包含上述含有基因编辑系统的细胞。In order to achieve the above object, according to the twentieth aspect of the present invention, a multicellular organism is provided, which includes the above-mentioned cell containing the gene editing system.
进一步地,多细胞生物体包括模型动物或模型植物。Further, multicellular organisms include model animals or model plants.
为了实现上述目的,根据本发明的第二十一个方面,提供了一种获得目的性状的植物的方法,利用上述基因编辑系统与植物细胞接触,对植物细胞的基因进行修饰或引入目的基因,修饰或目的基因能够表达目的性状,获得修饰后的植物细胞,利用修饰后的植物细胞进行再生,获得目的性状的植物。In order to achieve the above object, according to the twenty-first aspect of the present invention, a method for obtaining plants with target traits is provided, using the above gene editing system to contact plant cells, modifying the genes of the plant cells or introducing the target genes, The modified or target gene can express the target traits, obtain modified plant cells, and use the modified plant cells for regeneration to obtain plants with the target traits.
为了实现上述目的,根据本发明的第二十二个方面,提供了一种鉴定植物中目的性状的方法,植物细胞中的目的基因能够表达目的性状,利用上述基因编辑系统与植物细胞接触,从而鉴定目的基因。In order to achieve the above object, according to the twenty-second aspect of the present invention, a method for identifying target traits in plants is provided. The target genes in plant cells can express the target traits, and the above gene editing system is used to contact the plant cells, thereby Identify the gene of interest.
为了实现上述目的,根据本发明的第二十三个方面,提供了一种试剂盒,该试剂盒包括一种或多种选自下列的组分:上述CRISPR-Cas效应子蛋白、DNA分子、重组载体、宿主细胞、基因编辑系统、基因编辑载体、含有基因编辑系统的细胞、真核细胞系、或多细胞生物体;试剂盒的组分在相同或不同的容器中。In order to achieve the above object, according to the twenty-third aspect of the present invention, a kit is provided, which kit includes one or more components selected from the following: the above-mentioned CRISPR-Cas effector protein, DNA molecules, Recombinant vector, host cell, gene editing system, gene editing vector, cell containing the gene editing system, eukaryotic cell line, or multicellular organism; the components of the kit are in the same or different containers.
为了实现上述目的,根据本发明的第二十四个方面,提供了一种容器,该容器包含上述试剂盒。In order to achieve the above object, according to the twenty-fourth aspect of the present invention, a container is provided, which container contains the above-mentioned kit.
进一步地,容器包括无菌容器;优选地,容器包括注射器。Further, the container includes a sterile container; preferably, the container includes a syringe.
为了实现上述目的,根据本发明的第二十五个方面,提供了一种可植入装置,该可植入装置包括上述基因编辑系统。In order to achieve the above object, according to the twenty-fifth aspect of the present invention, an implantable device is provided, which includes the above gene editing system.
进一步地,基因编辑系统在基质内;优选地,基因编辑系统在储库内。Further, the gene editing system is in the matrix; preferably, the gene editing system is in the reservoir.
根据本发明的第二十六个方面,提供了一种治疗有需要的受试者的疾病或病症的方法,所述方法包括向受试者施用如下任意一种或多种: According to a twenty-sixth aspect of the present invention, there is provided a method of treating a disease or condition in a subject in need thereof, the method comprising administering to the subject any one or more of the following:
上述CRISPR-Cas效应子蛋白、或上述CRISPR-Cas效应子蛋白的衍生物或上述CRISPR-Cas效应子蛋白的功能片段,以及异源功能结构域,以及RNA指导物或编码所述RNA指导物的核酸;The above-mentioned CRISPR-Cas effector protein, or a derivative of the above-mentioned CRISPR-Cas effector protein or a functional fragment of the above-mentioned CRISPR-Cas effector protein, as well as a heterologous functional domain, and an RNA guide or an RNA guide encoding the RNA guide nucleic acid;
上述CRISPR-Cas效应子融合蛋白,以及RNA指导物或编码所述RNA指导物的核酸;The above-mentioned CRISPR-Cas effector fusion protein, and an RNA guide or a nucleic acid encoding the RNA guide;
上述DNA分子;The above-mentioned DNA molecules;
上述重组载体;The above recombinant vector;
上述基因编辑系统;The above gene editing system;
上述基因编辑载体;The above gene editing vector;
上述含有基因编辑系统的细胞;The above-mentioned cells containing gene editing systems;
上述真核细胞系;The above eukaryotic cell lines;
上述多细胞生物体;The above-mentioned multicellular organisms;
上述试剂盒;The above-mentioned kit;
上述容器;或上述可植入装置。The above-mentioned container; or the above-mentioned implantable device.
进一步地,所述受试者包括患有病症或疾病的受试者,所述病症或疾病包括癌症、代谢性疾病或遗传性疾病或感染性疾病;Further, the subject includes a subject suffering from a condition or disease, including cancer, metabolic disease or genetic disease or infectious disease;
进一步地,所述癌症包括维尔姆斯瘤、尤文肉瘤、神经内分泌瘤、胶质母细胞瘤、神经母细胞瘤、黑色素瘤、皮肤癌、乳腺癌、结肠癌、直肠癌、前列腺癌、肝癌、肾癌、胰腺癌、肺癌、胆道癌、宫颈癌、子宫内膜癌、食管癌、胃癌、头颈癌、甲状腺髓样癌、卵巢癌、胶质瘤、淋巴瘤、白血病、骨髓瘤、急性淋巴细胞白血病、急性髓细胞性白血病、慢性淋巴细胞白血病、慢性髓细胞性白血病、何杰金氏淋巴瘤、非何杰金氏淋巴瘤或尿膀胱癌中的一种或多种;Further, the cancer includes Wilms tumor, Ewing sarcoma, neuroendocrine tumor, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, Kidney cancer, pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, myeloma, acute lymphocyte One or more of leukemia, acute myeloid leukemia, chronic lymphocytic leukemia, chronic myelogenous leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma, or urinary bladder cancer;
进一步地,代谢性疾病包括家族性高胆固醇血症(FH);Further, metabolic diseases include familial hypercholesterolemia (FH);
进一步地,遗传性疾病包括转甲状腺素蛋白淀粉样变(ATTR)、原发性高草尿酸症(PH1)、遗传性血管性水肿(HAE);Further, genetic diseases include transthyretin amyloidosis (ATTR), primary hyperoxaluric acidosis (PH1), and hereditary angioedema (HAE);
进一步地,所述感染性疾病的感染原包括人类免疫缺陷病毒、单纯疱疹病毒-1、乙型肝炎(HEPATITIS B)或单纯疱疹病毒-2中的一种或多种;Further, the infectious agent of the infectious disease includes one or more of human immunodeficiency virus, herpes simplex virus-1, hepatitis B (HEPATITIS B) or herpes simplex virus-2;
进一步地,所述病症或疾病包括囊性纤维化、进行性假肥大性肌营养不良、贝克肌营养不良、α-1-抗胰蛋白酶缺乏、庞贝病、强直性肌营养不良、亨廷顿病、脆性X综合征、弗里德赖希共济失调、肌萎缩侧索硬化、额颞叶痴呆、遗传性慢性肾脏病、高脂血症、高胆固醇血症、莱伯氏先天性黑蒙、镰状细胞病或β地中海贫血中的一种或多种。 Further, the conditions or diseases include cystic fibrosis, progressive Duchenne muscular dystrophy, Baker muscular dystrophy, alpha-1-antitrypsin deficiency, Pompe disease, myotonic dystrophy, Huntington's disease, Fragile One or more of syncytosis or beta thalassemia.
根据本发明的第二十七个方面,提供了上述基因编辑系统在治疗病症或疾病中的用途。According to a twenty-seventh aspect of the present invention, there is provided the use of the above gene editing system in treating disorders or diseases.
进一步地,所述病症或疾病包括癌症、感染性疾病、代谢性疾病或遗传性疾病;Further, the conditions or diseases include cancer, infectious diseases, metabolic diseases or genetic diseases;
所述癌症包括维尔姆斯瘤、尤文肉瘤、神经内分泌瘤、胶质母细胞瘤、神经母细胞瘤、黑色素瘤、皮肤癌、乳腺癌、结肠癌、直肠癌、前列腺癌、肝癌、肾癌、胰腺癌、肺癌、胆道癌、宫颈癌、子宫内膜癌、食管癌、胃癌、头颈癌、甲状腺髓样癌、卵巢癌、胶质瘤、淋巴瘤、白血病、骨髓瘤、急性淋巴细胞白血病、急性髓细胞性白血病、慢性淋巴细胞白血病、慢性髓细胞性白血病、何杰金氏淋巴瘤、非何杰金氏淋巴瘤或尿膀胱癌中的一种或多种;The cancers include Wilms tumor, Ewing sarcoma, neuroendocrine tumors, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, kidney cancer, Pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, myeloma, acute lymphoblastic leukemia, acute One or more of myeloid leukemia, chronic lymphocytic leukemia, chronic myelogenous leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma, or urinary bladder cancer;
进一步地,所述感染性疾病的感染原包括人类免疫缺陷病毒、单纯疱疹病毒-1、乙型肝炎(HEPATITIS B)或单纯疱疹病毒-2中的一种或多种;Further, the infectious agent of the infectious disease includes one or more of human immunodeficiency virus, herpes simplex virus-1, hepatitis B (HEPATITIS B) or herpes simplex virus-2;
进一步地,所述代谢性疾病包括家族性高胆固醇血症(FH);Further, the metabolic disease includes familial hypercholesterolemia (FH);
进一步地,所述遗传性疾病包括转甲状腺素蛋白淀粉样变(ATTR)、原发性高草尿酸症(PH1)、遗传性血管性水肿(HAE);Further, the genetic diseases include transthyretin amyloidosis (ATTR), primary hyperoxaluric acidosis (PH1), and hereditary angioedema (HAE);
进一步地,所述病症或疾病包括囊性纤维化、进行性假肥大性肌营养不良、贝克肌营养不良、α-1-抗胰蛋白酶缺乏、庞贝病、强直性肌营养不良、亨廷顿病、脆性X综合征、弗里德赖希共济失调、肌萎缩侧索硬化、额颞叶痴呆、遗传性慢性肾脏病、高脂血症、高胆固醇血症、莱伯氏先天性黑蒙、镰状细胞病或β地中海贫血中的一种或多种。Further, the conditions or diseases include cystic fibrosis, progressive Duchenne muscular dystrophy, Baker muscular dystrophy, alpha-1-antitrypsin deficiency, Pompe disease, myotonic dystrophy, Huntington's disease, Fragile One or more of syncytosis or beta thalassemia.
应用本发明的技术方案,利用上述CasY1、CasY2、CasY3、CasY4和CasY5,均能够发挥Cas蛋白的切割活性,相较于现有的CRISPR/Cas系统切割效率高。Applying the technical solution of the present invention, the above-mentioned CasY1, CasY2, CasY3, CasY4 and CasY5 can all exert the cleavage activity of Cas protein, and the cleavage efficiency is higher than that of the existing CRISPR/Cas system.
附图说明Description of the drawings
构成本申请的一部分的说明书附图用来提供对本发明的进一步理解,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:The description and drawings that constitute a part of this application are used to provide a further understanding of the present invention. The illustrative embodiments of the present invention and their descriptions are used to explain the present invention and do not constitute an improper limitation of the present invention. In the attached picture:
图1示出了根据本发明实施例1的CasY1-CasY5的蛋白结构域示意图,其中D、E、D代表RuvC结构域的三个保守基序I、II、III的催化残基,h表示桥螺旋结构。Figure 1 shows a schematic diagram of the protein domain of CasY1-CasY5 according to Embodiment 1 of the present invention, in which D, E, and D represent the catalytic residues of the three conserved motifs I, II, and III of the RuvC domain, and h represents the bridge. spiral structure.
图2示出了根据本发明实施例1的CasY1、CasY2、CasY3、CasY4和CasY5的直接重复序列的二级结构分析结果示意图。Figure 2 shows a schematic diagram of the secondary structure analysis results of the direct repeat sequences of CasY1, CasY2, CasY3, CasY4 and CasY5 according to Embodiment 1 of the present invention.
图3示出了根据本发明实施例2的体内筛选效应子和文库质粒设计的示意图。Figure 3 shows a schematic diagram of in vivo screening of effectors and library plasmid design according to Example 2 of the present invention.
图4示出了根据本发明实施例2的负选择筛选工作流程的示意图。Figure 4 shows a schematic diagram of the negative selection screening workflow according to Embodiment 2 of the present invention.
图5示出了根据本发明实施例2的CasY1、CasY2、CasY3、CasY4、CasY5的PAM结构域分析结果图。 Figure 5 shows the PAM domain analysis results of CasY1, CasY2, CasY3, CasY4, and CasY5 according to Embodiment 2 of the present invention.
图6示出了根据本发明实施例3的CasY1、CasY2、CasY3、CasY4、CasY5靶向切割质粒示意图。Figure 6 shows a schematic diagram of CasY1, CasY2, CasY3, CasY4 and CasY5 targeted cutting plasmids according to Embodiment 3 of the present invention.
图7示出了根据本发明实施例3的CasY1、CasY2、CasY3、CasY4、CasY5的细菌切割结果图。Figure 7 shows the bacterial cleavage results of CasY1, CasY2, CasY3, CasY4 and CasY5 according to Example 3 of the present invention.
图8示出了根据本发明实施例4的CasY1、CasY2、CasY3、CasY4、CasY5的体外切割结果图,其中,图a示出了CasY1的体外切割结果图,图b示出了CasY2的体外切割结果图,图c示出了CasY3的体外切割结果图,图d示出了CasY4的体外切割结果图,图e示出了CasY5的体外切割结果图。Figure 8 shows the in vitro cleavage results of CasY1, CasY2, CasY3, CasY4, and CasY5 according to Embodiment 4 of the present invention. Figure a shows the in vitro cleavage results of CasY1, and figure b shows the in vitro cleavage of CasY2. Figure c shows the in vitro cleavage results of CasY3, figure d shows the in vitro cleavage results of CasY4, and figure e shows the in vitro cleavage results of CasY5.
图9示出了根据本发明实施例5的CasY1、CasY2、CasY3、CasY4、CasY5和Lbcpf1对293T细胞不同靶基因切割活性统计图。Figure 9 shows a statistical graph of the cleavage activities of CasY1, CasY2, CasY3, CasY4, CasY5 and Lbcpf1 on different target genes in 293T cells according to Example 5 of the present invention.
具体实施方式Detailed ways
需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。下面将结合实施例来详细说明本发明。It should be noted that, as long as there is no conflict, the embodiments and features in the embodiments of this application can be combined with each other. The present invention will be described in detail below with reference to examples.
术语解释:Terminology explanation:
本发明所用的术语“切割”是指由本文所述CRISPR系统的核酸酶产生的靶核酸中的DNA断裂。在一些实施例中,切割事件是双链DNA断裂。在一些实施例中,切割是单链DNA断裂。在一些实施例中,切口是双链DNA的某一条DNA链断裂。The term "cleavage" as used herein refers to a DNA break in a target nucleic acid produced by the nuclease of the CRISPR system described herein. In some embodiments, the cleavage event is a double-stranded DNA break. In some embodiments, the cleavage is a single-stranded DNA break. In some embodiments, the nick is a break in one of the DNA strands of the double-stranded DNA.
本发明中使用的术语“CRISPR-Cas系统”是指V-I型CRISPR-Cas效应子蛋白(即,Cas 12i效应子蛋白)和一种或多种RNA指导物,和/或编码上述CRISPR-Cas效应子蛋白或一种或多种RNA指导物的核酸,以及任选地与CRISPR-Cas效应子的表达或与RNA指导物或两者可操作地连接的启动子。The term "CRISPR-Cas system" as used in the present invention refers to type V-I CRISPR-Cas effector proteins (i.e., Cas 12i effector proteins) and one or more RNA guides, and/or encodes the CRISPR-Cas effectors described above A nucleic acid containing a daughter protein or one or more RNA guides, and optionally a promoter operably linked to expression of a CRISPR-Cas effector or to the RNA guide or both.
本发明中使用的术语“直接重复序列”或“直接重复”是指多个短的直接重复序列,其在CRISPR阵列中显示出非常小的序列变化或没有序列变化。适当地,Cas12i的直接重复序列可以形成茎环结构。The term "direct repeat" or "direct repeat" as used in the present invention refers to multiple short direct repeats that show very little or no sequence change in a CRISPR array. Appropriately, the direct repeats of Cas12i can form a stem-loop structure.
本发明所使用的术语“茎环结构”是指具有二级结构的核酸,所述二级结构包括已知或预测形成双链(茎部分)的核苷酸区域,所述双链(茎部分)在一侧由主要为单链核苷酸的区域(环部分)连接。术语“发夹”和“折回”结构在本发明中也用于指茎环结构。这样的结构在本领域中是公知的,并且这些术语与其在本领域中的公知含义一致地使用。如本领域已知的,茎环结构不需要精确的碱基配对。因此,茎可以包括一个或多个碱基错配。The term "stem-loop structure" used in the present invention refers to a nucleic acid having a secondary structure that includes a nucleotide region known or predicted to form a double-stranded (stem portion). ) are connected on one side by a region (loop portion) of mostly single-stranded nucleotides. The terms "hairpin" and "turn-back" structures are also used herein to refer to stem-loop structures. Such structures are well known in the art, and these terms are used consistent with their commonly known meanings in the art. As is known in the art, stem-loop structures do not require precise base pairing. Thus, a stem may include one or more base mismatches.
可替代地,碱基配对可以是精确的,即不包括任何错配。直接重复序列具有茎环结构。RNA指导物中包含的直接重复的茎由5个互相杂交的互补核碱基组成,并且环长度是6、7或9个核苷酸。 Alternatively, base pairing may be exact, ie, not include any mismatches. Direct repeat sequences have a stem-loop structure. The directly repeated stems included in the RNA guides consist of five complementary nucleobases that hybridize to each other, and the loop length is 6, 7, or 9 nucleotides.
发明中使用的术语“CRISPR RNA”、“crRNA”或是指包含由CRISPR效应子用于靶向特定核酸序列的指导物序列的RNA分子。典型地,crRNA包含介导靶识别的间隔子序列和与CRISPR-Cas效应子蛋白形成复合物的直接重复序列(或称为直接重复或“DR”序列)。The terms "CRISPR RNA", "crRNA" or "crRNA" as used in the invention refer to RNA molecules containing guide sequences used by CRISPR effectors to target specific nucleic acid sequences. Typically, crRNA contains spacer sequences that mediate target recognition and direct repeat sequences (also known as direct repeat or "DR" sequences) that form complexes with CRISPR-Cas effector proteins.
本发明中所用的术语“供体模板核酸”是指在本文所述的CRISPR酶改变了靶核酸之后,一种或多种细胞蛋白质可以使用其来改变靶核酸的结构的核酸分子。The term "donor template nucleic acid" as used herein refers to a nucleic acid molecule that can be used by one or more cellular proteins to alter the structure of the target nucleic acid after the CRISPR enzyme described herein has altered the target nucleic acid.
在一些实施例中,供体模板核酸是双链核酸。In some embodiments, the donor template nucleic acid is a double-stranded nucleic acid.
在一些实施例中,供体模板核酸是单链核酸。In some embodiments, the donor template nucleic acid is a single-stranded nucleic acid.
在一些实施例中,供体模板核酸是线性的。In some embodiments, the donor template nucleic acid is linear.
在一些实施例中,供体模板核酸是环状的(例如,环状质粒)。在一些实施例中,供体模板核酸是外源核酸分子。在一些实施例中,供体模板核酸是内源核酸分子(例如,染色体)。In some embodiments, the donor template nucleic acid is circular (eg, a circular plasmid). In some embodiments, the donor template nucleic acid is an exogenous nucleic acid molecule. In some embodiments, the donor template nucleic acid is an endogenous nucleic acid molecule (eg, a chromosome).
本发明中使用的术语“CRISPR-Cas效应子”、“CRISPR效应子”、“效应子”、“CRISPR相关蛋白”或“CRISPR酶”、“CRISPR-Cas效应子蛋白”、或“Cas效应子蛋白”是指执行酶活性或结合RNA指导物指定的核酸上的靶位点的蛋白。As used in the present invention, the terms "CRISPR-Cas effector", "CRISPR effector", "effector", "CRISPR-related protein" or "CRISPR enzyme", "CRISPR-Cas effector protein", or "Cas effector" "Protein" refers to a protein that performs an enzymatic activity or binds to a target site on a nucleic acid specified by an RNA guide.
在CRISPR-Cas系统内相关联的CRISPR-Cas效应子蛋白在本发明中也可称为“Cas”或“Cas酶”、“Cas蛋白”。Cas酶可以识别与靶DNA附近相关联的短基序,称为前间隔子相邻基序(PAM)。The CRISPR-Cas effector protein associated within the CRISPR-Cas system may also be referred to as "Cas" or "Cas enzyme" or "Cas protein" in the present invention. Cas enzymes can recognize short motifs associated near target DNA, called prespacer adjacent motifs (PAMs).
本发明中的CasY1、CasY2、CasY3、CasY4和CasY5蛋白可以识别包含TTN或由TTN组成的PAM,其中N表示任何核苷酸。CasY1, CasY2, CasY3, CasY4 and CasY5 proteins in the present invention can recognize PAMs containing or consisting of TTN, where N represents any nucleotide.
例如,PAM可以是TTA、TTC、TTT或TTG。For example, PAM can be TTA, TTC, TTT or TTG.
在一些实施例中,CRISPR-Cas效应子蛋白具有核酸内切酶活性、切口酶活性和/或核酸外切酶活性。In some embodiments, the CRISPR-Cas effector protein has endonuclease activity, nickase activity, and/or exonuclease activity.
本发明所使用的术语“RNA指导物”是指促进本发明中所述蛋白质靶向于靶核酸的任何RNA分子。示例性“RNA指导物”包括但不限于crRNA、前crRNA(例如DR-间隔子-DR)和成熟crRNA(例如成熟DR-间隔子,成熟DR-间隔子-成熟DR)。The term "RNA guide" as used herein refers to any RNA molecule that facilitates the targeting of a protein described in the invention to a target nucleic acid. Exemplary "RNA guides" include, but are not limited to, crRNA, pre-crRNA (eg, DR-spacer-DR), and mature crRNA (eg, mature DR-spacer, mature DR-spacer-mature DR).
如本发明中所用的术语“靶向”是指与不具有与靶核酸相同或相似序列的其他核酸相比,包括CRISPR相关蛋白和RNA指导物(如crRNA)的复合物优先或特异性结合到例如杂交到特定靶核酸的能力。The term "targeting" as used in the present invention refers to the preferential or specific binding of a complex including a CRISPR-associated protein and an RNA guide (such as crRNA) to other nucleic acids that do not have the same or similar sequence as the target nucleic acid. For example, the ability to hybridize to a specific target nucleic acid.
本发明中所用的术语“靶核酸”是指特定的核酸底物,其包含与RNA指导物中间隔子的全部或部分互补的核酸序列。在一些实施例中,靶核酸包含基因或基因内的序列。在一些实施例中,靶核酸包含非编码区(例如,启动子)。在一些实施例中,靶核酸是单链的或双链的。The term "target nucleic acid" as used in the present invention refers to a specific nucleic acid substrate that contains a nucleic acid sequence complementary to all or part of the spacer in the RNA guide. In some embodiments, the target nucleic acid comprises a gene or a sequence within a gene. In some embodiments, the target nucleic acid includes a non-coding region (eg, a promoter). In some embodiments, the target nucleic acid is single-stranded or double-stranded.
本说明书,使用“数值A~数值B”表示的数值范围是指包含端点数值A、B的范围。 In this specification, the numerical range represented by "numeric value A to numerical value B" refers to the range including the endpoint values A and B.
本发明中,使用“至少数值A”表示的数值是指包含大于、等于数值A的范围。In the present invention, the numerical value represented by "at least numerical value A" means a range including greater than or equal to numerical value A.
本发明中,使用“基本上”或“实质上”表示与理论模型或理论数据的标准偏差在5%、优选为3%、更优选为1%范围以内。In the present invention, the use of "substantially" or "substantially" means that the standard deviation from the theoretical model or theoretical data is within the range of 5%, preferably 3%, and more preferably 1%.
本发明中,使用“可以”表示的含义包括了进行某种处理以及不进行某种处理两方面的含义。In the present invention, the meaning expressed by "can" includes both the meaning of performing a certain process and not performing a certain process.
本发明中,“任选的”或“任选地”是指接下来描述的事件或情况可发生或可不发生,并且该描述包括该事件发生的情况和该事件不发生的情况。In the present invention, "optional" or "optionally" means that the next described event or situation may or may not occur, and the description includes situations where the event occurs and situations where the event does not occur.
本说明书中,所提及的“一些具体/优选的实施方案”、“另一些具体/优选的实施方案”、“实施方案”等是指所描述的与该实施方案有关的特定要素(例如,特征、结构、性质和/或特性)包括在此处所述的至少一种实施方案中,并且可存在于其它实施方案中或者可不存在于其它实施方案中。另外,应理解,所述要素可以任何合适的方式组合在各种实施方案中。In this specification, references to "some specific/preferred embodiments", "other specific/preferred embodiments", "implementations", etc. refer to the specific elements described related to the embodiment (for example, Features, structures, properties and/or characteristics) are included in at least one embodiment described herein and may or may not be present in other embodiments. Additionally, it is to be understood that the described elements may be combined in various embodiments in any suitable manner.
如本发明所使用的,术语“核酸”和“核酸分子”是指包含核碱基和酸性部分的化合物,例如核苷、核苷酸或核苷酸的聚合物。通常,聚合核酸,例如包含三个或更多个核苷酸的核酸分子是线性分子,其中相邻核苷酸通过磷酸二酯键相互连接。在一些实施方案中,“核酸”是指单个核酸残基(例如核苷酸和/或核苷)。在一些实施方案中,“核酸”是指包含三个或更多个单独核苷酸残基的寡核苷酸链。如本文所用,术语“寡核苷酸”和“多核苷酸”可互换使用以指核苷酸的聚合物(例如,至少三个核苷酸的串)。在一些实施方案中,“核酸”包括RNA以及单链和/或双链DNA。核酸可以是天然存在的,例如在基因组、转录物、mRNA、tRNA、rRNA、siRNA、snRNA、质粒、粘粒、染色体、染色单体或其他天然存在的核酸分子的上下文中。另一方面,核酸分子可以是非天然存在的分子,例如重组DNA或RNA、人工染色体、工程基因组或其片段,或合成的DNA、RNA、DNA/RNA杂交体、或包括非天然存在的核苷酸或核苷。此外,术语“核酸”、“DNA”、“RNA”和/或类似术语包括核酸类似物,例如具有除磷酸二酯骨架之外的其他骨架的类似物。核酸可以从天然来源纯化、使用重组表达系统产生和任选地纯化、化学合成等。在合适的情况下,例如在化学合成分子的情况下,核酸可以包含核苷类似物,例如具有化学修饰碱基的类似物或糖和骨架修饰。As used herein, the terms "nucleic acid" and "nucleic acid molecule" refer to a compound containing a nucleobase and an acidic moiety, such as a nucleoside, a nucleotide, or a polymer of nucleotides. Typically, polymeric nucleic acids, such as nucleic acid molecules containing three or more nucleotides, are linear molecules in which adjacent nucleotides are linked to each other by phosphodiester bonds. In some embodiments, "nucleic acid" refers to a single nucleic acid residue (eg, a nucleotide and/or nucleoside). In some embodiments, "nucleic acid" refers to an oligonucleotide chain comprising three or more individual nucleotide residues. As used herein, the terms "oligonucleotide" and "polynucleotide" are used interchangeably to refer to a polymer of nucleotides (eg, a string of at least three nucleotides). In some embodiments, "nucleic acid" includes RNA as well as single- and/or double-stranded DNA. Nucleic acids may be naturally occurring, for example, in the context of genomes, transcripts, mRNA, tRNA, rRNA, siRNA, snRNA, plasmids, cosmids, chromosomes, chromatids or other naturally occurring nucleic acid molecules. On the other hand, a nucleic acid molecule may be a non-naturally occurring molecule, such as a recombinant DNA or RNA, an artificial chromosome, an engineered genome or a fragment thereof, or a synthetic DNA, RNA, DNA/RNA hybrid, or include non-naturally occurring nucleotides or nucleosides. Furthermore, the terms "nucleic acid," "DNA," "RNA," and/or similar terms include nucleic acid analogs, for example, analogs having backbones other than a phosphodiester backbone. Nucleic acids can be purified from natural sources, produced and optionally purified using recombinant expression systems, chemically synthesized, and the like. Where appropriate, for example in the case of chemically synthesized molecules, the nucleic acid may comprise nucleoside analogs, for example analogs with chemically modified bases or sugars and backbone modifications.
如发明所使用的,术语“多肽”、“肽”和“蛋白质”在本文中互换地使用并且为任意长度的氨基酸聚合物。该聚合物可以是线形或分支的,它可以包含修饰的氨基酸,并且它可以由非氨基酸隔断。该术语也包括已经被修饰(例如,二硫键形成、糖基化、脂质化、乙酰化、磷酸化或任何其他操作,如以标记组分缀合)的氨基酸聚合物。As used herein, the terms "polypeptide," "peptide," and "protein" are used interchangeably herein and refer to a polymer of amino acids of any length. The polymer can be linear or branched, it can contain modified amino acids, and it can be interrupted by non-amino acids. The term also includes amino acid polymers that have been modified (eg, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling component).
如本发明所使用的,“CRISPR-Cas效应子融合蛋白”是指包含来自至少两种不同蛋白质的蛋白质结构域的杂化多肽。一种蛋白质可位于CRISPR-Cas效应子融合蛋白的氨基-末端(N-末端,N端)部分或羧基-末端(C-末端,C端)蛋白处,因此分别形成“氨基-末端CRISPR-Cas效应子融合蛋白”或“羧基-末端CRISPR-Cas效应子融合蛋白”。本文提供的任何蛋白质可通过本领域已知的任何方法产生。例如,本文提供的蛋白质可经由重组蛋白质表达和纯化来产生,这尤其适合于包含肽接头的CRISPR-Cas效应子融合蛋白。用于重组蛋白质表达和纯化的方法是 公知的,并且包括以下中所述的那些,例如,可参见Green and Sambrook,Molecular Cloning:A Laboratory Manual(4th ed.,Cold Spring Harbor Laboratory Press,Cold Spring Harbor,N.Y.(2012))。As used herein, "CRISPR-Cas effector fusion protein" refers to a hybrid polypeptide comprising protein domains from at least two different proteins. A protein can be located at the amino-terminal (N-terminal, N-terminal) portion or the carboxyl-terminal (C-terminal, C-terminal) portion of a CRISPR-Cas effector fusion protein, thus forming an "amino-terminal CRISPR-Cas" respectively Effector fusion protein" or "carboxy-terminal CRISPR-Cas effector fusion protein". Any protein provided herein can be produced by any method known in the art. For example, the proteins provided herein can be produced via recombinant protein expression and purification, which is particularly suitable for CRISPR-Cas effector fusion proteins containing peptide linkers. Methods used for recombinant protein expression and purification are Well known and include those described below, see, for example, Green and Sambrook, Molecular Cloning: A Laboratory Manual (4th ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (2012)).
本领域技术人员清楚,可以改变蛋白质的结构而不对其活性和功能性产生不利影响,例如,可以在蛋白质氨基酸序列中引入一个或多个保守性氨基酸取代,而不会对蛋白质分子的活性和/或三维结构产生不利影响。本领域技术人员清楚保守性氨基酸取代的实例以及实施方式。具体的说,可以用与待取代位点属于相同组的另一氨基酸残基取代该氨基酸残基,即用非极性氨基酸残基取代另一非极性氨基酸残基,用极性不带电荷的氨基酸残基取代另一极性不带电荷的氨基酸残基,用碱性氨基酸残基取代另一碱性氨基酸残基,和用酸性氨基酸残基取代另一酸性氨基酸残基。这样的取代的氨基酸残基可以是也可以不是由遗传密码编码的。只要取代不导致蛋白质生物活性的失活,则一种氨基酸被属于同组的其他氨基酸替换的保守取代落在本发明的范围内。因此,本发明涉及的蛋白可以在氨基酸序列中包含一个或多个保守性取代,只要该非保守取代不显著影响本发明的蛋白质的所需功能和生物活性即可。It is clear to those skilled in the art that the structure of a protein can be changed without adversely affecting its activity and functionality. For example, one or more conservative amino acid substitutions can be introduced in the amino acid sequence of the protein without affecting the activity and/or functionality of the protein molecule. Or adversely affect the three-dimensional structure. Examples and implementations of conservative amino acid substitutions will be apparent to those skilled in the art. Specifically, the amino acid residue can be replaced with another amino acid residue belonging to the same group as the site to be replaced, that is, a non-polar amino acid residue can be substituted for another non-polar amino acid residue, and a polar uncharged amino acid residue can be substituted. An amino acid residue is substituted for another polar uncharged amino acid residue, a basic amino acid residue is substituted for another basic amino acid residue, and an acidic amino acid residue is substituted for another acidic amino acid residue. Such substituted amino acid residues may or may not be encoded by the genetic code. Conservative substitutions in which one amino acid is replaced by another amino acid belonging to the same group fall within the scope of the invention as long as the substitution does not result in inactivation of the biological activity of the protein. Therefore, the protein involved in the present invention may contain one or more conservative substitutions in the amino acid sequence, as long as the non-conservative substitution does not significantly affect the desired function and biological activity of the protein of the invention.
保守氨基酸置换可以在一个或多个预测的非必需氨基酸残基处进行。“非必需”氨基酸残基是可以发生改变(缺失、取代或置换)而不改变生物活性的氨基酸残基,而“必需”氨基酸残基是生物活性所需的。“保守氨基酸置换”是其中氨基酸残基被具有类似侧链的氨基酸残基替代的置换。氨基酸置换可以在上述Cas蛋白的非保守区域中进行。一般而言,此类置换不对保守的氨基酸残基,或者不对位于保守基序内的氨基酸残基进行,其中此类残基是蛋白质活性所需的。然而,本领域技术人员应当理解,功能变体可以具有较少的在保守区域中的保守或非保守改变。Conservative amino acid substitutions can be made at one or more predicted non-essential amino acid residues. "Non-essential" amino acid residues are amino acid residues that can be altered (deletion, substitution or replacement) without altering biological activity, whereas "essential" amino acid residues are required for biological activity. A "conservative amino acid substitution" is a substitution in which an amino acid residue is replaced by an amino acid residue with a similar side chain. Amino acid substitutions can be made in non-conserved regions of the above-mentioned Cas proteins. Generally, such substitutions are not made to conserved amino acid residues, or to amino acid residues located within conserved motifs where such residues are required for protein activity. However, those skilled in the art will appreciate that functional variants may have fewer conservative or non-conservative changes in conserved regions.
如本发明所使用的,术语“CRISPR”是指成簇、规律间隔的短回文重复序列(Clustered regularly interspaced short palindromic repeats),其来自微生物的免疫系统。As used herein, the term "CRISPR" refers to clustered regularly interspaced short palindromic repeats (Clustered regularly interspaced short palindromic repeats), which are derived from the immune system of microorganisms.
如本发明所使用的,术语“靶序列”是指目标核酸中与crRNA互补或至少部分互补的核苷酸序列,Cas蛋白、crRNA与靶序列形成三元复合物后,Cas蛋白发挥对目标核酸中靶核酸链和/或非核苷酸链的特异性切割活性。在本公开中,“靶序列”与“靶核酸”、“靶多核苷酸”、“目标序列”、“目标核酸序列”可以互换地使用。As used in the present invention, the term "target sequence" refers to the nucleotide sequence in the target nucleic acid that is complementary or at least partially complementary to crRNA. After the Cas protein, crRNA and the target sequence form a ternary complex, the Cas protein exerts its influence on the target nucleic acid. Specific cleavage activity on target nucleic acid strands and/or non-nucleotide strands. In this disclosure, "target sequence" is used interchangeably with "target nucleic acid," "target polynucleotide," "target sequence," and "target nucleic acid sequence."
如本发明所使用的,术语“靶标链”(target strand)是指目标核酸中与crRNA杂交的核苷酸链;术语“非靶标链”(non-target strand)是指目标核酸中与crRNA不发生杂交配对的核苷酸链。As used in the present invention, the term "target strand" refers to the nucleotide strand in the target nucleic acid that hybridizes to crRNA; the term "non-target strand" refers to the nucleotide strand in the target nucleic acid that does not hybridize with crRNA. The nucleotide chains in which hybridization pairing occurs.
如本发明所使用的,术语“脱氨酶”或“脱氨酶结构域”是指催化脱氨反应的蛋白质或酶。在一些实施方案中,脱氨酶是腺苷脱氨酶,其催化腺嘌呤水解脱氨为次黄嘌呤。在一些实施方案中,脱氨酶是腺苷脱氨酶,其催化腺苷或腺嘌呤(A)水解脱氨为肌苷(I)。As used herein, the term "deaminase" or "deaminase domain" refers to a protein or enzyme that catalyzes a deamination reaction. In some embodiments, the deaminase is an adenosine deaminase, which catalyzes the hydrolytic deamination of adenine to hypoxanthine. In some embodiments, the deaminase is an adenosine deaminase, which catalyzes the hydrolytic deamination of adenosine or adenine (A) to inosine (I).
如本公开所使用的,“碱基编辑器(Base Editor,BE)”或“核碱基编辑器”是指结合多核苷酸并具有核碱基修饰活性的试剂。在各种实施方案中,碱基编辑器包含核碱基修饰多肽(例如,脱氨酶)和与引导多核苷酸(例如,引导RNA)结合的核酸可编程核苷酸结合结构域(例 如,核酸可编程DNA结合蛋白)。在各种实施方案中,所述试剂是包含具有碱基编辑活性的蛋白质结构域的生物分子复合物,即能够修饰核酸分子(例如,DNA、RNA)内的碱基(例如,A、T、C、G或U)。在一些实施方案中,所述多核苷酸可编程DNA结合结构域与脱氨酶结构域融合或连接。在一个实施方案中,所述试剂是包含具有碱基编辑活性的结构域的CRISPR-Cas效应子融合蛋白。在一些实施方案中,具有碱基编辑活性的结构域能够使核酸分子内的碱基脱氨基。在一些实施方案中,所述碱基编辑器能够使DNA分子内的一个或多个碱基脱氨基。在一些实施方案中,所述碱基编辑器是腺苷碱基编辑器(ABE)。As used in this disclosure, "Base Editor (BE)" or "nucleobase editor" refers to a reagent that binds to a polynucleotide and has nucleobase modifying activity. In various embodiments, a base editor comprises a nucleobase modifying polypeptide (e.g., a deaminase) and a nucleic acid programmable nucleotide binding domain (e.g., a guide RNA) that binds to a guide polynucleotide (e.g., a guide RNA) For example, nucleic acid-programmable DNA-binding proteins). In various embodiments, the agent is a biomolecular complex comprising a protein domain having base editing activity, i.e., capable of modifying bases (e.g., A, T, C, G or U). In some embodiments, the polynucleotide programmable DNA binding domain is fused or linked to a deaminase domain. In one embodiment, the agent is a CRISPR-Cas effector fusion protein comprising a domain with base editing activity. In some embodiments, a domain with base editing activity is capable of deaminating bases within a nucleic acid molecule. In some embodiments, the base editor is capable of deaminating one or more bases within a DNA molecule. In some embodiments, the base editor is an adenosine base editor (ABE).
如本文可互换使用的术语“编码序列”或“蛋白质编码序列”是指编码蛋白质的多核苷酸片段。该区域或序列在靠近5′端的地方有一个起始密码子,在靠近3′端的地方有一个终止密码子。编码序列也可称为开放阅读框。The terms "coding sequence" or "protein coding sequence" as used interchangeably herein refer to a polynucleotide fragment that encodes a protein. This region or sequence has a start codon near the 5' end and a stop codon near the 3' end. The coding sequence may also be called an open reading frame.
术语“核定位序列”、“核定位信号(NuclearLocalization Signal,NLS)”是指促进蛋白质输入细胞核的氨基酸序列。核定位序列是本领域已知的并且描述于例如Plank等人的国际PCT申请,PCT/EP2000/011690,2000年11月23日提交,2001年5月31日作为WO/2001/038547公布,其内容以引用方式并入本文以用于它们对示例性核定位序列的公开。在一些实施方案中,所述NLS是优化的NLS,例如由Koblan等人,Nature Biotech.2018doi:10.1038/nbt.4172所描述。The terms "nuclear localization sequence" and "nuclear localization signal (NuclearLocalization Signal, NLS)" refer to the amino acid sequence that promotes protein import into the nucleus. Nuclear localization sequences are known in the art and are described, for example, in Plank et al., International PCT application, PCT/EP2000/011690, filed on November 23, 2000, published as WO/2001/038547 on May 31, 2001, which The contents are incorporated herein by reference for their disclosure of exemplary nuclear localization sequences. In some embodiments, the NLS is an optimized NLS, such as described by Koblan et al., Nature Biotech. 2018 doi: 10.1038/nbt.4172.
如本公开所使用的,术语“互补的”或“杂交的”用于指与碱基配对规则相关的“多核苷酸”和“寡核苷酸”(它们是可互换的术语,指的是核苷酸序列)。例如,序列“CAGT”与序列“GTCA”互补。互补可以是“部分的”或“全部的”。“部分”互补是指一个或多个核酸碱基根据碱基配对规则错配,核酸之间的“全部”或“完全”互补是指每个核酸碱基在碱基配对下均与另一个碱基匹配规则。核酸链之间的互补程度对核酸链之间的杂交效率和强度具有重要影响。这在扩增反应以及取决于核酸之间结合的检测方法中特别重要。As used in this disclosure, the terms "complementary" or "hybrid" are used to refer to "polynucleotide" and "oligonucleotide" (which are interchangeable terms referring to is the nucleotide sequence). For example, the sequence "CAGT" is complementary to the sequence "GTCA". Complementarity can be "partial" or "total." "Partial" complementarity means that one or more nucleic acid bases are mismatched according to the base pairing rules. "Total" or "complete" complementarity between nucleic acids means that each nucleic acid base is matched with another base under base pairing. Base matching rules. The degree of complementarity between nucleic acid strands has an important impact on the efficiency and strength of hybridization between nucleic acid strands. This is particularly important in amplification reactions and detection methods that depend on binding between nucleic acids.
如本文所用,术语“杂交”是指使用核酸链通过碱基配对与互补链结合以形成杂交复合物的任何过程来配对互补核酸。As used herein, the term "hybridization" refers to any process that uses a nucleic acid strand to combine with a complementary strand through base pairing to form a hybridization complex to pair complementary nucleic acids.
如本公开所使用的,术语“核酸序列”和“核苷酸序列”是指寡核苷酸或多核苷酸及其片段或部分,并且是指可以是单链或双链的基因组或合成来源的DNA或RNA,和代表有义或反义链。As used in this disclosure, the terms "nucleic acid sequence" and "nucleotide sequence" refer to oligonucleotides or polynucleotides, fragments or portions thereof, and refer to genomic or synthetic sources that may be single- or double-stranded of DNA or RNA, and represents the sense or antisense strand.
如本公开所使用的,术语“序列同一性”和“同一性百分比”指两个或更多个多核苷酸或多肽之间相同(即同一)的核苷酸或氨基酸的百分比。两个或更多个多核苷酸或多肽之间的序列同一性可通过以下方法测定:将多核苷酸或多肽的核苷酸或氨基酸序列对准且经对准的多核苷酸或多肽中含有相同核苷酸或氨基酸残基的位置数目进行评分,将其与经对准的多核苷酸或多肽中含有不同核苷酸或氨基酸残基的位置数目进行比较。多核苷酸可例如通过含有不同核苷酸(即取代或突变)或缺失核苷酸(即一个或两个多核苷酸中的核苷酸插入或核苷酸缺失)而在一个位置处不同。多肽可例如通过含有不同氨基酸(即取代或突变)或缺失氨基酸(即一个或两个多肽中的氨基酸插入或氨基酸缺失)而在一个位置处不同。序列同一性可通 过用含有相同核苷酸或氨基酸残基的位置数目除以多核苷酸或多肽中氨基酸残基的总数来计算。举例而言,可通过用含有相同核苷酸或氨基酸残基的位置数目除以多核苷酸或多肽中核苷酸或氨基酸残基的总数且乘以100来计算同一性百分比。As used in this disclosure, the terms "sequence identity" and "percent identity" refer to the percentage of nucleotides or amino acids that are identical (i.e., identical) between two or more polynucleotides or polypeptides. Sequence identity between two or more polynucleotides or polypeptides can be determined by aligning the nucleotide or amino acid sequences of the polynucleotides or polypeptides and the aligned polynucleotides or polypeptides containing The number of positions with identical nucleotides or amino acid residues is scored and compared to the number of positions in the aligned polynucleotide or polypeptide that contain different nucleotides or amino acid residues. Polynucleotides may differ at one position, for example, by containing different nucleotides (ie, substitutions or mutations) or missing nucleotides (ie, nucleotide insertions or nucleotide deletions in one or both polynucleotides). Polypeptides may differ at one position, for example, by containing a different amino acid (ie, a substitution or mutation) or a missing amino acid (ie, an amino acid insertion or amino acid deletion in one or both polypeptides). Passable sequence identity Calculated by dividing the number of positions containing the same nucleotide or amino acid residue by the total number of amino acid residues in the polynucleotide or polypeptide. For example, percent identity can be calculated by dividing the number of positions containing the same nucleotide or amino acid residue by the total number of nucleotides or amino acid residues in the polynucleotide or polypeptide and multiplying by 100.
示例性的,当使用序列比较算法或通过目视检查测量以最大的对应性进行比较和比对时,两个或多个序列或子序列具有至少80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%核苷酸的“序列同一性”或“同一性百分比”。在某些实施方案中,所述序列在任一或两个相比较的生物聚合物(例如,多核苷酸)的整个长度上基本相同。Illustratively, two or more sequences or subsequences have at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleotides "sequence identity" or "percent identity". In certain embodiments, the sequences are substantially identical throughout the entire length of either or both compared biopolymers (eg, polynucleotides).
术语“载体”是指将核酸序列引入细胞中从而产生转化细胞的手段。载体包括质粒、转座子、噬菌体、病毒、脂质体和附加体。“重组载体”、“表达载体”是包含待在受体细胞中表达的核苷酸序列的核酸序列。表达载体可以包括额外的核酸序列以促进和/或促进引入序列的表达,例如起始、终止、增强子、启动子和分泌序列。The term "vector" refers to a means of introducing a nucleic acid sequence into a cell thereby producing a transformed cell. Vectors include plasmids, transposons, phages, viruses, liposomes, and episomes. A "recombinant vector" or "expression vector" is a nucleic acid sequence containing a nucleotide sequence to be expressed in a recipient cell. Expression vectors may include additional nucleic acid sequences to facilitate and/or facilitate expression of introduced sequences, such as initiation, termination, enhancers, promoters, and secretion sequences.
如本公开所使用的,术语“个体”和“受试者”可互换地使用,是指哺乳动物。哺乳动物包括但不限于驯化动物(例如,奶牛、绵羊、猫、犬和马)、灵长类(例如,人和非人灵长类如猴)、兔和啮齿类(例如,小鼠和大鼠)。特别地,个体是人。As used in this disclosure, the terms "individual" and "subject" are used interchangeably and refer to mammals. Mammals include, but are not limited to, domesticated animals (e.g., cows, sheep, cats, dogs, and horses), primates (e.g., humans and non-human primates such as monkeys), rabbits, and rodents (e.g., mice and rats). mouse). In particular, individuals are people.
本文公开的方法可以在体外、离体、或体内进行,或者产品可以以体外、离体、或体内形式存在。术语“体外”是指在实验室条件或培养液中使用材料、生物物质、细胞和/或组织的实验;而术语“体内”是指使用完整多细胞有机体的实验和工序。在一些实施方案中,体内进行的方法可以在非人动物上进行。“离体”是指存在于有机体外或发生在有机体外,例如在人或动物体外的事件,例如可以在取自有机体的组织(例如整个器官)或细胞上存在或发生的事件。The methods disclosed herein may be performed in vitro, ex vivo, or in vivo, or the products may be present in in vitro, ex vivo, or in vivo forms. The term "in vitro" refers to experiments using materials, biological substances, cells and/or tissues in laboratory conditions or culture media; whereas the term "in vivo" refers to experiments and procedures using intact multicellular organisms. In some embodiments, methods performed in vivo can be performed on non-human animals. "Ex vivo" refers to an event that exists or occurs outside an organism, such as outside a human or animal body, such as an event that may exist or occur on tissue (eg, a whole organ) or cells taken from an organism.
如本公开所使用的,术语“药学上可接受的载体”是指药学上可接受的材料、组合物或媒介物,例如液体或固体填充剂、稀释剂、赋形剂、制造助剂(例如,润滑剂、滑石粉、硬脂酸镁、钙或锌或硬脂酸)或溶剂包封材料,涉及将化合物从身体的一个部位(例如,递送部位)运送或运输到另一个部位(例如,器官、组织或身体的一部分)。药学上可接受的载体是“可接受的”,意思是与制剂的其他成分相容并且对受试者的组织无害(例如,生理学相容的、无菌的、生理学的pH等)。可以充当药学上可接受的载体的材料的一些实例包括:(1)糖,例如乳糖、葡萄糖和蔗糖;(2)淀粉,如玉米淀粉和马铃薯淀粉;(3)纤维素及其衍生物,如羧甲基纤维素钠、甲基纤维素、乙基纤维素、微晶纤维素和醋酸纤维素;(4)粉末黄蓍胶;(5)麦芽;(6)明胶;(7)润滑剂,如硬脂酸镁、十二烷基硫酸钠和滑石粉;(8)赋形剂,如可可脂和栓剂蜡;(9)油,如花生油、棉籽油、红花油、芝麻油、橄榄油、玉米油和豆油;(10)二醇,如丙二醇;(11)多元醇,如甘油、山梨糖醇、甘露醇和聚乙二醇(PEG);(12)酯类,如油酸乙酯和月桂酸乙酯;(13)琼脂;(14)缓冲剂,如氢氧化镁和氢氧化铝;(15)海藻酸;(16)无热原水;(17)等渗盐水;(18)林格氏液;(19)乙醇;(20)pH缓冲溶液;(21)聚酯,聚碳酸酯和/或聚酸酐;(22)增量剂(bulking agent),如多肽和氨基酸(23)血清成分,如血清白蛋白、高密度脂蛋白(high density lipoprotein,HDL)和低密度脂蛋白(low density lipoprotein,LDL);(22)C2-C12醇,如乙醇;和(23)药物制剂中采用的其他无毒相容物质。润湿剂、着色剂、脱模剂、包衣剂、甜味剂、调味剂、芳香 剂、防腐剂和抗氧化剂也可以存在于制剂中。诸如“赋形剂”、“药学上可接受的载体”等术语在本文中可互换使用。As used in this disclosure, the term "pharmaceutically acceptable carrier" refers to a pharmaceutically acceptable material, composition or vehicle, such as a liquid or solid filler, diluent, excipient, manufacturing aid (e.g. , lubricants, talc, magnesium stearate, calcium or zinc or stearic acid) or solvent encapsulated materials involving the delivery or transport of compounds from one site of the body (e.g., delivery site) to another site (e.g., organ, tissue or body part). A pharmaceutically acceptable carrier is "acceptable" in the sense of being compatible with the other ingredients of the formulation and not deleterious to the tissues of the subject (eg, physiologically compatible, sterile, physiological pH, etc.). Some examples of materials that can serve as pharmaceutically acceptable carriers include: (1) sugars, such as lactose, glucose, and sucrose; (2) starches, such as corn starch and potato starch; (3) cellulose and its derivatives, such as Sodium carboxymethylcellulose, methylcellulose, ethylcellulose, microcrystalline cellulose and cellulose acetate; (4) powdered tragacanth; (5) malt; (6) gelatin; (7) lubricant, Such as magnesium stearate, sodium lauryl sulfate and talc; (8) Excipients, such as cocoa butter and suppository wax; (9) Oils, such as peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, Corn oil and soybean oil; (10) glycols, such as propylene glycol; (11) polyols, such as glycerol, sorbitol, mannitol, and polyethylene glycol (PEG); (12) esters, such as ethyl oleate and laurel Ethyl acid; (13) Agar; (14) Buffers, such as magnesium hydroxide and aluminum hydroxide; (15) Alginic acid; (16) Pyrogen-free water; (17) Isotonic saline; (18) Ringer's solution; (19) ethanol; (20) pH buffer solution; (21) polyester, polycarbonate and/or polyanhydride; (22) bulking agent, such as peptides and amino acids (23) serum components, Such as serum albumin, high density lipoprotein (HDL) and low density lipoprotein (LDL); (22) C2-C12 alcohols, such as ethanol; and (23) others used in pharmaceutical preparations Non-toxic and compatible substances. Wetting agents, colorants, release agents, coating agents, sweeteners, flavorings, aroma Agents, preservatives and antioxidants may also be present in the formulation. Terms such as "excipient,""pharmaceutically acceptable carrier," and the like are used interchangeably herein.
如本文所用,术语“有效量”是指足以引起期望的生物学反应的生物活性剂的量。例如,在一些实施方案中,碱基编辑器的有效量可以指足以诱导由碱基编辑器突变的特异性结合的靶位点的突变的碱基编辑器的量。如本领域技术人员将理解的,试剂,例如碱基编辑器CRISPR-Cas效应子融合蛋白、脱氨酶、多核苷酸等的有效量可以随各种因素而变化,例如随期望的生物学反应,例如随待编辑的特定等位基因、基因组或靶位点,随靶定的细胞或组织和使用的试剂而变化。As used herein, the term "effective amount" refers to an amount of biologically active agent sufficient to elicit the desired biological response. For example, in some embodiments, an effective amount of a base editor may refer to an amount of base editor sufficient to induce mutation of a target site that is specifically bound by the mutation of the base editor. As those skilled in the art will appreciate, effective amounts of reagents, such as base editor CRISPR-Cas effector fusion proteins, deaminases, polynucleotides, etc., can vary depending on various factors, such as the desired biological response. , vary, for example, with the specific allele, genome or target site to be edited, with the cells or tissues targeted and the reagents used.
术语“治疗”、“处理”是指如本文所述旨在逆转、缓解疾病或病症或其一种或多种症状、延迟疾病或病症或其一种或多种症状的发作或抑制疾病或病症或其一种或多种症状进展的临床干预。如本文所用,术语“治疗”、“处理”是指如本文所述旨在逆转、缓解疾病或病症或其一种或多种症状、延迟疾病或病症或其一种或多种症状的发作或抑制疾病或病症或其一种或多种症状进展的临床干预。在一些实施方案中,可以在一种或多种症状已经得以形成之后和/或疾病已经得到诊断之后施用治疗。在其他实施方案中,可以在没有症状的情况下施用治疗,例如用于预防或延迟症状的发作或抑制疾病的发作或进展。例如,可以在症状发作之前(例如,鉴于症状的历史和/或鉴于遗传或其他易感性因素)施用治疗于易感个体。治疗也可以在症状消退后继续进行,例如以预防或延迟其复发。The terms "treatment" and "treatment" mean, as described herein, intended to reverse, alleviate a disease or condition or one or more symptoms thereof, delay the onset of a disease or condition or one or more symptoms thereof, or inhibit a disease or condition or clinical intervention for the progression of one or more of its symptoms. As used herein, the terms "treatment" and "treatment" mean, as described herein, intended to reverse, alleviate a disease or condition or one or more symptoms thereof, delay the onset of a disease or condition or one or more symptoms thereof, or Clinical intervention that inhibits the progression of a disease or condition or one or more of its symptoms. In some embodiments, treatment may be administered after one or more symptoms have developed and/or the disease has been diagnosed. In other embodiments, treatment may be administered in the absence of symptoms, for example, to prevent or delay the onset of symptoms or to inhibit the onset or progression of a disease. For example, the treatment may be administered to a susceptible individual prior to the onset of symptoms (eg, in view of a history of symptoms and/or in view of genetic or other susceptibility factors). Treatment can also be continued after symptoms have subsided, for example to prevent or delay their recurrence.
术语“受试者”包括人和非人动物。非人动物包括脊椎动物,例如哺乳动物和非哺乳动物,例如非人灵长类、绵羊、猫、马、牛、鸡、狗、小鼠、大鼠、山羊、兔和猪。优选地,受试者是人。除非指出时,否则术语“患者”或“受试者”在本文中可互换地使用。The term "subject" includes humans and non-human animals. Non-human animals include vertebrates, such as mammals, and non-mammals, such as non-human primates, sheep, cats, horses, cattle, chickens, dogs, mice, rats, goats, rabbits, and pigs. Preferably, the subject is human. Unless otherwise indicated, the terms "patient" or "subject" are used interchangeably herein.
如背景技术所提到的,现有技术中的CRISPR/Cas系统切割效率较低。因而,在本申请中发明人尝试开发新型Cas蛋白和CRISPR/Cas系统,从而丰富CRISPR/Cas系统,提高切割效率和靶向性,以适应在实际使用中的需求。因而提出了本申请的一系列保护方案。As mentioned in the background, the cutting efficiency of CRISPR/Cas systems in the prior art is low. Therefore, in this application, the inventor attempts to develop new Cas proteins and CRISPR/Cas systems to enrich the CRISPR/Cas system and improve cutting efficiency and targeting to meet the needs of actual use. Therefore, a series of protection schemes of this application are proposed.
在本申请第一种典型的实施方式中,提供了一种CRISPR-Cas效应子蛋白,该CRISPR-Cas效应子蛋白包括与SEQ ID NO:1至5中任一项的氨基酸序列具有至少70%同一性的蛋白(例如,70%,71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%)。In a first typical embodiment of the present application, a CRISPR-Cas effector protein is provided, which CRISPR-Cas effector protein includes at least 70% of the amino acid sequence of any one of SEQ ID NO: 1 to 5. Identity proteins (e.g., 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84 %, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%) .
在一种优选的实施例中,CRISPR-Cas效应子蛋白包括与SEQ ID NO:1至5中任一项的氨基酸序列具有80%以上,优选90%以上,更优选95%以上,进一步优选99%以上同一性的蛋白;优选地,CRISPR-Cas效应子蛋白包括RuvC结构域。In a preferred embodiment, the CRISPR-Cas effector protein includes more than 80%, preferably more than 90%, more preferably more than 95%, further preferably more than 99% of the amino acid sequence of any one of SEQ ID NO: 1 to 5. % or above identity; preferably, the CRISPR-Cas effector protein includes a RuvC domain.
在一种优选的实施例中,CRISPR-Cas效应子蛋白包括:In a preferred embodiment, the CRISPR-Cas effector protein includes:
a)SEQ ID NO:1至5中任一项所示的蛋白;或a) The protein shown in any one of SEQ ID NO: 1 to 5; or
b)以SEQ ID NO:1所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:N21X, N23X,R25X,K26X,Q482X,S484X,R486X,S489X,R493X,H511X,C513X,H515X,N516X,R518X,R540X,K558X,Y560X,K562X,K565X,T600X,T672X,D676X,Q680X,Y683X,L686X,D693X,Y731X,G767X,R772X,K832X,K833X,Q836X,M896X;或b) Based on the amino acid sequence shown in SEQ ID NO: 1, a protein with one or more point mutations as follows: N21X, N23X, R25X, K26X, Q482X, S484X, R486X, S489X, R493X, H511X, C513X, H515X, N516X, R518X, R540X, K558X, Y560X, K562X, K565X, T600X, T672X, D676X, Q 680X, Y683X, L686X, D693X, Y731X, G767X, R772X, K832X, K833X, Q836X, M896X; or
c)以SEQ ID NO:2所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:R19X,R28X,R32X,K512X,N527X,W531X,R553X,K581X,K589X,I590X,R605X,K611X,R612X,R615X,Y777X,E877X,R931X;或c) A protein based on the amino acid sequence shown in SEQ ID NO: 2 with one or more of the following point mutations: R19X, R28X, R32X, K512X, N527X, W531X, R553X, K581X, K589X, I590X, R605X, K611X, R612X, R615X, Y777X, E877X, R931X; or
d)以SEQ ID NO:3所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:K8X,F15X,N17X,K20X,K471X,W483X,H502X,R505X,K557X,K556X,R560X,Y673X,L676X,Y723X,N822X,K823X,E826X,K827X,K830X,K880X,L887X;或d) Based on the amino acid sequence shown in SEQ ID NO: 3, a protein with one or more point mutations as follows: K8X, F15X, N17X, K20X, K471X, W483X, H502X, R505X, K557X, K556X, R560X, Y673X, L676X, Y723X, N822X, K823X, E826X, K827X, K830X, K880X, L887X; or
e)以SEQ ID NO:4所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:K317X,W330X,Y351X,K354X,D392X,F395X,N399X,Y509X,V512X,Y568X,N662X,K663X,E666X,R667X,K670X,K719X,L726X;或e) Based on the amino acid sequence shown in SEQ ID NO: 4, a protein with one or more point mutations as follows: K317X, W330X, Y351X, K354X, D392X, F395X, N399X, Y509X, V512X, Y568X, N662X, K663X, E666X, R667X, K670X, K719X, L726X; or
f)以SEQ ID NO:5所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:M9X,V16X,D18X,K21X,K518X,W531X,F550X,K553X,R609X,Y612X,R616X,Y730X,L733X,Y781X,N879X,K880X,E883X,K884X,K887X,K936X,F943X;其中X为任意氨基酸。f) Based on the amino acid sequence shown in SEQ ID NO: 5, a protein with one or more point mutations as follows: M9X, V16X, D18X, K21X, K518X, W531X, F550X, K553X, R609X, Y612X, R616X, Y730X, L733X, Y781X, N879X, K880X, E883X, K884X, K887X, K936X, F943X; where X is any amino acid.
上述CRISPR-Cas效应子蛋白具有剪切活性,能够特异性或非特异性剪切核苷酸链,实现CRISPR-Cas系统的活性。上述同一性限定范围内的同源蛋白和点突变蛋白,相较于SEQ ID NO:1至5中任一项所示的蛋白,氨基酸的改变可以发生在蛋白活性位点或非活性位点处,包括RuvC结构域内或结构域外。氨基酸改变得到的蛋白,仍具有CRISPR-Cas效应子蛋白的剪切活性。The above-mentioned CRISPR-Cas effector protein has cutting activity and can specifically or non-specifically cut nucleotide chains to achieve the activity of the CRISPR-Cas system. For homologous proteins and point mutant proteins within the above-mentioned identity limits, compared to the protein shown in any one of SEQ ID NO: 1 to 5, amino acid changes can occur at the active site or inactive site of the protein. , including within or outside the RuvC domain. The protein obtained by changing the amino acid still has the cleavage activity of CRISPR-Cas effector protein.
在本申请第二种典型的实施方式中,提供了一种CRISPR-Cas效应子融合蛋白,该CRISPR-Cas效应子融合蛋白包括上述CRISPR-Cas效应子蛋白、或CRISPR-Cas效应子蛋白的衍生物或CRISPR-Cas效应子蛋白的功能片段,以及异源功能结构域。In the second typical embodiment of the present application, a CRISPR-Cas effector fusion protein is provided. The CRISPR-Cas effector fusion protein includes the above-mentioned CRISPR-Cas effector protein, or a derivative of the CRISPR-Cas effector protein. or functional fragments of CRISPR-Cas effector proteins, as well as heterologous functional domains.
功能片段的序列少于全长序列但保留了上述CRISPR-Cas效应子蛋白的切割功能,功能片段中的缺失残基可以在N末端、C末端和/或内部。衍生物是指与上述CRISPR-Cas效应子蛋白至少约80%的序列同一性,且拥有至少一种相同功能,例如与一个包含至少一个DR序列的crRNA结合并形成复合物的能力。衍生物形成的原因包括但不限于保守氨基酸残基取代。The sequence of the functional fragment is less than the full-length sequence but retains the cleavage function of the above-mentioned CRISPR-Cas effector protein. The missing residues in the functional fragment can be at the N-terminal, C-terminal and/or internal. Derivatives refer to at least about 80% sequence identity with the above-mentioned CRISPR-Cas effector protein, and possess at least one of the same functions, such as the ability to bind to and form a complex with a crRNA containing at least one DR sequence. Reasons for derivative formation include, but are not limited to, conservative amino acid residue substitutions.
通过向具有剪切活性的CRISPR-Cas效应子蛋白、或CRISPR-Cas效应子蛋白的衍生物、或CRISPR-Cas效应子蛋白的功能片段上,融合异源功能结构域,获得CRISPR-Cas效应子融合蛋白,能够在正常发挥剪切活性的基础上,具有异源功能结构域的活性。在具体使用中,可以灵活选择现有技术中的异源功能结构域,增加该融合蛋白的功能。异源功能结构域与CRISPR-Cas效应子蛋白(包括衍生物和功能片段)的连接,可灵活选用现有技术中不同长度 的融合蛋白接头(Linker),也可不利用Linker而直接相连,均不影响CRISPR-Cas效应子蛋白(包括衍生物和功能片段)和异源功能结构域的活性发挥。CRISPR-Cas effectors are obtained by fusing heterologous functional domains to CRISPR-Cas effector proteins with cleavage activity, or derivatives of CRISPR-Cas effector proteins, or functional fragments of CRISPR-Cas effector proteins. Fusion proteins can have the activity of heterologous functional domains on the basis of normal cleavage activity. In specific use, heterologous functional domains in the existing technology can be flexibly selected to increase the function of the fusion protein. The connection of heterologous functional domains to CRISPR-Cas effector proteins (including derivatives and functional fragments) can be flexibly selected using different lengths in existing technologies The fusion protein linker (Linker) can also be directly connected without using a Linker, which will not affect the activity of CRISPR-Cas effector proteins (including derivatives and functional fragments) and heterologous functional domains.
在一种优选的实施例中,异源功能结构域位于CRISPR-Cas效应子融合蛋白的N端、C端或内部;优选地,异源功能结构域包括但不限于定位信号、报告蛋白、CRISPR-Cas效应子蛋白靶向部分、DNA结合域、表位标签、转录激活域、转录抑制域、核酸酶、脱氨结构域、甲基化酶、脱甲基酶、转录释放因子、HDAC(组蛋白去乙酰化酶)、裂解活性多肽、连接酶中的一种或多种;优选地,定位信号包括但不限于核定位信号和/或核输出信号;优选地,核输出信号包括但不限于人类蛋白酪氨酸激酶2;优选地,报告蛋白包括但不限于谷胱甘肽-S-转移酶、辣根过氧化物酶、氯霉素乙酰转移酶、β-半乳糖苷酶、β-葡糖醛酸糖苷酶、或荧光蛋白中的一种或多种;优选地,荧光蛋白包括但不限于但不限于绿色荧光蛋白、HcRed、DsRed、青荧光蛋白、黄色荧光蛋白或蓝色荧光蛋白中的一种或多种;优选地,DNA结合域包括但不限于甲基化结合蛋白、LexA DBD(LexA蛋白的DNA结合结构域)或Gal4DBD(GAL4蛋白的DNA结合结构域)中的一种或多种;优选地,表位标签包括但不限于组氨酸标签、V5标签、FLAG标签、流感病毒血凝素标签、Myc标签、VSV-G标签或硫氧还蛋白标签中的一种或多种;优选地,转录激活域包括但不限于VP64和/或VPR;优选地,转录抑制域包括但不限于KRAB和/或SID;优选地,核酸酶包括但不限于FokI;优选地,脱氨结构域包括但不限于ADAR1、ADAR2、APOBEC、AID或TAD中的一种或多种;优选地,裂解活性多肽包括但不限于具有单链RNA裂解活性的多肽、具有双链RNA裂解活性的多肽、具有单链DNA裂解活性的多肽或具有双链DNA裂解活性的多肽;优选地,连接酶包括但不限于DNA连接酶和/或RNA连接酶。In a preferred embodiment, the heterologous functional domain is located at the N-terminal, C-terminal or internal part of the CRISPR-Cas effector fusion protein; preferably, the heterologous functional domain includes but is not limited to positioning signal, reporter protein, CRISPR -Cas effector protein targeting portion, DNA binding domain, epitope tag, transcription activation domain, transcription repression domain, nuclease, deamination domain, methylase, demethylase, transcription release factor, HDAC (group One or more of protein deacetylase), cleavage active polypeptide, ligase; Preferably, the localization signal includes but is not limited to nuclear localization signal and/or nuclear export signal; Preferably, the nuclear export signal includes but is not limited to Human protein tyrosine kinase 2; preferably, the reporter protein includes but is not limited to glutathione-S-transferase, horseradish peroxidase, chloramphenicol acetyltransferase, β-galactosidase, β- Glucuronidase, or one or more of fluorescent proteins; preferably, the fluorescent proteins include but are not limited to, but are not limited to, green fluorescent protein, HcRed, DsRed, cyan fluorescent protein, yellow fluorescent protein or blue fluorescent protein One or more of them; preferably, the DNA binding domain includes but is not limited to one of methylation binding proteins, LexA DBD (DNA binding domain of LexA protein) or Gal4DBD (DNA binding domain of GAL4 protein) Or more; preferably, the epitope tag includes but is not limited to one of histidine tag, V5 tag, FLAG tag, influenza virus hemagglutinin tag, Myc tag, VSV-G tag or thioredoxin tag or A variety of; Preferably, the transcription activation domain includes but is not limited to VP64 and/or VPR; Preferably, the transcription repression domain includes but is not limited to KRAB and/or SID; Preferably, the nuclease includes but is not limited to FokI; Preferably, De The amino domain includes but is not limited to one or more of ADAR1, ADAR2, APOBEC, AID or TAD; preferably, the cleavage active polypeptide includes but is not limited to polypeptides with single-stranded RNA cleavage activity, polypeptides with double-stranded RNA cleavage activity polypeptide, a polypeptide with single-stranded DNA cleavage activity or a polypeptide with double-stranded DNA cleavage activity; preferably, the ligase includes but is not limited to DNA ligase and/or RNA ligase.
灵活选用上述异源功能结构域,能够实现融合蛋白的多种功能与活性。上述脱氨结构域包括脱氨酶或脱氨酶的功能片段,将上述脱氨结构域与利用上述异源功能结构域与CRISPR-Cas效应子蛋白融合,能够实现碱基编辑器的效果。包含核定位信号的融合蛋白,能够能与入核载体相互作用,使蛋白能被运进细胞核。DNA结合域(DBD)能够识别特定的DNA序列,从而提高融合蛋白的靶向性。Flexible selection of the above heterologous functional domains can achieve multiple functions and activities of the fusion protein. The above-mentioned deamination domain includes a deaminase or a functional fragment of a deaminase. The effect of a base editor can be achieved by fusing the above-mentioned deamination domain with the above-mentioned heterologous functional domain and CRISPR-Cas effector protein. Fusion proteins containing nuclear localization signals can interact with nuclear import carriers so that proteins can be transported into the nucleus. The DNA binding domain (DBD) can recognize specific DNA sequences, thereby improving the targeting of the fusion protein.
在本申请第三种典型的实施方式中,提供了一种DNA分子,该DNA分子编码上述CRISPR-Cas效应子蛋白、或CRISPR-Cas效应子融合蛋白。In the third typical embodiment of the present application, a DNA molecule encoding the above-mentioned CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein is provided.
在一种优选的实施例中,DNA分子为根据宿主细胞的密码子偏好性进行密码子优化的DNA分子;优选地,宿主细胞包括原核细胞或真核细胞;优选地,DNA分子包括与SEQ ID NO:6至10中任一项的核苷酸序列具有70%以上(例如,70%,71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%),优选90%以上,更优选95%以上,进一步优选99%,更进一步优选为100%同一性的核苷酸。In a preferred embodiment, the DNA molecule is a DNA molecule that is codon-optimized according to the codon preference of the host cell; preferably, the host cell includes a prokaryotic cell or a eukaryotic cell; preferably, the DNA molecule includes SEQ ID The nucleotide sequence of any one of NO: 6 to 10 has more than 70% (for example, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79% , 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96 %, 97%, 98%, 99% or 100%), preferably 90% or more, more preferably 95% or more, further preferably 99%, and still more preferably 100% identical nucleotides.
上述DNA分子,包括分离的DNA分子。能够通过转录、翻译,编码上述CRISPR-Cas效应子蛋白、或CRISPR-Cas效应子融合蛋白。上述DNA分子包括单链或双链DNA,均能够携带遗传信息,从而实现编码蛋白的作用。根据DNA分子所处的、或编码发生的宿主细胞的 密码子偏好性,灵活进行密码子优化,能够实现DNA分子的高效表达。上述SEQ ID NO:6至10所示的核苷酸,分别能够编码SEQ ID NO:1至5所示的蛋白。The above-mentioned DNA molecules include isolated DNA molecules. It can encode the above-mentioned CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein through transcription and translation. The above-mentioned DNA molecules include single-stranded or double-stranded DNA, which can carry genetic information and thus achieve the function of encoding proteins. According to the host cell in which the DNA molecule is located or where the encoding occurs Codon preference and flexible codon optimization can achieve efficient expression of DNA molecules. The above-mentioned nucleotides shown in SEQ ID NO: 6 to 10 can respectively encode the proteins shown in SEQ ID NO: 1 to 5.
在本申请第四种典型的实施方式中,提供了一种重组载体,该重组载体包含上述DNA分子。In the fourth typical embodiment of the present application, a recombinant vector is provided, which includes the above-mentioned DNA molecule.
在一种优选的实施例中,DNA分子与启动子连接;优选地,启动子包括但不限于诱导型启动子、组成型启动子或组织特异性启动子中的一种或多种;优选地,启动子包括但不限于T7、SP6、T3、CMV、EF1a、SV40、PGK1、humanβ-actin、CAG、U6、H1、T7、T7lac、araBAD、trp、lac或Ptac中的一种或多种;优选地,重组载体包括但不限于逆转录病毒载体、慢病毒载体、腺病毒载体、腺相关病毒载体、单纯疱疹载体或噬菌粒载体;优选地,所述重组载体包括质粒载体。In a preferred embodiment, the DNA molecule is connected to a promoter; preferably, the promoter includes but is not limited to one or more of an inducible promoter, a constitutive promoter or a tissue-specific promoter; preferably , promoters include but are not limited to one or more of T7, SP6, T3, CMV, EF1a, SV40, PGK1, humanβ-actin, CAG, U6, H1, T7, T7lac, araBAD, trp, lac or Ptac; Preferably, the recombinant vector includes, but is not limited to, a retroviral vector, a lentiviral vector, an adenoviral vector, an adeno-associated virus vector, a herpes simplex vector or a phagemid vector; preferably, the recombinant vector includes a plasmid vector.
在本申请第五种典型的实施方式中,提供了一种宿主细胞,该宿主细胞转化有上述重组载体。In the fifth typical embodiment of the present application, a host cell is provided, and the host cell is transformed with the above-mentioned recombinant vector.
将上述DNA分子整合到上述重组载体中,获得能够在宿主细胞中复制、表达的重组载体,从而实现DNA分子的大规模复制和表达,进而实现上述CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白的表达、纯化,或在宿主细胞中发挥蛋白活性。Integrate the above-mentioned DNA molecules into the above-mentioned recombinant vector to obtain a recombinant vector capable of replication and expression in host cells, thereby achieving large-scale replication and expression of DNA molecules, thereby realizing the above-mentioned CRISPR-Cas effector protein or CRISPR-Cas effector Expression, purification, or protein activity of fusion proteins in host cells.
在本申请第六种典型的实施方式中,提供了一种基因编辑系统,该基因编辑系统包括:a)RNA指导物或编码RNA指导物的核酸,RNA指导物包括直接重复序列和间隔子序列,间隔子序列用于与靶核酸杂交;b)上述CRISPR-Cas效应子蛋白,或CRISPR-Cas效应子融合蛋白,或DNA分子,或重组载体,或宿主细胞;DNA分子、重组载体或宿主细胞能够表达CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白,在基因编辑系统中,CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白,与RNA指导物结合后,靶向作用于间隔子序列与靶核酸杂交形成的杂交序列。In a sixth typical embodiment of the present application, a gene editing system is provided. The gene editing system includes: a) an RNA guide or a nucleic acid encoding an RNA guide. The RNA guide includes a direct repeat sequence and a spacer sequence. , the spacer sequence is used to hybridize with the target nucleic acid; b) the above-mentioned CRISPR-Cas effector protein, or CRISPR-Cas effector fusion protein, or DNA molecule, or recombinant vector, or host cell; DNA molecule, recombinant vector or host cell Able to express CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein. In the gene editing system, the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein is combined with the RNA guide to target the spacer. The hybridization sequence formed by hybridization of the subsequence with the target nucleic acid.
上述基因编辑系统,包括RNA指导物或编码RNA指导物的核酸,以及CRISPR-Cas效应子蛋白,或CRISPR-Cas效应子融合蛋白,或能够表达上述蛋白的DNA分子、重组载体或宿主细胞。该基因编辑系统,通过RNA指导物与靶核酸杂交结合,形成杂交序列;杂交序列上的RNA指导物能够与上述蛋白结合形成复合物,从而使得蛋白靠近靶核酸完成定位,蛋白从而发挥活性,靶向或非靶向地对上述靶核酸进行切割、切口等修饰。The above-mentioned gene editing system includes RNA guides or nucleic acids encoding RNA guides, and CRISPR-Cas effector proteins, or CRISPR-Cas effector fusion proteins, or DNA molecules, recombinant vectors or host cells capable of expressing the above-mentioned proteins. This gene editing system hybridizes and combines the RNA guide with the target nucleic acid to form a hybrid sequence; the RNA guide on the hybrid sequence can combine with the above-mentioned protein to form a complex, thereby positioning the protein close to the target nucleic acid, and the protein can exert activity and target. The above-mentioned target nucleic acid may be modified in a targeted or non-targeted manner by cutting, nicking or the like.
在一种优选的实施例中,基因编辑系统不包含tracrRNA。In a preferred embodiment, the gene editing system does not contain tracrRNA.
tracrRNA是trans-activating crRNA的简写,在涉及Cas9蛋白的基因编辑系统中单独转录,并与crRNA结合形成gRNA(guide RNA),从而与Cas9蛋白结合而引导蛋白定位。上述基因编辑系统中不包含tracrRNA,则引导蛋白定位的gRNA长度和分子量更小,gRNA与Cas蛋白的结合域更小,从而实现Cas蛋白分子量和尺寸的减小,丰富了CRISPR-Cas系统的应用场景。TracrRNA is the abbreviation of trans-activating crRNA. It is transcribed separately in the gene editing system involving Cas9 protein and combines with crRNA to form gRNA (guide RNA), which combines with Cas9 protein to guide protein positioning. If the above gene editing system does not contain tracrRNA, the length and molecular weight of the gRNA that guides protein positioning will be smaller, and the binding domain between gRNA and Cas protein will be smaller, thereby reducing the molecular weight and size of the Cas protein and enriching the applications of the CRISPR-Cas system. Scenes.
在一种优选的实施例中,RNA指导物包括1种或多种。 In a preferred embodiment, the RNA guide includes one or more types.
上述RNA指导物包括1种、2种、3种、4种、5种乃至更多种。基因编辑系统中包括多种RNA指导物,及能够同时与多个靶核酸杂交,进而在多个靶核酸位点发挥该基因编辑系统的活性,能够大大提高基因编辑的效率,降低基因编辑、细胞传代、编辑结果验证等实验所需时间。在现有技术中,同时利用多个gRNA的情况在实际实验过程中也较为常见,例如进行基因敲除或检测应用时,均可会采用多个gRNA。The above-mentioned RNA guides include 1 type, 2 types, 3 types, 4 types, 5 types or even more types. The gene editing system includes a variety of RNA guides and can hybridize with multiple target nucleic acids at the same time, thereby exerting the activity of the gene editing system at multiple target nucleic acid sites, which can greatly improve the efficiency of gene editing and reduce the cost of gene editing and cell The time required for experiments such as passaging and verification of editing results. In the existing technology, it is also common to use multiple gRNAs at the same time in actual experimental processes. For example, when performing gene knockout or detection applications, multiple gRNAs may be used.
在一种优选的实施例中,靶核酸包括DNA;优选地,DNA包括来源于真核生物的DNA或来源于原核生物的DNA;优选地,真核生物包括但不限于动物或植物;优选地,DNA包括但不限于非人类哺乳动物DNA、人类DNA、昆虫DNA、鸟类DNA、爬行动物DNA、两栖动物DNA、啮齿动物DNA、鱼类DNA、蠕虫DNA、线虫DNA或酵母DNA;优选地,非人类哺乳动物DNA包括但不限于非人类灵长类DNA。In a preferred embodiment, the target nucleic acid includes DNA; preferably, DNA includes DNA derived from eukaryotic organisms or DNA derived from prokaryotes; preferably, eukaryotic organisms include but are not limited to animals or plants; preferably, eukaryotic organisms include but are not limited to animals or plants; , DNA includes but is not limited to non-human mammal DNA, human DNA, insect DNA, avian DNA, reptile DNA, amphibian DNA, rodent DNA, fish DNA, worm DNA, nematode DNA or yeast DNA; Preferably, Non-human mammalian DNA includes, but is not limited to, non-human primate DNA.
利用上述基因编辑系统,RNA指导物能够与不同来源的DNA进行杂交,从而实现对于多物种的基因编辑。Using the above gene editing system, RNA guides can be hybridized with DNA from different sources to achieve gene editing of multiple species.
在一种优选的实施例中,直接重复序列的3’端包含茎环结构,茎环结构包括依次连接的第一茎核苷酸链、环核苷酸链和第二茎核苷酸链和,第一茎核苷酸链和第二茎核苷酸链彼此杂交形成茎环结构的茎,环核苷酸链形成茎环结构的环;优选地,第一茎核苷酸链的长度为5或6个核苷酸;优选地,第二茎核苷酸链的长度为5个核苷酸;优选地,环核苷酸链的长度为6、7或8个核苷酸。In a preferred embodiment, the 3' end of the direct repeat sequence includes a stem-loop structure, and the stem-loop structure includes a first stem nucleotide chain, a cyclic nucleotide chain, and a second stem nucleotide chain connected in sequence. , the first stem nucleotide chain and the second stem nucleotide chain hybridize with each other to form a stem of a stem-loop structure, and the cyclic nucleotide chain forms a ring of a stem-loop structure; preferably, the length of the first stem nucleotide chain is 5 or 6 nucleotides; preferably, the length of the second stem nucleotide chain is 5 nucleotides; preferably, the length of the cyclic nucleotide chain is 6, 7 or 8 nucleotides.
在一种优选的实施例中,茎环结构包括SEQ ID NO:25、28、31、34或37的核苷酸序列。In a preferred embodiment, the stem-loop structure includes the nucleotide sequence of SEQ ID NO: 25, 28, 31, 34 or 37.
上述RNA指导物中包括直接重复序列和间隔子序列,其中直接重复序列的3’端为具有二级结构的茎环结构。在茎环结构中,包括能够彼此杂交的第一茎核苷酸链、第二茎核苷酸链,杂交形成双链,环核苷酸链形成环状结构。该茎环结构能够与CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白结合,从而发挥引导蛋白定位的作用。序列上的N表示所有碱基。The above-mentioned RNA guide includes direct repeat sequences and spacer sequences, wherein the 3' end of the direct repeat sequence is a stem-loop structure with secondary structure. The stem-loop structure includes a first stem nucleotide chain and a second stem nucleotide chain that can hybridize to each other to form a double strand, and the cyclic nucleotide chain forms a ring structure. This stem-loop structure can bind to CRISPR-Cas effector proteins or CRISPR-Cas effector fusion proteins to guide protein positioning. The N in the sequence indicates all bases.
在本申请实施例中,SEQ ID NO:25所述的核苷酸对应的茎环结构,能够与CasY1蛋白结合;SEQ ID NO:28所述的核苷酸对应的茎环结构,能够与CasY2蛋白结合;SEQ ID NO:31所述的核苷酸对应的茎环结构,能够与CasY3蛋白结合;SEQ ID NO:34所述的核苷酸对应的茎环结构,能够与CasY4蛋白结合;SEQ ID NO:37所述的核苷酸对应的茎环结构,能够与CasY5蛋白结合。对于茎环结构序列的选择,蛋白可以与上述茎环结构自由组合,也可以如上述的同一性进行灵活选用,而不影响蛋白与RNA指导物的结合。In the embodiments of this application, the stem-loop structure corresponding to the nucleotide described in SEQ ID NO: 25 can bind to CasY1 protein; the stem-loop structure corresponding to the nucleotide described in SEQ ID NO: 28 can bind to CasY2 Protein binding; the stem-loop structure corresponding to the nucleotide described in SEQ ID NO: 31 can bind to the CasY3 protein; the stem-loop structure corresponding to the nucleotide described in SEQ ID NO: 34 can bind to the CasY4 protein; SEQ The stem-loop structure corresponding to the nucleotide described in ID NO: 37 can bind to the CasY5 protein. Regarding the selection of the stem-loop structure sequence, the protein can be freely combined with the above-mentioned stem-loop structure, or can be flexibly selected according to the above-mentioned identity without affecting the binding of the protein to the RNA guide.
在一种优选的实施例中,直接重复序列包括与SEQ ID NO:24、27、30、33或36的核苷酸序列具有至少80%(例如,80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%)同一性的核苷酸序列;优选地,直接重复序列包括与SEQ ID NO:24、27、30、33或36的核苷酸序列具有至少85%以上,更优选90%以上,进一步优选95%以上同一性的核苷酸序列;优选地,直接重复序列包括SEQ ID NO:24、27、30、33或36的核苷酸序列。 In a preferred embodiment, the direct repeat sequence includes at least 80% (e.g., 80%, 81%, 82%, 83%) with the nucleotide sequence of SEQ ID NO: 24, 27, 30, 33 or 36 , 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100 %) nucleotide sequence identity; preferably, the direct repeat sequence includes at least 85% or more, more preferably 90% or more, and further preferably more than 90% identity with the nucleotide sequence of SEQ ID NO: 24, 27, 30, 33 or 36 A nucleotide sequence with more than 95% identity; preferably, the direct repeat sequence includes the nucleotide sequence of SEQ ID NO: 24, 27, 30, 33 or 36.
在一种优选的实施例中,间隔子序列包括80%(例如,80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%)以上与靶核酸互补;优选地,间隔子序列包括90%以上,更优选95%以上,进一步优选99%以上,更进一步优选100%与靶核酸互补;优选地,间隔子序列的长度为18-41nt;优选地,间隔子序列的长度为18-37nt;优选地,间隔子序列长度为18-26nt或34-36nt;优选地,间隔子序列长度为20nt。In a preferred embodiment, the spacer sequence includes 80% (e.g., 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90% , 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%) or more complementary to the target nucleic acid; preferably, the spacer sequence includes more than 90%, more preferably More than 95%, more preferably more than 99%, even more preferably 100% complementary to the target nucleic acid; preferably, the length of the spacer sequence is 18-41nt; preferably, the length of the spacer sequence is 18-37nt; preferably, the spacer sequence The length of the subsequence is 18-26nt or 34-36nt; preferably, the length of the spacer subsequence is 20nt.
上述间隔子序列包括80%以上与靶核酸互补的片段,通过至少80%的碱基互补配对,带有间隔子序列的RNA指导物能够与靶核酸牢固结合,从而实现该基因编辑系统对靶核酸的修饰。间隔子序列的长度可在一定范围内灵活选择,若长度过短则形成的互补链长度较短,结合力差,且结合特异性较弱,影响基因编辑效率和脱靶率;若长度过长,则增加RNA引导物的长度,且互补配对难度,也会对基因编辑效率产生影响。The above-mentioned spacer sequence includes more than 80% fragments that are complementary to the target nucleic acid. Through at least 80% complementary base pairing, the RNA guide with the spacer sequence can bind firmly to the target nucleic acid, thereby realizing the gene editing system's targeting of the target nucleic acid. modification. The length of the spacer sequence can be flexibly selected within a certain range. If the length is too short, the complementary chain formed will be shorter, have poor binding force, and have weak binding specificity, affecting gene editing efficiency and off-target rate; if the length is too long, The increased length of the RNA guide and the difficulty of complementary pairing will also affect the gene editing efficiency.
在一种优选的实施例中,直接重复序列包括第一直接重复序列和第二直接重复序列;优选地,RNA指导物包括按顺序依次连接的第一直接重复序列、间隔子序列以及第二直接重复序列;优选地,第一直接重复序列与第二直接重复序列相同。In a preferred embodiment, the direct repeat sequence includes a first direct repeat sequence and a second direct repeat sequence; preferably, the RNA guide includes a first direct repeat sequence, a spacer sequence and a second direct repeat sequence connected in sequence. Repeating sequence; preferably, the first direct repeating sequence and the second direct repeating sequence are identical.
本申请中的RNA指导物是指促进本发明中蛋白质靶向于靶核酸的任何RNA分子,包括但不限于crRNA、前crRNA(例如DR-间隔子-DR)和成熟crRNA(例如成熟DR-间隔子,成熟DR-间隔子-成熟DR)。如本申请实施例中所用的RNA指导物即为前crRNA,直接重复序列包括位于间隔子序列两端的第一直接重复序列和第二直接重复序列,茎环结构在第一和第二直接重复序列均存在。在细胞后续对于RNA指导物的加工中,能够进一步获得发挥活性的RNA指导物,如对于DR-间隔子-DR组成的前crRNA,在加工后其中一个直接重复序列被删除。An RNA guide in this application refers to any RNA molecule that facilitates targeting of a protein of the invention to a target nucleic acid, including but not limited to crRNA, pre-crRNA (e.g., DR-spacer-DR), and mature crRNA (e.g., mature DR-spacer). sub, mature DR-spacer-mature DR). The RNA guide used in the examples of this application is pre-crRNA. The direct repeat sequence includes a first direct repeat sequence and a second direct repeat sequence located at both ends of the spacer sequence. The stem-loop structure is between the first and second direct repeat sequence. all exist. In the subsequent processing of RNA guides by cells, active RNA guides can be further obtained. For example, for the pre-crRNA composed of DR-spacer-DR, one of the direct repeat sequences is deleted after processing.
在一种优选的实施例中,靶核酸包含前间隔子相邻基序,CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白能够识别前间隔子相邻基序(PAM),前间隔子相邻基序包含核酸序列5’-TTN-3’,其中N是任何核苷酸;优选地,N为A、C或T。In a preferred embodiment, the target nucleic acid contains a pre-spacer adjacent motif, and the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein is capable of recognizing the pre-spacer adjacent motif (PAM). The adjacent motif comprises the nucleic acid sequence 5'-TTN-3', where N is any nucleotide; preferably, N is A, C or T.
在CRISPR-Cas系统中,Cas酶可以识别与靶核酸附近相关联的短基序,从而完成对特异性位点的剪切、修饰。上述基因编辑系统中的CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白,能够特异性识别的PAM为5’-TTN-3’,其中N是任何核苷酸。且对5’-TTA-3’、5’-TTC-3’和5’-TTT-3’识别效率最高。In the CRISPR-Cas system, the Cas enzyme can recognize short motifs associated with the target nucleic acid, thereby completing the cutting and modification of specific sites. The PAM that can be specifically recognized by the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein in the above gene editing system is 5’-TTN-3’, where N is any nucleotide. And it has the highest recognition efficiency for 5’-TTA-3’, 5’-TTC-3’ and 5’-TTT-3’.
在一种优选的实施例中,CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白,与RNA指导物结合,形成蛋白-核酸复合物;优选地,蛋白-核酸复合物是非天然存在的或经修饰的;优选地,蛋白-核酸复合物中的至少一个组分是非天然存在的或经修饰的。In a preferred embodiment, the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein is combined with the RNA guide to form a protein-nucleic acid complex; preferably, the protein-nucleic acid complex is non-naturally occurring or Modified; preferably, at least one component of the protein-nucleic acid complex is non-naturally occurring or modified.
在基因编辑系统中,RNA指导物与Cas蛋白能够形成蛋白-核酸复合物。可以对RNA指导物进行修饰或对Cas蛋白进行修饰,或者直接采用不修饰的蛋白-核酸复合物(即RNA指导物与Cas蛋白均为天然的),均能够发挥基因编辑的作用。In gene editing systems, RNA guides and Cas proteins can form protein-nucleic acid complexes. The RNA guide can be modified or the Cas protein can be modified, or an unmodified protein-nucleic acid complex can be directly used (that is, the RNA guide and Cas protein are both natural), both of which can play a role in gene editing.
在一种优选的实施例中,通过CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白和 RNA指导物对靶核酸的靶向作用,对靶核酸进行修饰;优选地,修饰包括但不限于切割或切口;优选地,修饰导致:(1)细胞包含至少一种基因产物的表达的改变;或(2)细胞包含至少一种基因产物的表达的改变,其中至少一种基因产物的表达增加;或(3)细胞包含至少一种基因产物的表达的改变,其中至少一种基因产物的表达减少;或(4)细胞包含经编辑的基因组;优选地,修饰导致细胞毒性;优选地,修饰导致抑制基因表达,降低基因表达,或增强基因表达。In a preferred embodiment, by CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein and The RNA guide targets the target nucleic acid and modifies the target nucleic acid; preferably, the modification includes but is not limited to cleavage or nicking; preferably, the modification results in: (1) a change in the expression of the cell containing at least one gene product; or (2) the cell comprises an alteration in the expression of at least one gene product, wherein the expression of at least one gene product is increased; or (3) the cell comprises an alteration in the expression of at least one gene product, wherein the expression of at least one gene product Reduced; or (4) the cell contains an edited genome; preferably, the modification results in cytotoxicity; preferably, the modification results in inhibition of gene expression, reduction in gene expression, or enhancement of gene expression.
利用上述CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白和RNA指导物对靶核酸的靶向作用,能够对靶核酸进行修饰,包括切割或切口。切割是单链DNA或双链DNA的断裂。切口是双链DNA的某一条DNA链断裂。上述作用于细胞基因的修饰,能够导致细胞中基因产物表达的改变,这种改变包括增加或减少,也能够导致细胞基因组被编辑。上述修饰可以作用于细胞自身的基因组,也可以作用于细胞中的如质粒等外源基因。上述修饰能够导致细胞毒性,抑制基因表达,降低基因表达,或增强基因表达等情况的出现。Using the above-mentioned CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein and RNA guide to target the target nucleic acid, the target nucleic acid can be modified, including cutting or nicking. Cleavage is the break of single-stranded DNA or double-stranded DNA. A nick is a break in one of the DNA strands of double-stranded DNA. The above-mentioned modifications to cellular genes can lead to changes in the expression of gene products in cells, including increases or decreases, and can also lead to editing of the cell genome. The above modifications can act on the cell's own genome, or on foreign genes such as plasmids in the cell. The above modifications can cause cytotoxicity, inhibit gene expression, reduce gene expression, or enhance gene expression.
在一种优选的实施例中,基因编辑系统包括目标核酸或编码目标核酸的核酸,目标核酸包括同源臂片段和供体模板核酸;优选地,目标核酸包含能够与间隔子序列杂交的序列;优选地,同源臂片段包括5’同源臂和3’同源臂,目标核酸由5’同源臂、供体模板核酸和3’同源臂顺序连接组成。In a preferred embodiment, the gene editing system includes a target nucleic acid or a nucleic acid encoding the target nucleic acid, and the target nucleic acid includes a homology arm fragment and a donor template nucleic acid; preferably, the target nucleic acid includes a sequence capable of hybridizing with the spacer sequence; Preferably, the homology arm fragment includes a 5' homology arm and a 3' homology arm, and the target nucleic acid consists of a 5' homology arm, a donor template nucleic acid and a 3' homology arm sequentially connected.
上述基因编辑系统中,还可以包括目标核酸或编码目标核酸的核酸,目标核酸包含能够与间隔子序列杂交的序列,即能够将目标核酸定位在特定的核酸位点(如靶核酸)上。在本申请中,靶核酸为基因编辑系统所修饰的核酸;供体模板核酸为对靶核酸进行修饰、修改时的模版。进一步地,目标核酸包括同源臂片段和供体模板核酸,同源臂片段能够与特定的核酸位点周围的核酸进行特异性结合,供体模板核酸则携带有特定的遗传信息,能够通过同源重组或其他机制,在上述CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白对靶核酸进行切割后,将特定的遗传信息整合到靶核酸的位置,从而完成对于靶核酸的修饰,包括碱基的插入、删除或替换,从而完成基因编辑。The above gene editing system may also include a target nucleic acid or a nucleic acid encoding the target nucleic acid. The target nucleic acid includes a sequence that can hybridize with the spacer sequence, that is, the target nucleic acid can be positioned at a specific nucleic acid site (such as a target nucleic acid). In this application, the target nucleic acid is the nucleic acid modified by the gene editing system; the donor template nucleic acid is the template for modifying the target nucleic acid. Further, the target nucleic acid includes a homology arm fragment and a donor template nucleic acid. The homology arm fragment can specifically bind to the nucleic acid surrounding a specific nucleic acid site. The donor template nucleic acid carries specific genetic information and can be synthesized through homology. Source recombination or other mechanisms, after the above-mentioned CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein cleaves the target nucleic acid, specific genetic information is integrated into the position of the target nucleic acid, thereby completing the modification of the target nucleic acid, including Insertion, deletion or replacement of bases to complete gene editing.
“供体模板核酸”是指在Cas酶改变(修饰,包括碱基的插入、删除或替换)了靶核酸之后,一种或多种细胞蛋白质(如同源重组酶)可以使用其来改变靶核酸的结构的核酸分子。举例说明,利用Cas蛋白对靶核酸进行修饰后,利用同源重组酶,以供体模板核酸为模版,对于靶核酸进行插入、删除或替换等修饰、改变。供体模板核酸可以是双链核酸、单链核酸;供体模板核酸可以是线性的、环状的(例如可以采用质粒);供体模板核酸可以是外源核酸分子。供体模板核酸包括DNA或RNA。"Donor template nucleic acid" means that one or more cellular proteins (such as homologous recombinase) can use it to change the target nucleic acid after the Cas enzyme has changed (modified, including the insertion, deletion or replacement of bases) the target nucleic acid. The structure of a nucleic acid molecule. For example, after using Cas protein to modify the target nucleic acid, homologous recombinase is used to modify and change the target nucleic acid by inserting, deleting, or replacing the donor template nucleic acid as a template. The donor template nucleic acid can be double-stranded nucleic acid or single-stranded nucleic acid; the donor template nucleic acid can be linear or circular (for example, a plasmid can be used); the donor template nucleic acid can be an exogenous nucleic acid molecule. Donor template nucleic acids include DNA or RNA.
在一种优选的实施例中,基因编辑系统以可递送的形式存在,利用递送系统,使基因编辑系统与靶核酸接触;优选地,递送系统将基因编辑系统递送入含有靶核酸的细胞中;优选地,可递送的形式包括但不限于纳米颗粒、脂质体、外泌体、微泡、蛋白衣壳或基因枪所用的颗粒。In a preferred embodiment, the gene editing system exists in a deliverable form, and the delivery system is used to bring the gene editing system into contact with the target nucleic acid; preferably, the delivery system delivers the gene editing system into cells containing the target nucleic acid; Preferably, deliverable forms include, but are not limited to, nanoparticles, liposomes, exosomes, microvesicles, protein capsids, or particles for gene guns.
对于存在于细胞内的靶核酸,通过上述递送系统能够将基因编辑系统递送入细胞中,从 而完成基因编辑系统与靶核酸的接触。递送系统也可以进一步地定位于特定的细胞种类、细胞内部结构等位置,达到基因编辑系统精确递送的目的,提高基因编辑的精确性。For target nucleic acids present in cells, the gene editing system can be delivered into the cells through the above delivery system. The contact between the gene editing system and the target nucleic acid is completed. The delivery system can also be further positioned at specific cell types, internal cell structures, etc., to achieve the purpose of precise delivery of the gene editing system and improve the accuracy of gene editing.
在本申请第七种典型的实施方式中,提供了一种基因编辑载体,该基因编辑载体包含上述编码RNA指导物的核酸。In a seventh typical embodiment of the present application, a gene editing vector is provided, which contains the above-mentioned nucleic acid encoding an RNA guide.
在一种优选的实施例中,基因编辑载体还包含上述DNA分子;优选地,DNA分子与编码RNA指导物的核酸位于相同或不同的载体上;优选地,DNA分子与第一调节元件连接;优选地,编码RNA指导物的核酸与第二调节元件连接;优选地,第一调节元件和第二调节元件分别独立选自诱导型启动子、组成型启动子或组织特异性启动子中的一种或多种;优选地,第一调节元件和第二调节元件分别独立选自包括但不限于T7、SP6、T3、CMV、EF1a、SV40、PGK1、humanβ-actin、CAG、U6、H1、T7、T7lac、araBAD、trp、lac或Ptac中的一种或多种。In a preferred embodiment, the gene editing vector also contains the above-mentioned DNA molecule; preferably, the DNA molecule and the nucleic acid encoding the RNA guide are located on the same or different vectors; preferably, the DNA molecule is connected to the first regulatory element; Preferably, the nucleic acid encoding the RNA guide is connected to a second regulatory element; preferably, the first regulatory element and the second regulatory element are each independently selected from one of an inducible promoter, a constitutive promoter or a tissue-specific promoter. One or more; preferably, the first regulatory element and the second regulatory element are independently selected from the group including but not limited to T7, SP6, T3, CMV, EF1a, SV40, PGK1, human β-actin, CAG, U6, H1, T7 , T7lac, araBAD, trp, lac or one or more of Ptac.
上述基因编辑载体包含能够编码RNA指导物的核酸,能够复制编码RNA指导物的核酸,并在细胞中编码RNA指导物。利用上述包含DNA分子的重组载体与该基因编辑载体,能够分别表达CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白与RNA指导物,即能表达上述基因编辑系统。在该基因编辑载体中,也可以包含上述DNA分子,DNA分子与编码RNA指导物的核酸位于相同或不同的载体上。若位于相同载体上,则基因编辑载体包括一个载体;若位于不同载体上,则多个载体组成基因编辑载体。利用上述基因编辑载体,能够在细胞中表达上述基因编辑系统。在基因编辑载体上,可以各自独立地灵活设置多种如启动子等调节元件,帮助RNA指导物和/或蛋白完成转录、翻译、纯化等工作。The above-mentioned gene editing vector contains a nucleic acid capable of encoding an RNA guide, capable of replicating the nucleic acid encoding the RNA guide, and encoding the RNA guide in the cell. The above-mentioned recombinant vector containing DNA molecules and the gene editing vector can be used to express CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein and RNA guide respectively, that is, the above-mentioned gene editing system can be expressed. The gene editing vector may also contain the above-mentioned DNA molecule, and the DNA molecule and the nucleic acid encoding the RNA guide are located on the same or different vectors. If they are on the same vector, the gene editing vector includes one vector; if they are on different vectors, multiple vectors make up the gene editing vector. Using the above gene editing vector, the above gene editing system can be expressed in cells. On the gene editing vector, a variety of regulatory elements such as promoters can be flexibly set independently to help RNA guides and/or proteins complete transcription, translation, purification and other tasks.
在本申请第八种典型的实施方式中,提供了一种上述基因编辑系统与细胞中靶核酸结合的方法,该方法包括:将基因编辑系统递送至细胞中,细胞包括靶核酸;使CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白,与RNA指导物结合,使间隔子序列与靶核酸结合。In an eighth typical embodiment of the present application, a method for binding the above-mentioned gene editing system to a target nucleic acid in a cell is provided. The method includes: delivering the gene editing system to the cell, and the cell includes the target nucleic acid; allowing CRISPR- Cas effector proteins or CRISPR-Cas effector fusion proteins combine with the RNA guide to bind the spacer sequence to the target nucleic acid.
在一种优选的实施例中,靶核酸为双链DNA或单链DNA;优选地,基因编辑系统与细胞中靶核酸的结合,导致靶核酸的表达状态改变;优选地,基因编辑系统与细胞中靶核酸的结合,导致靶核酸被切割;优选地,靶核酸被切割导致靶核酸破坏、或靶核酸特定位点替换、或靶核酸位点的移除、或靶核酸区域功能的改变、或靶核酸上两个位点之间的序列倒置。In a preferred embodiment, the target nucleic acid is double-stranded DNA or single-stranded DNA; preferably, the combination of the gene editing system and the target nucleic acid in the cell causes a change in the expression state of the target nucleic acid; preferably, the gene editing system and the cell The binding of the target nucleic acid results in the target nucleic acid being cleaved; preferably, the target nucleic acid is cleaved leading to the destruction of the target nucleic acid, or the replacement of a specific site of the target nucleic acid, or the removal of the target nucleic acid site, or the change of the function of the target nucleic acid region, or A sequence inversion between two sites on a target nucleic acid.
靶核酸破坏,例如靶标突变,例如导致基因敲除;靶核酸替换,例如导致靶标校正;靶位点的移除,例如导致靶标缺失;功能的改变,例如,靶核酸活性或可及性,导致例如(转录的和/或表观遗传的)基因或基因组区域激活或者基因或基因组区域沉默。靶核酸上两个位点之间的序列倒置,例如通过切割靶核酸包括在两个位点,再利用Donor序列或Cre-loxP重组酶系统即可实现。Target nucleic acid disruption, e.g., target mutation, e.g., resulting in gene knockout; Target nucleic acid replacement, e.g., resulting in target correction; Removal of target sites, e.g., resulting in target deletion; Changes in function, e.g., target nucleic acid activity or accessibility, resulting For example (transcriptional and/or epigenetic) gene or genomic region activation or gene or genomic region silencing. Sequence inversion between two sites on the target nucleic acid can be achieved, for example, by cutting the target nucleic acid into two sites and then using the Donor sequence or Cre-loxP recombinase system.
上述递送包括但不限于质粒转化、显微注射、纳米颗粒、脂质体、外泌体、微泡、蛋白衣壳或基因枪等现有技术。靶核酸破坏包括靶核酸突变或基因敲除,靶核酸特定位点替换包括靶核酸中错误碱基被纠正,靶核酸位点的移除能够导致靶核酸位点缺失,区域功能的改变包括但不限于靶核酸表达活性重新激活,或靶核酸表达活性失活,或表达量的调高,或表达 量的调低,或表达产物改变。靶核酸的表达状态改变,包括但不限于基因沉默或基因表达激活。The above-mentioned delivery includes, but is not limited to, existing technologies such as plasmid transformation, microinjection, nanoparticles, liposomes, exosomes, microvesicles, protein capsids or gene guns. Target nucleic acid destruction includes target nucleic acid mutation or gene knockout. Target nucleic acid site-specific replacement includes correction of incorrect bases in the target nucleic acid. Removal of the target nucleic acid site can lead to deletion of the target nucleic acid site. Changes in regional function include but not Limited to reactivation of target nucleic acid expression activity, or inactivation of target nucleic acid expression activity, or increase in expression level, or expression The amount is reduced, or the expression product is changed. Changes in the expression state of the target nucleic acid, including but not limited to gene silencing or gene expression activation.
在本申请第九种典型的实施方式中,提供了一种含有基因编辑系统的细胞,含有基因编辑系统的细胞包括上述基因编辑系统、或基因编辑载体。In a ninth typical embodiment of the present application, a cell containing a gene editing system is provided. The cell containing a gene editing system includes the above-mentioned gene editing system or gene editing vector.
在一种优选的实施例中,含有基因编辑系统的细胞包含经修饰的目的靶基因座,目的靶基因座为利用基因编辑系统修饰的基因座;优选地,目的靶基因座的修饰导致:(1)含有基因编辑系统的细胞包含至少一种基因产物的表达的改变;或(2)含有基因编辑系统的细胞包含至少一种基因产物的表达的改变,其中至少一种基因产物的表达增加;或(3)含有基因编辑系统的细胞包含至少一种基因产物的表达的改变,其中至少一种基因产物的表达减少;或(4)含有基因编辑系统的细胞包含经编辑的基因组;优选地,含有基因编辑系统的细胞包括真核细胞或原核细胞;优选地,真核细胞包括但不限于动物细胞、植物细胞或人类细胞;优选地,动物细胞包括但不限于哺乳动物细胞。In a preferred embodiment, the cell containing the gene editing system contains a modified target target locus, and the target target locus is a locus modified using the gene editing system; preferably, the modification of the target target locus results in: ( 1) The cells containing the gene editing system contain changes in the expression of at least one gene product; or (2) The cells containing the gene editing system contain changes in the expression of at least one gene product, wherein the expression of at least one gene product is increased; or (3) the cell containing the gene editing system contains an alteration in the expression of at least one gene product, wherein the expression of at least one gene product is reduced; or (4) the cell containing the gene editing system contains an edited genome; preferably, Cells containing gene editing systems include eukaryotic cells or prokaryotic cells; preferably, eukaryotic cells include but are not limited to animal cells, plant cells or human cells; preferably, animal cells include but are not limited to mammalian cells.
上述含有基因编辑系统的细胞中含有上述基因编辑系统或基因编辑载体,处于基因编辑未发生、基因编辑正在发生、基因编辑已完成等多种状态的细胞均属于上述含有基因编辑系统的细胞。处于基因编辑正在发生和基因编辑已完成的上述细胞中,含有经修饰的目的靶基因座。能够导致细胞中基因产物表达的改变,这种改变包括增加或减少,也能够导致细胞基因组被编辑。上述修饰可以作用于细胞自身的基因组,也可以作用于细胞中的如质粒等外源基因。上述修饰能够导致细胞毒性,抑制基因表达,降低基因表达,或增强基因表达等情况的出现。The above-mentioned cells containing the gene editing system contain the above-mentioned gene editing system or gene editing vector. Cells in various states such as gene editing has not occurred, gene editing is occurring, gene editing has been completed, etc., all belong to the above-mentioned cells containing the gene editing system. The above-mentioned cells where gene editing is occurring and where gene editing has been completed contain modified target loci. It can lead to changes in the expression of gene products in cells, including increases or decreases, and can also lead to editing of the cell genome. The above modifications can act on the cell's own genome, or on foreign genes such as plasmids in the cell. The above modifications can cause cytotoxicity, inhibit gene expression, reduce gene expression, or enhance gene expression.
术语“靶基因座”包含期望编辑的多核苷酸的任何DNA区段或区域。在一些实施例中,靶基因座为基因组座位。靶基因座对于细胞可为天然的,或另选地可包含异源或外源DNA区段。异源或外源DNA区段可包括转基因、表达盒、编码选择标记的多核苷酸,或者异源或外源DNA区域。在具体的实施例中,所靶向基因座可包含来自原核生物、真核生物、动物或植物,包括非人哺乳动物、非人细胞、啮齿动物、人、鼠、灵长类动物或任何其他目标生物体,或者它们的组合的天然、异源或外源基因组核酸序列。The term "target locus" encompasses any DNA segment or region of a polynucleotide that is desired to be edited. In some embodiments, the target locus is a genomic locus. The target locus may be native to the cell, or alternatively may comprise heterologous or exogenous DNA segments. Heterologous or exogenous DNA segments may include transgenes, expression cassettes, polynucleotides encoding selectable markers, or heterologous or exogenous DNA regions. In specific embodiments, the targeted loci may comprise genes from prokaryotes, eukaryotes, animals or plants, including non-human mammals, non-human cells, rodents, humans, mice, primates or any other The native, heterologous or exogenous genomic nucleic acid sequence of the target organism, or a combination thereof.
在本申请第十种典型的实施方式中,提供了一种靶向和编辑靶核酸的方法,该方法包括使靶核酸与上述基因编辑系统接触。In a tenth typical embodiment of the present application, a method for targeting and editing a target nucleic acid is provided, which method includes contacting the target nucleic acid with the above-mentioned gene editing system.
在本申请第十一种典型的实施方式中,提供了一种在识别靶核酸后非特异性降解单链DNA的方法,该方法包括使靶核酸与上述基因编辑系统接触。利用近似于Cas12i的旁切效应的活性,能够实现上述方法。Cas12i蛋白可具有附带活性,即在某些环境中,激活的Cas12i蛋白在结合靶序列后仍然保持活性,并继续非特异性地切割非靶寡核苷酸。该附带活性被称为“旁切活性”或“乱切活性”,利用该活性,能够使用Cas12i系统检测特定靶寡核苷酸的存在。比如,将Cas12i系统工程化以非特异性切割ssDNA或转录物。比如,Cas12i在体外系统或细胞中瞬时或稳定地提供或表达,并靶向或触发以非特异性地切割细胞核酸,例如ssDNA,例如病毒ssDNA。 In an eleventh typical embodiment of the present application, a method for non-specific degradation of single-stranded DNA after identifying a target nucleic acid is provided, which method includes contacting the target nucleic acid with the above-mentioned gene editing system. The above method can be realized by utilizing the activity similar to the side-cleaving effect of Cas12i. Cas12i proteins can have incidental activity, that is, under certain circumstances, the activated Cas12i protein remains active after binding to the target sequence and continues to non-specifically cleave non-target oligonucleotides. This incidental activity is called "side-cleaving activity" or "random cleavage activity", and using this activity, the Cas12i system can be used to detect the presence of specific target oligonucleotides. For example, the Cas12i system is engineered to nonspecifically cleave ssDNA or transcripts. For example, Cas12i is provided or expressed transiently or stably in in vitro systems or cells, and targets or triggers non-specific cleavage of cellular nucleic acids, such as ssDNA, such as viral ssDNA.
在现有技术中,上述旁切活性能够应用于称为SHERLOCK的高灵敏度和特异性核酸检测平台,该平台可用于许多临床诊断。利用“旁切活性”进行检测时,会用到报告核酸,“报告核酸”是指可被激活的CRISPR系统蛋白切割或以其他方式减活的分子。报告核酸包含可被CRISPR蛋白切割的核酸元件。核酸元件的切割释放出剂或产生构象变化,从而允许产生可检测的信号。在切割之前,或者当报告核酸处于“活性”状态时,报告核酸阻止阳性可检测信号的产生或检测。比如,在存在活性报告核酸的情况下可产生最小的背景信号。阳性可检测信号可以是可使用光学、荧光、化学发光、电化学或本领域已知的其他检测方法检测的任何信号。例如,在某些实施方式中,当存在报告核酸时,可检测到第一信号(即阴性可检测信号),然后在检测到靶分子以及通过激活的CRISPR蛋白切割或减活后将其转换为第二信号(例如阳性可检测信号)。利用上述方法或上述CRISPR-Cas效应子蛋白,能够实现上述应用。In the existing technology, the above-mentioned side-cleaving activity can be applied to a highly sensitive and specific nucleic acid detection platform called SHERLOCK, which can be used in many clinical diagnoses. When using "by-cleaving activity" for detection, a reporter nucleic acid is used. "Reporter nucleic acid" refers to a molecule that can be cleaved or otherwise inactivated by the activated CRISPR system protein. Reporter nucleic acids contain nucleic acid elements that can be cleaved by CRISPR proteins. Cleavage of the nucleic acid element releases the agent or produces a conformational change that allows a detectable signal to be generated. Prior to cleavage, or when the reporter nucleic acid is in an "active" state, the reporter nucleic acid prevents the generation or detection of a positive detectable signal. For example, minimal background signal can be generated in the presence of an active reporter nucleic acid. A positive detectable signal can be any signal detectable using optical, fluorescent, chemiluminescent, electrochemical, or other detection methods known in the art. For example, in certain embodiments, a first signal (i.e., a negative detectable signal) can be detected when a reporter nucleic acid is present, which is then converted to a target molecule upon detection and cleavage or inactivation by an activated CRISPR protein. Second signal (eg, positive detectable signal). The above application can be realized using the above method or the above CRISPR-Cas effector protein.
在本申请第十二种典型的实施方式中,提供了一种在识别双链靶DNA的间隔子互补链后靶向双链靶DNA的非间隔子互补链并使其产生切口的方法,该方法包括使双链靶DNA与上述基因编辑系统接触。In a twelfth typical embodiment of the present application, a method of targeting and nicking the non-spacer complementary strand of a double-stranded target DNA after identifying the spacer complementary strand of the double-stranded target DNA is provided. Methods include contacting double-stranded target DNA with the gene editing system described above.
在本申请第十三种典型的实施方式中,提供了一种靶向和切割双链靶DNA的方法,该方法包括使双链靶DNA与上述基因编辑系统接触。In a thirteenth typical embodiment of the present application, a method for targeting and cutting double-stranded target DNA is provided, which method includes contacting the double-stranded target DNA with the above-mentioned gene editing system.
在一种优选的实施例中,在使双链DNA的间隔子互补链产生切口之前,使双链靶DNA的非间隔子互补链产生切口。In a preferred embodiment, before nicking the spacer complementary strand of the double-stranded DNA, the non-spacer complementary strand of the double-stranded target DNA is nicked.
在本申请第十四种典型的实施方式中,提供了一种特异性编辑双链核酸的方法,该方法包括在充分的条件下使以下进行接触充分的时间量,(1)上述CRISPR-Cas效应子蛋白、或CRISPR-Cas效应子融合蛋白、另一具有序列特异性切口活性的酶,以及RNA指导物,RNA指导物指导CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白,相对于另一序列特异性切口酶的活性使相对链产生切口;以及(2)双链核酸;上述方法导致双链断裂的形成。In a fourteenth typical embodiment of the present application, a method for specifically editing double-stranded nucleic acids is provided, which method includes contacting the following under sufficient conditions for a sufficient amount of time, (1) the above-mentioned CRISPR-Cas an effector protein, or a CRISPR-Cas effector fusion protein, another enzyme having sequence-specific nicking activity, and an RNA guide, the RNA guide directs the CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein, relative to The activity of another sequence-specific nicking enzyme nicks the opposite strand; and (2) double-stranded nucleic acids; the above method results in the formation of double-stranded breaks.
在上述方法中,可以将两种上述CRISPR-Cas效应子蛋白、或CRISPR-Cas效应子融合蛋白、或者一种上述蛋白和另一种具有切口活性的Cas蛋白(两种Cas蛋白被一对RNA指导物靶向到靶基因座的相对链)可以产生具有突出端的双链断裂。这种方法可降低脱靶修饰的可能性,因为双链断裂预计只发生在两种酶都产生切口的基因座处,从而增加基因组编辑特异性。该方法也称为“双切口”或“配对切口酶”策略。In the above method, two of the above-mentioned CRISPR-Cas effector proteins, or a CRISPR-Cas effector fusion protein, or one of the above-mentioned proteins and another Cas protein with nicking activity (two Cas proteins are combined with a pair of RNA Guides targeted to the opposite strand of the target locus) can create double-stranded breaks with overhangs. This approach reduces the potential for off-target modifications because double-strand breaks are expected to occur only at loci where both enzymes nick, thereby increasing genome editing specificity. This approach is also known as the "double-nicking" or "paired-nickase" strategy.
在本申请第十五种典型的实施方式中,提供了一种编辑双链核酸的方法,该方法包括在充分的条件下使以下进行接触充分的时间量:(1)上述CRISPR-Cas效应子蛋白、或CRISPR-Cas效应子融合蛋白,和具有DNA修饰活性的蛋白质结构域的融合蛋白,以及靶向双链核酸的RNA指导物;以及(2)双链核酸;融合蛋白的CRISPR-Cas效应子被修饰以使双链核酸的非靶链产生切口。利用近似于Cas12i的旁切效应的活性,能够实现上述方法。上述双链核酸包括但不限于病毒DNA(例如,巴氏病毒、肝炎病毒、疱疹病毒、腺病毒、痘病毒、细小病毒等)。In a fifteenth typical embodiment of the present application, a method for editing double-stranded nucleic acids is provided, which method includes contacting the following under sufficient conditions for a sufficient amount of time: (1) the above-mentioned CRISPR-Cas effector Protein, or CRISPR-Cas effector fusion protein, and a fusion protein of a protein domain with DNA modification activity, and an RNA guide targeting double-stranded nucleic acid; and (2) double-stranded nucleic acid; CRISPR-Cas effect of the fusion protein The molecule is modified to nick the non-target strand of the double-stranded nucleic acid. The above method can be realized by utilizing the activity similar to the side-cleaving effect of Cas12i. The above-mentioned double-stranded nucleic acid includes, but is not limited to, viral DNA (eg, Pasteur virus, hepatitis virus, herpes virus, adenovirus, poxvirus, parvovirus, etc.).
在一种优选的实施例中,双链核酸的两条链在不同的位点被切割,导致交错切割;优选地,双链核酸的两条链在同一位点被切割,导致平双链断裂(DSB)。 In a preferred embodiment, the two strands of the double-stranded nucleic acid are cleaved at different sites, resulting in staggered cleavage; preferably, the two strands of the double-stranded nucleic acid are cleaved at the same site, resulting in flat double-stranded breaks (DSB).
在本申请第十六种典型的实施方式中,提供了一种靶向并切割单链靶DNA的方法,该方法包括使靶核酸与上述基因编辑系统接触。In a sixteenth typical embodiment of the present application, a method for targeting and cutting single-stranded target DNA is provided, which method includes contacting the target nucleic acid with the above-mentioned gene editing system.
上述基因编辑系统与靶核酸接触,能够利用基因编辑系统对靶核酸进行编辑。在充分的接触时间内,基因编辑系统能够靶核酸进行编辑。上述接触在细胞内、外均能实现。充分的条件和接触充分的时间量,表示上述方法能够进行或进行完全的反应条件和反应时间,根据上述方法的具体实施可进行灵活调整。上述基因编辑系统与靶核酸接触,能够利用基因编辑系统对靶核酸进行编辑,实现不同的基因编辑效果。在充分的接触时间内,基因编辑系统能够靶核酸进行编辑。上述接触在细胞内、外均能实现,靶核酸包括但不限于基因组、分离的单链或双链DNA。The above gene editing system is in contact with the target nucleic acid, and can use the gene editing system to edit the target nucleic acid. Within sufficient contact time, the gene editing system can target nucleic acids for editing. The above-mentioned contacts can be achieved both inside and outside the cell. Sufficient conditions and a sufficient amount of contact time indicate that the above method can proceed or complete the reaction conditions and reaction time, which can be flexibly adjusted according to the specific implementation of the above method. The above gene editing system is in contact with the target nucleic acid and can use the gene editing system to edit the target nucleic acid to achieve different gene editing effects. Within sufficient contact time, the gene editing system can target nucleic acids for editing. The above-mentioned contact can be achieved inside or outside the cell, and the target nucleic acid includes but is not limited to the genome and isolated single-stranded or double-stranded DNA.
在本申请第十七种典型的实施方式中,提供了一种诱导细胞状态改变的方法,上述方法保守使上述基因编辑系统与细胞中的靶核酸接触。In a seventeenth typical embodiment of the present application, a method for inducing changes in cell state is provided. The method conservatively brings the above gene editing system into contact with the target nucleic acid in the cell.
在一种优选的实施例中,细胞状态包括但不限于凋亡或休眠;优选地,细胞包括真核细胞或原核细胞;优选地,细胞包括但不限于哺乳动物细胞或植物病变细胞;优选地,细胞包括但不限于癌细胞;优选地,细胞包括但不限于感染性细胞或被感染原感染的细胞;优选地,细胞包括但不限于被病毒感染的细胞、被朊病毒感染的细胞;优选地,细胞包括但不限于真菌细胞、原生动物或寄生虫细胞。In a preferred embodiment, the cell state includes but is not limited to apoptosis or dormancy; preferably, the cells include eukaryotic cells or prokaryotic cells; preferably, the cells include but are not limited to mammalian cells or plant disease cells; preferably, the cells include but are not limited to mammalian cells or plant disease cells; , cells include but are not limited to cancer cells; preferably, cells include but are not limited to infectious cells or cells infected by infectious agents; preferably, cells include but are not limited to cells infected by viruses and cells infected by prions; preferably, cells include but are not limited to cells infected by viruses and prions. In particular, cells include, but are not limited to, fungal cells, protozoan or parasite cells.
上述诱导细胞状态改变的方法,通过基因编辑系统接触细胞中调控生长、代谢等功能的靶核酸,从而时间对靶核酸的修饰,进而诱导细胞状态改变。如对癌细胞、寄生虫细胞等中的特征靶核酸进行修饰,从而改变细胞的状态,使之凋亡或休眠,达到清除此类细胞的目的。The above-mentioned method of inducing cell state changes uses a gene editing system to contact target nucleic acids in cells that regulate growth, metabolism and other functions, thereby modifying the target nucleic acids over time, thereby inducing changes in cell state. For example, the characteristic target nucleic acids in cancer cells, parasite cells, etc. can be modified to change the state of the cells, causing them to undergo apoptosis or dormancy, thereby achieving the purpose of eliminating such cells.
在本申请第十八种典型的实施方式中,提供了一种利用上述基因编辑系统在制备治疗受试者病症或疾病的药物中的应用。In an eighteenth typical embodiment of the present application, there is provided an application of using the above gene editing system in preparing a drug for treating a subject's illness or disease.
在一种优选的实施例中,应用包括向受试者或受试者的离体细胞施用基因编辑系统;优选地,间隔子序列与跟病症或疾病相关的靶核酸的至少15个核苷酸互补,CRISPR-Cas效应子蛋白或CRISPR-Cas效应子融合蛋白切割靶核酸;优选地,病症或疾病包括但不限于癌症、感染性疾病、代谢性疾病或遗传性疾病;优选地,癌症包括但不限于维尔姆斯瘤、尤文肉瘤、神经内分泌瘤、胶质母细胞瘤、神经母细胞瘤、黑色素瘤、皮肤癌、乳腺癌、结肠癌、直肠癌、前列腺癌、肝癌、肾癌、胰腺癌、肺癌、胆道癌、宫颈癌、子宫内膜癌、食管癌、胃癌、头颈癌、甲状腺髓样癌、卵巢癌、胶质瘤、淋巴瘤、白血病、骨髓瘤、急性淋巴细胞白血病、急性髓细胞性白血病、慢性淋巴细胞白血病、慢性髓细胞性白血病、何杰金氏淋巴瘤、非何杰金氏淋巴瘤或尿膀胱癌中的一种或多种;优选地,代谢性疾病包括家族性高胆固醇血症(FH);优选地,遗传性疾病包括转甲状腺素蛋白淀粉样变(ATTR)、原发性高草尿酸症(PH1)、遗传性血管性水肿(HAE);优选地,病症或疾病包括但不限于囊性纤维化、进行性假肥大性肌营养不良、贝克肌营养不良、α-1-抗胰蛋白酶缺乏、庞贝病、强直性肌营养不良、亨廷顿病、脆性X综合征、弗里德赖希共济失调、肌萎缩侧索硬化、额颞叶痴呆、遗传性慢性肾脏病、高脂血症、高胆固醇血症、莱伯氏先天性黑蒙、镰状细胞病或β地中海贫血中的一种或多种;优选 地,感染性疾病的感染原包括但不限于人类免疫缺陷病毒、单纯疱疹病毒-1、乙型肝炎(HEPATITIS B)或单纯疱疹病毒-2中的一种或多种。In a preferred embodiment, the application includes administering the gene editing system to a subject or ex vivo cells of a subject; preferably, the spacer sequence is at least 15 nucleotides of a target nucleic acid associated with the disorder or disease. Complementary, CRISPR-Cas effector protein or CRISPR-Cas effector fusion protein cleaves the target nucleic acid; Preferably, the disorder or disease includes but is not limited to cancer, infectious disease, metabolic disease or genetic disease; Preferably, cancer includes but Not limited to Wilms tumor, Ewing sarcoma, neuroendocrine tumors, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, kidney cancer, pancreatic cancer , lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, myeloma, acute lymphoblastic leukemia, acute myeloid cell One or more of chronic lymphocytic leukemia, chronic myelogenous leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma or urinary bladder cancer; preferably, the metabolic disease includes familial hypertension cholesterolemia (FH); preferably, the genetic disease includes transthyretin amyloidosis (ATTR), primary hyperoxaluric acidosis (PH1), hereditary angioedema (HAE); preferably, the disorder or Diseases include, but are not limited to, cystic fibrosis, progressive Duchenne muscular dystrophy, Baker muscular dystrophy, alpha-1-antitrypsin deficiency, Pompe disease, myotonic dystrophy, Huntington's disease, fragile X syndrome , Friedreich's ataxia, amyotrophic lateral sclerosis, frontotemporal dementia, hereditary chronic kidney disease, hyperlipidemia, hypercholesterolemia, Leber's congenital amaurosis, sickle cell disease, or One or more beta thalassemias; preferably Specifically, the infectious agents of infectious diseases include but are not limited to one or more of human immunodeficiency virus, herpes simplex virus-1, hepatitis B (HEPATITIS B) or herpes simplex virus-2.
利用上述基因编辑系统,能够应用于制备治疗受试者病症或疾病的药物。此种应用包括对受试者或其离体细胞施用基因编辑系统,对于受试者的施用包括局部施用、全身施用或靶向施用等,进而达到药物的作用。间隔子序列与跟病症或疾病相关的靶核酸的至少15个核苷酸互补,能够保证间隔子序列与靶核酸实现稳定的特异性结合,防止脱靶情况的发生。上述病症或疾病包括但不限于癌症或感染性疾病,通过该应用能够对癌细胞基因组或受试者缺陷基因等靶核酸进行修饰,进而达到药物的作用。Utilizing the above gene editing system, it can be applied to the preparation of drugs for treating conditions or diseases in subjects. Such applications include the application of gene editing systems to subjects or their isolated cells. The application to subjects includes local application, systemic application or targeted application, etc., so as to achieve the effect of the drug. The spacer sequence is complementary to at least 15 nucleotides of the target nucleic acid related to the disease or disease, which can ensure stable and specific binding between the spacer sequence and the target nucleic acid and prevent off-target situations. The above conditions or diseases include but are not limited to cancer or infectious diseases. Through this application, target nucleic acids such as cancer cell genomes or subject defective genes can be modified to achieve the effect of drugs.
在本申请第十九种典型的实施方式中,提供了一种真核细胞系,该真核细胞系包含上述含有基因编辑系统的细胞,或为含有基因编辑系统的细胞的后代。In a nineteenth typical embodiment of the present application, a eukaryotic cell line is provided, which includes the above-mentioned cells containing the gene editing system, or is the progeny of the cells containing the gene editing system.
在本申请第二十种典型的实施方式中,提供了一种多细胞生物体,该多细胞生物体包含上述含有基因编辑系统的细胞。In a twentieth typical embodiment of the present application, a multicellular organism is provided, which includes the above-mentioned cell containing the gene editing system.
在一种优选的实施例中,多细胞生物体包括但不限于模型动物或模型植物。In a preferred embodiment, multicellular organisms include but are not limited to model animals or model plants.
上述多细胞生物体为利用上述基因编辑系统修饰过的多细胞生物体,在上述基因编辑系统的作用下获得可遗传或不可遗传的基因修饰。上述基因修饰包括利用基因编辑系统进行的基因插入、删除或替换,基因修饰的结果,及该多细胞生物体的基因修饰为可控制和可预期的。The above-mentioned multicellular organisms are multicellular organisms modified using the above-mentioned gene editing system, and heritable or non-heritable genetic modifications are obtained under the action of the above-mentioned gene editing system. The above-mentioned genetic modification includes gene insertion, deletion or replacement using a gene editing system, the result of genetic modification, and the genetic modification of the multicellular organism are controllable and predictable.
在本申请第二十一种典型的实施方式中,提供了一种获得目的性状的植物的方法,利用上述基因编辑系统与植物细胞接触,对植物细胞的基因进行修饰或引入目的基因,修饰或目的基因能够表达目的性状,获得修饰后的植物细胞,利用修饰后的植物细胞进行再生,获得目的性状的植物。In the twenty-first typical embodiment of the present application, a method for obtaining plants with target traits is provided. The above-mentioned gene editing system is used to contact plant cells to modify the genes of plant cells or introduce target genes. Modify or The target gene can express the target traits, obtain modified plant cells, and use the modified plant cells for regeneration to obtain plants with the target traits.
在本申请第二十二种典型的实施方式中,提供了一种鉴定植物中目的性状的方法,植物细胞中的目的基因能够表达目的性状,利用上述基因编辑系统与植物细胞接触,从而鉴定目的基因。In the twenty-second typical embodiment of the present application, a method for identifying target traits in plants is provided. The target genes in plant cells can express the target traits, and the above gene editing system is used to contact the plant cells, thereby identifying the target traits. Gene.
在本申请第二十三种典型的实施方式中,提供了一种试剂盒,该试剂盒包括一种或多种选自下列的组分:上述CRISPR-Cas效应子蛋白、CRISPR-Cas效应子融合蛋白、DNA分子、重组载体、宿主细胞、基因编辑系统、基因编辑载体、含有基因编辑系统的细胞、真核细胞系、多细胞生物体;试剂盒的组分分布在在相同或不同的容器中。In the twenty-third typical embodiment of the present application, a kit is provided, which includes one or more components selected from the following: the above-mentioned CRISPR-Cas effector protein, CRISPR-Cas effector Fusion proteins, DNA molecules, recombinant vectors, host cells, gene editing systems, gene editing vectors, cells containing gene editing systems, eukaryotic cell lines, multicellular organisms; the components of the kit are distributed in the same or different containers middle.
在本申请第二十四种典型的实施方式中,提供了一种容器,该容器包含上述试剂盒。In the twenty-fourth typical embodiment of the present application, a container is provided, which container contains the above-mentioned kit.
在一种优选的实施例中,容器包括无菌容器;优选地,容器包括注射器。In a preferred embodiment, the container includes a sterile container; preferably, the container includes a syringe.
在本申请第二十五种典型的实施方式中,提供了一种可植入装置,该可植入装置包括上述基因编辑系统。In the twenty-fifth typical embodiment of the present application, an implantable device is provided, and the implantable device includes the above-mentioned gene editing system.
在一种优选的实施例中,基因编辑系统在基质内;优选地,基因编辑系统在储库内。 In a preferred embodiment, the gene editing system is within the matrix; preferably, the gene editing system is within the reservoir.
可使用可植入装置将本发明的CRISPR-Cas效应子蛋白、CRISPR-Cas效应子融合蛋白、本公开的RNP、本公开的核酸(例如,上述DNA分子、重组载体、基因编辑载体、CRISPR-Cas效应子指导RNA、编码CRISPR-Cas效应子指导RNA的核酸、编码CRISPR-Cas效应子蛋白的核酸、供体模板等)或本公开的CRISPR-Cas效应子系统递送至靶细胞(例如,体内靶细胞,其中靶细胞是循环中的靶细胞、组织中的靶细胞、器官中的靶细胞等)。适用于将本公开的CRISPR-Cas效应子多肽、本公开的CRISPR-Cas效应子融合多肽、本公开的RNP、本公开的核酸或本公开的CRISPR-Cas效应子系统递送至靶细胞(例如,体内靶细胞,其中靶细胞是循环中的靶细胞、组织中的靶细胞、器官中的靶细胞等)的可植入装置可包括容器(例如,储库、基质等),所述容器包含CRISPR-Cas效应子蛋白、CRISPR-Cas效应子CRISPR-Cas效应子融合蛋白、RNP或CRISPR-Cas效应子系统(或其组分,例如本公开的核酸)。The CRISPR-Cas effector protein, CRISPR-Cas effector fusion protein of the present invention, the RNP of the present disclosure, the nucleic acid of the present disclosure (for example, the above-mentioned DNA molecules, recombinant vectors, gene editing vectors, CRISPR- Cas effector guide RNA, nucleic acid encoding CRISPR-Cas effector guide RNA, nucleic acid encoding CRISPR-Cas effector protein, donor template, etc.) or the CRISPR-Cas effector system of the present disclosure is delivered to target cells (e.g., in vivo Target cells, where target cells are target cells in circulation, target cells in tissues, target cells in organs, etc.). Suitable for delivering a CRISPR-Cas effector polypeptide of the disclosure, a CRISPR-Cas effector fusion polypeptide of the disclosure, an RNP of the disclosure, a nucleic acid of the disclosure, or a CRISPR-Cas effector system of the disclosure to a target cell (e.g., Implantable devices that target cells in vivo, wherein the target cells are target cells in circulation, target cells in tissue, target cells in organs, etc.) may include a container (e.g., reservoir, matrix, etc.) that contains CRISPR -Cas effector protein, CRISPR-Cas effector CRISPR-Cas effector fusion protein, RNP or CRISPR-Cas effector system (or components thereof, such as nucleic acids of the present disclosure).
合适的可植入装置可包括例如用作装置主体的聚合物基底(诸如基质),并且在一些情况下包括另外的支架材料(诸如金属或另外的聚合物),以及增强可见性和成像的材料。可植入递送装置可有利于在局部和长时间内提供释放,其中待递送的多肽和/或核酸直接释放至靶位点,例如细胞外基质(ECM)、肿瘤周围的脉管系统、病变组织等。合适的可植入递送装置包括适用于递送至腔(诸如腹腔)和/或其中药物递送系统未锚定或附接的任何其他类型的施用的装置,所述装置包括生物稳定的和/或可降解的和/或生物可吸收的聚合物基底,其可以例如任选地是基质。Suitable implantable devices may include, for example, a polymeric substrate (such as a matrix) serving as the body of the device, and in some cases additional scaffolding materials (such as metals or additional polymers), as well as materials that enhance visibility and imaging . Implantable delivery devices may be advantageous in providing localized and prolonged release, where the polypeptide and/or nucleic acid to be delivered is released directly to the target site, such as the extracellular matrix (ECM), vasculature surrounding tumors, diseased tissue wait. Suitable implantable delivery devices include devices suitable for delivery to a cavity (such as the peritoneal cavity) and/or any other type of administration where the drug delivery system is not anchored or attached, including devices that are biostable and/or implantable. A degradable and/or bioabsorbable polymeric substrate, which may optionally be the matrix, for example.
上述基质包括在与生物环境接触后被有效加工和/或改型而不形成生物学活性、毒性和/或有害的副产物的材料。包括但不限于生物可吸收基质,可用于生物可吸收基质的材料包括例如生物聚合物(例如蛋白质、肽、碳水化合物、多核苷酸等)、合成聚合物、蛋白质、多糖、丝、聚癸二酸甘油酯(PGS)、聚二恶烷酮、聚乳酸-羟基乙酸共聚物(PLGA)、聚乳酸(PLA)、胶原、壳聚糖、丝蛋白及其组合。可用于生物可吸收基质的丝材料包括例如蚕丝蛋白、改性蚕丝蛋白、蜘蛛丝、昆虫丝、重组丝及其任意组合。上述储库储库组合物包括可生物侵蚀、可生物相容的聚合物的基质,包括但不限于高分子凝胶储库或膜控储库。Such matrices include materials that are efficiently processed and/or modified upon contact with a biological environment without forming biologically active, toxic and/or harmful by-products. Including but not limited to bioabsorbable matrices, materials that can be used for bioabsorbable matrices include, for example, biopolymers (e.g., proteins, peptides, carbohydrates, polynucleotides, etc.), synthetic polymers, proteins, polysaccharides, silk, polydecane Glyceryl acid esters (PGS), polydioxanone, polylactic acid-co-glycolic acid (PLGA), polylactic acid (PLA), collagen, chitosan, silk protein and combinations thereof. Silk materials that can be used in the bioabsorbable matrix include, for example, silk fibroin, modified silk fibroin, spider silk, insect silk, recombinant silk, and any combination thereof. The above-mentioned reservoir reservoir composition includes a matrix of bioerodible, biocompatible polymers, including but not limited to polymer gel reservoirs or membrane-controlled reservoirs.
在一些情况下,合适的可植入药物递送装置包含可降解聚合物,其中主要释放机制是整体侵蚀(bulkerosion)。在一些情况下,合适的可植入药物递送装置包含不可降解或缓慢降解的聚合物,其中主要释放机制是扩散而不是整体侵蚀,使得外部部分用作膜并且其内部部分用作药物储库,实际上,所述药物储库长时间内(例如约一周至约几个月)不会受到周围环境的影响。也可任选地使用具有不同释放机制的不同聚合物的组合。在总释放期的有效期内,浓度梯度可保持有效恒定,并且因此扩散速率是有效恒定的(称为“零模式”扩散)。术语“恒定”意指扩散速率维持高于治疗有效性的下阈值,但其仍然任选地以初始突发为特征并且/或者可波动,例如增加和降低到某一程度。扩散速率可长时间这样维持,并且可认为扩散速率恒定到某一水平以优化治疗有效期,例如有效的沉默期。In some cases, suitable implantable drug delivery devices include degradable polymers where the primary release mechanism is bulkerosion. In some cases, suitable implantable drug delivery devices contain non-degradable or slowly degrading polymers in which the primary release mechanism is diffusion rather than bulk erosion, such that the outer portion acts as a membrane and its inner portion serves as a drug reservoir, In fact, the drug reservoir is not affected by the surrounding environment for long periods of time (eg, from about one week to about several months). Combinations of different polymers with different release mechanisms may also optionally be used. Over the lifetime of the total release period, the concentration gradient can remain effectively constant, and therefore the diffusion rate is effectively constant (referred to as "zero-mode" diffusion). The term "constant" means that the diffusion rate remains above a lower threshold for therapeutic effectiveness, but that it is still optionally characterized by an initial burst and/or may fluctuate, such as increase and decrease to a certain extent. The diffusion rate can be maintained like this for long periods of time, and the diffusion rate can be considered constant to a certain level to optimize the duration of treatment, such as an effective silent period.
在一些情况下,可植入递送系统被设计成保护基于核苷酸的治疗剂免于降解,无论是化学性质还是由于受试者体内酶和其他因素的攻击而引起的降解。可选择装置的植入位点或靶位点,用于获得最大的治疗功效。 In some cases, implantable delivery systems are designed to protect nucleotide-based therapeutics from degradation, whether chemical in nature or due to attack by enzymes and other factors within the subject's body. The implantation site or target site of the device can be selected for maximum therapeutic efficacy.
在本申请第二十六种典型的实施方式中,提供了一种治疗有需要的受试者的疾病或病症的方法,所述方法包括向受试者施用如下任意一种或多种:In the twenty-sixth exemplary embodiment of the present application, a method of treating a disease or condition in a subject in need is provided, the method comprising administering to the subject any one or more of the following:
上述CRISPR-Cas效应子蛋白、或上述CRISPR-Cas效应子蛋白的衍生物或上述CRISPR-Cas效应子蛋白的功能片段,以及异源功能结构域,以及RNA指导物或编码所述RNA指导物的核酸;The above-mentioned CRISPR-Cas effector protein, or a derivative of the above-mentioned CRISPR-Cas effector protein or a functional fragment of the above-mentioned CRISPR-Cas effector protein, as well as a heterologous functional domain, and an RNA guide or an RNA guide encoding the RNA guide nucleic acid;
上述CRISPR-Cas效应子融合蛋白,以及RNA指导物或编码所述RNA指导物的核酸;The above-mentioned CRISPR-Cas effector fusion protein, and an RNA guide or a nucleic acid encoding the RNA guide;
上述DNA分子;The above-mentioned DNA molecules;
上述重组载体;The above recombinant vector;
上述基因编辑系统;The above gene editing system;
上述基因编辑载体;The above gene editing vector;
上述含有基因编辑系统的细胞;The above-mentioned cells containing gene editing systems;
上述真核细胞系;The above-mentioned eukaryotic cell lines;
上述多细胞生物体;The above-mentioned multicellular organisms;
上述试剂盒;The above-mentioned kit;
上述容器;或上述可植入装置。The above-mentioned container; or the above-mentioned implantable device.
在一个优选的实施例中,所述受试者包括患有病症或疾病的受试者,所述病症或疾病包括癌症、感染性疾病、代谢性疾病或遗传性疾病;In a preferred embodiment, the subject includes a subject suffering from a disorder or disease, including cancer, infectious disease, metabolic disease or genetic disease;
在一个优选的实施例中,所述癌症包括维尔姆斯瘤、尤文肉瘤、神经内分泌瘤、胶质母细胞瘤、神经母细胞瘤、黑色素瘤、皮肤癌、乳腺癌、结肠癌、直肠癌、前列腺癌、肝癌、肾癌、胰腺癌、肺癌、胆道癌、宫颈癌、子宫内膜癌、食管癌、胃癌、头颈癌、甲状腺髓样癌、卵巢癌、胶质瘤、淋巴瘤、白血病、骨髓瘤、急性淋巴细胞白血病、急性髓细胞性白血病、慢性淋巴细胞白血病、慢性髓细胞性白血病、何杰金氏淋巴瘤、非何杰金氏淋巴瘤或尿膀胱癌中的一种或多种;In a preferred embodiment, the cancer includes Wilms tumor, Ewing sarcoma, neuroendocrine tumor, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, Prostate cancer, liver cancer, kidney cancer, pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, bone marrow One or more of neoplasms, acute lymphoblastic leukemia, acute myeloid leukemia, chronic lymphocytic leukemia, chronic myelogenous leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma, or urinary bladder cancer;
在一个优选的实施例中,所述代谢性疾病包括家族性高胆固醇血症(FH);In a preferred embodiment, the metabolic disease includes familial hypercholesterolemia (FH);
在一个优选的实施例中,所述遗传性疾病包括转甲状腺素蛋白淀粉样变(ATTR)、原发性高草尿酸症(PH1)、遗传性血管性水肿(HAE);In a preferred embodiment, the genetic diseases include transthyretin amyloidosis (ATTR), primary hyperoxaluric acidosis (PH1), and hereditary angioedema (HAE);
在一个优选的实施例中,所述感染性疾病的感染原包括人类免疫缺陷病毒、单纯疱疹病毒-1、乙型肝炎(HEPATITIS B)或单纯疱疹病毒-2中的一种或多种;In a preferred embodiment, the infectious agent of the infectious disease includes one or more of human immunodeficiency virus, herpes simplex virus-1, hepatitis B (HEPATITIS B) or herpes simplex virus-2;
在一个优选的实施例中,所述病症或疾病包括囊性纤维化、进行性假肥大性肌营养不良、贝克肌营养不良、α-1-抗胰蛋白酶缺乏、庞贝病、强直性肌营养不良、亨廷顿病、脆性X综合 征、弗里德赖希共济失调、肌萎缩侧索硬化、额颞叶痴呆、遗传性慢性肾脏病、高脂血症、高胆固醇血症、莱伯氏先天性黑蒙、镰状细胞病或β地中海贫血中的一种或多种。In a preferred embodiment, the disorder or disease includes cystic fibrosis, progressive Duchenne muscular dystrophy, Becker muscular dystrophy, alpha-1-antitrypsin deficiency, Pompe disease, myotonic dystrophy Dysplasia, Huntington's disease, Fragile X syndrome syndrome, Friedreich's ataxia, amyotrophic lateral sclerosis, frontotemporal dementia, hereditary chronic kidney disease, hyperlipidemia, hypercholesterolemia, Leber's congenital amaurosis, sickle cell disease or one or more of beta thalassemias.
在本申请第二十七种典型的实施方式中,提供了上述基因编辑系统在治疗病症或疾病中的用途。In the twenty-seventh typical embodiment of the present application, the use of the above gene editing system in treating disorders or diseases is provided.
在一种优选的实施例中,所述病症或疾病包括癌症、感染性疾病、代谢性疾病或遗传性疾病;In a preferred embodiment, the condition or disease includes cancer, infectious disease, metabolic disease or genetic disease;
所述癌症包括维尔姆斯瘤、尤文肉瘤、神经内分泌瘤、胶质母细胞瘤、神经母细胞瘤、黑色素瘤、皮肤癌、乳腺癌、结肠癌、直肠癌、前列腺癌、肝癌、肾癌、胰腺癌、肺癌、胆道癌、宫颈癌、子宫内膜癌、食管癌、胃癌、头颈癌、甲状腺髓样癌、卵巢癌、胶质瘤、淋巴瘤、白血病、骨髓瘤、急性淋巴细胞白血病、急性髓细胞性白血病、慢性淋巴细胞白血病、慢性髓细胞性白血病、何杰金氏淋巴瘤、非何杰金氏淋巴瘤或尿膀胱癌中的一种或多种;The cancers include Wilms tumor, Ewing sarcoma, neuroendocrine tumors, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, kidney cancer, Pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, myeloma, acute lymphoblastic leukemia, acute One or more of myeloid leukemia, chronic lymphocytic leukemia, chronic myelogenous leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma, or urinary bladder cancer;
在一种优选的实施例中,所述感染性疾病的感染原包括人类免疫缺陷病毒、单纯疱疹病毒-1、乙型肝炎(HEPATITIS B)或单纯疱疹病毒-2中的一种或多种;In a preferred embodiment, the infectious agent of the infectious disease includes one or more of human immunodeficiency virus, herpes simplex virus-1, hepatitis B (HEPATITIS B) or herpes simplex virus-2;
在一种优选的实施例中,所述代谢性疾病包括家族性高胆固醇血症(FH);In a preferred embodiment, the metabolic disease includes familial hypercholesterolemia (FH);
在一种优选的实施例中,所述遗传性疾病包括转甲状腺素蛋白淀粉样变(ATTR)、原发性高草尿酸症(PH1)、遗传性血管性水肿(HAE);In a preferred embodiment, the genetic diseases include transthyretin amyloidosis (ATTR), primary hyperoxaluric acidosis (PH1), and hereditary angioedema (HAE);
在一种优选的实施例中,所述病症或疾病包括囊性纤维化、进行性假肥大性肌营养不良、贝克肌营养不良、α-1-抗胰蛋白酶缺乏、庞贝病、强直性肌营养不良、亨廷顿病、脆性X综合征、弗里德赖希共济失调、肌萎缩侧索硬化、额颞叶痴呆、遗传性慢性肾脏病、高脂血症、高胆固醇血症、莱伯氏先天性黑蒙、镰状细胞病或β地中海贫血中的一种或多种。In a preferred embodiment, the disorder or disease includes cystic fibrosis, progressive Duchenne muscular dystrophy, Becker muscular dystrophy, alpha-1-antitrypsin deficiency, Pompe disease, myotonic Malnutrition, Huntington's disease, fragile X syndrome, Friedreich's ataxia, amyotrophic lateral sclerosis, frontotemporal dementia, hereditary chronic kidney disease, hyperlipidemia, hypercholesterolemia, Leber's disease One or more of congenital amaurosis, sickle cell disease, or beta thalassemia.
下面将结合具体的实施例来进一步详细解释本申请的有益效果。The beneficial effects of the present application will be further explained in detail below in conjunction with specific embodiments.
实施例1Example 1
CasY1、CasY2、CasY3、CasY4和CasY5(即CasY1-CasY5)基因和RNA指导物的获取。Acquisition of CasY1, CasY2, CasY3, CasY4 and CasY5 (i.e. CasY1-CasY5) genes and RNA guides.
下载NCBI和JGI数据库的微生物基因组和宏基因组数据,总共得到了20TB的高质量数据。Downloading microbial genome and metagenomic data from NCBI and JGI databases, a total of 20TB of high-quality data was obtained.
用TBLASTN(https://blast.ncbi.nlm.nih.gov/)对宏基因组数据进行本地比对,获得了5种全新的CRISPR-Cas效应子蛋白,即CasY1(SEQ ID NO:1)和CasY2(SEQ ID NO:2)、CasY1(SEQ ID NO:3)、CasY4(SEQ ID NO:4)、CasY5(SEQ ID NO:5)。



TBLASTN (https://blast.ncbi.nlm.nih.gov/) was used to perform local alignment of metagenomic data, and five new CRISPR-Cas effector proteins were obtained, namely CasY1 (SEQ ID NO: 1) and CasY2 (SEQ ID NO: 2), CasY1 (SEQ ID NO: 3), CasY4 (SEQ ID NO: 4), CasY5 (SEQ ID NO: 5).



图1示出了CasY1-CasY5的蛋白结构域示意图,其中D、E、D代表RuvC结构域的三个保守基序I、II、III的催化残基,h表示桥螺旋结构。D-E-D表示Asp-Glu-Asp氨基酸,是RuvC结构域的保守氨基酸残基。Figure 1 shows a schematic diagram of the protein domain of CasY1-CasY5, in which D, E, and D represent the catalytic residues of the three conserved motifs I, II, and III of the RuvC domain, and h represents the bridge helix structure. D-E-D represents Asp-Glu-Asp amino acids, which are conserved amino acid residues of the RuvC domain.
CasY1的编码DNA序列如SEQ ID NO:6所示,CasY2的编码DNA序列如SEQ ID NO:7所示,CasY3的编码DNA序列如SEQ ID NO:8所示,CasY4的编码DNA序列如SEQ ID NO:9所示,CasY5的编码DNA序列如SEQ ID NO:10所示。The coding DNA sequence of CasY1 is shown in SEQ ID NO: 6, the coding DNA sequence of CasY2 is shown in SEQ ID NO: 7, the coding DNA sequence of CasY3 is shown in SEQ ID NO: 8, and the coding DNA sequence of CasY4 is shown in SEQ ID NO. NO:9 is shown, and the coding DNA sequence of CasY5 is shown as SEQ ID NO:10.
CasY1-CasY5所对应的直接重复序列分别如SEQ ID NO:24、27、30、33、36所示,茎环结构的序列分别SEQ ID NO:25、28、31、34、37,RNA指导物(前crRNA序列)分别如SEQ ID NO:26、29、32、35、38所示。The direct repeat sequences corresponding to CasY1-CasY5 are shown in SEQ ID NO: 24, 27, 30, 33, and 36 respectively. The sequences of the stem-loop structure are SEQ ID NO: 25, 28, 31, 34, and 37 respectively. RNA guide (Pre-crRNA sequences) are shown in SEQ ID NO: 26, 29, 32, 35, and 38 respectively.
对于CasY1、CasY2、CasY3、CasY4和CasY5的直接重复序列的二级结构进行分析,结果如图2所示。The secondary structure of the direct repeat sequences of CasY1, CasY2, CasY3, CasY4 and CasY5 was analyzed, and the results are shown in Figure 2.
实施例2Example 2
CasY1-CasY5基因的PAM序列的确定。Determination of PAM sequences of CasY1-CasY5 genes.
1.将CasY1蛋白的核酸序列(SEQ ID NO:6)和CasY1 crRNA-TTR序列(SEQ ID NO:11)克隆到表达载体pACYCDuet-1(SEQ ID NO:16)中,构建出重组质粒pACYCDuet1-CasY1-crRNA,所述重组质粒序列如SEQ ID NO:17所示。1. Clone the nucleic acid sequence of CasY1 protein (SEQ ID NO: 6) and CasY1 crRNA-TTR sequence (SEQ ID NO: 11) into the expression vector pACYCDuet-1 (SEQ ID NO: 16) to construct the recombinant plasmid pACYCDuet1- CasY1-crRNA, the recombinant plasmid sequence is shown in SEQ ID NO: 17.
同样地,将CasY2蛋白的核酸序列(SEQ ID NO:7)和CasY2 crRNA-TTR序列(SEQ ID NO:12)克隆到表达载体pACYCDuet-1中,构建出重组质粒pACYCDuet1-CasY2-crRNA,所述重组质粒序列如SEQ ID NO:18所示。Similarly, the nucleic acid sequence of the CasY2 protein (SEQ ID NO: 7) and the CasY2 crRNA-TTR sequence (SEQ ID NO: 12) were cloned into the expression vector pACYCDuet-1 to construct the recombinant plasmid pACYCDuet1-CasY2-crRNA, as described The recombinant plasmid sequence is shown in SEQ ID NO: 18.
将CasY3蛋白的核酸序列(SEQ ID NO:8)和CasY3 crRNA-TTR序列(SEQ ID NO:13)克隆到表达载体pACYCDuet-1中,构建出重组质粒pACYCDuet1-CasY3-crRNA,所述重组质粒序列如SEQ ID NO:19所示。The nucleic acid sequence of the CasY3 protein (SEQ ID NO: 8) and the CasY3 crRNA-TTR sequence (SEQ ID NO: 13) were cloned into the expression vector pACYCDuet-1 to construct the recombinant plasmid pACYCDuet1-CasY3-crRNA. The recombinant plasmid sequence As shown in SEQ ID NO: 19.
将CasY4蛋白的核酸序列(SEQ ID NO:9)和CasY4 crRNA-TTR序列(SEQ ID NO:14)克隆到表达载体pACYCDuet-1中,构建出重组质粒pACYCDuet1-CasY4-crRNA,所述重组质粒序列如SEQ ID NO:20所示。 The nucleic acid sequence of the CasY4 protein (SEQ ID NO: 9) and the CasY4 crRNA-TTR sequence (SEQ ID NO: 14) were cloned into the expression vector pACYCDuet-1 to construct the recombinant plasmid pACYCDuet1-CasY4-crRNA. The recombinant plasmid sequence As shown in SEQ ID NO:20.
将CasY5蛋白的核酸序列(SEQ ID NO:10)和CasY5 crRNA-TTR序列(SEQ ID NO:15)克隆到表达载体pACYCDuet-1中,构建出重组质粒pACYCDuet1-CasY5-crRNA,所述重组质粒序列如SEQ ID NO:21所示。The nucleic acid sequence of the CasY5 protein (SEQ ID NO: 10) and the CasY5 crRNA-TTR sequence (SEQ ID NO: 15) were cloned into the expression vector pACYCDuet-1 to construct the recombinant plasmid pACYCDuet1-CasY5-crRNA. The recombinant plasmid sequence As shown in SEQ ID NO: 21.
2.将合成的PAM文库序列克隆到pUC19载体上,重组质粒pUC19-PAM如SEQ ID NO:22所示。2. Clone the synthesized PAM library sequence into the pUC19 vector. The recombinant plasmid pUC19-PAM is shown in SEQ ID NO: 22.
本申请在SEQ ID NO:22中的随机序列包括6个随机碱基(n),即最终所构建的随机碱基序列种类达到46,共4096种排列组合。The random sequence in SEQ ID NO: 22 of this application includes 6 random bases (n), that is, the types of random base sequences finally constructed reach 4 6 , with a total of 4096 permutations and combinations.
3.将pACYCDuet1-CasY1-crRNA和pUC19-PAM文库质粒共转到DH5α感受态细胞中,对照组为pACYCDuet-1和pUC19-PAM文库质粒共转。3. Co-transfect pACYCDuet1-CasY1-crRNA and pUC19-PAM library plasmid into DH5α competent cells. The control group is co-transfection of pACYCDuet-1 and pUC19-PAM library plasmid.
同样地,也将pACYCDuet1-CasY2-crRNA和pUC19-PAM文库质粒共转到DH5α感受态细胞中;将pACYCDuet1-CasY3-crRNA和pUC19-PAM文库质粒共转到DH5α感受态细胞中;将pACYCDuet1-CasY4-crRNA和pUC19-PAM文库质粒共转到DH5α感受态细胞中;将pACYCDuet1-CasY5-crRNA和pUC19-PAM文库质粒共转到DH5α感受态细胞中。Similarly, pACYCDuet1-CasY2-crRNA and pUC19-PAM library plasmids were also co-transfected into DH5α competent cells; pACYCDuet1-CasY3-crRNA and pUC19-PAM library plasmids were co-transfected into DH5α competent cells; pACYCDuet1-CasY4 -crRNA and pUC19-PAM library plasmid were co-transfected into DH5α competent cells; pACYCDuet1-CasY5-crRNA and pUC19-PAM library plasmid were co-transfected into DH5α competent cells.
转化有2种质粒的DH5α细胞,37℃条件下处理1小时后,抽提质粒并对PAM区域序列进行PCR扩增和高通量测序。DH5α cells were transformed with two plasmids and treated at 37°C for 1 hour. The plasmids were extracted and the PAM region sequence was subjected to PCR amplification and high-throughput sequencing.
图3是体内筛选效应子和文库质粒设计的示意图,pACYC-Effector-crRNA质粒即为SEQ ID NO:17-21所示质粒(效应子质粒),并设计了靶向PAM文库的pUC19-PAM文库质粒。Figure 3 is a schematic diagram of in vivo screening of effectors and library plasmid design. The pACYC-Effector-crRNA plasmid is the plasmid (effector plasmid) shown in SEQ ID NO: 17-21, and a pUC19-PAM library targeting the PAM library was designed. plasmid.
对于负选择筛选实验流程如下:The experimental process for negative selection screening is as follows:
1)构建效应子质粒;1) Construct effector plasmid;
2)将效应子质粒和文库质粒转化到大肠杆菌中,然后生长,进行抗生素筛选;2) Transform the effector plasmid and library plasmid into E. coli, then grow and conduct antibiotic screening;
3)利用靶向测序鉴定耗减的文库质粒,并利用小RNA测序鉴定成熟的crRNA。3) Use targeted sequencing to identify depleted library plasmids, and use small RNA sequencing to identify mature crRNA.
图4是负选择筛选工作流的示意图。Figure 4 is a schematic diagram of the negative selection screening workflow.
4.分别统计实验组和对照组中4096种组合的PAM序列出现次数,并用PAM序列数目进行标准化,对于一条PAM序列,当log2(对照组标准化值/实验组标准化值)>3.5时,即认为该条PAM序列被显著消耗,通过显著被消耗的PAM序列预测得到PAM结构域。4. Count the occurrence times of 4096 combinations of PAM sequences in the experimental group and the control group respectively, and standardize them with the number of PAM sequences. For a PAM sequence, when log2 (standardized value of the control group/standardized value of the experimental group) > 3.5, it is considered This PAM sequence is significantly consumed, and the PAM domain is predicted from the significantly consumed PAM sequence.
通过实验结果,观察到CasY1-CasY5对带有5’-TTA、5’-TTT、5’-TTC PAM的靶标序列进行有效编辑,CasY1-CasY5蛋白的PAM序列为5’-TTN结构。通过统计发现,CRISPR/CasY1、CRISPR/CasY2、CRISPR/CasY3、CRISPR/CasY4、CRISPR/CasY5系统对于5’-TTA、5’-TTT、5’-TTC的编辑活性远高于5’-TTG。CasY1、CasY2、CasY3、CasY4、CasY5的PAM结构域分析结果如图5所示。图5中的“Bits”即为“比特”,“Position”为“位置”。Through experimental results, it was observed that CasY1-CasY5 effectively edits target sequences with 5’-TTA, 5’-TTT, and 5’-TTC PAMs. The PAM sequence of the CasY1-CasY5 protein is a 5’-TTN structure. Statistics show that the editing activity of CRISPR/CasY1, CRISPR/CasY2, CRISPR/CasY3, CRISPR/CasY4, and CRISPR/CasY5 systems for 5’-TTA, 5’-TTT, and 5’-TTC is much higher than that of 5’-TTG. The PAM domain analysis results of CasY1, CasY2, CasY3, CasY4, and CasY5 are shown in Figure 5. "Bits" in Figure 5 refers to "bits", and "Position" refers to "position".
实施例3 Example 3
细菌体内切割活性确定Determination of bacterial cleavage activity in vivo
1.构建ccdb毒性质粒,ccdb毒性质粒如SEQ ID:23所示。将ccdb毒性质粒(SEQ ID NO:23)转入Top10感受态细胞中,通过梯度实验,发现当平板中加入64mM L-阿拉伯糖(L-ara)时,细菌致死,说明该剂量是致死剂量。1. Construct the ccdb virus plasmid. The ccdb virus plasmid is shown in SEQ ID: 23. The ccdb toxic plasmid (SEQ ID NO: 23) was transferred into Top10 competent cells. Through gradient experiments, it was found that when 64mM L-arabinose (L-ara) was added to the plate, the bacteria were lethal, indicating that this dose was a lethal dose.
2.将pACYCDuet1-CasY1-crRNA重组质粒(SEQ ID NO:17)和ccdb毒性质粒共转到Top10感受态细胞中,再加入64mM L-阿拉伯糖(L-ara)时,部分细菌存活,说明在存活部分的细菌中CasY1蛋白对ccdb毒性质粒进行了切割。2. Co-transfer pACYCDuet1-CasY1-crRNA recombinant plasmid (SEQ ID NO: 17) and ccdb toxic plasmid into Top10 competent cells, and then add 64mM L-arabinose (L-ara), some bacteria survive, indicating that in The CasY1 protein in the surviving bacteria cut the ccdb virulence plasmid.
同样地,将pACYCDuet1-CasY2-crRNA重组质粒(SEQ ID NO:18)、pACYCDuet1-CasY3-crRNA重组质粒(SEQ ID NO:19)、pACYCDuet1-CasY4-crRNA重组质粒(SEQ ID NO:20)、pACYCDuet1-CasY5-crRNA重组质粒(SEQ ID NO:21)分别和ccdb毒性质粒共转到Top10感受态细胞中,再加入64mM L-阿拉伯糖(L-ara)时,部分细菌存活,说明在存活部分的细菌中,CasY2-CasY5蛋白分别对ccdb毒性质粒进行了切割。Similarly, pACYCDuet1-CasY2-crRNA recombinant plasmid (SEQ ID NO: 18), pACYCDuet1-CasY3-crRNA recombinant plasmid (SEQ ID NO: 19), pACYCDuet1-CasY4-crRNA recombinant plasmid (SEQ ID NO: 20), pACYCDuet1 -CasY5-crRNA recombinant plasmid (SEQ ID NO: 21) and ccdb toxic plasmid were co-transfected into Top10 competent cells, and when 64mM L-arabinose (L-ara) was added, some bacteria survived, indicating that in the surviving part In bacteria, CasY2-CasY5 proteins cut the ccdb virulence plasmid respectively.
CasY1-CasY5蛋白在细菌体内的切割示意图如图6所示,切割活性鉴定结果如图7所示。The schematic diagram of CasY1-CasY5 protein cleavage in bacteria is shown in Figure 6, and the cleavage activity identification results are shown in Figure 7.
实施例4Example 4
CasY1-CasY5蛋白的体外切割活性In vitro cleavage activity of CasY1-CasY5 proteins
1.CasY1-CasY5蛋白的表达纯化1. Expression and purification of CasY1-CasY5 proteins
将分别表达CasY1-CasY5的Rosetta(DE3)pLyseS(EMD Millipore)细胞分别接种于10ml LB培养基中37℃过夜培养。待细菌的OD600达到0.2时,将培养温度降至21℃继续培养,直到OD600达到0.6,然后加入终浓度是500μM的IPTG以诱导Cas蛋白表达。诱导培养14-18小时后收集细胞,将细胞重悬于200ml裂解缓冲液(50mM HEPES[pH7],2M NaCl,5mM MgCl,20mM imidazole),其中含有蛋白酶抑制剂(Roche complete,EDTA-free,COEDTAF-RO)和溶菌酶(Sigma,10837059001),进行匀浆。用超声处理(Branson Sonifier 450)裂解细胞。以10000×g离心1小时以清除裂解物。裂解物通过0.22μm过滤器(Millipore,Stericup)过滤,之后转移到镍柱(HisTrap FF,5ml)上,用咪唑梯度洗脱。将含有预期大小的蛋白质合并在一起,加入TEV蛋白酶(Sigma,T4455-10KU),并将样品在TEV缓冲液中透析过夜(500mM NaCl,50mM HEPES[pH为7],5mM MgCl,2mM DTT)。透析后将样品浓缩至500μl,在-80℃下冷冻储存。Rosetta (DE3) pLyseS (EMD Millipore) cells expressing CasY1-CasY5 respectively were inoculated into 10 ml LB medium and cultured at 37°C overnight. When the OD 600 of the bacteria reaches 0.2, lower the culture temperature to 21°C and continue culturing until the OD 600 reaches 0.6. Then add IPTG at a final concentration of 500 μM to induce Cas protein expression. Collect the cells after 14-18 hours of induction culture, and resuspend the cells in 200ml lysis buffer (50mM HEPES [pH7], 2M NaCl, 5mM MgCl, 20mM imidazole), which contains protease inhibitors (Roche complete, EDTA-free, COEDTAF -RO) and lysozyme (Sigma, 10837059001) for homogenization. Cells were lysed using sonication (Branson Sonifier 450). Centrifuge at 10,000 × g for 1 hour to clear lysate. The lysate was filtered through a 0.22 μm filter (Millipore, Stericup) before being transferred to a nickel column (HisTrap FF, 5 ml) and eluted with an imidazole gradient. Proteins containing the expected sizes were pooled together, TEV protease (Sigma, T4455-10KU) was added, and the samples were dialyzed overnight in TEV buffer (500mM NaCl, 50mM HEPES [pH 7], 5mM MgCl, 2mM DTT). After dialysis, the sample was concentrated to 500 μl and stored frozen at -80°C.
2.体外切割测定2. In vitro cleavage assay
将纯化的蛋白质(25nM)在切割缓冲液(NEBuffer 3,5mM DTT)中37℃反应20分钟。分别使用500ng合成的TTR-2 crRNA(pre-crRNA,SEQ ID NO:40、43、46、49、52)(南京金斯瑞生物科技有限公司合成)和200ng的靶DNA(dsDNA,如SEQ ID NO:57所示)进行裂解反应,蛋白质浓度分别为0nM,50nM,100nM,200nM,500nM和1μM。反应产 物用纯化回收试剂盒回收(QIAGEN)。在TBE-尿素6%聚丙烯酰胺凝胶(Invitrogen)上检测切割效率。
The purified protein (25nM) was reacted in cleavage buffer (NEBuffer 3, 5mM DTT) at 37°C for 20 minutes. Use 500ng of synthesized TTR-2 crRNA (pre-crRNA, SEQ ID NO: 40, 43, 46, 49, 52) (synthesized by Nanjing Genscript Biotechnology Co., Ltd.) and 200ng of target DNA (dsDNA, such as SEQ ID NO: 57) for cleavage reaction, the protein concentrations were 0nM, 50nM, 100nM, 200nM, 500nM and 1μM respectively. reaction product The materials were recovered using a purification and recovery kit (QIAGEN). Cleavage efficiency was tested on TBE-urea 6% polyacrylamide gel (Invitrogen).
37℃条件下CasY1-CasY5蛋白具有明显切割活性。图8示出了体外切割活性结果,其中CasY1、CasY2、CasY3、CasY4、CasY5对靶dsDNA的体外切割活性分别如图8中的图a、图b、图c、图d、图e所示。CasY1-CasY5 proteins have obvious cleavage activity under 37°C. Figure 8 shows the results of in vitro cleavage activity, in which the in vitro cleavage activities of CasY1, CasY2, CasY3, CasY4, and CasY5 on target dsDNA are shown in Figure 8, Figure a, Figure b, Figure c, Figure d, and Figure e, respectively.
实施例5Example 5
293T细胞切割活性293T cell cleavage activity
1.将HEK293T细胞(购自ATCC)接种于添加了10%FBS(v/v)的DMEM培养基中(Gibco,11965092),其中含1%Penicillin Streptomycin(v/v)(Gibco,15140122),在含有5%CO2的37℃细胞培养箱中进行培养。利用LONZA转染试剂(Lonza,Cat #V4XP-3032)按照说明书转染,细胞通过计数得2×106个。1. Inoculate HEK293T cells (purchased from ATCC) into DMEM medium (Gibco, 11965092) supplemented with 10% FBS (v/v), containing 1% Penicillin Streptomycin (v/v) (Gibco, 15140122), Culture was performed in a 37 °C cell culture incubator containing 5% CO2 . LONZA transfection reagent (Lonza, Cat #V4XP-3032) was used for transfection according to the instructions, and the cells were counted to 2×10 6 .
将CasY1蛋白:crRNA(金斯瑞生物科技有限公司化学合成)按照3μg∶1.5μg的质量混合。同样地,将CasY2-CasY5蛋白和Lbcpf1蛋白(QRU95066.1,利用现有技术表达纯化后获得)均按照上述方式分别与多种crRNA(金斯瑞生物科技有限公司化学合成)按照上述比例混合,如表1所示。CasY1 protein: crRNA (chemically synthesized by Genscript Biotechnology Co., Ltd.) was mixed at a mass of 3 μg: 1.5 μg. Similarly, CasY2-CasY5 protein and Lbcpf1 protein (QRU95066.1, obtained after expression and purification using existing technology) were mixed with a variety of crRNA (chemically synthesized by Genscript Biotechnology Co., Ltd.) in the above proportions according to the above method, As shown in Table 1.
表1

Table 1

电转48h后分别收集细胞检测切割效率。After 48 hours of electroporation, cells were collected to detect cutting efficiency.
细胞进行基因组抽提(TIANGEN,DP304-03)。以基因组为模板,对靶点附近序列进行PCR扩增,扩增的PCR产物用于高通量深度测序(金唯智生物科技有限公司)。用于目标位点序列扩增的体系如下:2×Taq Master Mix(Vazyme,P112-03)25μL;Primer-F(10pmol/μL)1μL;Primer-R(10pmol/μL)1μL;模板1μL;ddH2O补齐到50μL。Cells were subjected to genome extraction (TIANGEN, DP304-03). Using the genome as a template, PCR amplification was performed on sequences near the target site, and the amplified PCR products were used for high-throughput deep sequencing (Jinweizhi Biotechnology Co., Ltd.). The system used for target site sequence amplification is as follows: 2×Taq Master Mix (Vazyme, P112-03) 25 μL; Primer-F (10 pmol/μL) 1 μL; Primer-R (10 pmol/μL) 1 μL; template 1 μL; ddH 2 O. Make up to 50 μL.
2.在293T细胞中CasY1、CasY2、CasY3、CasY4、CasY5蛋白和LbCpf1的切割活性,参见图9所示,indel表示插入缺失率。显示了在瞬时转染效应子蛋白和RNA指导物48小时后,由靶向293T细胞系中TTR基因座和PCSK9的CasY1-CasY5和LbCpf1CRISPR效应子诱导的插入缺失活性。对不同的RNA指导物设计进行了测定,并显示出不同程度的功效。误差线表示3个重复情况下的S.E.M.。2. The cleavage activities of CasY1, CasY2, CasY3, CasY4, CasY5 proteins and LbCpf1 in 293T cells are shown in Figure 9, indel represents the insertion and deletion rate. Shown is the indel activity induced by CasY1-CasY5 and LbCpf1 CRISPR effectors targeting the TTR locus and PCSK9 in the 293T cell line 48 hours after transient transfection of effector proteins and RNA guides. Different RNA guide designs were tested and showed varying degrees of efficacy. Error bars represent S.E.M. of 3 replicates.
说明在293T细胞中CasY1-CasY5蛋白可以实现不同位点的高效切割,并且切割效率优于LbCpf1。This shows that CasY1-CasY5 proteins can achieve efficient cleavage at different sites in 293T cells, and the cleavage efficiency is better than LbCpf1.
从以上的描述中,可以看出,本发明上述的实施例实现了如下技术效果:上述CasY1、CasY2、CasY3、CasY4、CasY5蛋白,能够用于以靶向方式操纵核酸的系统、方法和组合物。由上述蛋白和RNA引导物组成的用于靶向修饰核酸(例如DNA)的非天然存在的工程改造的CRISPR-Cas系统,每个系统包括一起靶向核酸的一种或多种蛋白质组分和一种或多种核酸组分,能够在细胞内和外均发挥CRISPR-Cas系统的功能与活性。From the above description, it can be seen that the above-mentioned embodiments of the present invention achieve the following technical effects: the above-mentioned CasY1, CasY2, CasY3, CasY4, and CasY5 proteins can be used in systems, methods, and compositions for manipulating nucleic acids in a targeted manner. . Non-naturally occurring engineered CRISPR-Cas systems for targeted modification of nucleic acids (e.g., DNA) consisting of the above-described proteins and RNA guides, each system including one or more protein components that together target the nucleic acid and One or more nucleic acid components can exert the function and activity of the CRISPR-Cas system both inside and outside the cell.
以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。 The above descriptions are only preferred embodiments of the present invention and are not intended to limit the present invention. For those skilled in the art, the present invention may have various modifications and changes. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and principles of the present invention shall be included in the protection scope of the present invention.

Claims (58)

  1. 一种CRISPR-Cas效应子蛋白,其特征在于,所述CRISPR-Cas效应子蛋白包括与SEQ ID NO:2、1、3-5中任一项所述的氨基酸序列具有至少70%同一性的蛋白。A CRISPR-Cas effector protein, characterized in that the CRISPR-Cas effector protein includes at least 70% identity with the amino acid sequence described in any one of SEQ ID NO: 2, 1, 3-5 protein.
  2. 根据权利要求1所述的CRISPR-Cas效应子蛋白,其特征在于,所述CRISPR-Cas效应子蛋白包括与SEQ ID NO:2、1、3-5中任一项所述的氨基酸序列具有80%以上,优选90%以上,更优选95%以上,进一步优选99%以上同一性的蛋白;The CRISPR-Cas effector protein according to claim 1, characterized in that the CRISPR-Cas effector protein includes an amino acid sequence having 80 % or more, preferably 90% or more, more preferably 95% or more, even more preferably 99% or more identical proteins;
    优选地,所述CRISPR-Cas效应子蛋白包括RuvC结构域。Preferably, the CRISPR-Cas effector protein includes a RuvC domain.
  3. 根据权利要求2所述的CRISPR-Cas效应子蛋白,其特征在于,所述CRISPR-Cas效应子蛋白包括:The CRISPR-Cas effector protein according to claim 2, wherein the CRISPR-Cas effector protein includes:
    a)SEQ ID NO:1至5中任一项所示的蛋白;或a) The protein shown in any one of SEQ ID NO: 1 to 5; or
    b)以SEQ ID NO:1所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:b) Based on the amino acid sequence shown in SEQ ID NO: 1, a protein with one or more point mutations as follows:
    N21X,N23X,R25X,K26X,Q482X,S484X,R486X,S489X,R493X,H511X,C513X,H515X,N516X,R518X,R540X,K558X,Y560X,K562X,K565X,T600X,T672X,D676X,Q680X,Y683X,L686X,D693X,Y731X,G767X,R772X,K832X,K833X,Q836X,M896X;或N21X, N23X, R25X, K26X, Q482X, S484X, R486X, S489X, R493X, H511X, C513X, H515X, N516X, R518X, R540X, K558X, Y560X, K562X, K565X, T600X, T672X, D6 76X, Q680X, Y683X, L686X, D693X, Y731X, G767X, R772X, K832X, K833X, Q836X, M896X; or
    c)以SEQ ID NO:2所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:c) Based on the amino acid sequence shown in SEQ ID NO: 2, a protein with one or more point mutations as follows:
    R19X,R28X,R32X,K512X,N527X,W531X,R553X,K581X,K589X,I590X,R605X,K611X,R612X,R615X,Y777X,E877X,R931X;或or
    d)以SEQ ID NO:3所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:d) Based on the amino acid sequence shown in SEQ ID NO: 3, a protein with one or more point mutations as follows:
    K8X,F15X,N17X,K20X,K471X,W483X,H502X,R505X,K557X,K556X,R560X,Y673X,L676X,Y723X,N822X,K823X,E826X,K827X,K830X,K880X,L887X;或OR
    e)以SEQ ID NO:4所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:e) Based on the amino acid sequence shown in SEQ ID NO: 4, a protein with one or more point mutations as follows:
    K317X,W330X,Y351X,K354X,D392X,F395X,N399X,Y509X,V512X,Y568X,N662X,K663X,E666X,R667X,K670X,K719X,L726X;或OR
    f)以SEQ ID NO:5所示的氨基酸序列为基础,进行如下一种或多种点突变的蛋白:f) Based on the amino acid sequence shown in SEQ ID NO: 5, a protein with one or more point mutations as follows:
    M9X,V16X,D18X,K21X,K518X,W531X,F550X,K553X,R609X,Y612X,R616X,Y730X,L733X,Y781X,N879X,K880X,E883X,K884X,K887X,K936X,F943X;M9X, V16X, D18X, K21X, K518X, W531X, F550X, K553X, R609X, Y612X, R616X, Y730X, L733X, Y781X, N879X, K880X, E883X, K884X, K887X, K936X, F943X;
    其中X为任意氨基酸。 where X is any amino acid.
  4. 一种CRISPR-Cas效应子融合蛋白,其特征在于,所述CRISPR-Cas效应子融合蛋白包括权利要求1至3中任一项所述的CRISPR-Cas效应子蛋白、或所述CRISPR-Cas效应子蛋白的衍生物或所述CRISPR-Cas效应子蛋白的功能片段,以及异源功能结构域。A CRISPR-Cas effector fusion protein, characterized in that the CRISPR-Cas effector fusion protein includes the CRISPR-Cas effector protein according to any one of claims 1 to 3, or the CRISPR-Cas effector Derivatives of the daughter protein or functional fragments of the CRISPR-Cas effector protein, as well as heterologous functional domains.
  5. 根据权利要求4所述的CRISPR-Cas效应子融合蛋白,其特征在于,所述异源功能结构域位于所述CRISPR-Cas效应子融合蛋白的N端、C端或内部;The CRISPR-Cas effector fusion protein according to claim 4, wherein the heterologous functional domain is located at the N-terminal, C-terminal or inside the CRISPR-Cas effector fusion protein;
    优选地,所述异源功能结构域包括定位信号、报告蛋白、CRISPR-Cas效应子蛋白靶向部分、DNA结合域、表位标签、转录激活域、转录抑制域、核酸酶、脱氨结构域、甲基化酶、脱甲基酶、转录释放因子、HDAC、裂解活性多肽、连接酶中的一种或多种;Preferably, the heterologous functional domain includes a positioning signal, a reporter protein, a CRISPR-Cas effector protein targeting portion, a DNA binding domain, an epitope tag, a transcription activation domain, a transcription inhibition domain, a nuclease, and a deamination domain. , one or more of methylase, demethylase, transcription release factor, HDAC, cleavage active polypeptide, and ligase;
    优选地,所述定位信号包括核定位信号和/或核输出信号;Preferably, the localization signal includes a nuclear localization signal and/or a nuclear export signal;
    优选地,所述核输出信号包括人类蛋白酪氨酸激酶2;Preferably, the nuclear export signal includes human protein tyrosine kinase 2;
    优选地,所述报告蛋白包括谷胱甘肽-S-转移酶、辣根过氧化物酶、氯霉素乙酰转移酶、β-半乳糖苷酶、β-葡糖醛酸糖苷酶或自发荧光蛋白中的一种或多种;Preferably, the reporter protein includes glutathione-S-transferase, horseradish peroxidase, chloramphenicol acetyltransferase, β-galactosidase, β-glucuronidase or autofluorescence one or more proteins;
    优选地,所述自发荧光蛋白包括绿色荧光蛋白、HcRed、DsRed、青荧光蛋白、黄色荧光蛋白或蓝色荧光蛋白中的一种或多种;Preferably, the autofluorescent protein includes one or more of green fluorescent protein, HcRed, DsRed, cyan fluorescent protein, yellow fluorescent protein or blue fluorescent protein;
    优选地,所述DNA结合域包括甲基化结合蛋白、Lex A DBD或Gal4 DBD中的一种或多种;Preferably, the DNA binding domain includes one or more of methylation binding protein, Lex A DBD or Gal4 DBD;
    优选地,所述表位标签包括组氨酸标签、V5标签、FLAG标签、流感病毒血凝素标签、Myc标签、VSV-G标签或硫氧还蛋白标签中的一种或多种;Preferably, the epitope tag includes one or more of histidine tag, V5 tag, FLAG tag, influenza virus hemagglutinin tag, Myc tag, VSV-G tag or thioredoxin tag;
    优选地,所述转录激活域包括VP64和/或VPR;Preferably, the transcription activation domain includes VP64 and/or VPR;
    优选地,所述转录抑制域包括KRAB和/或SID;Preferably, the transcription repression domain includes KRAB and/or SID;
    优选地,所述核酸酶包括FokI;Preferably, the nuclease includes FokI;
    优选地,所述脱氨结构域包括ADAR1、ADAR2、APOBEC、AID或TAD中的一种或多种;Preferably, the deamination domain includes one or more of ADAR1, ADAR2, APOBEC, AID or TAD;
    优选地,所述裂解活性多肽包括具有单链RNA裂解活性的多肽、具有双链RNA裂解活性的多肽、具有单链DNA裂解活性的多肽或具有双链DNA裂解活性的多肽;Preferably, the cleavage active polypeptide includes a polypeptide with single-stranded RNA cleavage activity, a polypeptide with double-stranded RNA cleavage activity, a polypeptide with single-stranded DNA cleavage activity or a polypeptide with double-stranded DNA cleavage activity;
    优选地,所述连接酶包括DNA连接酶和/或RNA连接酶。Preferably, the ligase includes DNA ligase and/or RNA ligase.
  6. 一种DNA分子,其特征在于,所述DNA分子编码权利要求1至3中任一项所述的CRISPR-Cas效应子蛋白、或权利要求4或5所述的CRISPR-Cas效应子融合蛋白。A DNA molecule, characterized in that the DNA molecule encodes the CRISPR-Cas effector protein described in any one of claims 1 to 3, or the CRISPR-Cas effector fusion protein described in claim 4 or 5.
  7. 根据权利要求6所述的DNA分子,其特征在于,所述DNA分子为根据宿主细胞的密码子偏好性进行密码子优化的DNA分子; The DNA molecule according to claim 6, characterized in that the DNA molecule is a DNA molecule that is codon-optimized according to the codon preference of the host cell;
    优选地,宿主细胞包括原核细胞或真核细胞;Preferably, the host cell includes a prokaryotic cell or a eukaryotic cell;
    优选地,所述DNA分子包括与SEQ ID NO:6至10中任一项所述的核苷酸序列具有70%以上,优选90%以上,更优选95%以上,进一步优选99%,更进一步优选为100%同一性的核苷酸。Preferably, the DNA molecule includes more than 70% of the nucleotide sequence described in any one of SEQ ID NO: 6 to 10, preferably more than 90%, more preferably more than 95%, further preferably 99%, and further Nucleotides that are 100% identical are preferred.
  8. 一种重组载体,其特征在于,所述重组载体包含权利要求6或7所述的DNA分子。A recombinant vector, characterized in that the recombinant vector contains the DNA molecule described in claim 6 or 7.
  9. 根据权利要求8所述的重组载体,其特征在于,所述DNA分子与启动子连接;The recombinant vector according to claim 8, wherein the DNA molecule is connected to a promoter;
    优选地,所述启动子包括诱导型启动子、组成型启动子或组织特异性启动子中的一种或多种;Preferably, the promoter includes one or more of an inducible promoter, a constitutive promoter or a tissue-specific promoter;
    优选地,所述启动子包括T7、SP6、T3、CMV、EF1a、SV40、PGK1、humanβ-actin、CAG、U6、H1、T7、T7lac、araBAD、trp、lac或Ptac中的一种或多种;Preferably, the promoter includes one or more of T7, SP6, T3, CMV, EF1a, SV40, PGK1, human β-actin, CAG, U6, H1, T7, T7lac, araBAD, trp, lac or Ptac ;
    优选地,所述重组载体包括逆转录病毒载体、慢病毒载体、腺病毒载体、腺相关病毒载体、单纯疱疹载体或噬菌粒载体;Preferably, the recombinant vector includes a retroviral vector, a lentiviral vector, an adenoviral vector, an adeno-associated virus vector, a herpes simplex vector or a phagemid vector;
    优选地,所述重组载体包括质粒载体。Preferably, the recombinant vector includes a plasmid vector.
  10. 一种宿主细胞,其特征在于,所述宿主细胞转化有权利要求8或9所述的重组载体。A host cell, characterized in that the host cell is transformed with the recombinant vector according to claim 8 or 9.
  11. 一种基因编辑系统,其特征在于,所述基因编辑系统包括:A gene editing system, characterized in that the gene editing system includes:
    a)RNA指导物或编码所述RNA指导物的核酸,所述RNA指导物包括直接重复序列和间隔子序列,所述间隔子序列用于与靶核酸杂交;a) an RNA guide or a nucleic acid encoding the RNA guide, which includes a direct repeat sequence and a spacer sequence for hybridizing to a target nucleic acid;
    b)权利要求1至3中任一项所述的CRISPR-Cas效应子蛋白,或权利要求4或5的CRISPR-Cas效应子融合蛋白,或权利要求6或7所述的DNA分子,或权利要求8或9所述的重组载体,或权利要求10所述的宿主细胞;b) The CRISPR-Cas effector protein of any one of claims 1 to 3, or the CRISPR-Cas effector fusion protein of claim 4 or 5, or the DNA molecule of claim 6 or 7, or the The recombinant vector described in claim 8 or 9, or the host cell described in claim 10;
    所述DNA分子、所述重组载体或所述宿主细胞能够表达所述CRISPR-Cas效应子蛋白或所述CRISPR-Cas效应子融合蛋白,The DNA molecule, the recombinant vector or the host cell is capable of expressing the CRISPR-Cas effector protein or the CRISPR-Cas effector fusion protein,
    在所述基因编辑系统中,所述CRISPR-Cas效应子蛋白或所述CRISPR-Cas效应子融合蛋白,与所述RNA指导物结合后,靶向作用于所述间隔子序列与所述靶核酸杂交形成的杂交序列。In the gene editing system, the CRISPR-Cas effector protein or the CRISPR-Cas effector fusion protein, after being combined with the RNA guide, targets the spacer sequence and the target nucleic acid. The hybridization sequence formed by hybridization.
  12. 根据权利要求11所述的基因编辑系统,其特征在于,所述基因编辑系统不包含tracrRNA。The gene editing system according to claim 11, characterized in that the gene editing system does not contain tracrRNA.
  13. 根据权利要求11所述的基因编辑系统,其特征在于,所述RNA指导物包括1种或多种。The gene editing system according to claim 11, wherein the RNA guide includes one or more.
  14. 根据权利要求11所述的基因编辑系统,其特征在于,所述靶核酸包括DNA;The gene editing system according to claim 11, wherein the target nucleic acid includes DNA;
    优选地,所述DNA包括来源于真核生物的DNA或来源于原核生物的DNA;Preferably, the DNA includes DNA derived from eukaryotic organisms or DNA derived from prokaryotes;
    优选地,所述真核生物包括动物或植物; Preferably, the eukaryotes include animals or plants;
    优选地,所述DNA包括非人类哺乳动物DNA、人类DNA、昆虫DNA、鸟类DNA、爬行动物DNA、两栖动物DNA、啮齿动物DNA、鱼类DNA、蠕虫DNA、线虫DNA或酵母DNA;Preferably, the DNA includes non-human mammalian DNA, human DNA, insect DNA, avian DNA, reptile DNA, amphibian DNA, rodent DNA, fish DNA, worm DNA, nematode DNA or yeast DNA;
    优选地,所述非人类哺乳动物DNA包括非人类灵长类DNA。Preferably, the non-human mammalian DNA includes non-human primate DNA.
  15. 根据权利要求11所述的基因编辑系统,其特征在于,所述直接重复序列的3’端包含茎环结构,所述茎环结构包括依次连接的第一茎核苷酸链、环核苷酸链和第二茎核苷酸链,所述第一茎核苷酸链和所述第二茎核苷酸链彼此杂交形成所述茎环结构的茎,所述环核苷酸链形成所述茎环结构的环;The gene editing system according to claim 11, wherein the 3' end of the direct repeat sequence includes a stem-loop structure, and the stem-loop structure includes a first stem nucleotide chain and a cyclic nucleotide connected in sequence. chain and a second stem nucleotide chain, the first stem nucleotide chain and the second stem nucleotide chain hybridize to each other to form the stem of the stem-loop structure, and the cyclic nucleotide chain forms the Rings of stem-loop structure;
    优选地,所述第一茎核苷酸链的长度为5或6个核苷酸;Preferably, the length of the first stem nucleotide chain is 5 or 6 nucleotides;
    优选地,所述第二茎核苷酸链的长度为5个核苷酸;Preferably, the length of the second stem nucleotide chain is 5 nucleotides;
    优选地,所述环核苷酸链的长度为6、7或8个核苷酸。Preferably, the cyclic nucleotide chain is 6, 7 or 8 nucleotides in length.
  16. 根据权利要求15所述的基因编辑系统,其特征在于,所述茎环结构包括SEQ ID NO:25、28、31、34或37所述的核苷酸序列。The gene editing system according to claim 15, wherein the stem-loop structure includes the nucleotide sequence described in SEQ ID NO: 25, 28, 31, 34 or 37.
  17. 根据权利要求11所述的基因编辑系统,其特征在于,所述直接重复序列包括与SEQ ID NO:24、27、30、33或36所述的核苷酸序列具有至少80%同一性的核苷酸序列;The gene editing system according to claim 11, wherein the direct repeat sequence includes a core with at least 80% identity to the nucleotide sequence described in SEQ ID NO: 24, 27, 30, 33 or 36. nucleotide sequence;
    优选地,所述直接重复序列包括与SEQ ID NO:24、27、30、33或36所述的核苷酸序列具有至少85%以上,更优选90%以上,进一步优选95%以上同一性的核苷酸序列;Preferably, the direct repeat sequence includes at least 85% or more, more preferably more than 90%, and further preferably more than 95% identity with the nucleotide sequence described in SEQ ID NO: 24, 27, 30, 33 or 36. Nucleotide sequence;
    优选地,所述直接重复序列包括SEQ ID NO:24、27、30、33或36所述的核苷酸序列。Preferably, the direct repeat sequence includes the nucleotide sequence described in SEQ ID NO: 24, 27, 30, 33 or 36.
  18. 根据权利要求11所述的基因编辑系统,其特征在于,所述间隔子序列的80%以上与所述靶核酸互补;The gene editing system according to claim 11, wherein more than 80% of the spacer sequence is complementary to the target nucleic acid;
    优选地,所述间隔子序列的90%以上,更优选95%以上,进一步优选99%以上,更进一步优选100%与所述靶核酸互补;Preferably, more than 90% of the spacer sequence, more preferably more than 95%, further preferably more than 99%, even more preferably 100% is complementary to the target nucleic acid;
    优选地,所述间隔子序列的长度为18-41nt;Preferably, the length of the spacer sequence is 18-41 nt;
    优选地,所述间隔子序列的长度为18-37nt;Preferably, the length of the spacer sequence is 18-37nt;
    优选地,所述间隔子序列长度为18-26或34-36nt;Preferably, the length of the spacer sequence is 18-26 or 34-36nt;
    优选地,所述间隔子序列长度为20nt。Preferably, the length of the spacer sequence is 20nt.
  19. 根据权利要求11所述的基因编辑系统,其特征在于,所述直接重复序列包括第一直接重复序列和第二直接重复序列;The gene editing system according to claim 11, wherein the direct repeat sequence includes a first direct repeat sequence and a second direct repeat sequence;
    优选地,所述RNA指导物包括按顺序依次连接的所述第一直接重复序列、所述间隔 子序列以及所述第二直接重复序列;Preferably, the RNA guide includes the first direct repeat sequence, the spacer sequence connected in sequence. subsequence and said second direct repeat sequence;
    优选地,所述第一直接重复序列与所述第二直接重复序列相同。Preferably, said first direct repeat sequence and said second direct repeat sequence are identical.
  20. 根据权利要求11所述的基因编辑系统,其特征在于,所述靶核酸包含前间隔子相邻基序,所述CRISPR-Cas效应子蛋白或所述CRISPR-Cas效应子融合蛋白能够识别所述前间隔子相邻基序,所述前间隔子相邻基序包含核酸序列5’-TTN-3’,其中N是任何核苷酸;The gene editing system according to claim 11, wherein the target nucleic acid includes a pre-spacer adjacent motif, and the CRISPR-Cas effector protein or the CRISPR-Cas effector fusion protein can recognize the a pre-spacer adjacent motif, the pre-spacer adjacent motif comprising the nucleic acid sequence 5'-TTN-3', wherein N is any nucleotide;
    优选地,所述N为A、C或T。Preferably, the N is A, C or T.
  21. 根据权利要求11所述的基因编辑系统,其特征在于,CRISPR-Cas效应子蛋白或所述CRISPR-Cas效应子融合蛋白,与所述RNA指导物结合,形成蛋白-核酸复合物;The gene editing system of claim 11, wherein the CRISPR-Cas effector protein or the CRISPR-Cas effector fusion protein combines with the RNA guide to form a protein-nucleic acid complex;
    优选地,所述蛋白-核酸复合物是非天然存在的或经修饰的;Preferably, the protein-nucleic acid complex is non-naturally occurring or modified;
    优选地,所述蛋白-核酸复合物中的至少一个组分是非天然存在的或经修饰的。Preferably, at least one component of the protein-nucleic acid complex is non-naturally occurring or modified.
  22. 根据权利要求11所述的基因编辑系统,其特征在于,通过所述CRISPR-Cas效应子蛋白或所述CRISPR-Cas效应子融合蛋白和所述RNA指导物对所述靶核酸的所述靶向作用,对所述靶核酸进行修饰;The gene editing system according to claim 11, characterized in that the targeting of the target nucleic acid by the CRISPR-Cas effector protein or the CRISPR-Cas effector fusion protein and the RNA guide Function to modify the target nucleic acid;
    优选地,所述修饰包括切割或切口;Preferably, said modification includes cutting or incision;
    优选地,所述修饰导致:Preferably, said modification results in:
    (1)细胞包含至少一种基因产物表达的改变;或(1) The cell contains an alteration in the expression of at least one gene product; or
    (2)所述细胞包含至少一种基因产物的表达的改变,其中所述至少一种基因产物的表达增加;或(2) the cell comprises an alteration in the expression of at least one gene product, wherein the expression of the at least one gene product is increased; or
    (3)所述细胞包含至少一种基因产物的表达的改变,其中所述至少一种基因产物的表达减少;或(3) the cell comprises an alteration in the expression of at least one gene product, wherein the expression of the at least one gene product is reduced; or
    (4)所述细胞包含经编辑的基因组;(4) The cell contains an edited genome;
    优选地,所述修饰导致细胞毒性;Preferably, said modification results in cytotoxicity;
    优选地,上述修饰导致抑制基因表达、降低基因表达或增强基因表达。Preferably, the above modification results in inhibition of gene expression, reduction in gene expression or enhancement of gene expression.
  23. 根据权利要求11所述的基因编辑系统,其特征在于,所述基因编辑系统包括目标核酸或编码所述目标核酸的核酸,所述目标核酸包括同源臂片段和供体模板核酸;The gene editing system according to claim 11, characterized in that the gene editing system includes a target nucleic acid or a nucleic acid encoding the target nucleic acid, and the target nucleic acid includes a homology arm fragment and a donor template nucleic acid;
    优选地,所述目标核酸包含能够与所述间隔子序列杂交的序列;Preferably, the target nucleic acid comprises a sequence capable of hybridizing to the spacer sequence;
    优选地,所述同源臂片段包括5’同源臂和3’同源臂,所述目标核酸由所述5’同源臂、所述供体模板核酸和所述3’同源臂顺序连接组成。 Preferably, the homology arm fragment includes a 5' homology arm and a 3' homology arm, and the target nucleic acid is sequenced by the 5' homology arm, the donor template nucleic acid and the 3' homology arm. Connection composition.
  24. 根据权利要求11所述的基因编辑系统,其特征在于,所述基因编辑系统以可递送的形式存在,利用递送系统使所述基因编辑系统与所述靶核酸接触;The gene editing system according to claim 11, characterized in that the gene editing system exists in a deliverable form, and the delivery system is used to bring the gene editing system into contact with the target nucleic acid;
    优选地,所述递送系统将所述基因编辑系统递送入含有所述靶核酸的细胞中;Preferably, the delivery system delivers the gene editing system into cells containing the target nucleic acid;
    优选地,所述可递送的形式包括纳米颗粒、脂质体、外泌体、微泡、蛋白衣壳或基因枪所用的颗粒。Preferably, the deliverable form includes nanoparticles, liposomes, exosomes, microvesicles, protein capsids or particles for gene guns.
  25. 一种基因编辑载体,其特征在于,所述基因编辑载体包含权利要求11至24中任一项所述的基因编辑系统中的编码所述RNA指导物的核酸。A gene editing vector, characterized in that the gene editing vector contains the nucleic acid encoding the RNA guide in the gene editing system according to any one of claims 11 to 24.
  26. 根据权利要求25所述的基因编辑载体,其特征在于,所述基因编辑载体还包含权利要求6或7所述的DNA分子;The gene editing vector according to claim 25, characterized in that the gene editing vector further comprises the DNA molecule according to claim 6 or 7;
    优选地,所述DNA分子与编码所述RNA指导物的核酸位于相同或不同的载体上;Preferably, the DNA molecule and the nucleic acid encoding the RNA guide are located on the same or different vectors;
    优选地,所述DNA分子与第一调节元件连接;Preferably, the DNA molecule is connected to a first regulatory element;
    优选地,编码所述RNA指导物的核酸与第二调节元件连接;Preferably, the nucleic acid encoding said RNA guide is linked to a second regulatory element;
    优选地,所述第一调节元件和所述第二调节元件分别独立选自诱导型启动子、组成型启动子或组织特异性启动子中的一种或多种;Preferably, the first regulatory element and the second regulatory element are independently selected from one or more of inducible promoters, constitutive promoters or tissue-specific promoters;
    优选地,所述第一调节元件和所述第二调节元件分别独立选自T7、SP6、T3、CMV、EF1a、SV40、PGK1、humanβ-actin、CAG、U6、H1、T7、T7lac、araBAD、trp、lac或Ptac中的一种或多种。Preferably, the first regulatory element and the second regulatory element are independently selected from T7, SP6, T3, CMV, EF1a, SV40, PGK1, human β-actin, CAG, U6, H1, T7, T7lac, araBAD, One or more of trp, lac or Ptac.
  27. 一种权利要求11至24中任一项所述的基因编辑系统与细胞中靶核酸结合的方法,其特征在于,所述方法包括:A method for binding the gene editing system to a target nucleic acid in a cell according to any one of claims 11 to 24, characterized in that the method includes:
    将所述基因编辑系统递送至所述细胞中,所述细胞包括所述靶核酸;delivering the gene editing system to the cell, the cell including the target nucleic acid;
    使所述CRISPR-Cas效应子蛋白或所述CRISPR-Cas效应子融合蛋白,与所述RNA指导物结合,causing the CRISPR-Cas effector protein or the CRISPR-Cas effector fusion protein to bind to the RNA guide,
    使所述间隔子序列与所述靶核酸结合。The spacer sequence is allowed to bind to the target nucleic acid.
  28. 根据权利要求27所述的方法,其特征在于,所述靶核酸为双链DNA或单链DNA;The method of claim 27, wherein the target nucleic acid is double-stranded DNA or single-stranded DNA;
    优选地,所述基因编辑系统与所述细胞中靶核酸的结合,导致所述靶核酸的表达状态改变;Preferably, the combination of the gene editing system with the target nucleic acid in the cell results in a change in the expression state of the target nucleic acid;
    优选地,所述基因编辑系统与所述细胞中靶核酸的结合,导致所述靶核酸被切割;Preferably, the binding of the gene editing system to the target nucleic acid in the cell causes the target nucleic acid to be cleaved;
    优选地,所述靶核酸被切割导致靶核酸破坏、或靶核酸特定位点替换、或靶核酸位点的移除、或靶核酸区域功能的改变、或靶核酸上两个位点之间的序列倒置。 Preferably, the target nucleic acid is cleaved resulting in the destruction of the target nucleic acid, or the replacement of a specific site on the target nucleic acid, or the removal of the target nucleic acid site, or the change in the function of the target nucleic acid region, or the change between two sites on the target nucleic acid. Sequence inversion.
  29. 一种含有基因编辑系统的细胞,其特征在于,所述含有基因编辑系统的细胞包括权利要求11至24中任一项所述的基因编辑系统、或权利要求25或26所述的基因编辑载体。A cell containing a gene editing system, characterized in that the cell containing a gene editing system includes the gene editing system of any one of claims 11 to 24, or the gene editing vector of claims 25 or 26 .
  30. 根据权利要求29所述的含有基因编辑系统的细胞,其特征在于,所述含有基因编辑系统的细胞包含经修饰的目的靶基因座,所述目的靶基因座为利用所述基因编辑系统修饰的基因座;The cell containing a gene editing system according to claim 29, wherein the cell containing a gene editing system contains a modified target target locus, and the target target locus is modified using the gene editing system. loci;
    优选地,所述目的靶基因座的所述修饰导致:Preferably, said modification of said target locus of interest results in:
    (1)所述含有基因编辑系统的细胞包含至少一种基因产物的表达的改变;或(1) The cells containing the gene editing system contain changes in the expression of at least one gene product; or
    (2)所述含有基因编辑系统的细胞包含至少一种基因产物的表达的改变,其中所述至少一种基因产物的表达增加;或(2) the cell containing the gene editing system comprises an alteration in the expression of at least one gene product, wherein the expression of the at least one gene product is increased; or
    (3)所述含有基因编辑系统的细胞包含至少一种基因产物的表达的改变,其中所述至少一种基因产物的表达减少;或(3) The cells containing the gene editing system comprise changes in the expression of at least one gene product, wherein the expression of the at least one gene product is reduced; or
    (4)所述含有基因编辑系统的细胞包含经编辑的基因组;(4) The cells containing the gene editing system contain the edited genome;
    优选地,所述含有基因编辑系统的细胞包括真核细胞或原核细胞;Preferably, the cells containing the gene editing system include eukaryotic cells or prokaryotic cells;
    优选地,所述真核细胞包括动物细胞、植物细胞或人类细胞;Preferably, the eukaryotic cells include animal cells, plant cells or human cells;
    优选地,所述动物细胞包括哺乳动物细胞。Preferably, the animal cells comprise mammalian cells.
  31. 一种靶向和编辑靶核酸的方法,其特征在于,所述方法包括使所述靶核酸与权利要求11至24中任一项所述的基因编辑系统接触。A method of targeting and editing a target nucleic acid, characterized in that the method includes contacting the target nucleic acid with the gene editing system of any one of claims 11 to 24.
  32. 一种在识别靶核酸后非特异性降解单链DNA的方法,其特征在于,所述方法包括使所述靶核酸与权利要求11至24中任一项所述的基因编辑系统接触。A method for non-specific degradation of single-stranded DNA after identifying a target nucleic acid, characterized in that the method includes contacting the target nucleic acid with the gene editing system of any one of claims 11 to 24.
  33. 一种在识别双链靶DNA的间隔子互补链后靶向所述双链靶DNA的非间隔子互补链并使其产生切口的方法,其特征在于,所述方法包括使所述双链靶DNA与权利要求11至24中任一项所述的基因编辑系统接触。A method for targeting and nicking the non-spacer complementary strand of a double-stranded target DNA after identifying the spacer complementary strand of the double-stranded target DNA, characterized in that the method includes making the double-stranded target DNA The DNA is contacted with the gene editing system of any one of claims 11 to 24.
  34. 一种靶向和切割双链靶DNA的方法,其特征在于,所述方法包括使所述双链靶DNA与权利要求11至24中任一项所述的基因编辑系统接触。A method of targeting and cutting double-stranded target DNA, characterized in that the method includes contacting the double-stranded target DNA with the gene editing system of any one of claims 11 to 24.
  35. 根据权利要求34所述的方法,其特征在于,在使所述双链DNA的间隔子互补链产生切口之前,使所述双链靶DNA的非间隔子互补链产生切口。The method according to claim 34, characterized in that, before nicking the spacer complementary strand of the double-stranded DNA, the non-spacer complementary strand of the double-stranded target DNA is nicked.
  36. 一种特异性编辑双链核酸的方法,其特征在于,A method for specifically editing double-stranded nucleic acids, characterized by:
    所述方法包括在充分的条件下使以下进行接触充分的时间量,The method includes contacting under sufficient conditions for a sufficient amount of time,
    (1)权利要求11至24中任一项所述的CRISPR-Cas效应子蛋白、或权利要求4或5所述的CRISPR-Cas效应子融合蛋白、另一具有序列特异性切口活性的酶,以及所述 RNA指导物,所述RNA指导物指导所述CRISPR-Cas效应子蛋白或所述CRISPR-Cas效应子融合蛋白,相对于所述另一序列特异性切口酶的活性使相对链产生切口;以及(1) The CRISPR-Cas effector protein of any one of claims 11 to 24, or the CRISPR-Cas effector fusion protein of claim 4 or 5, or another enzyme with sequence-specific nicking activity, as well as the an RNA guide that directs the CRISPR-Cas effector protein or the CRISPR-Cas effector fusion protein to nick an opposing strand relative to the activity of the other sequence-specific nickase; and
    (2)所述双链核酸;(2) the double-stranded nucleic acid;
    所述方法导致双链断裂的形成。The method results in the formation of double-strand breaks.
  37. 一种编辑双链核酸的方法,其特征在于,所述方法包括在充分的条件下使以下进行接触充分的时间量:A method of editing double-stranded nucleic acids, characterized in that the method includes contacting the following under sufficient conditions for a sufficient amount of time:
    (1)权利要求11至24中任一项所述的CRISPR-Cas效应子蛋白、或权利要求4或5所述的CRISPR-Cas效应子融合蛋白,和具有DNA修饰活性的蛋白质结构域的融合蛋白,以及靶向所述双链核酸的所述RNA指导物;以及(1) Fusion of the CRISPR-Cas effector protein according to any one of claims 11 to 24, or the CRISPR-Cas effector fusion protein according to claim 4 or 5, and a protein domain having DNA modification activity a protein, and the RNA guide targeting the double-stranded nucleic acid; and
    (2)所述双链核酸;(2) the double-stranded nucleic acid;
    所述融合蛋白的CRISPR-Cas效应子被修饰以使所述双链核酸的非靶链产生切口。The CRISPR-Cas effector of the fusion protein is modified to nick the non-target strand of the double-stranded nucleic acid.
  38. 根据权利要求37所述的方法,其特征在于,所述双链核酸的两条链在不同的位点被切割,导致交错切割。The method of claim 37, wherein the two strands of the double-stranded nucleic acid are cleaved at different sites, resulting in staggered cleavage.
  39. 根据权利要求37所述的方法,其特征在于,所述双链核酸的两条链在同一位点被切割,导致平双链断裂。The method of claim 37, wherein the two strands of the double-stranded nucleic acid are cleaved at the same site, resulting in a flat double-stranded break.
  40. 一种靶向并切割单链靶DNA的方法,其特征在于,所述方法包括使靶核酸与权利要求11至24中任一项所述的基因编辑系统接触。A method for targeting and cutting single-stranded target DNA, characterized in that the method includes contacting the target nucleic acid with the gene editing system of any one of claims 11 to 24.
  41. 一种诱导细胞状态改变的方法,其特征在于,所述方法包括使权利要求11至24中任一项所述的基因编辑系统与细胞中的所述靶核酸接触。A method for inducing changes in cell state, characterized in that the method includes contacting the gene editing system of any one of claims 11 to 24 with the target nucleic acid in the cell.
  42. 根据权利要求41所述的方法,其特征在于,所述细胞状态包括凋亡或休眠;The method of claim 41, wherein the cell state includes apoptosis or dormancy;
    优选地,所述细胞包括真核细胞或原核细胞;Preferably, the cells include eukaryotic cells or prokaryotic cells;
    优选地,所述细胞包括哺乳动物细胞或植物病变细胞;Preferably, the cells include mammalian cells or plant disease cells;
    优选地,所述细胞包括癌细胞;Preferably, the cells include cancer cells;
    优选地,所述细胞包括感染性细胞或被感染原感染的细胞;Preferably, the cells include infectious cells or cells infected by an infectious agent;
    优选地,所述细胞包括被病毒感染的细胞、被朊病毒感染的细胞;Preferably, the cells include virus-infected cells and prion-infected cells;
    优选地,所述细胞包括真菌细胞、原生动物或寄生虫细胞。Preferably, the cells comprise fungal, protozoan or parasitic cells.
  43. 权利要求11至24中任一项所述的基因编辑系统在制备治疗受试者病症或疾病的药物中的应用。 Use of the gene editing system according to any one of claims 11 to 24 in the preparation of medicines for treating conditions or diseases in a subject.
  44. 根据权利要求43所述的应用,其特征在于,所述应用包括向所述受试者或所述受试者的离体细胞施用所述基因编辑系统;The application according to claim 43, characterized in that the application comprises administering the gene editing system to the subject or the ex vivo cells of the subject;
    优选地,所述间隔子序列与跟所述病症或疾病相关的所述靶核酸的至少15个核苷酸互补,所述CRISPR-Cas效应子蛋白或所述CRISPR-Cas效应子融合蛋白切割所述靶核酸;Preferably, the spacer sequence is complementary to at least 15 nucleotides of the target nucleic acid associated with the disorder or disease, which is cleaved by the CRISPR-Cas effector protein or the CRISPR-Cas effector fusion protein. The target nucleic acid;
    优选地,所述病症或疾病包括癌症、感染性疾病、代谢性疾病或遗传性疾病;Preferably, the condition or disease includes cancer, infectious disease, metabolic disease or genetic disease;
    优选地,所述癌症包括维尔姆斯瘤、尤文肉瘤、神经内分泌瘤、胶质母细胞瘤、神经母细胞瘤、黑色素瘤、皮肤癌、乳腺癌、结肠癌、直肠癌、前列腺癌、肝癌、肾癌、胰腺癌、肺癌、胆道癌、宫颈癌、子宫内膜癌、食管癌、胃癌、头颈癌、甲状腺髓样癌、卵巢癌、胶质瘤、淋巴瘤、白血病、骨髓瘤、急性淋巴细胞白血病、急性髓细胞性白血病、慢性淋巴细胞白血病、慢性髓细胞性白血病、何杰金氏淋巴瘤、非何杰金氏淋巴瘤或尿膀胱癌中的一种或多种;Preferably, the cancer includes Wilms tumor, Ewing sarcoma, neuroendocrine tumor, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, Kidney cancer, pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, myeloma, acute lymphocyte One or more of leukemia, acute myeloid leukemia, chronic lymphocytic leukemia, chronic myelogenous leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma, or urinary bladder cancer;
    优选地,所述感染性疾病的感染原包括人类免疫缺陷病毒、单纯疱疹病毒-1、乙型肝炎或单纯疱疹病毒-2中的一种或多种;Preferably, the infectious agent of the infectious disease includes one or more of human immunodeficiency virus, herpes simplex virus-1, hepatitis B or herpes simplex virus-2;
    优选地,所述代谢性疾病包括家族性高胆固醇血症;Preferably, the metabolic disease includes familial hypercholesterolemia;
    优选地,所述遗传性疾病包括转甲状腺素蛋白淀粉样变、原发性高草尿酸症、遗传性血管性水肿;Preferably, the genetic disease includes transthyretin amyloidosis, primary hyperoxaluric acidosis, and hereditary angioedema;
    优选地,所述病症或疾病包括囊性纤维化、进行性假肥大性肌营养不良、贝克肌营养不良、α-1-抗胰蛋白酶缺乏、庞贝病、强直性肌营养不良、亨廷顿病、脆性X综合征、弗里德赖希共济失调、肌萎缩侧索硬化、额颞叶痴呆、遗传性慢性肾脏病、高脂血症、高胆固醇血症、莱伯氏先天性黑蒙、镰状细胞病或β地中海贫血中的一种或多种。Preferably, the disorder or disease includes cystic fibrosis, progressive Duchenne muscular dystrophy, Baker muscular dystrophy, alpha-1-antitrypsin deficiency, Pompe disease, myotonic dystrophy, Huntington's disease, Fragile One or more of syncytosis or beta thalassemia.
  45. 一种真核细胞系,其特征在于,所述真核细胞系包含权利要求29或30所述的含有基因编辑系统的细胞,或为所述含有基因编辑系统的细胞的后代。A eukaryotic cell line, characterized in that the eukaryotic cell line contains the cell containing the gene editing system of claim 29 or 30, or is a progeny of the cell containing the gene editing system.
  46. 一种多细胞生物体,其特征在于,所述多细胞生物体包含权利要求29或30所述的含有基因编辑系统的细胞。A multicellular organism, characterized in that the multicellular organism contains the cell containing the gene editing system of claim 29 or 30.
  47. 根据权利要求46所述的多细胞生物体,其特征在于,所述多细胞生物体包括模型动物或模型植物。The multicellular organism according to claim 46, characterized in that the multicellular organism includes a model animal or a model plant.
  48. 一种获得目的性状的植物的方法,其特征在于,利用权利要求11至24中任一项所述的基因编辑系统与植物细胞接触,对所述植物细胞的基因进行修饰或引入目的基因,所述修饰或目的基因能够表达所述目的性状,获得修饰后的植物细胞,A method for obtaining plants with target traits, characterized by using the gene editing system of any one of claims 11 to 24 to contact plant cells, modifying the genes of the plant cells or introducing target genes, so The modification or target gene can express the target trait, and a modified plant cell is obtained,
    利用所述修饰后的植物细胞进行再生,获得所述目的性状的植物。 The modified plant cells are used for regeneration to obtain plants with the desired traits.
  49. 一种鉴定植物中目的性状的方法,其特征在于,植物细胞中的目的基因能够表达所述目的性状,利用权利要求11至24中任一项所述的基因编辑系统与所述植物细胞接触,从而鉴定所述目的基因。A method for identifying target traits in plants, characterized in that the target genes in plant cells are capable of expressing the target traits, and the gene editing system of any one of claims 11 to 24 is used to contact the plant cells, Thus, the target gene is identified.
  50. 一种试剂盒,其特征在于,所述试剂盒包括一种或多种选自下列的组分:权利要求1至3中任一项所述的CRISPR-Cas效应子蛋白、权利要求4或5所述的CRISPR-Cas效应子融合蛋白、权利要求6或7所述的DNA分子、权利要求8或9所述的重组载体、权利要求10所述的宿主细胞、权利要求11至24所述的基因编辑系统、权利要求25或26所述的基因编辑载体、权利要求29或30所述的含有基因编辑系统的细胞、权利要求45所述的真核细胞系、权利要求46或47所述的多细胞生物体;A kit, characterized in that the kit includes one or more components selected from the following: the CRISPR-Cas effector protein according to any one of claims 1 to 3, claim 4 or 5 Described CRISPR-Cas effector fusion protein, the DNA molecule described in claim 6 or 7, the recombinant vector described in claim 8 or 9, the host cell described in claim 10, the method described in claims 11 to 24 Gene editing system, the gene editing vector according to claim 25 or 26, the cell containing the gene editing system according to claim 29 or 30, the eukaryotic cell line according to claim 45, the gene editing vector according to claim 46 or 47 multicellular organisms;
    所述试剂盒的组分在相同或不同的容器中。The components of the kit are in the same or different containers.
  51. 一种容器,其特征在于,所述容器包含权利要求50所述的试剂盒。A container, characterized in that the container contains the kit according to claim 50.
  52. 根据权利要求51所述的容器,其特征在于,所述容器包括无菌容器;The container of claim 51, wherein said container comprises a sterile container;
    优选地,所述容器包括注射器。Preferably, the container includes a syringe.
  53. 一种可植入装置,其特征在于,所述可植入装置包括权利要求11至24中任一项所述的基因编辑系统。An implantable device, characterized in that the implantable device includes the gene editing system according to any one of claims 11 to 24.
  54. 根据权利要求53所述的可植入装置,其特征在于,所述基因编辑系统在基质内;The implantable device of claim 53, wherein the gene editing system is within a matrix;
    优选地,所述基因编辑系统在储库内。Preferably, the gene editing system is within a reservoir.
  55. 一种治疗有需要的受试者的疾病或病症的方法,其特征在于,所述方法包括向受试者施用如下任意一种或多种:A method of treating a disease or condition in a subject in need thereof, characterized in that the method includes administering to the subject any one or more of the following:
    权利要求1至3中任一项所述的CRISPR-Cas效应子蛋白、或所述CRISPR-Cas效应子蛋白的衍生物或所述CRISPR-Cas效应子蛋白的功能片段,以及异源功能结构域,以及RNA指导物或编码所述RNA指导物的核酸;The CRISPR-Cas effector protein of any one of claims 1 to 3, or a derivative of the CRISPR-Cas effector protein or a functional fragment of the CRISPR-Cas effector protein, and a heterologous functional domain , and an RNA guide or a nucleic acid encoding said RNA guide;
    权利要求4或5所述的CRISPR-Cas效应子融合蛋白,以及RNA指导物或编码所述RNA指导物的核酸;The CRISPR-Cas effector fusion protein of claim 4 or 5, and an RNA guide or a nucleic acid encoding the RNA guide;
    权利要求6或7所述的DNA分子;The DNA molecule of claim 6 or 7;
    权利要求8或9所述的重组载体;The recombinant vector according to claim 8 or 9;
    权利要求11至24中任一项所述的基因编辑系统;The gene editing system according to any one of claims 11 to 24;
    权利要求25或26所述的基因编辑载体;The gene editing vector according to claim 25 or 26;
    权利要求29或30所述的含有基因编辑系统的细胞;The cell containing the gene editing system according to claim 29 or 30;
    权利要求45所述的真核细胞系; The eukaryotic cell line of claim 45;
    权利要求46或47所述的多细胞生物体;The multicellular organism of claim 46 or 47;
    权利要求50所述的试剂盒;The kit according to claim 50;
    权利要求51或52所述的容器;或A container as claimed in claim 51 or 52; or
    权利要求53或54所述的可植入装置。The implantable device of claim 53 or 54.
  56. 根据权利要求55所述的方法,其特征在于,所述受试者包括患有病症或疾病的受试者,所述病症或疾病包括癌症、感染性疾病代谢性疾病或遗传性疾病;The method of claim 55, wherein the subject includes a subject suffering from a disorder or disease, including cancer, infectious disease, metabolic disease, or genetic disease;
    优选地,所述癌症包括维尔姆斯瘤、尤文肉瘤、神经内分泌瘤、胶质母细胞瘤、神经母细胞瘤、黑色素瘤、皮肤癌、乳腺癌、结肠癌、直肠癌、前列腺癌、肝癌、肾癌、胰腺癌、肺癌、胆道癌、宫颈癌、子宫内膜癌、食管癌、胃癌、头颈癌、甲状腺髓样癌、卵巢癌、胶质瘤、淋巴瘤、白血病、骨髓瘤、急性淋巴细胞白血病、急性髓细胞性白血病、慢性淋巴细胞白血病、慢性髓细胞性白血病、何杰金氏淋巴瘤、非何杰金氏淋巴瘤或尿膀胱癌中的一种或多种;Preferably, the cancer includes Wilms tumor, Ewing sarcoma, neuroendocrine tumor, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, Kidney cancer, pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, myeloma, acute lymphocyte One or more of leukemia, acute myeloid leukemia, chronic lymphocytic leukemia, chronic myelogenous leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma, or urinary bladder cancer;
    优选地,所述感染性疾病的感染原包括人类免疫缺陷病毒、单纯疱疹病毒-1或单纯疱疹病毒-2中的一种或多种;Preferably, the infectious agent of the infectious disease includes one or more of human immunodeficiency virus, herpes simplex virus-1 or herpes simplex virus-2;
    优选地,所述代谢性疾病包括家族性高胆固醇血症;Preferably, the metabolic disease includes familial hypercholesterolemia;
    优选地,所述遗传性疾病包括转甲状腺素蛋白淀粉样变、原发性高草尿酸症、遗传性血管性水肿;Preferably, the genetic disease includes transthyretin amyloidosis, primary hyperoxaluric acidosis, and hereditary angioedema;
    优选地,所述病症或疾病包括囊性纤维化、进行性假肥大性肌营养不良、贝克肌营养不良、α-1-抗胰蛋白酶缺乏、庞贝病、强直性肌营养不良、亨廷顿病、脆性X综合征、弗里德赖希共济失调、肌萎缩侧索硬化、额颞叶痴呆、遗传性慢性肾脏病、高脂血症、高胆固醇血症、莱伯氏先天性黑蒙、镰状细胞病或β地中海贫血中的一种或多种。Preferably, the disorder or disease includes cystic fibrosis, progressive Duchenne muscular dystrophy, Baker muscular dystrophy, alpha-1-antitrypsin deficiency, Pompe disease, myotonic dystrophy, Huntington's disease, Fragile One or more of syncytosis or beta thalassemia.
  57. 权利要求11至24中任一项所述的基因编辑系统在治疗病症或疾病中的用途。Use of the gene editing system of any one of claims 11 to 24 in the treatment of a condition or disease.
  58. 根据权利要求57的用途,其特征在于,所述病症或疾病包括癌症、感染性疾病、代谢性疾病或遗传性疾病;Use according to claim 57, characterized in that the condition or disease includes cancer, infectious diseases, metabolic diseases or genetic diseases;
    所述癌症包括维尔姆斯瘤、尤文肉瘤、神经内分泌瘤、胶质母细胞瘤、神经母细胞瘤、黑色素瘤、皮肤癌、乳腺癌、结肠癌、直肠癌、前列腺癌、肝癌、肾癌、胰腺癌、肺癌、胆道癌、宫颈癌、子宫内膜癌、食管癌、胃癌、头颈癌、甲状腺髓样癌、卵巢癌、胶质瘤、淋巴瘤、白血病、骨髓瘤、急性淋巴细胞白血病、急性髓细胞性白血病、慢性淋巴细胞白血病、慢性髓细胞性白血病、何杰金氏淋巴瘤、非何杰金氏淋巴瘤或尿膀胱癌中的一种或多种;The cancers include Wilms tumor, Ewing sarcoma, neuroendocrine tumors, glioblastoma, neuroblastoma, melanoma, skin cancer, breast cancer, colon cancer, rectal cancer, prostate cancer, liver cancer, kidney cancer, Pancreatic cancer, lung cancer, biliary tract cancer, cervical cancer, endometrial cancer, esophageal cancer, gastric cancer, head and neck cancer, medullary thyroid cancer, ovarian cancer, glioma, lymphoma, leukemia, myeloma, acute lymphoblastic leukemia, acute One or more of myeloid leukemia, chronic lymphocytic leukemia, chronic myelogenous leukemia, Hodgkin's lymphoma, non-Hodgkin's lymphoma, or urinary bladder cancer;
    优选地,所述感染性疾病的感染原包括人类免疫缺陷病毒、单纯疱疹病毒-1、乙型肝炎或单纯疱疹病毒-2中的一种或多种; Preferably, the infectious agent of the infectious disease includes one or more of human immunodeficiency virus, herpes simplex virus-1, hepatitis B or herpes simplex virus-2;
    优选地,所述代谢性疾病包括家族性高胆固醇血症;Preferably, the metabolic disease includes familial hypercholesterolemia;
    优选地,所述遗传性疾病包括转甲状腺素蛋白淀粉样变、原发性高草尿酸症、遗传性血管性水肿;Preferably, the genetic disease includes transthyretin amyloidosis, primary hyperoxaluric acidosis, and hereditary angioedema;
    优选地,所述病症或疾病包括囊性纤维化、进行性假肥大性肌营养不良、贝克肌营养不良、α-1-抗胰蛋白酶缺乏、庞贝病、强直性肌营养不良、亨廷顿病、脆性X综合征、弗里德赖希共济失调、肌萎缩侧索硬化、额颞叶痴呆、遗传性慢性肾脏病、高脂血症、高胆固醇血症、莱伯氏先天性黑蒙、镰状细胞病或β地中海贫血中的一种或多种。 Preferably, the disorder or disease includes cystic fibrosis, progressive Duchenne muscular dystrophy, Baker muscular dystrophy, alpha-1-antitrypsin deficiency, Pompe disease, myotonic dystrophy, Huntington's disease, Fragile One or more of syncytosis or beta thalassemia.
PCT/CN2023/100529 2022-06-16 2023-06-15 Crispr-cas effector protein, gene editing system therefor, and application WO2023241669A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210681597.5A CN114921439B (en) 2022-06-16 2022-06-16 CRISPR-Cas effector protein, gene editing system and application thereof
CN202210681597.5 2022-06-16

Publications (1)

Publication Number Publication Date
WO2023241669A1 true WO2023241669A1 (en) 2023-12-21

Family

ID=82814195

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/100529 WO2023241669A1 (en) 2022-06-16 2023-06-15 Crispr-cas effector protein, gene editing system therefor, and application

Country Status (2)

Country Link
CN (1) CN114921439B (en)
WO (1) WO2023241669A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114921439B (en) * 2022-06-16 2024-04-26 尧唐(上海)生物科技有限公司 CRISPR-Cas effector protein, gene editing system and application thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110418647A (en) * 2016-09-30 2019-11-05 加利福尼亚大学董事会 The nucleic acid modifying enzyme and its application method of RNA guidance
US20200063126A1 (en) * 2018-03-14 2020-02-27 Arbor Biotechnologies, Inc. Novel crispr dna targeting enzymes and systems
CN113106081A (en) * 2018-10-29 2021-07-13 中国农业大学 Novel CRISPR/Cas12f enzymes and systems
CN113308451A (en) * 2020-12-07 2021-08-27 中国科学院动物研究所 Engineered Cas effector proteins and methods of use thereof
CN114921439A (en) * 2022-06-16 2022-08-19 尧唐(上海)生物科技有限公司 CRISPR-Cas effector protein, and gene editing system and application thereof

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116694603A (en) * 2019-05-14 2023-09-05 深圳华大生命科学研究院 Novel Cas protein, crispr-Cas system and use thereof in the field of gene editing
CN114015674A (en) * 2021-11-02 2022-02-08 辉二(上海)生物科技有限公司 Novel CRISPR-Cas12i system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110418647A (en) * 2016-09-30 2019-11-05 加利福尼亚大学董事会 The nucleic acid modifying enzyme and its application method of RNA guidance
US20200063126A1 (en) * 2018-03-14 2020-02-27 Arbor Biotechnologies, Inc. Novel crispr dna targeting enzymes and systems
CN113106081A (en) * 2018-10-29 2021-07-13 中国农业大学 Novel CRISPR/Cas12f enzymes and systems
CN113308451A (en) * 2020-12-07 2021-08-27 中国科学院动物研究所 Engineered Cas effector proteins and methods of use thereof
CN114921439A (en) * 2022-06-16 2022-08-19 尧唐(上海)生物科技有限公司 CRISPR-Cas effector protein, and gene editing system and application thereof

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
DATABASE Protein 29 December 2020 (2020-12-29), ANONYMOUS : "Chain A, Cas12i2", XP093117584, retrieved from NCBI Database accession no. PDB: 6LTP_A *
LIU KE, LIN GUIHONG, LIU KUN, ZHOU WEI, WANG FENGQING, DONGZHI WEI: "Mining, engineering and functional expansion of CRISPR/Cas systems", SYNTHETIC BIOLOGY JOURNAL., vol. 4, no. 1, 13 May 2022 (2022-05-13), pages 47 - 66, XP093117588, DOI: 10.12211/2096-8280.2021-022 *
ZHANG HENG; LI ZHUANG; XIAO RENJIAN; CHANG LEIFU: "Mechanisms for target recognition and cleavage by the Cas12i RNA-guided endonuclease", NATURE STRUCTURAL & MOLECULAR BIOLOGY, NATURE PUBLISHING GROUP US, NEW YORK, vol. 27, no. 11, 7 September 2020 (2020-09-07), New York , pages 1069 - 1076, XP037295140, ISSN: 1545-9993, DOI: 10.1038/s41594-020-0499-0 *

Also Published As

Publication number Publication date
CN114921439A (en) 2022-08-19
CN114921439B (en) 2024-04-26

Similar Documents

Publication Publication Date Title
US11149259B2 (en) CRISPR-Cas systems and methods for altering expression of gene products, structural information and inducible modular Cas enzymes
JP7094323B2 (en) Optimization Function Systems, Methods and Compositions for Sequence Manipulation with CRISPR-Cas Systems
EP3237615B1 (en) Crispr having or associated with destabilization domains
EP3234192B1 (en) Unbiased identification of double-strand breaks and genomic rearrangement by genome-wide insert capture sequencing
EP3080259B1 (en) Engineering of systems, methods and optimized guide compositions with new architectures for sequence manipulation
JP2024023194A (en) Delivery and use of crispr-cas systems, vectors and compositions for hepatic targeting and therapy
JP2023027277A (en) Rna-guided nucleic acid modifying enzymes and methods of use thereof
WO2016094872A9 (en) Dead guides for crispr transcription factors
AU2014362248A1 (en) Compositions and methods of use of CRISPR-Cas systems in nucleotide repeat disorders
WO2016049258A2 (en) Functional screening with optimized functional crispr-cas systems
WO2023241669A1 (en) Crispr-cas effector protein, gene editing system therefor, and application
JP2024504630A (en) Site-specific genetic modification
US20220364122A1 (en) Bacterial platform for delivery of gene-editing systems to eukaryotic cells
CN116783295A (en) Novel design of guide RNA and use thereof
US9816077B2 (en) Use of integrase for targeted gene expression
KR20230016751A (en) Nucleobase editor and its use

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23823235

Country of ref document: EP

Kind code of ref document: A1