WO2024036600A1 - Autonomous parthenogenetic haploid reproduction method for maternal cells and use thereof in breeding - Google Patents

Autonomous parthenogenetic haploid reproduction method for maternal cells and use thereof in breeding Download PDF

Info

Publication number
WO2024036600A1
WO2024036600A1 PCT/CN2022/113576 CN2022113576W WO2024036600A1 WO 2024036600 A1 WO2024036600 A1 WO 2024036600A1 CN 2022113576 W CN2022113576 W CN 2022113576W WO 2024036600 A1 WO2024036600 A1 WO 2024036600A1
Authority
WO
WIPO (PCT)
Prior art keywords
gene
positions
seq
expression cassette
promoter
Prior art date
Application number
PCT/CN2022/113576
Other languages
French (fr)
Chinese (zh)
Inventor
谢传晓
祁显涛
高惠敏
刘昌林
朱金洁
黄晶
徐孝洁
赵延明
毛文博
吕仁瑶
Original Assignee
中国农业科学院作物科学研究所
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国农业科学院作物科学研究所 filed Critical 中国农业科学院作物科学研究所
Priority to PCT/CN2022/113576 priority Critical patent/WO2024036600A1/en
Publication of WO2024036600A1 publication Critical patent/WO2024036600A1/en

Links

Images

Classifications

    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01HNEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
    • A01H5/00Angiosperms, i.e. flowering plants, characterised by their plant parts; Angiosperms characterised otherwise than by their botanic taxonomy
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01HNEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
    • A01H6/00Angiosperms, i.e. flowering plants, characterised by their botanic taxonomy
    • A01H6/46Gramineae or Poaceae, e.g. ryegrass, rice, wheat or maize
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses

Definitions

  • the invention relates to the field of biotechnology, and in particular to a maternal cell-autonomous parthenogenetic haploid reproduction method and its breeding application.
  • BBM transcription factors belong to the APETALA2 (AP2) family and play an important regulatory role in regulating the development of floral meristems and promoting cell proliferation and morphogenesis (Kim S, Soltis P S, Wall K, et al. Phylogeny and domain evolution in the APETALA2-like gene family[J].Molecular Biology and Evolution, 2006,23(1):107-120.).
  • AP2 APETALA2
  • Previous studies have shown that ectopic expression of BBM transcription factors in egg cells can significantly increase the frequency of plant parthenogenetic haploids. In 2015, Conner et al.
  • Transcription factors are key regulators of target gene transcription by binding to specific DNA sequences in the enhancer and promoter regions of target genes (Spitz, F. and Furlong, E.E.M. (2012) Transcription factors: from enhancer binding to developmental control. Nat .Rev.Genet.,13,613–626.).
  • ATFs artificial transcription factors fuse sequence-specific DNA-binding domains to activation domains to regulate gene expression
  • zinc-finger ATFs Wang, K.A., Chateau, M.L., and Porteus
  • TAL effectors for targeted enhancement of transgene expression in plants.Plant Biotechnol.J.,12,436 –446.
  • CRISPR-mediated gene activation system Gilbert, L.A., Larson, M.H., Morsu
  • CRISPR-mediated modular RNA -guided regulation of transcription in eukaryotes. Cell,154,442–451. CRISPR-mediated gene activation
  • CRISPRa CRISPR-mediated gene activation
  • VPR VP64-P65-Rta
  • CRISPR-SAM CRISPR-SAM
  • Cas9-SunTag CRISPR-mediated modular RNA -guided regulation of transcription in eukaryotes. Cell,154,442–451.
  • a series of transcription activation tools such as dCas9-VP64, dCas9-6TAL-VP128, CRISPR-Act2.0 and CRISPR-Act3.0 have shown strong gene transcription activation levels in plants.
  • no gene activation system suitable for maize egg cells or plants has been developed so far. Therefore, the establishment of a maize gene activation system will provide a powerful technical means for studying the ectopic expression of ZmBBM2 gene in maize egg cells and exploring the relationship between the BBM gene dosage effect and the frequency of parthenogenetic haploid induction.
  • the purpose of the present invention is to provide a maternal cell-autonomous parthenogenetic haploid reproduction method and its breeding application.
  • the present invention claims a method for obtaining maize parthenogenetic haploid.
  • the purpose of the present invention is to provide a maternal cell-autonomous parthenogenetic haploid reproduction method and its breeding application.
  • the present invention claims a method for obtaining maize parthenogenetic haploid.
  • the method for obtaining maize parthenogenetic haploids claimed by the present invention may include the following steps: introducing the system vector of the CRISPRa5 gene activation system used to mediate the ectopic expression of the ZmBBM2 gene in maize egg cells into the recipient maize to obtain positive transgenic maize. ; The egg cells in some embryo sacs of the transgenic positive corn can directly develop into haploid embryos and form haploid seeds.
  • the system vector can express dCas9-VP64 fusion protein, can express MS2-P65-HSF1 fusion protein, and can express one or several sgRNA targeting the -200 to -1 bp position upstream of the ZmBBM2 gene transcription start point, and the sgRNA is sgRNA2.0.
  • system vector contains the following 2 expression cassettes:
  • Expression cassette 1 used to express two sgRNAs; in the expression cassette 1, it contains the coding sequences of two sgRNAs, and the coding sequences of the two sgRNAs are connected with the Arabidopsis thaliana At-tRNAGly coding sequence and are composed of the same A promoter initiates transcription;
  • Expression cassette 2 used to express functional proteins, which are formed by fusing the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein through the self-cleaving polypeptide T2A; in the expression cassette 2, The functional protein is transcribed by a plant egg cell-specific promoter.
  • two sgRNAs respectively target the -36 and -166 bp regions upstream of the TSS of the ZmBBM2 gene.
  • the promoter is the ZmU6-2 promoter.
  • the plant egg cell-specific promoter is the Arabidopsis thaliana DD45 promoter.
  • both the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein contain nuclear localization signals.
  • the expression cassette 1 consists of the ZmU6-2 promoter and the sgRNA spacer sequence targeting the -36 bp region upstream of the ZmBBM2 gene transcription start point (TSS) from the 5' end to the 3' end.
  • the coding nucleic acid, the sgRNA2.0 scaffold coding sequence, the Arabidopsis thaliana At-tRNAGly coding sequence, the coding nucleic acid of the sgRNA spacer sequence targeting the -166bp region upstream of the ZmBBM2 gene transcription start site (TSS), the sgRNA2.0 scaffold Composed of coding sequences.
  • the expression cassette 2 consists of the Arabidopsis DD45 promoter, 3 ⁇ Flag tag encoding nucleic acid, SV40 nuclear localization signal encoding sequence, and dCas9 encoding gene in sequence from the 5' end to the 3' end. , NLS nuclear localization signal coding sequence, VP64 coding gene, the coding gene of the self-cleaving polypeptide T2A, the MS2 coding gene, the SV40 nuclear localization signal coding sequence, the P65 coding gene, the HSF1 coding gene and the transcription terminator NOS.
  • the ZmU6-2 promoter is as shown in positions 1-397 of SEQ ID No. 16; the sgRNA spacer sequence targeting the -36bp region upstream of the ZmBBM2 gene transcription start point (TSS)
  • the coding nucleic acid is shown in positions 398-417 of SEQ ID No. 16; the sgRNA2.0 scaffold coding sequence is shown in positions 418-553 or 651-793 of SEQ ID No. 16; the proposed The Arabidopsis At-tRNAGly coding sequence is shown in positions 554-630 of SEQ ID No. 16; the coding nucleic acid of the sgRNA spacer sequence targeting the -166 bp region upstream of the ZmBBM2 gene transcription start point (TSS) is shown in SEQ ID No. Positions 631-650 of 16 are shown.
  • the Arabidopsis DD45 promoter is as shown in positions 1008-2017 of SEQ ID No. 16; the 3 ⁇ Flag tag encoding nucleic acid is as shown in positions 2018-2086 of SEQ ID No. 16 As shown; the SV40 nuclear localization signal coding sequence is as shown in positions 2093-2113 or 6992-7012 of SEQ ID No. 16; the dCas9 encoding gene is as shown in positions 2114-6238 of SEQ ID No. 16 shown; the NLS nuclear localization signal coding sequence is shown in positions 6239-6286 of SEQ ID No. 16; the VP64 encoding gene is shown in positions 6305-6454 of SEQ ID No.
  • the coding gene of T2A is shown in positions 6482-6535 of SEQ ID No. 16; the MS2 coding gene is shown in positions 6551-6937 of SEQ ID No. 16; the P65 coding gene is shown in SEQ ID No. 16
  • the HSF1 coding gene is shown in positions 7595-7966 of SEQ ID No. 16; the transcription terminator NOS is shown in positions 7967-8238 of SEQ ID No. 16.
  • the nucleotide sequence of the expression cassette 1 is shown in positions 1-1007 of SEQ ID No. 16.
  • the nucleotide sequence of the expression cassette 2 is shown in positions 1008-8238 of SEQ ID No. 16.
  • the system vector of the CRISPRa5 gene activation system is a plasmid containing the DNA fragment shown in SEQ ID No. 16.
  • the DNA fragment shown in SEQ ID No. 16 is replaced with the CPB vector (saved by this laboratory, article Li C, Liu C, Qi X, et al. RNA-guided Cas9 as an in The recombinant vector obtained by digesting the small fragment between the HindIII and EcoRI enzyme sites of vivo desired-target mutator in maize[J].Plant biotechnology journal, 2017,15(12):1566-1576.).
  • the method may also include the following steps: using the maize haploid auxiliary screening line as a pollen donor to pollinate the positive transgenic maize, thereby screening out maize parthenogenetic haploids.
  • the recipient corn is the corn inbred line KN5585.
  • the corn haploid-assisted screening system is DFP haploid-assisted screening of corn transgenic lines (recorded in the document "Dong L, Li L, Liu C, et al.
  • Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize[J].Molecular plant, 2018,11(9):1214-1217.”As shown in Supplemental Figure 1, the eGFP gene was found to be expressed in embryos(green)and DsRED was expressed in endosperms(red). The DFP cassette was then stably transformed into the maize variety ZC01, the same inbred line as the created haploid inducer line ( Figure 1C)", that is, the DFP cassette shown in Figure 1C was stably transformed into corn in this document The transgenic strain obtained after the selection of variety ZC01 can be obtained by the public from the applicant and can only be used to repeat the experiments of the present invention and cannot be used for other purposes).
  • This transgenic line contains two fluorescent reporter expression cassettes, one of which marks the aleurone layer and uses the aleurone layer-specific promoter LTP2 to activate the DsRed2 gene.
  • This expression cassette can mark the aleurone layer of corn kernels with red fluorescence.
  • the other expression cassette marks embryos and uses an embryo-specific promoter to drive the eGFP gene. Therefore, DFP haploid-assisted screening of corn transgenic lines is used as a pollen donor to pollinate the recipient corn. When the material is not induced to become haploid, the two sperm cells provided by the DFP line can achieve double fertilization and obtain embryonic hair. A kernel that glows green and the aleurone layer of the endosperm glows red.
  • the present invention claims to protect the system vector of the CRISPRa5 gene activation system for mediating the ectopic expression of ZmBBM2 gene in maize egg cells as described in the first aspect.
  • the present invention claims the use of the system vector described in the second aspect in inducing parthenogenetic haploidy in maize.
  • the present invention claims a CRISPR-based gene activation system.
  • the system vector of the CRISPR-based gene activation system claimed by the present invention can express dCas9-VP64 fusion protein, can express MS2-P65-HSF1 fusion protein, and can express one or several sgRNA targeting the target region of the target gene, so The sgRNA is sgRNA2.0.
  • system vector contains the following 2 expression cassettes:
  • Expression cassette 1' used to express the sgRNA
  • Expression cassette 2' used to express functional proteins, which are formed by fusing the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein through the self-cleaving polypeptide T2A.
  • the promoter used to initiate the transcription of the sgRNA is the U6 promoter.
  • the promoter used to initiate the transcription of the functional protein is the UBI promoter.
  • Both the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein contain nuclear localization signals.
  • the expression cassette 1' consists of the U6 promoter, the coding nucleic acid of the sgRNA spacer sequence targeting the target gene target region, and sgRNA2.0 in sequence from the 5' end to the 3' end. Scaffold coding sequence composition.
  • the 2' expression cassette consists of the UBI promoter, the NLS nuclear localization signal coding sequence, the dCas9 coding gene, the VP64 coding gene, and the self-cleavage sequence from the 5' end to the 3' end. It consists of the coding gene for polypeptide T2A, the MS2 coding gene, the NLS nuclear localization signal coding sequence, the P65 coding gene, the HSF1 coding gene and the NOS transcription terminator.
  • the nucleotide sequence of the U6 promoter is as shown in positions 1-397 of SEQ ID No.7-15; the sgRNA2.0 scaffold coding sequence is as SEQ ID No.8 -15 is shown in bits 418-560.
  • the nucleotide sequence of the UBI promoter is as shown in positions 18-2013 of SEQ ID No. 5; the NLS nuclear localization signal coding sequence is as shown in position 18 of SEQ ID No. 5.
  • the dCas9 encoding gene is shown in positions 2107-2127 or 7006-7026; the dCas9 encoding gene is shown in positions 2152-6252 of SEQ ID No. 5; the VP64 encoding gene is shown in positions 6319-6495 of SEQ ID No. 5 As shown; the coding gene of the self-cleaving polypeptide T2A is shown in positions 6496-6549 of SEQ ID No.
  • the MS2 coding gene is shown in positions 6565-6951 of SEQ ID No. 5; the P65 The coding gene is as shown in positions 7042-7584 of SEQ ID No.5; the HSF1 encoding gene is as shown in positions 7609-7983 of SEQ ID No.5; the NOS transcription terminator is as shown in SEQ ID No.5 Shown at bits 8018-8270.
  • the sequence of 2' of the expression cassette is shown in SEQ ID No. 5.
  • the present invention claims the application of the CRISPR-based gene activation system described in the fourth aspect in gene activation of target genes.
  • FIG. 1 shows the evaluation of the transcriptional activation ability of CRISPRa(1-6) tools in corn mesophyll protoplasts.
  • a CRISPRa vector.
  • dCas9 Streptococcus pyogenes Cas9 lacks endonuclease activity (mutations D10A, H840A);
  • ERF2m Arabidopsis ethylene response modified transcriptional activator 2 (ERF2); HSF1, human heat shock factor 1; MS2, phage coat protein; NLS , nuclear localization signal; P65, NF-kB transactivating subunit P65; pU6, ZmU6-2 promoter; pUBI, maize ubiquitin promoter;
  • sgRNA1.0 Cas9 conventional sgRNA; sgRNA2.0, contains two MS2 stems Circular sgRNA backbone; T, terminator; T2A, self-cleaving polypeptide; VP64, 4 copies of the transcriptional activator VP16.
  • b Schematic diagram of the three components of the a4-a6 system.
  • cd RT-qPCR detection of ZmDPS1 expression.
  • c Schematic diagram of ZmDPS1 site.
  • the first transcription start site (TSS) is named +1.
  • CRISPRa(1-6) mediates the activated expression of ZmDPS1.
  • the maize ubiquitin gene (NCBIGenBank: U29159.1) was used as a reference. Relative expression levels were calculated using the 2 - ⁇ Ct method.
  • e Using the ZmDPS1 gene as the sgRNA target to evaluate the window activity of the CRISPRa5 tool.
  • FIG. 2 shows the isolation of maize female gametocytes and validation of the CRISPRa5 system in egg cell (EC) protoplasts.
  • a–h Single-cell level micromanipulation and isolation of maize female gametocytes.
  • a Schematic diagram of corn ovary structure. AC antipodal cells, CC central cells, EC egg cells, SC synergid cells.
  • b Isolated maize ovary and embryo sac ES (red arrow).
  • c The embryo sac is separated by manual micromanipulation under a stereomicroscope.
  • d Single cells obtained from ES by partial enzymatic dissociation.
  • e-h separated AC(e), SC(f), CC(g) and EC(h).
  • i and j Egg cell-specific CRISPRa5 egg cell targeting activates the LTP2pro-DsRed2 expression cassette of the stably transformed DFP line.
  • i Schematic diagram of CRISPRa5 vector targeting LTP2pro.
  • Figure 3 shows the targeted activation of the maize endogenous ZmBBM2 gene based on egg cell-specific CRISPRa5 BBM .
  • a Schematic diagram of CRISPRa5 BBM vector. 3 ⁇ FLAG, 3 tandem FLAG epitope tags; Bar, BlpR gene. Other components are shown in the CRISPRa5 vector shown in Figure 1, a.
  • b Identification of transformed copy number based on ddPCR technology. The nine independent transformants of the T3 generation were EA13, EA14, EA15, EA19, EA21, EA22, EA28, EA34 and EA36.
  • the a5 BBM probe was labeled with FAM, while the ZmADH1 internal reference probe was labeled with HEX.
  • the upper left panel shows the 1D reference data of target genes and internal reference genes; the upper right panel shows the calculated a5 BBM copy number; the lower panel shows the 2D reference data of a typical EA19 line.
  • cf Quantitative analysis of gene expression levels in unpollinated female gametophytes.
  • c RT-ddPCR identifies a5 BBM expression level in embryo sac.
  • d RT-ddPCR identification of ZmBBM2 gene expression levels in embryo sac shown in c. Quantitative results were converted into target gene expression copy number per 1 ⁇ g of total RNA.
  • e Single-cell RT-ddPCR technology identifies the expression levels of ECs, ANs, SCs and CCs isolated from EA19 and ZC01.
  • f Use anti-FLAG antibody and BBM2 mRNA probe to determine the dCas9 protein expression site and ZmBBM2 gene expression pattern, and use Feulgen staining to observe the development of the embryo sac.
  • g and h Transcription levels of EA19 and ZC01 embryo sacs or ZmBBM2 and a5 BBM genes in young embryos before and after 0-9DAP pollination.
  • Figure 4 shows the apomictic phenotype caused by egg cell-specific CRISPRa5 ectopic activation of endogenous ZmBBM2 gene expression in maize.
  • b Fluorescence photos of haploid and diploid grain sections.
  • c Plant type and ear phenotype of diploid and haploid plants in ZC01 and EA19.
  • d DNA content of generated haploid plants (right) and control diploid plants (left) verified by flow cytometry.
  • e Haploid generation efficiency of 9 CRISPRa5 BBM lines.
  • the following examples facilitate a better understanding of the present invention, but do not limit the present invention.
  • the experimental methods in the following examples are all conventional methods unless otherwise specified.
  • the test materials used in the following examples were all purchased from conventional biochemical reagent stores unless otherwise specified.
  • the quantitative experiments in the following examples were repeated three times, and the results were averaged.
  • Example 1 Maternal cell-autonomous parthenogenetic haploid reproduction method and its breeding application
  • CRISPRa(a1-a6) construct the sequences of the ZmUbi promoter, nuclear localization signal (NLS), dCas9 (D10A, H830A), transcriptional activation domain, and NOS terminator were amplified and cloned into the pUC19 vector backbone.
  • the egg cell-specific expression CRISPRa5 vector was designed to replace the ZmUbi promoter with the Arabidopsis egg cell-specific promoter DD45, and the vector was constructed in the pUC19 vector skeleton.
  • the maize ZmU6-2 promoter is used for the expression of all sgRNAs.
  • the present invention selects the ZmDPS1 gene as the main target gene for evaluating the CRISPRa (a1-a6) tool.
  • the selected target points are the target points within -1 to -100bp upstream of TSS.
  • -100 to -200bp Design corresponding targets in the -200 to -300bp and 5'UTR regions to determine the optimal activation window of the CRISPRa5 tool.
  • the present invention also selected -1 to -100bp of the ZmTrxh, ZmES2 and ZmRCP1 genes to design target sites and construct a CRISPRa5 activation vector. Relevant gene targets and detailed information are shown in Table 1.
  • IMPLENP330 UV spectrophotometer In order to check the integrity of the RNA, take 500ng of total RNA and evaluate the RNA quality using 1% (w/v) agarose gel. The RNA bands are three clear, complete and undegraded samples for subsequent experiments. RNA samples were cleaned of residual genomic DNA contamination in the RNA using a gDNA wiper mix kit (Novizan Biotechnology, China).
  • the DFP haploid-assisted screening of corn transgenic lines created in the laboratory was used as experimental material. Before the material is spun, the ears are bagged. When the filament is 5-15cm long, the ears are removed. The surface of the ear shell is sterilized with 70% (v/v) ethanol, and the corn ovary structure is removed through micromanipulation. Use a scalpel to make incisions along both sides of the filaments of the ovule to remove the ovary wall structures on both sides of the ovary. Take 20 processed ovary slices containing embryo sac structures and collect them in a 3.5cm diameter sterile cell culture dish.
  • enzymatic solution contains 150mg mannitol, 19.65U Pectinase, 22U hemicellulase, 120U cellulase RS, use KOH to adjust pH to 5.0.
  • 1U is defined as: under optimal conditions, it is required to catalyze the conversion of 1 micromole of substrate into product per minute. The amount of enzyme is determined as one unit of activity).
  • the mixture was shaken on a shaking shaker at a speed of 140 rpm, incubated at 24°C for 60 min, and then manually separated embryonic sacs (ESs), synergid cells (SCs), antipodal cells (ANs), and central cells using a glass needle under a stereomicroscope. (CCs) and egg cells (ECs).
  • ESs embryonic sacs
  • SCs synergid cells
  • ANs antipodal cells
  • CCs central cells
  • CCs central cells
  • CCs central cells
  • ECs egg cells
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the full sequence is as follows As shown in SEQ ID No. 15. The complete sequence of the expression cassette 2 is as shown in SEQ ID No. 5).
  • the electroporation conditions were 5ms, 400V and 200 ⁇ F respectively. Electroporated samples were kept on ice for 10 min, transferred to a 6-well plate, and incubated in the dark at 28°C for 24 h. The DsRed2 fluorescence signal was detected using the DM1000LED system (LEICA, Germany), with an excitation wavelength of 554 nm and an amplification of 10 ⁇ 10 times.
  • the two sgRNA2.0 targets were transcribed using the same ZmU6-2 promoter and linked to the pBBM2TG1sgRNA2 and pBBMTG2sgRNA sequences with Arabidopsis At-tRNAGly. All primers used in the present invention are listed in Table 3.
  • Genomic DNA was isolated and purified from transgenic lines using the FastPure Plant DNA Isolation Mini Kit (Novozan Biotechnology, China). ZC01, EA13, EA14, EA15, EA19, EA21, EA22, EA28, EA34 and EA36 CRISPRa5 BBM genomic DNA were obtained respectively.
  • the multiplex quantitative droplet digital PCR (ddPCR) method was used to determine transgene copy number.
  • Primer3 Plus http://primer3plus.com/; Rozen and Skaletsky, 2000
  • the single-copy gene ZmADH1 was used as an internal quantitative control of diploid genes. Transgene copy number was measured using Bio-Rad QuantaSoft TM software (v1.6.6.0320), with the default setting being a threshold determination to distinguish positive and negative droplets. All primers and probes used in this study are listed in Table 3.
  • Wall-free ovaries were obtained by stripping the ovary walls under RNase-free conditions, and 3 biological replicates of 20 ovaries per group were collected. The isolation, quality control, and cDNA synthesis of total ovary RNA were performed according to the previously mentioned protocol. Identification of gene expression levels in ovary tissue was measured using RT-ddPCR technology, and candidate primers and probes for amplifying dCas9, ZmBBM2 and ZmeIF4 ⁇ genes were designed using Primer3Plus. All primers and probes are listed in Table 3.
  • Each ddPCR sample contained 11 ⁇ L of 2 ⁇ ddPCR Supermix for Probes (No dUTP) (Bio-Rad, USA), with a primer concentration of 900 nM and a probe concentration of 227 nM. At the same time, 2 ⁇ L cDNA was added; the final volume was adjusted to 22 ⁇ L with sterile ddH 2 O. Droplet counts were analyzed and absolute gene expression measurements were performed using Bio-Rad QuantaSoft TM software (v1.6.6.0320), with default settings for threshold determination to differentiate between positive and negative droplets.
  • the synergid cells, antipodal cells, central cells, and egg cells were isolated under a microscope. Take 10 cells of the same type for lysis, and take 1/10 of the original solution in the cell pool as RNA template. Single-cell reverse transcription droplet digital PCR technology was used to measure gene expression levels in cells, and the kit used One-Step RT ddPCR Advanced Kit for Probes (Bio-Rad, USA). Primer3Plus was used to design candidate gene primers and amplification probes for dCas9 and ZmBBM2 genes.
  • Each ddPCR sample contains 5 ⁇ L Supermix, 2 ⁇ L reverse transcriptase, 1 ⁇ L DDT, primer concentration 900 nM, probe concentration 250 nM, add 1 ⁇ L RNA lysis buffer; adjust the final volume to 20 ⁇ L with sterile ddH 2 O. Reactions were run on a QX200AutoDG ddPCR system. Analyze droplet counts and perform absolute gene expression measurements using Bio-Rad QuantaSoft TM software (v1.6.6.0320), with default settings for threshold determination to distinguish positive and negative droplets.
  • the ovary tissue was isolated and obtained, fixed with FAA solution (recipe: 10% formaldehyde, 5% acetic acid, 50% ethanol), and preserved in 70% (v/v) ethanol. Before sample preparation, it is necessary to rehydrate with different gradient concentrations of ethanol, then treat the DNA with 6M hydrochloric acid for hydrolysis, treat with 0.5% (w/v) periodic acid solution for 20 minutes, and incubate in 1% (w/v) acriflavine for 20 minutes. The samples were then dehydrated using an ethanol series, and the tissues were cleared with methyl salicylate-ethanol and pure methyl salicylate solutions. Finally, the samples were mounted on methyl salicylate slides and observed under a microscope.
  • Immunohistochemistry was used to locate the dCas9 protein in embryo sac sections. The collected corn kernels were fixed in FAA solution, and the samples were tissue embedded and sectioned using paraffin sectioning technique. Immunohistochemistry was performed using standard protocols. Immunohistochemistry was performed using rabbit anti-FLAG antibody ab205606 (Abcam Inc., USA) as the primary antibody and anti-goat anti-rabbit alkaline phosphatase conjugate A9919 (Sigma, USA) as the secondary antibody.
  • ZmBBM2 mRNA in situ hybridization was performed in embryo sac tissue using a digoxigenin-labeled antisense RNA probe to detect the accumulation of ZmBBM2 transcripts.
  • ZmBBM2 cDNA was cloned into a pUC19 vector containing a T7 promoter. NcoI restriction endonuclease ends are used to linearize the template and create a 5 "overhang" structure prior to transcription.
  • the probe was synthesized using in vitro transcription with T7 RNA polymerase kit combined with digoxigenin-11-UTP (Sigma, USA) for in vitro transcription, and the labeled ssRNA probe was synthesized. Degradation of mRNA must be avoided before and after embryo sac tissue fixation to avoid changes in the position of the mRNA.
  • DFP haploid-assisted screening of maize transgenic lines is maintained in our laboratory (document "Dong L, Li L, Liu C, et al. Genome editing and double-fluorescence proteins "As shown in Supplemental Figure 1, the eGFP gene was found to be expressed in embryos(green )and DsRED was expressed in endosperms(red).
  • the DFP cassette was then stably transformed into the maize variety ZC01.
  • the DFP cassette was then stably transformed into the maize variety ZC01, the same inbred line as the created haploid inducer line(Figure 1C )", that is, the transgenic line obtained by stably transforming the DFP cassette shown in Figure 1C into the corn variety ZC01 in this document).
  • This transgenic line contains two fluorescent reporter expression cassettes, one of which uses paste to mark the aleurone layer.
  • the flour layer-specific promoter LTP2 drives the DsRed2 gene, and this expression cassette can mark the aleurone layer of corn kernels with red fluorescence.
  • the other expression cassette marks embryos and uses an embryo-specific promoter to drive the eGFP gene. Therefore, DFP haploid-assisted screening of corn transgenic lines was used as a pollen donor to pollinate the KN5585 corn inbred line and 9 CRISPRa5 BBM lines. When the material is not induced to become haploid, the two sperm cells provided by the DFP line can Double fertilization is achieved to obtain a seed with a green embryo and a red endosperm aleurone layer.
  • DFP When the material egg cell develops into a haploid, one of the two sperm cells provided by DFP combines with the central cell to form a red endosperm aleurone layer. light, while the embryo has not received DFP sperm and therefore does not fluoresce.
  • the wild-type inbred line ZC01, 9 CRISPRa5BBM lines and DFP plants were grown in the greenhouse respectively. During flowering, DFP plants were selected as pollen donors for pollination of all other genotypes.
  • Haploid corn kernels were identified using a LUYOR-3415RG light source (LUYOR, California, USA) after maturity. Haploid frequency was calculated using the number of haploid grains for each line.
  • Flow cytometry was used to detect the ploidy level of cells.
  • the haploid seeds screened through DFP technology and the diploid control seeds were germinated together.
  • the seedlings grew to the three-leaf stage, about 1 g of young leaves were collected and cut into shreds with a razor blade, and filtered through an 80 ⁇ m nylon mesh.
  • Ploidy levels were analyzed on the FACSCalibur system (BD, USA), data were extracted by CellQuest software (BD, USA), and analyzed using ModFit software (Yerity, USA).
  • the present invention uses CRISPR-dCas9, which lacks endonuclease activity, as a DNA guidance tool and fuses different transcription activation response elements to construct six groups of CRISPRa vectors (a in Figure 1).
  • Vector design is divided into two strategies: In the first strategy, the dCas9 protein is fused to transcriptional activation response elements including ethylene response FACTOR2 (ERF2m, CRISPRa1), VP64 (4 ⁇ VP16, CRISPRa2) and NF-kB transgene from Arabidopsis thaliana.
  • CRISPRa vector formula activating subunit p65 (p65, CRISPRa3).
  • sgRNA1.0 guides dCas9 to the target DNA, and the transcriptional activation effector element recruits RNA polymerase for transcription and translation to achieve gene activation;
  • a modified sgRNA scaffold sgRNA2.0 which contains the basic skeleton of sgRNA1.0 and introduces two MS2-binding RNA aptamers at the 2nd and 4th stem-loop structures to recruit VP64 in addition to Additional transcriptional activators such as: ERF2m-HSF1 (CRISPRa4), p65-HSF1 (CRISPRa5) and VP64-HSF1 (CRISPRa6).
  • HSF1 human heat shock factor 1
  • dCas9-VP64 and other additional transcriptional activators are designed into a reading frame.
  • the fusion protein releases NLS-dCas9-VP64 through the T2A peptide, which binds to sgRNA2.0.
  • the MS2-bound RNA aptamer recruits the NLS-MS2-affetor-HSF1 structure, thereby forming an enhanced gene activation tool (a and b in Figure 1).
  • the expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 7), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA1.0 scaffold coding sequence (shown at positions 418-500 of SEQ ID No. 7).
  • the expression cassette 2 consists of the UBI promoter (No.
  • the expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 7), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA1.0 scaffold coding sequence (shown at positions 418-500 of SEQ ID No. 7).
  • the expression cassette 2 consists of the UBI promoter (No.
  • the expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 7), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA1.0 scaffold coding sequence (shown at positions 418-500 of SEQ ID No. 7).
  • the expression cassette 2 consists of the UBI promoter (No.
  • CRISPRa4 vector structure description A recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site.
  • the expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 8), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA2.0 scaffold coding sequence (shown at positions 418-560 of SEQ ID No. 8).
  • the expression cassette 2 consists of the UBI promoter (positions 18-2013 of SEQ ID No. 4) and the NLS nuclear localization signal coding sequence (positions 2107-2127 of SEQ ID No. 4) from the 5' end to the 3' end. ), dCas9 coding gene (No. 2152-6252 of SEQ ID No. 4), VP64 coding gene (No. 6319-6495 of SEQ ID No. 4), self-cleaving polypeptide T2A coding gene (SEQ ID No. 4) No. 6496-6549), MS2 coding gene (No. 6565-6951 of SEQ ID No. 4), NLS nuclear localization signal coding sequence (No. 7006-7026 of SEQ ID No.
  • ERF2m coding gene (SEQ ID No. 4) No. 4 No. 7042-7770), HSF1 coding gene (SEQ ID No. 4 No. 7795-8169) and NOS transcription terminator (SEQ ID No. 4 No. 8204-8456).
  • the complete sequence of the expression cassette 2 is shown in SEQ ID No. 4.
  • CRISPRa5 vector structure description A recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site.
  • the expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 8), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA2.0 scaffold coding sequence (shown at positions 418-560 of SEQ ID No. 8).
  • the expression cassette 2 consists of the UBI promoter (positions 18-2013 of SEQ ID No. 5) and the NLS nuclear localization signal coding sequence (positions 2107-2127 of SEQ ID No. 5) from the 5' end to the 3' end. ), dCas9 coding gene (No. 2152-6252 of SEQ ID No. 5), VP64 coding gene (No. 6319-6495 of SEQ ID No. 5), self-cleaving polypeptide T2A coding gene (SEQ ID No. 5 No. 6496-6549), MS2 coding gene (No. 6565-6951 of SEQ ID No. 5), NLS nuclear localization signal coding sequence (No. 7006-7026 of SEQ ID No.
  • CRISPRa6 vector structure description A recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site.
  • the expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 8), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA2.0 scaffold coding sequence (shown at positions 418-560 of SEQ ID No. 8).
  • the expression cassette 2 consists of the UBI promoter (positions 18-2013 of SEQ ID No. 6) and the NLS nuclear localization signal coding sequence (positions 2107-2127 of SEQ ID No. 6) from the 5' end to the 3' end. ), dCas9 coding gene (No. 2152-6252 of SEQ ID No. 6), VP64 coding gene (No. 6319-6495 of SEQ ID No. 6), self-cleaving polypeptide T2A coding gene (SEQ ID No. 6) No. 6496-6549), MS2 coding gene (No. 6565-6951 of SEQ ID No. 6), NLS nuclear localization signal coding sequence (No. 7006-7026 of SEQ ID No.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 7. Positions 1-397 of SEQ ID No. 7 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-500 are the sgRNA1.0 scaffold coding sequence.
  • SEQ ID No. 1 The complete sequence of the expression cassette 2 is shown in SEQ ID No. 1.
  • No. 18-2013 of SEQ ID No. 1 is the UBI promoter
  • No. 2107-2127 is the NLS nuclear localization signal coding sequence
  • No. 2152-6252 is the dCas9 coding gene
  • No. 6319-7047 is the ERF2m coding gene
  • No. 7082 Position -7334 is the NOS transcription terminator.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 7. Positions 1-397 of SEQ ID No. 7 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-500 are the sgRNA1.0 scaffold coding sequence.
  • SEQ ID No. 2 The complete sequence of the expression cassette 2 is shown in SEQ ID No. 2.
  • No. 18-2013 of SEQ ID No. 2 is the UBI promoter
  • No. 2107-2127 is the NLS nuclear localization signal coding sequence
  • No. 2152-6252 is the dCas9 coding gene
  • No. 6319-6468 is the VP64 coding gene
  • No. 6503 Position -6755 is the NOS transcription terminator.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 7. Positions 1-397 of SEQ ID No. 7 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-500 are the sgRNA1.0 scaffold coding sequence.
  • the complete sequence of the expression cassette 2 is shown in SEQ ID No. 3.
  • No. 18-2013 of SEQ ID No. 3 is the UBI promoter
  • No. 2107-2127 is the NLS nuclear localization signal coding sequence
  • No. 2152-6252 is the dCas9 coding gene
  • No. 6319-6861 is the P65 coding gene
  • No. 6896 Position -7148 is the NOS transcription terminator.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 8. Positions 1-397 of SEQ ID No. 8 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence.
  • Positions 18-2013 of SEQ ID No. 4 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene.
  • Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7770 are the genes encoding ERF2m, and positions 7795-8169 Positions 8204-8456 are the NOS transcription terminator.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 8. Positions 1-397 of SEQ ID No. 8 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence.
  • Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene.
  • Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 8. Positions 1-397 of SEQ ID No. 8 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence.
  • Positions 18-2013 of SEQ ID No. 6 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene.
  • Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A
  • positions 6565-6951 are the genes encoding MS2
  • positions 7006-7026 are the NLS nuclear localization signal encoding sequences
  • positions 7042-7218 are the genes encoding VP64
  • positions 7243-7617 Positions are the HSF1 encoding gene
  • positions 7652-7904 are the NOS transcription terminator.
  • the above-mentioned CRISPRa(1-6) DPS1 was transferred into the mesophyll protoplasts of the B73 corn inbred line to activate the ZmDPS1 gene respectively, and high-activity CRISPRa gene activation tools were screened. From the RT-qPCR experimental data, it can be seen that compared with the protoplast control group that was not transformed into the vector, the transcription levels of all CRISPRa systems increased the ZmDPS1 expression level by at least 1.6 times, and the CRISPRa5 system produced the most powerful transcription activation (relative to 11.25 times compared with the control group), which shows that the designed CRISPRa gene activation tool can achieve highly active transcription activation in corn mesophyll protoplast cells (d in Figure 1).
  • the next step is to evaluate the activation window of the CRISPRa5 transcriptional activation tool. Therefore, we selected four regions of the 5'UTR of the ZmDPS1 gene, -1 to -100bp, -100 to -200bp, and -200 to -300bp upstream of the TSS to design sgRNA targets and a CRIPRa5 gene activation vector. The resulting four vector structures are described as follows:
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 11. Positions 1-397 of SEQ ID No. 11 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting the 5'UTR of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence.
  • the complete sequence of the expression cassette 2 is shown in SEQ ID No. 5.
  • Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene.
  • Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 8. Positions 1-397 of SEQ ID No. 8 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence.
  • Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene.
  • Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 9. Positions 1-397 of SEQ ID No.
  • positions 398-417 are the sgRNA spacer sequence targeting -100 to -200 bp upstream of the TSS of the ZmDPS1 gene
  • positions 418-560 are the sgRNA2.0 scaffold coding sequence.
  • the complete sequence of the expression cassette 2 is shown in SEQ ID No. 5.
  • Positions 18-2013 of SEQ ID No. 5 are the UBI promoter
  • positions 2107-2127 are the NLS nuclear localization signal coding sequence
  • positions 2152-6252 are the dCas9 coding gene
  • positions 6319-6495 are the VP64 coding gene.
  • Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A
  • positions 6565-6951 are the genes encoding MS2
  • positions 7006-7026 are the NLS nuclear localization signal encoding sequences
  • positions 7042-7584 are the genes encoding P65
  • positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 10. Positions 1-397 of SEQ ID No.
  • positions 398-417 are the sgRNA spacer sequence targeting -200 to -300 bp upstream of the TSS of the ZmDPS1 gene
  • positions 418-560 are the sgRNA2.0 scaffold coding sequence.
  • the complete sequence of the expression cassette 2 is shown in SEQ ID No. 5.
  • Positions 18-2013 of SEQ ID No. 5 are the UBI promoter
  • positions 2107-2127 are the NLS nuclear localization signal coding sequence
  • positions 2152-6252 are the dCas9 coding gene
  • positions 6319-6495 are the VP64 coding gene.
  • Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A
  • positions 6565-6951 are the genes encoding MS2
  • positions 7006-7026 are the NLS nuclear localization signal encoding sequences
  • positions 7042-7584 are the genes encoding P65
  • positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
  • the expression level of ZmDPS1 was measured by transforming B73 corn inbred line mesophyll protoplasts and RT-qPCR. Through experiments, we found that the sgRNA target targeting the -1 to -100bp region showed the highest fold activation of ZmDPS1 expression level (e in Figure 1).
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 12. Positions 1-397 of SEQ ID No.
  • positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmTrxh gene
  • positions 418-560 are the sgRNA2.0 scaffold coding sequence.
  • the complete sequence of the expression cassette 2 is shown in SEQ ID No. 5.
  • Positions 18-2013 of SEQ ID No. 5 are the UBI promoter
  • positions 2107-2127 are the NLS nuclear localization signal coding sequence
  • positions 2152-6252 are the dCas9 coding gene
  • positions 6319-6495 are the VP64 coding gene.
  • Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A
  • positions 6565-6951 are the genes encoding MS2
  • positions 7006-7026 are the NLS nuclear localization signal encoding sequences
  • positions 7042-7584 are the genes encoding P65
  • positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 13. Positions 1-397 of SEQ ID No. 13 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmES2 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence. .
  • Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene.
  • Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 14. Positions 1-397 of SEQ ID No. 14 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmRCP1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence.
  • Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene.
  • Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
  • the CRISPRa5 gene activation tool increased the ZmTrxh gene expression by 375.38 times compared with the control group of protoplasts not transformed into the vector (f in Figure 1 ), the expression level of ZmES2 gene was increased by 5.33 times (g in Figure 1), and the expression level of ZmRCP1 gene was increased by 4.45 times (h in Figure 1).
  • the results show that the CRISPRa5 gene activation tools we designed (sgRNA2.0, dCas9-VP64 and MS2-p65-HSF1 effector) show the strongest targeted transcription activation ability in corn cells.
  • the DFP haploid established in the laboratory to assist in screening of corn transgenic lines (kept in our laboratory, literature: Dong L, Li L, Liu C, et al. Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize[J].Molecular plant, 2018,11(9):1214-1217.)
  • the DFP transgenic line contains aleurone layer The red fluorescent protein DsRed2 gene driven by the specific promoter LTP2 promoter (Dong, L., Li, L., Liu, Changlin, Liu, Chenxu, Geng, S., Li, X., et al.
  • the LTP2 promoter is specifically expressed in the aleurone layer but not in egg cells.
  • the Arabidopsis egg cell-specific promoter DD45 promoter to express the CRISPRa5 system, and designed a gene activation target at -1 to -100bp upstream of the TSS of the LTP2 promoter and designed to target LTP2 Promoter of the oocyte-specific CRISPRa5 pLTP2::DsRed2 vector.
  • the resulting vector structure is described as follows: the recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site.
  • the expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 15. Positions 1-397 of SEQ ID No. 15 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the LTP2 promoter, and positions 418-560 are the sgRNA2.0 scaffold encoding sequence.
  • the complete sequence of the expression cassette 2 is shown in SEQ ID No. 5. Positions 18-2013 of SEQ ID No.
  • positions 2107-2127 are the NLS nuclear localization signal coding sequence
  • positions 2152-6252 are the dCas9 coding gene
  • positions 6319-6495 are the VP64 coding gene.
  • Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A
  • positions 6565-6951 are the genes encoding MS2
  • positions 7006-7026 are the NLS nuclear localization signal encoding sequences
  • positions 7042-7584 are the genes encoding P65
  • Positions 8018-8270 are the NOS transcription terminator.
  • ZmBBM2 has a close evolutionary relationship with PsASGR-BBML and OsBBM1 protein sequences.
  • the ZmBBM2 gene has similar functions to PsASGR-BBML and OsBBM1, and its egg cells are ectopic. Activity can induce parthenogenesis. Therefore, we constructed a CRISPRa5 BBM gene activator vector targeting the endogenous ZmBBM2 gene promoter for egg cell-specific expression (a in Figure 3).
  • the resulting vector structure is described as follows: the recombinant vector obtained by replacing the small fragment between the enzyme cutting sites HindIII and EcoRI of the CPB vector with the DNA fragment shown in SEQ ID No. 16.
  • Positions 1-1007 of SEQ ID No. 16 are expression cassettes for expressing sgRNA. Positions 1-397 of SEQ ID No. 16 are the ZmU6-2 promoter, positions 398-417 are the coding nucleic acids of the sgRNA spacer sequence targeting the -36 bp region upstream of the transcription start site (TSS) of the ZmBBM2 gene, and positions 418-553 Positions 554-630 are the sgRNA2.0 scaffold coding sequence, positions 554-630 are the Arabidopsis At-tRNAGly coding sequence, and positions 631-650 are the coding for the sgRNA spacer sequence targeting the -166 bp region upstream of the ZmBBM2 gene transcription start site (TSS).
  • TSS transcription start site
  • Nucleic acid, positions 651-793 are the sgRNA2.0 scaffold coding sequence. Positions 1008-8238 of SEQ ID No. 16 are expression cassettes for expressing functional proteins. Positions 1008-2017 of SEQ ID No. 16 are the Arabidopsis DD45 promoter, positions 2018-2086 are the 3 ⁇ Flag tag encoding nucleic acid, positions 2093-2113 are the SV40 nuclear localization signal encoding sequence, and positions 2114-6238 It is the gene encoding dCas9, positions 6239-6286 are the NLS nuclear localization signal encoding sequence, positions 6305-6454 are the VP64 encoding gene, positions 6482-6535 are the encoding genes for the self-cleaving polypeptide T2A, and positions 6551-6937 are the MS2 encoding Gene, positions 6992-7012 are the SV40 nuclear localization signal coding sequence, positions 7028-7570 are the P65 encoding gene, positions 7595-7966 are the HSF1
  • RNA of the above transgenic materials was extracted from the isolated embryo sacs.
  • RT-ddPCR reverse transcription ddPCR
  • EA19 showed the highest ZmBBM2 activation level among the five single-copy transgenic lines, we next selected the transgenic line EA19 for further verification.
  • micromanipulation technology to isolate ANs, SCs, CCs and ECs from EA19 transgenic line plants.
  • Single-cell RT-ddPCR technology to detect CRISPRa5 BBM and ZmBBM2 transcripts in EC (e in Figure 3)
  • the data showed that CRISPRa5 BBM can be expressed in egg cells, and CRISPRa5 BBM can activate the ectopic expression of the endogenous ZmBBM2 gene in maize. .
  • CRISPRa5 BBM and ZmBBM2 were found in unpollinated embryo sacs in EA19 material, and the expression of CRISPRa5 BBM and ZmBBM2 increased rapidly as pollination time went by (g and h in Figure 3).
  • the present invention used the corn mesophyll protoplast transient expression system to screen and identify 6 sets of CRISPR-based gene activation systems.
  • the results showed that the CRISPRa5 system composed of dCas9-VP64 and p65-HSF1 activation domains showed the strongest performance in corn mesophyll protoplasts.
  • the transcriptional activation activity of The transcriptional activation activity of the maize endogenous gene ZmBBM2 was then used as a target to design a CRISPRa5 egg cell ectopic activation endogenous ZmBBM2 gene vector, combined with Agrobacterium-mediated transformation of maize immature embryos, to obtain 9 transgenic events (EA13, EA15 , EA19, EA21, EA22, EA28, EA34, EA36), using in situ hybridization technology, immunohistochemistry, single-cell RT-ddPCR technology and other biochemical and molecular biology methods to detect transformation events, it was determined that CRISPRa5 is present in egg cells Expresses and can ectopically activate the endogenous ZmBBM2 gene to achieve high-level gene activation; finally, by combining flow cytometry, DFP haploid screening system and phenotypic observation, the present invention found that the CRISPRa5 gene activation system was used to ectopically activate maize eggs.
  • the present invention uses the CRISPRa5 gene activation system to mediate the ectopic expression of ZmBBM2 gene in maize egg cells, thereby inducing the occurrence of maize parthenogenetic haploidy.
  • This technology provides a method for the creation of engineered maize parthenogenetic haploid reproduction. new ways and methods.
  • the present invention provides an effective way to obtain crop haploid. At the same time, it provides new genetic resources for maize parthenogenesis and provides new ideas for the synthesis of apomixis in other crops.

Abstract

Disclosed in the present invention are an autonomous parthenogenetic haploid reproduction method for maternal cells and use thereof in breeding. The present invention provides a method for obtaining a parthenogenetic haploid of corn, comprising: introducing a system vector of a CRISPRa5 gene activation system that mediates ectopic expression of the ZmBBM2 gene in corn oocytes into a receptor corn to obtain a positive transgenic corn. The realization of ectopic activation of the ZmBBM2 gene in plant oocytes results in the direct development of the oocytes into haploid embryos. The present invention provides an effective way to obtain crop haploids. Meanwhile, the present invention provides a new gene resource for corn parthenogenesis and provides new insights into apomixis synthesis of other crops.

Description

一种母体细胞自主的孤雌单倍体生殖方法及其育种应用A maternal cell-autonomous parthenogenetic haploid reproduction method and its breeding application 技术领域Technical field
本发明涉及生物技术领域,具体涉及一种母体细胞自主的孤雌单倍体生殖方法及其育种应用。The invention relates to the field of biotechnology, and in particular to a maternal cell-autonomous parthenogenetic haploid reproduction method and its breeding application.
背景技术Background technique
植物无融合生殖过程不经精卵两性细胞受精作用而形成种子,通过该生殖方式能产生基因型与母本完全一致的种子,所以在固定杂种优势、提高育种效率、降低制种成本方面有巨大潜力,在杂交育种界被誉为实现新一次绿色革命的“无性生殖革命”(Spillane,C.,Steimer,A.,and Grossniklaus,U.(2001)Apomixis in agriculture:the quest for clonal seeds.Sex.Plant Reprod.,14,179–187.)。然而,玉米、水稻和小麦等主要粮食作物由于缺乏无融合生殖种质资源,限制了该技术在重要经济粮食作物上的推广与应用。2019年,王克剑研究员团队与Sundaresan教授团队分别独立突破性运用基因编辑技术,创制了有丝分裂型减数分裂(MiMe)结合单性生殖的克隆生殖技术,首次概念性验证了水稻无融合生殖技术的可行性。然而,孤雌单倍体结实率低仍是限制了该技术应用的瓶颈(Wang,C.,Liu,Q.,Shen,Y.,Hua,Y.,Wang,J.,Lin,J.,et al.(2019)Clonal seeds from hybrid rice by simultaneous genome engineering of meiosis and fertilization genes.Nat.Biotechnol.,37,283–286.Khanday,I.,Skinner,D.,Yang,B.,Mercier,R.,and Sundaresan,V.(2019)A male-expressed rice embryogenic trigger redirected for asexual propagation through seeds.Nature,565,91–95.)。因此,探索作物孤雌单倍体发生途径并解析其生物学机制,具有重要的理论意义与实践价值。The process of plant apomixis forms seeds without fertilization of sperm and egg cells. This reproductive method can produce seeds whose genotype is exactly the same as that of the female parent. Therefore, it has great potential in fixing hybrid vigor, improving breeding efficiency, and reducing seed production costs. Potential, known as the "apomixis revolution" in the hybrid breeding community to achieve a new green revolution (Spillane, C., Steimer, A., and Grossniklaus, U. (2001) Apomixis in agriculture: the quest for clonal seeds.Sex .Plant Reprod.,14,179–187.). However, the lack of apomictic germplasm resources in major food crops such as corn, rice and wheat limits the promotion and application of this technology in important economic food crops. In 2019, researcher Wang Kejian's team and Professor Sundaresan's team independently and independently used gene editing technology to create a clonal reproduction technology that combines mitotic meiosis (MiMe) with parthenogenesis, and for the first time conceptually verified the feasibility of rice apomixis technology. sex. However, the low parthenogenetic haploid seed setting rate is still a bottleneck that limits the application of this technology (Wang, C., Liu, Q., Shen, Y., Hua, Y., Wang, J., Lin, J., et al. (2019) Clonal seeds from hybrid rice by simultaneous genome engineering of meiosis and fertilization genes. Nat. Biotechnol., 37, 283–286. Khanday, I., Skinner, D., Yang, B., Mercier, R., and Sundaresan, V. (2019) A male-expressed rice embryogenic trigger redirected for asexual propagation through seeds. Nature, 565, 91–95.). Therefore, exploring the pathogenesis of crop parthenogenetic haploidy and analyzing its biological mechanism have important theoretical significance and practical value.
BBM转录因子属于APETALA2(AP2)家族,在调控花分生组织的发育、促进细胞增殖和形态发生中发挥了重要的调节作用(Kim S,Soltis P S,Wall K,et al.Phylogeny and domain evolution in the APETALA2-like gene family[J].Molecular Biology and Evolution,2006,23(1):107-120.)。前期研究表明,BBM转录因子卵细胞异位表达可显著提升植物孤雌单倍体频率。2015年Conner等人发现,狼尾草BBM基因PsASGR-BBML卵细胞异位表达可诱导后代形成36%的孤雌单倍体(Conner J A,Mookkan M,Huo H,et al.A parthenogenesis gene of apomict origin elicits embryo formation from unfertilized eggs in a sexual plant[J].Proceedings of the National Academy of Sciences,2015,112(36):11205-11210.)。2017年,Conner等人利用卵细胞特异启动子pDD45启动PsASGR-BBML基因,实现PsASGR-BBML基因在水稻及玉米卵细胞中异位表达并分别获得89%以及80%的孤雌单倍体后代,同时该研究在玉米中利用内源PsASGR-BBML基因启动子启动PsASGR-BBML仍可诱导单倍体形成但频率仅为47% (Conner J A,Podio M,Ozias A P.Haploid embryo production in rice and maize induced by PsASGR-BBML transgenes[J].Plant Reproduction,2017,30(1):41-52.)。2018年Khanday等人研究发现水稻卵细胞异位表达OsBBM1基因可显著提升水稻孤雌单倍体诱导频率(Khanday,I.,Skinner,D.,Yang,B.,Mercier,R.,and Sundaresan,V.(2019)A male-expressed rice embryogenic trigger redirected for asexual propagation through seeds.Nature,565,91–95.)。BBM transcription factors belong to the APETALA2 (AP2) family and play an important regulatory role in regulating the development of floral meristems and promoting cell proliferation and morphogenesis (Kim S, Soltis P S, Wall K, et al. Phylogeny and domain evolution in the APETALA2-like gene family[J].Molecular Biology and Evolution, 2006,23(1):107-120.). Previous studies have shown that ectopic expression of BBM transcription factors in egg cells can significantly increase the frequency of plant parthenogenetic haploids. In 2015, Conner et al. found that ectopic expression of the Pennisetum BBM gene PsASGR-BBML in egg cells could induce the formation of 36% parthenogenetic haploids in offspring (Conner J A, Mookkan M, Huo H, et al. A parthenogenesis gene of apomict origin elicits embryo formation from unfertilized eggs in a sexual plant[J].Proceedings of the National Academy of Sciences,2015,112(36):11205-11210.). In 2017, Conner et al. used the egg cell-specific promoter pDD45 to activate the PsASGR-BBML gene, achieving ectopic expression of the PsASGR-BBML gene in rice and corn egg cells and obtaining 89% and 80% of parthenogenetic haploid offspring respectively. Studies using the endogenous PsASGR-BBML gene promoter to activate PsASGR-BBML in maize can still induce haploid formation but the frequency is only 47% (Conner J A, Podio M, Ozias A P. Haploid embryo production in rice and maize induced by PsASGR-BBML transgenes[J].Plant Reproduction,2017,30(1):41-52.). In 2018, Khanday et al. found that ectopic expression of OsBBM1 gene in rice egg cells can significantly increase the frequency of rice parthenogenetic haploid induction (Khanday, I., Skinner, D., Yang, B., Mercier, R., and Sundaresan, V .(2019)A male-expressed rice embryogenic trigger redirected for asexual propagation through seeds.Nature,565,91–95.).
在真核细胞中,基因表达在发育和分化等过程中受到严格的控制。转录因子通过与靶基因增强子和启动子区域的特定DNA序列结合,是调控靶基因转录的关键调控因子(Spitz,F.and Furlong,E.E.M.(2012)Transcription factors:from enhancer binding to developmental control.Nat.Rev.Genet.,13,613–626.)。基于同样的原理,人工转录因子(Artificial Transcription Factors,ATFs)是将具有序列特异性的DNA结合域融合到激活结构域以调节基因表达,如锌指ATF(Wilson,K.A.,Chateau,M.L.,and Porteus,M.H.(2013)Design and development of artificial zinc finger transcription factors and zinc finger nucleases to the hTERT locus.Mol.Ther.Nucleic Acids,2,e87.)、转录激活因子样效应器ATF(Liu,W.,Rudis,M.R.,Peng,Y.,Mazarei,M.,Millwood,R.J.,Yang,J.-P.,et al.(2014)Synthetic TAL effectors for targeted enhancement of transgene expression in plants.Plant Biotechnol.J.,12,436–446.)以及CRISPR介导的基因激活系统(Gilbert,L.A.,Larson,M.H.,Morsut,L.,Liu,Z.,Brar,G.A.,Torres,S.E.,et al.(2013)CRISPR-mediated modular RNA-guided regulation of transcription in eukaryotes.Cell,154,442–451.)。与以前的ATFs工具不同,CRISPR介导的基因激活,也被称为CRISPRa,因其具备设计简单灵活且成本较低等优点,因此,近年来利用基因激活系统调控植物内源基因,研究目标基因表达网络一直是植物研究领域热点。基于基因编辑系统的基因表达调控工具的开发丰富了植物转录激活系列工具,如动物中已建立的几种高效CRISPRa工具:VP64-P65-Rta(VPR)、CRISPR-SAM以及Cas9-SunTag等。在植物中dCas9-VP64、dCas9-6TAL-VP128、CRISPR-Act2.0以及CRISPR-Act3.0等转录激活系列工具在植物中表现出较强的基因转录激活水平。然而目前为止尚未开发出适用于玉米卵细胞特异的或植株的基因激活系统。因此,建立玉米基因激活系统将为研究玉米卵细胞异位表达ZmBBM2基因以及探索BBM基因剂量效应与孤雌单倍体诱导频率之间的关系提供了强有力的技术手段。In eukaryotic cells, gene expression is tightly controlled during processes such as development and differentiation. Transcription factors are key regulators of target gene transcription by binding to specific DNA sequences in the enhancer and promoter regions of target genes (Spitz, F. and Furlong, E.E.M. (2012) Transcription factors: from enhancer binding to developmental control. Nat .Rev.Genet.,13,613–626.). Based on the same principle, artificial transcription factors (ATFs) fuse sequence-specific DNA-binding domains to activation domains to regulate gene expression, such as zinc-finger ATFs (Wilson, K.A., Chateau, M.L., and Porteus) ,M.H.(2013)Design and development of artificial zinc finger transcription factors and zinc finger nucleases to the hTERT locus.Mol.Ther.Nucleic Acids,2,e87.), transcription activator-like effector ATF (Liu, W., Rudis ,M.R.,Peng,Y.,Mazarei,M.,Millwood,R.J.,Yang,J.-P.,et al.(2014)Synthetic TAL effectors for targeted enhancement of transgene expression in plants.Plant Biotechnol.J.,12,436 –446.) and CRISPR-mediated gene activation system (Gilbert, L.A., Larson, M.H., Morsut, L., Liu, Z., Brar, G.A., Torres, S.E., et al. (2013) CRISPR-mediated modular RNA -guided regulation of transcription in eukaryotes. Cell,154,442–451.). Different from previous ATFs tools, CRISPR-mediated gene activation, also known as CRISPRa, has the advantages of simple, flexible design and low cost. Therefore, in recent years, gene activation systems have been used to regulate endogenous genes in plants and study target genes. Expression networks have always been a hot topic in the field of plant research. The development of gene expression regulation tools based on gene editing systems has enriched the series of plant transcription activation tools, such as several efficient CRISPRa tools that have been established in animals: VP64-P65-Rta (VPR), CRISPR-SAM, and Cas9-SunTag. In plants, a series of transcription activation tools such as dCas9-VP64, dCas9-6TAL-VP128, CRISPR-Act2.0 and CRISPR-Act3.0 have shown strong gene transcription activation levels in plants. However, no gene activation system suitable for maize egg cells or plants has been developed so far. Therefore, the establishment of a maize gene activation system will provide a powerful technical means for studying the ectopic expression of ZmBBM2 gene in maize egg cells and exploring the relationship between the BBM gene dosage effect and the frequency of parthenogenetic haploid induction.
发明公开invention disclosure
本发明的目的是提供一种母体细胞自主的孤雌单倍体生殖方法及其育种应用。The purpose of the present invention is to provide a maternal cell-autonomous parthenogenetic haploid reproduction method and its breeding application.
第一方面,本发明要求保护一种获得玉米孤雌单倍体的方法。In a first aspect, the present invention claims a method for obtaining maize parthenogenetic haploid.
本发明的目的是提供一种母体细胞自主的孤雌单倍体生殖方法及其育种应用。The purpose of the present invention is to provide a maternal cell-autonomous parthenogenetic haploid reproduction method and its breeding application.
第一方面,本发明要求保护一种获得玉米孤雌单倍体的方法。In a first aspect, the present invention claims a method for obtaining maize parthenogenetic haploid.
本发明所要求保护的获得玉米孤雌单倍体的方法,可包括如下步骤:将用于介导玉米卵细胞异位表达ZmBBM2基因的CRISPRa5基因激活系统的系统载体导入受体玉米,得到阳性转基因玉米;所述转基因阳性玉米的部分胚囊中的卵细胞能直接发育成为单倍体胚,形成单倍体种子。The method for obtaining maize parthenogenetic haploids claimed by the present invention may include the following steps: introducing the system vector of the CRISPRa5 gene activation system used to mediate the ectopic expression of the ZmBBM2 gene in maize egg cells into the recipient maize to obtain positive transgenic maize. ; The egg cells in some embryo sacs of the transgenic positive corn can directly develop into haploid embryos and form haploid seeds.
所述系统载体能够表达dCas9-VP64融合蛋白,能够表达MS2-P65-HSF1融合蛋白,并且能够表达靶向ZmBBM2基因转录起始点上游-200至-1bp位置的一个或若干个sgRNA,所述sgRNA为sgRNA2.0。The system vector can express dCas9-VP64 fusion protein, can express MS2-P65-HSF1 fusion protein, and can express one or several sgRNA targeting the -200 to -1 bp position upstream of the ZmBBM2 gene transcription start point, and the sgRNA is sgRNA2.0.
进一步地,所述系统载体上含有如下2个表达盒:Furthermore, the system vector contains the following 2 expression cassettes:
表达盒1:用于表达2个所述sgRNA;在所述表达盒1中,含有2个sgRNA的编码序列,2个所述sgRNA的编码序列以拟南芥At-tRNAGly编码序列连接,由同一个启动子启动转录;Expression cassette 1: used to express two sgRNAs; in the expression cassette 1, it contains the coding sequences of two sgRNAs, and the coding sequences of the two sgRNAs are connected with the Arabidopsis thaliana At-tRNAGly coding sequence and are composed of the same A promoter initiates transcription;
表达盒2:用于表达功能蛋白,所述功能蛋白为将所述dCas9-VP64融合蛋白和所述MS2-P65-HSF1融合蛋白通过自裂解多肽T2A融合而成;在所述表达盒2中,所述功能蛋白由植物卵细胞特异性启动子启动转录。Expression cassette 2: used to express functional proteins, which are formed by fusing the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein through the self-cleaving polypeptide T2A; in the expression cassette 2, The functional protein is transcribed by a plant egg cell-specific promoter.
在本发明的具体实施方式中,2个sgRNA分别靶向ZmBBM2基因TSS上游的-36及-166bp区域。In a specific embodiment of the present invention, two sgRNAs respectively target the -36 and -166 bp regions upstream of the TSS of the ZmBBM2 gene.
在本发明的一些案例中,在所述表达盒1中,所述启动子为ZmU6-2启动子。In some cases of the present invention, in the expression cassette 1, the promoter is the ZmU6-2 promoter.
在本发明的一些案例中,在所述表达盒2中,所述植物卵细胞特异性启动子为拟南芥DD45启动子。In some cases of the present invention, in the expression cassette 2, the plant egg cell-specific promoter is the Arabidopsis thaliana DD45 promoter.
在本发明的一些案例中,所述dCas9-VP64融合蛋白和所述MS2-P65-HSF1融合蛋白均含有核定位信号。In some cases of the present invention, both the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein contain nuclear localization signals.
在本发明的具体实施方式中,所述表达盒1自5’端到3’端依次由所述ZmU6-2启动子、靶向ZmBBM2基因转录起始点(TSS)上游-36bp区域的sgRNA间隔序列的编码核酸、sgRNA2.0支架编码序列、所述拟南芥At-tRNAGly编码序列、靶向ZmBBM2基因转录起始点(TSS)上游-166bp区域的sgRNA间隔序列的编码核酸、所述sgRNA2.0支架编码序列组成。In a specific embodiment of the invention, the expression cassette 1 consists of the ZmU6-2 promoter and the sgRNA spacer sequence targeting the -36 bp region upstream of the ZmBBM2 gene transcription start point (TSS) from the 5' end to the 3' end. The coding nucleic acid, the sgRNA2.0 scaffold coding sequence, the Arabidopsis thaliana At-tRNAGly coding sequence, the coding nucleic acid of the sgRNA spacer sequence targeting the -166bp region upstream of the ZmBBM2 gene transcription start site (TSS), the sgRNA2.0 scaffold Composed of coding sequences.
在本发明的具体实施方式中,所述表达盒2自5’端到3’端依次由所述拟南芥DD45启动子、3×Flag标签编码核酸、SV40核定位信号编码序列、dCas9编码基因、NLS核定位信号编码序列、VP64编码基因、所述自裂解多肽T2A的编码基因、MS2编码基因、所述SV40核定位信号编码序列、P65编码基因、HSF1编码基因和转录终止子NOS组成。In a specific embodiment of the present invention, the expression cassette 2 consists of the Arabidopsis DD45 promoter, 3×Flag tag encoding nucleic acid, SV40 nuclear localization signal encoding sequence, and dCas9 encoding gene in sequence from the 5' end to the 3' end. , NLS nuclear localization signal coding sequence, VP64 coding gene, the coding gene of the self-cleaving polypeptide T2A, the MS2 coding gene, the SV40 nuclear localization signal coding sequence, the P65 coding gene, the HSF1 coding gene and the transcription terminator NOS.
在所述表达盒1中,所述ZmU6-2启动子如SEQ ID No.16的第1-397位所示;所述靶向ZmBBM2基因转录起始点(TSS)上游-36bp区域的sgRNA间隔序列的编码核酸如SEQ ID No.16的第398-417位所示;所述sgRNA2.0支架编码序列如SEQ ID  No.16的第418-553位或第651-793位所示;所述拟南芥At-tRNAGly编码序列如SEQ ID No.16的第554-630位所示;所述靶向ZmBBM2基因转录起始点(TSS)上游-166bp区域的sgRNA间隔序列的编码核酸如SEQ ID No.16的第631-650位所示。In the expression cassette 1, the ZmU6-2 promoter is as shown in positions 1-397 of SEQ ID No. 16; the sgRNA spacer sequence targeting the -36bp region upstream of the ZmBBM2 gene transcription start point (TSS) The coding nucleic acid is shown in positions 398-417 of SEQ ID No. 16; the sgRNA2.0 scaffold coding sequence is shown in positions 418-553 or 651-793 of SEQ ID No. 16; the proposed The Arabidopsis At-tRNAGly coding sequence is shown in positions 554-630 of SEQ ID No. 16; the coding nucleic acid of the sgRNA spacer sequence targeting the -166 bp region upstream of the ZmBBM2 gene transcription start point (TSS) is shown in SEQ ID No. Positions 631-650 of 16 are shown.
在所述表达盒2,所述拟南芥DD45启动子如SEQ ID No.16的第1008-2017位所示;所述3×Flag标签编码核酸如SEQ ID No.16的第2018-2086位所示;所述SV40核定位信号编码序列如SEQ ID No.16的第2093-2113位或第6992-7012位所示;所述dCas9编码基因如SEQ ID No.16的第2114-6238位所示;所述NLS核定位信号编码序列如SEQ ID No.16的第6239-6286位所示;所述VP64编码基因如SEQ ID No.16的第6305-6454位所示;所述自裂解多肽T2A的编码基因如SEQ ID No.16的第6482-6535位所示;所述MS2编码基因如SEQ ID No.16的第6551-6937位所示;所述P65编码基因如SEQ ID No.16的第7028-7570位所示;所述HSF1编码基因如SEQ ID No.16的第7595-7966位所示;所述转录终止子NOS如SEQ ID No.16的第7967-8238位所示。In the expression cassette 2, the Arabidopsis DD45 promoter is as shown in positions 1008-2017 of SEQ ID No. 16; the 3×Flag tag encoding nucleic acid is as shown in positions 2018-2086 of SEQ ID No. 16 As shown; the SV40 nuclear localization signal coding sequence is as shown in positions 2093-2113 or 6992-7012 of SEQ ID No. 16; the dCas9 encoding gene is as shown in positions 2114-6238 of SEQ ID No. 16 shown; the NLS nuclear localization signal coding sequence is shown in positions 6239-6286 of SEQ ID No. 16; the VP64 encoding gene is shown in positions 6305-6454 of SEQ ID No. 16; the self-cleaving polypeptide The coding gene of T2A is shown in positions 6482-6535 of SEQ ID No. 16; the MS2 coding gene is shown in positions 6551-6937 of SEQ ID No. 16; the P65 coding gene is shown in SEQ ID No. 16 The HSF1 coding gene is shown in positions 7595-7966 of SEQ ID No. 16; the transcription terminator NOS is shown in positions 7967-8238 of SEQ ID No. 16.
所述表达盒1的核苷酸序列如SEQ ID No.16的第1-1007位所示。The nucleotide sequence of the expression cassette 1 is shown in positions 1-1007 of SEQ ID No. 16.
所述表达盒2的核苷酸序列如SEQ ID No.16的第1008-8238位所示。The nucleotide sequence of the expression cassette 2 is shown in positions 1008-8238 of SEQ ID No. 16.
所述CRISPRa5基因激活系统的系统载体为含有SEQ ID No.16所示DNA片段的质粒。在本发明的具体实施方式中,具体为将SEQ ID No.16所示DNA片段替换CPB载体(本实验室保存,文章Li C,Liu C,Qi X,et al.RNA‐guided Cas9 as an in vivo desired‐target mutator in maize[J].Plant biotechnology journal,2017,15(12):1566-1576.)的酶切位点HindIII和EcoRI之间的小片段后得到的重组载体。The system vector of the CRISPRa5 gene activation system is a plasmid containing the DNA fragment shown in SEQ ID No. 16. In the specific embodiment of the present invention, specifically, the DNA fragment shown in SEQ ID No. 16 is replaced with the CPB vector (saved by this laboratory, article Li C, Liu C, Qi X, et al. RNA-guided Cas9 as an in The recombinant vector obtained by digesting the small fragment between the HindIII and EcoRI enzyme sites of vivo desired-target mutator in maize[J].Plant biotechnology journal, 2017,15(12):1566-1576.).
所述方法还可包括如下步骤:以玉米单倍体辅助筛选系作为花粉供体,对所述阳性转基因玉米进行授粉,从而筛选出玉米孤雌单倍体。The method may also include the following steps: using the maize haploid auxiliary screening line as a pollen donor to pollinate the positive transgenic maize, thereby screening out maize parthenogenetic haploids.
在本发明的具体实施方式中,所述受体玉米为玉米自交系KN5585。所述玉米单倍体辅助筛选系为DFP单倍体辅助筛选玉米转基因株系(记载于文献“Dong L,Li L,Liu C,et al.Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize[J].Molecular plant,2018,11(9):1214-1217.”中的“As shown in Supplemental Figure 1,the eGFP gene was found to be expressed in embryos(green)and DsRED was expressed in endosperms(red).The DFP cassette was then stably transformed into the maize variety ZC01,the same inbred line as the created haploid inducer line(Figure 1C)”,即该文献中将图1C所示的DFP盒稳定转化到玉米品种ZC01中后得到的转基因株系,公众可从申请人处获得,仅可用于重复本发明实验使用,不得他用)。该转基因系含有两个荧光报告表达盒,一个是标记糊粉层的利用糊粉层特异性启动子LTP2启动DsRed2基因,该表达盒可将玉米籽粒的糊粉层标记成红色荧光。另一个表达盒为标记胚的利用胚特异性启动子启动eGFP基因。因此将DFP单倍体辅助筛选玉米转基因株系作为 花粉供体对所述受体玉米进行授粉,当材料未诱导成为单倍体时DFP系提供的两个精细胞可以实现双受精进而得到胚发绿光而胚乳糊粉层发红光的籽粒,当材料卵细胞发育成单倍体时,DFP提供的两个精细胞其中一个与中央细胞结合形成胚乳糊粉层发红光,而胚未接受DFP精子因此不发荧光。In a specific embodiment of the present invention, the recipient corn is the corn inbred line KN5585. The corn haploid-assisted screening system is DFP haploid-assisted screening of corn transgenic lines (recorded in the document "Dong L, Li L, Liu C, et al. Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize[J].Molecular plant, 2018,11(9):1214-1217."As shown in Supplemental Figure 1, the eGFP gene was found to be expressed in embryos(green)and DsRED was expressed in endosperms(red).The DFP cassette was then stably transformed into the maize variety ZC01, the same inbred line as the created haploid inducer line (Figure 1C)", that is, the DFP cassette shown in Figure 1C was stably transformed into corn in this document The transgenic strain obtained after the selection of variety ZC01 can be obtained by the public from the applicant and can only be used to repeat the experiments of the present invention and cannot be used for other purposes). This transgenic line contains two fluorescent reporter expression cassettes, one of which marks the aleurone layer and uses the aleurone layer-specific promoter LTP2 to activate the DsRed2 gene. This expression cassette can mark the aleurone layer of corn kernels with red fluorescence. The other expression cassette marks embryos and uses an embryo-specific promoter to drive the eGFP gene. Therefore, DFP haploid-assisted screening of corn transgenic lines is used as a pollen donor to pollinate the recipient corn. When the material is not induced to become haploid, the two sperm cells provided by the DFP line can achieve double fertilization and obtain embryonic hair. A kernel that glows green and the aleurone layer of the endosperm glows red. When the egg cell develops into a haploid, one of the two sperm cells provided by DFP combines with the central cell to form an aleurone layer of the endosperm that glows red, but the embryo does not receive DFP. Sperm therefore do not fluoresce.
第二方面,本发明要求保护前文第一方面中所述的用于介导玉米卵细胞异位表达ZmBBM2基因的所述CRISPRa5基因激活系统的系统载体。In a second aspect, the present invention claims to protect the system vector of the CRISPRa5 gene activation system for mediating the ectopic expression of ZmBBM2 gene in maize egg cells as described in the first aspect.
第三方面,本发明要求保护前文第二方面中所述的系统载体在诱发玉米孤雌单倍体发生中的应用。In a third aspect, the present invention claims the use of the system vector described in the second aspect in inducing parthenogenetic haploidy in maize.
第四方面,本发明要求保护一种基于CRISPR的基因激活系统。In a fourth aspect, the present invention claims a CRISPR-based gene activation system.
本发明所要求保护的基于CRISPR的基因激活系统的系统载体能够表达dCas9-VP64融合蛋白,能够表达MS2-P65-HSF1融合蛋白,并且能够表达靶向目标基因靶标区域的一个或若干个sgRNA,所述sgRNA为sgRNA2.0。The system vector of the CRISPR-based gene activation system claimed by the present invention can express dCas9-VP64 fusion protein, can express MS2-P65-HSF1 fusion protein, and can express one or several sgRNA targeting the target region of the target gene, so The sgRNA is sgRNA2.0.
进一步地,所述系统载体上含有如下2个表达盒:Furthermore, the system vector contains the following 2 expression cassettes:
表达盒1’:用于表达所述sgRNA;Expression cassette 1': used to express the sgRNA;
表达盒2’:用于表达功能蛋白,所述功能蛋白为将所述dCas9-VP64融合蛋白和所述MS2-P65-HSF1融合蛋白通过自裂解多肽T2A融合而成。Expression cassette 2': used to express functional proteins, which are formed by fusing the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein through the self-cleaving polypeptide T2A.
更进一步地,在所述表达盒1’中,用于启动所述sgRNA转录的启动子为U6启动子。在所述表达盒2’中,用于启动所述功能蛋白转录的启动子为UBI启动子。所述dCas9-VP64融合蛋白和所述MS2-P65-HSF1融合蛋白均含有核定位信号。Furthermore, in the expression cassette 1', the promoter used to initiate the transcription of the sgRNA is the U6 promoter. In the expression cassette 2', the promoter used to initiate the transcription of the functional protein is the UBI promoter. Both the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein contain nuclear localization signals.
在本发明的具体实施方式中,所述表达盒1’自5’端到3’端依次由所述U6启动子、靶向靶向目标基因靶标区域的sgRNA间隔序列的编码核酸、sgRNA2.0支架编码序列组成。In a specific embodiment of the present invention, the expression cassette 1' consists of the U6 promoter, the coding nucleic acid of the sgRNA spacer sequence targeting the target gene target region, and sgRNA2.0 in sequence from the 5' end to the 3' end. Scaffold coding sequence composition.
在本发明的具体实施方式中,所述表达盒2’自5’端到3’端依次由所述UBI启动子、NLS核定位信号编码序列、dCas9编码基因、VP64编码基因、所述自裂解多肽T2A的编码基因、MS2编码基因、所述NLS核定位信号编码序列、P65编码基因、HSF1编码基因和NOS转录终止子组成。In a specific embodiment of the invention, the 2' expression cassette consists of the UBI promoter, the NLS nuclear localization signal coding sequence, the dCas9 coding gene, the VP64 coding gene, and the self-cleavage sequence from the 5' end to the 3' end. It consists of the coding gene for polypeptide T2A, the MS2 coding gene, the NLS nuclear localization signal coding sequence, the P65 coding gene, the HSF1 coding gene and the NOS transcription terminator.
在所述表达盒1’中,所述U6启动子的核苷酸序列如SEQ ID No.7-15的第1-397位所示;所述sgRNA2.0支架编码序列如SEQ ID No.8-15的第418-560位所示。In the expression cassette 1', the nucleotide sequence of the U6 promoter is as shown in positions 1-397 of SEQ ID No.7-15; the sgRNA2.0 scaffold coding sequence is as SEQ ID No.8 -15 is shown in bits 418-560.
在所述表达盒2’中,所述UBI启动子的核苷酸序列如SEQ ID No.5的第18-2013位所示;所述NLS核定位信号编码序列如SEQ ID No.5的第2107-2127位或第7006-7026位所示;所述dCas9编码基因如SEQ ID No.5的第2152-6252位所示;所述VP64编码基因如SEQ ID No.5的第6319-6495位所示;所述自裂解多肽T2A的编码基因如SEQ ID No.5的第6496-6549位所示;所述MS2编码基因如SEQ ID No.5的第6565-6951位所示;所述P65编码基因如SEQ ID No.5的第7042-7584位所示;所述HSF1编码基因如SEQ ID No.5的第7609-7983位所示;所述NOS转录终止子如SEQ ID No.5的第8018-8270位所示。所述表达盒2’的序列如SEQ ID No.5 所示。In the expression cassette 2', the nucleotide sequence of the UBI promoter is as shown in positions 18-2013 of SEQ ID No. 5; the NLS nuclear localization signal coding sequence is as shown in position 18 of SEQ ID No. 5. The dCas9 encoding gene is shown in positions 2107-2127 or 7006-7026; the dCas9 encoding gene is shown in positions 2152-6252 of SEQ ID No. 5; the VP64 encoding gene is shown in positions 6319-6495 of SEQ ID No. 5 As shown; the coding gene of the self-cleaving polypeptide T2A is shown in positions 6496-6549 of SEQ ID No. 5; the MS2 coding gene is shown in positions 6565-6951 of SEQ ID No. 5; the P65 The coding gene is as shown in positions 7042-7584 of SEQ ID No.5; the HSF1 encoding gene is as shown in positions 7609-7983 of SEQ ID No.5; the NOS transcription terminator is as shown in SEQ ID No.5 Shown at bits 8018-8270. The sequence of 2' of the expression cassette is shown in SEQ ID No. 5.
第五方面,本发明要求保护前文第四方面所述基于CRISPR的基因激活系统在对目的基因进行基因激活中的应用。In the fifth aspect, the present invention claims the application of the CRISPR-based gene activation system described in the fourth aspect in gene activation of target genes.
附图说明Description of the drawings
图1为评估玉米叶肉原生质体中CRISPRa(1-6)工具转录激活能力。a:CRISPRa载体。dCas9,化脓性链球菌Cas9缺乏内切酶活性(突变D10A,H840A);ERF2m,拟南芥乙烯反应修饰转录激活因子2(ERF2);HSF1,人热休克因子1;MS2,噬菌体外壳蛋白;NLS,核定位信号;P65,NF-kB反式激活亚基P65;pU6,ZmU6-2启动子;pUBI,玉米泛素启动子;sgRNA1.0,Cas9常规sgRNA;sgRNA2.0,包含两个MS2茎环的sgRNA骨架;T,终止子;T2A,自裂解多肽;VP64,转录激活因子VP16的4个拷贝。b:a4-a6系统的三个组成部分示意图。c-d:RT-qPCR检测ZmDPS1表达。c:ZmDPS1位点示意图。第一个转录起始位点(TSS)被命名为+1。d:CRISPRa(1-6)介导ZmDPS1的激活表达。以玉米泛素基因(NCBIGenBank:U29159.1)为参考。使用2 -△△Ct方法计算相对表达水平。e:以ZmDPS1基因作为sgRNA靶点,评估CRISPRa5工具的窗口活性。f-h:以ZmTrxh(f)、ZmES2(g)和ZmRCP1(h)基因为靶点验证CRISPRa5系统。所有值均为n=3个生物重复的±sem。 Figure 1 shows the evaluation of the transcriptional activation ability of CRISPRa(1-6) tools in corn mesophyll protoplasts. a: CRISPRa vector. dCas9, Streptococcus pyogenes Cas9 lacks endonuclease activity (mutations D10A, H840A); ERF2m, Arabidopsis ethylene response modified transcriptional activator 2 (ERF2); HSF1, human heat shock factor 1; MS2, phage coat protein; NLS , nuclear localization signal; P65, NF-kB transactivating subunit P65; pU6, ZmU6-2 promoter; pUBI, maize ubiquitin promoter; sgRNA1.0, Cas9 conventional sgRNA; sgRNA2.0, contains two MS2 stems Circular sgRNA backbone; T, terminator; T2A, self-cleaving polypeptide; VP64, 4 copies of the transcriptional activator VP16. b: Schematic diagram of the three components of the a4-a6 system. cd: RT-qPCR detection of ZmDPS1 expression. c: Schematic diagram of ZmDPS1 site. The first transcription start site (TSS) is named +1. d: CRISPRa(1-6) mediates the activated expression of ZmDPS1. The maize ubiquitin gene (NCBIGenBank: U29159.1) was used as a reference. Relative expression levels were calculated using the 2 -ΔΔCt method. e: Using the ZmDPS1 gene as the sgRNA target to evaluate the window activity of the CRISPRa5 tool. fh: Validation of the CRISPRa5 system using ZmTrxh(f), ZmES2(g) and ZmRCP1(h) genes as targets. All values are ±s.e.m. from n=3 biological replicates.
图2为玉米雌配子细胞的分离和卵细胞(EC)原生质体中CRISPRa5系统的验证。a-h:玉米雌性配子细胞的单细胞水平的显微操作和分离。a:玉米子房结构示意图。AC反足细胞、CC中央细胞、EC卵细胞、SC助细胞。b:分离的玉米子房及胚囊ES(红色箭头)。c:在体式显微镜下人工微操作分离胚囊。d:通过部分酶解从ES中获得单个细胞。e-h:分离的AC(e)、SC(f)、CC(g)和EC(h)。i和j:卵细胞特异CRISPRa5卵细胞靶向激活稳定转化DFP系的LTP2pro-DsRed2表达盒。i:靶向LTP2pro的CRISPRa5载体示意图。j:通过RsRed2荧光显示,在一个分离的EC中,DsRed2蛋白在卵细胞中实现异位表达。BAR=50μm。Figure 2 shows the isolation of maize female gametocytes and validation of the CRISPRa5 system in egg cell (EC) protoplasts. a–h: Single-cell level micromanipulation and isolation of maize female gametocytes. a: Schematic diagram of corn ovary structure. AC antipodal cells, CC central cells, EC egg cells, SC synergid cells. b: Isolated maize ovary and embryo sac ES (red arrow). c: The embryo sac is separated by manual micromanipulation under a stereomicroscope. d: Single cells obtained from ES by partial enzymatic dissociation. e-h: separated AC(e), SC(f), CC(g) and EC(h). i and j: Egg cell-specific CRISPRa5 egg cell targeting activates the LTP2pro-DsRed2 expression cassette of the stably transformed DFP line. i: Schematic diagram of CRISPRa5 vector targeting LTP2pro. j: RsRed2 fluorescence shows that in an isolated EC, DsRed2 protein is ectopically expressed in egg cells. BAR=50μm.
图3为基于卵细胞特异的CRISPRa5 BBM靶向激活玉米内源ZmBBM2基因。a:CRISPRa5 BBM载体示意图。3×FLAG,3个串联的FLAG表位标签;Bar,BlpR基因。其他组件如图1中a所示的CRISPRa5载体所示。b:基于ddPCR技术的转化拷贝数的鉴定。T3代的9个独立转化子分别为EA13、EA14、EA15、EA19、EA21、EA22、EA28、EA34和EA36。a5 BBM的探针采用FAM标记,而ZmADH1内参探针选择HEX进行标记。左上面板,靶标基因与内参基因1D参考数据;右上面板,计算出a5 BBM拷贝数;下面板,是典型的EA19系的2D参考数据。c-f:未授粉的雌性配子体中进行基因表达水平的定量分析。c:RT-ddPCR鉴定胚囊中a5 BBM表达水平。d:c所示RT-ddPCR鉴定胚囊中ZmBBM2基因表达水平。定量结果转化为每1μg总RNA的目标基因表达拷贝数。e:单细胞RT-ddPCR技术鉴定EA19及ZC01中分离的ECs、ANs、SCs和CCs的表达水平。f:通过抗FLAG抗体和BBM2 mRNA探针确定dCas9蛋白表达部位及ZmBBM2基因表达模及用Feulgen染色法观察胚囊的发育情况。g和h:0-9DAP授粉前后EA19及ZC01胚囊或幼胚ZmBBM2和a5 BBM 基因在胚囊中的转录水平。 Figure 3 shows the targeted activation of the maize endogenous ZmBBM2 gene based on egg cell-specific CRISPRa5 BBM . a: Schematic diagram of CRISPRa5 BBM vector. 3×FLAG, 3 tandem FLAG epitope tags; Bar, BlpR gene. Other components are shown in the CRISPRa5 vector shown in Figure 1, a. b: Identification of transformed copy number based on ddPCR technology. The nine independent transformants of the T3 generation were EA13, EA14, EA15, EA19, EA21, EA22, EA28, EA34 and EA36. The a5 BBM probe was labeled with FAM, while the ZmADH1 internal reference probe was labeled with HEX. The upper left panel shows the 1D reference data of target genes and internal reference genes; the upper right panel shows the calculated a5 BBM copy number; the lower panel shows the 2D reference data of a typical EA19 line. cf: Quantitative analysis of gene expression levels in unpollinated female gametophytes. c: RT-ddPCR identifies a5 BBM expression level in embryo sac. d: RT-ddPCR identification of ZmBBM2 gene expression levels in embryo sac shown in c. Quantitative results were converted into target gene expression copy number per 1 μg of total RNA. e: Single-cell RT-ddPCR technology identifies the expression levels of ECs, ANs, SCs and CCs isolated from EA19 and ZC01. f: Use anti-FLAG antibody and BBM2 mRNA probe to determine the dCas9 protein expression site and ZmBBM2 gene expression pattern, and use Feulgen staining to observe the development of the embryo sac. g and h: Transcription levels of EA19 and ZC01 embryo sacs or ZmBBM2 and a5 BBM genes in young embryos before and after 0-9DAP pollination.
图4为基于卵细胞特异的CRISPRa5异位激活玉米内源ZmBBM2基因表达引起的无融合生殖表型。a:EA19卵巢中双胚囊的发生率为3.9%。Bar=50μm。b-e:CRISPRa5 BBM母体单倍体。b:单倍体与二倍体籽粒切片荧光照片。c:ZC01、EA19中二倍体和单倍体植株株型及果穗表型。d:通过流式细胞术验证产生的单倍体植株(右)和对照二倍体植株(左)的DNA含量。e:9个CRISPRa5 BBM株系单倍体生成效率。 Figure 4 shows the apomictic phenotype caused by egg cell-specific CRISPRa5 ectopic activation of endogenous ZmBBM2 gene expression in maize. a: The incidence of double embryonic sacs in EA19 ovaries is 3.9%. Bar=50μm. be: CRISPRa5 BBM maternal haploid. b: Fluorescence photos of haploid and diploid grain sections. c: Plant type and ear phenotype of diploid and haploid plants in ZC01 and EA19. d: DNA content of generated haploid plants (right) and control diploid plants (left) verified by flow cytometry. e: Haploid generation efficiency of 9 CRISPRa5 BBM lines.
实施发明的最佳方式Best way to implement your invention
以下的实施例便于更好地理解本发明,但并不限定本发明。下述实施例中的实验方法,如无特殊说明,均为常规方法。下述实施例中所用的试验材料,如无特殊说明,均为自常规生化试剂商店购买得到的。以下实施例中的定量试验,均设置三次重复实验,结果取平均值。The following examples facilitate a better understanding of the present invention, but do not limit the present invention. The experimental methods in the following examples are all conventional methods unless otherwise specified. The test materials used in the following examples were all purchased from conventional biochemical reagent stores unless otherwise specified. The quantitative experiments in the following examples were repeated three times, and the results were averaged.
实施例1、母体细胞自主的孤雌单倍体生殖方法及其育种应用Example 1. Maternal cell-autonomous parthenogenetic haploid reproduction method and its breeding application
一、实验材料与方法1. Experimental materials and methods
(一)实验材料(1) Experimental materials
B73玉米自交系(未米生物科技(江苏)有限公司提供)、KN5585玉米自交系(未米生物科技(江苏)有限公司提供)、pUC19载体(北京全式金生物技术有限公司)、CPB载体(本实验室保存,记载于文献“Li C,Liu C,Qi X,et al.RNA‐guided Cas9as an in vivo desired‐target mutator in maize[J].Plant biotechnology journal,2017,15(12):1566-1576.”中,公众可从申请人处获得,仅可用于重复本发明实验使用,不得他用)、DFP单倍体辅助筛选玉米转基因株系(本实验室保存,记载于文献“Dong L,Li L,Liu C,et al.Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize[J].Molecular plant,2018,11(9):1214-1217.”中的“As shown in Supplemental Figure 1,the eGFP gene was found to be expressed in embryos(green)and DsRED was expressed in endosperms(red).The DFP cassette was then stably transformed into the maize variety ZC01,the same inbred line as the created haploid inducer line(Figure 1C)”,即该文献中将图1C所示的DFP盒稳定转化到玉米品种ZC01中后得到的转基因株系,公众可从申请人处获得,仅可用于重复本发明实验使用,不得他用)、农杆菌感受态(EHA105)细胞(北京庄盟国际生物基因科技有限公司)。B73 corn inbred line (provided by Weimi Biotechnology (Jiangsu) Co., Ltd.), KN5585 corn inbred line (provided by Weimi Biotechnology (Jiangsu) Co., Ltd.), pUC19 vector (Beijing Quanshijin Biotechnology Co., Ltd.), CPB Vector (preserved in this laboratory, recorded in the document "Li C, Liu C, Qi :1566-1576.", which can be obtained by the public from the applicant and can only be used to repeat the experiments of the present invention and cannot be used for other purposes), DFP haploid-assisted screening of corn transgenic lines (preserved by this laboratory and recorded in the literature " Dong L, Li L, Liu C, et al.Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize[J].Molecular plant, 2018,11(9):1214-1217." As shown in Supplemental Figure 1, the eGFP gene was found to be expressed in embryos(green)and DsRED was expressed in endosperms(red).The DFP cassette was then stably transformed into the maize variety ZC01, the same inbred line as the created haploid inducer line (Figure 1C)", that is, the transgenic line obtained by stably transforming the DFP cassette shown in Figure 1C into the corn variety ZC01 in this document. The public can obtain it from the applicant and can only be used to repeat the experiment of the present invention. used, no other use), Agrobacterium competent (EHA105) cells (Beijing Zhuangmeng International Biogene Technology Co., Ltd.).
(二)实验方法(2) Experimental methods
1、CRISPRa载体的构建1. Construction of CRISPRa vector
对于CRISPRa(a1-a6)构建,以ZmUbi启动子、核定位信号(NLS)、dCas9(D10A,H830A)、转录激活域以及NOS终止子的序列被扩增并克隆在pUC19载体骨架中。For the CRISPRa(a1-a6) construct, the sequences of the ZmUbi promoter, nuclear localization signal (NLS), dCas9 (D10A, H830A), transcriptional activation domain, and NOS terminator were amplified and cloned into the pUC19 vector backbone.
卵细胞特异表达CRISPRa5载体设计是利用拟南芥卵细胞特异型启动子DD45替换ZmUbi启动子,并将载体构建在pUC19载体骨架中。其中玉米ZmU6-2启动子用于所有sgRNA的表达。The egg cell-specific expression CRISPRa5 vector was designed to replace the ZmUbi promoter with the Arabidopsis egg cell-specific promoter DD45, and the vector was constructed in the pUC19 vector skeleton. The maize ZmU6-2 promoter is used for the expression of all sgRNAs.
对于靶点设计,本发明选择ZmDPS1基因作为评估CRISPRa(a1-a6)工具的主要目标基因,选择靶点为TSS上游-1至-100bp内的靶点,同时还选择了-100至-200bp、-200至-300bp以及5’UTR区域设计相应靶点来判断CRISPRa5工具的最佳激活窗口。为了进一步验证CRISPRa5工具在玉米中具有较高且稳定的基因激活能力,本发明还选择了ZmTrxh、ZmES2以及ZmRCP1基因的-1至-100bp处分别设计了靶点构建了CRISPRa5激活载体。相关基因靶点及详细信息见表1。For target design, the present invention selects the ZmDPS1 gene as the main target gene for evaluating the CRISPRa (a1-a6) tool. The selected target points are the target points within -1 to -100bp upstream of TSS. At the same time, -100 to -200bp, Design corresponding targets in the -200 to -300bp and 5'UTR regions to determine the optimal activation window of the CRISPRa5 tool. In order to further verify that the CRISPRa5 tool has high and stable gene activation ability in maize, the present invention also selected -1 to -100bp of the ZmTrxh, ZmES2 and ZmRCP1 genes to design target sites and construct a CRISPRa5 activation vector. Relevant gene targets and detailed information are shown in Table 1.
表1、CRISPRa基因激活系统及其靶基因序列位点等信息Table 1. CRISPRa gene activation system and its target gene sequence and other information
Figure PCTCN2022113576-appb-000001
Figure PCTCN2022113576-appb-000001
Figure PCTCN2022113576-appb-000002
Figure PCTCN2022113576-appb-000002
2、玉米叶肉原生质体中CRISPRa(a1-a6)的离体评价2. In vitro evaluation of CRISPRa(a1-a6) in corn mesophyll protoplasts
收集约5×10 5个原生质体用于RNA分析。总RNA的提取使用FastPure Plant Total RNA Isolation Kit试剂盒,根据制造商的说明(诺唯赞生物技术,中国)使用IMPLENP330紫外分光光度计对总RNA进行分光光度法定量并舍弃A260/A280<1.8样品。为了检查RNA的完整性,取500ng总RNA,利用1%(w/v)琼脂糖凝胶RNA质量进行评估,RNA条带为三条清晰完整且未降解的样品用作后续实验。RNA样品使用gDNA wiper mix试剂盒(诺唯赞生物技术,中国)清除RNA中残留的基因组DNA污染。取1μg总RNA利用HiScript III 1st Strand cDNA Synthesis Kit试剂盒及寡聚物(dT)引物,按照制造商的说明(诺唯赞生物技术,中国)合成。实验在BIO-RAD CFX96实时荧光定量系统(BIO-RAD,USA)上进行。使用10μL Taq Pro Universal SYBR qPCR Master Mix(诺唯赞生物技术,中国),每个引物0.5μL(终浓度200nM),2μL的cDNA,最终体积为20μL。 所有的样品设置三个生物学重复。以玉米泛素基因(NCBIGenBank:U29159.1)为内参。采用自动阈值用于定量循环(Cq)的测定。采用2 -△△Ct方法计算表达水平。实验所用引物及PCR扩增效率列于表2。 Approximately 5 × 10 protoplasts were collected for RNA analysis. Total RNA was extracted using the FastPure Plant Total RNA Isolation Kit, and total RNA was quantified spectrophotometrically using an IMPLENP330 UV spectrophotometer according to the manufacturer's instructions (Norwegian Biotechnology, China) and samples with A260/A280 <1.8 were discarded. . In order to check the integrity of the RNA, take 500ng of total RNA and evaluate the RNA quality using 1% (w/v) agarose gel. The RNA bands are three clear, complete and undegraded samples for subsequent experiments. RNA samples were cleaned of residual genomic DNA contamination in the RNA using a gDNA wiper mix kit (Novizan Biotechnology, China). 1 μg of total RNA was synthesized using HiScript III 1st Strand cDNA Synthesis Kit and oligo (dT) primers according to the manufacturer's instructions (Novozan Biotechnology, China). The experiment was performed on the BIO-RAD CFX96 real-time fluorescence quantification system (BIO-RAD, USA). Use 10 μL Taq Pro Universal SYBR qPCR Master Mix (Norwegian Biotechnology, China), 0.5 μL of each primer (final concentration 200 nM), 2 μL of cDNA, and a final volume of 20 μL. All samples were set up in triplicate biological replicates. The maize ubiquitin gene (NCBIGenBank: U29159.1) was used as the internal reference. An automatic threshold was used for determination of cycle quantitation (Cq). Expression levels were calculated using the 2 -ΔΔCt method. The primers used in the experiment and the PCR amplification efficiency are listed in Table 2.
表2、Real-time RT-PCR所用引物详细信息Table 2. Details of primers used in Real-time RT-PCR
Figure PCTCN2022113576-appb-000003
Figure PCTCN2022113576-appb-000003
3、玉米卵细胞分离技术的建立3. Establishment of corn egg cell isolation technology
以实验室前期创建的DFP单倍体辅助筛选玉米转基因株系为实验材料。材料吐丝前对雌穗进行套袋,待花丝5-15cm时取下雌穗,并用70%(v/v)乙醇对雌穗外壳表面进行消毒,显微操作取出玉米子房结构。用手术刀沿胚珠的花丝两侧切口,去除子房两侧的子房壁结构。取20片处理好的含有胚囊结构的子房切片,并将其收集在3.5cm直径无菌细胞培养皿中,加入1.5mL酶解液(1.5mL酶解液中含有150mg甘露醇,19.65U果胶酶,22U半纤维素酶,120U纤维素酶RS,利用KOH调节pH到5.0。其中,1U规定为:在最适条件下,每分钟内催化1微摩尔底物转化为产物所需的酶量定为一个活力单位)。将混合物在震荡摇床上以140rpm的速度震荡,在24℃下孵育60min,然后在体式显微镜下用玻璃针管手动分离胚胎囊(ESs)、助细胞(SCs)、对足细胞(ANs)、中央细胞(CCs)和卵细胞(ECs)。将分离的EC原生质体分别转移到100μL电穿孔缓冲液(配方:0.5M甘露醇,4mM氯化钾,4mM MES,pH5.0)中,加入10μg CRISPRa5 pLTP2::DsRed2载体质粒DNA(结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.15所示。所述表达盒2全序列如SEQ ID No.5所示)。利用Bio-Rad基因脉冲器,电穿孔条件分别为5ms、400V和200μF。电穿孔样品在冰上保存10min,转移到6孔板中,在28℃的黑暗中孵育24h。DsRed2荧光信号采用DM1000LED系统(LEICA,德国)进行检测,激发波长为554nm,放大倍数为10×10倍。 The DFP haploid-assisted screening of corn transgenic lines created in the laboratory was used as experimental material. Before the material is spun, the ears are bagged. When the filament is 5-15cm long, the ears are removed. The surface of the ear shell is sterilized with 70% (v/v) ethanol, and the corn ovary structure is removed through micromanipulation. Use a scalpel to make incisions along both sides of the filaments of the ovule to remove the ovary wall structures on both sides of the ovary. Take 20 processed ovary slices containing embryo sac structures and collect them in a 3.5cm diameter sterile cell culture dish. Add 1.5mL enzymatic solution (1.5mL enzymatic solution contains 150mg mannitol, 19.65U Pectinase, 22U hemicellulase, 120U cellulase RS, use KOH to adjust pH to 5.0. Among them, 1U is defined as: under optimal conditions, it is required to catalyze the conversion of 1 micromole of substrate into product per minute. The amount of enzyme is determined as one unit of activity). The mixture was shaken on a shaking shaker at a speed of 140 rpm, incubated at 24°C for 60 min, and then manually separated embryonic sacs (ESs), synergid cells (SCs), antipodal cells (ANs), and central cells using a glass needle under a stereomicroscope. (CCs) and egg cells (ECs). Transfer the isolated EC protoplasts to 100 μL electroporation buffer (recipe: 0.5M mannitol, 4mM potassium chloride, 4mM MES, pH 5.0), and add 10 μg of CRISPRa5 pLTP2::DsRed2 vector plasmid DNA (structural description: The recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site at the same time. The expression cassette 1 is an expression cassette for expressing sgRNA, and the full sequence is as follows As shown in SEQ ID No. 15. The complete sequence of the expression cassette 2 is as shown in SEQ ID No. 5). Using Bio-Rad gene pulser, the electroporation conditions were 5ms, 400V and 200μF respectively. Electroporated samples were kept on ice for 10 min, transferred to a 6-well plate, and incubated in the dark at 28°C for 24 h. The DsRed2 fluorescence signal was detected using the DM1000LED system (LEICA, Germany), with an excitation wavelength of 554 nm and an amplification of 10 × 10 times.
4、CRISPRa5 BBM载体构建 4. CRISPRa5 BBM vector construction
为了获得卵细胞特异的CRISPRa5 BBM载体,我们提取拟南芥基因组并克隆出拟南芥DD45启动子,通过无缝克隆生物技术将DD45启动子及CRISPRa5元件扩增并组装到CPB载体中。设计靶向ZmBBM2基因启动子的两个靶点sgRNA2.0s:pBBM2TG1sgRNA2.0和pBBM2TG2sgRNA2.0,设计靶点位于ZmBBM2基因TSS的上游区域-36及-166bp区域。两个sgRNA2.0靶点用同一个ZmU6-2启动子进行转录并以拟南芥At-tRNAGly连接pBBM2TG1sgRNA2和pBBMTG2sgRNA序列。本发明中使用的所有引物均见表3。 In order to obtain the egg cell-specific CRISPRa5 BBM vector, we extracted the Arabidopsis genome and cloned the Arabidopsis DD45 promoter. We amplified and assembled the DD45 promoter and CRISPRa5 elements into the CPB vector through seamless cloning biotechnology. Two target sgRNA2.0s targeting the ZmBBM2 gene promoter were designed: pBBM2TG1sgRNA2.0 and pBBM2TG2sgRNA2.0. The designed targets were located in the -36 and -166bp regions of the TSS upstream region of the ZmBBM2 gene. The two sgRNA2.0 targets were transcribed using the same ZmU6-2 promoter and linked to the pBBM2TG1sgRNA2 and pBBMTG2sgRNA sequences with Arabidopsis At-tRNAGly. All primers used in the present invention are listed in Table 3.
表3、CRISPRa5 BBM载体构建及表达量分析等引物和探针序列 Table 3. Primer and probe sequences for CRISPRa5 BBM vector construction and expression analysis
Figure PCTCN2022113576-appb-000004
Figure PCTCN2022113576-appb-000004
5、农杆菌介导的玉米稳定转化5. Agrobacterium-mediated stable transformation of corn
将CRISPRa5 BBM质粒转化农杆菌感受态(EHA105)细胞,取阳性单克隆并进行培养,当培养基OD值600为0.3时在KN5585玉米自交系(未米生物科技(江苏)有限公司提供)中进行了农杆菌介导的幼胚转化。 Transform the CRISPRa5 BBM plasmid into Agrobacterium competent cells (EHA105), take positive single clones and culture them. When the OD value 600 of the medium is 0.3, in the KN5585 corn inbred line (provided by Weimi Biotechnology (Jiangsu) Co., Ltd.) Agrobacterium-mediated transformation of immature embryos was performed.
6、CRISPRa5 BBM株系转基因元件拷贝数的确定 6. Determination of copy number of transgenic elements in CRISPRa5 BBM lines
使用FastPure Plant DNA Isolation Mini Kit试剂盒(诺唯赞生物技术,中国)从转基因株系中分离和纯化基因组DNA。分别获得ZC01、EA13、EA14、EA15、EA19、EA21、EA22、EA28、EA34和EA36CRISPRa5 BBM基因组DNA。采用多重定量液滴数字PCR(ddPCR)方法确定转基因拷贝数。利用Primer3 Plus(http://primer3plus.com/;Rozen and Skaletsky,2000)设计了CRISPRa5 BBM元件和内源内参基因的候选引物和探针。以单拷贝基因ZmADH1作为二倍体基因的内定量对照。转基因拷贝数的测量采用Bio-Rad QuantaSoft TM软件(v1.6.6.0320),默认设置为阈值测定以区分阳性和阴性液滴。本研究中使用的所有引物和探针均列于表3。 Genomic DNA was isolated and purified from transgenic lines using the FastPure Plant DNA Isolation Mini Kit (Novozan Biotechnology, China). ZC01, EA13, EA14, EA15, EA19, EA21, EA22, EA28, EA34 and EA36 CRISPRa5 BBM genomic DNA were obtained respectively. The multiplex quantitative droplet digital PCR (ddPCR) method was used to determine transgene copy number. Primer3 Plus (http://primer3plus.com/; Rozen and Skaletsky, 2000) was used to design candidate primers and probes for the CRISPRa5 BBM element and endogenous reference gene. The single-copy gene ZmADH1 was used as an internal quantitative control of diploid genes. Transgene copy number was measured using Bio-Rad QuantaSoft TM software (v1.6.6.0320), with the default setting being a threshold determination to distinguish positive and negative droplets. All primers and probes used in this study are listed in Table 3.
7、胚囊CRISPRa5 BBM和ZmBBM2的表达鉴定 7. Expression identification of CRISPRa5 BBM and ZmBBM2 in embryo sac
在无RNA酶的条件下,通过剥离子房壁,获取无壁子房,收集3个生物重复,每个每组20个子房。子房总RNA的分离、质量控制和cDNA的合成按照前面提到的操作方案进行。子房组织基因表达水平鉴定利用RT-ddPCR技术进行测量,采用Primer3Plus设计了扩增dCas9、ZmBBM2和ZmeIF4α基因的候选引物和探针。所有引物和探针列于表3。每个ddPCR样品包含11μL 2×ddPCR Supermix for Probes(No dUTP)(Bio-Rad,美国),引物浓度900nM,探针浓度227nM,同时加入2μL cDNA;用无菌ddH 2O调整最终体积至22μL。分析液滴计数,并使用Bio-Rad QuantaSoft TM软件(v1.6.6.0320)进行绝对基因表达测量,默认设置为阈值测定以区分阳性和阴性液滴。 Wall-free ovaries were obtained by stripping the ovary walls under RNase-free conditions, and 3 biological replicates of 20 ovaries per group were collected. The isolation, quality control, and cDNA synthesis of total ovary RNA were performed according to the previously mentioned protocol. Identification of gene expression levels in ovary tissue was measured using RT-ddPCR technology, and candidate primers and probes for amplifying dCas9, ZmBBM2 and ZmeIF4α genes were designed using Primer3Plus. All primers and probes are listed in Table 3. Each ddPCR sample contained 11 μL of 2×ddPCR Supermix for Probes (No dUTP) (Bio-Rad, USA), with a primer concentration of 900 nM and a probe concentration of 227 nM. At the same time, 2 μL cDNA was added; the final volume was adjusted to 22 μL with sterile ddH 2 O. Droplet counts were analyzed and absolute gene expression measurements were performed using Bio-Rad QuantaSoft software (v1.6.6.0320), with default settings for threshold determination to differentiate between positive and negative droplets.
8、CRISPRa5 BBM和ZmBBM2在单细胞水平上的表达 8. Expression of CRISPRa5 BBM and ZmBBM2 at the single cell level
在显微镜下分离出助细胞、反足细胞、中央细胞、卵细胞。取10个同一类型细胞进行裂解,取细胞池中的1/10原液作为RNA模板。采用单细胞反转录微滴数字PCR技术测量细胞中基因表达水平,试剂盒采用One-Step RT ddPCR Advanced Kit for Probes(Bio-Rad,USA)。利用Primer3Plus设计了dCas9和ZmBBM2基因的候选基因引物和扩增探针。每个ddPCR样品含有5μL Supermix,2μL逆转录酶,1μL DDT,引物浓度900nM,探针浓度250nM,加入1μL RNA裂解液;用无菌ddH 2O调整最终体积至20μL。反应在QX200AutoDG ddPCR系统上运行。分析液滴计数并使用Bio-Rad QuantaSoft TM软件(v1.6.6.0320)进行绝对基因表达测量,默认设置为阈值测定以区分阳性和阴性液滴。 The synergid cells, antipodal cells, central cells, and egg cells were isolated under a microscope. Take 10 cells of the same type for lysis, and take 1/10 of the original solution in the cell pool as RNA template. Single-cell reverse transcription droplet digital PCR technology was used to measure gene expression levels in cells, and the kit used One-Step RT ddPCR Advanced Kit for Probes (Bio-Rad, USA). Primer3Plus was used to design candidate gene primers and amplification probes for dCas9 and ZmBBM2 genes. Each ddPCR sample contains 5 μL Supermix, 2 μL reverse transcriptase, 1 μL DDT, primer concentration 900 nM, probe concentration 250 nM, add 1 μL RNA lysis buffer; adjust the final volume to 20 μL with sterile ddH 2 O. Reactions were run on a QX200AutoDG ddPCR system. Analyze droplet counts and perform absolute gene expression measurements using Bio-Rad QuantaSoft software (v1.6.6.0320), with default settings for threshold determination to distinguish positive and negative droplets.
9、Feulgen染色方法9. Feulgen staining method
分离并获得子房组织,利用FAA溶液(配方:10%甲醛,5%醋酸,50%乙醇)将子房固定,并在70%(v/v)乙醇中保存。样品制备前需用不同梯度浓度乙醇进 行再水合,然后用6M盐酸处理DNA水解,用0.5%(w/v)periodic acid溶液处理20min,并在1%(w/v)吖啶黄素中孵育20min。然后样品用乙醇系列脱水,用水杨酸甲酯-乙醇和纯水杨酸甲酯溶液将组织透明化。最后,将样品安装在水杨酸甲酯的载玻片上,并在显微镜下观察。The ovary tissue was isolated and obtained, fixed with FAA solution (recipe: 10% formaldehyde, 5% acetic acid, 50% ethanol), and preserved in 70% (v/v) ethanol. Before sample preparation, it is necessary to rehydrate with different gradient concentrations of ethanol, then treat the DNA with 6M hydrochloric acid for hydrolysis, treat with 0.5% (w/v) periodic acid solution for 20 minutes, and incubate in 1% (w/v) acriflavine for 20 minutes. The samples were then dehydrated using an ethanol series, and the tissues were cleared with methyl salicylate-ethanol and pure methyl salicylate solutions. Finally, the samples were mounted on methyl salicylate slides and observed under a microscope.
10、免疫组化10. Immunohistochemistry
采用免疫组化方法定位胚囊切片中的dCas9蛋白。收集到的玉米粒在FAA溶液中被固定,利用石蜡切片技术对样品进行了组织包埋和切片。免疫组化采用标准方案进行实施。以兔抗FLAG抗体ab205606(AbcamInc.,USA)为一抗,以抗山羊抗兔碱性磷酸酶偶联物A9919(Sigma,USA)为二抗,进行免疫组化。Immunohistochemistry was used to locate the dCas9 protein in embryo sac sections. The collected corn kernels were fixed in FAA solution, and the samples were tissue embedded and sectioned using paraffin sectioning technique. Immunohistochemistry was performed using standard protocols. Immunohistochemistry was performed using rabbit anti-FLAG antibody ab205606 (Abcam Inc., USA) as the primary antibody and anti-goat anti-rabbit alkaline phosphatase conjugate A9919 (Sigma, USA) as the secondary antibody.
11、mRNA原位杂交11. mRNA in situ hybridization
用地高辛标记的反义RNA探针在胚囊组织中进行ZmBBM2 mRNA原位杂交,检测ZmBBM2转录本的积累。将ZmBBM2cDNA克隆到一个含有T7启动子的pUC19载体中。NcoI限制性内切酶末端用于线性化模板,并在转录前创建一个5“overhang”结构。探针合成采用in vitro transcription with T7 RNA polymerase试剂盒结合digoxigenin-11-UTP(Sigma,USA)进行体外转录,合成了标记的ssRNA探针。胚囊组织固定前后必须避免mRNA降解,以免造成mRNA位置的变化。ZmBBM2 mRNA in situ hybridization was performed in embryo sac tissue using a digoxigenin-labeled antisense RNA probe to detect the accumulation of ZmBBM2 transcripts. ZmBBM2 cDNA was cloned into a pUC19 vector containing a T7 promoter. NcoI restriction endonuclease ends are used to linearize the template and create a 5 "overhang" structure prior to transcription. The probe was synthesized using in vitro transcription with T7 RNA polymerase kit combined with digoxigenin-11-UTP (Sigma, USA) for in vitro transcription, and the labeled ssRNA probe was synthesized. Degradation of mRNA must be avoided before and after embryo sac tissue fixation to avoid changes in the position of the mRNA.
12、孤雌单倍体种子发生效率评估12. Evaluation of parthenogenetic haploid seed generation efficiency
对于孤雌单倍体筛选采用双荧光蛋白DFP方法,DFP单倍体辅助筛选玉米转基因株系本实验室保存(文献“Dong L,Li L,Liu C,et al.Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize[J].Molecular plant,2018,11(9):1214-1217.”中的“As shown in Supplemental Figure 1,the eGFP gene was found to be expressed in embryos(green)and DsRED was expressed in endosperms(red).The DFP cassette was then stably transformed into the maize variety ZC01.The DFP cassette was then stably transformed into the maize variety ZC01,the same inbred line as the created haploid inducer line(Figure 1C)”,即该文献中将图1C所示的DFP盒稳定转化到玉米品种ZC01中后得到的转基因株系),该转基因系含有两个荧光报告表达盒,一个是标记糊粉层的利用糊粉层特异性启动子LTP2启动DsRed2基因,该表达盒可将玉米籽粒的糊粉层标记成红色荧光。另一个表达盒为标记胚的利用胚特异性启动子启动eGFP基因。因此将DFP单倍体辅助筛选玉米转基因株系作为花粉供体对KN5585玉米自交系及9个CRISPRa5 BBM株系进行授粉,当材料未诱导成为单倍体时DFP系提供的两个精细胞可以实现双受精进而得到胚发绿光而胚乳糊粉层发红光的籽粒,当材料卵细胞发育成单倍体时,DFP提供的两个精细胞其中一个与中央细胞结合形成胚乳糊粉层发红光,而胚未接受DFP精子因此不发荧光。材料种植过程中野生型自交系ZC01、9株CRISPRa5BBM株系和DFP植株分别在温 室中生长。在开花期间,选择DFP植株作为花粉供体,用于所有其他基因型的授粉。成熟后的穗和籽粒利用LUYOR-3415RG光源(LUYOR,加利福尼亚州,美国)对单倍体玉米粒进行鉴定。用每个品系的单倍体粒数来计算单倍体频率。 For parthenogenetic haploid screening, the double fluorescent protein DFP method is used. DFP haploid-assisted screening of maize transgenic lines is maintained in our laboratory (document "Dong L, Li L, Liu C, et al. Genome editing and double-fluorescence proteins "As shown in Supplemental Figure 1, the eGFP gene was found to be expressed in embryos(green )and DsRED was expressed in endosperms(red).The DFP cassette was then stably transformed into the maize variety ZC01.The DFP cassette was then stably transformed into the maize variety ZC01, the same inbred line as the created haploid inducer line(Figure 1C )", that is, the transgenic line obtained by stably transforming the DFP cassette shown in Figure 1C into the corn variety ZC01 in this document). This transgenic line contains two fluorescent reporter expression cassettes, one of which uses paste to mark the aleurone layer. The flour layer-specific promoter LTP2 drives the DsRed2 gene, and this expression cassette can mark the aleurone layer of corn kernels with red fluorescence. The other expression cassette marks embryos and uses an embryo-specific promoter to drive the eGFP gene. Therefore, DFP haploid-assisted screening of corn transgenic lines was used as a pollen donor to pollinate the KN5585 corn inbred line and 9 CRISPRa5 BBM lines. When the material is not induced to become haploid, the two sperm cells provided by the DFP line can Double fertilization is achieved to obtain a seed with a green embryo and a red endosperm aleurone layer. When the material egg cell develops into a haploid, one of the two sperm cells provided by DFP combines with the central cell to form a red endosperm aleurone layer. light, while the embryo has not received DFP sperm and therefore does not fluoresce. During the material planting process, the wild-type inbred line ZC01, 9 CRISPRa5BBM lines and DFP plants were grown in the greenhouse respectively. During flowering, DFP plants were selected as pollen donors for pollination of all other genotypes. Haploid corn kernels were identified using a LUYOR-3415RG light source (LUYOR, California, USA) after maturity. Haploid frequency was calculated using the number of haploid grains for each line.
13、倍性鉴定13. Ploidy identification
采用流式细胞术检测细胞的倍性水平。将通过DFP技术筛选得到单倍体种子和二倍体对照种子一起发芽,待幼苗生长到三叶阶段收集约1g幼叶并用刀片切丝,用80μm尼龙网过滤。1000r/min离心5min,沉淀用1mL OTTOI缓冲(配方:100mmol柠檬酸,0.5%(V/V)吐温20,pH=2.0)和2mL OTTOII缓冲(配方:400mmol磷酸氢二钠,pH=8)和50μL 1.5mg/L碘化丙啶在黑暗中染色20min。倍性水平在FACSCalibur系统(BD,USA)上进行分析,数据由CellQuest公司软件(BD,USA)提取,并用ModFit软件(Yerity,美国)进行分析。Flow cytometry was used to detect the ploidy level of cells. The haploid seeds screened through DFP technology and the diploid control seeds were germinated together. When the seedlings grew to the three-leaf stage, about 1 g of young leaves were collected and cut into shreds with a razor blade, and filtered through an 80 μm nylon mesh. Centrifuge at 1000r/min for 5 minutes, and precipitate with 1mL OTTOI buffer (formula: 100mmol citric acid, 0.5% (V/V) Tween 20, pH=2.0) and 2mL OTTOII buffer (formula: 400mmol disodium hydrogen phosphate, pH=8) Stain with 50 μL 1.5 mg/L propidium iodide for 20 min in the dark. Ploidy levels were analyzed on the FACSCalibur system (BD, USA), data were extracted by CellQuest software (BD, USA), and analyzed using ModFit software (Yerity, USA).
二、结果与分析2. Results and analysis
1、基于CRISPR-dCas9介导的玉米CRISPRa系列载体建立1. Establishment of maize CRISPRa series vectors based on CRISPR-dCas9-mediated
本发明利用缺乏内切酶活性的CRISPR-dCas9作为DNA引导工具,并融合不同的转录激活效应元件构建了6组CRISPRa载体(图1中a)。载体设计分为两个策略:在第一种策略中,将dCas9蛋白融合转录激活效应元件包括来自拟南芥乙烯反应FACTOR2(ERF2m,CRISPRa1)、VP64(4×VP16,CRISPRa2)以及NF-kB反式激活亚基p65(p65,CRISPRa3)的CRISPRa载体。在该策略中dCas9融合转录激活效应元件后,由sgRNA1.0引导dCas9到靶点DNA处,由转录激活效应原件招募RNA聚合酶进行转录及翻译,达到基因激活的目的;在第二种策略种,我们采用了修饰的sgRNA支架sgRNA2.0,该结构包含sgRNA1.0基本骨架并在第2和第4茎环结构处引入两个MS2结合的RNA适配体,该适配体以招募除VP64之外的额外转录激活因子如:ERF2m-HSF1(CRISPRa4)、p65-HSF1(CRISPRa5)和VP64-HSF1(CRISPRa6)。同时,我们将额外的转录激活因子与人类热休克因子1(HSF1)的激活结构域进行融合表达以实现结构域协同激活作用。在第二种策略中,dCas9-VP64和其他额外转录激活因子被设计到一个阅读框内,当该蛋白表达,融合蛋白通过T2A肽释放NLS-dCas9-VP64,该结构与sgRNA2.0结合后其中MS2结合的RNA适配体招募NLS-MS2-affetor-HSF1结构,进而形成增强型的基因激活工具(图1中a和b)。The present invention uses CRISPR-dCas9, which lacks endonuclease activity, as a DNA guidance tool and fuses different transcription activation response elements to construct six groups of CRISPRa vectors (a in Figure 1). Vector design is divided into two strategies: In the first strategy, the dCas9 protein is fused to transcriptional activation response elements including ethylene response FACTOR2 (ERF2m, CRISPRa1), VP64 (4×VP16, CRISPRa2) and NF-kB transgene from Arabidopsis thaliana. CRISPRa vector formula activating subunit p65 (p65, CRISPRa3). In this strategy, after dCas9 is fused with a transcriptional activation effector element, sgRNA1.0 guides dCas9 to the target DNA, and the transcriptional activation effector element recruits RNA polymerase for transcription and translation to achieve gene activation; in the second strategy , we adopted a modified sgRNA scaffold sgRNA2.0, which contains the basic skeleton of sgRNA1.0 and introduces two MS2-binding RNA aptamers at the 2nd and 4th stem-loop structures to recruit VP64 in addition to Additional transcriptional activators such as: ERF2m-HSF1 (CRISPRa4), p65-HSF1 (CRISPRa5) and VP64-HSF1 (CRISPRa6). At the same time, we fused and expressed additional transcriptional activators with the activation domain of human heat shock factor 1 (HSF1) to achieve synergistic activation of the domains. In the second strategy, dCas9-VP64 and other additional transcriptional activators are designed into a reading frame. When the protein is expressed, the fusion protein releases NLS-dCas9-VP64 through the T2A peptide, which binds to sgRNA2.0. The MS2-bound RNA aptamer recruits the NLS-MS2-affetor-HSF1 structure, thereby forming an enhanced gene activation tool (a and b in Figure 1).
最终获得的6个CRISPRa载体结构描述如下:The six CRISPRa vector structures finally obtained are described as follows:
CRISPRa1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒;所述表达盒1中自5’端到3’端依次由U6启动子(SEQ ID No.7的第1-397位)、靶向目标基因靶标区域的sgRNA间隔序列插入位点(gtaacgtcaccgtagggcct)、sgRNA1.0支架编码序列(SEQ ID No.7的第418-500位所示)组成。所述表达盒2自5’端到3’端依次由UBI启动子(SEQ ID No.1的第18-2013)、NLS核定位信号编码序列(SEQ ID No.1的第2107-2127 位)、dCas9编码基因(SEQ ID No.1的第2152-6252位)、ERF2m编码基因(SEQ ID No.1的第6319-7047位)和NOS转录终止子(SEQ ID No.1的第7082-7334位)组成。所述表达盒2的全序列如SEQ ID No.1所示。CRISPRa1 vector structure description: A recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site. The expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 7), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA1.0 scaffold coding sequence (shown at positions 418-500 of SEQ ID No. 7). The expression cassette 2 consists of the UBI promoter (No. 18-2013 of SEQ ID No. 1) and the NLS nuclear localization signal coding sequence (No. 2107-2127 of SEQ ID No. 1) from the 5' end to the 3' end. , dCas9 encoding gene (No. 2152-6252 of SEQ ID No. 1), ERF2m encoding gene (No. 6319-7047 of SEQ ID No. 1) and NOS transcription terminator (No. 7082-7334 of SEQ ID No. 1) bits). The complete sequence of the expression cassette 2 is shown in SEQ ID No. 1.
CRISPRa2载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒;所述表达盒1中自5’端到3’端依次由U6启动子(SEQ ID No.7的第1-397位)、靶向目标基因靶标区域的sgRNA间隔序列插入位点(gtaacgtcaccgtagggcct)、sgRNA1.0支架编码序列(SEQ ID No.7的第418-500位所示)组成。所述表达盒2自5’端到3’端依次由UBI启动子(SEQ ID No.2的第18-2013)、NLS核定位信号编码序列(SEQ ID No.2的第2107-2127位)、dCas9编码基因(SEQ ID No.2的第2152-6252位)、VP64编码基因(SEQ ID No.2的第6319-6468位)和NOS转录终止子(SEQ ID No.2的第6503-6755位)组成。所述表达盒2的全序列如SEQ ID No.2所示。CRISPRa2 vector structure description: A recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site. The expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 7), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA1.0 scaffold coding sequence (shown at positions 418-500 of SEQ ID No. 7). The expression cassette 2 consists of the UBI promoter (No. 18-2013 of SEQ ID No. 2) and the NLS nuclear localization signal coding sequence (No. 2107-2127 of SEQ ID No. 2) from the 5' end to the 3' end. , dCas9 encoding gene (No. 2152-6252 of SEQ ID No. 2), VP64 encoding gene (No. 6319-6468 of SEQ ID No. 2) and NOS transcription terminator (No. 6503-6755 of SEQ ID No. 2) bits). The complete sequence of the expression cassette 2 is shown in SEQ ID No. 2.
CRISPRa3载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒;所述表达盒1中自5’端到3’端依次由U6启动子(SEQ ID No.7的第1-397位)、靶向目标基因靶标区域的sgRNA间隔序列插入位点(gtaacgtcaccgtagggcct)、sgRNA1.0支架编码序列(SEQ ID No.7的第418-500位所示)组成。所述表达盒2自5’端到3’端依次由UBI启动子(SEQ ID No.3的第18-2013)、NLS核定位信号编码序列(SEQ ID No.3的第2107-2127位)、dCas9编码基因(SEQ ID No.3的第2152-6252位)、P65编码基因(SEQ ID No.3的第6319-6861位)和NOS转录终止子(SEQ ID No.3的第6896-7148位)组成。所述表达盒2的全序列如SEQ ID No.3所示。CRISPRa3 vector structure description: A recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site. The expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 7), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA1.0 scaffold coding sequence (shown at positions 418-500 of SEQ ID No. 7). The expression cassette 2 consists of the UBI promoter (No. 18-2013 of SEQ ID No. 3) and the NLS nuclear localization signal coding sequence (No. 2107-2127 of SEQ ID No. 3) from the 5' end to the 3' end. , dCas9 encoding gene (No. 2152-6252 of SEQ ID No. 3), P65 encoding gene (No. 6319-6861 of SEQ ID No. 3) and NOS transcription terminator (No. 6896-7148 of SEQ ID No. 3) bits). The complete sequence of the expression cassette 2 is shown in SEQ ID No. 3.
CRISPRa4载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒;所述表达盒1中自5’端到3’端依次由U6启动子(SEQ ID No.8的第1-397位)、靶向目标基因靶标区域的sgRNA间隔序列插入位点(gtaacgtcaccgtagggcct)、sgRNA2.0支架编码序列(SEQ ID No.8的第418-560位所示)组成。所述表达盒2自5’端到3’端依次由UBI启动子(SEQ ID No.4的第18-2013位)、NLS核定位信号编码序列(SEQ ID No.4的第2107-2127位)、dCas9编码基因(SEQ ID No.4的第2152-6252位)、VP64编码基因(SEQ ID No.4的第6319-6495位)、自裂解多肽T2A的编码基因(SEQ ID No.4的第6496-6549位)、MS2编码基因(SEQ ID No.4的第6565-6951位)、NLS核定位信号编码序列(SEQ ID No.4的第7006-7026位)、ERF2m编码基因(SEQ ID No.4的第7042-7770位)、HSF1编码基因(SEQ ID No.4的第7795-8169位)和NOS转录终止子(SEQ ID No.4的第8204-8456位)组成。所述表达盒2的全序列如SEQ ID No.4所示。CRISPRa4 vector structure description: A recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site. The expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 8), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA2.0 scaffold coding sequence (shown at positions 418-560 of SEQ ID No. 8). The expression cassette 2 consists of the UBI promoter (positions 18-2013 of SEQ ID No. 4) and the NLS nuclear localization signal coding sequence (positions 2107-2127 of SEQ ID No. 4) from the 5' end to the 3' end. ), dCas9 coding gene (No. 2152-6252 of SEQ ID No. 4), VP64 coding gene (No. 6319-6495 of SEQ ID No. 4), self-cleaving polypeptide T2A coding gene (SEQ ID No. 4) No. 6496-6549), MS2 coding gene (No. 6565-6951 of SEQ ID No. 4), NLS nuclear localization signal coding sequence (No. 7006-7026 of SEQ ID No. 4), ERF2m coding gene (SEQ ID No. 4) No. 4 No. 7042-7770), HSF1 coding gene (SEQ ID No. 4 No. 7795-8169) and NOS transcription terminator (SEQ ID No. 4 No. 8204-8456). The complete sequence of the expression cassette 2 is shown in SEQ ID No. 4.
CRISPRa5载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒;所述表达盒1中自5’端到3’端依次由U6启动子(SEQ ID No.8的第1-397位)、靶向目标基因靶标区域的sgRNA间隔序列插入位点(gtaacgtcaccgtagggcct)、sgRNA2.0支架编码序列(SEQ ID No.8的第418-560位所示)组成。所述表达盒2自5’端到3’端依次由UBI启动子(SEQ ID No.5的第18-2013位)、NLS核定位信号编码序列(SEQ ID No.5的第2107-2127位)、dCas9编码基因(SEQ ID No.5的第2152-6252位)、VP64编码基因(SEQ ID No.5的第6319-6495位)、自裂解多肽T2A的编码基因(SEQ ID No.5的第6496-6549位)、MS2编码基因(SEQ ID No.5的第6565-6951位)、NLS核定位信号编码序列(SEQ ID No.5的第7006-7026位)、P65编码基因(SEQ ID No.5的第7042-7584位)、HSF1编码基因(SEQ ID No.5的第7609-7983位)、NOS转录终止子(SEQ ID No.5的第8018-8270位)。所述表达盒2’的序列如SEQ ID No.5所示。CRISPRa5 vector structure description: A recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site. The expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 8), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA2.0 scaffold coding sequence (shown at positions 418-560 of SEQ ID No. 8). The expression cassette 2 consists of the UBI promoter (positions 18-2013 of SEQ ID No. 5) and the NLS nuclear localization signal coding sequence (positions 2107-2127 of SEQ ID No. 5) from the 5' end to the 3' end. ), dCas9 coding gene (No. 2152-6252 of SEQ ID No. 5), VP64 coding gene (No. 6319-6495 of SEQ ID No. 5), self-cleaving polypeptide T2A coding gene (SEQ ID No. 5 No. 6496-6549), MS2 coding gene (No. 6565-6951 of SEQ ID No. 5), NLS nuclear localization signal coding sequence (No. 7006-7026 of SEQ ID No. 5), P65 coding gene (SEQ ID No. 5) No. 5 No. 7042-7584), HSF1 coding gene (SEQ ID No. 5 No. 7609-7983), NOS transcription terminator (SEQ ID No. 5 No. 8018-8270). The sequence of 2' of the expression cassette is shown in SEQ ID No. 5.
CRISPRa6载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒;所述表达盒1中自5’端到3’端依次由U6启动子(SEQ ID No.8的第1-397位)、靶向目标基因靶标区域的sgRNA间隔序列插入位点(gtaacgtcaccgtagggcct)、sgRNA2.0支架编码序列(SEQ ID No.8的第418-560位所示)组成。所述表达盒2自5’端到3’端依次由UBI启动子(SEQ ID No.6的第18-2013位)、NLS核定位信号编码序列(SEQ ID No.6的第2107-2127位)、dCas9编码基因(SEQ ID No.6的第2152-6252位)、VP64编码基因(SEQ ID No.6的第6319-6495位)、自裂解多肽T2A的编码基因(SEQ ID No.6的第6496-6549位)、MS2编码基因(SEQ ID No.6的第6565-6951位)、NLS核定位信号编码序列(SEQ ID No.6的第7006-7026位)、VP64编码基因(SEQ ID No.6的第7042-7218位)、HSF1编码基因(SEQ ID No.6的第7243-7617位)、NOS转录终止子(SEQ ID No.6的第7652-7904位)。所述表达盒2’的序列如SEQ ID No.6所示。CRISPRa6 vector structure description: A recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site. The expression cassette 1 is an expression cassette for expressing sgRNA; from the 5' end to the 3' end, the expression cassette 1 consists of the U6 promoter (positions 1-397 of SEQ ID No. 8), the target It consists of the sgRNA spacer sequence insertion site (gtaacgtcaccgtagggcct) in the gene target region and the sgRNA2.0 scaffold coding sequence (shown at positions 418-560 of SEQ ID No. 8). The expression cassette 2 consists of the UBI promoter (positions 18-2013 of SEQ ID No. 6) and the NLS nuclear localization signal coding sequence (positions 2107-2127 of SEQ ID No. 6) from the 5' end to the 3' end. ), dCas9 coding gene (No. 2152-6252 of SEQ ID No. 6), VP64 coding gene (No. 6319-6495 of SEQ ID No. 6), self-cleaving polypeptide T2A coding gene (SEQ ID No. 6) No. 6496-6549), MS2 coding gene (No. 6565-6951 of SEQ ID No. 6), NLS nuclear localization signal coding sequence (No. 7006-7026 of SEQ ID No. 6), VP64 coding gene (SEQ ID No. 6) No. 6 No. 7042-7218), HSF1 coding gene (SEQ ID No. 6 No. 7243-7617), NOS transcription terminator (SEQ ID No. 6 No. 7652-7904). The sequence of 2' of the expression cassette is shown in SEQ ID No. 6.
为了评估CRISPRa(1-6)6个基因激活系统在玉米细胞中基因激活活性,我们选择了玉米内源的种子赖氨酸生物合成相关基因ZmDPS1作为CRISPRa系统的靶点。针对ZmDPS1转录起始位点(TSS)上游100bp区域设计基因激活靶点(图1中b和c);表1),并分别构建基因激活载体。所得6个载体结构描述如下:In order to evaluate the gene activation activity of the CRISPRa(1-6) 6 gene activation system in maize cells, we selected the endogenous seed lysine biosynthesis-related gene ZmDPS1 in maize as the target of the CRISPRa system. Gene activation targets were designed for the 100 bp region upstream of the ZmDPS1 transcription start site (TSS) (b and c in Figure 1; Table 1), and gene activation vectors were constructed respectively. The resulting 6 vector structures are described as follows:
靶向ZmDPS1基因TSS上游-1至-100bp的CRISPRa1 DPS1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.7所示。SEQ ID No.7的第1-397位为U6启动子,第398-417位为靶向ZmDPS1基因TSS上游-1至-100bp的sgRNA间隔序列、第418-500位 为sgRNA1.0支架编码序列。所述表达盒2全序列如SEQ ID No.1所示。SEQ ID No.1的第18-2013为UBI启动子,第2107-2127位为NLS核定位信号编码序列、第2152-6252位为dCas9编码基因,第6319-7047位为ERF2m编码基因,第7082-7334位为NOS转录终止子。 Structural description of the CRISPRa1 DPS1 vector targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 7. Positions 1-397 of SEQ ID No. 7 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-500 are the sgRNA1.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 1. No. 18-2013 of SEQ ID No. 1 is the UBI promoter, No. 2107-2127 is the NLS nuclear localization signal coding sequence, No. 2152-6252 is the dCas9 coding gene, No. 6319-7047 is the ERF2m coding gene, No. 7082 Position -7334 is the NOS transcription terminator.
靶向ZmDPS1基因TSS上游-1至-100bp的CRISPRa2 DPS1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.7所示。SEQ ID No.7的第1-397位为U6启动子,第398-417位为靶向ZmDPS1基因TSS上游-1至-100bp的sgRNA间隔序列、第418-500位为sgRNA1.0支架编码序列。所述表达盒2全序列如SEQ ID No.2所示。SEQ ID No.2的第18-2013为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6468位为VP64编码基因,第6503-6755位为NOS转录终止子。 Structural description of the CRISPRa2 DPS1 vector targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 7. Positions 1-397 of SEQ ID No. 7 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-500 are the sgRNA1.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 2. No. 18-2013 of SEQ ID No. 2 is the UBI promoter, No. 2107-2127 is the NLS nuclear localization signal coding sequence, No. 2152-6252 is the dCas9 coding gene, No. 6319-6468 is the VP64 coding gene, No. 6503 Position -6755 is the NOS transcription terminator.
靶向ZmDPS1基因TSS上游-1至-100bp的CRISPRa3 DPS1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.7所示。SEQ ID No.7的第1-397位为U6启动子,第398-417位为靶向ZmDPS1基因TSS上游-1至-100bp的sgRNA间隔序列,第418-500位为sgRNA1.0支架编码序列。所述表达盒2全序列如SEQ ID No.3所示。SEQ ID No.3的第18-2013为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6861位为P65编码基因,第6896-7148位为NOS转录终止子。 Structural description of the CRISPRa3 DPS1 vector targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 7. Positions 1-397 of SEQ ID No. 7 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-500 are the sgRNA1.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 3. No. 18-2013 of SEQ ID No. 3 is the UBI promoter, No. 2107-2127 is the NLS nuclear localization signal coding sequence, No. 2152-6252 is the dCas9 coding gene, No. 6319-6861 is the P65 coding gene, No. 6896 Position -7148 is the NOS transcription terminator.
靶向ZmDPS1基因TSS上游-1至-100bp的CRISPRa4 DPS1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.8所示。SEQ ID No.8的第1-397位为U6启动子,第398-417位为靶向ZmDPS1基因TSS上游-1至-100bp的sgRNA间隔序列,第418-560位为sgRNA2.0支架编码序列。所述表达盒2全序列如SEQ ID No.4所示。SEQ ID No.4的第18-2013位为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6495位为VP64编码基因,第6496-6549位为自裂解多肽T2A的编码基因,第6565-6951位为MS2编码基因,第7006-7026位为NLS核定位信号编码序列,第7042-7770位为ERF2m编码基因,第7795-8169位为HSF1编码基因,第8204-8456位为NOS转录终止子。 Structural description of the CRISPRa4 DPS1 vector targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 8. Positions 1-397 of SEQ ID No. 8 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 4. Positions 18-2013 of SEQ ID No. 4 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene. Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7770 are the genes encoding ERF2m, and positions 7795-8169 Positions 8204-8456 are the NOS transcription terminator.
靶向ZmDPS1基因TSS上游-1至-100bp的CRISPRa5 DPS1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.8所示。SEQ ID No.8的第1-397位为U6启动子,第398-417 位为靶向ZmDPS1基因TSS上游-1至-100bp的sgRNA间隔序列,第418-560位为sgRNA2.0支架编码序列。所述表达盒2全序列如SEQ ID No.5所示。SEQ ID No.5的第18-2013位为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6495位为VP64编码基因,第6496-6549位为自裂解多肽T2A的编码基因,第6565-6951位为MS2编码基因,第7006-7026位为NLS核定位信号编码序列,第7042-7584位为P65编码基因,第7609-7983位为HSF1编码基因,第8018-8270位为NOS转录终止子。 Structural description of the CRISPRa5 DPS1 vector targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 8. Positions 1-397 of SEQ ID No. 8 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 5. Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene. Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
靶向ZmDPS1基因TSS上游-1至-100bp的CRISPRa6 DPS1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.8所示。SEQ ID No.8的第1-397位为U6启动子,第398-417位为靶向ZmDPS1基因TSS上游-1至-100bp的sgRNA间隔序列,第418-560位为sgRNA2.0支架编码序列。所述表达盒2全序列如SEQ ID No.6所示。SEQ ID No.6的第18-2013位为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6495位为VP64编码基因,第6496-6549位为自裂解多肽T2A的编码基因,第6565-6951位为MS2编码基因,第7006-7026位为NLS核定位信号编码序列,第7042-7218位为VP64编码基因,第7243-7617位为HSF1编码基因,第7652-7904位为NOS转录终止子。 Structural description of the CRISPRa6 DPS1 vector targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 8. Positions 1-397 of SEQ ID No. 8 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 6. Positions 18-2013 of SEQ ID No. 6 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene. Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7218 are the genes encoding VP64, and positions 7243-7617 Positions are the HSF1 encoding gene, and positions 7652-7904 are the NOS transcription terminator.
分别将上述CRISPRa(1-6) DPS1转入B73玉米自交系叶肉原生质体中,使其分别对ZmDPS1基因进行激活,筛选高活性CRISPRa基因激活工具。通过RT-qPCR实验数据可知,与未转入载体的原生质体对照组相比所有CRISPRa系统的转录水平至少增加了1.6倍的ZmDPS1表达水平,且CRISPRa5系统产生了最强大的转录激活(相对于对照组11.25倍),这表明了设计的CRISPRa基因激活工具可在玉米叶肉原生质体细胞中实现高活性的转录激活(图1中d)。下一步我们要对CRISPRa5转录激活工具的激活窗口进行评估。因此我们选择了ZmDPS1基因5’UTR、TSS上游-1至-100bp、-100至-200bp、-200至-300bp四个区域设计了sgRNA靶点并设计CRIPRa5基因激活载体。所得4个载体结构描述如下: The above-mentioned CRISPRa(1-6) DPS1 was transferred into the mesophyll protoplasts of the B73 corn inbred line to activate the ZmDPS1 gene respectively, and high-activity CRISPRa gene activation tools were screened. From the RT-qPCR experimental data, it can be seen that compared with the protoplast control group that was not transformed into the vector, the transcription levels of all CRISPRa systems increased the ZmDPS1 expression level by at least 1.6 times, and the CRISPRa5 system produced the most powerful transcription activation (relative to 11.25 times compared with the control group), which shows that the designed CRISPRa gene activation tool can achieve highly active transcription activation in corn mesophyll protoplast cells (d in Figure 1). The next step is to evaluate the activation window of the CRISPRa5 transcriptional activation tool. Therefore, we selected four regions of the 5'UTR of the ZmDPS1 gene, -1 to -100bp, -100 to -200bp, and -200 to -300bp upstream of the TSS to design sgRNA targets and a CRIPRa5 gene activation vector. The resulting four vector structures are described as follows:
靶向ZmDPS1基因5’UTR的CRISPRa5 DPS1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.11所示。SEQ ID No.11的第1-397位为U6启动子,第398-417位为靶向ZmDPS1基因5’UTR的sgRNA间隔序列,第418-560位为sgRNA2.0支架编码序列。所述表达盒2全序列如SEQ ID No.5所示。SEQ ID No.5的第18-2013位为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6495位为VP64编码基因,第6496-6549位为自裂解多肽T2A的编码基因,第6565-6951位为MS2编码基因,第7006-7026位为NLS核定位信号编码序列,第7042-7584位为P65编码基因,第7609-7983位为HSF1编码 基因,第8018-8270位为NOS转录终止子。 Structural description of the CRISPRa5 DPS1 vector targeting the 5'UTR of the ZmDPS1 gene: a recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site. The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 11. Positions 1-397 of SEQ ID No. 11 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting the 5'UTR of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence. The complete sequence of the expression cassette 2 is shown in SEQ ID No. 5. Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene. Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
靶向ZmDPS1基因TSS上游-1至-100bp的CRISPRa5 DPS1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.8所示。SEQ ID No.8的第1-397位为U6启动子,第398-417位为靶向ZmDPS1基因TSS上游-1至-100bp的sgRNA间隔序列,第418-560位为sgRNA2.0支架编码序列。所述表达盒2全序列如SEQ ID No.5所示。SEQ ID No.5的第18-2013位为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6495位为VP64编码基因,第6496-6549位为自裂解多肽T2A的编码基因,第6565-6951位为MS2编码基因,第7006-7026位为NLS核定位信号编码序列,第7042-7584位为P65编码基因,第7609-7983位为HSF1编码基因,第8018-8270位为NOS转录终止子。 Structural description of the CRISPRa5 DPS1 vector targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 8. Positions 1-397 of SEQ ID No. 8 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 5. Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene. Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
靶向ZmDPS1基因TSS上游-100至-200bp的CRISPRa5 DPS1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.9所示。SEQ ID No.9的第1-397位为U6启动子,第398-417位为靶向ZmDPS1基因TSS上游-100至-200bp的sgRNA间隔序列,第418-560位为sgRNA2.0支架编码序列。所述表达盒2全序列如SEQ ID No.5所示。SEQ ID No.5的第18-2013位为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6495位为VP64编码基因,第6496-6549位为自裂解多肽T2A的编码基因,第6565-6951位为MS2编码基因,第7006-7026位为NLS核定位信号编码序列,第7042-7584位为P65编码基因,第7609-7983位为HSF1编码基因,第8018-8270位为NOS转录终止子。 Structural description of the CRISPRa5 DPS1 vector targeting -100 to -200 bp upstream of the TSS of the ZmDPS1 gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 9. Positions 1-397 of SEQ ID No. 9 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -100 to -200 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 5. Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene. Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
靶向ZmDPS1基因TSS上游-200至-300bp的CRISPRa5 DPS1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.10所示。SEQ ID No.10的第1-397位为U6启动子,第398-417位为靶向ZmDPS1基因TSS上游-200至-300bp的sgRNA间隔序列,第418-560位为sgRNA2.0支架编码序列。所述表达盒2全序列如SEQ ID No.5所示。SEQ ID No.5的第18-2013位为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6495位为VP64编码基因,第6496-6549位为自裂解多肽T2A的编码基因,第6565-6951位为MS2编码基因,第7006-7026位为NLS核定位信号编码序列,第7042-7584位为P65编码基因,第7609-7983位为HSF1编码基因,第8018-8270位为NOS转录终止子。 Structural description of the CRISPRa5 DPS1 vector targeting -200 to -300 bp upstream of the TSS of the ZmDPS1 gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 10. Positions 1-397 of SEQ ID No. 10 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -200 to -300 bp upstream of the TSS of the ZmDPS1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 5. Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene. Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
通过转化B73玉米自交系叶肉原生质体及RT-qPCR对ZmDPS1表达水平进行测量。通过实验我们发现,针对-1至-100bp区域的sgRNA靶点对ZmDPS1表达水平显示出最高的倍数激活(图1中e)。为了进一步验证了CRISPRa5系统的 基因激活能力,我们分别针对ZmTrxh、ZmES2、ZmRCP1三个基因的TSS上游-1至-100bp区域设计sgRNA靶点,并分别构建CRISPRa5载体。所得3个载体结构描述如下:The expression level of ZmDPS1 was measured by transforming B73 corn inbred line mesophyll protoplasts and RT-qPCR. Through experiments, we found that the sgRNA target targeting the -1 to -100bp region showed the highest fold activation of ZmDPS1 expression level (e in Figure 1). In order to further verify the gene activation ability of the CRISPRa5 system, we designed sgRNA targets for the -1 to -100bp region upstream of the TSS of the three genes ZmTrxh, ZmES2, and ZmRCP1, and constructed CRISPRa5 vectors respectively. The resulting three vector structures are described as follows:
靶向ZmTrxh基因TSS上游-1至-100bp的CRISPRa5 Trxh载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.12所示。SEQ ID No.12的第1-397位为U6启动子,第398-417位为靶向ZmTrxh基因TSS上游-1至-100bp的sgRNA间隔序列,第418-560位为sgRNA2.0支架编码序列。所述表达盒2全序列如SEQ ID No.5所示。SEQ ID No.5的第18-2013位为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6495位为VP64编码基因,第6496-6549位为自裂解多肽T2A的编码基因,第6565-6951位为MS2编码基因,第7006-7026位为NLS核定位信号编码序列,第7042-7584位为P65编码基因,第7609-7983位为HSF1编码基因,第8018-8270位为NOS转录终止子。 Structural description of the CRISPRa5 Trxh vector targeting -1 to -100 bp upstream of the TSS of the ZmTrxh gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 12. Positions 1-397 of SEQ ID No. 12 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmTrxh gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 5. Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene. Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
靶向ZmES2基因TSS上游-1至-100bp的CRISPRa5 ES2载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.13所示。SEQ ID No.13的第1-397位为U6启动子,第398-417位为靶向ZmES2基因TSS上游-1至-100bp的sgRNA间隔序列,第418-560位为sgRNA2.0支架编码序列。所述表达盒2全序列如SEQ ID No.5所示。SEQ ID No.5的第18-2013位为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6495位为VP64编码基因,第6496-6549位为自裂解多肽T2A的编码基因,第6565-6951位为MS2编码基因,第7006-7026位为NLS核定位信号编码序列,第7042-7584位为P65编码基因,第7609-7983位为HSF1编码基因,第8018-8270位为NOS转录终止子。 Description of the structure of the CRISPRa5 ES2 vector targeting -1 to -100 bp upstream of the TSS of the ZmES2 gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 13. Positions 1-397 of SEQ ID No. 13 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmES2 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 5. Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene. Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
靶向ZmRCP1基因TSS上游-1至-100bp的CRISPRa5 RCP1载体结构描述:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.14所示。SEQ ID No.14的第1-397位为U6启动子,第398-417位为靶向ZmRCP1基因TSS上游-1至-100bp的sgRNA间隔序列,第418-560位为sgRNA2.0支架编码序列。所述表达盒2全序列如SEQ ID No.5所示。SEQ ID No.5的第18-2013位为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6495位为VP64编码基因,第6496-6549位为自裂解多肽T2A的编码基因,第6565-6951位为MS2编码基因,第7006-7026位为NLS核定位信号编码序列,第7042-7584位为P65编码基因,第7609-7983位为HSF1编码基因,第8018-8270位为NOS转录终止子。 Structural description of the CRISPRa5 RCP1 vector targeting -1 to -100 bp upstream of the TSS of the ZmRCP1 gene: a recombinant vector obtained by inserting expression cassette 1 at the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 at the BamHI restriction site. . The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 14. Positions 1-397 of SEQ ID No. 14 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the ZmRCP1 gene, and positions 418-560 are the sgRNA2.0 scaffold coding sequence. . The complete sequence of the expression cassette 2 is shown in SEQ ID No. 5. Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene. Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
通过B73玉米自交系叶肉原生质体及RT-qPCR技术评估,我们观察到与未 转入载体的原生质体的对照组相比较CRISPRa5基因激活工具对ZmTrxh基因表达量提高了375.38倍(图1中f),对ZmES2基因表达水平提高了5.33倍(图1中g),对ZmRCP1基因表达量提高了4.45倍(图1中h)。综上结果表明,我们所设计的CRISPRa5基因激活工具(sgRNA2.0,dCas9-VP64和MS2-p65-HSF1效应体),在玉米细胞中显示出最强的靶向转录激活能力。Through evaluation of B73 corn inbred line mesophyll protoplasts and RT-qPCR technology, we observed that the CRISPRa5 gene activation tool increased the ZmTrxh gene expression by 375.38 times compared with the control group of protoplasts not transformed into the vector (f in Figure 1 ), the expression level of ZmES2 gene was increased by 5.33 times (g in Figure 1), and the expression level of ZmRCP1 gene was increased by 4.45 times (h in Figure 1). In summary, the results show that the CRISPRa5 gene activation tools we designed (sgRNA2.0, dCas9-VP64 and MS2-p65-HSF1 effector) show the strongest targeted transcription activation ability in corn cells.
2、基于卵细胞分离与转化技术评估CRISPRa5在卵细胞中激活活性2. Evaluate the activation activity of CRISPRa5 in egg cells based on egg cell isolation and transformation technology
为了确定CRISPRa5在卵细胞中基因激活能力,通过显微操作结合胚囊酶解方法我们建立了玉米卵细胞分离与转化系统。首先在体式显微镜下,我们利用显微操作技术从胚珠组织中分离出胚囊组织(图2中a和b)。随后用酶解的方法处理这些胚囊组织样本1小时(图2中c),通过显微操作技术分离出整个胚囊(图2中d)。接下来,我们在体式显微镜下手动分离反足细胞(ACs)(图2中e)、助细胞(SCs)(图2中f)、中央细胞(CC)(图2中g)和卵细胞(EC)(图2中h)。我们从酶解后的胚囊中分离出的SCs、CCs和ECs的产量约为25%。In order to determine the gene activation ability of CRISPRa5 in egg cells, we established a corn egg cell isolation and transformation system through micromanipulation combined with embryo sac enzymatic hydrolysis. First, we used micromanipulation techniques to separate embryo sac tissue from the ovule tissue under a stereomicroscope (a and b in Figure 2). These embryo sac tissue samples were then treated with enzymatic hydrolysis for 1 hour (c in Figure 2), and the entire embryo sac was isolated through micromanipulation techniques (d in Figure 2). Next, we manually separated antipodal cells (ACs) (e in Figure 2 ), synergid cells (SCs) (f in Figure 2 ), central cells (CC) (g in Figure 2 ) and egg cells (EC) under a stereomicroscope. ) (h in Figure 2). The yield of SCs, CCs, and ECs we isolated from enzymatically digested embryo sacs was approximately 25%.
为了验证CRISPRa5在玉米卵细胞中的激活能力,我们以实验室前期建立的DFP单倍体辅助筛选玉米转基因株系(本实验室保存,文献:Dong L,Li L,Liu C,et al.Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize[J].Molecular plant,2018,11(9):1214-1217.)为材料制备卵细胞及其原生质体,DFP转基因系中包含糊粉层特异性启动子LTP2启动子驱动的红色荧光蛋白DsRed2基因(Dong,L.,Li,L.,Liu,Changlin,Liu,Chenxu,Geng,S.,Li,X.,et al.(2018)Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize.Mol.Plant,11,1214–1217.)。LTP2启动子在糊粉层特异表达,而在卵细胞中不表达。为了验证CRISPRa5在卵细胞中的活性,我们利用拟南芥卵细胞特异性启动子DD45启动子表达CRISPRa5系统,并在LTP2启动子TSS上游-1至-100bp处设计了基因激活靶点并设计靶向LTP2启动子的卵细胞特异的CRISPRa5 pLTP2::DsRed2载体。所得载体结构描述如下:在pUC19载体的HindIII酶切位点处插入表达盒1,同时在BamHI酶切位点处插入表达盒2后得到的重组载体。所述表达盒1为用于表达sgRNA的表达盒,全序列如SEQ ID No.15所示。SEQ ID No.15的第1-397位为U6启动子,第398-417位为靶向LTP2启动子TSS上游-1至-100bp的sgRNA间隔序列,第418-560位为sgRNA2.0支架编码序列。所述表达盒2全序列如SEQ ID No.5所示。SEQ ID No.5的第18-2013位为UBI启动子,第2107-2127位为NLS核定位信号编码序列,第2152-6252位为dCas9编码基因,第6319-6495位为VP64编码基因,第6496-6549位为自裂解多肽T2A的编码基因,第6565-6951位为MS2编码基因,第7006-7026位为NLS核定位信号编码序列,第7042-7584位为P65编码基因,第7609-7983位为HSF1编码基因,第8018-8270位为NOS转录终止子。 In order to verify the activation ability of CRISPRa5 in corn egg cells, we used the DFP haploid established in the laboratory to assist in screening of corn transgenic lines (kept in our laboratory, literature: Dong L, Li L, Liu C, et al. Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize[J].Molecular plant, 2018,11(9):1214-1217.) To prepare egg cells and their protoplasts as materials, the DFP transgenic line contains aleurone layer The red fluorescent protein DsRed2 gene driven by the specific promoter LTP2 promoter (Dong, L., Li, L., Liu, Changlin, Liu, Chenxu, Geng, S., Li, X., et al. (2018) Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize. Mol. Plant, 11, 1214–1217.). The LTP2 promoter is specifically expressed in the aleurone layer but not in egg cells. In order to verify the activity of CRISPRa5 in egg cells, we used the Arabidopsis egg cell-specific promoter DD45 promoter to express the CRISPRa5 system, and designed a gene activation target at -1 to -100bp upstream of the TSS of the LTP2 promoter and designed to target LTP2 Promoter of the oocyte-specific CRISPRa5 pLTP2::DsRed2 vector. The resulting vector structure is described as follows: the recombinant vector obtained by inserting expression cassette 1 into the HindIII restriction site of the pUC19 vector and inserting expression cassette 2 into the BamHI restriction site. The expression cassette 1 is an expression cassette for expressing sgRNA, and the complete sequence is shown in SEQ ID No. 15. Positions 1-397 of SEQ ID No. 15 are the U6 promoter, positions 398-417 are the sgRNA spacer sequence targeting -1 to -100 bp upstream of the TSS of the LTP2 promoter, and positions 418-560 are the sgRNA2.0 scaffold encoding sequence. The complete sequence of the expression cassette 2 is shown in SEQ ID No. 5. Positions 18-2013 of SEQ ID No. 5 are the UBI promoter, positions 2107-2127 are the NLS nuclear localization signal coding sequence, positions 2152-6252 are the dCas9 coding gene, and positions 6319-6495 are the VP64 coding gene. Positions 6496-6549 are the genes encoding the self-cleaving polypeptide T2A, positions 6565-6951 are the genes encoding MS2, positions 7006-7026 are the NLS nuclear localization signal encoding sequences, positions 7042-7584 are the genes encoding P65, and positions 7609-7983 Positions 8018-8270 are the NOS transcription terminator.
通过卵细胞转化技术,我们将载体转入DFP单倍体辅助筛选玉米转基因株系(本实验室保存,文献:Dong L,Li L,Liu C,et al.Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize[J].Molecular plant,2018,11(9):1214-1217.)的卵细胞中,以诱导DFP卵细胞中LTP2pro:DsRed2的表达(图2中i)。转化后的卵细胞经过24小时培养后检测DsRed2荧光,经过荧光显微镜检测我们发现了DsRed2荧光蛋白的表达,表明LTP2pro:DsRed2表达盒被转录激活导致卵细胞中DsRed2蛋白的表达(图2中j)。综合以上数据表明,CRISPRa5基因激活工具可在玉米卵细胞中实现高活性的基因激活。Through egg cell transformation technology, we transferred the vector into DFP haploid-assisted screening of corn transgenic lines (kept in this laboratory, literature: Dong L, Li L, Liu C, et al. Genome editing and double-fluorescence proteins enable robust maternal haploid induction and identification in maize[J].Molecular plant, 2018,11(9):1214-1217.) in egg cells to induce the expression of LTP2pro:DsRed2 in DFP egg cells (i in Figure 2). The transformed egg cells were cultured for 24 hours and DsRed2 fluorescence was detected. Through fluorescence microscopy, we found the expression of DsRed2 fluorescent protein, indicating that the LTP2pro:DsRed2 expression cassette was transcriptionally activated leading to the expression of DsRed2 protein in the egg cells (j in Figure 2). Taken together, the above data show that the CRISPRa5 gene activation tool can achieve highly active gene activation in corn egg cells.
3、通过CRISPRa在玉米卵细胞中异位激活ZmBBM23. Ectopic activation of ZmBBM2 in corn egg cells through CRISPRa
基于8个物种的14个BBM蛋白的系统发育树分析可知ZmBBM2与PsASGR-BBML及OsBBM1蛋白序列具有较近的亲缘进化关系,我们推断ZmBBM2基因具有PsASGR-BBML及OsBBM1类似的功能,其卵细胞异位活性可诱发孤雌生殖现象。因此,我们构建了以内源ZmBBM2基因启动子为目标的卵细胞特异表达的CRISPRa5 BBM基因激活体载体(图3中a)。所得载体结构描述如下:将CPB载体的酶切位点HindIII和EcoRI之间的小片段替换为SEQ ID No.16所示DNA片段后得到的重组载体。SEQ ID No.16的第1-1007位为用于表达sgRNA的表达盒。SEQ ID No.16的第1-397位为ZmU6-2启动子,第398-417位为靶向ZmBBM2基因转录起始点(TSS)上游-36bp区域的sgRNA间隔序列的编码核酸,第418-553位为sgRNA2.0支架编码序列,第554-630位为拟南芥At-tRNAGly编码序列,第631-650位为靶向ZmBBM2基因转录起始点(TSS)上游-166bp区域的sgRNA间隔序列的编码核酸,第651-793位为sgRNA2.0支架编码序列。SEQ ID No.16的第1008-8238位为用于表达功能蛋白的表达盒。SEQ ID No.16的第1008-2017位为拟南芥DD45启动子,第2018-2086位为3×Flag标签编码核酸,第2093-2113位为SV40核定位信号编码序列,第2114-6238位为dCas9编码基因,第6239-6286位为NLS核定位信号编码序列,第6305-6454位为VP64编码基因,第6482-6535位为自裂解多肽T2A的编码基因,第6551-6937位为MS2编码基因,第6992-7012位为SV40核定位信号编码序列,第7028-7570位为P65编码基因,第7595-7966位为HSF1编码基因,第7967-8238位为NOS转录终止子。 Based on the phylogenetic tree analysis of 14 BBM proteins from 8 species, it can be seen that ZmBBM2 has a close evolutionary relationship with PsASGR-BBML and OsBBM1 protein sequences. We infer that the ZmBBM2 gene has similar functions to PsASGR-BBML and OsBBM1, and its egg cells are ectopic. Activity can induce parthenogenesis. Therefore, we constructed a CRISPRa5 BBM gene activator vector targeting the endogenous ZmBBM2 gene promoter for egg cell-specific expression (a in Figure 3). The resulting vector structure is described as follows: the recombinant vector obtained by replacing the small fragment between the enzyme cutting sites HindIII and EcoRI of the CPB vector with the DNA fragment shown in SEQ ID No. 16. Positions 1-1007 of SEQ ID No. 16 are expression cassettes for expressing sgRNA. Positions 1-397 of SEQ ID No. 16 are the ZmU6-2 promoter, positions 398-417 are the coding nucleic acids of the sgRNA spacer sequence targeting the -36 bp region upstream of the transcription start site (TSS) of the ZmBBM2 gene, and positions 418-553 Positions 554-630 are the sgRNA2.0 scaffold coding sequence, positions 554-630 are the Arabidopsis At-tRNAGly coding sequence, and positions 631-650 are the coding for the sgRNA spacer sequence targeting the -166 bp region upstream of the ZmBBM2 gene transcription start site (TSS). Nucleic acid, positions 651-793 are the sgRNA2.0 scaffold coding sequence. Positions 1008-8238 of SEQ ID No. 16 are expression cassettes for expressing functional proteins. Positions 1008-2017 of SEQ ID No. 16 are the Arabidopsis DD45 promoter, positions 2018-2086 are the 3×Flag tag encoding nucleic acid, positions 2093-2113 are the SV40 nuclear localization signal encoding sequence, and positions 2114-6238 It is the gene encoding dCas9, positions 6239-6286 are the NLS nuclear localization signal encoding sequence, positions 6305-6454 are the VP64 encoding gene, positions 6482-6535 are the encoding genes for the self-cleaving polypeptide T2A, and positions 6551-6937 are the MS2 encoding Gene, positions 6992-7012 are the SV40 nuclear localization signal coding sequence, positions 7028-7570 are the P65 encoding gene, positions 7595-7966 are the HSF1 encoding gene, and positions 7967-8238 are the NOS transcription terminator.
我们通过农杆菌(根癌农杆菌)介导将上述所得载体对KN5585玉米自交系进行转化,得到9个独立的转基因株系EA13、EA14、EA15、EA19、EA21、EA222、EA28、EA34和EA36,并将其繁殖到T3代。我们利用微滴数字PCR(ddPCR)对上述材料转基因元件拷贝数进行检测,结果表明EA15、EA19、EA22、EA34和EA36为单拷贝,而EA13、EA21和EA28是双拷贝且EA14为4个拷贝(图3中b)。利用分离的胚囊提取上述转基因材料总RNA,我们以ZmeIF4α基因为内参,采用逆转录ddPCR(RT-ddPCR)分析了CRISPRa5 BBM材料胚囊dCas9和ZmBBM2转录水平。我们在所有9个CRISPRa5 BBM转基因株系的胚囊中都检测到CRISPRa5 BBM(图3 中c)和ZmBBM2(图3中d)RNA的高转录水平,且每1μg总RNA高达10 3拷贝,而在对照组野生型KN5585玉米自交系中没有检测到任何表达痕迹。由于在5个单拷贝转基因株系中EA19表现出最高的ZmBBM2激活水平,因此接下来我们选择了转基因株系EA19进行进一步的验证。为了进一步分析单个卵细胞CRISPRa5 BBM和ZmBBM2RNA的转录水平,我们利用显微操作技术将EA19转基因株系植物中分离出ANs、SCs、CCs和ECs。结合单细胞RT-ddPCR技术对EC中CRISPRa5 BBM和ZmBBM2转录本进行检测(图3中e),数据表明,CRISPRa5 BBM可在卵细胞中表达,且CRISPRa5 BBM可激活玉米内源ZmBBM2基因的异位表达。为了进一步验证这以上结论,我们对EA19材料胚囊组织进行了免疫组化分析以及原位杂交分析,实验结果表明,在EA19卵细胞的细胞核中检测到了dCas9蛋白的表达,同时利用ZmBBM2 mRNA探针我们发现ZmBBM2RNA在卵细胞中表现出较强的信号。以上结果证实了未授粉EC中CRISPRa5 BBM表达且成功激活了卵细胞ZmBBM2基因的靶向激活(图3中f)。同时,通过RT-ddPCR技术我们还分析了EA19材料授粉前及不同授粉时间胚囊/幼胚中CRISPRa5 BBM及ZmBBM2表达水平。实验数据表明,ZC01野生型未授粉胚囊中未发现CRISPRa5 BBM及ZmBBM2表达,随着授粉时间推移ZmBBM2的表达量迅速增加而未发现CRISPRa5 BBM表达。在EA19材料中未授粉胚囊中发现CRISPRa5 BBM及ZmBBM2表达,随着授粉时间推移CRISPRa5 BBM及ZmBBM2的表达量迅速增加(图3中g和h)。 We transformed the KN5585 corn inbred line with the vector obtained above through Agrobacterium tumefaciens (Agrobacterium tumefaciens), and obtained 9 independent transgenic lines EA13, EA14, EA15, EA19, EA21, EA222, EA28, EA34 and EA36 , and breed it to the T3 generation. We used droplet digital PCR (ddPCR) to detect the copy number of the transgenic elements in the above materials. The results showed that EA15, EA19, EA22, EA34 and EA36 were single copies, while EA13, EA21 and EA28 were double copies and EA14 was 4 copies ( b) in Figure 3. The total RNA of the above transgenic materials was extracted from the isolated embryo sacs. We used the ZmeIF4α gene as the internal reference and used reverse transcription ddPCR (RT-ddPCR) to analyze the transcript levels of dCas9 and ZmBBM2 in the embryo sacs of CRISPRa5 BBM materials. We detected high transcript levels of CRISPRa5 BBM (c in Figure 3) and ZmBBM2 (d in Figure 3) RNA in the embryo sacs of all nine CRISPRa5 BBM transgenic lines, with up to 10 copies per 1 μg of total RNA, while No trace of expression was detected in the control wild-type KN5585 maize inbred line. Since EA19 showed the highest ZmBBM2 activation level among the five single-copy transgenic lines, we next selected the transgenic line EA19 for further verification. In order to further analyze the transcript levels of CRISPRa5 BBM and ZmBBM2 RNA in single egg cells, we used micromanipulation technology to isolate ANs, SCs, CCs and ECs from EA19 transgenic line plants. Combining single-cell RT-ddPCR technology to detect CRISPRa5 BBM and ZmBBM2 transcripts in EC (e in Figure 3), the data showed that CRISPRa5 BBM can be expressed in egg cells, and CRISPRa5 BBM can activate the ectopic expression of the endogenous ZmBBM2 gene in maize. . In order to further verify the above conclusion, we performed immunohistochemical analysis and in situ hybridization analysis on the embryo sac tissue of EA19 material. The experimental results showed that the expression of dCas9 protein was detected in the nucleus of EA19 egg cells. At the same time, using the ZmBBM2 mRNA probe we It was found that ZmBBM2RNA showed a strong signal in egg cells. The above results confirmed that CRISPRa5 BBM was expressed in unpollinated EC and successfully activated the targeted activation of ZmBBM2 gene in egg cells (f in Figure 3). At the same time, through RT-ddPCR technology, we also analyzed the expression levels of CRISPRa5 BBM and ZmBBM2 in the embryo sac/young embryos of EA19 materials before pollination and at different pollination times. Experimental data showed that no expression of CRISPRa5 BBM and ZmBBM2 was found in the unpollinated embryo sac of ZC01 wild type. With the passage of pollination time, the expression of ZmBBM2 increased rapidly but no expression of CRISPRa5 BBM was found. Expression of CRISPRa5 BBM and ZmBBM2 was found in unpollinated embryo sacs in EA19 material, and the expression of CRISPRa5 BBM and ZmBBM2 increased rapidly as pollination time went by (g and h in Figure 3).
4、CRISPRa5 BBM2介导的ZmBBM2基因卵细胞异位活性诱导玉米孤雌单倍体发生 4. CRISPRa5 BBM2- mediated ectopic activity of ZmBBM2 gene egg cells induces parthenogenetic haploidy in maize
为了确定CRISPRa5 BBM介导ZmBBM2基因在卵细胞中异位激活可诱发孤雌生殖表型现象的可能性,我们首先对受精前的胚囊进行观察(图4中a),实验发现3.9%的EA19籽粒在胚珠中发育出双胚囊。此外我们利用DFP单倍体辅助筛选玉米转基因株系评估CRISPRa5 BBM的单倍体发生率,我们使用DFP单倍体辅助筛选玉米转基因株系作为花粉供体对EA19材料进行授粉,当发生孤雌单倍体时则胚不发光,糊粉层发红光,而正常的二倍体则表现为胚发绿光而糊粉层发红光(图4中b)。我们激光共聚焦观察成熟种子横切样品,发现了EA19材料孤雌单倍体的发生。同时我们对筛选出的单倍体籽粒进行播种生长(图4中c)。通过表型发现与二倍体和野生型相比,单倍体植株更瘦弱矮小,且雌花不育。我们利用流式细胞术从叶肉原生质体中分离的细胞核进一步确定了倍性水平,并获得了单倍体的信号强度值,约为二倍体的一半(图4中d)。我们对所有9个CRISPRa5 BBM株系的单倍体粒产生率进行了统计,统计结果表明,CRISPRa5 BBM介导ZmBBM2基因在卵细胞中异位激活可诱发孤雌生殖表型,比率从0.44%到3.55%不等,大多数株系产生的单倍体粒在1.5%到2%之间(图4中e)。 In order to determine the possibility that CRISPRa5 BBM- mediated ectopic activation of the ZmBBM2 gene in egg cells can induce parthenogenesis phenotypes, we first observed the embryo sac before fertilization (a in Figure 4). The experiment found that 3.9% of EA19 seeds A double embryo sac develops in the ovule. In addition, we used DFP haploid-assisted screening of corn transgenic lines to evaluate the haploid incidence of CRISPRa5 BBM . We used DFP haploid-assisted screening of corn transgenic lines as pollen donors to pollinate EA19 materials. When parthenogenesis occurs When the embryo is ploidy, the embryo does not emit light and the aleurone layer emits red light. However, in normal diploid, the embryo emits green light and the aleurone layer emits red light (b in Figure 4). We observed cross-section samples of mature seeds with laser confocal observation and discovered the occurrence of parthenogenetic haploidy in EA19 material. At the same time, we sow and grow the selected haploid seeds (c in Figure 4). Phenotypic results showed that compared with diploid and wild-type plants, haploid plants were thinner and shorter, and the female flowers were sterile. We further determined the ploidy level using flow cytometry of nuclei isolated from mesophyll protoplasts and obtained signal intensity values for haploids, which were approximately half those of diploids (d in Figure 4). We calculated the haploid grain production rates of all nine CRISPRa5 BBM lines. The statistical results showed that CRISPRa5 BBM -mediated ectopic activation of the ZmBBM2 gene in egg cells can induce parthenogenesis phenotypes, with rates ranging from 0.44% to 3.55. % varied, with most strains producing haploid grains ranging from 1.5% to 2% (e in Figure 4).
上文中,所涉及的各序列如下:In the above, the sequences involved are as follows:
SEQ ID No.1:SEQ ID No.1:
Figure PCTCN2022113576-appb-000005
Figure PCTCN2022113576-appb-000005
Figure PCTCN2022113576-appb-000006
Figure PCTCN2022113576-appb-000006
Figure PCTCN2022113576-appb-000007
Figure PCTCN2022113576-appb-000007
SEQ ID No.2:SEQ ID No.2:
Figure PCTCN2022113576-appb-000008
Figure PCTCN2022113576-appb-000008
Figure PCTCN2022113576-appb-000009
Figure PCTCN2022113576-appb-000009
Figure PCTCN2022113576-appb-000010
Figure PCTCN2022113576-appb-000010
SEQ ID No.3:SEQ ID No.3:
Figure PCTCN2022113576-appb-000011
Figure PCTCN2022113576-appb-000011
Figure PCTCN2022113576-appb-000012
Figure PCTCN2022113576-appb-000012
SEQ ID No.4:SEQ ID No.4:
Figure PCTCN2022113576-appb-000013
Figure PCTCN2022113576-appb-000013
Figure PCTCN2022113576-appb-000014
Figure PCTCN2022113576-appb-000014
SEQ ID No.5:SEQ ID No.5:
Figure PCTCN2022113576-appb-000015
Figure PCTCN2022113576-appb-000015
Figure PCTCN2022113576-appb-000016
Figure PCTCN2022113576-appb-000016
SEQ ID No.6:SEQ ID No.6:
Figure PCTCN2022113576-appb-000017
Figure PCTCN2022113576-appb-000017
Figure PCTCN2022113576-appb-000018
Figure PCTCN2022113576-appb-000018
SEQ ID No.7:SEQ ID No.7:
Figure PCTCN2022113576-appb-000019
Figure PCTCN2022113576-appb-000019
SEQ ID No.8:SEQ ID No.8:
Figure PCTCN2022113576-appb-000020
Figure PCTCN2022113576-appb-000020
SEQ ID No.9:SEQ ID No.9:
Figure PCTCN2022113576-appb-000021
Figure PCTCN2022113576-appb-000021
SEQ ID No.10:SEQ ID No.10:
Figure PCTCN2022113576-appb-000022
Figure PCTCN2022113576-appb-000022
SEQ ID No.11:SEQ ID No.11:
Figure PCTCN2022113576-appb-000023
Figure PCTCN2022113576-appb-000023
SEQ ID No.12:SEQ ID No.12:
Figure PCTCN2022113576-appb-000024
Figure PCTCN2022113576-appb-000024
SEQ ID No.13:SEQ ID No.13:
Figure PCTCN2022113576-appb-000025
Figure PCTCN2022113576-appb-000025
SEQ ID No.14:SEQ ID No.14:
Figure PCTCN2022113576-appb-000026
Figure PCTCN2022113576-appb-000026
SEQ ID No.15:SEQ ID No.15:
Figure PCTCN2022113576-appb-000027
Figure PCTCN2022113576-appb-000027
SEQ ID No.16:SEQ ID No.16:
Figure PCTCN2022113576-appb-000028
Figure PCTCN2022113576-appb-000028
Figure PCTCN2022113576-appb-000029
Figure PCTCN2022113576-appb-000029
Figure PCTCN2022113576-appb-000030
Figure PCTCN2022113576-appb-000030
工业应用Industrial applications
本发明利用玉米叶肉生质体瞬时表达系统筛选并鉴定了6套基于CRISPR的基因激活系统,结果表明由dCas9-VP64和p65-HSF1激活域组成的CRISPRa5系统在玉米叶肉原生质体中表现出最强的转录激活活性;接着,基于酶解及显微操作技术,本发明建立了玉米卵细胞分离与转化体系,结合卵细胞特异表达的CRISPRa5基因激活系统,验证了CRISPRa5基因激活工具在玉米卵细胞中具有较强的转录激活活性;然后,以玉米内源基因ZmBBM2启动子为靶点,设计CRISPRa5卵细胞异位激活内源ZmBBM2基因载体,结合农杆菌介导玉米幼胚转化技术获得9个转基因事件(EA13,EA15,EA19,EA21,EA22,EA28,EA34,EA36),利用原位杂交技术、免疫组化、单细胞RT-ddPCR技术等生物化学与分子生物学手段对转化事件进行检测,确定了CRISPRa5在卵细胞中表达且可异位激活内源ZmBBM2基因实现高水平的基因激活;最终,结合流式细胞技术、DFP单倍体筛选系统及表型观察本发明发现,利用CRISPRa5基因激活系统于玉米卵细胞中异位激活ZmBBM2基因可以诱发玉米孤雌单倍体的发生,且最高单倍体诱导发生效率高达3.55%。综上所述,本发明利用CRISPRa5基因激活系统介导玉米卵细胞异位表达ZmBBM2基因,进而诱发玉米孤雌单倍体发生,该技术为工程化的玉米孤雌单倍体生殖的创制提供了一种新的途径与方法。本发明为获得作物单倍体提供了一种有效的途径。同时,为玉米孤雌生殖提供了新的基因资源,为其他作物的无融合生殖合成提供了新的思路。The present invention used the corn mesophyll protoplast transient expression system to screen and identify 6 sets of CRISPR-based gene activation systems. The results showed that the CRISPRa5 system composed of dCas9-VP64 and p65-HSF1 activation domains showed the strongest performance in corn mesophyll protoplasts. The transcriptional activation activity of The transcriptional activation activity of the maize endogenous gene ZmBBM2 was then used as a target to design a CRISPRa5 egg cell ectopic activation endogenous ZmBBM2 gene vector, combined with Agrobacterium-mediated transformation of maize immature embryos, to obtain 9 transgenic events (EA13, EA15 , EA19, EA21, EA22, EA28, EA34, EA36), using in situ hybridization technology, immunohistochemistry, single-cell RT-ddPCR technology and other biochemical and molecular biology methods to detect transformation events, it was determined that CRISPRa5 is present in egg cells Expresses and can ectopically activate the endogenous ZmBBM2 gene to achieve high-level gene activation; finally, by combining flow cytometry, DFP haploid screening system and phenotypic observation, the present invention found that the CRISPRa5 gene activation system was used to ectopically activate maize eggs. Activating the ZmBBM2 gene can induce the occurrence of parthenogenetic haploidy in maize, and the highest haploid induction efficiency is as high as 3.55%. In summary, the present invention uses the CRISPRa5 gene activation system to mediate the ectopic expression of ZmBBM2 gene in maize egg cells, thereby inducing the occurrence of maize parthenogenetic haploidy. This technology provides a method for the creation of engineered maize parthenogenetic haploid reproduction. new ways and methods. The present invention provides an effective way to obtain crop haploid. At the same time, it provides new genetic resources for maize parthenogenesis and provides new ideas for the synthesis of apomixis in other crops.

Claims (17)

  1. 一种获得玉米孤雌单倍体的方法,包括如下步骤:将用于介导玉米卵细胞异位表达ZmBBM2基因的CRISPRa5基因激活系统的系统载体导入受体玉米,得到阳性转基因玉米;所述转基因阳性玉米的部分胚囊中的卵细胞能直接发育成为单倍体胚,形成单倍体种子;A method for obtaining maize parthenogenetic haploids, including the following steps: introducing a system vector of the CRISPRa5 gene activation system used to mediate the ectopic expression of ZmBBM2 gene in maize egg cells into recipient maize to obtain positive transgenic maize; the transgene is positive The egg cells in some embryo sacs of corn can directly develop into haploid embryos and form haploid seeds;
    所述系统载体能够表达dCas9-VP64融合蛋白,能够表达MS2-P65-HSF1融合蛋白,并且能够表达靶向ZmBBM2基因转录起始点上游-200至-1bp位置的一个或若干个sgRNA,所述sgRNA为sgRNA2.0。The system vector can express dCas9-VP64 fusion protein, can express MS2-P65-HSF1 fusion protein, and can express one or several sgRNA targeting the -200 to -1 bp position upstream of the ZmBBM2 gene transcription start point, and the sgRNA is sgRNA2.0.
  2. 根据权利要求1所述的方法,其特征在于:所述系统载体上含有如下2个表达盒:The method according to claim 1, characterized in that: the system vector contains the following two expression cassettes:
    表达盒1:用于表达2个所述sgRNA;在所述表达盒1中,含有2个sgRNA的编码序列,2个所述sgRNA的编码序列以拟南芥At-tRNAGly编码序列连接,由同一个启动子启动转录;Expression cassette 1: used to express two sgRNAs; in the expression cassette 1, it contains the coding sequences of two sgRNAs, and the coding sequences of the two sgRNAs are connected with the Arabidopsis thaliana At-tRNAGly coding sequence and are composed of the same A promoter initiates transcription;
    表达盒2:用于表达功能蛋白,所述功能蛋白为将所述dCas9-VP64融合蛋白和所述MS2-P65-HSF1融合蛋白通过自裂解多肽T2A融合而成;在所述表达盒2中,所述功能蛋白由植物卵细胞特异性启动子启动转录。Expression cassette 2: used to express functional proteins, which are formed by fusing the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein through the self-cleaving polypeptide T2A; in the expression cassette 2, The functional protein is transcribed by a plant egg cell-specific promoter.
  3. 根据权利要求2所述的方法,其特征在于:在所述表达盒1中,所述启动子为ZmU6-2启动子。The method according to claim 2, characterized in that: in the expression cassette 1, the promoter is a ZmU6-2 promoter.
  4. 根据权利要求2所述的方法,其特征在于:在所述表达盒2中,所述植物卵细胞特异性启动子为拟南芥DD45启动子。The method according to claim 2, characterized in that: in the expression cassette 2, the plant egg cell-specific promoter is the Arabidopsis thaliana DD45 promoter.
  5. 根据权利要求2所述的方法,其特征在于:所述dCas9-VP64融合蛋白和所述MS2-P65-HSF1融合蛋白均含有核定位信号。The method according to claim 2, characterized in that: both the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein contain nuclear localization signals.
  6. 根据权利要求3-5中任一所述的方法,其特征在于:所述表达盒1自5’端到3’端依次由所述ZmU6-2启动子、靶向ZmBBM2基因转录起始点上游-36bp区域的sgRNA间隔序列的编码核酸、sgRNA2.0支架编码序列、所述拟南芥At-tRNAGly编码序列、靶向ZmBBM2基因转录起始点上游-166bp区域的sgRNA间隔序列的编码核酸、所述sgRNA2.0支架编码序列组成。The method according to any one of claims 3-5, characterized in that: the expression cassette 1 is sequentially composed of the ZmU6-2 promoter and the upstream of the ZmBBM2 gene transcription starting point from the 5' end to the 3' end - The coding nucleic acid of the sgRNA spacer sequence in the 36 bp region, the sgRNA2.0 scaffold coding sequence, the Arabidopsis thaliana At-tRNAGly coding sequence, the coding nucleic acid of the sgRNA spacer sequence targeting the 166 bp region upstream of the ZmBBM2 gene transcription start point, the sgRNA2 .0 scaffold coding sequence composition.
  7. 根据权利要求3-5中任一所述的方法,其特征在于:所述表达盒2自5’端到3’端依次由所述拟南芥DD45启动子、3×Flag标签编码核酸、SV40核定位信号编码序列、dCas9编码基因、NLS核定位信号编码序列、VP64编码基因、所述自裂解多肽T2A的编码基因、MS2编码基因、所述SV40核定位信号编码序列、P65编码基因、HSF1编码基因和转录终止子NOS组成。The method according to any one of claims 3 to 5, characterized in that: the expression cassette 2 consists of the Arabidopsis DD45 promoter, 3×Flag tag encoding nucleic acid, and SV40 from the 5' end to the 3' end. Nuclear localization signal coding sequence, dCas9 coding gene, NLS nuclear localization signal coding sequence, VP64 coding gene, the coding gene of the self-cleaving polypeptide T2A, MS2 coding gene, the SV40 nuclear localization signal coding sequence, P65 coding gene, HSF1 coding Gene and transcription terminator NOS.
  8. 根据权利要求3-7中任一所述的方法,其特征在于:所述ZmU6-2启动子如SEQ ID No.16的第1-397位所示;和/或The method according to any one of claims 3-7, characterized in that: the ZmU6-2 promoter is shown in positions 1-397 of SEQ ID No. 16; and/or
    所述靶向ZmBBM2基因转录起始点上游-36bp区域的sgRNA间隔序列的编码核酸如SEQ ID No.16的第398-417位所示;和/或The coding nucleic acid of the sgRNA spacer sequence targeting the -36 bp region upstream of the ZmBBM2 gene transcription start point is shown in positions 398-417 of SEQ ID No. 16; and/or
    所述sgRNA2.0支架编码序列如SEQ ID No.16的第418-553位或第651-793位所示;和/或The sgRNA2.0 scaffold coding sequence is shown in positions 418-553 or 651-793 of SEQ ID No. 16; and/or
    所述拟南芥At-tRNAGly编码序列如SEQ ID No.16的第554-630位所示;和/或The Arabidopsis thaliana At-tRNAGly coding sequence is shown in positions 554-630 of SEQ ID No. 16; and/or
    所述靶向ZmBBM2基因转录起始点上游-166bp区域的sgRNA间隔序列的编码核酸如SEQ ID No.16的第631-650位所示;The coding nucleic acid of the sgRNA spacer sequence targeting the -166bp region upstream of the ZmBBM2 gene transcription start point is shown in positions 631-650 of SEQ ID No. 16;
    和/或and / or
    所述拟南芥DD45启动子如SEQ ID No.16的第1008-2017位所示;和/或The Arabidopsis thaliana DD45 promoter is shown in positions 1008-2017 of SEQ ID No. 16; and/or
    所述3×Flag标签编码核酸如SEQ ID No.16的第2018-2086位所示;和/或The 3×Flag tag encoding nucleic acid is shown in bits 2018-2086 of SEQ ID No. 16; and/or
    所述SV40核定位信号编码序列如SEQ ID No.16的第2093-2113位或第6992-7012位所示;和/或The SV40 nuclear localization signal coding sequence is shown in bits 2093-2113 or 6992-7012 of SEQ ID No. 16; and/or
    所述dCas9编码基因如SEQ ID No.16的第2114-6238位所示;和/或The dCas9 encoding gene is shown in positions 2114-6238 of SEQ ID No. 16; and/or
    所述NLS核定位信号编码序列如SEQ ID No.16的第6239-6286位所示;和/或The NLS nuclear localization signal coding sequence is shown in bits 6239-6286 of SEQ ID No. 16; and/or
    所述VP64编码基因如SEQ ID No.16的第6305-6454位所示;和/或The VP64 encoding gene is shown in positions 6305-6454 of SEQ ID No. 16; and/or
    所述自裂解多肽T2A的编码基因如SEQ ID No.16的第6482-6535位所示;和/或The coding gene of the self-cleaving polypeptide T2A is shown in positions 6482-6535 of SEQ ID No. 16; and/or
    所述MS2编码基因如SEQ ID No.16的第6551-6937位所示;和/或The MS2 encoding gene is shown in positions 6551-6937 of SEQ ID No. 16; and/or
    所述P65编码基因如SEQ ID No.16的第7028-7570位所示;和/或The P65 encoding gene is shown in positions 7028-7570 of SEQ ID No. 16; and/or
    所述HSF1编码基因如SEQ ID No.16的第7595-7966位所示;和/或The HSF1 encoding gene is shown in positions 7595-7966 of SEQ ID No. 16; and/or
    所述转录终止子NOS如SEQ ID No.16的第7967-8238位所示。The transcription terminator NOS is shown in positions 7967-8238 of SEQ ID No. 16.
  9. 根据权利要求8所述的方法,其特征在于:所述表达盒1的核苷酸序列如SEQ ID No.16的第1-1007位所示;和/或The method according to claim 8, characterized in that: the nucleotide sequence of the expression cassette 1 is shown in positions 1-1007 of SEQ ID No. 16; and/or
    所述表达盒2的核苷酸序列如SEQ ID No.16的第1008-8238位所示。The nucleotide sequence of the expression cassette 2 is shown in positions 1008-8238 of SEQ ID No. 16.
  10. 根据权利要求9所述的方法,其特征在于:所述CRISPRa5基因激活系统的系统载体为含有SEQ ID No.16所示DNA片段的质粒。The method according to claim 9, characterized in that: the system vector of the CRISPRa5 gene activation system is a plasmid containing the DNA fragment shown in SEQ ID No. 16.
  11. 权利要求1-10任一中所述的用于介导玉米卵细胞异位表达ZmBBM2基因的所述CRISPRa5基因激活系统的系统载体。The system vector of the CRISPRa5 gene activation system used to mediate the ectopic expression of ZmBBM2 gene in maize egg cells according to any one of claims 1 to 10.
  12. 权利要求11所述系统载体在诱发玉米孤雌单倍体发生中的应用。The application of the system vector described in claim 11 in inducing the occurrence of parthenogenetic haploidy in maize.
  13. 一种基于CRISPR的基因激活系统,其特征在于:所述基于CRISPR的基因激活系统的系统载体能够表达dCas9-VP64融合蛋白,能够表达MS2-P65-HSF1融合蛋白,并且能够表达靶向目标基因靶标区域的一个或若干个sgRNA,所述sgRNA为sgRNA2.0。A CRISPR-based gene activation system, characterized in that: the system vector of the CRISPR-based gene activation system can express dCas9-VP64 fusion protein, can express MS2-P65-HSF1 fusion protein, and can express targeted gene targets One or several sgRNAs in the region, and the sgRNA is sgRNA2.0.
  14. 根据权利要求13所述的基于CRISPR的基因激活系统,其特征在于:所述系统载体上含有如下2个表达盒:The CRISPR-based gene activation system according to claim 13, characterized in that: the system vector contains the following two expression cassettes:
    表达盒1’:用于表达所述sgRNA;Expression cassette 1': used to express the sgRNA;
    表达盒2’:用于表达功能蛋白,所述功能蛋白为将所述dCas9-VP64融合蛋白和所述MS2-P65-HSF1融合蛋白通过自裂解多肽T2A融合而成。Expression cassette 2': used to express functional proteins, which are formed by fusing the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein through the self-cleaving polypeptide T2A.
  15. 根据权利要求14所述的基于CRISPR的基因激活系统,其特征在于:在所述表达盒1’中,用于启动所述sgRNA转录的启动子为U6启动子;和/或The CRISPR-based gene activation system according to claim 14, characterized in that: in the expression cassette 1', the promoter used to initiate the transcription of the sgRNA is a U6 promoter; and/or
    在所述表达盒2’中,用于启动所述功能蛋白转录的启动子为UBI启动子。In the expression cassette 2', the promoter used to initiate the transcription of the functional protein is the UBI promoter.
  16. 根据权利要求14所述的基于CRISPR的基因激活系统,其特征在于:所述dCas9-VP64融合蛋白和所述MS2-P65-HSF1融合蛋白均含有核定位信号。The CRISPR-based gene activation system according to claim 14, wherein both the dCas9-VP64 fusion protein and the MS2-P65-HSF1 fusion protein contain nuclear localization signals.
  17. 权利要求13-16中任一所述基于CRISPR的基因激活系统在对目的基因进行基因激活中的应用。Application of the CRISPR-based gene activation system in any one of claims 13-16 for gene activation of target genes.
PCT/CN2022/113576 2022-08-19 2022-08-19 Autonomous parthenogenetic haploid reproduction method for maternal cells and use thereof in breeding WO2024036600A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/113576 WO2024036600A1 (en) 2022-08-19 2022-08-19 Autonomous parthenogenetic haploid reproduction method for maternal cells and use thereof in breeding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/113576 WO2024036600A1 (en) 2022-08-19 2022-08-19 Autonomous parthenogenetic haploid reproduction method for maternal cells and use thereof in breeding

Publications (1)

Publication Number Publication Date
WO2024036600A1 true WO2024036600A1 (en) 2024-02-22

Family

ID=89940458

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/113576 WO2024036600A1 (en) 2022-08-19 2022-08-19 Autonomous parthenogenetic haploid reproduction method for maternal cells and use thereof in breeding

Country Status (1)

Country Link
WO (1) WO2024036600A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111885915A (en) * 2018-03-19 2020-11-03 瑞泽恩制药公司 Transcriptional modulation of animals using CRISPR/CAS systems
CN112204147A (en) * 2017-12-22 2021-01-08 科沃施种子欧洲股份两合公司 Cpf 1-based plant transcription regulatory system
CN113637658A (en) * 2021-08-19 2021-11-12 中国农业科学院棉花研究所 dCas 9-oToV-based gene transcription system and application thereof
WO2022087616A1 (en) * 2020-10-21 2022-04-28 Pioneer Hi-Bred International, Inc. Parthenogenesis factors and methods of using same

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112204147A (en) * 2017-12-22 2021-01-08 科沃施种子欧洲股份两合公司 Cpf 1-based plant transcription regulatory system
CN111885915A (en) * 2018-03-19 2020-11-03 瑞泽恩制药公司 Transcriptional modulation of animals using CRISPR/CAS systems
WO2022087616A1 (en) * 2020-10-21 2022-04-28 Pioneer Hi-Bred International, Inc. Parthenogenesis factors and methods of using same
CN113637658A (en) * 2021-08-19 2021-11-12 中国农业科学院棉花研究所 dCas 9-oToV-based gene transcription system and application thereof

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
DU XUEMEI, FANG TING, LIU YAN, HUANG LIYING, ZANG MAOSEN, WANG GUOYING, LIU YUNJUN, FU JUNJIE: "Transcriptome Profiling Predicts New Genes to Promote Maize Callus Formation and Transformation", FRONTIERS IN PLANT SCIENCE, FRONTIERS RESEARCH FOUNDATION, CH, vol. 10, 20 December 2019 (2019-12-20), CH , pages 1633, XP093140213, ISSN: 1664-462X, DOI: 10.3389/fpls.2019.01633 *
IMTIYAZ KHANDAY ET AL.: "A Male-Expressed Rice Embryogenic Trigger Redirected for Asexual Propagation Through Seeds", NATURE, vol. 565, 12 December 2018 (2018-12-12), XP036664324, ISSN: 0028-0836, DOI: 10.1038/s41586-018-0785-8 *
黎裕 等 (LI, YU ET AL.): "玉米生物育种: 现状与展望 (Maize Breeding: Advances and Prospects)", 中国基础科学 (CHINA BASIC SCIENCE), vol. 24, no. 4, 15 August 2022 (2022-08-15), ISSN: 1009-2412 *

Similar Documents

Publication Publication Date Title
EP3186381B1 (en) Generation of haploid plants
Althoff et al. Comparison of the MpEF1α and CaMV35 promoters for application in Marchantia polymorpha overexpression studies
EP3292204B1 (en) Polynucleotide responsible of haploid induction in maize plants and related processes
US20200263189A1 (en) Systems and methods for cellular reprogramming of a plant cell
US20200216853A1 (en) Method for rapidly and efficiently creating directed gene mutated non-transgenic plants and its applications
US20230175000A1 (en) Maize gene zmravl1 and functional site and use thereof
JP2013074892A (en) Method of producing haploid plant embryo and doubled haploid plant embryo
CN110093354B (en) Seed sorting method for fixing plant heterosis
CN110892074A (en) Compositions and methods for increasing the shelf life of bananas
US20230183729A1 (en) Methods of increasing seed yield
Jiang et al. ORYZA SATIVA MYOSIN XI B controls pollen development by photoperiod-sensitive protein localizations
Zhao et al. The maternal-to-zygotic transition in higher plants: available approaches, critical limitations, and technical requirements
Qi et al. CRISPR/dCas-mediated gene activation toolkit development and its application for parthenogenesis induction in maize
US20240002877A1 (en) Parthenogenesis factors and methods of using same
US7659448B2 (en) Plant regulatory sequences for selective control of gene expression
Chen et al. Creation of a watermelon haploid inducer line via ClDMP3-mediated single fertilization of the central cell
EP2989889B1 (en) Generation of haploid plants
JP7335383B2 (en) Plant regulatory elements and uses thereof
WO2024036600A1 (en) Autonomous parthenogenetic haploid reproduction method for maternal cells and use thereof in breeding
CN115786390A (en) Maternal cell autonomous parthenogenetic haploid reproduction method and breeding application thereof
US20230407324A1 (en) Doubled haploid inducer
Heliel Functional evaluation of RAMOSA2 cis-regulatory regions from different grass species for shaping inflorescence architecture in barley
Tovar-Aguilar et al. MIR822 modulates monosporic female gametogenesis through an ARGONAUTE9-dependent pathway in Arabidopsis thaliana
WO2023150657A2 (en) Monocot leaf explant preparation
EA043050B1 (en) WAYS TO INCREASE GRAIN YIELD

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22955380

Country of ref document: EP

Kind code of ref document: A1