WO2023050644A1 - Chimeric allosteric transcription factor for mammalian system, regulatory element group, and inducible expression system - Google Patents

Chimeric allosteric transcription factor for mammalian system, regulatory element group, and inducible expression system Download PDF

Info

Publication number
WO2023050644A1
WO2023050644A1 PCT/CN2022/071023 CN2022071023W WO2023050644A1 WO 2023050644 A1 WO2023050644 A1 WO 2023050644A1 CN 2022071023 W CN2022071023 W CN 2022071023W WO 2023050644 A1 WO2023050644 A1 WO 2023050644A1
Authority
WO
WIPO (PCT)
Prior art keywords
chimeric
sequence
domain
transcription factor
promoter
Prior art date
Application number
PCT/CN2022/071023
Other languages
French (fr)
Chinese (zh)
Inventor
娄春波
项延会
王卫军
李婷婷
陈茜
Original Assignee
中国科学院深圳先进技术研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国科学院深圳先进技术研究院 filed Critical 中国科学院深圳先进技术研究院
Publication of WO2023050644A1 publication Critical patent/WO2023050644A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K19/00Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/10Cells modified by introduction of foreign genetic material
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks

Definitions

  • the invention belongs to the field of synthetic biology and bioengineering, and relates to a chimeric allosteric transcription factor capable of regulating gene expression in a mammalian system, a regulatory element group and an induced expression system.
  • the present invention relates to the use of natural amino acid-derived acyl homoserine lactones as inducers to regulate the transcription of a gene of interest in a host cell.
  • the Streptomyces coelicolor quorum sensing receptor ScbR or the Streptomyces pristinaespiralis transcription factor SpbR can be fused to the eukaryotic transcriptional activation domain VP16 of herpes simplex virus as a transcriptional activator. factor, and the corresponding operator sequence was assembled to the 5' end of the CMVmini promoter to construct a mammalian inducible expression system using butyrolactone compounds as inducers [1] .
  • the above expression systems are all positive control repressor systems, in which allosteric transcription factors are detached from DNA in the presence of small molecules.
  • WO2007058527A2 reports the transfer of a prokaryotic-derived Tet regulatory system to a mammalian system.
  • the repressor protein TetR specifically binds to its operator sequence tetO to inhibit downstream gene transcription, while the binding of tetracycline or doxycycline (dox) to TetR triggers a conformational change, releases the promoter region, and turns on gene transcription.
  • dox tetracycline or doxycycline
  • VP16 can be fused with TetR to construct an allosteric transcription factor tTA, which can regulate eukaryotic promoters containing tetO sequences.
  • TetR can be mutated so that it binds to DNA after being combined with an inducer.
  • This negative control system can be activated by a lower concentration of inducer (such as 44ng/ml doxycycline).
  • the quorum sensing system is a system that exists in bacteria and responds to bacterial population density signals. It realizes intercellular communication by synthesizing, secreting, detecting, and responding at the population level to coordinate gene expression within the population. This process is closely related to bioluminescence, antibiotic synthesis and biofilm formation.
  • the signaling molecules secreted by different species of bacteria have different chemical structures, which provide more options for constructing artificial expression systems that are orthogonal to the gene regulatory network of mammalian hosts.
  • the quorum sensing system of Proteus uses acyl homoserine lactone (N-acyl homoserine lactone, AHL) as a signal molecule, and the LuxR family allosteric transcription factor acts as a receptor to transduce the concentration information of the signal molecule. It has been reported in the field that the LuxR family allosteric transcription factor was fused with the eukaryotic transactivation domain NF- ⁇ B p65 or VP16 to construct a chimeric transcription factor; and the operator sequence was added to the CMVmini promoter to construct an inducible promoter responsive to AHL [2]-[3] .
  • the EC50 levels of such mammalian induction systems reported in the art are above 10 ⁇ M, and the inducers are limited to 3-oxo-octanoyl-L-homoserine lactone (3OC8) or 3-oxo- Hexanoyl-L-homoserine lactone (3OC6); the acyl side chains of these two straight-chain fatty acid-type AHLs are highly similar, so the multichannel communication system based on this type of AHL exists at both the signal level and the promoter level Extensive crosstalk.
  • the corresponding allosteric transcription factors and operating sequences are identified by data mining of the quorum sensing system in prokaryotes using acyl homoserine lactone derived from natural amino acids as an inducer, and are compared with the eukaryotic transcriptional regulation system. Integrating, a novel inducible expression system for mammalian cells was constructed.
  • the present invention provides a chimeric allosteric transcription factor for regulating a gene of interest in a mammalian cell, the chimeric allosteric transcription factor comprising a regulatory domain and a transcriptional activation structure domain; wherein, the regulatory domain is selected from the group consisting of RpaR, BjaR and BraR; the transcriptional activation domain is selected from P65, VP16, VP64, VTR1, VTR2 and VTR3; the chimeric type Constitutive transcription factors use natural amino acid derived acyl homoserine lactones as inducers; said natural amino acid derived acyl homoserine lactones are selected from the group consisting of the following compounds: p-coumaroyl-homoserine lactone (pC -HSL), isovaleryl-homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL).
  • the regulatory domain is selected from the group consisting of Rp
  • the present invention provides a set of regulatory elements using natural amino acid-derived acyl homoserine lactones as inducers selected from the group consisting of : p-coumaroyl-homoserine lactone (pC-HSL), isovaleryl-homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL);
  • the set of regulatory elements comprises:
  • a chimeric allosteric transcription factor comprising a regulatory domain and a transcriptional activation domain; wherein said regulatory domain is selected from the group consisting of RpaR, BjaR and BraR; said The transcriptional activation domain is selected from the group consisting of P65, VP16, VP64, VTR1, VTR2 and VTR3; and
  • An inducible promoter active in mammalian cells comprising a core sequence and at least one operator site pair interacting with said chimeric allosteric transcription factor.
  • the present invention provides a system for inducing expression of a gene of interest in mammals, the inducible expression system utilizes acyl homoserine lactone derived from a natural amino acid as an inducer, and the acyl homoserine lactone derived from a natural amino acid Acyl homoserine lactones are selected from the group consisting of p-coumaroyl-homoserine lactone (pC-HSL), isovaleryl-homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone Lactone (Cinn-HSL); the inducible expression system comprises:
  • a first gene expression cassette comprising a first promoter sequence and a coding sequence of a chimeric allosteric transcription factor, wherein the chimeric allosteric transcription factor comprises a regulatory domain and a transcriptional activation structure domain;
  • the regulatory domain is selected from the group consisting of RpaR, BjaR and BraR;
  • the transcriptional activation domain is selected from the group consisting of P65, VP16, VP64, VTR1, VTR2 and VTR3;
  • a second gene expression cassette comprising an inducible promoter sequence and the coding sequence of the gene of interest, the inducible promoter sequence comprising an allosteric transcription factor interacting with the chimeric type At least one operator site pair for .
  • the present invention provides the chimeric allosteric transcription factor described in the first aspect, the set of regulatory elements described in the second aspect, and the inducible expression system described in the third aspect in the construction of single-cell or multi-cell signaling Communication systems and use in models of human disease.
  • the present invention by identifying and characterizing the operator sequences of the above-mentioned transcription factors, and adding the operator sequences to eukaryotic promoters, constructs capable of responding to pC-HSL, IV-HSL and/or Cinn-HSL in mammalian hosts
  • the inducible promoter of HSL on the other hand, the eukaryotic transactivator is fused as the transcriptional activation domain to the above-mentioned transcriptional activator of the LuxR family to construct a chimeric allosteric transcription factor capable of regulating gene expression in a eukaryotic host factor.
  • the regulatory domain of the chimeric allosteric transcription factor constructed in the present invention is combined with the above-mentioned inducer to change its conformation, so that it can be compatible with the corresponding
  • the operator sequence is combined, the transcriptional activation domain of the chimeric allosteric transcription factor is positioned to a promoter containing the corresponding operator sequence, and the gene controlled by the promoter is induced to express.
  • the chimeric allosteric transcription factor dissociates from the DNA, turning off gene transcription.
  • the induction system can be used in various types of mammalian cells, tissues or organisms in a modular manner to regulate the expression of genes of interest.
  • the chimeric allosteric transcription factor of the present invention is a transcription activator without the above-mentioned systemic defects.
  • the currently reported induction system using straight-chain fatty acid type AHL as an inducer responds to 3OC8 signal by means of VP16-TraR fusion protein, and the half-maximum effective concentration EC50 of the system is 7-14 ⁇ M [3] , while the pC-TraR of the present invention
  • the EC50 of HSL and Cinn-HSL induction system are 0.8 ⁇ M and 1.25nM, respectively.
  • the induction system of the present invention can achieve a hypersensitive response. Since it can be induced at a lower working concentration, the cytotoxicity of the inducer to cells is effectively reduced.
  • the maximum induction multiple of the 3OC8 induction system is about 20 times, while the maximum induction multiple of the pC-HSL induction system of the present invention can reach 100 times, and the signal-to-noise ratio is higher.
  • the induction system of the present invention can be used in combination with the straight-chain fatty acid-type AHL induction system to construct multiple A complex regulatory network or gene circuit of signals.
  • the chimeric allosteric transcription factor further comprises a multimerization domain CarH.
  • a multimerization domain increases steric hindrance near the promoter, increases the steepness of the induction curve, reduces background expression, and shifts the induction threshold. High for better transcription switch performance. For example, in the process of vaccine production, excessive background expression will lead to the loss of inducer dependence, thereby resulting in excessive immunogenicity, so reduced background expression is extremely important.
  • fusion of the CarH multimerization domain can increase the sensitivity of the pC-inducible system by more than 5 times.
  • linearization can reduce signal distortion and improve linearity of the amplifier in the circuit, which is very important for peak frequency adaptation and conversion of non-linear transient trigger signals into stable linear responses.
  • the linear transformation of the reporter level can be achieved by adding negative feedback in the regulatory network, reducing the heterogeneity of expression and changing the shape of the sigmoid response curve.
  • the dynamic range of the inducible expression system is expanded by adding a TetR-based negative feedback system, so that the concentration of the inducer has a linear relationship with the expression level of the target protein within a wide range. Linearization of such regulatory elements is critical for the rational design of regulatory networks.
  • Figure 1 shows the design principle and characterization of the induction system of the present invention.
  • A The design principle of chimeric allosteric transcription factors and the working principle of the induction system comprising said chimeric allosteric transcription factors.
  • B The pC inducible system, the Cinn inducible system and the inducer of the IV inducible system, the regulatory domain of the chimeric allosteric transcription factor, and the EC50 and dynamic range of the induction curve according to an embodiment of the present invention.
  • C Induction curves of the pC inducible system, Cinn inducible system and IV inducible system measured by flow cytometry, the inducers used were pC-HSL, Cinn-HSL and IV-HSL, respectively.
  • FIG. 2 shows the identification and optimization of the operator sequence for the Cinn inducible system.
  • A is the result of comparing the sequences of the putative BraI promoter regions in four Bradyrhizobium oligotrophicum S58 (APO) and ORS278 (CUL and CUS) by BLAST . Box encloses the putative operator site braO.
  • B Induction curves of Cinn inducible systems containing promoters P CUS-TRE3G , P CUL-TRE3 , P ORS-TRE3G , P APO-TRE3G and P NC-TRE3G determined by flow cytometry.
  • the promoters P CUS-TRE3G , P CUL-TRE3 , P ORS-TRE3G , P APO-TRE3G and P NC-TRE3G respectively contain a pair of CUS, CUL, ORS, APO and NC operation sites.
  • C For the operator sequence CUS (wt) of Bradyrhizobium ORS278 strain, it was truncated from upstream or downstream, respectively, to obtain operator sequences F1, F2, F4, F7 (5' end truncated), and R1, R4, R6 , R7, R8 (3' end truncated).
  • Figure 3 shows the induction curves of the Cinn inducible system comprising the transcriptional activation domain VTR3 or VP64 measured by flow cytometry.
  • Figure 4 shows the effect of the multimerization domain on the performance of the induction system.
  • A shows the design principle of the chimeric allosteric transcription factor VTR3-CarH-RpaR containing the multimerization domain CarH and the working principle of the pC induction system containing the allosteric transcription factor.
  • B Induction curves of pC-inducible systems containing allosteric transcription factors VTR3-CarH-RpaR or VTR3-RpaR measured by flow cytometry.
  • Figure 5 shows that adding negative feedback to the induction system can linearize the response.
  • (a) The working principle of the regulation of the dual-input promoter pC-dox by the inducer pC-HSL and the allosteric transcription factor VTR3-RpaR, wherein the pC-dox promoter contains a pair of rpaO and a pair of tetO operation sites.
  • (b) The mechanism of regulation of the dual-input promoter pC-dox by the inducer dox and the allosteric transcription factor TetR.
  • (c) The induction curve of the pC-dox induction system was determined by flow cytometry. The data presented are the mean fluorescence values of triplicate experiments.
  • Figure 6 shows the signal crosstalk between the pC-inducible system and the Cinn-inducible system.
  • A The dose-response curves of the Cinn-inducing system were measured by flow cytometry at each concentration of pC-HSL (ORS-pC) or Cinn-HSL (ORS-Cinn).
  • B The dose-response curves of the pC-inducing system were determined by flow cytometry at various concentrations of pC-HSL (RpaO-pC) or Cinn-HSL (RpaO-Cinn).
  • Figure 7 shows the construction of mammalian cell-to-cell communication using the Cinn induction system of the present invention.
  • A Gene circuit diagram of mammalian cell communication system B4YP-2XR8.
  • B Gene circuit diagram of negative control module 2XR8.
  • C Communication response of B4YP-2XR8 or 2XR8 modules in mammalian cells. From left to right are the FITC channel, ECD channel, bright field and the overlay of the three. The magnification of the fluorescence microscope is 20 ⁇ , and the scale bar is 150 ⁇ m.
  • the present invention proposes to use natural amino acid-derived acyl homoserine lactone as an inducer to regulate gene expression in mammalian cells in a dose-dependent and reversible manner. To regulate.
  • the chimeric allosteric transcription factor of the present invention has a regulatory domain of prokaryotic origin and a transcriptional activation domain of eukaryotic origin, and this heterologous fusion enables the allosteric transcription factor to respond to heterologous signals.
  • Response The response to the inducer is achieved by the regulatory domain, and the transcription of the gene in mammals is controlled by the transcriptional activation domain.
  • the regulatory domain is a quorum sensing response protein RpaR, BjaR or BraR, and the regulatory domain is combined with acyl homoserine lactone pC-HSL, Cinn-HSL or IV-HSL derived from a natural amino acid to produce a conformation Change, bind to the operator sequence on the DNA, thereby positioning the transcriptional activation domain to the promoter containing the operator sequence, so the expression level of the target gene can be regulated by the concentration of the inducer.
  • RpaR quorum sensing response protein
  • BjaR or BraR the regulatory domain is combined with acyl homoserine lactone pC-HSL, Cinn-HSL or IV-HSL derived from a natural amino acid to produce a conformation Change, bind to the operator sequence on the DNA, thereby positioning the transcriptional activation domain to the promoter containing the operator sequence, so the expression level of the target gene can be regulated by the concentration of the inducer.
  • the production and response of signaling molecules are usually mediated by the autoinducer synthase LuxI and the response protein LuxR.
  • LuxI catalyzes the generation of signal molecule AHL with S-adenosylmethionine and acyl carrier protein (ACP) as substrates in vivo, and this kind of signal molecule can freely diffuse through the cell membrane; the allosteric transcription factor LuxR can interact with the AHL binds to regulate the transcription of quorum sensing-dependent genes.
  • the positions of LuxI and LuxR genes on the bacterial genome are usually close to each other [6] , and the promoter of LuxI contains an operator site that can be regulated by LuxR and AHL, so that the synthesis pathway of AHL includes positive feedback, so bacteria can be in a short time Faster synthesis of signaling molecules.
  • the quorum-sensing allosteric transcription factor is fused as a regulatory domain with the transcriptional activation domain derived from eukaryotic sources, and the operator sequence of the allosteric transcription factor is added to the eukaryotic promoter, and constructed in eukaryotic cells Regulatory element sets and induction systems in response to quorum sensing signals.
  • the inducible system in which the chimeric allosteric transcription factor comprises RpaR as the regulatory domain and the inducible promoter comprises rpaO as the operating site is called the pC inducible system.
  • the The inducer is p-coumaroyl-homoserine lactone (pC-HSL).
  • the inducible system in which the chimeric allosteric transcription factor comprises BraR as the regulatory domain and the inducible promoter comprises braO as the operating site is called the Cinn inducible system.
  • the The inducer was cinnamoyl-homoserine lactone (Cinn-HSL).
  • the induction system in which the chimeric allosteric transcription factor includes BjaR as the regulatory domain and the inducible promoter includes bjaO as the operating site is called the IV induction system.
  • the The inducer is isovaleryl-homoserine lactone (IV-HSL).
  • IV-HSL isovaleryl-homoserine lactone
  • inducers used in the present invention are acyl homoserine lactones derived from natural amino acids, which are currently only found in bacteria that are symbiotic with plants, and these small molecules are involved in bacteria-plant interactions.
  • straight-chain fatty acid-type AHL is catalyzed by Lux type I N-acyl homoserine lactone synthase, and the substrates of this reaction are S-adenosylmethionine and acyl-acyl carrier protein (Acyl-ACP) .
  • the substrates of Lux type I synthases RpaI, BraI and BjaI are acyl-coenzyme A (Acyl-CoA) rather than acyl-acyl carrier protein.
  • ACP and coenzyme A have similar chemical properties, both form adducts with carboxylated substrates and present these substrates to the enzyme.
  • ACP usually lies in the biosynthesis and transfer of fatty acid substrates; while CoA is related to carboxylation substrates with more diverse structures: Coenzyme A modified compounds are usually intermediates in biosynthesis and biodegradation pathways [5] . Acyl homoserine lactones derived from natural amino acids can freely diffuse into eukaryotic cells, exist stably in cells, and cannot be synthesized by eukaryotic own metabolic pathways. This heterologous inducer can effectively reduce the background expression.
  • the LuxR homologous protein is used as a regulatory domain to detect and respond to signal molecules. Its structure includes a domain that binds to an inducer and a domain that binds to a promoter operator sequence. Binding of the regulatory domain to the small molecule causes a conformational change that alters the transcriptional level of downstream genes controlled by the promoter.
  • allosteric transcription factors that bind to DNA to trigger or enhance the transcription of target genes are called activators, and vice versa are called repressors.
  • the function of the chimeric allosteric transcription factor of the present invention is an activator. Inducible promoters contain operator sequences capable of interacting with regulatory domains.
  • an inducible promoter may comprise one or more pairs of operator sequences, and the operator site pairs may both be located upstream or downstream of the transcription initiation site, or located respectively at the transcription initiation site. both sides of the starting point.
  • promoter refers to a region of DNA that drives the expression or transcription of a nucleic acid sequence, located synonymously upstream of the transcription initiation site of a gene. RNA polymerase and transcription factors are able to bind DNA at the promoter.
  • An inducible promoter generally includes a transcriptional regulatory region, an RNA polymerase recognition region, and a transcriptional initiation site. It is known in the art that the transcription of the gene can be dynamically turned on or off by operably linking the coding sequence of the gene of interest to the downstream of an inducible promoter, and adding an inducer at a desired time.
  • minimal promoter or "promoter's core sequence” refers to the smallest transcriptional control unit capable of triggering transcription in eukaryotic cells, such as a promoter core region comprising only the RNA polymerase recognition region and the transcription initiation site . At least one operator site can be added upstream and/or downstream of the minimal promoter to construct an inducible promoter with excellent switch performance.
  • the core sequence of the inducible promoter can be selected from CMV1 promoter (SEQ ID NO: 1), CMVmini promoter (SEQ ID NO: 2), TRE3G promoter (SEQ ID NO: 3), EF1a core promoter (SEQ ID NO: 4) or hEF1a promoter (SEQ ID NO: 40); preferably, the core sequence is a CMV1 promoter or a TRE3G promoter.
  • the inducible system is a pC inducible system.
  • the inducer is pC-HSL
  • the regulatory domain of the chimeric allosteric transcription factor is RpaR
  • the operator sequence contained in the inducible promoter is rpaO (Table 1).
  • the natural host of the pC quorum sensing system is Rhodopseudomonas palustris, which is symbiotic with plants, and its quorum sensing system includes RpaI of the LuxI family and RpaR of the LuxR family.
  • the transcription of RpaI is jointly activated by pC-HSL and RpaR.
  • Rhodopseudomonas palustris RCB100 strain the coding sequence of RpaR is the 348672-349403bp antisense strand of its whole genome sequence (NCBI accession number is WP_011155889.1).
  • concentration of pC-HSL is on the order of 1-10 ⁇ M, and the bacterium can appear quorum sensing in the presence of nM pC-HSL [7] .
  • the RpaI promoter containing the rpaO operator sequence could not be activated by straight-chain fatty acid-type AHL, indicating that there was good orthogonality between the pC quorum sensing system and the straight-chain fatty acid-type AHL quorum sensing system at working concentrations.
  • This system can be used in combination with a linear fatty acid-type AHL induction system (for example, 3OC6 or 3OC8 as an inducer) to construct a multiple induction system.
  • the pC induction system can also be regulated using Cinn-HSL as an inducer.
  • the pC-HSL synthesis pathway that exists in Rhodopseudomonas palustris is not complete, and requires The biosynthesis of pC-HSL can only be achieved by adding p-coumarate to the medium.
  • tyrosine ammonia lyase TAL, 4-coumaric acid coenzyme A ligase 4CL and p-coumaroyl-homoserine lactone synthetase RpaI can be used to catalyze the synthesis of p-coumaroyl-homoserine from tyrosine Serine lactone [8] .
  • the induction system is a Cinn induction system.
  • the inducer is Cinn-HSL
  • the regulatory domain of the chimeric allosteric transcription factor is BraR
  • the operator sequence contained in the inducible promoter is braO (Table 1).
  • braO the operator sequences capable of responding to Cinn-HSL and BraR are collectively referred to as braO, for example, braO can be NC (SEQ ID NO: 5), ORS (SEQ ID NO: 6), APO (SEQ ID NO: 7), CUS (SEQ ID NO: 8), CUL (SEQ ID NO: 9), R1 (SEQ ID NO: 10), R4 (SEQ ID NO: 11), R7 (SEQ ID NO: 12) or R8 (SEQ ID NO: 13) ( Figure 2), preferably R8.
  • Cinn-HSL secreted by photoheterotrophic Bradyrhizobium can stimulate the quorum sensing of Rhodopseudomonas palustris.
  • the biosynthesis of Cinn-HSL is catalyzed by LuxI homologous protein BraI (NCBI accession number: CAL74857.1), and the signal molecule is detected by LuxR homologous protein BraR.
  • the coding sequence of BraR is its genome (NCBI accession number: CU234118.1) antisense strand 1004717-1005442bp), and the coding sequence of BraI is its genome antisense strand 1004631-1003951bp.
  • the concentration of Cinn-HSL required to achieve a half-saturated quorum sensing response can be as low as 10 pM, while the concentration of linear fatty acid-type AHL required to achieve a half-saturated quorum sensing response is 10 4 -10 7 pM, indicating that there is a good orthogonality between the Cinn quorum sensing system and the linear fatty acid AHL quorum sensing system at the working concentration.
  • a multiple induction system can be constructed by reducing the concentration of Cinn-HSL and using this system in combination with a linear fatty acid-type AHL induction system (for example, using 3OC6 or 3OC8 as an inducer).
  • Cinn-HSL In Rhodopseudomonas palustris, a higher concentration of Cinn-HSL can also activate the RpaI promoter containing the rpaO operator sequence.
  • concentration of Cinn-HSL By adjusting the concentration of Cinn-HSL, the host cells containing both the pC inducible system and the Cinn inducible system can Spatiotemporal specific regulation.
  • regulatory element group and induction system are used in the construction of the intercellular signal communication system, although Bradyrhizobium biosynthesis of Cinn-HSL does not require the addition of cinnamate, Cinn The yield of -HSL was only 20nM.
  • the production of the intermediate product cinnamate can be increased by introducing the cinnamate biosynthesis pathway, thereby increasing the production of Cinn-HSL .
  • phenylalanine ammonia lyase catalyzes the deamination of phenylalanine to cinnamic acid.
  • the phenylalanine ammonia lyase can be derived from, for example, Streptomyces maritimus or Rhodosporidium toruloides.
  • phenylalanine ammonia-lyase PAL, 4-coumaric acid coenzyme A ligase 4CL and BraI are used to catalyze the synthesis of cinnamoyl-homoserine lactone from phenylalanine [9] .
  • the induction system is an IV induction system.
  • the inducer is IV-HSL
  • the regulatory domain of the chimeric allosteric transcription factor is BjaR
  • the operator sequence contained in the inducible promoter is bjaO (Table 1).
  • IV response elements exist in a variety of Bradyrhizobium japonicum, Bradyrhizobium diazoefficiens, Bradyrhizobium niftali, Bradyrhizobium liaoningense, etc.
  • the first discovered natural host its quorum sensing system includes BjaI of the LuxI family and BjaR of the LuxR family. The transcription of BjaI is jointly activated by IV-HSL and BjaR.
  • the BjaI-lacZ reporter can respond to IV-HSL as low as 10pM, while the concentration of linear fatty acid-type AHL required to produce quorum sensing signals is nM, indicating that the working concentration is low.
  • IV induction system There is good orthogonality between the IV induction system and the linear fatty acid type AHL induction system [5] .
  • the coding sequence of BjaR is the sense strand 1105945-1106673 bp of its genome (the whole genome NCBI accession number of 110spc4 strain: NZ_CP032617.1).
  • the branched chain ⁇ -ketoacid dehydrogenase complex BCDH and isovaleryl-homoserine Lactone synthase BjaI catalyzes the synthesis of IV-HSL from isoleucine [8] .
  • transcriptional regulators of prokaryotic origin can be fused to eukaryotic transcriptional activation domains to regulate gene expression in eukaryotic cells.
  • US8383364B2 discloses that the VP16 activation domain of the herpes simplex virus is fused with the prokaryotic repressor protein TetR to prepare the transcriptional activator tTA.
  • TetR prokaryotic repressor protein
  • tTA regulates the transcriptional activity of the promoter P tet containing tetO.
  • eukaryotic transcriptional activation domains such as VP16 (ie VP64), p65 [10] or SAM [11] with four repeats in tandem can be fused to dCas9 to guide target gene activation by sgRNA.
  • VP16 ie VP64
  • p65 p65 [10]
  • SAM sgRNA
  • Ma et al. fused the known eukaryotic transcription activation domains p65, RTA, and VP64 to construct a series of VTR domains, and their fusion proteins with dCas9 can be used to regulate eukaryotic gene transcription [12] .
  • Transcription activation domains that can be used as chimeric allosteric transcription factors of the present invention include but are not limited to the above-mentioned p65, VP16, VP64, VTR1, VTR2 and VTR3, preferably VP64 or VTR3.
  • nucleic acid molecule such as DNA
  • nucleic acid molecule comprises a nucleotide sequence containing transcriptional and translational regulatory information, and such a sequence is "operably linked" to a nucleotide sequence encoding a polypeptide
  • an operable link is a link between the regulatory DNA sequence and the coding sequence of each structural domain in a manner that "allows the gene to be expressed as a peptide or protein in a recoverable amount”.
  • the regulatory domain is operably linked to the transcriptional activation domain, as long as the functions of the regulatory domain and the transcriptional activation domain are not destroyed, the regulatory domain can be directly linked to the N-terminal or C-terminal of the transcriptional activation domain, Or a linker peptide is used to link the regulatory domain to the N- or C-terminus of the transcriptional activation domain.
  • the connecting peptide is flexible, does not contain any active domains, and preferably consists of glycine and/or serine. In fact the length of the connecting peptide has very limited effect on the chimeric protein and can be of any length. In one embodiment, the connecting peptide is 20-60 bp. In one embodiment, the sequence of the connecting peptide is GGAGGAGGCGGCTCCGGAGGCGGAGGAAGC (SEQ ID NO: 16).
  • the chimeric allosteric transcription factor of the present invention may further comprise a nuclear localization signal peptide (NLS), which localizes the allosteric transcription factor to the nucleus of eukaryotic cells.
  • NLS nuclear localization signal peptide
  • the chimeric allosteric transcription factor of the present invention may comprise a nuclear localization signal peptide, the nuclear localization signal peptide comprises 1-4 nuclear localization sequences, and each nuclear localization sequence may have a flanking sequence of 5-40bp at both ends, said The flanking sequences are flexible. Each flanking sequence may be the same or different in length. In fact, the flanking sequence has very limited influence on the function of the chimeric protein and can be any sequence.
  • the nuclear localization signal peptide can be located at the N-terminal or C-terminal of the chimeric allosteric transcription factor, or between the regulatory domain and the transcriptional activation domain.
  • nuclear localization signal peptide is located between the regulatory domain and the transcriptional activation domain, and the nuclear localization signal peptide has two nuclear localization sequences, with 5 -20bp flanking sequence.
  • Nuclear localization sequences that can be used in the present invention include: the NLS of the SV40 virus large T antigen with the amino acid sequence of PKKKRKV; the NLS derived from the nucleoplasmic protein (such as the bipartite nucleoplasmic protein NLS with the KRPAATKKAGQAKKKK sequence); with PAAKRVKLD or RQRRNELKRSP NLS of c-myc with amino acid sequence NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY NLS of hRNPA1M9; sequence of importin- ⁇ derived IBB domain RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV; sequence of myoma T protein VSRKRPRP and PPKKARED
  • the chimeric allosteric transcription factor of the present invention may further comprise a multimerization domain, enabling the multimerization of the chimeric allosteric transcription factor.
  • This ability to multimerize does not depend on the structure of the regulatory domain or the transcriptional activation domain, and does not affect the interaction of the regulatory domain with the corresponding operator site. As shown in the examples of the present invention, even if there is only a pair of operator sites on the promoter region, the inducible system switch performance of the allosteric transcription factor containing the multimerization domain is better.
  • the multimerization domain is CarH (SEQ ID NO: 18).
  • CarH is a coenzyme B12-dependent bacterial photoreceptor protein that controls carotenoid synthesis in response to light.
  • Monomer CarH can form a tetramer after combining with coenzyme B12 in the dark, and the tetramer disintegrates under light. It has been reported in the field that the C-terminal adenosylcobalamin binding domain of CarH was used to prepare gels by utilizing the tetramerization property of CarH [13] .
  • the addition of a multimerization domain between the regulatory domain and the transcriptional activation domain of a chimeric allosteric transcription factor increases the steepness of the induction curve and reduces background expression.
  • the multimerization domain is located between the regulatory domain and the transcriptional activation domain.
  • the regulatory domain, the multimerization domain and the transcriptional activation domain are linked using a linker peptide.
  • the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, and the nuclear localization signal peptide may be located at the N-terminal or C-terminal of the chimeric allosteric transcription factor; or in the regulatory domain Between the multimerization domain, or between the transcription activation domain and the multimerization domain.
  • the features of the linker peptide and the nuclear localization signal peptide are as described above.
  • the first promoter of the induction system of the present invention can be selected according to the desired expression level of transcription factors.
  • the first promoter may be a promoter commonly used in mammals, a constitutive promoter, an inducible promoter, or a tissue-specific promoter.
  • the first promoter may be the same as or different from the inducible promoter driving the expression of the gene of interest.
  • the first promoter is selected from CMV1 promoter, CMVmini promoter, TRE3G promoter, EF1a core promoter or hEF1a promoter.
  • negative feedback is added to the set of regulatory elements and the induction system of the present invention, so that the linear transformation of the reporter level can be realized.
  • the dose-response curve exhibits an S-shape, that is, reporter gene expression saturates rapidly as the inducer concentration increases.
  • the dose-response curves were linear in a wide range of inducer concentrations, and the dynamic range increased.
  • the structure of the linearized gene circuit resembles a bacterial operon regulated by a repressor protein.
  • the set of regulatory elements of the present invention further comprises a repressor protein (such as TetR (US8383364B2) or CymR (WO2019175600A1)) and a third promoter regulating the repressor protein, said third promoter comprising the same A second operator site for repressor interaction (eg tetO or cymO).
  • the inducible promoter further includes a second operating site interacting with the repressor protein. There can be one or more second manipulation sites, and they can be located upstream or downstream of the transcription initiation site, or multiple second manipulation sites are distributed upstream or downstream of the transcription initiation site.
  • the inducible expression system of the present invention further comprises a third gene expression cassette comprising a third promoter sequence and a coding sequence of a repressor protein for expressing said gene expression cassette in a host cell.
  • the third promoter sequence includes a core sequence and a second operator site interacting with the repressor protein; the inducible promoter sequence includes an operator site interacting with a chimeric allosteric transcription factor
  • it further contains a second operator site that interacts with the repressor protein.
  • the repressor protein is TetR, and the second operator site is tetO.
  • TetR and its operating sequence tetO see, for example, US8383364B2.
  • tetO is tccctatcagtgatagaga (nucleotides 173-191 of SEQ ID NO: 37).
  • the gene sequence of TetR is as described in literature [17] .
  • the repressor protein is CymR and the second operator site is cymO.
  • CymR and its operating sequence cymO see CN107344962A, for example.
  • the induction system of the present invention can be used as a signal receiving module of the intercellular communication system.
  • the biosynthetic gene expression cassette of the signal molecule is placed in a cell as a signal sending element as a whole, so that the signal sending cell expresses all the enzymes required for the synthesis of the signal molecule in vivo.
  • the induction system of the present invention is placed in another cell as a signal receiving element, so that the signal molecule is used as a "wire" to couple the signal sending module and the signal receiving module to construct a multicellular gene circuit.
  • the prokaryotic quorum sensing system can also be imitated, and the signal sending module and the induction system of the present invention can be introduced into the same mammalian cells to regulate the functions of the mammalian cells at the population level.
  • the signaling modules complete biosynthetic pathways of signaling molecules corresponding to the pC, Cinn and IV induction systems are described above and summarized in Table 2.
  • the chimeric allosteric transcription factor, regulatory element group and induction system of the present invention can be transiently or stably transfected into mammalian cells, and the host gene Expression is regulated. Methods of transient transfection and stable transfection are well known in the art. Based on this, the chimeric allosteric transcription factor, regulatory element group and induction system of the present invention are also useful in the construction of artificial tissues, artificial organs, animal models of human diseases, gene function analysis, biopharmaceuticals, vaccine preparation, gene therapy, etc. advantageous.
  • proteins that are substantially homologous to the above-mentioned enzymes that catalyze the biosynthesis of signal molecules and allosteric transcription factors can also be used.
  • substantially homologous means that the protein sequences are at least 95%, at least 98% or at least 99% identical at the sequence level.
  • a homologous sequence may be a functionally identical sequence in a different species.
  • Homologues of a gene or protein of the present invention can be readily determined by those skilled in the art.
  • codons can be humanized and optimized based on the protein sequence to improve the expression strength of the protein in a heterologous host, which is well known in the art.
  • expression cassette refers to a DNA segment consisting of one or more genes and sequences that control the expression of the genes so that the protein encoded by the genes can be expressed in a desired host cell (i.e., a eukaryotic cell). ) expression. It is known in the art that the expression control sequences included in the expression cassette optionally include promoter sequences, 3' non-coding regions, transcription termination sites, polyadenylation and the like. In the present invention, an expression cassette comprising a promoter and a coding sequence can be integrated into the genome or present in a plasmid vector.
  • a chimeric allosteric transcription factor for regulating a gene of interest in a mammalian cell said chimeric allosteric transcription factor comprising a regulatory domain and a transcriptional activation domain; wherein said regulation
  • the structural domain is selected from the group consisting of RpaR, BjaR and BraR;
  • the transcription activation domain is selected from P65, VP16, VP64, VTR1, VTR2 and VTR3;
  • the chimeric allosteric transcription factor is derived from natural amino acids
  • the acyl homoserine lactone derived from a natural amino acid is selected from the group consisting of the following compounds: p-coumaroyl-homoserine lactone (pC-HSL), isovaleryl - homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL).
  • a connecting peptide is used to connect the regulatory domain to the transcriptional activation domain; preferably, the connecting peptide The length is 20-60bp; preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
  • the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located at the The N-terminal or C-terminal of the chimeric allosteric transcription factor or between the regulatory domain and the transcriptional activation domain, the nuclear localization signal peptide comprises 1-4 nuclear localization sequences; preferably, each The 5' end and the 3' end of the nuclear localization sequence independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide is located between the regulatory domain and the transcriptional activation domain; preferably, the The nuclear localization signal peptide comprises two nuclear localization sequences, and the 5' end and the 3' end of each nuclear localization sequence independently have a flanking sequence of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17 .
  • chimeric allosteric transcription factor according to any one of paragraphs 1-5, wherein the chimeric allosteric transcription factor further comprises a multimerization domain, preferably, the multimerization structure The domain is CarH.
  • the N-terminal or C-terminal of the transcription factor is either located between the regulatory domain and the multimerization domain, or between the transcription activation domain and the multimerization domain, and the nuclear localization
  • the signal peptide contains 1-4 nuclear localization sequences; preferably, the 5' and 3' ends of each nuclear localization sequence independently have flanking sequences of 5-40bp; preferably, the nuclear localization signal peptide contains 2 nuclear localization sequences Sequences, the 5' and 3' ends of each nuclear localization sequence independently have flanking sequences of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17; preferably, the nuclear localization signal peptide Located between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimer
  • a chimeric allosteric transcription factor comprising a regulatory domain and a transcriptional activation domain; wherein said regulatory domain is selected from the group consisting of RpaR, BjaR and BraR; said The transcriptional activation domain is selected from the group consisting of P65, VP16, VP64, VTR1, VTR2 and VTR3; and
  • An inducible promoter active in mammalian cells comprising a core sequence and at least one operator site pair interacting with said chimeric allosteric transcription factor.
  • the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located in the chimeric allosteric transcription factor
  • the nuclear localization signal peptide comprises 1-4 nuclear localization sequences; preferably, each nuclear localization sequence The 5' end and the 3' end independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide is located between the regulatory domain and the transcriptional activation domain; preferably, the nuclear localization signal
  • the peptide comprises two nuclear localization sequences, and the 5' and 3' ends of each nuclear localization sequence independently have flanking sequences of 5-20 bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17.
  • each of the connecting peptides has a length of 20-60bp;
  • the connecting peptide has the sequence of SEQ ID NO: 16.
  • the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located at the N of the chimeric allosteric transcription factor terminal or C-terminal, or between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain, and the nuclear localization signal peptide contains 1 - 4 nuclear localization sequences; preferably, the 5' end and 3' end of each nuclear localization sequence independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide comprises 2 nuclear localization sequences, each nuclear localization sequence The 5' end and the 3' end of the localization sequence independently have flanking sequences of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17; preferably, the nuclear localization signal peptide is located in the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain;
  • the core sequence of the inducible promoter is selected from CMV1 promoter, CMVmini promoter, TRE3G promoter, EF1a core promoter or hEF1a Promoter; preferably, the core sequence is a CMV1 promoter or a TRE3G promoter.
  • a system for inducing expression of genes of interest in mammalian cells the inducible expression system utilizing natural amino acid-derived acyl homoserine lactone as an inducer, the natural amino acid-derived acyl homoserine lactone selected from From the group consisting of the following compounds: p-coumaroyl-homoserine lactone (pC-HSL), isovaleryl-homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL );
  • the inducible expression system comprises:
  • a first gene expression cassette comprising a first promoter sequence and a coding sequence of a chimeric allosteric transcription factor, wherein the chimeric allosteric transcription factor comprises a regulatory domain and a transcriptional activation structure domain;
  • the regulatory domain is selected from the group consisting of RpaR, BjaR and BraR;
  • the transcriptional activation domain is selected from the group consisting of P65, VP16, VP64, VTR1, VTR2 and VTR3;
  • a second gene expression cassette comprising an inducible promoter sequence and the coding sequence of the gene of interest, the inducible promoter sequence comprising an allosteric transcription factor interacting with the chimeric type At least one operator site pair for .
  • the chimeric allosteric transcription factor of the first gene further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide Located at the N-terminal or C-terminal of the chimeric allosteric transcription factor or between the regulatory domain and the transcriptional activation domain, the nuclear localization signal peptide comprises 1-4 nuclear localization sequences; preferably , the 5' and 3' ends of each nuclear localization sequence independently have flanking sequences of 5-40 bp; preferably, the nuclear localization signal peptide is located between the regulatory domain and the transcriptional activation domain; preferably , the nuclear localization signal peptide comprises two nuclear localization sequences, the 5' end and the 3' end of each nuclear localization sequence independently have a flanking sequence of 5-20bp; preferably, the nuclear localization sequence has SEQ ID NO: 17 the sequence of.
  • each of the connecting peptides has a length of 20-60bp;
  • the connecting peptide has the sequence of SEQ ID NO: 16.
  • the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located at the N of the chimeric allosteric transcription factor terminal or C-terminal, or between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain, and the nuclear localization signal peptide contains 1 - 4 nuclear localization sequences; preferably, the 5' end and 3' end of each nuclear localization sequence independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide comprises 2 nuclear localization sequences, each nuclear localization sequence The 5' end and the 3' end of the localization sequence independently have flanking sequences of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17; preferably, the nuclear localization signal peptide is located in the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain
  • the core sequence of the inducible promoter in the second gene is selected from CMV1 promoter, CMVmini promoter, TRE3G promoter, EF1a core promoter or hEF1a promoter; preferably, the core sequence is CMV1 promoter or TRE3G promoter.
  • the core sequence of the third promoter in the third gene is selected from CMV1 promoter, CMVmini promoter, TRE3G promoter, EF1a core promoter or hEF1a promoter; preferably, the core sequence is CMV1 promoter or TRE3G promoter.
  • the chimeric allosteric transcription factor of any one of paragraphs 1-12, the set of regulatory elements of any one of paragraphs 13-28, the inducible expression system of any one of paragraphs 29-46 Use in the construction of a single-cell or multi-cell signal communication system, wherein the signal communication system includes a signal sending element and a signal receiving element, and the signal receiving element includes a chimeric allosteric transcription factor and a mammalian cell having active inducible promoter.
  • the signal molecule of the signal communication system is pC-HSL; the regulatory domain in the chimeric allosteric transcription factor is RpaR, and the inducible promoter comprises A core sequence and at least one pair of rpaO; the signaling elements include TAL, 4CL and RpaI.
  • the signal molecule of the signal communication system is Cinn-HSL
  • the regulatory domain in the chimeric allosteric transcription factor is BraR
  • the inducible promoter comprises A core sequence and at least one pair of braO
  • the signaling elements include PAL, 4CL and BraI.
  • the signal molecule of described signal communication system is IV-HSL;
  • Said regulatory domain in said chimeric allosteric transcription factor is BjaR, and said inducible promoter comprises A core sequence and at least one pair of bjaO;
  • the signaling element includes a branched-chain ⁇ -ketoacid dehydrogenase complex BCDH and BjaI.
  • the chimeric allosteric transcription factor of any one of paragraphs 1-12, the set of regulatory elements of any one of paragraphs 13-28, the inducible expression system of any one of paragraphs 29-46 Uses in gene function analysis, construction of human disease models, artificial tissues or organs.
  • the chimeric allosteric transcription factor of the present invention has a regulatory domain of prokaryotic origin and a transcriptional activation domain of eukaryotic origin, the response to the inducer is realized by the regulatory domain, and the transcriptional activation domain is triggered. Transcription of genes in mammalian cells.
  • the biosynthesis of Cinn-HSL is catalyzed by the LuxI homolog protein BraI, and the signal is detected by the LuxR homolog protein BraR.
  • the inducer small molecule can usually positively regulate the transcription of the autoinducer synthase LuxI, in other words, the promoter region of LuxI should contain the LuxR operating site, and the LuxI Transcription is co-activated by AHL and LuxR.
  • the promoter region of acyl homoserine lactone-dependent genes usually has an inverted repeat sequence of 18-20 bp, however, the operator site interacting with BraR could not be identified in the BraI promoter region according to the above characteristics [5] .
  • the CUL sequence was further tested, which is the CUS comprising the upstream 45 bp.
  • NC, ORS, APO, CUS and CUL were respectively inserted into the upstream of the TRE3G core promoter in pairs, and the Citrine yellow fluorescent protein coding sequence was connected downstream as a reporter gene, and Cinn-HSL was used to induce and discover the above-mentioned induction systems.
  • Cinn-HSL was used to induce and discover the above-mentioned induction systems.
  • Fig. 2B EC50 was 1.1-1.4nM, dynamic range was 18.5-37.7
  • further optimization is performed based on the CUS sequence.
  • the CUS was truncated from different positions. Among them, F1, F2, F4, F7 are truncated from the 5'end; R1, R4, R6, R7 and R8 are truncated from the 3' end. Insert the above operating sequence into the upstream of the TRE3G core promoter to construct an inducible promoter, and perform inducible expression. Under the induction conditions of 0 and 10 -6 mol/L Cinn-HSL, the fluorescence intensity changes of the reporter genes downstream of the above nine promoters are shown in Figure 2D.
  • the P 2XR8-TRE3G promoter containing two R8 operator sequences is used as an inducible promoter; the eukaryotic transcription activation domain VTR3 is fused with the regulatory domain BraR to construct VTR3- BraR chimeric allosteric transcription factor, the dose-response curve of each Cinn-HSL concentration was detected by flow cytometry.
  • the eukaryotic transcription activation domain VTR3 was fused with the regulatory domain RpaR to construct a VTR3-RpaR chimeric allosteric transcription factor; two rpaO operating sequences were inserted into the upstream of the TRE3G core promoter to construct P rpaO -TRE3G promoter; and the Citrine yellow fluorescent protein coding sequence is connected downstream as a reporter gene.
  • VTR3-RpaR could not bind to rpaO, and the expression of the reporter gene was turned off.
  • the combination of pC-HSL and VTR3-RpaR guides the chimeric allosteric transcription factor to the rpaO site of the P rpaO-TRE3G promoter, and the transcription activation domain VTR3 triggers the transcription of the reporter gene Citrine, which is detected by flow cytometry Dose-response curves of the system at various inducer concentrations.
  • the eukaryotic transcription activation domain VTR3 was fused with the regulatory domain BjaR to construct a VTR3-BjaR chimeric allosteric transcription factor; two bjaO operator sequences were inserted into the upstream of the TRE3G core promoter to construct P bjaO -TRE3G promoter; and the Citrine yellow fluorescent protein coding sequence is connected downstream as a reporter gene.
  • VTR3-BjaR could not bind to bjaO, and the expression of the reporter gene was turned off.
  • IV-HSL and VTR3-BjaR guides the chimeric allosteric transcription factor to the bjaO site of the P bjaO-TRE3G promoter, and the transcription activation domain VTR3 triggers the transcription of the reporter gene Citrine, which is detected by flow cytometry Dose-response curves of the system at various inducer concentrations.
  • the fluorescence intensity of the reporter was measured at a range of inducing concentrations (Fig. 1C).
  • the sensitivity of the induction system was quantified by fitting the induction curve with the Hill equation to calculate the EC50 value, that is, the half-maximal effective concentration.
  • the EC50 of pC, Cinn and IV induction systems were 8 ⁇ 10 -7 mol/L, 1 ⁇ 10 -9 mol/L, 2.5 ⁇ 10 -5 mol/L, respectively.
  • lower EC50 values correspond to higher sensitivity.
  • Such signal hypersensitivity and extremely low working concentrations of inducer are not found in other known AHL-based mammalian induction systems.
  • the reported straight-chain fatty acid-type AHL induction system uses VP16-TraR fusion protein to respond to 3OC8 signal, and the EC50 of this system is 7-14 ⁇ M [3] , while the EC50 of the Cinn-HSL induction system of the present invention is 1.25nM, 3 orders of magnitude lower than the prior art. Since it can be induced at a lower working concentration, the cytotoxicity of the inducer to cells is effectively reduced. In addition, the maximum induction multiple of the 3OC8 induction system is about 20 times, while the maximum induction multiple of the pC-HSL induction system of the present invention can reach 100 times, and the signal-to-noise ratio is higher.
  • the background expression of the IV induction system of the present invention is higher, so the sensitivity is not as good as that of the pC induction system and the Cinn induction system, but the crosstalk between the IV induction system and the pC induction system and the Cinn induction system is all small, and the IV induction system can be combined with the pC induction system. Or use in combination with the Cinn induction system.
  • the induction system of the present invention can be used in combination with the straight-chain fatty acid-type AHL induction system to construct multiple signals in mammalian hosts. Complex regulatory networks or gene circuits.
  • transcriptional regulatory factors VP16, VP64, p65, SAM, VTR, etc. can be fused as transcriptional activation domains and regulatory domains to regulate gene expression in eukaryotic cells.
  • allosteric transcription factors of two Cinn-inducing systems VP64-BraR and VTR3-BraR.
  • VP64 is four repeated tandem VP16
  • said VP16 is the eukaryotic transcriptional activation domain of herpes simplex virus (WO2007058527A2).
  • VTR3 is an optimized VTR series domain.
  • VP64, p65 and RTA were first fused into the eukaryotic transcriptional activation domain VTR; as an improvement, the first step was to delete the DNA binding domain (VTR1) at the N-terminal of p65 to reduce the non-targeting effect on the genome; and then for the C-terminal of p65
  • VTR1 and TA2 Two transactivation domains, TA1 and TA2, retain only the complete TA2 and part of TA1 (VTR2) in order to reduce the size of the domain; on the basis of VTR2, part of the RTA domain is further deleted to obtain only 0.8kb of VTR3 activation area.
  • the activation efficiency of dCas9-VTR3 is 10 times that of dCas9-VP64 [12] .
  • the EC50 of the Cinn-HSL signal on the activation of the P 2XR8-TRE3G promoter is 3.8 ⁇ 10 -9 mol/L and 1 ⁇ 10 -8 mol/L, the dynamic ranges are 32.2 and 9.5 respectively.
  • VTR3 as a transcriptional activation domain can achieve a better transcriptional switching effect.
  • the repression performance of regulatory elements can be enhanced by fusing the multimerization domain with a repressor protein and setting one or more pairs of operating sites at the promoter (CN107344962A).
  • a connecting peptide to connect the multimerization domain CarH to between the transcriptional activation domain and the regulatory domain of the transcription factor VTR3-RpaR, constructing VTR3-CarH-RpaR chimeric transcription factor.
  • the EC50 of the pC-inducible system was 8.7 ⁇ 10 -7 mol/L, and the dynamic range was 52.6; while the transcription factor contained the CarH domain, the pC-inducible system
  • the EC50 is 1 ⁇ 10 -7 mol/L, and the dynamic range is 30.8, indicating that the synergy brought about by multimerization improves the sensitivity of the induction system and reduces the working concentration of the inducer.
  • the competitive inhibition of RNA polymerase is more severe.
  • RNA polymerase binding is reduced, so by increasing the number of manipulation site pairs, the expression leakage can be effectively reduced (CN107344962A).
  • inserting 1 to 7 pairs of rpaO operator sequences upstream and/or downstream of the TRE3G promoter had little difference in the induction performance, indicating that increasing the number of operator sites in the positive control induction system did not significantly improve the induction performance of the regulatory element set .
  • the inventors also tried to use the multimerization domain CI434 to construct a multimerizable transcription factor, but this design could not improve the sensitivity of the pC induction system.
  • the TetR expression cassette containing negative feedback (P tet-CMV D2i promoter containing tetO) in the promoter region is introduced into the host cell, and the transcription start site (Inr) of the inducible promoter P rpaO-CMV1 A tetO operator site was inserted in the upstream and downstream of , respectively, to construct a double-input promoter pC-dox, which can respond to the dual signals of pC-HSL and doxycycline.
  • the combination of the repressor protein TetR and tetO blocked the rpaO operating site of pC-dox and repressed the transcription of the reporter gene; in the presence of dox, TetR detached from tetO, exposing the operating site rpaO, thereby RpaR bound to pC-HSL can bind to rpaO, thereby recruiting RNA polymerase and reporting gene expression; since the P tet-CMV D2i promoter also contains tetO, the negative regulation of TetR expression by dox makes it appear at every dox concentration TetR adaptation (adaptation), so the expression of the reporter gene is linearized with the increase of dox and pC-HSL.
  • the induction system maintains a linear response in a wide range of inducer concentration (0-100ng/mL dox, 10nM-10 ⁇ M pC-HSL), and the dynamic range is as high as 500, which can realize the control of gene expression. Fine regulation.
  • the EC50 of Cinn signaling system was 0.1 ⁇ M and 1.3 nM under the induction of pC-HSL and Cinn-HSL, respectively.
  • the EC50 of the pC signaling system was 0.7 ⁇ M and 6 ⁇ M induced by pC-HSL and Cinn-HSL, respectively. It can be seen that the lower concentration of Cinn-HSL can only successfully induce the Cinn system, so by adjusting the concentration of Cinn-HSL, the cells or tissues with both the pC-inducible system and the Cinn-inducible system can be regulated spatiotemporally.
  • Cinn-HSL can also be used alone to regulate the pC-inducible system.
  • the mammalian cell communication system constructed in the present invention includes a signal sending module for generating signal molecules and a signal receiving module for responding to signals ( FIG. 7A ).
  • the signaling module B4YP is composed of phenylalanine ammonia lyase PAL [9] , 4-coumaric acid-CoA ligase 4CL [9] and LuxI family protein BraI.
  • the signal receiving module 2XR8 is the Cinn induction system of the present invention.
  • the first gene includes the coding sequence of the promoter hEF1a and the chimeric allosteric transcription factor VTR3-BraR
  • the second gene includes the inducible promoter P 2XR8-TRE3G and the reporter gene Citrine.
  • the negative control contained only the signal receiving module 2XR8 (Fig. 7B).
  • the reporter gene Citrine of the communication system was well expressed in the cells transfected with the B4YP-2XR8 module. Control cells containing only the receptor module 2XR8 had no expression of Citrine due to the inability to synthesize signaling molecules.
  • the reporter gene in the above induction system is Citrine [15] .
  • VTR3 (SEQ ID NO: 41) was first connected to the 5' end of the nuclear localization signal peptide (SEQ ID NO: 38) and RpaR (NCBI accession number is WP_011155889.1 antisense chain 348672-349403bp ) is connected to the 3' end of the nuclear localization signal peptide to construct the coding sequence of the allosteric transcription factor VTR3-RpaR.
  • the nuclear localization signal peptide comprises two nuclear localization sequences (SEQ ID NO: 17), with 20 bp random sequences between the two nuclear localization sequences, 5' end and 3' end.
  • the two rpaO operating sequences were inserted into the upstream of the TRE3G core promoter to obtain the P rpaO-TRE3G promoter; and the Citrine yellow fluorescent protein coding sequence was connected downstream as a reporter gene.
  • Plasmids containing VTR3-RpaR and an inducible promoter-reporter gene also constitutively express mCherry red fluorescent protein for internal monitoring of plasmid transfection. When pC-HSL exists, VTR3-RpaR can bind to rpaO to activate the expression of Citrine reporter gene.
  • each element was assembled onto the pcDNA3.1(+) plasmid (Invitrogen, catalog #V790-20) using Gibson assembly in the following order: P rpaO-TRE3G promoter; Citrine reporter gene [15] ; hEF1a promoter (114743-113562bp segment from human clone RP11-505P4 chromosome 6 DNA (NCBI accession number AL603910.6)); VTR3-RpaR; EF1 promoter [8] ; mCherry reporter gene [16] .
  • each protein coding sequence is followed by a poly(A) sequence.
  • VTR3-CarH-RpaR with multimerization domain its 5' end to 3' end are respectively VTR3 (SEQ ID NO: 41), nuclear localization signal peptide (SEQ ID NO: 38) , connecting peptide (SEQ ID NO: 16), CarH (SEQ ID NO: 18), connecting peptide (SEQ ID NO: 16) and RpaR (NCBI accession number is WP_011155889.1 antisense strand No. 348672-349403bp).
  • VTR3-BjaR the 5' end to the 3' end of the chimeric allosteric transcription factor VTR3-BjaR are VTR3 (SEQ ID NO: 41), nuclear localization signal peptide (SEQ ID NO: 38) and BjaR ( SEQ ID NO: 43).
  • BjaR is obtained through human codon optimization according to the BjaR protein sequence (WP_011083882.1).
  • each element was assembled on the pcDNA3.1(+) plasmid (Invitrogen, catalog #V790-20) using Gibson assembly in the following order: P bjaO-TRE3G promoter; Citrine reporter gene; hEF1a promoter sub; VTR3-BjaR; EF1 promoter; mCherry reporter gene, get IV induction system plasmid.
  • VTR3-BraR the 5' end to the 3' end of the chimeric allosteric transcription factor VTR3-BraR are VTR3 (SEQ ID NO: 41), nuclear localization signal peptide (SEQ ID NO: 38) and BraR (SEQ ID NO: 42).
  • BraR is obtained through human codon optimization according to the BraR protein sequence (CAL74858.1).
  • the components were assembled into the pcDNA3.1(+) plasmid (Invitrogen, catalog #V790-20) using Gibson assembly in the following order: Cinn-HSL inducible promoter; Citrine reporter gene; hEF1a promoter; VTR3-BraR or VP64 - BraR (NCBI accession number of VP64: ASW25882.1); EF1 promoter; mCherry reporter gene, resulting in Cinn induction system plasmid.
  • the Cinn-HSL inducible promoter has its operating sequence from the promoters P CUS-TRE3G , P APO-TRE3G , P NC-TRE3G , P ORS-TRE3G , P CUL-TRE3G of different strains; the upstream truncated promoter Promoters P F1-TRE3G , P F2-TRE3G , P F4-TRE3G , P F7-TRE3G ; downstream truncated promoters P R1-TRE3G , P R4-TRE3G , P R6-TRE3G , P R7-TRE3G , P R8- TRE3G and the optimized promoter P 2XR8-TRE3G .
  • the PRpaO-TRE3G promoter of the pC induction system was replaced with the PRpaO-CMV1 promoter (SEQ ID NO: 39) by inverse PCR first, and then the P RpaO-CMV1 A tetO was added to the upstream and downstream of the transcription initiation site (Inr) of the promoter, and a pC-dox inducible promoter capable of simultaneously responding to pC-HSL and dox signals was obtained.
  • the GFP coding region on the plasmid pDN-D2irTNG4kwh was removed by inverse PCR, and then the entire TetR expression element of the plasmid (from the beginning of CMV enhancer to the end of bGH poly A, which contained the promoter P tet- CMV D2i ).
  • the PCR-amplified TetR expression element with NotI sites at both ends was inserted into the pC inducible system containing the pC-dox promoter through NotI single enzyme digestion to construct the final receiver of the pC-dox inducible system.
  • the signal receiving module 2XR8 of the signal communication system is a Cinn inducible system (the inducible promoter is P 2XR8-TRE3G ).
  • the signal sending module B4YP of the signal communication system includes three genes of BraI, 4CL and PAL. Use Gibson assembly to replace the entire gene expression frame sequence from the CMV promoter on the PB vector (PB513B-1) to SV40 with the EF1a core promoter (SEQ ID NO: 40), BraI (SEQ ID NO: 44) and hGH-PA (polyA region) to obtain the EF1a-Bral expression plasmid.
  • ES plasmids were constructed first: RFP expression cassettes were inserted between the 5' insulator of the PB plasmid and the CMV promoter as bacterial selection markers, and the EF1 promoter, mCherry and SV40PA are used as mammalian screening markers, and the restriction sites at both ends of the expression element are I-CeuI and I-SceI; at the same time, the entire sequence from the CMV promoter to SV40PA on the PB vector is deleted.
  • EF1a-BraI, EF1a-4CL and EF1a-PAL, EF1a-BraI-hGH-PA, EF1a-4CL-hGH-PA and EF1a-PAL-hGH-PA and three fragments were obtained by PCR amplification. The three fragments were assembled and integrated on the ES vector through the golden gate according to the sequence of BraI-4CL-PAL, and finally PB-EF1a-BraI-EFa-4cl-EF1a-PAL (B4YP) was obtained.
  • the Citrine/VTR3-BraR/mCherry expression cassette of the 2XR8 module was amplified and I-CeuI and I-SceI restriction sites were added at both ends.
  • the amplified product and plasmid pEF1a-B4YP were digested with I-CeuI and I-SceI and ligated with T4DNA ligase to obtain the single-plasmid cell communication system p2XR8-B4YP.
  • the small molecule inducer pC-HSL was purchased from Sigma (Product No. 7077), and Cinn-HSL and IV-HSL were synthesized by Shenzhen Qianyan Drug Research and Development Technology Co., Ltd.
  • the human embryonic kidney cell line 293T (HEK-293T, ATCC: CRL-11268) was inoculated in the culture medium supplemented with 10% fetal bovine serum (Gibco) and 1% penicillin and streptomycin under the culture condition of 37°C and 5% CO 2 .
  • the flow cytometry data were analyzed using CytExpert software that comes with the Beckman CytoFLEX S flow cytometer, and the gate was set based on the fluorescence intensity of untransfected 293T cells. The average FITC of the ECD+ population was used as the output for each sample. For pC-dox inducible systems, background autofluorescence was subtracted. All flow cytometric data presented herein are the mean of at least three replicates, S.D. is the standard deviation of individual measurements.
  • B4YP-2XR8 and 2XR8 were respectively transfected into 293T cells according to the Lipofectamine transfection method described herein, and the expression of the receiver reporter gene Citrine was detected with a fluorescence microscope EVOS7000 (Thermo) 48 hours after transfection.
  • y is the fluorescence intensity (output) of the reporter Citrine
  • y min is the minimum value of the output
  • y max is the maximum value of the output.
  • the K obtained by fitting is the input signal concentration when the output value is at half maximum, that is, the theoretical value of EC50.
  • the dynamic range of this induction curve is y max /y min .

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Plant Pathology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Physiology (AREA)
  • Evolutionary Biology (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Medicinal Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Cell Biology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

The present invention relates to a chimeric allosteric transcription factor for regulating genes of interest in mammalian cells and a regulatory element group comprising the chimeric allosteric transcription factor, and an inducible expression system. The chimeric allosteric transcription factor comprises a regulatory domain and a transcriptional activation domain, wherein the regulatory domain is selected from the group consisting of RpaR, BjaR, and BraR; the transcriptional activation domain is selected from P65, VP16, VP64, VTR1, VTR2, and VTR3; the chimeric allosteric transcription factor uses a natural amino acid-derived acyl homoserine lactone as an inducer; and the natural amino acid-derived acyl homoserine lactone is selected from the group consisting of the following compounds: p-coumaroyl-homoserine lactone (pC-HSL), isovaleryl-homoserine lactone (IV-HSL), and cinnamoyl-homoserine lactone (Cinn-HSL). The system can be used in a modular manner in a variety of mammalian cells, tissues or organisms to regulate the expression of genes of interest.

Description

用于哺乳动物体系的嵌合型别构转录因子、调控元件组和诱导表达系统Chimeric allosteric transcription factors, sets of regulatory elements and inducible expression systems for use in mammalian systems
相关申请的引用References to related applications
本申请要求于2021年09月29日向中华人民共和国国家知识产权局提交的第202111150967.4号中国专利申请的权益,在此将其全部内容以援引的方式整体并入本文中。This application claims the rights and interests of the Chinese patent application No. 202111150967.4 submitted to the State Intellectual Property Office of the People's Republic of China on September 29, 2021, the entire contents of which are hereby incorporated herein by reference.
技术领域technical field
本发明属于合成生物学和生物工程领域,涉及能够对哺乳动物体系中基因表达进行调控的嵌合型别构转录因子、调控元件组和诱导表达系统。具体而言,本发明涉及使用天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂来调控宿主细胞中感兴趣基因的转录。The invention belongs to the field of synthetic biology and bioengineering, and relates to a chimeric allosteric transcription factor capable of regulating gene expression in a mammalian system, a regulatory element group and an induced expression system. In particular, the present invention relates to the use of natural amino acid-derived acyl homoserine lactones as inducers to regulate the transcription of a gene of interest in a host cell.
背景技术Background technique
对哺乳动物宿主中基因表达的精确调控对于基因功能分析、生物制药、疫苗制备、动物模型设计以及基因治疗等均极为重要。本领域已报道了多种用于哺乳动物的基因表达调控系统,例如用免疫抑制剂或类固醇激素作为诱导剂,来调控目标基因的转录。然而,这些系统普遍存在诱导剂浓度难以控制或者对宿主基因网络造成干扰等问题。为构建与宿主基因调控网络正交的诱导表达系统,已尝试对原核细胞来源的调控元件进行改造并将其转用于真核体系,来诱导表达真核宿主中的基因。例如,可将天蓝色链霉菌(Streptomyces coelicolor)群体感应受体ScbR或始旋链霉菌(Streptomyces pristinaespiralis)转录因子SpbR与单纯疱疹病毒(herpes simplex virus)的真核转录激活结构域VP16融合作为转录激活因子,并将相应的操纵序列组装至CMVmini启动子5’端,构建以丁内酯类化合物为诱导剂的哺乳动物诱导表达系统 [1]。然而上述表达系统均为正控阻遏系统,这些系统中,在小分子存在下别构转录因子与DNA脱离。WO2007058527A2报道了将原核细胞来源的Tet调控系统转用于哺乳动物体系。在大肠杆菌中,阻遏蛋白TetR与其操纵序列tetO特异性结合抑制下游基因转录,而四环素或强力霉素(dox)与TetR的结合触发构象改变,释放启动子区,开启基因转录。对于真核宿主,可将VP16与TetR融合构建别构转录因子tTA,对包含tetO序列的真核启动子进行调控。然而,四环素或dox与tTA的结合使得该转录因子与启动子区脱离,该系统仍为正控阻遏系统。作为改进,可将TetR部分氨基酸突变,使其与诱导剂结合后与DNA结合,这种负控系统能够被较低浓度的诱导剂(例如44ng/ml强力霉素)激活。 Precise regulation of gene expression in mammalian hosts is extremely important for gene function analysis, biopharmaceuticals, vaccine preparation, animal model design, and gene therapy. Various gene expression regulation systems for mammals have been reported in the art, for example, using immunosuppressants or steroid hormones as inducers to regulate the transcription of target genes. However, these systems generally have problems such as difficulty in controlling the concentration of the inducer or causing interference to the host gene network. In order to construct an inducible expression system orthogonal to the host gene regulatory network, attempts have been made to transform the regulatory elements derived from prokaryotic cells and transfer them to eukaryotic systems to induce the expression of genes in eukaryotic hosts. For example, the Streptomyces coelicolor quorum sensing receptor ScbR or the Streptomyces pristinaespiralis transcription factor SpbR can be fused to the eukaryotic transcriptional activation domain VP16 of herpes simplex virus as a transcriptional activator. factor, and the corresponding operator sequence was assembled to the 5' end of the CMVmini promoter to construct a mammalian inducible expression system using butyrolactone compounds as inducers [1] . However, the above expression systems are all positive control repressor systems, in which allosteric transcription factors are detached from DNA in the presence of small molecules. WO2007058527A2 reports the transfer of a prokaryotic-derived Tet regulatory system to a mammalian system. In Escherichia coli, the repressor protein TetR specifically binds to its operator sequence tetO to inhibit downstream gene transcription, while the binding of tetracycline or doxycycline (dox) to TetR triggers a conformational change, releases the promoter region, and turns on gene transcription. For eukaryotic hosts, VP16 can be fused with TetR to construct an allosteric transcription factor tTA, which can regulate eukaryotic promoters containing tetO sequences. However, the binding of tetracycline or dox to tTA disengages the transcription factor from the promoter region, and the system remains a positively controlled repressive system. As an improvement, some amino acids of TetR can be mutated so that it binds to DNA after being combined with an inducer. This negative control system can be activated by a lower concentration of inducer (such as 44ng/ml doxycycline).
群体感应系统是存在于细菌体内并响应菌群密度信号的系统,通过对信号分子进行合成、分泌、检测以及群体水平应答来实现细胞间通讯,从而协调群体内基因表达。该过程与生物发光、抗生素合成和菌膜形成等密切相关。不同种属的细菌分泌的信号分子在化学结构上具有差异,从而为构建与哺乳动物宿主基因调控网络正交的人工表达系统提供了更多选择。例如,变形菌的群体感应系统利用酰基高丝氨酸内酯(N-acyl homoserine lactone,AHL)作为信号分子,由LuxR家族别构转录因子作为受体对信号分子的浓度信息进行转导。本领域已报道将LuxR家族别构转录因子与真核反式激活域NF-κB p65或VP16融合,构建嵌合转录因子;并将操纵序列添加至CMVmini启动子,构建响应AHL的诱导型启动子 [2]-[3]。目前本领域已报道的此类哺乳动物诱导系统的EC50的量级均在10μM以上,且诱导剂仅限于3-氧代-辛酰基-L-高丝氨酸内酯(3OC8)或3-氧代-己酰基-L-高丝氨酸内酯(3OC6);这两种直链脂肪酸型AHL的酰基侧链相似度较高,因此基于此类AHL的多通道通讯系统在信号水平和启动子水平上均存在广泛的串扰。 The quorum sensing system is a system that exists in bacteria and responds to bacterial population density signals. It realizes intercellular communication by synthesizing, secreting, detecting, and responding at the population level to coordinate gene expression within the population. This process is closely related to bioluminescence, antibiotic synthesis and biofilm formation. The signaling molecules secreted by different species of bacteria have different chemical structures, which provide more options for constructing artificial expression systems that are orthogonal to the gene regulatory network of mammalian hosts. For example, the quorum sensing system of Proteus uses acyl homoserine lactone (N-acyl homoserine lactone, AHL) as a signal molecule, and the LuxR family allosteric transcription factor acts as a receptor to transduce the concentration information of the signal molecule. It has been reported in the field that the LuxR family allosteric transcription factor was fused with the eukaryotic transactivation domain NF-κB p65 or VP16 to construct a chimeric transcription factor; and the operator sequence was added to the CMVmini promoter to construct an inducible promoter responsive to AHL [2]-[3] . The EC50 levels of such mammalian induction systems reported in the art are above 10 μM, and the inducers are limited to 3-oxo-octanoyl-L-homoserine lactone (3OC8) or 3-oxo- Hexanoyl-L-homoserine lactone (3OC6); the acyl side chains of these two straight-chain fatty acid-type AHLs are highly similar, so the multichannel communication system based on this type of AHL exists at both the signal level and the promoter level Extensive crosstalk.
在本发明中,通过对原核生物中以天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂的群体感应系统进行数据挖掘,识别相应的别构转录因子和操纵序列,并与真核转录调控系统进行整合,构建了用于哺乳动物细胞的新型诱导表达系统。In the present invention, the corresponding allosteric transcription factors and operating sequences are identified by data mining of the quorum sensing system in prokaryotes using acyl homoserine lactone derived from natural amino acids as an inducer, and are compared with the eukaryotic transcriptional regulation system. Integrating, a novel inducible expression system for mammalian cells was constructed.
发明内容Contents of the invention
在第一方面,本发明提供了一种用于对哺乳动物细胞中感兴趣的基因进行调控的嵌合型别构转录因子,所述嵌合型别构转录因子包含调控结构域以及转录激活结构域;其中,所述调控结构域选自于由RpaR、BjaR以及BraR所组成的组;所述转录激活结构域选自于P65、VP16、VP64、VTR1、VTR2以及VTR3;所述嵌合型别构转录因子以天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂;所述天然氨基酸衍生的酰基高丝氨酸内酯选自于由如下化合物所组成的组:对香豆酰基-高丝氨酸内酯(pC-HSL)、异戊酰基-高丝氨酸内酯(IV-HSL)以及肉桂酰基-高丝氨酸内酯(Cinn-HSL)。In a first aspect, the present invention provides a chimeric allosteric transcription factor for regulating a gene of interest in a mammalian cell, the chimeric allosteric transcription factor comprising a regulatory domain and a transcriptional activation structure domain; wherein, the regulatory domain is selected from the group consisting of RpaR, BjaR and BraR; the transcriptional activation domain is selected from P65, VP16, VP64, VTR1, VTR2 and VTR3; the chimeric type Constitutive transcription factors use natural amino acid derived acyl homoserine lactones as inducers; said natural amino acid derived acyl homoserine lactones are selected from the group consisting of the following compounds: p-coumaroyl-homoserine lactone (pC -HSL), isovaleryl-homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL).
在第二方面,本发明提供了一种以天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂的调控元件组,所述天然氨基酸衍生的酰基高丝氨酸内酯选自于由如下化合物所组成的组:对香豆酰基-高丝氨酸内酯(pC-HSL)、异戊酰基-高丝氨酸内酯(IV-HSL)以及肉桂酰基-高丝氨酸内酯(Cinn-HSL);所述调控元件组包含:In a second aspect, the present invention provides a set of regulatory elements using natural amino acid-derived acyl homoserine lactones as inducers selected from the group consisting of : p-coumaroyl-homoserine lactone (pC-HSL), isovaleryl-homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL); the set of regulatory elements comprises:
嵌合型别构转录因子,所述嵌合型别构转录因子包含调控结构域以及转录激活结构域;其中,所述调控结构域选自于由RpaR、BjaR以及BraR所组成的组;所述转录激活结构域选自于P65、VP16、VP64、VTR1、VTR2以及VTR3;以及A chimeric allosteric transcription factor, said chimeric allosteric transcription factor comprising a regulatory domain and a transcriptional activation domain; wherein said regulatory domain is selected from the group consisting of RpaR, BjaR and BraR; said The transcriptional activation domain is selected from the group consisting of P65, VP16, VP64, VTR1, VTR2 and VTR3; and
在哺乳动物细胞中具有活性的诱导型启动子,所述诱导型启动子包含核心序列以及与所述嵌合型别构转录因子相互作用的至少一个操纵位点对。An inducible promoter active in mammalian cells, said inducible promoter comprising a core sequence and at least one operator site pair interacting with said chimeric allosteric transcription factor.
在第三方面,本发明提供了一种对哺乳动物中感兴趣的基因进行诱导表达的系统,所述诱导表达系统利用天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂,所述天然氨基酸衍生的酰基高丝氨酸内酯选自于由如下化合物所组成的组:对香豆酰基-高丝氨酸内酯(pC-HSL)、异戊酰基-高丝氨酸内酯(IV-HSL)以及肉桂酰基-高丝氨酸内酯(Cinn-HSL);所述诱导表达系统包含:In a third aspect, the present invention provides a system for inducing expression of a gene of interest in mammals, the inducible expression system utilizes acyl homoserine lactone derived from a natural amino acid as an inducer, and the acyl homoserine lactone derived from a natural amino acid Acyl homoserine lactones are selected from the group consisting of p-coumaroyl-homoserine lactone (pC-HSL), isovaleryl-homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone Lactone (Cinn-HSL); the inducible expression system comprises:
第一基因表达盒,所述第一基因表达盒包含第一启动子序列以及嵌合型别构转录因子的编码序列,其中,所述嵌合型别构转录因子包含调控结构域以及转录激活结构域;所述调控结构域选自于由RpaR、BjaR以及BraR所组成的组;所述转录激活结构域选自于P65、VP16、VP64、VTR1、VTR2以及VTR3;以及A first gene expression cassette, the first gene expression cassette comprising a first promoter sequence and a coding sequence of a chimeric allosteric transcription factor, wherein the chimeric allosteric transcription factor comprises a regulatory domain and a transcriptional activation structure domain; the regulatory domain is selected from the group consisting of RpaR, BjaR and BraR; the transcriptional activation domain is selected from the group consisting of P65, VP16, VP64, VTR1, VTR2 and VTR3; and
第二基因表达盒,所述第二基因表达盒包含诱导型启动子序列以及所述感兴趣的基因的编码序列,所述诱导型启动子序列包含与所述嵌合型别构转录因子相互作用的至少一个操纵位点对。A second gene expression cassette, the second gene expression cassette comprising an inducible promoter sequence and the coding sequence of the gene of interest, the inducible promoter sequence comprising an allosteric transcription factor interacting with the chimeric type At least one operator site pair for .
在第四方面,本发明提供了第一方面所述的嵌合型别构转录因子、第二方面所述的调控元件组、第三方面所述的诱导表达系统在构建单细胞或多细胞信号通讯系统以及人类疾病模型中的用途。In the fourth aspect, the present invention provides the chimeric allosteric transcription factor described in the first aspect, the set of regulatory elements described in the second aspect, and the inducible expression system described in the third aspect in the construction of single-cell or multi-cell signaling Communication systems and use in models of human disease.
有益效果Beneficial effect
在细菌中,天然氨基酸衍生的酰基高丝氨酸内酯与LuxR家族转录激活因子RpaR、BjaR或BraR的相互作用使得此类转录激活因子构象改变,进而能够结合至DNA上相应的操纵序列并激活下游基因转录。在本发明中,通过对上述转录因子的操纵序列进行识别与表征,并将该操纵序列添加至真核启动子,构建能够在哺乳动物宿主中响应pC-HSL、IV-HSL和/或Cinn-HSL的诱导型启动子;另一方面,将真核反式激活因子作为转录激活结构域融合至LuxR家族的上述转录激活因子,构建能够在真核宿主中调控基因表达的嵌合型别构转录因子。由于pC-HSL、IV-HSL以及Cinn-HSL能够自由扩散进出细胞,本发明构建的嵌合型别构转录因子的调控结构域与上述诱导剂结合使得其构象改变,从而能够与DNA上相应的操纵序列结合,将所述嵌合型别构转录因子的转录激活结构域定位至包含相应操纵序列的启动子,对该启动子控制的基因进行诱导表达。而不存在诱导剂时,嵌合型别构转录因子从DNA上脱离,从而关闭基因转录。这是本领域首次报道在哺乳动物细胞中利用天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂对感兴趣的基因进行诱导。该诱导系统能够以模块化的方式用于多种类型的哺乳动物细胞、组织或者生物体,对感兴趣的基因的表达进行调控。In bacteria, the interaction of natural amino acid-derived acyl homoserine lactones with the LuxR family transcriptional activators RpaR, BjaR or BraR changes the conformation of these transcriptional activators, allowing them to bind to the corresponding operator sequence on DNA and activate downstream genes transcription. In the present invention, by identifying and characterizing the operator sequences of the above-mentioned transcription factors, and adding the operator sequences to eukaryotic promoters, constructs capable of responding to pC-HSL, IV-HSL and/or Cinn-HSL in mammalian hosts The inducible promoter of HSL; on the other hand, the eukaryotic transactivator is fused as the transcriptional activation domain to the above-mentioned transcriptional activator of the LuxR family to construct a chimeric allosteric transcription factor capable of regulating gene expression in a eukaryotic host factor. Since pC-HSL, IV-HSL and Cinn-HSL can freely diffuse in and out of cells, the regulatory domain of the chimeric allosteric transcription factor constructed in the present invention is combined with the above-mentioned inducer to change its conformation, so that it can be compatible with the corresponding The operator sequence is combined, the transcriptional activation domain of the chimeric allosteric transcription factor is positioned to a promoter containing the corresponding operator sequence, and the gene controlled by the promoter is induced to express. In the absence of an inducer, the chimeric allosteric transcription factor dissociates from the DNA, turning off gene transcription. This is the first report in the field of using natural amino acid-derived acyl homoserine lactone as an inducer to induce a gene of interest in mammalian cells. The induction system can be used in various types of mammalian cells, tissues or organisms in a modular manner to regulate the expression of genes of interest.
事实上,在哺乳动物宿主中,由于转录调控位点难以被100%占据,以阻遏蛋白作为转录调控因子时转录不能被完全关闭。因此,基于阻遏蛋白的诱导系统不但需要胞内表达较高浓度的转录因子,增大了宿主细胞的代谢负担,而且难以提高信噪比以及实现快速诱导。本发明的嵌合型别构转录因子为转录激活因子,不存在上述系统性缺陷。另一方面,哺乳动物细胞中不存在与原核AHL相关的信号通路,因此在将本发明的嵌合型别构转录因子或诱导表达系统导入哺乳动物宿主时,该嵌合型别构转录因子高度特异地与相应的操纵序列结合,不存在不期望的多效性(pleiotropic)。目前已报道的以直链脂肪酸型AHL作为诱导剂的诱导系统借助VP16-TraR融合蛋白对3OC8信号进行应答,该系统的半最大有效浓度EC50为7-14μM [3],而本发明的pC-HSL以及Cinn-HSL诱导系统的EC50分别为0.8μM和1.25nM。与3OC8诱导系统相比,本发明的诱导系统可实现超灵敏应答。由于可在较低的工作浓度下诱导,有效降低了诱导剂对细胞的细胞毒性。此外,3OC8诱导系统的最大诱导倍数约为20倍,而本发明的pC-HSL诱导系统的最大诱导倍数可达100倍,信噪比更高。此外,由于本发明的诱导系统与直链脂肪酸型AHL诱导系统不存在信号串扰 [5],可将本发明的诱导系统与直链脂肪酸型AHL诱导系统组合使用,在哺乳动物宿主中构建具有多重信号的复杂调控网络或基因线路。 In fact, in mammalian hosts, transcription cannot be completely shut down when repressor proteins are used as transcriptional regulators because the transcriptional regulatory sites are difficult to be 100% occupied. Therefore, the repressor-based induction system not only requires the expression of a higher concentration of transcription factors in the cell, which increases the metabolic burden of the host cell, but also makes it difficult to improve the signal-to-noise ratio and achieve rapid induction. The chimeric allosteric transcription factor of the present invention is a transcription activator without the above-mentioned systemic defects. On the other hand, there is no signaling pathway associated with prokaryotic AHL in mammalian cells, so when the chimeric allosteric transcription factor or inducible expression system of the present invention is introduced into a mammalian host, the chimeric allosteric transcription factor is highly Specifically combined with the corresponding operator sequence, there is no undesired pleiotropic effect. The currently reported induction system using straight-chain fatty acid type AHL as an inducer responds to 3OC8 signal by means of VP16-TraR fusion protein, and the half-maximum effective concentration EC50 of the system is 7-14 μM [3] , while the pC-TraR of the present invention The EC50 of HSL and Cinn-HSL induction system are 0.8μM and 1.25nM, respectively. Compared with the 3OC8 induction system, the induction system of the present invention can achieve a hypersensitive response. Since it can be induced at a lower working concentration, the cytotoxicity of the inducer to cells is effectively reduced. In addition, the maximum induction multiple of the 3OC8 induction system is about 20 times, while the maximum induction multiple of the pC-HSL induction system of the present invention can reach 100 times, and the signal-to-noise ratio is higher. In addition, since there is no signal crosstalk between the induction system of the present invention and the straight-chain fatty acid-type AHL induction system [5] , the induction system of the present invention can be used in combination with the straight-chain fatty acid-type AHL induction system to construct multiple A complex regulatory network or gene circuit of signals.
在优选的实施方式中,所述嵌合型别构转录因子进一步包含多聚化结构域CarH。不希望被理论所限地,在别构转录因子中添加多聚化结构域使得启动子附近空间位阻增大,增加诱导曲线的陡峭度(steepness),降低本底表达,并使得诱导阈值变高,实现更好的转录开关性能。在例如疫苗生产过程中,过高的本底表达会导致诱导剂依赖性丧失,进而产生过量的免疫原性,因此降低的本底表达极为重要。如实施例所展示的,融合CarH多聚化结构域能够将pC诱导系统的灵敏度提高5倍以上。In a preferred embodiment, the chimeric allosteric transcription factor further comprises a multimerization domain CarH. Without wishing to be bound by theory, the addition of a multimerization domain to an allosteric transcription factor increases steric hindrance near the promoter, increases the steepness of the induction curve, reduces background expression, and shifts the induction threshold. High for better transcription switch performance. For example, in the process of vaccine production, excessive background expression will lead to the loss of inducer dependence, thereby resulting in excessive immunogenicity, so reduced background expression is extremely important. As demonstrated in the examples, fusion of the CarH multimerization domain can increase the sensitivity of the pC-inducible system by more than 5 times.
本领域已知的是,线性化能够降低电路中放大器的信号失真并提高线性度,这对于尖峰频率适应以及将非线性瞬时触发信号转换为稳定线性应答非常重要。在基因线路中,可通过在调控网络中添加负反馈,降低表达的异质性并改变S型应答曲线的形状,实现报告子水平的线性变换。在本发明优选的实施方式中,通过添加基于TetR的负反馈系统来扩大诱导表达系统的动态范围,从而在较大范围内诱导物浓度与目的蛋白的表达量呈线性关系。此类调控元件线性化对于调控网络的理性设计而言至关重要。It is known in the art that linearization can reduce signal distortion and improve linearity of the amplifier in the circuit, which is very important for peak frequency adaptation and conversion of non-linear transient trigger signals into stable linear responses. In gene circuits, the linear transformation of the reporter level can be achieved by adding negative feedback in the regulatory network, reducing the heterogeneity of expression and changing the shape of the sigmoid response curve. In a preferred embodiment of the present invention, the dynamic range of the inducible expression system is expanded by adding a TetR-based negative feedback system, so that the concentration of the inducer has a linear relationship with the expression level of the target protein within a wide range. Linearization of such regulatory elements is critical for the rational design of regulatory networks.
附图说明Description of drawings
图1显示本发明诱导系统的设计原理和表征。(A)嵌合型别构转录因子的设计原理以及包含所述嵌合型别构转录因子的诱导系统的工作原理。(B)根据本发明实施例的pC诱导系统、Cinn诱导系统以及IV 诱导系统的诱导剂、嵌合型别构转录因子的调控结构域以及诱导曲线的EC50和动态范围。(C)利用流式细胞术测定的pC诱导系统、Cinn诱导系统以及IV诱导系统的诱导曲线,所使用的诱导剂分别为pC-HSL、Cinn-HSL以及IV-HSL。Figure 1 shows the design principle and characterization of the induction system of the present invention. (A) The design principle of chimeric allosteric transcription factors and the working principle of the induction system comprising said chimeric allosteric transcription factors. (B) The pC inducible system, the Cinn inducible system and the inducer of the IV inducible system, the regulatory domain of the chimeric allosteric transcription factor, and the EC50 and dynamic range of the induction curve according to an embodiment of the present invention. (C) Induction curves of the pC inducible system, Cinn inducible system and IV inducible system measured by flow cytometry, the inducers used were pC-HSL, Cinn-HSL and IV-HSL, respectively.
图2显示Cinn诱导系统操纵序列的识别和优化。(A)为借助BLAST对四种慢生根瘤菌BTAi1(NC)、ORS285(ORS)、Bradyrhizobium oligotrophicum S58(APO)以及ORS278(CUL以及CUS)中推定的BraI启动子区域的序列进行比对的结果。方框框出推定的操纵位点braO。(B)流式细胞术测定的包含启动子P CUS-TRE3G、P CUL-TRE3、P ORS-TRE3G、P APO-TRE3G以及P NC-TRE3G的Cinn诱导系统的诱导曲线。其中,启动子P CUS-TRE3G、P CUL-TRE3、P ORS-TRE3G、P APO-TRE3G以及P NC-TRE3G分别包含一对CUS、CUL、ORS、APO以及NC操纵位点。(C)对于慢生根瘤菌ORS278菌株的操纵序列CUS(wt),分别从上游或下游截短,得到操纵序列F1、F2、F4、F7(5’端截短),以及R1、R4、R6、R7、R8(3’端截短)。(D)借助流式细胞术测定在0和10 -6mol/L的Cinn-HSL下包含(C)中各操纵序列的诱导系统中报告子的荧光强度。(E)借助流式细胞术测定的包含启动子P 2XR8-TRE3G或P CUS-TRE3G的Cinn诱导系统的诱导曲线。(F)示出对应于(A)中CUL、NC、ORS、APO以及CUS的最小操纵序列。 Figure 2 shows the identification and optimization of the operator sequence for the Cinn inducible system. (A) is the result of comparing the sequences of the putative BraI promoter regions in four Bradyrhizobium oligotrophicum S58 (APO) and ORS278 (CUL and CUS) by BLAST . Box encloses the putative operator site braO. (B) Induction curves of Cinn inducible systems containing promoters P CUS-TRE3G , P CUL-TRE3 , P ORS-TRE3G , P APO-TRE3G and P NC-TRE3G determined by flow cytometry. Wherein, the promoters P CUS-TRE3G , P CUL-TRE3 , P ORS-TRE3G , P APO-TRE3G and P NC-TRE3G respectively contain a pair of CUS, CUL, ORS, APO and NC operation sites. (C) For the operator sequence CUS (wt) of Bradyrhizobium ORS278 strain, it was truncated from upstream or downstream, respectively, to obtain operator sequences F1, F2, F4, F7 (5' end truncated), and R1, R4, R6 , R7, R8 (3' end truncated). (D) The fluorescent intensity of the reporter in the induction system containing each operator sequence in (C) was measured by flow cytometry at 0 and 10 -6 mol/L Cinn-HSL. (E) Induction curves of Cinn inducible systems containing the promoter P 2XR8-TRE3G or P CUS-TRE3G determined by flow cytometry. (F) shows the minimal manipulation sequences corresponding to CUL, NC, ORS, APO and CUS in (A).
图3显示流式细胞术测定的包含转录激活结构域VTR3或VP64的Cinn诱导系统的诱导曲线。Figure 3 shows the induction curves of the Cinn inducible system comprising the transcriptional activation domain VTR3 or VP64 measured by flow cytometry.
图4显示多聚化结构域对于诱导系统性能的影响。(A)显示包含多聚化结构域CarH的嵌合型别构转录因子VTR3-CarH-RpaR的设计原理以及包含该别构转录因子的pC诱导系统的工作原理。(B)流式细胞术测定包含别构转录因子VTR3-CarH-RpaR或VTR3-RpaR的pC诱导系统的诱导曲线。Figure 4 shows the effect of the multimerization domain on the performance of the induction system. (A) shows the design principle of the chimeric allosteric transcription factor VTR3-CarH-RpaR containing the multimerization domain CarH and the working principle of the pC induction system containing the allosteric transcription factor. (B) Induction curves of pC-inducible systems containing allosteric transcription factors VTR3-CarH-RpaR or VTR3-RpaR measured by flow cytometry.
图5显示在诱导系统中添加负反馈能够使得应答线性化。(a)诱导剂pC-HSL和别构转录因子VTR3-RpaR对双输入启动子pC-dox调控的工作原理,其中,所述pC-dox启动子包含一对rpaO和一对tetO操纵位点。(b)诱导剂dox和别构转录因子TetR对双输入启动子pC-dox调控的工作原理。(c)流式细胞术测定pC-dox诱导系统的诱导曲线。所给出的数据均为三次重复实验的平均荧光值。Figure 5 shows that adding negative feedback to the induction system can linearize the response. (a) The working principle of the regulation of the dual-input promoter pC-dox by the inducer pC-HSL and the allosteric transcription factor VTR3-RpaR, wherein the pC-dox promoter contains a pair of rpaO and a pair of tetO operation sites. (b) The mechanism of regulation of the dual-input promoter pC-dox by the inducer dox and the allosteric transcription factor TetR. (c) The induction curve of the pC-dox induction system was determined by flow cytometry. The data presented are the mean fluorescence values of triplicate experiments.
图6显示pC诱导系统与Cinn诱导系统间的信号串扰。(A)借助流式细胞术测定各pC-HSL(ORS-pC)或Cinn-HSL(ORS-Cinn)浓度下,Cinn诱导系统的剂量响应曲线。(B)借助流式细胞术测定各pC-HSL(RpaO-pC)或Cinn-HSL(RpaO-Cinn)浓度下,pC诱导系统的剂量响应曲线。Figure 6 shows the signal crosstalk between the pC-inducible system and the Cinn-inducible system. (A) The dose-response curves of the Cinn-inducing system were measured by flow cytometry at each concentration of pC-HSL (ORS-pC) or Cinn-HSL (ORS-Cinn). (B) The dose-response curves of the pC-inducing system were determined by flow cytometry at various concentrations of pC-HSL (RpaO-pC) or Cinn-HSL (RpaO-Cinn).
图7显示利用本发明的Cinn诱导系统构建哺乳动物细胞间通讯。(A)哺乳动物细胞通讯系统B4YP-2XR8的基因线路图。(B)阴性对照模块2XR8的基因线路图。(C)B4YP-2XR8或2XR8模块在哺乳细胞中的通讯响应。从左至右分别为FITC通道、ECD通道、明场以及三者的叠加图。荧光显微镜的放大倍数为20×,比例尺为150μm。Figure 7 shows the construction of mammalian cell-to-cell communication using the Cinn induction system of the present invention. (A) Gene circuit diagram of mammalian cell communication system B4YP-2XR8. (B) Gene circuit diagram of negative control module 2XR8. (C) Communication response of B4YP-2XR8 or 2XR8 modules in mammalian cells. From left to right are the FITC channel, ECD channel, bright field and the overlay of the three. The magnification of the fluorescence microscope is 20×, and the scale bar is 150 μm.
图1-图4以及图6中,所给出的数据均为三次重复实验的平均荧光值,误差线对应于各测量的标准差。In Fig. 1-Fig. 4 and Fig. 6, the data given are the average fluorescence values of three repeated experiments, and the error bars correspond to the standard deviations of each measurement.
具体实施方式Detailed ways
针对现有哺乳动物诱导系统普遍存在的诱导水平低和漏表达等问题,本发明提出使用天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂,以剂量依赖和可逆的方式对哺乳细胞中的基因表达进行调控。Aiming at the common problems of low induction level and missing expression in existing mammalian induction systems, the present invention proposes to use natural amino acid-derived acyl homoserine lactone as an inducer to regulate gene expression in mammalian cells in a dose-dependent and reversible manner. To regulate.
如图1A所示,本发明的嵌合型别构转录因子具有原核来源的调控结构域以及真核来源的转录激活结构域,这种异源融合使得别构转录因子能够对异源信号做出应答:由调控结构域实现对诱导剂的应答,由转录激活结构域控制哺乳动物中基因的转录。具体而言,所述调控结构域为群体感应应答蛋白RpaR、BjaR或BraR,所述调控结构域与天然氨基酸衍生的酰基高丝氨酸内酯pC-HSL、Cinn-HSL或IV-HSL结合后产生构象改变,结合至DNA上的操纵序列,从而将转录激活结构域定位至包含该操纵序列的启动子,因此可通过诱导剂浓度来调控目标基因的表达水平。As shown in Figure 1A, the chimeric allosteric transcription factor of the present invention has a regulatory domain of prokaryotic origin and a transcriptional activation domain of eukaryotic origin, and this heterologous fusion enables the allosteric transcription factor to respond to heterologous signals. Response: The response to the inducer is achieved by the regulatory domain, and the transcription of the gene in mammals is controlled by the transcriptional activation domain. Specifically, the regulatory domain is a quorum sensing response protein RpaR, BjaR or BraR, and the regulatory domain is combined with acyl homoserine lactone pC-HSL, Cinn-HSL or IV-HSL derived from a natural amino acid to produce a conformation Change, bind to the operator sequence on the DNA, thereby positioning the transcriptional activation domain to the promoter containing the operator sequence, so the expression level of the target gene can be regulated by the concentration of the inducer.
在细菌群体感应系统中,通常由自诱导物合成酶LuxI和应答蛋白LuxR介导信号分子的产生和应答。LuxI在体内催化以S-腺苷甲硫氨酸和酰基载体蛋白(ACP)为底物生成信号分子AHL,此类信号分子可自由扩散穿过细胞膜;别构转录因子LuxR能够与扩散进入细菌的AHL结合,来调控群体感应依赖型基因的转录。在细菌的基因组上LuxI和LuxR基因的位置通常临近 [6],且LuxI的启动子包含能够被LuxR和AHL调控的操纵位点,使得AHL的合成通路包含正反馈,因此细菌能够在短时间内较快合成信号分子。在本发明中,将群体感应别构转录因子作为结构域调控与真核来源的转录激活结构域融合,并将该别构转录因子的操纵序列添加至真核启动子,构建在真核细胞中响应于群体感应信号的调控元件组和诱导系统。 In the bacterial quorum sensing system, the production and response of signaling molecules are usually mediated by the autoinducer synthase LuxI and the response protein LuxR. LuxI catalyzes the generation of signal molecule AHL with S-adenosylmethionine and acyl carrier protein (ACP) as substrates in vivo, and this kind of signal molecule can freely diffuse through the cell membrane; the allosteric transcription factor LuxR can interact with the AHL binds to regulate the transcription of quorum sensing-dependent genes. The positions of LuxI and LuxR genes on the bacterial genome are usually close to each other [6] , and the promoter of LuxI contains an operator site that can be regulated by LuxR and AHL, so that the synthesis pathway of AHL includes positive feedback, so bacteria can be in a short time Faster synthesis of signaling molecules. In the present invention, the quorum-sensing allosteric transcription factor is fused as a regulatory domain with the transcriptional activation domain derived from eukaryotic sources, and the operator sequence of the allosteric transcription factor is added to the eukaryotic promoter, and constructed in eukaryotic cells Regulatory element sets and induction systems in response to quorum sensing signals.
在本发明中,将所述嵌合型别构转录因子包含RpaR作为调控结构域、所述诱导型启动子包含rpaO作为操纵位点的诱导系统称为pC诱导系统,一般而言,该系统的诱导剂为对香豆酰基-高丝氨酸内酯(pC-HSL)。在本发明中,将所述嵌合型别构转录因子包含BraR作为调控结构域、所述诱导型启动子包含braO作为操纵位点的诱导系统称为Cinn诱导系统,一般而言,该系统的诱导剂为肉桂酰基-高丝氨酸内酯(Cinn-HSL)。在本发明中,将所述嵌合型别构转录因子包含BjaR作为调控结构域、所述诱导型启动子包含bjaO作为操纵位点的诱导系统称为IV诱导系统,一般而言,该系统的诱导剂为异戊酰基-高丝氨酸内酯(IV-HSL)。在本发明中,术语“诱导系统”以及“诱导表达系统”、“基因表达系统”可互换使用。In the present invention, the inducible system in which the chimeric allosteric transcription factor comprises RpaR as the regulatory domain and the inducible promoter comprises rpaO as the operating site is called the pC inducible system. Generally speaking, the The inducer is p-coumaroyl-homoserine lactone (pC-HSL). In the present invention, the inducible system in which the chimeric allosteric transcription factor comprises BraR as the regulatory domain and the inducible promoter comprises braO as the operating site is called the Cinn inducible system. Generally speaking, the The inducer was cinnamoyl-homoserine lactone (Cinn-HSL). In the present invention, the induction system in which the chimeric allosteric transcription factor includes BjaR as the regulatory domain and the inducible promoter includes bjaO as the operating site is called the IV induction system. Generally speaking, the The inducer is isovaleryl-homoserine lactone (IV-HSL). In the present invention, the terms "inducible system" and "inducible expression system" and "gene expression system" are used interchangeably.
本文中的术语“诱导剂”、“诱导物”、“信号分子”、“信号小分子”可互换使用,是指自身能够调控别 构转录因子与操纵序列结合的化学小分子,可为外源输入或宿主自身合成。已发现的细菌群体感应系统大多以直链脂肪酸型AHL作为信号分子,其酰基侧链决定信号的特异性。本发明使用的诱导剂天然氨基酸衍生的酰基高丝氨酸内酯目前仅在与植物共生的细菌中发现,这些小分子参与细菌-植物相互作用。在天然宿主体内,直链脂肪酸型AHL由LuxI型N-酰基高丝氨酸内酯合酶催化生成,该反应的底物为S-腺苷甲硫氨酸和酰基-酰基载体蛋白(Acyl-ACP)。而在天然氨基酸衍生的AHL的生物合成途径中,LuxI型合酶RpaI、BraI和BjaI的底物为酰基-辅酶A(Acyl-CoA)而非酰基-酰基载体蛋白。ACP和辅酶A具有相似的化学性质,两者都与羧化底物形成加合物并将这些底物呈递给酶。ACP的功能通常在于生物合成和转移脂肪酸底物;而CoA与结构更加多样的羧化底物相关:辅酶A修饰的化合物通常为生物合成途径和生物降解途径的中间体 [5]。天然氨基酸衍生的酰基高丝氨酸内酯能够自由扩散进入真核细胞,在细胞中可稳定存在,且不能由真核生物的自身代谢途径合成,这种异源诱导物能够有效降低本底表达。 The terms "inducer", "inducer", "signaling molecule" and "signaling small molecule" are used interchangeably herein, and refer to small chemical molecules that can regulate the binding of allosteric transcription factors and operator sequences by themselves. Source input or host own synthesis. Most of the discovered bacterial quorum sensing systems use straight-chain fatty acid AHL as the signal molecule, and its acyl side chain determines the specificity of the signal. The inducers used in the present invention are acyl homoserine lactones derived from natural amino acids, which are currently only found in bacteria that are symbiotic with plants, and these small molecules are involved in bacteria-plant interactions. In the natural host, straight-chain fatty acid-type AHL is catalyzed by Lux type I N-acyl homoserine lactone synthase, and the substrates of this reaction are S-adenosylmethionine and acyl-acyl carrier protein (Acyl-ACP) . In the biosynthetic pathway of AHL derived from natural amino acids, the substrates of Lux type I synthases RpaI, BraI and BjaI are acyl-coenzyme A (Acyl-CoA) rather than acyl-acyl carrier protein. ACP and coenzyme A have similar chemical properties, both form adducts with carboxylated substrates and present these substrates to the enzyme. The function of ACP usually lies in the biosynthesis and transfer of fatty acid substrates; while CoA is related to carboxylation substrates with more diverse structures: Coenzyme A modified compounds are usually intermediates in biosynthesis and biodegradation pathways [5] . Acyl homoserine lactones derived from natural amino acids can freely diffuse into eukaryotic cells, exist stably in cells, and cannot be synthesized by eukaryotic own metabolic pathways. This heterologous inducer can effectively reduce the background expression.
在本发明的嵌合型别构转录因子中,利用LuxR同源蛋白作为调控结构域,对信号分子进行检测和应答。其结构包含与诱导物结合的结构域以及与启动子操纵序列结合的结构域。调控结构域与小分子的结合造成构象改变,使得由该启动子控制的下游基因转录水平改变。一般而言,与DNA的结合触发或增强目标基因转录的别构转录因子称为激活因子,反之称为阻遏蛋白。本发明的嵌合型别构转录因子其功能为激活因子。诱导型启动子包含能够与调控结构域相互作用的操纵序列。操纵序列一般位于感兴趣基因的转录起始位点附近,或者空间上与转录起始位点邻近。本文中,术语“操纵位点”、“操纵序列”以及“转录调控位点”可互换使用。不希望被理论所限地,在本发明中,诱导型启动子可包含一对或多对操纵序列,该操纵位点对可均位于转录起始位点的上游或下游,或分别位于转录起始位点的两侧。In the chimeric allosteric transcription factor of the present invention, the LuxR homologous protein is used as a regulatory domain to detect and respond to signal molecules. Its structure includes a domain that binds to an inducer and a domain that binds to a promoter operator sequence. Binding of the regulatory domain to the small molecule causes a conformational change that alters the transcriptional level of downstream genes controlled by the promoter. In general, allosteric transcription factors that bind to DNA to trigger or enhance the transcription of target genes are called activators, and vice versa are called repressors. The function of the chimeric allosteric transcription factor of the present invention is an activator. Inducible promoters contain operator sequences capable of interacting with regulatory domains. The operator sequence is generally located near, or spatially adjacent to, the transcription initiation site of the gene of interest. Herein, the terms "operator site", "operator sequence" and "transcriptional regulatory site" are used interchangeably. Without wishing to be bound by theory, in the present invention, an inducible promoter may comprise one or more pairs of operator sequences, and the operator site pairs may both be located upstream or downstream of the transcription initiation site, or located respectively at the transcription initiation site. both sides of the starting point.
术语“启动子”指的是驱动核酸序列表达或转录的DNA区域,位于基因转录起始位点的同义链上游。在启动子处RNA聚合酶以及转录因子能够与DNA结合。诱导型启动子一般包含转录调控区、RNA聚合酶识别区和转录起始位点。本领域已知的是,将感兴趣基因的编码序列可操作地连接至诱导型启动子的下游,并在所需时机添加诱导物,可动态开启或关闭该基因的转录。术语“最小启动子”或“启动子的核心序列”指的是在真核细胞中能够触发转录的最小转录控制单元,例如仅包含RNA聚合酶识别区和转录起始位点的启动子核心区域。可将至少一个操纵位点添加至最小启动子的上游和/或下游,构造开关性能优异的诱导型启动子。在本发明优选的实施方式中,所述诱导型启动子的核心序列可选自于CMV1启动子(SEQ ID NO:1)、CMVmini启动子(SEQ ID NO:2)、TRE3G启动子(SEQ ID NO:3)、EF1a核心启动子(SEQ ID NO:4)或hEF1a启动子(SEQ ID NO:40);优选地,所述核心序列为CMV1启动子或TRE3G启动子。The term "promoter" refers to a region of DNA that drives the expression or transcription of a nucleic acid sequence, located synonymously upstream of the transcription initiation site of a gene. RNA polymerase and transcription factors are able to bind DNA at the promoter. An inducible promoter generally includes a transcriptional regulatory region, an RNA polymerase recognition region, and a transcriptional initiation site. It is known in the art that the transcription of the gene can be dynamically turned on or off by operably linking the coding sequence of the gene of interest to the downstream of an inducible promoter, and adding an inducer at a desired time. The term "minimal promoter" or "promoter's core sequence" refers to the smallest transcriptional control unit capable of triggering transcription in eukaryotic cells, such as a promoter core region comprising only the RNA polymerase recognition region and the transcription initiation site . At least one operator site can be added upstream and/or downstream of the minimal promoter to construct an inducible promoter with excellent switch performance. In a preferred embodiment of the present invention, the core sequence of the inducible promoter can be selected from CMV1 promoter (SEQ ID NO: 1), CMVmini promoter (SEQ ID NO: 2), TRE3G promoter (SEQ ID NO: 3), EF1a core promoter (SEQ ID NO: 4) or hEF1a promoter (SEQ ID NO: 40); preferably, the core sequence is a CMV1 promoter or a TRE3G promoter.
在一些实施方式中,所述诱导系统为pC诱导系统。其中,所述诱导剂为pC-HSL,所述嵌合型别构转录因子的调控结构域为RpaR,所述诱导型启动子包含的操纵序列为rpaO(表1)。pC群体感应系统的天然宿主为与植物共生的池沼红假单胞菌(Rhodopseudomonas palustris),其群体感应系统包含LuxI家族的RpaI和LuxR家族的RpaR。RpaI的转录由pC-HSL和RpaR共同激活。在池沼红假单胞菌RCB100菌株中,RpaR的编码序列为其全基因组序列(NCBI登录号为WP_011155889.1)反义链第348672-349403bp。在池沼红假单胞菌培养物中,pC-HSL的浓度在1-10μM量级,该细菌可在nM量级的pC-HSL存在下出现群体感应现象 [7]。此外,包含rpaO操纵序列的RpaI启动子不能被直链脂肪酸型AHL激活,表明工作浓度下pC群体感应系统与直链脂肪酸型AHL群体感应系统存在良好的正交性。可将该系统与直链脂肪酸型AHL诱导系统(例如以3OC6或3OC8作为诱导剂)组合使用,构建多重诱导系统。如本发明实施例所展示的,也可使用Cinn-HSL作为诱导剂调控pC诱导系统。就本发明的嵌合型别构转录因子、调控元件组以及诱导系统在构建细胞间信号通讯系统中的用途而言,池沼红假单胞菌中存在的pC-HSL合成途径并不完整,需要在培养基中添加对香豆酸盐才能实现pC-HSL的生物合成。在本发明中,可利用酪氨酸解氨酶TAL、4-香豆酸辅酶A连接酶4CL以及对香豆酰基-高丝氨酸内酯合成酶RpaI催化由酪氨酸合成对香豆酰基-高丝氨酸内酯 [8]In some embodiments, the inducible system is a pC inducible system. Wherein, the inducer is pC-HSL, the regulatory domain of the chimeric allosteric transcription factor is RpaR, and the operator sequence contained in the inducible promoter is rpaO (Table 1). The natural host of the pC quorum sensing system is Rhodopseudomonas palustris, which is symbiotic with plants, and its quorum sensing system includes RpaI of the LuxI family and RpaR of the LuxR family. The transcription of RpaI is jointly activated by pC-HSL and RpaR. In Rhodopseudomonas palustris RCB100 strain, the coding sequence of RpaR is the 348672-349403bp antisense strand of its whole genome sequence (NCBI accession number is WP_011155889.1). In the culture of Rhodopseudomonas palustris, the concentration of pC-HSL is on the order of 1-10 μM, and the bacterium can appear quorum sensing in the presence of nM pC-HSL [7] . In addition, the RpaI promoter containing the rpaO operator sequence could not be activated by straight-chain fatty acid-type AHL, indicating that there was good orthogonality between the pC quorum sensing system and the straight-chain fatty acid-type AHL quorum sensing system at working concentrations. This system can be used in combination with a linear fatty acid-type AHL induction system (for example, 3OC6 or 3OC8 as an inducer) to construct a multiple induction system. As demonstrated in the examples of the present invention, the pC induction system can also be regulated using Cinn-HSL as an inducer. In terms of the use of the chimeric allosteric transcription factor, regulatory element group and induction system of the present invention in the construction of an intercellular signal communication system, the pC-HSL synthesis pathway that exists in Rhodopseudomonas palustris is not complete, and requires The biosynthesis of pC-HSL can only be achieved by adding p-coumarate to the medium. In the present invention, tyrosine ammonia lyase TAL, 4-coumaric acid coenzyme A ligase 4CL and p-coumaroyl-homoserine lactone synthetase RpaI can be used to catalyze the synthesis of p-coumaroyl-homoserine from tyrosine Serine lactone [8] .
在一些实施方式中,所述诱导系统为Cinn诱导系统。其中,所述诱导剂为Cinn-HSL,所述嵌合型别构转录因子的调控结构域为BraR,所述诱导型启动子包含的操纵序列为braO(表1)。在本发明中,将能够响应Cinn-HSL和BraR的操纵序列统称为braO,braO例如可为本发明识别的NC(SEQ ID NO:5)、ORS(SEQ ID NO:6)、APO(SEQ ID NO:7)、CUS(SEQ ID NO:8)、CUL(SEQ ID NO:9)、R1(SEQ ID NO:10)、R4(SEQ ID NO:11)、R7(SEQ ID NO:12)或R8(SEQ ID NO:13)(图2),优选为R8。如上所述,在与植物共生的池沼红假单胞菌中仅存在完整的群体感应信号检测和应答模块,其pC-HSL信号的生成需要外加对香豆酸盐。研究表明,光合异养的慢生根瘤菌(Bradyrhizobium)分泌的小分子Cinn-HSL能够激发池沼红假单胞菌的群体感应。在慢生根瘤菌中,由LuxI的同源蛋白BraI(NCBI登录号:CAL74857.1)催化Cinn-HSL的生物合成,并由LuxR同源蛋白BraR检测该信号分子。例如,在Bradyrhizobium ORS278中,BraR的编码序列为其基因组(NCBI登录号:CU234118.1)反义链第1004717-1005442bp),BraI的编码序列为其基因组反义链第1004631-1003951bp。在慢生根瘤菌报告菌株 中,实现半饱和群体感应应答所需Cinn-HSL的浓度可低至10pM,而实现半饱和群体感应应答所需的直链脂肪酸型AHL的浓度为10 4-10 7pM,表明工作浓度下Cinn群体感应系统与直链脂肪酸型AHL群体感应系统存在良好的正交性。可通过降低Cinn-HSL的浓度,将该系统与直链脂肪酸型AHL诱导系统(例如以3OC6或3OC8作为诱导剂)组合使用,构建多重诱导系统。在池沼红假单胞菌中,较高浓度的Cinn-HSL也能够激活包含rpaO操纵序列的RpaI启动子,可通过调整Cinn-HSL的浓度,对同时包含pC诱导系统和Cinn诱导系统的宿主细胞进行时空特异性调控。就本发明的嵌合型别构转录因子、调控元件组以及诱导系统在构建细胞间信号通讯系统中的用途而言,虽然慢生根瘤菌生物合成Cinn-HSL不需要添加肉桂酸盐,然而Cinn-HSL的产量仅有20nM。考虑到BraI催化由S-腺苷甲硫氨酸和活性形式的肉桂酸盐生成Cinn-HSL,可通过导入肉桂酸盐生物合成通路提高中间产物肉桂酸盐的产量,进而增加Cinn-HSL的产量。本领域已知的是,苯丙氨酸解氨酶催化由苯丙氨酸脱氨生成肉桂酸。苯丙氨酸解氨酶例如可来自于马里蒂姆链霉菌(Streptomyces maritimus)或圆红冬孢酵母菌。在本发明中,利用苯丙氨酸解氨酶PAL、4-香豆酸辅酶A连接酶4CL以及BraI催化由苯丙氨酸合成肉桂酰基-高丝氨酸内酯 [9]In some embodiments, the induction system is a Cinn induction system. Wherein, the inducer is Cinn-HSL, the regulatory domain of the chimeric allosteric transcription factor is BraR, and the operator sequence contained in the inducible promoter is braO (Table 1). In the present invention, the operator sequences capable of responding to Cinn-HSL and BraR are collectively referred to as braO, for example, braO can be NC (SEQ ID NO: 5), ORS (SEQ ID NO: 6), APO (SEQ ID NO: 7), CUS (SEQ ID NO: 8), CUL (SEQ ID NO: 9), R1 (SEQ ID NO: 10), R4 (SEQ ID NO: 11), R7 (SEQ ID NO: 12) or R8 (SEQ ID NO: 13) (Figure 2), preferably R8. As mentioned above, there is only a complete quorum sensing signal detection and response module in Rhodopseudomonas ponderosa symbiotic with plants, and the generation of its pC-HSL signal requires the addition of p-coumarate. Studies have shown that the small molecule Cinn-HSL secreted by photoheterotrophic Bradyrhizobium can stimulate the quorum sensing of Rhodopseudomonas palustris. In Bradyrhizobium, the biosynthesis of Cinn-HSL is catalyzed by LuxI homologous protein BraI (NCBI accession number: CAL74857.1), and the signal molecule is detected by LuxR homologous protein BraR. For example, in Bradyrhizobium ORS278, the coding sequence of BraR is its genome (NCBI accession number: CU234118.1) antisense strand 1004717-1005442bp), and the coding sequence of BraI is its genome antisense strand 1004631-1003951bp. In Bradyrhizobium reporter strains, the concentration of Cinn-HSL required to achieve a half-saturated quorum sensing response can be as low as 10 pM, while the concentration of linear fatty acid-type AHL required to achieve a half-saturated quorum sensing response is 10 4 -10 7 pM, indicating that there is a good orthogonality between the Cinn quorum sensing system and the linear fatty acid AHL quorum sensing system at the working concentration. A multiple induction system can be constructed by reducing the concentration of Cinn-HSL and using this system in combination with a linear fatty acid-type AHL induction system (for example, using 3OC6 or 3OC8 as an inducer). In Rhodopseudomonas palustris, a higher concentration of Cinn-HSL can also activate the RpaI promoter containing the rpaO operator sequence. By adjusting the concentration of Cinn-HSL, the host cells containing both the pC inducible system and the Cinn inducible system can Spatiotemporal specific regulation. As far as the chimeric allosteric transcription factor of the present invention, regulatory element group and induction system are used in the construction of the intercellular signal communication system, although Bradyrhizobium biosynthesis of Cinn-HSL does not require the addition of cinnamate, Cinn The yield of -HSL was only 20nM. Considering that BraI catalyzes the production of Cinn-HSL from S-adenosylmethionine and the active form of cinnamate, the production of the intermediate product cinnamate can be increased by introducing the cinnamate biosynthesis pathway, thereby increasing the production of Cinn-HSL . It is known in the art that phenylalanine ammonia lyase catalyzes the deamination of phenylalanine to cinnamic acid. The phenylalanine ammonia lyase can be derived from, for example, Streptomyces maritimus or Rhodosporidium toruloides. In the present invention, phenylalanine ammonia-lyase PAL, 4-coumaric acid coenzyme A ligase 4CL and BraI are used to catalyze the synthesis of cinnamoyl-homoserine lactone from phenylalanine [9] .
在一些实施方式中,所述诱导系统为IV诱导系统。其中,所述诱导剂为IV-HSL,所述嵌合型别构转录因子的调控结构域为BjaR,所述诱导型启动子包含的操纵序列为bjaO(表1)。IV应答元件存在于多种慢生根瘤菌中,例如日本慢生根瘤菌(Bradyrhizobium japonicum)、扩大重氮根瘤菌(Bradyrhizobium diazoefficiens)、Bradyrhizobium niftali、辽宁慢生根瘤菌(Bradyrhizobium liaoningense)等。在最早发现的天然宿主日本慢生根瘤菌中,其群体感应系统包含LuxI家族的BjaI和LuxR家族的BjaR。BjaI的转录由IV-HSL和BjaR共同激活。在日本慢生根瘤菌中BjaI-lacZ报告子能够对低至10pM量级的IV-HSL产生应答,而产生群体感应信号所需的直链脂肪酸型AHL的浓度为nM量级,表明工作浓度下IV诱导系统与直链脂肪酸型AHL诱导系统存在较好的正交性 [5]。在扩大重氮根瘤菌中,BjaR的编码序列为其基因组(110spc4菌株的全基因组NCBI登录号:NZ_CP032617.1)正义链第1105945-1106673bp。就本发明的嵌合型别构转录因子、调控元件组以及诱导系统在构建细胞间信号通讯系统的用途而言,可由支链α-酮酸脱氢酶复合物BCDH和异戊酰-高丝氨酸内酯合成酶BjaI催化由异亮氨酸合成IV-HSL [8]In some embodiments, the induction system is an IV induction system. Wherein, the inducer is IV-HSL, the regulatory domain of the chimeric allosteric transcription factor is BjaR, and the operator sequence contained in the inducible promoter is bjaO (Table 1). IV response elements exist in a variety of Bradyrhizobium japonicum, Bradyrhizobium diazoefficiens, Bradyrhizobium niftali, Bradyrhizobium liaoningense, etc. In Bradyrhizobium japonicum, the first discovered natural host, its quorum sensing system includes BjaI of the LuxI family and BjaR of the LuxR family. The transcription of BjaI is jointly activated by IV-HSL and BjaR. In Bradyrhizobium japonicus, the BjaI-lacZ reporter can respond to IV-HSL as low as 10pM, while the concentration of linear fatty acid-type AHL required to produce quorum sensing signals is nM, indicating that the working concentration is low. There is good orthogonality between the IV induction system and the linear fatty acid type AHL induction system [5] . In Diazorhizobium amplified, the coding sequence of BjaR is the sense strand 1105945-1106673 bp of its genome (the whole genome NCBI accession number of 110spc4 strain: NZ_CP032617.1). In terms of the use of the chimeric allosteric transcription factor, regulatory element group and induction system of the present invention in the construction of an intercellular signal communication system, the branched chain α-ketoacid dehydrogenase complex BCDH and isovaleryl-homoserine Lactone synthase BjaI catalyzes the synthesis of IV-HSL from isoleucine [8] .
表1 pC、Cinn和IV诱导系统的各元件Table 1 Elements of pC, Cinn and IV induction systems
Figure PCTCN2022071023-appb-000001
Figure PCTCN2022071023-appb-000001
本领域已知的是,可将原核来源的转录调控因子与真核转录激活结构域融合,对真核细胞中基因表达进行调控。US8383364B2公开了将单纯疱疹病毒的VP16激活域与原核阻遏蛋白TetR融合制备转录激活因子tTA,在真核细胞中tTA调控含有tetO的启动子P tet的转录活性。Neddermann等将LuxR家族别构转录因子TraR与真核转录激活结构域p65融合,该转录因子TraR-p65可对哺乳动物的基因进行诱导表达 [3]。此外,可将四个重复串联的VP16(即VP64)、p65 [10]或SAM [11]等真核转录激活结构域融合至dCas9,由sgRNA引导靶基因激活。Ma等将已知的真核转录激活结构域p65、RTA和VP64融合,构建一系列VTR结构域,其与dCas9的融合蛋白能够用于调控真核基因转录 [12]。可用作本发明嵌合型别构转录因子的转录激活结构域包括但不限于上述p65、VP16、VP64、VTR1、VTR2以及VTR3,优选VP64或VTR3。 It is known in the art that transcriptional regulators of prokaryotic origin can be fused to eukaryotic transcriptional activation domains to regulate gene expression in eukaryotic cells. US8383364B2 discloses that the VP16 activation domain of the herpes simplex virus is fused with the prokaryotic repressor protein TetR to prepare the transcriptional activator tTA. In eukaryotic cells, tTA regulates the transcriptional activity of the promoter P tet containing tetO. Neddermann et al. fused the LuxR family allosteric transcription factor TraR with the eukaryotic transcriptional activation domain p65, and the transcription factor TraR-p65 can induce the expression of mammalian genes [3] . In addition, eukaryotic transcriptional activation domains such as VP16 (ie VP64), p65 [10] or SAM [11] with four repeats in tandem can be fused to dCas9 to guide target gene activation by sgRNA. Ma et al. fused the known eukaryotic transcription activation domains p65, RTA, and VP64 to construct a series of VTR domains, and their fusion proteins with dCas9 can be used to regulate eukaryotic gene transcription [12] . Transcription activation domains that can be used as chimeric allosteric transcription factors of the present invention include but are not limited to the above-mentioned p65, VP16, VP64, VTR1, VTR2 and VTR3, preferably VP64 or VTR3.
本领域公知的是,如果核酸分子(如DNA)包含含有转录和翻译调控信息的核苷酸序列、而且这样的序列“可操作地连接”至编码多肽的核苷酸序列,则该核酸分子(如DNA)被称为“能够表达”所述多肽。可操作的连接是调控DNA序列与各结构域的编码序列通过“允许基因以可回收的量表达为肽或蛋白”的方式相连接的连接。就将调控结构域可操作地连接至转录激活结构域而言,只要不破坏调控结构域以及转录激活结构域的功能,可将调控结构域直接连接至转录激活结构域的N端或C端,或使用连接肽将调控结构域连接至转录激活结构域的N端或C端。连接肽为柔性,不包含任何活性结构域,优选由甘氨酸和/或丝氨酸组成。事实上连接肽的长度对嵌合蛋白的影响极其有限,并可为任意长度。在一个实施方式中,连接肽为20-60bp。在一个实施方式中,连接肽的序列为GGAGGAGGCGGCTCCGGAGGCGGAGGAAGC(SEQ ID NO:16)。It is well known in the art that if a nucleic acid molecule (such as DNA) comprises a nucleotide sequence containing transcriptional and translational regulatory information, and such a sequence is "operably linked" to a nucleotide sequence encoding a polypeptide, then the nucleic acid molecule ( such as DNA) is said to be "capable of expressing" the polypeptide. An operable link is a link between the regulatory DNA sequence and the coding sequence of each structural domain in a manner that "allows the gene to be expressed as a peptide or protein in a recoverable amount". As far as the regulatory domain is operably linked to the transcriptional activation domain, as long as the functions of the regulatory domain and the transcriptional activation domain are not destroyed, the regulatory domain can be directly linked to the N-terminal or C-terminal of the transcriptional activation domain, Or a linker peptide is used to link the regulatory domain to the N- or C-terminus of the transcriptional activation domain. The connecting peptide is flexible, does not contain any active domains, and preferably consists of glycine and/or serine. In fact the length of the connecting peptide has very limited effect on the chimeric protein and can be of any length. In one embodiment, the connecting peptide is 20-60 bp. In one embodiment, the sequence of the connecting peptide is GGAGGAGGCGGCTCCGGAGGCGGAGGAAGC (SEQ ID NO: 16).
在优选的实施方式中,本发明的嵌合型别构转录因子还可包含核定位信号肽(NLS),所述核定位信号肽将别构转录因子定位至真核细胞的核内。事实上,在最初的报道中,是否添加NLS对于Tet诱导系统的性能影响不大;然而在将Tet用于腺病毒表达系统时,加入的NLS极大提高了诱导系统的性能(US 7745592B2)。此外,将TraR-p65连接区域中包含的NLS删除对嵌合蛋白的性能影响不大 [3]。本发明的嵌合型别构转录因子可包含核定位信号肽,所述核定位信号肽包含1-4个核定位序列,各核定位序列的两端可具有5-40bp的侧翼序列,所述侧翼序列为柔性。各侧翼序列的长度可相同或不同。事实上,侧翼序列对于嵌合蛋白的功能影响极其有限,可为任意序列。所述核定位信号肽可位于所述嵌合型别构转录因子的N端或C端,或者位于调控结构域与转录激活结构域之间。在优选的实施方式中,所述核定位信号肽位于调控结构域与转录激活结构域之间,所述核定位信号肽具有2个核定位序列,在核定位序列的两端和之间具有5-20bp的侧翼序列。可用于本发明的核定位序列包括:具有PKKKRKV的氨基酸序列的SV40病毒大T抗原的NLS;衍生于核质蛋白的NLS(例如具有KRPAATKKAGQAKKKK序列的双分型核质蛋白NLS);具有PAAKRVKLD或RQRRNELKRSP氨基酸序列的c-myc的NLS;具有氨基酸序列NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY的hRNPA1M9的NLS;输入蛋白(importin-α)衍生的IBB结构域的序列RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV;肌瘤T蛋白的序列VSRKRPRP和PPKKARED;人p53的序列POPKKKPL;小鼠c-abl IV的序列SALIKKKKKMAP;流感病毒NS1的序列DRLRR和PKQKKRK;丁型肝炎抗原的序列RKLKKKIKKL;小鼠Mx1蛋白的序列REKKKFLKRR;人多聚(ADP-核糖)聚合酶的序列KRKGDEVDGVDEVAKKKSKK;或者类固醇激素受体糖皮质激素序列RKCLQAGMNLEA RKTKK,但不限于此。在优选的实施方式中,所述核定位序列为gctgaccccaagaagaagaggaaggtg(SEQ ID NO:17)。 In a preferred embodiment, the chimeric allosteric transcription factor of the present invention may further comprise a nuclear localization signal peptide (NLS), which localizes the allosteric transcription factor to the nucleus of eukaryotic cells. In fact, in the original report, adding NLS had little effect on the performance of the Tet induction system; however, when Tet was used in the adenovirus expression system, the addition of NLS greatly improved the performance of the induction system (US 7745592B2). Furthermore, deletion of the NLS included in the TraR-p65 junction region had little effect on the performance of the chimeric protein [3] . The chimeric allosteric transcription factor of the present invention may comprise a nuclear localization signal peptide, the nuclear localization signal peptide comprises 1-4 nuclear localization sequences, and each nuclear localization sequence may have a flanking sequence of 5-40bp at both ends, said The flanking sequences are flexible. Each flanking sequence may be the same or different in length. In fact, the flanking sequence has very limited influence on the function of the chimeric protein and can be any sequence. The nuclear localization signal peptide can be located at the N-terminal or C-terminal of the chimeric allosteric transcription factor, or between the regulatory domain and the transcriptional activation domain. In a preferred embodiment, the nuclear localization signal peptide is located between the regulatory domain and the transcriptional activation domain, and the nuclear localization signal peptide has two nuclear localization sequences, with 5 -20bp flanking sequence. Nuclear localization sequences that can be used in the present invention include: the NLS of the SV40 virus large T antigen with the amino acid sequence of PKKKRKV; the NLS derived from the nucleoplasmic protein (such as the bipartite nucleoplasmic protein NLS with the KRPAATKKAGQAKKKK sequence); with PAAKRVKLD or RQRRNELKRSP NLS of c-myc with amino acid sequence NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY NLS of hRNPA1M9; sequence of importin-α derived IBB domain RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV; sequence of myoma T protein VSRKRPRP and PPKKARED; sequence SALIKKKKKMAP of mouse c-abl IV; sequence DRLRR and PKQKKRK of influenza virus NS1; sequence RKLKKKIKKL of hepatitis D antigen; sequence REKKKFLKRR of mouse Mx1 protein; sequence KRKGDEVDGVDEVAKKKKSKK of human poly(ADP-ribose) polymerase; or Steroid hormone receptor glucocorticoid sequence RKCLQAGMNLEA RKTKK, but not limited thereto. In a preferred embodiment, the nuclear localization sequence is gctgaccccaagaagaagaggaaggtg (SEQ ID NO: 17).
在优选的实施方式中,本发明的嵌合型别构转录因子还可包含多聚化结构域,使得嵌合型别构转录因子能够发生多聚化。这种多聚化的能力并不依赖于调控结构域或转录激活结构域的结构,且并不影响调控结构域与相应操纵位点的相互作用。如本发明实施例所展示的,即便在启动子区上仅存在一对操纵位点,别构转录因子中包含多聚化结构域的诱导系统开关性能也更优。在一些实施方式中,所述多聚化结构域为CarH(SEQ ID NO:18)。CarH是一种辅酶B12依赖的细菌光受体蛋白,可对光产生应答而控制类胡萝卜素合成。在黑暗中与辅酶B12结合后单体CarH可形成四聚体,而光照后四聚体分解。本领域已报道利用CarH的四聚化性质,将其C端腺苷钴胺素结合域用于制备凝胶 [13]。就本发明而言,在嵌合型别构转录因子的调控结构域和转录激活结构域之间添加多聚化结构域能够增加诱导曲线的陡峭度(steepness),并降低本底表达。在一些实施方式中,所述多聚化域位于所述调控结构域与所述转录激活结构域之间。优选地,利用连接肽将调控结构域、多聚化结构域与转录激活结构域相连。在优选的实施方式中,所述嵌合型别构转录因子还包含核定位信号肽,所述核定位信号肽可位于嵌合型别构转录因子的N端或C端;或者位于调控结构域与多聚化结构域之间、或者位于转录激活结构域与多聚化结构域之间。连接肽与核定位信号肽的特征如上文所述。 In a preferred embodiment, the chimeric allosteric transcription factor of the present invention may further comprise a multimerization domain, enabling the multimerization of the chimeric allosteric transcription factor. This ability to multimerize does not depend on the structure of the regulatory domain or the transcriptional activation domain, and does not affect the interaction of the regulatory domain with the corresponding operator site. As shown in the examples of the present invention, even if there is only a pair of operator sites on the promoter region, the inducible system switch performance of the allosteric transcription factor containing the multimerization domain is better. In some embodiments, the multimerization domain is CarH (SEQ ID NO: 18). CarH is a coenzyme B12-dependent bacterial photoreceptor protein that controls carotenoid synthesis in response to light. Monomer CarH can form a tetramer after combining with coenzyme B12 in the dark, and the tetramer disintegrates under light. It has been reported in the field that the C-terminal adenosylcobalamin binding domain of CarH was used to prepare gels by utilizing the tetramerization property of CarH [13] . For the purposes of the present invention, the addition of a multimerization domain between the regulatory domain and the transcriptional activation domain of a chimeric allosteric transcription factor increases the steepness of the induction curve and reduces background expression. In some embodiments, the multimerization domain is located between the regulatory domain and the transcriptional activation domain. Preferably, the regulatory domain, the multimerization domain and the transcriptional activation domain are linked using a linker peptide. In a preferred embodiment, the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, and the nuclear localization signal peptide may be located at the N-terminal or C-terminal of the chimeric allosteric transcription factor; or in the regulatory domain Between the multimerization domain, or between the transcription activation domain and the multimerization domain. The features of the linker peptide and the nuclear localization signal peptide are as described above.
本领域技术人员能够理解的是,可根据所需的转录因子表达量选择本发明诱导系统的第一启动子(即,驱动嵌合型别构转录因子表达的启动子)。第一启动子可为常用于哺乳动物的启动子,可为组成型启动子、诱导型启动子,也可为组织特异性启动子。在本发明中,第一启动子可与驱动感兴趣的基因表达的所述诱导型启动子相同或不同。在一些实施方式中,第一启动子选自于CMV1启动子、CMVmini启动子、TRE3G启动子、EF1a核心启动子或hEF1a启动子。Those skilled in the art can understand that the first promoter of the induction system of the present invention (ie, the promoter driving the expression of chimeric allosteric transcription factors) can be selected according to the desired expression level of transcription factors. The first promoter may be a promoter commonly used in mammals, a constitutive promoter, an inducible promoter, or a tissue-specific promoter. In the present invention, the first promoter may be the same as or different from the inducible promoter driving the expression of the gene of interest. In some embodiments, the first promoter is selected from CMV1 promoter, CMVmini promoter, TRE3G promoter, EF1a core promoter or hEF1a promoter.
在优选的实施方式中,在本发明的调控元件组以及诱导系统中添加负反馈,能够实现报告子水平的线性变换。在无负反馈的设计中,剂量响应曲线表现为S型,即,在诱导物浓度升高时报告基因表达量迅速达到饱和。而在添加负反馈后,剂量响应曲线在较宽的诱导物浓度范围内均表现为线性,且动态范围增大。不希望被理论所限地,线性化基因线路的结构类似于由阻遏蛋白调控的细菌操纵子。然而,由于在细菌中效应蛋白的功能通常在于催化生化反应,或者效应蛋白编码影响诱导物水平的运载蛋白,会产生其它反馈,因此在天然的基因线路中几乎不存在完全线性的应答 [14]。考虑到原核基因线路与真核宿主的信号通路正交,可在本发明的诱导系统中增加负反馈,将S型(sigmoidal)剂量响应曲线线性化并降低双峰(bimodal)状态,从而获得具有较大线性范围的诱导系统。这些线性诱导系统对于构建复杂转录调控网络是有益的。在此类实施方式中,本发明的调控元件组进一步包含阻遏蛋白(例如TetR(US8383364B2)或CymR(WO2019175600A1))以及调控该阻遏蛋白的第三启动子,所述第三启动子包含与所述阻遏蛋白相互作用的第二操纵位点(例如tetO或cymO)。此外,所述诱导型启动子除包含与嵌合型别构转录因子相互作用的操纵位点外,进一步包含与阻遏蛋白相互作用的第二操纵位点。第二操纵位点可为一个或多个,可位于转录起始位点的上游或下游,或者多个第二操纵位点分布于转录起始位点的上游或下游。在此类实施方式中,本发明的诱导表达系统进一步包含第三基因表达盒,所述第三基因表达盒包含第三启动子序列以及阻遏蛋白的编码序列,用于在宿主细胞中表达所述阻遏蛋白,所述第三启动子序列包含核心序列以及与所述阻遏蛋白相互作用的第二操纵位点;所述诱导型启动子序列除包含与嵌合型别构转录因子相互作用的操纵位点 外,进一步包含与阻遏蛋白相互作用的第二操纵位点。在一个优选的实施方式中,所述阻遏蛋白为TetR,所述第二操纵位点为tetO。所述TetR及其操纵序列tetO参见例如US8383364B2。在本发明的实施方式中,tetO为tccctatcagtgatagaga(SEQ ID NO:37的第173-191位核苷酸)。TetR的基因序列为如文献 [17]所述。在一个实施方式中,所述阻遏蛋白为CymR,所述第二操纵位点为cymO。所述CymR及其操纵序列cymO例如参见CN107344962A。 In a preferred embodiment, negative feedback is added to the set of regulatory elements and the induction system of the present invention, so that the linear transformation of the reporter level can be realized. In a design without negative feedback, the dose-response curve exhibits an S-shape, that is, reporter gene expression saturates rapidly as the inducer concentration increases. However, after adding negative feedback, the dose-response curves were linear in a wide range of inducer concentrations, and the dynamic range increased. Without wishing to be bound by theory, the structure of the linearized gene circuit resembles a bacterial operon regulated by a repressor protein. However, since in bacteria the function of effector proteins is often to catalyze biochemical reactions, or the effector proteins encode carrier proteins that affect the level of inducers, other feedbacks will occur, so that a completely linear response rarely exists in natural gene circuits [14] . Considering that the prokaryotic gene circuit is orthogonal to the signal pathway of the eukaryotic host, negative feedback can be added in the induction system of the present invention to linearize the sigmoidal dose-response curve and reduce the bimodal state, thereby obtaining Induced systems with a large linear range. These linear induction systems are beneficial for constructing complex transcriptional regulatory networks. In such embodiments, the set of regulatory elements of the present invention further comprises a repressor protein (such as TetR (US8383364B2) or CymR (WO2019175600A1)) and a third promoter regulating the repressor protein, said third promoter comprising the same A second operator site for repressor interaction (eg tetO or cymO). In addition, besides the operating site interacting with the chimeric allosteric transcription factor, the inducible promoter further includes a second operating site interacting with the repressor protein. There can be one or more second manipulation sites, and they can be located upstream or downstream of the transcription initiation site, or multiple second manipulation sites are distributed upstream or downstream of the transcription initiation site. In such embodiments, the inducible expression system of the present invention further comprises a third gene expression cassette comprising a third promoter sequence and a coding sequence of a repressor protein for expressing said gene expression cassette in a host cell. repressor protein, the third promoter sequence includes a core sequence and a second operator site interacting with the repressor protein; the inducible promoter sequence includes an operator site interacting with a chimeric allosteric transcription factor In addition to the site, it further contains a second operator site that interacts with the repressor protein. In a preferred embodiment, the repressor protein is TetR, and the second operator site is tetO. For the TetR and its operating sequence tetO, see, for example, US8383364B2. In an embodiment of the invention, tetO is tccctatcagtgatagaga (nucleotides 173-191 of SEQ ID NO: 37). The gene sequence of TetR is as described in literature [17] . In one embodiment, the repressor protein is CymR and the second operator site is cymO. For the CymR and its operating sequence cymO, see CN107344962A, for example.
鉴于本发明的嵌合型别构转录因子中调控结构域来源于群体感应系统的信号检测和应答蛋白,可将本发明的诱导系统用作细胞间通讯系统的信号接收模块。具体而言,将信号分子的生物合成基因表达盒作为信号发送元件整体置于一种细胞,使得信号发送细胞表达该信号分子体内合成所需的全部酶。另一方面,将本发明的诱导系统作为信号接收元件置于另一种细胞,从而利用信号分子作为“电线”偶联信号发送模块和信号接收模块,构建多细胞基因线路。也可模仿原核群体感应系统,将信号发送模块与本发明的诱导系统导入相同的哺乳动物细胞内,调控哺乳动物细胞在群体水平的功能。与pC、Cinn和IV诱导系统相对应的信号发送模块(信号分子的完整生物合成通路)如上文所述,并总结于表2中。In view of the fact that the regulatory domain of the chimeric allosteric transcription factor of the present invention is derived from the signal detection and response protein of the quorum sensing system, the induction system of the present invention can be used as a signal receiving module of the intercellular communication system. Specifically, the biosynthetic gene expression cassette of the signal molecule is placed in a cell as a signal sending element as a whole, so that the signal sending cell expresses all the enzymes required for the synthesis of the signal molecule in vivo. On the other hand, the induction system of the present invention is placed in another cell as a signal receiving element, so that the signal molecule is used as a "wire" to couple the signal sending module and the signal receiving module to construct a multicellular gene circuit. The prokaryotic quorum sensing system can also be imitated, and the signal sending module and the induction system of the present invention can be introduced into the same mammalian cells to regulate the functions of the mammalian cells at the population level. The signaling modules (complete biosynthetic pathways of signaling molecules) corresponding to the pC, Cinn and IV induction systems are described above and summarized in Table 2.
表2 pC、Cinn和IV信号发送模块Table 2 pC, Cinn and IV signal transmission modules
Figure PCTCN2022071023-appb-000002
Figure PCTCN2022071023-appb-000002
由于其正交性、模块化、可调性和可组合性,可将本发明的嵌合型别构转录因子、调控元件组以及诱导系统瞬时或稳定转染至哺乳动物细胞,对宿主的基因表达进行调控。瞬时转染和稳定转染的方法为本领域公知。基于此,本发明的嵌合型别构转录因子、调控元件组以及诱导系统在构建人工组织、人工器官、人类疾病的动物模型,以及基因功能分析、生物制药、疫苗制备、基因治疗等方面也是有利的。Due to its orthogonality, modularity, adjustability and combinability, the chimeric allosteric transcription factor, regulatory element group and induction system of the present invention can be transiently or stably transfected into mammalian cells, and the host gene Expression is regulated. Methods of transient transfection and stable transfection are well known in the art. Based on this, the chimeric allosteric transcription factor, regulatory element group and induction system of the present invention are also useful in the construction of artificial tissues, artificial organs, animal models of human diseases, gene function analysis, biopharmaceuticals, vaccine preparation, gene therapy, etc. advantageous.
本领域技术人员能够理解的是,就本发明的蛋白而言,也可采用与上述催化信号分子生物合成的酶以及别构转录因子基本同源的蛋白。术语“基本同源”是指蛋白序列水平至少95%、至少98%或至少99%相同。同源序列可以是不同物种中的功能相同的序列本发明的基因或蛋白的同源物可由本领域技术人员容易地确定。此外,可基于蛋白序列对密码子进行人源化优化,提高蛋白在异源宿主中的表达强度,这是本领域公知的。Those skilled in the art can understand that, as far as the protein of the present invention is concerned, proteins that are substantially homologous to the above-mentioned enzymes that catalyze the biosynthesis of signal molecules and allosteric transcription factors can also be used. The term "substantially homologous" means that the protein sequences are at least 95%, at least 98% or at least 99% identical at the sequence level. A homologous sequence may be a functionally identical sequence in a different species. Homologues of a gene or protein of the present invention can be readily determined by those skilled in the art. In addition, codons can be humanized and optimized based on the protein sequence to improve the expression strength of the protein in a heterologous host, which is well known in the art.
本文使用的术语“表达盒(expression cassette)”是指由一个或多个基因及控制该基因表达的序列组成的DNA片段,从而该基因编码的蛋白能够在期望的宿主细胞(即,真核细胞)中表达。本领域已知的是,表达盒中包含的控制表达的序列任选包含启动子序列、3’非编码区、转录终止位点以及多聚腺苷酸等。在本发明中,包含启动子和编码序列的表达盒可被整合至基因组或存在于质粒载体。The term "expression cassette" as used herein refers to a DNA segment consisting of one or more genes and sequences that control the expression of the genes so that the protein encoded by the genes can be expressed in a desired host cell (i.e., a eukaryotic cell). ) expression. It is known in the art that the expression control sequences included in the expression cassette optionally include promoter sequences, 3' non-coding regions, transcription termination sites, polyadenylation and the like. In the present invention, an expression cassette comprising a promoter and a coding sequence can be integrated into the genome or present in a plasmid vector.
本文所述各方面的实施方式可由如下编号的段落说明:Implementations of the aspects described herein can be illustrated by the following numbered paragraphs:
1.一种用于对哺乳动物细胞中感兴趣的基因进行调控的嵌合型别构转录因子,所述嵌合型别构转录因子包含调控结构域以及转录激活结构域;其中,所述调控结构域选自于由RpaR、BjaR以及BraR所组成的组;所述转录激活结构域选自于P65、VP16、VP64、VTR1、VTR2以及VTR3;所述嵌合型别构转录因子以天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂;所述天然氨基酸衍生的酰基高丝氨酸内酯选自于由如下化合物所组成的组:对香豆酰基-高丝氨酸内酯(pC-HSL)、异戊酰基-高丝氨酸内酯(IV-HSL)以及肉桂酰基-高丝氨酸内酯(Cinn-HSL)。1. A chimeric allosteric transcription factor for regulating a gene of interest in a mammalian cell, said chimeric allosteric transcription factor comprising a regulatory domain and a transcriptional activation domain; wherein said regulation The structural domain is selected from the group consisting of RpaR, BjaR and BraR; the transcription activation domain is selected from P65, VP16, VP64, VTR1, VTR2 and VTR3; the chimeric allosteric transcription factor is derived from natural amino acids The acyl homoserine lactone derived from a natural amino acid is selected from the group consisting of the following compounds: p-coumaroyl-homoserine lactone (pC-HSL), isovaleryl - homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL).
2.如段落1所述的嵌合型别构转录因子,其中,所述转录激活结构域为VTR3。2. The chimeric allosteric transcription factor as described in paragraph 1, wherein the transcriptional activation domain is VTR3.
3.如段落1或2所述的嵌合型别构转录因子,其中,所述调控结构域为RpaR,所述诱导剂为pC-HSL或Cinn-HSL;优选地,所述诱导剂为pC-HSL。3. The chimeric allosteric transcription factor as described in paragraph 1 or 2, wherein the regulatory domain is RpaR, and the inducer is pC-HSL or Cinn-HSL; preferably, the inducer is pC -HSL.
4.如段落1或2所述的嵌合型别构转录因子,其中,所述调控结构域BjaR,所述诱导剂为IV-HSL。4. The chimeric allosteric transcription factor as described in paragraph 1 or 2, wherein the regulatory domain is BjaR, and the inducer is IV-HSL.
5.如段落1或2所述的嵌合型别构转录因子,其中,所述调控结构域为BraR,所述诱导剂为Cinn-HSL。5. The chimeric allosteric transcription factor according to paragraph 1 or 2, wherein the regulatory domain is BraR, and the inducer is Cinn-HSL.
6.如段落1-5中任一项所述的嵌合型别构转录因子,其中,使用连接肽将所述调控结构域与所述转录激活结构域相连;优选地,所述连接肽的长度为20-60bp;优选地,所述连接肽具有SEQ ID NO:16的序列。6. The chimeric allosteric transcription factor of any one of paragraphs 1-5, wherein a connecting peptide is used to connect the regulatory domain to the transcriptional activation domain; preferably, the connecting peptide The length is 20-60bp; preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
7.如段落1-5中任一项所述的嵌合型别构转录因子,其中,所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端或位于所述调控结构域与所述转录激活结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,各核定位序列的5’端和3’端独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽位于所述调控结构域与所述转录激活结构域之间;优选地,所述核定位信号肽包含2个核定位序列,各核定位序列的5’端和3’端独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列。7. The chimeric allosteric transcription factor of any one of paragraphs 1-5, wherein the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located at the The N-terminal or C-terminal of the chimeric allosteric transcription factor or between the regulatory domain and the transcriptional activation domain, the nuclear localization signal peptide comprises 1-4 nuclear localization sequences; preferably, each The 5' end and the 3' end of the nuclear localization sequence independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide is located between the regulatory domain and the transcriptional activation domain; preferably, the The nuclear localization signal peptide comprises two nuclear localization sequences, and the 5' end and the 3' end of each nuclear localization sequence independently have a flanking sequence of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17 .
8.如段落1-5中任一项所述的嵌合型别构转录因子,其中,所述嵌合型别构转录因子进一步包含多聚化结构域,优选地,所述多聚化结构域为CarH。8. The chimeric allosteric transcription factor according to any one of paragraphs 1-5, wherein the chimeric allosteric transcription factor further comprises a multimerization domain, preferably, the multimerization structure The domain is CarH.
9.如段落8所述的嵌合型别构转录因子,其中,使用连接肽将所述调控结构域、所述多聚化结构域与所述转录激活结构域相连;优选地,所述连接肽的长度为20-60bp;优选地,所述连接肽具有SEQ ID NO:16的序列。9. The chimeric allosteric transcription factor of paragraph 8, wherein the regulatory domain, the multimerization domain and the transcriptional activation domain are linked using a linking peptide; preferably, the linking The length of the peptide is 20-60bp; preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
10.如段落8所述的嵌合型别构转录因子,其中,所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端,或者位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,各核定位序列的5’端和3’端独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽包含2个核定位序列,各核定位序列的5’端和3’端独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列;优选地,所述核定位信号肽位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间;优选地,所述多聚化结构域的5’端和3’端进一步包含连接肽,优选地,所述连接肽各自具有20-60bp的长度;优选地,所述连接肽具有SEQ ID NO:16的序列。10. The chimeric allosteric transcription factor as described in paragraph 8, wherein the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located in the chimeric allosteric The N-terminal or C-terminal of the transcription factor is either located between the regulatory domain and the multimerization domain, or between the transcription activation domain and the multimerization domain, and the nuclear localization The signal peptide contains 1-4 nuclear localization sequences; preferably, the 5' and 3' ends of each nuclear localization sequence independently have flanking sequences of 5-40bp; preferably, the nuclear localization signal peptide contains 2 nuclear localization sequences Sequences, the 5' and 3' ends of each nuclear localization sequence independently have flanking sequences of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17; preferably, the nuclear localization signal peptide Located between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain; preferably, the 5' end of the multimerization domain And 3' end further comprises connecting peptide, preferably, described connecting peptide each has the length of 20-60bp; Preferably, described connecting peptide has the sequence of SEQ ID NO:16.
11.如段落1-10中任一项所述的嵌合型别构转录因子,其中,所述哺乳动物细胞为人细胞。11. The chimeric allosteric transcription factor of any of paragraphs 1-10, wherein the mammalian cell is a human cell.
12.如段落1-10中任一项所述的嵌合型别构转录因子,其中,所述哺乳动物细胞为永生化细胞、原代细胞或干细胞。12. The chimeric allosteric transcription factor of any of paragraphs 1-10, wherein the mammalian cell is an immortalized cell, a primary cell, or a stem cell.
13.一种以天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂的调控元件组,所述天然氨基酸衍生的酰基高丝氨酸内酯选自于由如下化合物所组成的组:对香豆酰基-高丝氨酸内酯(pC-HSL)、异戊酰基-高丝氨酸内酯(IV-HSL)以及肉桂酰基-高丝氨酸内酯(Cinn-HSL);所述调控元件组包含:13. A set of regulatory elements using natural amino acid-derived acyl homoserine lactones as inducers selected from the group consisting of p-coumaroyl-homoserine lactones Serine lactone (pC-HSL), isovaleryl-homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL); the set of regulatory elements comprises:
嵌合型别构转录因子,所述嵌合型别构转录因子包含调控结构域以及转录激活结构域;其中,所述调控结构域选自于由RpaR、BjaR以及BraR所组成的组;所述转录激活结构域选自于P65、VP16、VP64、VTR1、VTR2以及VTR3;以及A chimeric allosteric transcription factor, said chimeric allosteric transcription factor comprising a regulatory domain and a transcriptional activation domain; wherein said regulatory domain is selected from the group consisting of RpaR, BjaR and BraR; said The transcriptional activation domain is selected from the group consisting of P65, VP16, VP64, VTR1, VTR2 and VTR3; and
在哺乳动物细胞中具有活性的诱导型启动子,所述诱导型启动子包含核心序列以及与所述嵌合型别构转录因子相互作用的至少一个操纵位点对。An inducible promoter active in mammalian cells, said inducible promoter comprising a core sequence and at least one operator site pair interacting with said chimeric allosteric transcription factor.
14.如段落13所述的调控元件组,其中,所述嵌合型别构转录因子的转录激活结构域为VTR3。14. The set of regulatory elements of paragraph 13, wherein the transcriptional activation domain of the chimeric allosteric transcription factor is VTR3.
15.如段落13或14所述的调控元件组,其中,所述嵌合型别构转录因子的调控结构域为RpaR,所述诱导型启动子的操纵位点为rpaO,所述诱导剂为pC-HSL或Cinn-HSL;优选地,所述诱导剂为pC-HSL。15. The set of regulatory elements as described in paragraph 13 or 14, wherein the regulatory domain of the chimeric allosteric transcription factor is RpaR, the operating site of the inducible promoter is rpaO, and the inducer is pC-HSL or Cinn-HSL; preferably, the inducer is pC-HSL.
16.如段落13或14所述的调控元件组,其中,所述嵌合型别构转录因子的调控结构域为BjaR,所述诱导型启动子的操纵位点为bjaO,所述诱导剂为IV-HSL。16. The set of regulatory elements as described in paragraph 13 or 14, wherein the regulatory domain of the chimeric allosteric transcription factor is BjaR, the operator site of the inducible promoter is bjaO, and the inducer is IV-HSL.
17.如段落13或14所述的调控元件组,其中,所述嵌合型别构转录因子的调控结构域为BraR,所述诱导型启动子的操纵位点为braO、优选为R8;所述诱导剂为Cinn-HSL。17. The set of regulatory elements as described in paragraph 13 or 14, wherein the regulatory domain of the chimeric allosteric transcription factor is BraR, and the operating site of the inducible promoter is braO, preferably R8; The inducer is Cinn-HSL.
18.如段落13-17中任一项所述的调控元件组,其中,使用连接肽将所述嵌合型别构转录因子中的所述调控结构域与所述转录激活结构域相连;优选地,所述连接肽的长度为20-60bp;优选地,所述连接肽具有SEQ ID NO:16的序列。18. The set of regulatory elements of any one of paragraphs 13-17, wherein the regulatory domain in the chimeric allosteric transcription factor is linked to the transcriptional activation domain using a linker peptide; preferably Preferably, the length of the connecting peptide is 20-60bp; preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
19.如段落13-17中任一项所述的调控元件组,其中,所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端或位于所述调控结构域与所述转录激活结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,各核定位序列的5’端和3’端独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽位于所述调控结构域与所述转录激活结构域之间;优选地,所述核定位信号肽包含2个核定位序列,各核定位序列的5’端和3’端独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列。19. The set of regulatory elements of any one of paragraphs 13-17, wherein the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located in the chimeric allosteric transcription factor The N-terminal or C-terminal of the allosteric transcription factor or located between the regulatory domain and the transcriptional activation domain, the nuclear localization signal peptide comprises 1-4 nuclear localization sequences; preferably, each nuclear localization sequence The 5' end and the 3' end independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide is located between the regulatory domain and the transcriptional activation domain; preferably, the nuclear localization signal The peptide comprises two nuclear localization sequences, and the 5' and 3' ends of each nuclear localization sequence independently have flanking sequences of 5-20 bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17.
20.如段落13-19中任一项所述的调控元件组,其中,所述嵌合型别构转录因子进一步包含多聚化结构域,优选地,所述多聚化结构域为CarH。20. The set of regulatory elements of any of paragraphs 13-19, wherein the chimeric allosteric transcription factor further comprises a multimerization domain, preferably, the multimerization domain is CarH.
21.如段落20所述的调控元件组,其中,使用连接肽将所述嵌合型别构转录因子中的所述调控结构域、 所述多聚化结构域与所述转录激活结构域相连;优选地,所述连接肽各自具有20-60bp的长度;优选地,所述连接肽具有SEQ ID NO:16的序列。21. The set of regulatory elements of paragraph 20, wherein the regulatory domain, the multimerization domain, and the transcriptional activation domain in the chimeric allosteric transcription factor are linked using a linker peptide ; Preferably, each of the connecting peptides has a length of 20-60bp; Preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
22.如段落20所述的调控元件组,其中,所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端,或者位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,各核定位序列的5’端和3’端独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽包含2个核定位序列,各核定位序列的5’端和3’端独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列;优选地,所述核定位信号肽位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间;优选地,所述多聚化结构域的5’端和3’端进一步包含连接肽,优选地,所述连接肽的各自具有20-60bp的长度;优选地,所述连接肽具有SEQ ID NO:16的序列。22. The set of regulatory elements as described in paragraph 20, wherein the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located at the N of the chimeric allosteric transcription factor terminal or C-terminal, or between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain, and the nuclear localization signal peptide contains 1 - 4 nuclear localization sequences; preferably, the 5' end and 3' end of each nuclear localization sequence independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide comprises 2 nuclear localization sequences, each nuclear localization sequence The 5' end and the 3' end of the localization sequence independently have flanking sequences of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17; preferably, the nuclear localization signal peptide is located in the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain; preferably, the 5' end and the 3' end of the multimerization domain Further comprising a connecting peptide, preferably, each of the connecting peptides has a length of 20-60bp; preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
23.如段落13-22中任一项所述的调控元件组,其中,所述诱导型启动子的核心序列选自于CMV1启动子、CMVmini启动子、TRE3G启动子、EF1a核心启动子或hEF1a启动子;优选地,所述核心序列为CMV1启动子或TRE3G启动子。23. The set of regulatory elements according to any one of paragraphs 13-22, wherein the core sequence of the inducible promoter is selected from CMV1 promoter, CMVmini promoter, TRE3G promoter, EF1a core promoter or hEF1a Promoter; preferably, the core sequence is a CMV1 promoter or a TRE3G promoter.
24.如段落13-23中任一项所述的调控元件组,其中,所述操纵位点对为1对或多对;优选地,所述操纵位点对为1对;优选地,各操纵位点分别位于转录起始位点的上游和下游。24. The set of regulatory elements of any one of paragraphs 13-23, wherein the pair of manipulation sites is one or more pairs; preferably, the pair of manipulation sites is one pair; preferably, each The operator sites are located upstream and downstream of the transcription start site, respectively.
25.如段落13-24中任一项所述的调控元件组,所述调控元件组进一步包含阻遏蛋白以及第三启动子,其中,所述第三启动子驱动所述阻遏蛋白的表达,所述第三启动子包含与所述阻遏蛋白相互作用的第二操纵位点;其中,所述诱导型启动子进一步包含所述第二操纵位点。25. The set of regulatory elements of any one of paragraphs 13-24, further comprising a repressor protein and a third promoter, wherein the third promoter drives expression of the repressor protein, wherein The third promoter comprises a second operator site interacting with the repressor protein; wherein the inducible promoter further comprises the second operator site.
26.如段落25所述的调控元件组,其中,所述阻遏蛋白为TetR,所述第二操纵位点为tetO;或者,所述阻遏蛋白为CymR,所述第二操纵位点为cymO。26. The set of regulatory elements of paragraph 25, wherein the repressor protein is TetR and the second operator site is tetO; or, the repressor protein is CymR and the second operator site is cymO.
27.如段落13-26中任一项所述的调控元件组,其中,所述哺乳动物细胞为人细胞。27. The set of regulatory elements of any of paragraphs 13-26, wherein the mammalian cells are human cells.
28.如段落13-26中任一项所述的调控元件组,其中,所述哺乳动物细胞为永生化细胞、原代细胞或干细胞。28. The set of regulatory elements of any of paragraphs 13-26, wherein the mammalian cell is an immortalized cell, a primary cell, or a stem cell.
29.一种对哺乳动物细胞中感兴趣的基因进行诱导表达的系统,所述诱导表达系统利用天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂,所述天然氨基酸衍生的酰基高丝氨酸内酯选自于由如下化合物所组成的组:对香豆酰基-高丝氨酸内酯(pC-HSL)、异戊酰基-高丝氨酸内酯(IV-HSL)以及肉桂酰基-高丝氨酸内酯(Cinn-HSL);所述诱导表达系统包含:29. A system for inducing expression of genes of interest in mammalian cells, the inducible expression system utilizing natural amino acid-derived acyl homoserine lactone as an inducer, the natural amino acid-derived acyl homoserine lactone selected from From the group consisting of the following compounds: p-coumaroyl-homoserine lactone (pC-HSL), isovaleryl-homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL ); The inducible expression system comprises:
第一基因表达盒,所述第一基因表达盒包含第一启动子序列以及嵌合型别构转录因子的编码序列,其中,所述嵌合型别构转录因子包含调控结构域以及转录激活结构域;所述调控结构域选自于由RpaR、BjaR以及BraR所组成的组;所述转录激活结构域选自于P65、VP16、VP64、VTR1、VTR2以及VTR3;以及A first gene expression cassette, the first gene expression cassette comprising a first promoter sequence and a coding sequence of a chimeric allosteric transcription factor, wherein the chimeric allosteric transcription factor comprises a regulatory domain and a transcriptional activation structure domain; the regulatory domain is selected from the group consisting of RpaR, BjaR and BraR; the transcriptional activation domain is selected from the group consisting of P65, VP16, VP64, VTR1, VTR2 and VTR3; and
第二基因表达盒,所述第二基因表达盒包含诱导型启动子序列以及所述感兴趣的基因的编码序列,所述诱导型启动子序列包含与所述嵌合型别构转录因子相互作用的至少一个操纵位点对。A second gene expression cassette, the second gene expression cassette comprising an inducible promoter sequence and the coding sequence of the gene of interest, the inducible promoter sequence comprising an allosteric transcription factor interacting with the chimeric type At least one operator site pair for .
30.如段落29所述的诱导表达系统,其中,所述第一基因中所述嵌合型别构转录因子的转录激活结构域为VTR3。30. The inducible expression system according to paragraph 29, wherein the transcriptional activation domain of the chimeric allosteric transcription factor in the first gene is VTR3.
31.如段落29或30所述的诱导表达系统,其中,所述第一基因中所述嵌合型别构转录因子的调控结构域为RpaR,所述第二基因中所述诱导型启动子的操纵位点为rpaO,所述诱导剂为pC-HSL或Cinn-HSL;优选地,所述诱导剂为pC-HSL。31. The inducible expression system as described in paragraph 29 or 30, wherein the regulatory domain of the chimeric allosteric transcription factor in the first gene is RpaR, and the inducible promoter in the second gene is The operator site is rpaO, and the inducer is pC-HSL or Cinn-HSL; preferably, the inducer is pC-HSL.
32.如段落29或30所述的诱导表达系统,其中,所述第一基因中所述嵌合型别构转录因子的调控结构域为BjaR,所述第二基因中所述诱导型启动子的操纵位点为bjaO,所述诱导剂为IV-HSL。32. The inducible expression system as described in paragraph 29 or 30, wherein the regulatory domain of the chimeric allosteric transcription factor in the first gene is BjaR, and the inducible promoter in the second gene The operator site is bjaO, and the inducer is IV-HSL.
33.如段落29或30所述的诱导表达系统,其中,所述第一基因中所述嵌合型别构转录因子的调控结构域为BraR,所述第二基因中所述诱导型启动子的操纵位点为braO、优选R8,所述诱导剂为Cinn-HSL。33. The inducible expression system as described in paragraph 29 or 30, wherein the regulatory domain of the chimeric allosteric transcription factor in the first gene is BraR, and the inducible promoter in the second gene The operator site is braO, preferably R8, and the inducer is Cinn-HSL.
34.如段落29-33中任一项所述的诱导表达系统,其中,在所述第一基因中,使用连接肽将所述调控结构域与所述转录激活结构域相连;优选地,所述连接肽的长度为20-60bp;优选地,所述连接肽具有SEQ ID NO:16的序列。34. The inducible expression system of any one of paragraphs 29-33, wherein, in the first gene, a linker peptide is used to link the regulatory domain to the transcriptional activation domain; preferably, the The length of the connecting peptide is 20-60bp; preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
35.如段落29-33中任一项所述的诱导表达系统,其中,所述第一基因的所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端或位于所述调控结构域与所述转录激活结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,各核定位序列的5’端和3’端独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽位于所述调控结构域与所述转录激活结构域之间;优选地,所述核定位信号肽包含2个核定位序列,各核定位序列的5’端和3’端独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列。35. The inducible expression system according to any one of paragraphs 29-33, wherein the chimeric allosteric transcription factor of the first gene further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide Located at the N-terminal or C-terminal of the chimeric allosteric transcription factor or between the regulatory domain and the transcriptional activation domain, the nuclear localization signal peptide comprises 1-4 nuclear localization sequences; preferably , the 5' and 3' ends of each nuclear localization sequence independently have flanking sequences of 5-40 bp; preferably, the nuclear localization signal peptide is located between the regulatory domain and the transcriptional activation domain; preferably , the nuclear localization signal peptide comprises two nuclear localization sequences, the 5' end and the 3' end of each nuclear localization sequence independently have a flanking sequence of 5-20bp; preferably, the nuclear localization sequence has SEQ ID NO: 17 the sequence of.
36.如段落29-35中任一项所述的诱导表达系统,其中,所述第一基因的所述嵌合型别构转录因子进 一步包含多聚化结构域,优选地,所述多聚化结构域为CarH。36. The inducible expression system according to any one of paragraphs 29-35, wherein said chimeric allosteric transcription factor of said first gene further comprises a multimerization domain, preferably said multimerization The K domain is CarH.
37.如段落36所述的诱导表达系统,其中,使用连接肽将所述嵌合型别构转录因子中的所述调控结构域、所述多聚化结构域与所述转录激活结构域相连;优选地,所述连接肽各自具有20-60bp的长度;优选地,所述连接肽具有SEQ ID NO:16的序列。37. The inducible expression system of paragraph 36, wherein the regulatory domain, the multimerization domain and the transcriptional activation domain in the chimeric allosteric transcription factor are linked using a linking peptide ; Preferably, each of the connecting peptides has a length of 20-60bp; Preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
38.如段落36所述的诱导表达系统,其中,所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端,或者位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,各核定位序列的5’端和3’端独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽包含2个核定位序列,各核定位序列的5’端和3’端独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列;优选地,所述核定位信号肽位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间;优选地,所述多聚化结构域的5’端和3’端进一步包含连接肽,优选地,所述连接肽各自具有20-60bp的长度;优选地,所述连接肽具有SEQ ID NO:16的序列。38. The inducible expression system as described in paragraph 36, wherein the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located at the N of the chimeric allosteric transcription factor terminal or C-terminal, or between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain, and the nuclear localization signal peptide contains 1 - 4 nuclear localization sequences; preferably, the 5' end and 3' end of each nuclear localization sequence independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide comprises 2 nuclear localization sequences, each nuclear localization sequence The 5' end and the 3' end of the localization sequence independently have flanking sequences of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17; preferably, the nuclear localization signal peptide is located in the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain; preferably, the 5' end and the 3' end of the multimerization domain Further comprising a connecting peptide, preferably, each of the connecting peptides has a length of 20-60bp; preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
39.如段落29-38中任一项所述的诱导表达系统,其中,所述第一基因中所述第一启动子为组成型启动子或诱导型启动子。39. The inducible expression system of any of paragraphs 29-38, wherein the first promoter in the first gene is a constitutive promoter or an inducible promoter.
40.如段落29-39中任一项所述的诱导表达系统,其中,所述第二基因中所述诱导型启动子的核心序列选自于CMV1启动子、CMVmini启动子、TRE3G启动子、EF1a核心启动子或hEF1a启动子;优选地,所述核心序列为CMV1启动子或TRE3G启动子。40. The inducible expression system according to any one of paragraphs 29-39, wherein the core sequence of the inducible promoter in the second gene is selected from CMV1 promoter, CMVmini promoter, TRE3G promoter, EF1a core promoter or hEF1a promoter; preferably, the core sequence is CMV1 promoter or TRE3G promoter.
41.如段落29-40中任一项所述的诱导表达系统,其中,所述第二基因中所述操纵位点对为1对或多对;优选地,所述操纵位点对为1对;优选地,各操纵位点分别位于转录起始位点的上游和下游。41. The inducible expression system according to any one of paragraphs 29-40, wherein the pair of operating sites in the second gene is 1 or more pairs; preferably, the pair of operating sites is 1 Right; preferably, each operator site is located upstream and downstream of the transcription initiation site, respectively.
42.如段落29-41中任一项所述的诱导表达系统,所述诱导表达系统进一步包含第三基因表达盒,其中,所述第三基因表达盒包含第三启动子序列以及阻遏蛋白的编码序列,用于在宿主细胞中表达所述阻遏蛋白,所述第三启动子序列包含第二操纵位点,所述第二操纵位点与所述阻遏蛋白相互作用;所述第二基因的诱导型启动子序列进一步包含第二操纵位点。42. The inducible expression system of any one of paragraphs 29-41, further comprising a third gene expression cassette, wherein the third gene expression cassette comprises a third promoter sequence and a repressor protein A coding sequence for expressing the repressor protein in a host cell, the third promoter sequence comprising a second operator site interacting with the repressor protein; the second gene's The inducible promoter sequence further comprises a second operator site.
43.如段落42所述的诱导表达系统,其中,所述阻遏蛋白为TetR,所述第二操纵位点为tetO;或者,所述阻遏蛋白为CymR,所述第二操纵位点为cymO。43. The inducible expression system of paragraph 42, wherein the repressor protein is TetR, and the second operator site is tetO; or, the repressor protein is CymR, and the second operator site is cymO.
44.如段落42或43所述的诱导表达系统,其中,所述第三基因中所述第三启动子的核心序列选自于CMV1启动子、CMVmini启动子、TRE3G启动子、EF1a核心启动子或hEF1a启动子;优选地,所述核心序列为CMV1启动子或TRE3G启动子。44. The inducible expression system as described in paragraph 42 or 43, wherein the core sequence of the third promoter in the third gene is selected from CMV1 promoter, CMVmini promoter, TRE3G promoter, EF1a core promoter or hEF1a promoter; preferably, the core sequence is CMV1 promoter or TRE3G promoter.
45.如段落29-44中任一项所述的诱导表达系统,其中,所述哺乳动物细胞为人细胞。45. The inducible expression system of any of paragraphs 29-44, wherein the mammalian cells are human cells.
46.如段落29-44中任一项所述的诱导表达系统,其中,所述哺乳动物细胞为永生化细胞、原代细胞或干细胞。46. The inducible expression system of any of paragraphs 29-44, wherein the mammalian cells are immortalized cells, primary cells or stem cells.
47.段落1-12中任一项所述的嵌合型别构转录因子、段落13-28中任一项所述的调控元件组、段落29-46中任一项所述的诱导表达系统在构建单细胞或多细胞信号通讯系统中的用途,其中,所述信号通讯系统包含信号发送元件和信号接收元件,所述信号接收元件包含嵌合型别构转录因子以及在哺乳动物细胞中具有活性的诱导型启动子。47. The chimeric allosteric transcription factor of any one of paragraphs 1-12, the set of regulatory elements of any one of paragraphs 13-28, the inducible expression system of any one of paragraphs 29-46 Use in the construction of a single-cell or multi-cell signal communication system, wherein the signal communication system includes a signal sending element and a signal receiving element, and the signal receiving element includes a chimeric allosteric transcription factor and a mammalian cell having active inducible promoter.
48.如段落47所述的用途,其中,所述信号通讯系统的信号分子为pC-HSL;所述嵌合型别构转录因子中所述调控结构域为RpaR,所述诱导型启动子包含核心序列以及至少一对rpaO;所述信号发送元件包含TAL、4CL以及RpaI。48. The use as described in paragraph 47, wherein the signal molecule of the signal communication system is pC-HSL; the regulatory domain in the chimeric allosteric transcription factor is RpaR, and the inducible promoter comprises A core sequence and at least one pair of rpaO; the signaling elements include TAL, 4CL and RpaI.
49.如段落47所述的用途,其中,所述信号通讯系统的信号分子为Cinn-HSL;所述嵌合型别构转录因子中所述调控结构域为BraR,所述诱导型启动子包含核心序列以及至少一对braO;所述信号发送元件包含PAL、4CL以及BraI。49. The use as described in paragraph 47, wherein the signal molecule of the signal communication system is Cinn-HSL; the regulatory domain in the chimeric allosteric transcription factor is BraR, and the inducible promoter comprises A core sequence and at least one pair of braO; the signaling elements include PAL, 4CL and BraI.
50.如段落47所述的用途,其中,所述信号通讯系统的信号分子为IV-HSL;所述嵌合型别构转录因子中所述调控结构域为BjaR,所述诱导型启动子包含核心序列以及至少一对bjaO;所述信号发送元件包含支链α-酮酸脱氢酶复合物BCDH以及BjaI。50. purposes as described in paragraph 47, wherein, the signal molecule of described signal communication system is IV-HSL; Said regulatory domain in said chimeric allosteric transcription factor is BjaR, and said inducible promoter comprises A core sequence and at least one pair of bjaO; the signaling element includes a branched-chain α-ketoacid dehydrogenase complex BCDH and BjaI.
51.段落1-12中任一项所述的嵌合型别构转录因子、段落13-28中任一项所述的调控元件组、段落29-46中任一项所述的诱导表达系统在基因功能分析、构建人类疾病模型、人造组织或人造器官方面的用途。51. The chimeric allosteric transcription factor of any one of paragraphs 1-12, the set of regulatory elements of any one of paragraphs 13-28, the inducible expression system of any one of paragraphs 29-46 Uses in gene function analysis, construction of human disease models, artificial tissues or organs.
52如段落51所述的用途,其中,将所述嵌合型别构转录因子、所述调控元件组或所述诱导表达系统稳定转染至哺乳动物细胞。52. The use according to paragraph 51, wherein the chimeric allosteric transcription factor, the set of regulatory elements or the inducible expression system are stably transfected into mammalian cells.
53如段落51所述的用途,其中,将所述嵌合型别构转录因子、所述调控元件组或所述诱导表达系统瞬时转染至哺乳动物细胞。53. The use according to paragraph 51, wherein the chimeric allosteric transcription factor, the set of regulatory elements or the inducible expression system is transiently transfected into mammalian cells.
实施例Example
如图1A所示,本发明的嵌合型别构转录因子具有原核来源的调控结构域和真核来源的转录激活结构 域,由调控结构域实现对诱导剂的应答,由转录激活结构域触发哺乳动物细胞中基因的转录。As shown in Figure 1A, the chimeric allosteric transcription factor of the present invention has a regulatory domain of prokaryotic origin and a transcriptional activation domain of eukaryotic origin, the response to the inducer is realized by the regulatory domain, and the transcriptional activation domain is triggered. Transcription of genes in mammalian cells.
Cinn诱导系统操纵序列的识别和优化Identification and optimization of the operator sequence for the Cinn inducible system
在慢生根瘤菌(Bradyrhizobium)中,由LuxI的同源蛋白BraI催化Cinn-HSL的生物合成,并由LuxR同源蛋白BraR检测该信号。在群体感应系统中,为实现对群体密度的应答,诱导物小分子通常能够对自诱导物合成酶LuxI的转录进行正调控,换言之,LuxI的启动子区应包含LuxR的操纵位点,LuxI的转录由AHL和LuxR共同激活。酰基高丝氨酸内酯依赖型基因的启动子区通常具有18-20bp的倒置重复序列,然而,未能根据上述特征在BraI启动子区识别到与BraR相互作用的操纵位点 [5]In Bradyrhizobium, the biosynthesis of Cinn-HSL is catalyzed by the LuxI homolog protein BraI, and the signal is detected by the LuxR homolog protein BraR. In the quorum sensing system, in order to respond to the population density, the inducer small molecule can usually positively regulate the transcription of the autoinducer synthase LuxI, in other words, the promoter region of LuxI should contain the LuxR operating site, and the LuxI Transcription is co-activated by AHL and LuxR. The promoter region of acyl homoserine lactone-dependent genes usually has an inverted repeat sequence of 18-20 bp, however, the operator site interacting with BraR could not be identified in the BraI promoter region according to the above characteristics [5] .
针对这一问题,我们利用BLAST方法比较了四种慢生根瘤菌BTAi1、ORS285、Bradyrhizobium oligotrophicum S58以及ORS278的BraR-BraI操纵子区域(NCBI序列号分别为CP000494.1、LT859959.1、AP012603.1以及CU234118)的序列相似度。图2A显示了BTAi1、ORS285、Bradyrhizobium oligotrophicum S58以及ORS278四种慢生根瘤菌中BraR终止子至BraI启动子-10区之间的区域(分别以NC、ORS、APO以及CUS表示),我们推测该序列包含与BraR相互作用的操纵位点braO。此外,为确保操纵位点完全包括在推定区域内,进一步测试了CUL序列,其为包含上游45bp的CUS。将NC、ORS、APO、CUS以及CUL分别成对插入至TRE3G核心启动子的上游,并在下游连接Citrine黄色荧光蛋白编码序列作为报告基因,利用Cinn-HSL对上述各诱导系统进行诱导发现,其灵敏度和动态范围差别不大(图2B,EC50为1.1-1.4nM,动态范围为18.5-37.7),表明不同慢生根瘤菌的Bra操纵序列性能相近。在进一步的实施例中,基于CUS序列进行进一步优化。In response to this problem, we used the BLAST method to compare the BraR-BraI operon regions of four Bradyrhizobium bacterium BTAi1, ORS285, Bradyrhizobium oligotrophicum S58 and ORS278 (NCBI sequence numbers are CP000494.1, LT859959.1, AP012603.1 and sequence similarity of CU234118). Figure 2A shows the regions between the BraR terminator and the BraI promoter-10 region in BTAi1, ORS285, Bradyrhizobium oligotrophicum S58 and ORS278 (indicated by NC, ORS, APO and CUS, respectively), we speculate that the The sequence contains the operator site braO that interacts with BraR. In addition, to ensure that the operator site was fully included within the putative region, the CUL sequence was further tested, which is the CUS comprising the upstream 45 bp. NC, ORS, APO, CUS and CUL were respectively inserted into the upstream of the TRE3G core promoter in pairs, and the Citrine yellow fluorescent protein coding sequence was connected downstream as a reporter gene, and Cinn-HSL was used to induce and discover the above-mentioned induction systems. There was little difference in sensitivity and dynamic range (Fig. 2B, EC50 was 1.1-1.4nM, dynamic range was 18.5-37.7), indicating that the performance of the Bra operator sequence of different Bradyrhizobium bacteria was similar. In a further embodiment, further optimization is performed based on the CUS sequence.
如图2C所示,从不同位置将CUS截短。其中,F1、F2、F4、F7从5’端截短;R1、R4、R6、R7以及R8从3’端截短。将上述操纵序列插入TRE3G核心启动子的上游构建诱导型启动子,并进行诱导表达。在0和10 -6mol/L的Cinn-HSL诱导条件下,上述9种启动子下游报告基因的荧光强度改变如图2D所示。包含F2、F4和F7启动子的诱导系统不响应Cinn-HSL的诱导,表明其Bra操纵位点被破坏,而由R1、R4、R7以及R8诱导的荧光强度改变不大,表明它们几乎完全保留了操纵序列。如图2E所示,包含一对R8操纵序列的启动子P 2XR8-TRE3G与包含一对全长CUS操纵序列的启动子P CUS-TRE3G诱导性能相近(EC50分别为4.9nM和9.3nM,动态范围分别为35.5和74.1)。综合考虑操纵序列长度和诱导强度,将R8作为最小操纵序列(18bp)。在进一步的实施方式中,对于Cinn诱导系统,将包含两个R8操纵序列的P 2XR8-TRE3G启动子用作诱导型启动子;将真核转录激活结构域VTR3与调控结构域BraR融合构建VTR3-BraR嵌合型别构转录因子,借助流式细胞术检测各Cinn-HSL浓度下的剂量响应曲线。 As shown in Figure 2C, the CUS was truncated from different positions. Among them, F1, F2, F4, F7 are truncated from the 5'end; R1, R4, R6, R7 and R8 are truncated from the 3' end. Insert the above operating sequence into the upstream of the TRE3G core promoter to construct an inducible promoter, and perform inducible expression. Under the induction conditions of 0 and 10 -6 mol/L Cinn-HSL, the fluorescence intensity changes of the reporter genes downstream of the above nine promoters are shown in Figure 2D. Inducible systems containing F2, F4, and F7 promoters did not respond to induction by Cinn-HSL, indicating that its Bra operator site was disrupted, while fluorescence intensities induced by R1, R4, R7, and R8 did not change much, indicating that they were almost completely retained manipulated sequence. As shown in Figure 2E, the promoter P 2XR8-TRE3G containing a pair of R8 operating sequences was similar to the promoter P CUS-TRE3G containing a pair of full-length CUS operating sequences (EC50 were 4.9nM and 9.3nM, dynamic range 35.5 and 74.1, respectively). Considering the length of the operator sequence and the induction strength, R8 was used as the minimum operator sequence (18bp). In a further embodiment, for the Cinn induction system, the P 2XR8-TRE3G promoter containing two R8 operator sequences is used as an inducible promoter; the eukaryotic transcription activation domain VTR3 is fused with the regulatory domain BraR to construct VTR3- BraR chimeric allosteric transcription factor, the dose-response curve of each Cinn-HSL concentration was detected by flow cytometry.
pC与IV诱导系统的设计Design of pC and IV induction system
对于pC诱导系统,将真核转录激活结构域VTR3与调控结构域RpaR融合构建VTR3-RpaR嵌合型别构转录因子;将两个rpaO操纵序列插入至TRE3G核心启动子的上游,构建获得P rpaO-TRE3G启动子;并在下游连接Citrine黄色荧光蛋白编码序列作为报告基因。诱导剂pC-HSL不存在的情况下,VTR3-RpaR不能与rpaO结合,报告基因的表达处于关闭状态。pC-HSL与VTR3-RpaR的结合将该嵌合型别构转录因子引导至P rpaO-TRE3G启动子的rpaO位点处,转录激活结构域VTR3触发报告基因Citrine的转录,借助流式细胞术检测各诱导物浓度下该系统的剂量响应曲线。对于IV诱导系统,将真核转录激活结构域VTR3与调控结构域BjaR融合构建VTR3-BjaR嵌合型别构转录因子;将两个bjaO操纵序列插入至TRE3G核心启动子的上游,构建获得P bjaO-TRE3G启动子;并在下游连接Citrine黄色荧光蛋白编码序列作为报告基因。诱导剂IV-HSL不存在的情况下,VTR3-BjaR不能与bjaO结合,报告基因的表达处于关闭状态。IV-HSL与VTR3-BjaR的结合将该嵌合型别构转录因子引导至P bjaO-TRE3G启动子的bjaO位点处,转录激活结构域VTR3触发报告基因Citrine的转录,借助流式细胞术检测各诱导物浓度下该系统的剂量响应曲线。 For the pC induction system, the eukaryotic transcription activation domain VTR3 was fused with the regulatory domain RpaR to construct a VTR3-RpaR chimeric allosteric transcription factor; two rpaO operating sequences were inserted into the upstream of the TRE3G core promoter to construct P rpaO -TRE3G promoter; and the Citrine yellow fluorescent protein coding sequence is connected downstream as a reporter gene. In the absence of the inducer pC-HSL, VTR3-RpaR could not bind to rpaO, and the expression of the reporter gene was turned off. The combination of pC-HSL and VTR3-RpaR guides the chimeric allosteric transcription factor to the rpaO site of the P rpaO-TRE3G promoter, and the transcription activation domain VTR3 triggers the transcription of the reporter gene Citrine, which is detected by flow cytometry Dose-response curves of the system at various inducer concentrations. For the IV induction system, the eukaryotic transcription activation domain VTR3 was fused with the regulatory domain BjaR to construct a VTR3-BjaR chimeric allosteric transcription factor; two bjaO operator sequences were inserted into the upstream of the TRE3G core promoter to construct P bjaO -TRE3G promoter; and the Citrine yellow fluorescent protein coding sequence is connected downstream as a reporter gene. In the absence of inducer IV-HSL, VTR3-BjaR could not bind to bjaO, and the expression of the reporter gene was turned off. The combination of IV-HSL and VTR3-BjaR guides the chimeric allosteric transcription factor to the bjaO site of the P bjaO-TRE3G promoter, and the transcription activation domain VTR3 triggers the transcription of the reporter gene Citrine, which is detected by flow cytometry Dose-response curves of the system at various inducer concentrations.
各诱导系统的表征Characterization of each induction system
对于各诱导系统,测定其在一系列诱导浓度下报告子的荧光强度(图1C)。利用希尔方程对诱导曲线进行拟合来计算EC50值,即半最大有效浓度,从而对诱导系统的灵敏度进行定量。如图1B和1C所示,pC、Cinn和IV诱导系统的EC50分别为8×10 -7mol/L、1×10 -9mol/L、2.5×10 -5mol/L。不希望被理论所限地,较低的EC50值对应于较高的灵敏度。这种信号超敏和极低的诱导物工作浓度并未在已知的其它基于AHL的哺乳动物诱导系统中发现。目前已报道的直链脂肪酸型AHL诱导系统利用VP16-TraR融合蛋白对3OC8信号进行应答,该系统的EC50为7-14μM [3],而本发明的Cinn-HSL诱导系统的EC50为1.25nM,比现有技术降低3个量级。由于可在较低的工作浓度下诱导,有效降低了诱导剂对细胞的细胞毒性。此外,3OC8诱导系统的最大诱导倍数约为20倍,而本发明的pC-HSL诱导系统的最大诱导倍数可达100倍,信噪比更高。虽然本发明的IV诱导系统本底表达较高,因此灵敏度方面不如pC诱导系统和Cinn诱导系统,然而IV诱导系统与pC诱导系统以及Cinn诱导系统的串扰均较小,可将IV诱导系统与pC或Cinn诱导系统组合使用。此外,由于本发明的诱导系统与直链脂肪酸型AHL诱导系统不存在信号串扰 [5],可将本发明的诱导系统与直链脂肪酸型AHL诱导系统组合使用,在哺乳动物宿主中构建多重信号复杂调控网络或基因线路。 For each inducible system, the fluorescence intensity of the reporter was measured at a range of inducing concentrations (Fig. 1C). The sensitivity of the induction system was quantified by fitting the induction curve with the Hill equation to calculate the EC50 value, that is, the half-maximal effective concentration. As shown in Figure 1B and 1C, the EC50 of pC, Cinn and IV induction systems were 8×10 -7 mol/L, 1×10 -9 mol/L, 2.5×10 -5 mol/L, respectively. Without wishing to be bound by theory, lower EC50 values correspond to higher sensitivity. Such signal hypersensitivity and extremely low working concentrations of inducer are not found in other known AHL-based mammalian induction systems. The reported straight-chain fatty acid-type AHL induction system uses VP16-TraR fusion protein to respond to 3OC8 signal, and the EC50 of this system is 7-14μM [3] , while the EC50 of the Cinn-HSL induction system of the present invention is 1.25nM, 3 orders of magnitude lower than the prior art. Since it can be induced at a lower working concentration, the cytotoxicity of the inducer to cells is effectively reduced. In addition, the maximum induction multiple of the 3OC8 induction system is about 20 times, while the maximum induction multiple of the pC-HSL induction system of the present invention can reach 100 times, and the signal-to-noise ratio is higher. Although the background expression of the IV induction system of the present invention is higher, so the sensitivity is not as good as that of the pC induction system and the Cinn induction system, but the crosstalk between the IV induction system and the pC induction system and the Cinn induction system is all small, and the IV induction system can be combined with the pC induction system. Or use in combination with the Cinn induction system. In addition, since there is no signal crosstalk between the induction system of the present invention and the straight-chain fatty acid-type AHL induction system [5] , the induction system of the present invention can be used in combination with the straight-chain fatty acid-type AHL induction system to construct multiple signals in mammalian hosts. Complex regulatory networks or gene circuits.
不同的转录激活结构域对调控性能的影响Effect of different transcriptional activation domains on regulatory properties
本领域已报道能够将转录调控因子VP16、VP64、p65、SAM、VTR等作为转录激活结构域与调控结构域融合,对真核细胞中基因表达进行调控。为了比较不同转录激活结构域的效率,我们构建了VP64-BraR和VTR3-BraR两种Cinn诱导系统的别构转录因子。其中,VP64为四个重复串联的VP16,所述VP16为单纯疱疹病毒的真核转录激活结构域(WO2007058527A2)。VTR3是一种优化的VTR系列结构域。具体而言,首先将VP64、p65和RTA融合为真核转录激活结构域VTR;作为改进,第一步删除p65N端的DNA结合域(VTR1),降低与基因组的非靶向作用;进而对于p65C端的两个反式激活域TA1和TA2,仅保留完整TA2和部分TA1(VTR2),以便减小该结构域的尺寸;在VTR2的基础上,进一步删除部分RTA结构域,获得仅0.8kb的VTR3激活域。在哺乳动物细胞中,dCas9-VTR3的激活效率为dCas9-VP64的10倍 [12]。就本发明的Cinn诱导系统而言,如图3所示,在转录因子分别为VTR3-BraR以及VP64-BraR的情况下,Cinn-HSL信号对P 2XR8-TRE3G启动子激活的EC50分别为3.8×10 -9mol/L和1×10 -8mol/L,动态范围分别为32.2和9.5。在Cinn诱导系统中,VTR3作为转录激活结构域能够实现更好的转录开关效果。 It has been reported in the art that transcriptional regulatory factors VP16, VP64, p65, SAM, VTR, etc. can be fused as transcriptional activation domains and regulatory domains to regulate gene expression in eukaryotic cells. To compare the efficiency of different transcriptional activation domains, we constructed allosteric transcription factors of two Cinn-inducing systems, VP64-BraR and VTR3-BraR. Wherein, VP64 is four repeated tandem VP16, and said VP16 is the eukaryotic transcriptional activation domain of herpes simplex virus (WO2007058527A2). VTR3 is an optimized VTR series domain. Specifically, VP64, p65 and RTA were first fused into the eukaryotic transcriptional activation domain VTR; as an improvement, the first step was to delete the DNA binding domain (VTR1) at the N-terminal of p65 to reduce the non-targeting effect on the genome; and then for the C-terminal of p65 Two transactivation domains, TA1 and TA2, retain only the complete TA2 and part of TA1 (VTR2) in order to reduce the size of the domain; on the basis of VTR2, part of the RTA domain is further deleted to obtain only 0.8kb of VTR3 activation area. In mammalian cells, the activation efficiency of dCas9-VTR3 is 10 times that of dCas9-VP64 [12] . As far as the Cinn induction system of the present invention is concerned, as shown in Figure 3, when the transcription factors are VTR3-BraR and VP64-BraR respectively, the EC50 of the Cinn-HSL signal on the activation of the P 2XR8-TRE3G promoter is 3.8× 10 -9 mol/L and 1×10 -8 mol/L, the dynamic ranges are 32.2 and 9.5 respectively. In the Cinn induction system, VTR3 as a transcriptional activation domain can achieve a better transcriptional switching effect.
多聚化对调控性能的影响Effect of multimerization on regulatory properties
在原核宿主中,将多聚化结构域与阻遏蛋白融合并在启动子处设置一对或多对操纵位点,可增强调控元件的阻遏性能(CN107344962A)。为了研究在本发明的正控诱导系统中是否可采取类似的策略,我们利用连接肽将多聚化结构域CarH连接至转录因子VTR3-RpaR的转录激活结构域和调控结构域间之间,构建VTR3-CarH-RpaR嵌合转录因子。如图4B所示,在不包含多聚化结构域的情况下,pC诱导系统的EC50为8.7×10 -7mol/L、动态范围为52.6;而转录因子包含CarH结构域时,pC诱导系统的EC50为1×10 -7mol/L、动态范围为30.8,表明多聚化带来的协同性提高了诱导系统的灵敏度,降低诱导物的工作浓度。本领域已知的是,在负控阻遏系统中,由于阻遏蛋白多聚化后,与该阻遏蛋白相互作用的启动子序列附近的空间位阻增大,对RNA聚合酶构成的竞争性抑制更大,RNA聚合酶结合的概率减小,因而通过增加操纵位点对的个数,能够有效地降低漏表达(CN107344962A)。我们发现在TRE3G启动子上游和/或下游分别插入1对至7对rpaO操纵序列其诱导性能差别不大,表明在正控诱导系统中增加操纵位点的数量未明显提高调控元件组的诱导性能。此外,发明人还尝试利用多聚化结构域CI434构造可多聚化的转录因子,然而该设计不能提高pC诱导系统的灵敏度。 In prokaryotic hosts, the repression performance of regulatory elements can be enhanced by fusing the multimerization domain with a repressor protein and setting one or more pairs of operating sites at the promoter (CN107344962A). In order to investigate whether a similar strategy can be adopted in the positive control induction system of the present invention, we used a connecting peptide to connect the multimerization domain CarH to between the transcriptional activation domain and the regulatory domain of the transcription factor VTR3-RpaR, constructing VTR3-CarH-RpaR chimeric transcription factor. As shown in Figure 4B, without the multimerization domain, the EC50 of the pC-inducible system was 8.7×10 -7 mol/L, and the dynamic range was 52.6; while the transcription factor contained the CarH domain, the pC-inducible system The EC50 is 1×10 -7 mol/L, and the dynamic range is 30.8, indicating that the synergy brought about by multimerization improves the sensitivity of the induction system and reduces the working concentration of the inducer. It is known in the art that in a negative control repression system, due to the increased steric hindrance near the promoter sequence interacting with the repressor protein after multimerization of the repressor protein, the competitive inhibition of RNA polymerase is more severe. Larger, the probability of RNA polymerase binding is reduced, so by increasing the number of manipulation site pairs, the expression leakage can be effectively reduced (CN107344962A). We found that inserting 1 to 7 pairs of rpaO operator sequences upstream and/or downstream of the TRE3G promoter had little difference in the induction performance, indicating that increasing the number of operator sites in the positive control induction system did not significantly improve the induction performance of the regulatory element set . In addition, the inventors also tried to use the multimerization domain CI434 to construct a multimerizable transcription factor, but this design could not improve the sensitivity of the pC induction system.
调控系统的线性化Linearization of Regulatory Systems
本领域已知的是,在诱导系统中添加负反馈能够实现报告子水平的线性变换。在本发明中,将启动子区包含负反馈(包含tetO的P tet-CMV D2i启动子)的TetR表达盒导入宿主细胞,并在诱导型启动子P rpaO-CMV1转录起始位点(Inr)的上游和下游分别插入一个tetO操纵位点,构建双输入启动子pC-dox,该启动子能够对pC-HSL和强力霉素双重信号产生应答。如图5A和5B所示,阻遏蛋白TetR与tetO的结合封闭了pC-dox的rpaO操纵位点,阻抑报告基因的转录;在dox存在下TetR从tetO上脱离,暴露操纵位点rpaO,从而与pC-HSL结合的RpaR能够结合至rpaO,进而募集RNA聚合酶,报告基因表达;由于P tet-CMV D2i启动子也包含tetO,dox对TetR表达的负调控使得在每一个dox浓度下都出现TetR的自适应(adaptation),因此报告基因的表达量随dox和pC-HSL的升高线性化。如图5C所示,该诱导系统在较宽的诱导物浓度范围下(0-100ng/mL的dox,10nM-10μM的pC-HSL)保持线性化应答,动态范围高达500,能够实现基因表达的精细调控。 It is known in the art that adding negative feedback to the induction system enables a linear transformation of reporter levels. In the present invention, the TetR expression cassette containing negative feedback (P tet-CMV D2i promoter containing tetO) in the promoter region is introduced into the host cell, and the transcription start site (Inr) of the inducible promoter P rpaO-CMV1 A tetO operator site was inserted in the upstream and downstream of , respectively, to construct a double-input promoter pC-dox, which can respond to the dual signals of pC-HSL and doxycycline. As shown in Figures 5A and 5B, the combination of the repressor protein TetR and tetO blocked the rpaO operating site of pC-dox and repressed the transcription of the reporter gene; in the presence of dox, TetR detached from tetO, exposing the operating site rpaO, thereby RpaR bound to pC-HSL can bind to rpaO, thereby recruiting RNA polymerase and reporting gene expression; since the P tet-CMV D2i promoter also contains tetO, the negative regulation of TetR expression by dox makes it appear at every dox concentration TetR adaptation (adaptation), so the expression of the reporter gene is linearized with the increase of dox and pC-HSL. As shown in Figure 5C, the induction system maintains a linear response in a wide range of inducer concentration (0-100ng/mL dox, 10nM-10μM pC-HSL), and the dynamic range is as high as 500, which can realize the control of gene expression. Fine regulation.
诱导系统的正交性Orthogonality of the induced system
由于pC-HSL与Cinn-HSL的化学结构差别仅在芳香环上的一个羟基,在天然宿主中由pC-HSL与Cinn-HSL介导的群体感应存在串扰。然而,只要同一诱导剂针对两种系统的工作浓度差异较大,就可通过降低诱导剂浓度的方式使得仅较灵敏的系统被诱导,从而将两个信号通路的正交化。为检验在哺乳动物体系下这两种诱导系统的串扰程度,我们分别以pC-HSL和Cinn-HSL作为诱导剂,对pC和Cinn两种诱导系统的信号水平串扰进行研究。如图6A所示,在pC-HSL和Cinn-HSL诱导下,Cinn信号系统的EC50分别为0.1μM和1.3nM。如图6B所示,在pC-HSL和Cinn-HSL诱导下,pC信号系统的EC50分别为0.7μM和6μM。可见,较低的Cinn-HSL浓度仅能成功诱导Cinn系统,因此可通过调节Cinn-HSL的浓度,对同时存在pC诱导系统和Cinn诱导系统的细胞或组织进行时空特异性调控。当同时存在pC和Cinn诱导系统时,采用pC-HSL作为诱导剂将导致两个系统发生信号串扰。然而,对于pC诱导系统,采用Cinn-HSL作为诱导剂能够在较低诱导剂浓度下获得更强的应答,因此也可单独使用Cinn-HSL来调控pC诱导系统。Since the chemical structure difference between pC-HSL and Cinn-HSL is only one hydroxyl group on the aromatic ring, there is crosstalk in the quorum sensing mediated by pC-HSL and Cinn-HSL in the natural host. However, as long as the same inducer has a large difference in the working concentration of the two systems, the concentration of the inducer can be reduced so that only the more sensitive system is induced, thereby orthogonalizing the two signaling pathways. In order to test the degree of crosstalk between the two induction systems in mammalian system, we used pC-HSL and Cinn-HSL as inducers to study the signal level crosstalk between pC and Cinn induction systems. As shown in Fig. 6A, the EC50 of Cinn signaling system was 0.1 μM and 1.3 nM under the induction of pC-HSL and Cinn-HSL, respectively. As shown in Figure 6B, the EC50 of the pC signaling system was 0.7 μM and 6 μM induced by pC-HSL and Cinn-HSL, respectively. It can be seen that the lower concentration of Cinn-HSL can only successfully induce the Cinn system, so by adjusting the concentration of Cinn-HSL, the cells or tissues with both the pC-inducible system and the Cinn-inducible system can be regulated spatiotemporally. When both pC and Cinn induction systems exist, the use of pC-HSL as an inducer will lead to signal crosstalk between the two systems. However, for the pC-inducible system, using Cinn-HSL as an inducer can obtain a stronger response at a lower inducer concentration, so Cinn-HSL can also be used alone to regulate the pC-inducible system.
哺乳动物细胞间通讯系统Mammalian cell-to-cell communication system
作为一个应用实例,我们在哺乳动物细胞中重构了群体感应信号系统。如上文所述,细菌的群体感应系统利用可自由扩散的AHL为信号分子,该信号分子由LuxI家族蛋白催化合成(信号发送),并由转录因子LuxR家族蛋白检测并转导(信号接收)。类似地,本发明构建的哺乳动物细胞通讯系统中包含用于生成信号分子的信号发送模块和响应信号的信号接收模块(图7A)。其中,信号发送模块B4YP由苯丙氨酸解氨酶PAL [9]、4-香豆酸辅酶A连接酶4CL [9]以及LuxI家族蛋白BraI组成。信号接收模块2XR8为本发明 的Cinn诱导系统。其中,第一基因包含启动子hEF1a以及嵌合型别构转录因子VTR3-BraR的编码序列,第二基因包含诱导型启动子P 2XR8-TRE3G以及报告基因Citrine。阴性对照为仅包含信号接收模块2XR8(图7B)。如图7C的FITC通道所示,在转染B4YP-2XR8模块的细胞中,通讯系统的报告基因Citrine有很好的表达。由于不能合成信号分子,仅包含接收模块2XR8的对照细胞没有Citrine的表达。 As an applied example, we reconstituted the quorum-sensing signaling system in mammalian cells. As mentioned above, the bacterial quorum sensing system uses freely diffusible AHL as a signal molecule, which is catalyzed by LuxI family proteins (signal sending), and detected and transduced by transcription factor LuxR family proteins (signal reception). Similarly, the mammalian cell communication system constructed in the present invention includes a signal sending module for generating signal molecules and a signal receiving module for responding to signals ( FIG. 7A ). Among them, the signaling module B4YP is composed of phenylalanine ammonia lyase PAL [9] , 4-coumaric acid-CoA ligase 4CL [9] and LuxI family protein BraI. The signal receiving module 2XR8 is the Cinn induction system of the present invention. Wherein, the first gene includes the coding sequence of the promoter hEF1a and the chimeric allosteric transcription factor VTR3-BraR, and the second gene includes the inducible promoter P 2XR8-TRE3G and the reporter gene Citrine. The negative control contained only the signal receiving module 2XR8 (Fig. 7B). As shown in the FITC channel of Figure 7C, the reporter gene Citrine of the communication system was well expressed in the cells transfected with the B4YP-2XR8 module. Control cells containing only the receptor module 2XR8 had no expression of Citrine due to the inability to synthesize signaling molecules.
方法method
表3实施例中涉及的诱导系统The induction system involved in the embodiment of table 3
Figure PCTCN2022071023-appb-000003
Figure PCTCN2022071023-appb-000003
上述诱导系统中的报告基因均为Citrine [15]The reporter gene in the above induction system is Citrine [15] .
质粒的构建Plasmid construction
对于pC诱导系统,首先将VTR3(SEQ ID NO:41)连接至核定位信号肽(SEQ ID NO:38)的5’端并将RpaR(NCBI登录号为WP_011155889.1反义链第348672-349403bp)连接至该核定位信号肽的3’端,来构建别构转录因子VTR3-RpaR的编码序列。该核定位信号肽包含两个核定位序列(SEQ ID NO:17),在两个核定位序列之间、5’端以及3’端各具有20bp随机序列。将两个rpaO操纵序列插入至TRE3G核心启动子的上游,获得P rpaO-TRE3G启动子;并在下游连接Citrine黄色荧光蛋白编码序列作为报告基因。包含VTR3-RpaR以及诱导型启动子-报告基因的质粒还组成型表达mCherry红色荧光蛋白,以便对质粒转染进行内部监控。当pC-HSL存在时,VTR3-RpaR能够与rpaO结合从而激活Citrine报告基因的表达。具体而言,利用Gibson assembly按照以下顺序将各元件组装到pcDNA3.1(+)质粒(Invitrogen,catalog#V790-20)上:P rpaO-TRE3G启动子;Citrine报告基因 [15];hEF1a启动子(来自人克隆RP11-505P4 6号染色体DNA(NCBI登录号AL603910.6)的114743-113562bp区段);VTR3-RpaR;EF1启动子 [8];mCherry报告基因 [16]。其中,每个蛋白编码序列之后都带有poly(A)序列。 For the pC induction system, VTR3 (SEQ ID NO: 41) was first connected to the 5' end of the nuclear localization signal peptide (SEQ ID NO: 38) and RpaR (NCBI accession number is WP_011155889.1 antisense chain 348672-349403bp ) is connected to the 3' end of the nuclear localization signal peptide to construct the coding sequence of the allosteric transcription factor VTR3-RpaR. The nuclear localization signal peptide comprises two nuclear localization sequences (SEQ ID NO: 17), with 20 bp random sequences between the two nuclear localization sequences, 5' end and 3' end. The two rpaO operating sequences were inserted into the upstream of the TRE3G core promoter to obtain the P rpaO-TRE3G promoter; and the Citrine yellow fluorescent protein coding sequence was connected downstream as a reporter gene. Plasmids containing VTR3-RpaR and an inducible promoter-reporter gene also constitutively express mCherry red fluorescent protein for internal monitoring of plasmid transfection. When pC-HSL exists, VTR3-RpaR can bind to rpaO to activate the expression of Citrine reporter gene. Specifically, each element was assembled onto the pcDNA3.1(+) plasmid (Invitrogen, catalog #V790-20) using Gibson assembly in the following order: P rpaO-TRE3G promoter; Citrine reporter gene [15] ; hEF1a promoter (114743-113562bp segment from human clone RP11-505P4 chromosome 6 DNA (NCBI accession number AL603910.6)); VTR3-RpaR; EF1 promoter [8] ; mCherry reporter gene [16] . Wherein, each protein coding sequence is followed by a poly(A) sequence.
就具有多聚化结构域的别构转录因子VTR3-CarH-RpaR而言,其5’端至3’端分别是VTR3(SEQ ID NO:41)、核定位信号肽(SEQ ID NO:38)、连接肽(SEQ ID NO:16)、CarH(SEQ ID NO:18)、连接肽(SEQ ID NO:16)以及RpaR(NCBI登录号为WP_011155889.1反义链第348672-349403bp)。As far as the allosteric transcription factor VTR3-CarH-RpaR with multimerization domain is concerned, its 5' end to 3' end are respectively VTR3 (SEQ ID NO: 41), nuclear localization signal peptide (SEQ ID NO: 38) , connecting peptide (SEQ ID NO: 16), CarH (SEQ ID NO: 18), connecting peptide (SEQ ID NO: 16) and RpaR (NCBI accession number is WP_011155889.1 antisense strand No. 348672-349403bp).
就IV诱导系统而言,嵌合型别构转录因子VTR3-BjaR的5’端至3’端分别是VTR3(SEQ ID NO:41)、核定位信号肽(SEQ ID NO:38)以及BjaR(SEQ ID NO:43)。其中,BjaR为根据BjaR蛋白序列(WP_011083882.1)经人源密码子优化获得。类似于pC诱导系统的构建,利用Gibson assembly按照以下 顺序将各元件组装到pcDNA3.1(+)质粒(Invitrogen,catalog#V790-20)上:P bjaO-TRE3G启动子;Citrine报告基因;hEF1a启动子;VTR3-BjaR;EF1启动子;mCherry报告基因,得到IV诱导系统质粒。 As far as the IV induction system is concerned, the 5' end to the 3' end of the chimeric allosteric transcription factor VTR3-BjaR are VTR3 (SEQ ID NO: 41), nuclear localization signal peptide (SEQ ID NO: 38) and BjaR ( SEQ ID NO: 43). Among them, BjaR is obtained through human codon optimization according to the BjaR protein sequence (WP_011083882.1). Similar to the construction of the pC induction system, each element was assembled on the pcDNA3.1(+) plasmid (Invitrogen, catalog #V790-20) using Gibson assembly in the following order: P bjaO-TRE3G promoter; Citrine reporter gene; hEF1a promoter sub; VTR3-BjaR; EF1 promoter; mCherry reporter gene, get IV induction system plasmid.
对于Cinn诱导系统,嵌合型别构转录因子VTR3-BraR的5’端至3’端分别是VTR3(SEQ ID NO:41)、核定位信号肽(SEQ ID NO:38)以及BraR(SEQ ID NO:42)。其中,BraR为根据BraR蛋白序列(CAL74858.1)经人源密码子优化获得。利用Gibson assembly按照以下顺序将各元件组装到pcDNA3.1(+)质粒(Invitrogen,catalog#V790-20)上:Cinn-HSL诱导型启动子;Citrine报告基因;hEF1a启动子;VTR3-BraR或VP64-BraR(VP64的NCBI登录号:ASW25882.1);EF1启动子;mCherry报告基因,得到Cinn诱导系统质粒。其中,Cinn-HSL诱导型启动子为其操纵序列来自于不同菌株的启动子P CUS-TRE3G、P APO-TRE3G、P NC-TRE3G、P ORS-TRE3G、P CUL-TRE3G;上游截短的启动子P F1-TRE3G、P F2-TRE3G、P F4-TRE3G、P F7-TRE3G;下游截短的启动子P R1-TRE3G、P R4-TRE3G、P R6-TRE3G、P R7-TRE3G、P R8-TRE3G以及优化后的启动子P 2XR8-TRE3GFor the Cinn induction system, the 5' end to the 3' end of the chimeric allosteric transcription factor VTR3-BraR are VTR3 (SEQ ID NO: 41), nuclear localization signal peptide (SEQ ID NO: 38) and BraR (SEQ ID NO: 42). Among them, BraR is obtained through human codon optimization according to the BraR protein sequence (CAL74858.1). The components were assembled into the pcDNA3.1(+) plasmid (Invitrogen, catalog #V790-20) using Gibson assembly in the following order: Cinn-HSL inducible promoter; Citrine reporter gene; hEF1a promoter; VTR3-BraR or VP64 - BraR (NCBI accession number of VP64: ASW25882.1); EF1 promoter; mCherry reporter gene, resulting in Cinn induction system plasmid. Among them, the Cinn-HSL inducible promoter has its operating sequence from the promoters P CUS-TRE3G , P APO-TRE3G , P NC-TRE3G , P ORS-TRE3G , P CUL-TRE3G of different strains; the upstream truncated promoter Promoters P F1-TRE3G , P F2-TRE3G , P F4-TRE3G , P F7-TRE3G ; downstream truncated promoters P R1-TRE3G , P R4-TRE3G , P R6-TRE3G , P R7-TRE3G , P R8- TRE3G and the optimized promoter P 2XR8-TRE3G .
对于pC-dox双诱导系统,首先通过反向PCR将pC诱导系统的P RpaO-TRE3G启动子换成P RpaO-CMV1启动子(SEQ ID NO:39),然后用反向PCR在P RpaO-CMV1启动子的转录起始位点(Inr)上下游各加入一个tetO,得到能够同时响应pC-HSL和dox信号的pC-dox诱导型启动子。用反向PCR去除质粒pDN-D2irTNG4kwh(addgen#44722)上的GFP编码区域,然后PCR扩增该质粒整个TetR表达元件(从CMV enhancer开始到bGH poly A结束,该表达元件包含启动子P tet-CMV D2i)。接下来通过NotI单酶切,将PCR扩增的两端带有NotI位点的TetR表达元件插入含有pC-dox启动子的pC诱导系统中,构建成最终的pC-dox诱导系统的接收器。 For the pC-dox double-inducible system, the PRpaO-TRE3G promoter of the pC induction system was replaced with the PRpaO-CMV1 promoter (SEQ ID NO: 39) by inverse PCR first, and then the P RpaO-CMV1 A tetO was added to the upstream and downstream of the transcription initiation site (Inr) of the promoter, and a pC-dox inducible promoter capable of simultaneously responding to pC-HSL and dox signals was obtained. The GFP coding region on the plasmid pDN-D2irTNG4kwh (addgen#44722) was removed by inverse PCR, and then the entire TetR expression element of the plasmid (from the beginning of CMV enhancer to the end of bGH poly A, which contained the promoter P tet- CMV D2i ). Next, the PCR-amplified TetR expression element with NotI sites at both ends was inserted into the pC inducible system containing the pC-dox promoter through NotI single enzyme digestion to construct the final receiver of the pC-dox inducible system.
B4YP-2XR8信号通讯系统:如图7所示,该信号通讯系统的信号接收模块2XR8为Cinn诱导系统(诱导型启动子为P 2XR8-TRE3G)。该信号通讯系统的信号发送模块B4YP包含BraI、4CL及PAL三个基因。利用Gibson assembly将PB载体(PB513B-1)上的CMV启动子到SV40为止整个基因表达框序列替换为EF1a核心启动子(SEQ ID NO:40)、BraI(SEQ ID NO:44)和hGH-PA(polyA区),得到EF1a-BraI表达质粒。类似地,利用Gibson assembly将PB载体上的CMV启动子到SV40为止整个基因表达框序列替换为EF1a核心启动子(SEQ ID NO:40)、4CL(NCBI登录号:CP066699.1第1990899-1992794bp)和hGH-PA,得到EF1a-4CL表达质粒。利用Gibson assembly将PB载体上的CMV启动子到SV40为止整个基因表达框序列替换为EF1a核心启动子(SEQ ID NO:40)、PAL(SEQ ID NO:45)和hGH-PA,得到EF1a-PAL表达质粒。其中,BraI为根据BraR蛋白序列(CAL74857.1)经人源密码子优化获得;PAL为根据PAL蛋白序列(XP_016272209.1)经人源密码子优化获得。就三个表达质粒的组装而言,首先构建ES质粒:在PB质粒5’绝缘子和CMV启动子之间插入RFP表达盒作为细菌筛选标记,在SV40PA和3’绝缘子之间插入EF1启动子、mCherry和SV40PA作为哺乳动物筛选标记,表达元件两端的酶切位点为I-CeuI和I-SceI;同时删除PB载体上从CMV启动子开始到SV40PA止的整段序列。对于表达质粒EF1a-BraI、EF1a-4CL以及EF1a-PAL,通过PCR扩增得到EF1a-BraI-hGH-PA、EF1a-4CL-hGH-PA以及EF1a-PAL-hGH-PA和三个片段,将这三个片段按照BraI-4CL-PAL的顺序通过golden gate组装整合在ES载体上,最终得到PB-EF1a-BraI-EFa-4cl-EF1a-PAL(B4YP)。就信号发送和接收模块的组装而言,扩增2XR8模块的Citrine/VTR3-BraR/mCherry表达盒并在两端添加I-CeuI和I-SceI酶切位点。对上述扩增产物和质粒pEF1a-B4YP进行I-CeuI和I-SceI双酶切并用T4DNA连接酶连接,即得到单质粒的细胞通讯系统p2XR8-B4YP。 B4YP-2XR8 signal communication system: as shown in Figure 7, the signal receiving module 2XR8 of the signal communication system is a Cinn inducible system (the inducible promoter is P 2XR8-TRE3G ). The signal sending module B4YP of the signal communication system includes three genes of BraI, 4CL and PAL. Use Gibson assembly to replace the entire gene expression frame sequence from the CMV promoter on the PB vector (PB513B-1) to SV40 with the EF1a core promoter (SEQ ID NO: 40), BraI (SEQ ID NO: 44) and hGH-PA (polyA region) to obtain the EF1a-Bral expression plasmid. Similarly, use Gibson assembly to replace the entire gene expression frame sequence from the CMV promoter on the PB vector to SV40 with the EF1a core promoter (SEQ ID NO: 40), 4CL (NCBI accession number: CP066699.1 No. 1990899-1992794bp) and hGH-PA to obtain the EF1a-4CL expression plasmid. Use Gibson assembly to replace the entire gene expression frame sequence from the CMV promoter on the PB vector to SV40 with the EF1a core promoter (SEQ ID NO: 40), PAL (SEQ ID NO: 45) and hGH-PA to obtain EF1a-PAL expression plasmid. Among them, BraI was obtained by human codon optimization based on the BraR protein sequence (CAL74857.1); PAL was obtained by human codon optimization based on the PAL protein sequence (XP_016272209.1). As far as the assembly of the three expression plasmids was concerned, ES plasmids were constructed first: RFP expression cassettes were inserted between the 5' insulator of the PB plasmid and the CMV promoter as bacterial selection markers, and the EF1 promoter, mCherry and SV40PA are used as mammalian screening markers, and the restriction sites at both ends of the expression element are I-CeuI and I-SceI; at the same time, the entire sequence from the CMV promoter to SV40PA on the PB vector is deleted. For the expression plasmids EF1a-BraI, EF1a-4CL and EF1a-PAL, EF1a-BraI-hGH-PA, EF1a-4CL-hGH-PA and EF1a-PAL-hGH-PA and three fragments were obtained by PCR amplification. The three fragments were assembled and integrated on the ES vector through the golden gate according to the sequence of BraI-4CL-PAL, and finally PB-EF1a-BraI-EFa-4cl-EF1a-PAL (B4YP) was obtained. For the assembly of the signaling and receiving modules, the Citrine/VTR3-BraR/mCherry expression cassette of the 2XR8 module was amplified and I-CeuI and I-SceI restriction sites were added at both ends. The amplified product and plasmid pEF1a-B4YP were digested with I-CeuI and I-SceI and ligated with T4DNA ligase to obtain the single-plasmid cell communication system p2XR8-B4YP.
哺乳动物细胞的培养及转染Mammalian cell culture and transfection
小分子诱导物pC-HSL购自Sigma(货号7077),Cinn-HSL和IV-HSL为由深圳乾延药物研发科技有限公司合成。The small molecule inducer pC-HSL was purchased from Sigma (Product No. 7077), and Cinn-HSL and IV-HSL were synthesized by Shenzhen Qianyan Drug Research and Development Technology Co., Ltd.
在37℃、5%CO 2的培养条件下,将人胚胎肾细胞系293T(HEK-293T,ATCC:CRL-11268)接种于添加有10%胎牛血清(Gibco)和1%青链霉素(Hyclone)的DMEM高糖培养基(Hyclone,含丙酮酸钠)中进行培养。在24孔细胞培养板的各孔中按照2×10 5细胞/ml接种,培养1天后使用Lipofectamine(Thermo)标准方案实施转染:将1μg质粒DNA与25μl DMEM混匀,然后加入2μl P3000混匀,将混匀后的DNA/P3000/DMEM加入混有2μl Lipofectamine 3000的25μl DMEM中,室温孵育15min。将孵育好的DNA/Lipofectamine复合物滴加入培养有293T细胞的24孔板各孔中,将细胞放回培养箱继续培养。在转染6h后向细胞加入不同浓度的小分子诱导物,在5%CO 2、37℃培养箱中诱导40h左右后收集样品进行流式分析。就包含多聚化结构域的pC诱导系统而言,在转染后6h向细胞加入不同浓度的pC-HSL并同时加入终浓度为15nM的维生素B12。 The human embryonic kidney cell line 293T (HEK-293T, ATCC: CRL-11268) was inoculated in the culture medium supplemented with 10% fetal bovine serum (Gibco) and 1% penicillin and streptomycin under the culture condition of 37°C and 5% CO 2 . (Hyclone) DMEM high glucose medium (Hyclone, containing sodium pyruvate) for culture. Inoculate 2×10 5 cells/ml in each well of a 24-well cell culture plate, and use Lipofectamine (Thermo) standard protocol for transfection after 1 day of culture: Mix 1 μg plasmid DNA with 25 μl DMEM, then add 2 μl P3000 and mix well , Add the mixed DNA/P3000/DMEM into 25 μl DMEM mixed with 2 μl Lipofectamine 3000, and incubate at room temperature for 15 minutes. The incubated DNA/Lipofectamine complex was added dropwise into each well of the 24-well plate in which 293T cells were cultured, and the cells were returned to the incubator to continue culturing. Six hours after transfection, different concentrations of small molecule inducers were added to the cells, and the samples were collected for flow cytometric analysis after induction in a 5% CO 2 , 37° C. incubator for about 40 hours. For the pC-inducible system containing the multimerization domain, different concentrations of pC-HSL were added to the cells 6 h after transfection together with vitamin B12 at a final concentration of 15 nM.
流式样品的收集及数据处理Flow sample collection and data processing
对于经诱导的细胞,用PBS缓冲液清洗两次除去残留的培养基后,在各孔中加入25μl 0.25%的胰酶(Gibco)室温孵育1min,随后加入150μl DMEM完全培养基,反复吹打细胞使其成为单细胞状态。在96孔板中加入经分散处理的细胞悬液,1000rpm离心5min,吸去130μl上清,用110μl 4%多聚甲醛固定液(博士德)重悬细胞,随后用Beckman CytoFLEX S流式细胞仪的FITC通道和ECD通道检测全部样品的荧光 信号。For the induced cells, after washing twice with PBS buffer to remove the residual medium, add 25 μl 0.25% trypsin (Gibco) to each well and incubate at room temperature for 1 min, then add 150 μl DMEM complete medium, blow and beat the cells repeatedly It becomes a unicellular state. Add the dispersed cell suspension to the 96-well plate, centrifuge at 1000rpm for 5min, suck off 130μl supernatant, resuspend the cells with 110μl 4% paraformaldehyde fixative (Boster), and then use Beckman CytoFLEX S flow cytometer The FITC channel and ECD channel detect the fluorescence signals of all samples.
使用Beckman CytoFLEX S流式细胞仪自带软件CytExpert对流式数据进行分析,基于未转染的293T细胞的荧光强度设门。将ECD+群体的平均FITC作为各样品的输出值。对于pC-dox诱导系统,减掉背景自发荧光。本文示出的全部流式细胞数据为至少三次重复的平均值,S.D.为各次测量的标准差。The flow cytometry data were analyzed using CytExpert software that comes with the Beckman CytoFLEX S flow cytometer, and the gate was set based on the fluorescence intensity of untransfected 293T cells. The average FITC of the ECD+ population was used as the output for each sample. For pC-dox inducible systems, background autofluorescence was subtracted. All flow cytometric data presented herein are the mean of at least three replicates, S.D. is the standard deviation of individual measurements.
显微镜观察microscope observation
按照本文所述的Lipofectamine转染方法将B4YP-2XR8和2XR8分别转染至293T细胞,转染48h后用荧光显微镜EVOS7000(Thermo)检测接收器报告基因Citrine的表达。B4YP-2XR8 and 2XR8 were respectively transfected into 293T cells according to the Lipofectamine transfection method described herein, and the expression of the receiver reporter gene Citrine was detected with a fluorescence microscope EVOS7000 (Thermo) 48 hours after transfection.
诱导系统灵敏度的定量拟合Quantitative fitting of induced system sensitivity
为了确定各诱导系统的灵敏度和动态范围,利用如下的希尔方程对实验测得的剂量响应曲线进行拟合。In order to determine the sensitivity and dynamic range of each induction system, the following Hill equation was used to fit the experimentally measured dose-response curves.
Figure PCTCN2022071023-appb-000004
Figure PCTCN2022071023-appb-000004
其中,y是报告子Citrine的荧光强度(输出),y min为输出的最小值,y max为输出的最大值。拟合得到的K为当输出值为半最大值时的输入信号浓度,即EC50理论值。该诱导曲线的动态范围为y max/y minWherein, y is the fluorescence intensity (output) of the reporter Citrine, y min is the minimum value of the output, and y max is the maximum value of the output. The K obtained by fitting is the input signal concentration when the output value is at half maximum, that is, the theoretical value of EC50. The dynamic range of this induction curve is y max /y min .
参考文献references
1 Weber W.,et.al.,NAR 2003 31(14)e71,https://doi.org/10.1093/nar/gng071.1 Weber W., et.al., NAR 2003 31(14)e71, https://doi.org/10.1093/nar/gng071.
2.Shiner E.,et.al.,Biol.Proced.Online 2004;6(1):268-276,doi:10.1251/bpo98.2. Shiner E., et.al., Biol.Proced.Online 2004; 6(1):268-276, doi:10.1251/bpo98.
3.Neddermann P.,et.al.,EMBO reports Vol 4,NO 2,2003.3. Neddermann P., et.al., EMBO reports Vol 4, NO 2, 2003.
4.Loew R.,et al.,BMC Biotechnology 2010,10:81.4. Loew R., et al., BMC Biotechnology 2010, 10:81.
5.Lindemann A.,et.al.,PNAS 2011 108(40)16765-16770.5. Lindemann A., et.al., PNAS 2011 108(40) 16765-16770.
6.Fuqua C.,et.al.,Annu Rev Microbiol.1996;50:727-51.doi:10.1146/annurev.micro.50.1.727.6. Fuqua C., et.al., Annu Rev Microbiol.1996; 50:727-51. doi:10.1146/annurev.micro.50.1.727.
7.Ahlgren N.,et.al.,PNAS 2011 108(17)7183-7188.7. Ahlgren N., et.al., PNAS 2011 108(17)7183-7188.
8.Du P.,et.al.,Nature Communications Vol 11,Article number:4226(2020)8. Du P., et.al., Nature Communications Vol 11, Article number: 4226 (2020)
9.Jiang H.,et.al.,Appl Environ Microbiol.2005;71(6):2962–2969.9. Jiang H., et.al., Appl Environ Microbiol. 2005; 71(6):2962–2969.
10.Gilbert L.,et.al.,Cell 2013;154:442-451.10. Gilbert L., et.al., Cell 2013; 154:442-451.
11.Chavez A.,et.al.,Nat Methods.2015;12(4):326-8.11. Chavez A., et.al., Nat Methods. 2015; 12(4):326-8.
12.Ma D.,et.al.,ACS Synth.Biol.2018,7,978-985.12. Ma D., et.al., ACS Synth. Biol. 2018, 7, 978-985.
13.Wang R.,et.al.,PNAS 2017;114(23):5912–5917.13. Wang R., et.al., PNAS 2017; 114(23):5912–5917.
14.Nevozhay D.,et.al.,PNAS 2009 106(13)5123-5128.14.Nevozhay D., et.al., PNAS 2009 106(13)5123-5128.
15Ahmed,A.,et.al.,PNAS 97(22)11996-12001;https://doi.org/10.1073/pnas.97.22.11996(2000).15 Ahmed, A., et.al., PNAS 97(22) 11996-12001; https://doi.org/10.1073/pnas.97.22.11996(2000).
16.Shaner NC.,et.al.,Nat Biotechnol.2004;22(12):1567-72.16. Shaner NC., et.al., Nat Biotechnol. 2004; 22(12):1567-72.
17.Stanton,BC.,et al.,Nat Chem Biol 10,99-105,doi:10.1038/nchembio.1411(2014).17. Stanton, BC., et al., Nat Chem Biol 10, 99-105, doi: 10.1038/nchembio.1411 (2014).

Claims (53)

  1. 一种用于对哺乳动物细胞中感兴趣的基因进行调控的嵌合型别构转录因子,所述嵌合型别构转录因子包含调控结构域以及转录激活结构域;其中,所述调控结构域选自于由RpaR、BjaR以及BraR所组成的组;所述转录激活结构域选自于P65、VP16、VP64、VTR1、VTR2以及VTR3;所述嵌合型别构转录因子以天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂;所述天然氨基酸衍生的酰基高丝氨酸内酯选自于由如下化合物所组成的组:对香豆酰基-高丝氨酸内酯(pC-HSL)、异戊酰基-高丝氨酸内酯(IV-HSL)以及肉桂酰基-高丝氨酸内酯(Cinn-HSL)。A chimeric allosteric transcription factor for regulating a gene of interest in mammalian cells, said chimeric allosteric transcription factor comprising a regulatory domain and a transcriptional activation domain; wherein said regulatory domain selected from the group consisting of RpaR, BjaR, and BraR; the transcriptional activation domain is selected from P65, VP16, VP64, VTR1, VTR2, and VTR3; the chimeric allosteric transcription factor is derived from an acyl group derived from a natural amino acid Homoserine lactone is used as an inducer; the acyl homoserine lactone derived from a natural amino acid is selected from the group consisting of p-coumaroyl-homoserine lactone (pC-HSL), isovaleryl-homoserine lactone Serine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL).
  2. 如权利要求1所述的嵌合型别构转录因子,其中,所述转录激活结构域为VTR3。The chimeric allosteric transcription factor according to claim 1, wherein the transcriptional activation domain is VTR3.
  3. 如权利要求1或2所述的嵌合型别构转录因子,其中,所述调控结构域为RpaR,所述诱导剂为pC-HSL或Cinn-HSL;优选地,所述诱导剂为pC-HSL。The chimeric allosteric transcription factor according to claim 1 or 2, wherein the regulatory domain is RpaR, and the inducer is pC-HSL or Cinn-HSL; preferably, the inducer is pC- HSL.
  4. 如权利要求1或2所述的嵌合型别构转录因子,其中,所述调控结构域BjaR,所述诱导剂为IV-HSL。The chimeric allosteric transcription factor according to claim 1 or 2, wherein the regulatory domain is BjaR, and the inducer is IV-HSL.
  5. 如权利要求1或2所述的嵌合型别构转录因子,其中,所述调控结构域为BraR,所述诱导剂为Cinn-HSL。The chimeric allosteric transcription factor according to claim 1 or 2, wherein the regulatory domain is BraR, and the inducer is Cinn-HSL.
  6. 如权利要求1-5中任一项所述的嵌合型别构转录因子,其中,使用连接肽将所述调控结构域与所述转录激活结构域相连;优选地,所述连接肽的长度为20-60bp;优选地,所述连接肽具有SEQ ID NO:16的序列。The chimeric allosteric transcription factor according to any one of claims 1-5, wherein the regulatory domain is connected to the transcriptional activation domain using a connecting peptide; preferably, the length of the connecting peptide is It is 20-60bp; Preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
  7. 如权利要求1-5中任一项所述的嵌合型别构转录因子,其中,所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端、或位于所述调控结构域与所述转录激活结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,所述核定位序列的5’端和3’端各自独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽位于所述调控结构域与所述转录激活结构域之间;优选地,所述核定位信号肽包含2个核定位序列,所述核定位序列的5’端和3’端各自独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列。The chimeric allosteric transcription factor according to any one of claims 1-5, wherein the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located in the The N-terminal or C-terminal of the chimeric allosteric transcription factor, or between the regulatory domain and the transcriptional activation domain, the nuclear localization signal peptide contains 1-4 nuclear localization sequences; preferably, the The 5' end and the 3' end of the nuclear localization sequence each independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide is located between the regulatory domain and the transcriptional activation domain; preferably , the nuclear localization signal peptide comprises two nuclear localization sequences, the 5' end and the 3' end of the nuclear localization sequence each independently have a flanking sequence of 5-20bp; preferably, the nuclear localization sequence has SEQ ID NO :17 sequence.
  8. 如权利要求1-5中任一项所述的嵌合型别构转录因子,其中,所述嵌合型别构转录因子进一步包含多聚化结构域,优选地,所述多聚化结构域为CarH。The chimeric allosteric transcription factor according to any one of claims 1-5, wherein the chimeric allosteric transcription factor further comprises a multimerization domain, preferably, the multimerization domain For CarH.
  9. 如权利要求8所述的嵌合型别构转录因子,其中,使用连接肽将所述调控结构域、所述多聚化结构域与所述转录激活结构域相连;优选地,所述连接肽各自具有20-60bp的长度;优选地,所述连接肽具有SEQ ID NO:16的序列。The chimeric allosteric transcription factor according to claim 8, wherein the regulatory domain, the multimerization domain and the transcriptional activation domain are connected using a connecting peptide; preferably, the connecting peptide Each has a length of 20-60bp; preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
  10. 如权利要求8所述的嵌合型别构转录因子,其中,所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端、或者位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,所述核定位序列的5’端和3’端各自独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽包含2个核定位序列,所述核定位序列的5’端和3’端各自独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列;优选地,所述核定位信号肽位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间;优选地,所述多聚化结构域的5’端和3’端进一步包含连接肽,优选地,所述连接肽的各自独立地具有20-60bp的长度;优选地,所述连接肽具有SEQ ID NO:16的序列。The chimeric allosteric transcription factor according to claim 8, wherein said chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein said nuclear localization signal peptide is located at said chimeric allosteric transcription The N-terminal or C-terminal of the factor, or between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain, the nuclear localization signal The peptide contains 1-4 nuclear localization sequences; preferably, the 5' end and the 3' end of the nuclear localization sequence each independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide contains 2 nuclear Positioning sequence, the 5' end and the 3' end of the nuclear localization sequence each independently have a flanking sequence of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17; preferably, the nuclear localization sequence The positioning signal peptide is located between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain; preferably, the multimerization domain The 5' end and the 3' end further comprise a connecting peptide, preferably, each of the connecting peptides independently has a length of 20-60bp; preferably, the connecting peptide has a sequence of SEQ ID NO: 16.
  11. 如权利要求1-10中任一项所述的嵌合型别构转录因子,其中,所述哺乳动物细胞为人细胞。The chimeric allosteric transcription factor according to any one of claims 1-10, wherein the mammalian cells are human cells.
  12. 如权利要求1-10中任一项所述的嵌合型别构转录因子,其中,所述哺乳动物细胞为永生化细胞、原代细胞或干细胞。The chimeric allosteric transcription factor according to any one of claims 1-10, wherein the mammalian cells are immortalized cells, primary cells or stem cells.
  13. 一种以天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂的调控元件组,所述天然氨基酸衍生的酰基高丝氨酸内酯选自于由如下化合物所组成的组:对香豆酰基-高丝氨酸内酯 (pC-HSL)、异戊酰基-高丝氨酸内酯(IV-HSL)以及肉桂酰基-高丝氨酸内酯(Cinn-HSL);所述调控元件组包含:A set of regulatory elements using natural amino acid-derived acyl homoserine lactones as inducers selected from the group consisting of p-coumaroyl-homoserine ester (pC-HSL), isovaleryl-homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL); the set of regulatory elements comprises:
    嵌合型别构转录因子,所述嵌合型别构转录因子包含调控结构域以及转录激活结构域;其中,所述调控结构域选自于由RpaR、BjaR以及BraR所组成的组;所述转录激活结构域选自于P65、VP16、VP64、VTR1、VTR2以及VTR3;以及A chimeric allosteric transcription factor, said chimeric allosteric transcription factor comprising a regulatory domain and a transcriptional activation domain; wherein said regulatory domain is selected from the group consisting of RpaR, BjaR and BraR; said The transcriptional activation domain is selected from the group consisting of P65, VP16, VP64, VTR1, VTR2 and VTR3; and
    在哺乳动物细胞中具有活性的诱导型启动子,所述诱导型启动子包含核心序列以及与所述嵌合型别构转录因子相互作用的至少一对操纵位点对。An inducible promoter active in mammalian cells, said inducible promoter comprising a core sequence and at least one operator site pair interacting with said chimeric allosteric transcription factor.
  14. 如权利要求13所述的调控元件组,其中,所述嵌合型别构转录因子的转录激活结构域为VTR3。The set of regulatory elements according to claim 13, wherein the transcriptional activation domain of the chimeric allosteric transcription factor is VTR3.
  15. 如权利要求13或14所述的调控元件组,其中,所述嵌合型别构转录因子的调控结构域为RpaR,所述诱导型启动子的操纵位点为rpaO,所述诱导剂为pC-HSL或Cinn-HSL;优选地,所述诱导剂为pC-HSL。The set of regulatory elements according to claim 13 or 14, wherein the regulatory domain of the chimeric allosteric transcription factor is RpaR, the operating site of the inducible promoter is rpaO, and the inducer is pC -HSL or Cinn-HSL; preferably, the inducer is pC-HSL.
  16. 如权利要求13或14所述的调控元件组,其中,所述嵌合型别构转录因子的调控结构域为BjaR,所述诱导型启动子的操纵位点为bjaO,所述诱导剂为IV-HSL。The set of regulatory elements according to claim 13 or 14, wherein the regulatory domain of the chimeric allosteric transcription factor is BjaR, the operating site of the inducible promoter is bjaO, and the inducer is IV -HSL.
  17. 如权利要求13或14所述的调控元件组,其中,所述嵌合型别构转录因子的调控结构域为BraR,所述诱导型启动子的操纵位点为braO、优选为R8;所述诱导剂为Cinn-HSL。The set of regulatory elements according to claim 13 or 14, wherein the regulatory domain of the chimeric allosteric transcription factor is BraR, and the operating site of the inducible promoter is braO, preferably R8; The inducer was Cinn-HSL.
  18. 如权利要求13-17中任一项所述的调控元件组,其中,使用连接肽将所述嵌合型别构转录因子中的所述调控结构域与所述转录激活结构域相连;优选地,所述连接肽的长度为20-60bp;优选地,所述连接肽具有SEQ ID NO:16的序列。The set of regulatory elements according to any one of claims 13-17, wherein the regulatory domain in the chimeric allosteric transcription factor is connected to the transcriptional activation domain using a connecting peptide; preferably , the length of the connecting peptide is 20-60bp; preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
  19. 如权利要求13-17中任一项所述的调控元件组,其中,所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端、或位于所述调控结构域与所述转录激活结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,所述核定位序列的5’端和3’端各自独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽位于所述调控结构域与所述转录激活结构域之间;优选地,所述核定位信号肽包含2个核定位序列,所述核定位序列的5’端和3’端各自独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列。The set of regulatory elements according to any one of claims 13-17, wherein the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located in the chimeric type The N-terminal or C-terminal of the constitutive transcription factor, or between the regulatory domain and the transcriptional activation domain, the nuclear localization signal peptide comprises 1-4 nuclear localization sequences; preferably, the nuclear localization sequence The 5' end and the 3' end each independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide is located between the regulatory domain and the transcriptional activation domain; preferably, the nuclear The localization signal peptide comprises two nuclear localization sequences, the 5' end and the 3' end of the nuclear localization sequence each independently have a flanking sequence of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17 .
  20. 如权利要求13-19中任一项所述的调控元件组,其中,所述嵌合型别构转录因子进一步包含多聚化结构域,优选地,所述多聚化结构域为CarH。The set of regulatory elements according to any one of claims 13-19, wherein the chimeric allosteric transcription factor further comprises a multimerization domain, preferably, the multimerization domain is CarH.
  21. 如权利要求20所述的调控元件组,其中,使用连接肽将所述嵌合型别构转录因子中的所述调控结构域、所述多聚化结构域与所述转录激活结构域相连;优选地,所述连接肽各自独立地具有20-60bp的长度;优选地,所述连接肽具有SEQ ID NO:16的序列。The set of regulatory elements according to claim 20, wherein the regulatory domain, the multimerization domain and the transcriptional activation domain in the chimeric allosteric transcription factor are connected using a connecting peptide; Preferably, the connecting peptides each independently have a length of 20-60bp; preferably, the connecting peptides have the sequence of SEQ ID NO: 16.
  22. 如权利要求20所述的调控元件组,其中,所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端、或者位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,所述核定位序列的5’端和3’端各自独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽包含2个核定位序列,所述核定位序列的5’端和3’端各自独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列;优选地,所述核定位信号肽位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间;优选地,所述多聚化结构域的5’端和3’端进一步包含连接肽,优选地,所述连接肽各自独立地具有20-60bp的长度;优选地,所述连接肽具有SEQ ID NO:16的序列。The set of regulatory elements according to claim 20, wherein the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located at the N-terminal of the chimeric allosteric transcription factor Or the C-terminus, or between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain, the nuclear localization signal peptide comprises 1- 4 nuclear localization sequences; preferably, the 5' end and the 3' end of the nuclear localization sequence each independently have a 5-40bp flanking sequence; preferably, the nuclear localization signal peptide contains 2 nuclear localization sequences, so The 5' end and the 3' end of the nuclear localization sequence each independently have a flanking sequence of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17; preferably, the nuclear localization signal peptide is located at between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain; preferably, the 5' end of the multimerization domain and The 3' end further comprises a connecting peptide, preferably, each of the connecting peptides independently has a length of 20-60bp; preferably, the connecting peptide has a sequence of SEQ ID NO: 16.
  23. 如权利要求13-22中任一项所述的调控元件组,其中,所述诱导型启动子的核心序列选自于CMV1启动子、CMVmini启动子、TRE3G启动子、EF1a核心启动子或hEF1a启动子;优选地,所述核心序列为CMV1启动子或TRE3G启动子。The set of regulatory elements according to any one of claims 13-22, wherein the core sequence of the inducible promoter is selected from CMV1 promoter, CMVmini promoter, TRE3G promoter, EF1a core promoter or hEF1a promoter promoter; preferably, the core sequence is a CMV1 promoter or a TRE3G promoter.
  24. 如权利要求13-23中任一项所述的调控元件组,其中,所述操纵位点对为1对或多对;优选地,所述操纵位点对为1对;优选地,各操纵位点独立地位于转录起始位点的上游和下 游。The set of regulatory elements according to any one of claims 13-23, wherein the pair of manipulation sites is 1 or more pairs; preferably, the pair of manipulation sites is 1 pair; preferably, each manipulation site sites independently located upstream and downstream of the transcription initiation site.
  25. 如权利要求13-24中任一项所述的调控元件组,所述调控元件组进一步包含阻遏蛋白以及第三启动子,其中,所述第三启动子驱动所述阻遏蛋白的表达,所述第三启动子包含与所述阻遏蛋白相互作用的第二操纵位点;其中,所述诱导型启动子进一步包含所述第二操纵位点。The set of regulatory elements according to any one of claims 13-24, said set of regulatory elements further comprising a repressor protein and a third promoter, wherein said third promoter drives the expression of said repressor protein, said A third promoter comprises a second operator site that interacts with said repressor protein; wherein said inducible promoter further comprises said second operator site.
  26. 如权利要求25所述的调控元件组,其中,所述阻遏蛋白为TetR,所述第二操纵位点为tetO;或者,所述阻遏蛋白为CymR,所述第二操纵位点为cymO。The set of regulatory elements according to claim 25, wherein the repressor protein is TetR, and the second operating site is tetO; or, the repressor protein is CymR, and the second operating site is cymO.
  27. 如权利要求13-26中任一项所述的调控元件组,其中,所述哺乳动物细胞为人细胞。The set of regulatory elements of any one of claims 13-26, wherein the mammalian cells are human cells.
  28. 如权利要求13-26中任一项所述的调控元件组,其中,所述哺乳动物细胞为永生化细胞、原代细胞或干细胞。The set of regulatory elements according to any one of claims 13-26, wherein said mammalian cells are immortalized cells, primary cells or stem cells.
  29. 一种对哺乳动物细胞中感兴趣的基因进行诱导表达的系统,所述诱导表达系统利用天然氨基酸衍生的酰基高丝氨酸内酯作为诱导剂,所述天然氨基酸衍生的酰基高丝氨酸内酯选自于由如下化合物所组成的组:对香豆酰基-高丝氨酸内酯(pC-HSL)、异戊酰基-高丝氨酸内酯(IV-HSL)以及肉桂酰基-高丝氨酸内酯(Cinn-HSL);所述诱导表达系统包含:A system for inducing expression of a gene of interest in mammalian cells, the inducible expression system utilizing acyl homoserine lactone derived from a natural amino acid as an inducer, the acyl homoserine lactone derived from a natural amino acid being selected from The group consisting of the following compounds: p-coumaroyl-homoserine lactone (pC-HSL), isovaleryl-homoserine lactone (IV-HSL) and cinnamoyl-homoserine lactone (Cinn-HSL); The inducible expression system comprises:
    第一基因表达盒,所述第一基因表达盒包含第一启动子序列以及嵌合型别构转录因子的编码序列,其中,所述嵌合型别构转录因子包含调控结构域以及转录激活结构域;所述调控结构域选自于由RpaR、BjaR以及BraR所组成的组;所述转录激活结构域选自于P65、VP16、VP64、VTR1、VTR2以及VTR3;以及A first gene expression cassette, the first gene expression cassette comprising a first promoter sequence and a coding sequence of a chimeric allosteric transcription factor, wherein the chimeric allosteric transcription factor comprises a regulatory domain and a transcriptional activation structure domain; the regulatory domain is selected from the group consisting of RpaR, BjaR and BraR; the transcriptional activation domain is selected from the group consisting of P65, VP16, VP64, VTR1, VTR2 and VTR3; and
    第二基因表达盒,所述第二基因表达盒包含诱导型启动子序列以及所述感兴趣的基因的编码序列,所述诱导型启动子序列包含与所述嵌合型别构转录因子相互作用的至少一对操纵位点对。A second gene expression cassette, the second gene expression cassette comprising an inducible promoter sequence and the coding sequence of the gene of interest, the inducible promoter sequence comprising an allosteric transcription factor interacting with the chimeric type at least one pair of operator sites.
  30. 如权利要求29所述的诱导表达系统,其中,所述第一基因中所述嵌合型别构转录因子的转录激活结构域为VTR3。The inducible expression system according to claim 29, wherein the transcriptional activation domain of the chimeric allosteric transcription factor in the first gene is VTR3.
  31. 如权利要求29或30所述的诱导表达系统,其中,所述第一基因中所述嵌合型别构转录因子的调控结构域为RpaR,所述第二基因中所述诱导型启动子的操纵位点为rpaO,所述诱导剂为pC-HSL或Cinn-HSL;优选地,所述诱导剂为pC-HSL。The inducible expression system according to claim 29 or 30, wherein the regulatory domain of the chimeric allosteric transcription factor in the first gene is RpaR, and the inducible promoter in the second gene is The manipulation site is rpaO, and the inducer is pC-HSL or Cinn-HSL; preferably, the inducer is pC-HSL.
  32. 如权利要求29或30所述的诱导表达系统,其中,所述第一基因中所述嵌合型别构转录因子的调控结构域为BjaR,所述第二基因中所述诱导型启动子的操纵位点为bjaO,所述诱导剂为IV-HSL。The inducible expression system according to claim 29 or 30, wherein the regulatory domain of the chimeric allosteric transcription factor in the first gene is BjaR, and the inducible promoter in the second gene is The operator site is bjaO, and the inducer is IV-HSL.
  33. 如权利要求29或30所述的诱导表达系统,其中,所述第一基因中所述嵌合型别构转录因子的调控结构域为BraR,所述第二基因中所述诱导型启动子的操纵位点为braO、优选R8,所述诱导剂为Cinn-HSL。The inducible expression system according to claim 29 or 30, wherein the regulatory domain of the chimeric allosteric transcription factor in the first gene is BraR, and the inducible promoter in the second gene is The operator site is braO, preferably R8, and the inducer is Cinn-HSL.
  34. 如权利要求29-33中任一项所述的诱导表达系统,其中,在所述第一基因中,使用连接肽将所述调控结构域与所述转录激活结构域相连;优选地,所述连接肽的长度为20-60bp;优选地,所述连接肽具有SEQ ID NO:16的序列。The inducible expression system according to any one of claims 29-33, wherein, in the first gene, the regulatory domain is connected to the transcriptional activation domain using a connecting peptide; preferably, the The length of the connecting peptide is 20-60bp; preferably, the connecting peptide has the sequence of SEQ ID NO: 16.
  35. 如权利要求29-33中任一项所述的诱导表达系统,其中,所述第一基因的所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端、或位于所述调控结构域与所述转录激活结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,所述核定位序列的5’端和3’端各自独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽位于所述调控结构域与所述转录激活结构域之间;优选地,所述核定位信号肽包含2个核定位序列,所述核定位序列的5’端和3’端各自独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列。The inducible expression system according to any one of claims 29-33, wherein the chimeric allosteric transcription factor of the first gene further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located at The N-terminal or C-terminal of the chimeric allosteric transcription factor, or between the regulatory domain and the transcriptional activation domain, the nuclear localization signal peptide contains 1-4 nuclear localization sequences; preferably , the 5' end and the 3' end of the nuclear localization sequence each independently have a flanking sequence of 5-40bp; preferably, the nuclear localization signal peptide is located between the regulatory domain and the transcriptional activation domain; Preferably, the nuclear localization signal peptide comprises two nuclear localization sequences, and the 5' and 3' ends of the nuclear localization sequences each independently have a 5-20bp flanking sequence; preferably, the nuclear localization sequence has SEQ ID NO: Sequence of 17.
  36. 如权利要求29-35中任一项所述的诱导表达系统,其中,所述第一基因的所述嵌合型别构转录因子进一步包含多聚化结构域,优选地,所述多聚化结构域为CarH。The inducible expression system according to any one of claims 29-35, wherein the chimeric allosteric transcription factor of the first gene further comprises a multimerization domain, preferably, the multimerization The structural domain is CarH.
  37. 如权利要求36所述的诱导表达系统,其中,使用连接肽将所述嵌合型别构转录因子中的所述调控结构域、所述多聚化结构域与所述转录激活结构域相连;优选地,所述连接肽 各自独立地具有20-60bp的长度;优选地,所述连接肽具有SEQ ID NO:16的序列。The inducible expression system according to claim 36, wherein the regulatory domain, the multimerization domain and the transcriptional activation domain in the chimeric allosteric transcription factor are connected using a connecting peptide; Preferably, the connecting peptides each independently have a length of 20-60bp; preferably, the connecting peptides have the sequence of SEQ ID NO: 16.
  38. 如权利要求36所述的诱导表达系统,其中,所述嵌合型别构转录因子进一步包含核定位信号肽,其中,所述核定位信号肽位于所述嵌合型别构转录因子的N端或C端、或者位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间,所述核定位信号肽包含1-4个核定位序列;优选地,所述核定位序列的5’端和3’端各自独立地具有5-40bp的侧翼序列;优选地,所述核定位信号肽包含2个核定位序列,所述核定位序列的5’端和3’端各自独立地具有5-20bp的侧翼序列;优选地,所述核定位序列具有SEQ ID NO:17的序列;优选地,所述核定位信号肽位于所述调控结构域与所述多聚化结构域之间、或者位于所述转录激活结构域与所述多聚化结构域之间;优选地,所述多聚化结构域的5’端和3’端进一步包含连接肽,优选地,所述连接肽各自独立地具有20-60bp的长度;优选地,所述连接肽具有SEQ ID NO:16的序列。The inducible expression system according to claim 36, wherein the chimeric allosteric transcription factor further comprises a nuclear localization signal peptide, wherein the nuclear localization signal peptide is located at the N-terminal of the chimeric allosteric transcription factor Or the C-terminus, or between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain, the nuclear localization signal peptide comprises 1- 4 nuclear localization sequences; preferably, the 5' end and the 3' end of the nuclear localization sequence each independently have a 5-40bp flanking sequence; preferably, the nuclear localization signal peptide contains 2 nuclear localization sequences, so The 5' end and the 3' end of the nuclear localization sequence each independently have a flanking sequence of 5-20bp; preferably, the nuclear localization sequence has the sequence of SEQ ID NO: 17; preferably, the nuclear localization signal peptide is located at between the regulatory domain and the multimerization domain, or between the transcriptional activation domain and the multimerization domain; preferably, the 5' end of the multimerization domain and The 3' end further comprises a connecting peptide, preferably, each of the connecting peptides independently has a length of 20-60bp; preferably, the connecting peptide has a sequence of SEQ ID NO: 16.
  39. 如权利要求29-38中任一项所述的诱导表达系统,其中,所述第一基因中所述第一启动子为组成型启动子或诱导型启动子。The inducible expression system according to any one of claims 29-38, wherein the first promoter in the first gene is a constitutive promoter or an inducible promoter.
  40. 如权利要求29-39中任一项所述的诱导表达系统,其中,所述第二基因中所述诱导型启动子的核心序列选自于CMV1启动子、CMVmini启动子、TRE3G启动子、EF1a核心启动子或hEF1a启动子;优选地,所述核心序列为CMV1启动子或TRE3G启动子。The inducible expression system according to any one of claims 29-39, wherein the core sequence of the inducible promoter in the second gene is selected from CMV1 promoter, CMVmini promoter, TRE3G promoter, EF1a Core promoter or hEF1a promoter; preferably, the core sequence is CMV1 promoter or TRE3G promoter.
  41. 如权利要求29-40中任一项所述的诱导表达系统,其中,所述第二基因中所述操纵位点对为1对或多对;优选地,所述操纵位点对为1对;优选地,各操纵位点独立地位于转录起始位点的上游和下游。The inducible expression system according to any one of claims 29-40, wherein the pair of operating sites in the second gene is one or more pairs; preferably, the pair of operating sites is one pair ; Preferably, each operator site is independently located upstream and downstream of the transcription start site.
  42. 如权利要求29-41中任一项所述的诱导表达系统,所述诱导表达系统进一步包含第三基因表达盒,其中,所述第三基因表达盒包含第三启动子序列以及阻遏蛋白的编码序列,用于在宿主细胞中表达所述阻遏蛋白,所述第三启动子序列包含第二操纵位点,所述第二操纵位点与所述阻遏蛋白相互作用;所述第二基因的诱导型启动子序列进一步包含第二操纵位点。The inducible expression system according to any one of claims 29-41, said inducible expression system further comprising a third gene expression cassette, wherein said third gene expression cassette comprises a third promoter sequence and a coding repressor protein sequence for expressing said repressor protein in a host cell, said third promoter sequence comprising a second operator site, said second operator site interacting with said repressor protein; induction of said second gene The type promoter sequence further comprises a second operator site.
  43. 如权利要求42所述的诱导表达系统,其中,所述阻遏蛋白为TetR,所述第二操纵位点为tetO;或者,所述阻遏蛋白为CymR,所述第二操纵位点为cymO。The inducible expression system according to claim 42, wherein the repressor protein is TetR, and the second operating site is tetO; or, the repressor protein is CymR, and the second operating site is cymO.
  44. 如权利要求42或43所述的诱导表达系统,其中,所述第三基因中所述第三启动子的核心序列选自于CMV1启动子、CMVmini启动子、TRE3G启动子、EF1a核心启动子或hEF1a启动子;优选地,所述核心序列为CMV1启动子或TRE3G启动子。The inducible expression system according to claim 42 or 43, wherein the core sequence of the third promoter in the third gene is selected from CMV1 promoter, CMVmini promoter, TRE3G promoter, EF1a core promoter or hEF1a promoter; preferably, the core sequence is CMV1 promoter or TRE3G promoter.
  45. 如权利要求29-44中任一项所述的诱导表达系统,其中,所述哺乳动物细胞为人细胞。The inducible expression system according to any one of claims 29-44, wherein the mammalian cells are human cells.
  46. 如权利要求29-44中任一项所述的诱导表达系统,其中,所述哺乳动物细胞为永生化细胞、原代细胞或干细胞。The inducible expression system according to any one of claims 29-44, wherein the mammalian cells are immortalized cells, primary cells or stem cells.
  47. 权利要求1-12中任一项所述的嵌合型别构转录因子、权利要求13-28中任一项所述的调控元件组、权利要求29-46中任一项所述的诱导表达系统在构建单细胞或多细胞信号通讯系统中的用途,其中,所述信号通讯系统包含信号发送元件和信号接收元件,所述信号接收元件包含嵌合型别构转录因子以及在哺乳动物细胞中具有活性的诱导型启动子。The chimeric allosteric transcription factor according to any one of claims 1-12, the set of regulatory elements according to any one of claims 13-28, the induced expression according to any one of claims 29-46 The application of the system in the construction of a single cell or multi-cell signal communication system, wherein the signal communication system includes a signal sending element and a signal receiving element, and the signal receiving element includes a chimeric allosteric transcription factor and in a mammalian cell Active inducible promoter.
  48. 如权利要求47所述的用途,其中,所述信号通讯系统的信号分子为pC-HSL;所述嵌合型别构转录因子中所述调控结构域为RpaR,所述诱导型启动子包含核心序列以及至少一对rpaO;所述信号发送元件包含TAL、4CL以及RpaI。The use according to claim 47, wherein the signal molecule of the signal communication system is pC-HSL; the regulatory domain in the chimeric allosteric transcription factor is RpaR, and the inducible promoter comprises a core sequence and at least one pair of rpaO; the signaling elements include TAL, 4CL and RpaI.
  49. 如权利要求47所述的用途,其中,所述信号通讯系统的信号分子为Cinn-HSL;所述嵌合型别构转录因子中所述调控结构域为BraR,所述诱导型启动子包含核心序列以及至少一对braO;所述信号发送元件包含PAL、4CL以及BraI。The use according to claim 47, wherein the signaling molecule of the signal communication system is Cinn-HSL; the regulatory domain in the chimeric allosteric transcription factor is BraR, and the inducible promoter comprises a core sequence and at least one pair of braO; the signaling elements include PAL, 4CL and BraI.
  50. 如权利要求47所述的用途,其中,所述信号通讯系统的信号分子为IV-HSL;所述嵌合型别构转录因子中所述调控结构域为BjaR,所述诱导型启动子包含核心序列以及至少一对bjaO;所述信号发送元件包含支链α-酮酸脱氢酶复合物BCDH以及BjaI。The use according to claim 47, wherein the signal molecule of the signal communication system is IV-HSL; the regulatory domain in the chimeric allosteric transcription factor is BjaR, and the inducible promoter comprises a core sequence and at least one pair of bjaO; the signaling element comprises a branched-chain α-ketoacid dehydrogenase complex BCDH and BjaI.
  51. 权利要求1-12中任一项所述的嵌合型别构转录因子、权利要求13-28中任一项所述的调控元件组、权利要求29-46中任一项所述的诱导表达系统在基因功能分析、或者构建人 类疾病模型、人造组织或人造器官方面的用途。The chimeric allosteric transcription factor according to any one of claims 1-12, the set of regulatory elements according to any one of claims 13-28, the induced expression according to any one of claims 29-46 The use of the system in the analysis of gene function, or in the construction of human disease models, artificial tissues or artificial organs.
  52. 如权利要求51所述的用途,其中,将所述嵌合型别构转录因子、所述调控元件组或所述诱导表达系统稳定转染至哺乳动物细胞。The use according to claim 51, wherein the chimeric allosteric transcription factor, the set of regulatory elements or the inducible expression system are stably transfected into mammalian cells.
  53. 如权利要求51所述的用途,其中,将所述嵌合型别构转录因子、所述调控元件组或所述诱导表达系统瞬时转染至哺乳动物细胞。The use according to claim 51, wherein the chimeric allosteric transcription factor, the set of regulatory elements or the inducible expression system is transiently transfected into mammalian cells.
PCT/CN2022/071023 2021-09-29 2022-01-10 Chimeric allosteric transcription factor for mammalian system, regulatory element group, and inducible expression system WO2023050644A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111150967.4A CN115873125A (en) 2021-09-29 2021-09-29 Chimeric allosteric transcription factors, sets of regulatory elements and inducible expression systems for mammalian systems
CN202111150967.4 2021-09-29

Publications (1)

Publication Number Publication Date
WO2023050644A1 true WO2023050644A1 (en) 2023-04-06

Family

ID=85756104

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/071023 WO2023050644A1 (en) 2021-09-29 2022-01-10 Chimeric allosteric transcription factor for mammalian system, regulatory element group, and inducible expression system

Country Status (2)

Country Link
CN (1) CN115873125A (en)
WO (1) WO2023050644A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000009704A1 (en) * 1998-08-13 2000-02-24 Syngenta Limited Gene switch
US20100285584A1 (en) * 2006-11-03 2010-11-11 Ron Weiss Engineered cellular pathways for programmed autoregulation of differentiation
US20200143255A1 (en) * 2017-06-27 2020-05-07 Technion Research & Development Foundation Limited Dna-based neural network

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000009704A1 (en) * 1998-08-13 2000-02-24 Syngenta Limited Gene switch
US20100285584A1 (en) * 2006-11-03 2010-11-11 Ron Weiss Engineered cellular pathways for programmed autoregulation of differentiation
US20200143255A1 (en) * 2017-06-27 2020-05-07 Technion Research & Development Foundation Limited Dna-based neural network

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
DAER RENÉ, BARRETT CASSANDRA M., MELENDEZ ERNESTO LUNA, WU JIAQI, TEKEL STEFAN J., XU JIMMY, DENNISON BRADY, MULLER RYAN, HAYNES K: "Characterization of diverse homoserine lactone synthases in Escherichia coli", PLOS ONE, vol. 13, no. 8, 23 August 2018 (2018-08-23), pages e0202294, XP093054645, DOI: 10.1371/journal.pone.0202294 *
DU PEI, ZHAO HUIWEI, ZHANG HAOQIAN, WANG RUISHA, HUANG JIANYI, TIAN YE, LUO XUDONG, LUO XUNXUN, WANG MIN, XIANG YANHUI, QIAN LONG,: "De novo design of an intercellular signaling toolbox for multi-channel cell–cell communication and biological computation", NATURE COMMUNICATIONS, vol. 11, no. 1, 24 August 2020 (2020-08-24), pages 4226, XP093054640, DOI: 10.1038/s41467-020-17993-w *
JIANG, H. X. ET AL.: "Metabolic Engineering of the Phenylpropanoid Pathway in Saccharomyces cerevisiae", APPLIED AND ENVIRONMENTAL MICROBIOLOGY, vol. 71, no. 6, 30 June 2005 (2005-06-30), XP008053136, DOI: 10.1128/AEM.71.6.2962-2969.2005 *

Also Published As

Publication number Publication date
CN115873125A (en) 2023-03-31

Similar Documents

Publication Publication Date Title
Zhou et al. Optimization of the Tet-On system for regulated gene expression through viral evolution
Mullick et al. The cumate gene-switch: a system for regulated expression in mammalian cells
Weber et al. Macrolide-based transgene control in mammalian cells and mice
JP4585435B2 (en) Circular DNA molecules with conditional origins of replication, methods for their production, and their use in gene therapy
AU781628B2 (en) Recombinase-based methods for producing expression vectors and compositions for use in practicing the same
Damke et al. [24] Tightly regulated and inducible expression of dominant interfering dynamin mutant in stably transformed HeLa cells
Weber et al. Streptomyces‐derived quorum‐sensing systems engineered for adjustable transgene expression in mammalian cells and mice
EP3858992A1 (en) A three-component crispr/cas complex system and uses thereof
WO2017215619A1 (en) Fusion protein producing point mutation in cell, and preparation and use thereof
Attallah et al. A highly efficient modified human serum albumin signal peptide to secrete proteins in cells derived from different mammalian species
JPH11506901A (en) Tetracycline-regulated transcription modulator
CN107523588B (en) Tetracycline lentivirus induced expression vector and establishment method and application thereof
CN110551753B (en) Construction of doxycycline/mifepristone induced over-expression double-induced expression vector with fluorescent protein marker gene
EP3152326B1 (en) Low-leakage cellular biosensor system
Karlsson et al. De novo design and construction of an inducible gene expression system in mammalian cells
JP2003521941A (en) vector
WO2023050644A1 (en) Chimeric allosteric transcription factor for mammalian system, regulatory element group, and inducible expression system
JP2019505209A (en) Gene modification assay
Zhang et al. Conditional control of suicide gene expression in tumor cells with theophylline-responsive ribozyme
SG186788A1 (en) Dna element having the activity of enhancing foreign gene expression
JP2023528291A (en) Co-regulatory sequences based on tetracycline and Comate
Zhang et al. Construction of an inducible stable cell line for efficient incorporation of unnatural amino acids in mammalian cells
US7273723B2 (en) Antibiotic-based gene regulation system
Sun et al. The piggyBac-based double-inducible binary vector system: a novel universal platform for studying gene functions and interactions
RU2712838C1 (en) Gene-therapeutic dna-vector gdtt1.8nas12, method for its preparation, escherichia coli jm110-nas strain, a method for its preparation, escherichia coli jm110-nas/gdtt1.8nas12 strain, carrying a gene-therapeutic dna-vector gdtt1.8nas12, a method for production thereof, a method for industrial production of a gene-therapeutic dna vector

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22874047

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE