WO2024092769A1 - Pili à liaison covalente modifiée et bactéries recombinantes les comprenant - Google Patents

Pili à liaison covalente modifiée et bactéries recombinantes les comprenant Download PDF

Info

Publication number
WO2024092769A1
WO2024092769A1 PCT/CN2022/130033 CN2022130033W WO2024092769A1 WO 2024092769 A1 WO2024092769 A1 WO 2024092769A1 CN 2022130033 W CN2022130033 W CN 2022130033W WO 2024092769 A1 WO2024092769 A1 WO 2024092769A1
Authority
WO
WIPO (PCT)
Prior art keywords
polypeptide
gca
carrier protein
seq
spa2
Prior art date
Application number
PCT/CN2022/130033
Other languages
English (en)
Inventor
Chao Zhong
Yuanyuan Huang
Original Assignee
Shenzhen Institute Of Advanced Technology Chinese Academy Of Sciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Institute Of Advanced Technology Chinese Academy Of Sciences filed Critical Shenzhen Institute Of Advanced Technology Chinese Academy Of Sciences
Priority to PCT/CN2022/130033 priority Critical patent/WO2024092769A1/fr
Publication of WO2024092769A1 publication Critical patent/WO2024092769A1/fr

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • C07K14/34Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Corynebacterium (G)
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K19/00Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins

Definitions

  • the present disclosure relates to biological engineering.
  • the present disclosure relates to engineered bacteria, such as Corynebacterium glutamicum comprising modified covalently-linked pili (CLP) .
  • CLP covalently-linked pili
  • the engineered living materials relate to engineered biomaterials with distinctive “living” attributes such as autonomous growth, self-healing and environmental responsiveness that are only found in natural living materials, a wide range of remarkable ELMs had been developed for the applications in biosensors, bioremediation, biomedicine, biomanufacturing, wearable devices, and electronics.
  • ELMs can be produced either by harnessing engineered cells to simultaneously make the material and incorporate novel functionalities into it (known as self-organizing living materials or biological ELMs) or by embedding living cells in an organic or inorganic matrix (referred to as hybrid living materials) .
  • Self-organizing living materials aim to recapitulate the autonomous, adaptive, and versatile properties of natural living materials, and represent opportunities to harness engineered biological systems for new capabilities.
  • Some Gram-positive bacteria comprise covalently-linked pili (CLP) .
  • CLP covalently-linked pili
  • the CLP monomer subunits are typically joined via intermolecular isopeptide bond catalyzed by sortase conferring enormous tensile strength (McConnell, S. A. et al., Protein labeling via a specific lysine-isopeptide bond using the pilin polymerizing sortase from Corynebacterium diphtheriae. J. Am.
  • the CLP subunits contain auto-catalyzed intramolecular isopeptide bonds that are less susceptible to proteolytic cleavage and can dissipate mechanical energy (Ramirez, N.A. et al., 2020) imparting the robustness of CLP.
  • several pilin proteins in the CLP structure of different strains contain additional disulfide bonds that further enhance stability (Kang, H. J. et al., The Corynebacterium diphtheriae shaft pilin SpaA is built of tandem Ig-like modules with stabilizing isopeptide and disulfide bonds. Proc. Natl. Acad. Sci. U.S.A. 106, 16967-16971, 2009) .
  • the inventors develop an integrative technological platform for ELMs based on the discovary of the biosynthetic gene cluster (BGC) of the covalently-linked pili (CLP) fiber in the industrial workhorse Corynebacterium glutamicum.
  • BGC biosynthetic gene cluster
  • CLP covalently-linked pili
  • the present disclosure provides a fusion polypeptide comprising a carrier protein and a polypeptide of interest, wherein the polypeptide of interest is fused to a terminus of the carrier protein or inserted into the carrier protein, and wherein the carrier protein is a pilin of covalently-linked pili (CLP) from a microorganism.
  • CLP covalently-linked pili
  • the microorganism is a gram-positive bacterium, such as a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei; preferably, Corynebacterium glutamicum.
  • the carrier protein is a major pilin.
  • the polypeptide of interest is fused to a terminus of the carrier protein. In some embodiments, the polypeptide of interest is fused to the N terminus of the carrier protein.
  • the polypeptide of interest is inserted into the carrier protein. In some embodiments, the polypeptide of interest is inserted into a loop in the carrier protein.
  • the carrier protein is a major pilin from Corynebacterium glutamicum.
  • the polypeptide of interest is inserted into the M domain of the major pilin.
  • the polypeptide of interest replaces the M domain of the major pilin or a part thereof.
  • the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4.
  • the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between positions corresponding to G215 and L216 of SEQ ID NO: 1, between positions corresponding to G236 and E237 of SEQ ID NO: 1, or between positions corresponding to G336 and T337 of SEQ ID NO: 1.
  • the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1.
  • the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between G215 and L216, between G236 and E237, or between G336 and T337 of SEQ ID NO: 1.
  • the present disclosure provides a polynucleotide encoding the fusion polypeptide of the present disclosure, and a vector comprising the polynucleotide, as well as a host cell comprising the polypeptide, the polynucleotide or the vector of the present disclosure.
  • the present disclosure provides a recombinant cell comprising a polynucleotide encoding a fusion polypeptide, wherein the fusion polypeptide comprises a carrier protein and a polypeptide of interest, wherein the polypeptide of interest is fused to a terminus of the carrier protein or inserted into the carrier protein, wherein the carrier protein is a pilin of CLP, and wherein the recombinant cell is capable of expressing the polynucleotide and displaying a modified CLP comprising the fusion polypeptide.
  • the recombinant cell is a gram-positive bacterium, such as a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei; preferably, Corynebacterium glutamicum.
  • the carrier protein is a major pilin.
  • the polypeptide of interest is fused to a terminus of the carrier protein. In some embodiments, the polypeptide of interest is fused to the N terminus of the carrier protein.
  • the polypeptide of interest is inserted into the carrier protein. In some embodiments, the polypeptide of interest is inserted into a loop in the carrier protein.
  • the carrier protein is a major pilin from Corynebacterium glutamicum.
  • the polypeptide of interest is inserted into the M domain of the major pilin.
  • the polypeptide of interest replaces the M domain of the major pilin or a part thereof.
  • the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4.
  • the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between positions corresponding to G215 and L216 of SEQ ID NO: 1, between positions corresponding to G236 and E237 of SEQ ID NO: 1, or between positions corresponding to G336 and T337 of SEQ ID NO: 1.
  • the carrier protein comprises amino acids 35-509 of SEQ ID NO: 1, and the polypeptide of interest is fused to the N terminus of carrier protein, or is inserted between G215 and L216, between G236 and E237, or between G336 and T337 of SEQ ID NO: 1.
  • the recombinant cell comprises two or more polynucleotide respectively encoding two or more fusion polypeptides each comprising a different polypeptide of interest, and the modified CLP comprises the two or more polypeptides.
  • the present disclosure provides a method of preparing the recombinant cell of present disclosure, comprising introducing a polynucleotide encoding the fusion polypeptide of the present disclosure into a host cell derived from a microorganism having CLP.
  • the host cell is knock-out of native major pilin.
  • the method comprises a step of native major pilin knock-out.
  • the present disclosure provides a modified covalently-linked pili (CLP) comprising a plurality of the fusion polypeptides of the present disclosure.
  • CLP covalently-linked pili
  • the present disclosure provides a method of preparing a modified CLP comprising the steps of
  • the fusion polypeptide is provided by transcribing and/or translalting the polynucleotide of the present disclosure.
  • the activity of sortase is provided by transcribing and/or translalting one or more polynucleotides encoding a sortase.
  • the sortase is encoded by a gene which is identified to be present in the same cluster with the gene encoding the carrier protein in nature.
  • the sortase is class C type sortase, such as srtC1 and/or srtC2, preferably wherein the srtC1 and srtC2 are encoded by genes from the same cluster.
  • the method is an in vitro method.
  • the present disclosure provides a polynucleotide construct or a combination of polynucleotide constructs comprising the polynucleotide of the present disclosure, and one or more polynucleotides encoding a sortase.
  • the sortase is encoded by a gene which is identified to be present in the same cluster with the gene encoding the carrier protein in nature.
  • the sortase is class C type sortase, such as srtC1 and/or srtC2, preferably wherein the srtC1 and srtC2 are encoded by genes from the same cluster.
  • Fig. 1 shows the map of plasmid pEK-spa2.
  • Fig. 2 shows the workflow for constructing the tandem of two cassettes.
  • Fig. 3 shows the maps of plasmids comprising the tandem of two cassettes.
  • Fig. 4 shows the map of plasmid pZ9-dxs_crtEBI.
  • Fig. 5 shows the map of plasmid pET-28a-Spa2.
  • Fig. 6 shows the Cg CLP biosynthetic gene cluster (BGC) encoding the sortase genes srtC1 and srtC2, and the sortase-catalyzed pilin genes spa1, spa2, and spa3.
  • BGC Cg CLP biosynthetic gene cluster
  • Fig. 7 is the TEM and AFM images showing that the major pilin Spa2 is indispensable for Cg CLP fiber structure formation.
  • the bars in the TEM and AFM images are 200 nm and 400 nm, respectively
  • Fig. 8 shows the identification of the composition of CLP in C. glutamicum (CgCLP) by immunogold labelling.
  • the cartoon shows that Cg CLP fibers comprise two minor pilins (Spa1 and Spa3) and a major pilin of Spa2.
  • the immunogold labelling and TEM images show the constitution and distribution of Cg CLP pilins indicating that Spa2 is the major pilin.
  • For single immunogold labelling of Cg CLP with primary polyclonal antibodies of Spa1, Spa2, and Spa3 ( ⁇ -Spa1, ⁇ -Spa2, and ⁇ -Spa3, respectively) ; gold-decorated goat anti-rabbit IgG was used as the secondary antibody for labelling target pilin.
  • Fig. 9 shows the deletion of both the srtC1 and srtC2 genes abrogates pili formation.
  • the bars in the TEM (a) and AFM (b) images are 200 nm and 400 nm, respectively.
  • ⁇ -Spa2 is the primary antibody
  • the 10 nm gold-decorated goat anti-rabbit IgG is the secondary antibody.
  • Each ELISA experiment was performed at least in triplicate, and the standard error was shown.
  • Fig. 10 shows the isolation of Cg CLP fibers for mass spectrometry analysis.
  • SDS-PAGE gel electrophoresis analysis of the nickel affinity chromatography purified Cg CLP fibers showed the high-molecular Cg CLP polymers were eluted under 100 mM imidazole.
  • Fig. 11 shows the identification of intermolecular isopeptide bonds for the polymerization of Spa2 monomers in Cg CLP. Fragmentation spectra of the parent ion at m/z 832.9 2+ containing the intermolecular isopeptide bond (green font) between Spa2 i Lys194 (blue font) and Spa2 i+1 Thr477 (red font) are shown.
  • Fig. 12 shows the liquid chromatography-tandem mass spectrometry (LC-MS/MS) identifies the signal peptide of Spa2.
  • the cartoon shows the amino acid sequence of Spa2 cut (replacing the 470-509 residues at the C-terminus of Spa2 with 6His) , enabling the Spa2 monomer not to be polymerized and to be secreted as a monomer in the medium.
  • SDS-PAGE gel electrophoresis indicates the purified Spa2 cut .
  • the LC-MS/MS identified that the residues 1-34 at the N-terminus of Spa2 are the signal peptide.
  • This figure shows an MS/MS spectrum of the peptide with m/z 916.4538 2+ generated from chymotrypsin digest of Spa2.
  • Predicted b-and y-type ions (not all included) are listed above and below the peptide sequence, respectively. Matched ions are labelled in the spectrum.
  • Fig. 13 shows the Quadrupole time-of-flight mass spectrometry measured the accurate molecular weight of Spa2 cut .
  • the measured molecular weight is ⁇ 54.7 Da less than the calculated value of Spa2 cut , indicating that three intramolecular isopeptide bonds and two disulfide bonds exist in the monomeric Spa2.
  • An intramolecular isopeptide bond formation will lose one molecule of ammonia, ⁇ 17 Da; A disulfide bond formation will lose two hydrogen atoms, ⁇ 2 Da.
  • Fig. 14 shows crystals of Spa2 diffracted to resolution on the BL18U1 beamline at the Shanghai Synchrotron Radiation Facility (Shanghai, China) .
  • Fig. 15 shows the X-ray crystal structure of Spa2 which is arranged in three tandem Ig-like domains, N-domain (pink) , M-domain (blue) , and C-domain (green) . Residues involved in the formation of three intramolecular isopeptide bonds (yellow) and two disulfide bonds (red) are shown as sticks.
  • Fig. 16 shows the comparison of Spa2 in the crystal structure with the prediction from AlphaFold2 and crystal structure of 3HR6 and 4HSS.
  • C ⁇ alpha-carbon
  • RMSD root-mean-square deviation
  • Fig. 17 shows the Omit electron density maps showing the presence of internal covalent bonds in the crystal structure of Spa2.2mFo-DFc omit electron density maps of three isopeptide bonds (a) and two disulfide bonds (b) were shown in blue mesh, contoured at 1.0 ⁇ .
  • the omit electron density maps were generated using Phenix composite omit map.
  • Fig. 18 shows Identification of the disulfide bonds and intramolecular isopeptide bonds formation at appropriate sequence locations in Spa2 by LC-MS/MS analysis.
  • the cartoon shows the critical features in Spa2, including three intramolecular isopeptide bonds in individual domains, two disulfide bonds in the N-domain (C97-C128) and the C-domain (C380-C432) , the pilin motif of YPKN in N-domain, and the sortase cleavage sorting signal motif of LPLTG in C-domain.
  • Figs. 19 and 20 show the genetic manipulation in ⁇ spa2 strains (harboring a plasmid that expressed Spa2 or Spa2 variants of K194A, LPLTG 474LALAA478 , E158A, D246A, E435A, D246A/E435A, C97A, C380A, and C97A/C380A, respectively) to assess the key residues promoting the formation of inter-and intra-molecular isopeptide bonds, and disulfide bonds, in Spa2 by TEM bio-imaging (Fig. 19) and quantitative analysis of the amount of Cg CLP fiber by whole-cell filtration ELISA (detection by anti-Spa2 antibody) (Fig. 20) .
  • Results are presented as mean ⁇ s.d in Fig. 20.
  • Not significant (NS) P >0.05, *P ⁇ 0.05, **P ⁇ 0.01, ***P ⁇ 0.001, ****P ⁇ 0.0001.
  • Statistics were derived using a t-test. The bars in Fig. 19 are 200nm.
  • Fig. 21 shows the accurate molecular weight of Spa2 cut mutant variants determined by quadrupole time-of-flight mass spectrometry.
  • the measured molecular weight of E158A cut (a) , D246A cut (b) , E435A cut (c) , and D246A/E435A cut (d) are ⁇ 54.9, 37.3, 21.4, and 4.0 Da less than the calculated value of related variants, indicating that three, two, one and no intramolecular isopeptide bonds are retained in the corresponding monomeric mutants, respectively.
  • Spa2cut mutant variants E158A cut , D246A cut , E435A cut , and D246A/E435A cut were expressed in ⁇ spa2 and purified by nickel-affinity chromatography.
  • Fig. 22 shows the rational engineering of the Cg CLP protein scaffold through a modular genetic design strategy: the cartoon shows a polymerized Spa2 major pilin functionalized by incorporating a protein-of-interest (POI) (e.g., mCherry, a fluorescent reporter protein) at candidate insertion sites (including Q35 (E1) at the N-terminus, and G215 (E2) , G236 (E3) and G336 (E4) in the M-domain lacking a disulfide bond) based on structural verification.
  • POI protein-of-interest
  • Fig. 24 shows the TEM morphologies of the assembled mCherry-Spa2 fusion proteins associated with cell surfaces based on immunogold labelling.
  • TEM images of ⁇ spa2 cells (a) , E1 cells (b) , E2 cells (c) , E3 cells (d) and E4 cells (e) .
  • the TEM samples were collected from the ⁇ spa2 strain harboring a plasmid that expresses various mCherry-Spa2 fusions under the native constitutive promoter of the spa2 gene.
  • ⁇ -Spa2 is the primary antibody
  • the 10 nm gold-decorated goat anti-rabbit IgG is the secondary antibody. Scale bars, 200 nm.
  • Fig. 25 shows the extracellular secretion and assembly of R-Spa2 pilins into CgCLP fiber at the cell-surfaces of engineered C. glutamicum cells: a series of R-Spa2 fusion protein constructs comprising functional R peptides/proteins with different amino acid sequences.
  • Fig. 27 shows the Functional characterization of engineered Cg CLP with various fusion domains.
  • (a) TEM images showed that Ni-NTA-decorated AuNPs were anchored onto 6His-Spa2 Cg CLP.
  • (b) Confocal microscopic images showed the green fluorescence emitted from SpyTag-Spa2 Cg CLP cells to which SpyCatcher-EGFP protein binding partners were covalently attached via Spytag-SpyCatcher interaction pairs.
  • (c) Confocal microscopic images show the green fluorescence emitted from SpyCatcher-Spa2 Cg CLP cells to which SpyTag-EGFP protein binding partners were covalently attached via Spytag-SpyCatcher interaction pairs.
  • Fig. 28 shows the schematic showing simultaneous expression of the two Spa2 pilin fusion proteins, N-Ven-Spa2 and C-Ven-Spa2 (N-Ven-Spa2+C-Ven-Spa2 strain) , containing the N-terminus (N-Ven) and C-terminus (C-Ven) module of the split-Venus system, resulting in co-assembly of the split-Venus components into the final functional Cg CLP structures.
  • Fig. 29 shows the TEM morphologies of the assembled split-Venus components fused with Spa2 associated with cell surfaces based on immunogold labelling.
  • N-Ven+C-Ven cells expressing co-secreted split-Venus system (a) , N-Ven-Spa2 cells expressing the Spa2 pilin fusion protein of N-Venus-Spa2 (b) , C-Ven-Spa2 cells expressing the Spa2 pilin fusion protein of C-Venus-Spa2 (c) , and N-Ven-Spa2+C-Ven-Spa2 cells for simultaneous expression of two Spa2 pilin fusion proteins, N-Ven-Spa2 and C-Ven-Spa2 (d) .
  • TEM samples were collected from the ⁇ spa2 strain harboring a plasmid that expresses various Spa2 fusion proteins under the native constitutive promoter of the spa2 gene.
  • ⁇ -Spa2 is the primary antibody
  • 10 nm gold-decorated goat anti-rabbit IgG is the secondary antibody.
  • Scale bars 200 nm.
  • Fig. 30 shows the co-assembly of split-Venus components into the Cg CLP fibers leading to increased fluorescence intensity.
  • the engineered C. glutamicum cells show greater fluorescence intensity only in the N-Ven-Spa2+C-Ven-Spa2 strain, and
  • (b) confocal microscopy of C. glutamicum cells showing that the strongest Venus fluorescence signal appeared at the extracellular sites of the N-Ven-Spa2+C-Ven-Spa2 strain (scale bar 2 ⁇ m) .
  • Fig. 31 shows the schematic illustrating of engineered C. glutamicum living materials transforming cellulosic biomass into a value-added product of lycopene by combining the extracellular cellulose degradation capacity and intracellular bioconversion ability.
  • extracellular cellulose degradation (Step1) , endo-1, 4- ⁇ -glucanase from T. reesei (TrEgl) and a ⁇ -glucosidase from S.
  • SdBgl Spa2 pilin
  • TrEgl-Spa2+SdBgl-Spa2 Spa2 pilin
  • Step2 the glucose was used for lycopene production in the pathway engineered C. glutamicum of C003 strain by inducing IPTG.
  • G3P glyceraldehyde-3-phosphate
  • IPP isopentenyl phosphate.
  • Fig. 32 shows the lycopene production from biowastes with engineered C. glutamicum harboring modified CLPs.
  • a TEM images show that cells of C003, which contain the P2 plasmid, enabled co-assembly of TrEgl and SdBgl into Cg CLP structure, while the cells of C001, C002, and C004 did not.
  • Cg CLP was labeled with 10 nm gold particles by immunogold labelling. Scale bars, 200 nm.
  • ELMs can degrade CMC-Na in a medium from a viscous gel to a thin solution only when both TrEgl and SdBgl were co-assembled into the CgCLP structure (TrEgl-Spa2+SdBgl-Spa2, C003 strain) , outperforming the case of the secreted free enzymes (TrEgl+SdBgl, C004 strain) .
  • ⁇ spa2 ⁇ dec (C001 strain) is the negative control strain.
  • the C003 strain showed 4-fold higher enzymeactivity than the C004 strain.
  • covalently-linked pili or “CLP” refers to pili in which the monomers are linked to each other via covalent bonds.
  • the engineered living materials herein refers to the pili formed by the engineered monomers, i.e., the fusion polypeptide of the present disclosure, or recombinant bacterium forming the pili.
  • C. glutamicum a Gram-positive bacterium
  • GRAS general regarded as safe
  • peptide can be exchanged with “polypeptide” and “protein” , means a chain comprising at least two amino acids linked by peptide bond, such as ten or more amino acid residues.
  • the chemical formulas or sequences of all the peptides and polypeptide herein are written in left-to-right order, showing the direction from the amino terminal to the carboxyl terminal.
  • “Peptide” , “polypeptide” and “protein” can include, but are not limited to, an enzyme, an antibody, a hormone, a ligand, a receptor, etc.
  • amino acid includes amino acids naturally occurred in proteins and the unnatural amino acids.
  • the conventional nomenclature one-letter and three-letter of the amino acids naturally occurred in proteins is employed, which can be seen in Sambrook, et al. (Molecular Cloning: A Laboratory Manual, 2nd, ed. Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y., 1989) .
  • fusion polypeptide is a recombinant product comprising two or more peptide fragments which are not present in a single natural polypeptide.
  • the fragments can be fused directly or via a linker, such as a flexible linker, e.g., GS linkers.
  • a fusion polypeptide can be produced by the expression of a polynucleotide comprising nucleotide sequences encoding the two or more peptide fragments and the linker, if present, in desired order.
  • polynucleotide usually refers to generally a nucleic acid molecule (e.g., 100 nucleotides and up to 30k nucleotides in length) and a sequence that is either complementary (antisense) or identical (sense) to the sequence of a messenger RNA (mRNA) or miRNA fragment or molecule.
  • mRNA messenger RNA
  • miRNA fragment or molecule usually refers to DNA or RNA molecules that are either transcribed or non-transcribed.
  • polynucleotide construct refers to a single-stranded or double-stranded polynucleotide, which is isolated from a naturally occurring gene or modified to contain a nucleic acid segment that does not naturally occur.
  • polynucleotide construct contains the control sequences required to express the coding sequence of the present disclosure, the polynucleotide construct comprises an “expression cassette” .
  • exogenous polynucleotide refers to a nucleotide sequence that does not originate from the host in which it is placed. It may be identical or heterologous to the host’s DNA. An example is a sequence of interest inserted into a vector. Such exogenous DNA sequences may be derived from a variety of sources including DNA, cDNA, synthetic DNA, and RNA. Exogenous polynucleotides also encompass DNA sequences that encode antisense oligonucleotides.
  • expression cassette refers to a polynucleotide segment comprising a polynucleotide encoding a polypeptide operably linked to additional nucleotides provided for the expression of the polynucleotide, for example, control sequence.
  • the term “encoding” means that a polynucleotide directly specifies the amino acid sequence of its protein product.
  • the boundaries of the coding sequence are generally determined by an open reading frame, which generally starts with the ATG start codon or other start codons such as GTG and TTG, and ends with a stop codon such as TAA, TAG and TGA.
  • the coding sequence can be a DNA, cDNA or recombinant nucleotide sequence.
  • expression includes any step involved in the production of a polypeptide, including but not limited to transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
  • control sequence includes all elements necessary or beneficial for the expression of the polynucleotide encoding the polypeptide of the present disclosure.
  • Each control sequence may be natural or foreign to the nucleotide sequence encoding the polypeptide, or natural or foreign to each other.
  • control sequences include, but are not limited to, leader sequence, polyadenylation sequence, propeptide sequence, promoter, enhancer, signal peptide sequence, and transcription terminator.
  • control sequences include a promoter and signals for the termination of transcription and translation.
  • control sequence may be a suitable promoter sequence, a nucleotide sequence recognized by the host cell to express the polynucleotide encoding the polypeptide of the present disclosure.
  • the promoter sequence contains a transcription control sequence that mediates the expression of the polypeptide.
  • the promoter may be any nucleotide sequence that exhibits transcriptional activity in the selected host cell, for example, lac operon of E. coli.
  • the promoters also include mutant, truncated and hybrid promoters, and can be obtained from genes encoding extracellular or intracellular polypeptides, which are homologous or heterologous to the host cell.
  • operably linked refers to a configuration in which a control sequence is placed at an appropriate position relative to the coding sequence of the polynucleotide sequence, whereby the control sequence directs the expression of the polypeptide coding sequence.
  • the polynucleotide encoding a polypeptide of interest can be subjected to various manipulations to improve the expression of the polypeptide. Before the insertion thereof into a vector, manipulation of the polynucleotide according to the expression vector or the host, such as codon optimization, is desirable or necessary. Techniques for modifying polynucleotide sequences with recombinant DNA methods are well known in the art.
  • recombinant refers to nucleic acids, vectors, polypeptides, or proteins that have been generated using DNA recombination (cloning) methods and are distinguishable from native or wild-type nucleic acids, vectors, polypeptides, or proteins.
  • hybridization that nucleotides sequences, which are at least about 90%, preferably at least about 95%, more preferably at least about 96%, and more preferably at least 98%homologous to each other, generally maintain hybridization with each other under given stringent hybridization and washing conditions.
  • the sequences are aligned for the purpose of optimal comparison (e.g., a gap can be introduced into the first amino acid or nucleic acid sequence for the optimal alignment with the second amino acid or nucleic acid sequence) . Then, the amino acid residues or nucleotides at the corresponding amino acid positions or nucleotide positions are compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide at the corresponding position in the second sequence, these molecules are identical at this position.
  • the two sequences are identical in length.
  • Identity percentage or “sequence identity percentage” refers to the comparison between the amino acids of two polypeptides or nucleotides between two polynucleotides, and when optimally aligned, the two polypeptides or polynucleotides have approximately the specified percentage of identical amino acids.
  • 95% identity refers to the comparison between the amino acids of two polypeptides or nucleotides between two polynucleotides, and when optimally aligned, 95%of the amino acids in the two polypeptides or 95%of the nucleotides in the two polynucleotides are identical.
  • polynucleotide of the present disclosure does not include a polynucleotide that only hybridizes to a poly A sequence (such as the 3' end poly (A) of mRNA) or a complementary stretch of poly T (or U) residues.
  • the term “host cell” refers to, for example microorganisms, yeast cells, insect cells, and mammalian cells, that can be, or have been, used as recipients of vectors.
  • the term includes the progeny of the original cell which has been transduced.
  • a “host cell” as used herein generally refers to a cell which has been transduced with an exogenous DNA sequence. It is understood that the progeny of a single parental cell may not necessarily be completely identical in morphology or in genomic or total DNA complement to the original parent, due to natural, accidental, or deliberate mutation.
  • Spa2 protein is identified as the major pilin of the CLP fiber structure.
  • structure-guided design the inventor developed a new type of engineerable extracellular protein scaffold that can be genetically appended with diverse functional peptides or proteins at multiple sites of Spa2 protein.
  • the present disclosure provides a fusion polypeptide comprising a carrier protein and a polypeptide of interest, wherein the polypeptide of interest is fused to a terminus of the carrier protein or inserted into the carrier protein, and wherein the carrier protein is a pilin of covalently-linked pili (CLP) from a microorganism.
  • CLP covalently-linked pili
  • the microorganism is a gram-positive bacterium, such as a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei; preferably, Corynebacterium glutamicum.
  • the bacterium can include, but are not limited to, a bacterium selected from Corynebacterium glutamicum strain BE (GenBank assembly accession: GCA_013046805.1) , Corynebacterium glutamicum ATCC 14067 (GenBank assembly accession: GCA_002243555.1) , Corynebacterium glutamicum strain YI (GenBank assembly accession: GCA_001643035.1) , Corynebacterium glutamicum strain ATCC 13869 (GenBank assembly accession: GCA_001687645.1) , Corynebacterium glutamicum AJ1511 (GenBank assembly accession: GCA_002355675.1) , Corynebacterium glutamicum strain XV (GenBank assembly accession: GCA_001936195.1) , Corynebacterium glutamicum strain CP (GenBank assembly accession: GCA_001447865.2) , Corynebacterium glutamicum R (GenBank assembly accession: GCA
  • cremoris NZ9000 GenBank assembly accession: GCA_000143205.1
  • cremoris MG1363 GenBank assembly accession: GCA_000009425.1
  • cremoris A76 (GenBank assembly accession: GCA_000236475.1) , Lactococcus lactis strain SRCM103457 (GenBank assembly accession: GCA_004194355.1) , Lactococcus lactis strain CBA3619 (GenBank assembly accession: GCA_007954765.1) , Lactococcus lactis strain WiKim0098 (GenBank assembly accession: GCA_016406265.1) , Lactococcus lactis strain K_LL005 (GenBank assembly accession: GCA_014334715.1) , Lactococcus lactis subsp.
  • lactis strain G121 (GenBank assembly accession: GCA_013395015.1) , Lactococcus lactis strain N8 (GenBank assembly accession: GCA_014884605.1) , Lactococcus lactis subsp. lactis IO-1 (GenBank assembly accession: GCA_000344575.1) , Lactococcus lactis subsp. lactis strain F44 (GenBank assembly accession: GCA_002804185.1) , Lactococcus lactis subsp. lactis bv.
  • Lactococcus lactis strain S50 (GenBank assembly accession: GCA_003627395.2) , Lactococcus lactis strain FDAARGOS_1064 (GenBank assembly accession: GCA_016127135.1) , Lactococcus lactis strain FDAARGOS_887 (GenBank assembly accession: GCA_016027975.1) , Lactococcus lactis subsp.
  • lactis strain UC77 (GenBank assembly accession: GCA_002078615.2) , Lactococcus lactis strain FDAARGOS_866 (GenBank assembly accession: GCA_016028815.1) , Lactococcus lactis strain IL1403 (GenBank assembly accession: GCA_003722275.1) , Lactococcus lactis strain FDAARGOS_865 (GenBank assembly accession: GCA_016028835.1) , Lactococcus lactis subsp.
  • cremoris IBB477 (GenBank assembly accession: GCA_001856165.1) , Lacticaseibacillus paracasei strain TD 062 (GenBank assembly accession: GCA_009834405.1) , Lacticaseibacillus paracasei strain HM1 (GenBank assembly accession: GCA_018064185.1) , Bacillus thuringiensis strain FDAARGOS_794 (GenBank assembly accession: GCA_013267795.1) , Bacillus thuringiensis strain XL6 (GenBank assembly accession: GCA_000774075.2) , Bacillus thuringiensis strain Bt-GS57 (GenBank assembly accession: GCA_017751245.1) , Bacillus thuringiensis strain HER1410 (GenBank assembly accession: GCA_013340745.1) , Bacillus thuringiensis serovar tolworthi (GenBank assembly accession: GCA_001548175.1) , Bac
  • tolerans strain MGB0734 (GenBank assembly accession: GCA_015476135.1) , Lacticaseibacillus paracasei subsp. tolerans strain MGB0747 (GenBank assembly accession: GCA_015476175.1) , Lacticaseibacillus paracasei strain CBA3611 (GenBank assembly accession: GCA_007292115.1) , Lacticaseibacillus paracasei subsp. paracasei strain GR0548 (GenBank assembly accession: GCA_019175405.1) , Lacticaseibacillus paracasei subsp.
  • tolerans strain MGB0625 (GenBank assembly accession: GCA_015476155.1) , Lacticaseibacillus paracasei strain 10266 (GenBank assembly accession: GCA_008329845.1) , Lacticaseibacillus paracasei subsp. tolerans strain S-NB (GenBank assembly accession: GCA_016757695.1) , Lacticaseibacillus paracasei strain Lp02 (GenBank assembly accession: GCA_013307125.1) , Lacticaseibacillus paracasei strain ZFM54 (GenBank assembly accession: GCA_003627255.1) , Lacticaseibacillus paracasei subsp.
  • paracasei strain BD5115 GenBank assembly accession: GCA_018596415.1
  • Paracasei JCM 8130 GenBank assembly accession: GCA_000829035.1
  • Corynebacterium glutamicum ATCC 14067 preferably, Corynebacterium glutamicum ATCC 14067.
  • the carrier protein is a major pilin.
  • the fusion of insertion of the polypeptide of interest does not influence the formation of intermolecular isopeptide bond, disulfide bond, or intramolecular isopeptide bond in the carrier protein.
  • the polypeptide of interest is fused to a terminus of the carrier protein. In some embodiments, the polypeptide of interest is fused to the N terminus of the carrier protein.
  • the polypeptide of interest is inserted into the carrier protein. In some embodiments, the polypeptide of interest is inserted into a loop in the carrier protein.
  • the carrier protein is a major pilin from Corynebacterium glutamicum (Spa2 protein) . It is observed that the Spa2 protein (SEQ ID NO: 1) comprises three tandem Ig-like domains, including N-domain (residues 36-197) , M-domain (residues 198-343) , and C-domain (residues 344-469) which is consistent with other major pilin. It is also observed that the deletion of M-domain does not influence the formation of CLP.
  • the polypeptide of interest is inserted into the M domain of the major pilin. In some embodiments, the polypeptide of interest replaces the M domain of the major pilin or a part thereof.
  • the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to SEQ ID NO: 1, 2, 3, or 4.
  • the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to SEQ ID NO: 1, 2, 3, or 4 with the residues corresponding to residues C97, C128, K194, C380, C432, and LPLTG (474-478) , and optionally E158, D246, and/or E435 of SEQ ID NO: 1 unchanged.
  • the carrier protein can be the mature form of SEQ ID NO: 1, 2, 3, or 4, i.e., with the deletion of the signal peptide.
  • the carrier protein comprises amino acids 36 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4.
  • the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4with the residues corresponding to residues C97, C128, E158, K194, D246, C380, C432, E435, and LPLTGT (474-478) , and optionally E158, D246, and/or E435 of SEQ ID NO: 1 unchanged.
  • the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between positions corresponding to G215 and L216 of SEQ ID NO: 1, between positions corresponding to G236 and E237 of SEQ ID NO: 1, or between positions corresponding to G336 and T337 of SEQ ID NO: 1.
  • the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1.
  • the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between G215 and L216, between G236 and E237, or between G336 and T337 of SEQ ID NO: 1.
  • the polypeptide of interest is directly linked to the N terminal of the carrier polypeptide. In some embodiments, the polypeptide of interest is linked to the N terminal of the carrier polypeptide via a peptide linker such as a flexible linker.
  • a peptide linker can be generally short peptides with about 4-20 or more amino acids, such as combinations of Ser and Gly residues, which is a conventional flexible linker.
  • the peptide linker used in the present disclosure is (G4S) 2 i.e., SEQ ID NO: 22.
  • the peptide linker is a C10 linker of SEQ ID NO: 23.
  • the polypeptide of interest can be selected according to the desired application of the fusion polypeptide.
  • the fusion polypeptide is provided to bind, capture or enrich a target molecule
  • the polypeptide of interest is a polypeptide that can recognize a target peptide, including but not limited to a ligand, a receptor, an antigen and an antibody such as scFV and nanobody.
  • the fusion polypeptide is provided to capture a protein comprising a SpyTag (SEQ ID NO: 37)
  • the polypeptide of interest comprises SpyCatcher (SEQ ID NO: 15) , vice versa.
  • the fusion polypeptide is provided as an adhesive agent, and the polypeptide of interest is an adhesive peptide, e.g., Mfp35 (SEQ ID NO: 38) .
  • the fusion polypeptide is provided to catalyze chemical or biochemical reactions, and the polypeptide of interest is an enzyme.
  • the fusion polypeptide is provided to degrade carbohydrates such as cellulose, and the polypeptide of interest can be the endo-1, 4- ⁇ -glucanase, e.g., from Trichoderma reesei (TrEgl, SEQ ID NO: 19) and/or ⁇ -glucosidase, e.g., from Saccharophagus degradans (SdBgl, SEQ ID NO: 21) .
  • the fusion polypeptide is provided to degrade refractory organics, such as plastics, and the polypeptide of interest is an enzyme responsible for the degradation, such as a PETase.
  • the present disclosure provides a polynucleotide encoding the fusion polypeptide of the present disclosure.
  • the polynucleotide of the present disclosure can be amplified with cDNA, mRNA or genomic DNA as the template and suitable oligonucleotide primers according to standard PCR amplification techniques.
  • the nucleic acid amplified as above can be cloned into a suitable vector and characterized by DNA sequence analysis.
  • the polynucleotide of the present disclosure can be prepared by standard synthesis techniques, for example, by using an automated DNA synthesizer.
  • a nucleic acid molecule that is complementary to other nucleotide sequence is a molecule that is sufficiently complementary to the nucleotide sequence so that it can hybridize with the other nucleotide sequences to form a stable duplex.
  • a polynucleotide construct and a vector comprising the polynucleotide of the present disclosure, such as an expression vector.
  • the polynucleotide of the present disclosure is operably linked to a promoter.
  • the promoter is a constitutive promoter, such as the native promoter driving Spa2 gene in Corynebacterium glutamicum.
  • the promoter is an inducible promoter.
  • the expression vector comprises a Lac operon.
  • the polynucleotide encoding the polypeptide of the present disclosure can be subjected to various manipulations to allow the expression of the polypeptide. Before the insertion thereof into a vector, manipulation of the polynucleotide according to the expression vector is desirable or necessary. Techniques for modifying polynucleotide sequences with recombinant DNA methods are well known in the art.
  • the vector of the present disclosure preferably contains one or more selectable markers, which allow simple selection of transformed, transfected, transduced, etc. cells.
  • a selectable marker is a gene, of which the product provides biocide or virus resistance, heavy metal resistance, supplemental auxotrophs, etc.
  • the bacterial selectable marker is the dal gene from Bacillus subtilis or Bacillus licheniformis, or a marker that confers antibiotic resistance such as ampicillin, kanamycin, chloramphenicol or tetracycline resistance.
  • the vector of the present disclosure can be integrated into the genome of the host cell or autonomously replicate in the cell, which is independent of the genome.
  • the elements required for the integration into the genome of the host cell or the autonomous replication are known in the art (see, for example, the aforementioned Sambrook et al., 1989) .
  • the present disclosure provides a recombinant cell comprising a polynucleotide encoding a fusion polypeptide, wherein the fusion polypeptide comprises a carrier protein and a polypeptide of interest, wherein the polypeptide of interest is fused to a terminus of the carrier protein or inserted into the carrier protein, wherein the carrier protein is a pilin of CLP, and wherein the recombinant cell is capable of expressing the polynucleotide and displaying a modified CLP comprising the fusion polypeptide.
  • the carrier protein in the fusion polypeptide is the native major pilin of the recombinant cell.
  • the recombinant cell is a recombinant gram-positive bacterium, such as a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei; preferably, Corynebacterium glutamicum.
  • a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei preferably, Corynebacterium glutamicum.
  • the bacterium can include, but are not limited to, a bacterium selected from Corynebacterium glutamicum strain BE (GenBank assembly accession: GCA_013046805.1) , Corynebacterium glutamicum ATCC 14067 (GenBank assembly accession: GCA_002243555.1) , Corynebacterium glutamicum strain YI (GenBank assembly accession: GCA_001643035.1) , Corynebacterium glutamicum strain ATCC 13869 (GenBank assembly accession: GCA_001687645.1) , Corynebacterium glutamicum AJ1511 (GenBank assembly accession: GCA_002355675.1) , Corynebacterium glutamicum strain XV (GenBank assembly accession: GCA_001936195.1) , Corynebacterium glutamicum strain CP (GenBank assembly accession: GCA_001447865.2) , Corynebacterium glutamicum R (GenBank assembly accession: GCA
  • cremoris NZ9000 GenBank assembly accession: GCA_000143205.1
  • cremoris MG1363 GenBank assembly accession: GCA_000009425.1
  • cremoris A76 (GenBank assembly accession: GCA_000236475.1) , Lactococcus lactis strain SRCM103457 (GenBank assembly accession: GCA_004194355.1) , Lactococcus lactis strain CBA3619 (GenBank assembly accession: GCA_007954765.1) , Lactococcus lactis strain WiKim0098 (GenBank assembly accession: GCA_016406265.1) , Lactococcus lactis strain K_LL005 (GenBank assembly accession: GCA_014334715.1) , Lactococcus lactis subsp.
  • lactis strain G121 (GenBank assembly accession: GCA_013395015.1) , Lactococcus lactis strain N8 (GenBank assembly accession: GCA_014884605.1) , Lactococcus lactis subsp. lactis IO-1 (GenBank assembly accession: GCA_000344575.1) , Lactococcus lactis subsp. lactis strain F44 (GenBank assembly accession: GCA_002804185.1) , Lactococcus lactis subsp. lactis bv.
  • Lactococcus lactis strain S50 (GenBank assembly accession: GCA_003627395.2) , Lactococcus lactis strain FDAARGOS_1064 (GenBank assembly accession: GCA_016127135.1) , Lactococcus lactis strain FDAARGOS_887 (GenBank assembly accession: GCA_016027975.1) , Lactococcus lactis subsp.
  • lactis strain UC77 (GenBank assembly accession: GCA_002078615.2) , Lactococcus lactis strain FDAARGOS_866 (GenBank assembly accession: GCA_016028815.1) , Lactococcus lactis strain IL1403 (GenBank assembly accession: GCA_003722275.1) , Lactococcus lactis strain FDAARGOS_865 (GenBank assembly accession: GCA_016028835.1) , Lactococcus lactis subsp.
  • cremoris IBB477 (GenBank assembly accession: GCA_001856165.1) , Lacticaseibacillus paracasei strain TD 062 (GenBank assembly accession: GCA_009834405.1) , Lacticaseibacillus paracasei strain HM1 (GenBank assembly accession: GCA_018064185.1) , Bacillus thuringiensis strain FDAARGOS_794 (GenBank assembly accession: GCA_013267795.1) , Bacillus thuringiensis strain XL6 (GenBank assembly accession: GCA_000774075.2) , Bacillus thuringiensis strain Bt-GS57 (GenBank assembly accession: GCA_017751245.1) , Bacillus thuringiensis strain HER1410 (GenBank assembly accession: GCA_013340745.1) , Bacillus thuringiensis serovar tolworthi (GenBank assembly accession: GCA_001548175.1) , Bac
  • tolerans strain MGB0734 (GenBank assembly accession: GCA_015476135.1) , Lacticaseibacillus paracasei subsp. tolerans strain MGB0747 (GenBank assembly accession: GCA_015476175.1) , Lacticaseibacillus paracasei strain CBA3611 (GenBank assembly accession: GCA_007292115.1) , Lacticaseibacillus paracasei subsp. paracasei strain GR0548 (GenBank assembly accession: GCA_019175405.1) , Lacticaseibacillus paracasei subsp.
  • tolerans strain MGB0625 (GenBank assembly accession: GCA_015476155.1) , Lacticaseibacillus paracasei strain 10266 (GenBank assembly accession: GCA_008329845.1) , Lacticaseibacillus paracasei subsp. tolerans strain S-NB (GenBank assembly accession: GCA_016757695.1) , Lacticaseibacillus paracasei strain Lp02 (GenBank assembly accession: GCA_013307125.1) , Lacticaseibacillus paracasei strain ZFM54 (GenBank assembly accession: GCA_003627255.1) , Lacticaseibacillus paracasei subsp.
  • paracasei strain BD5115 GenBank assembly accession: GCA_018596415.1
  • Paracasei JCM 8130 GenBank assembly accession: GCA_000829035.1
  • Corynebacterium glutamicum ATCC 14067 preferably, Corynebacterium glutamicum ATCC 14067.
  • the carrier protein is a major pilin. In some embodiments, the carrier protein is the native major pilin of the bacterium.
  • the fusion of insertion of the polypeptide of interest does not influence the formation of intermolecular isopeptide bond, disulfide bond, or intramolecular isopeptide bond in the carrier protein.
  • the polypeptide of interest is fused to a terminus of the carrier protein. In some embodiments, the polypeptide of interest is fused to the N terminus of the carrier protein.
  • the polypeptide of interest is inserted into the carrier protein. In some embodiments, the polypeptide of interest is inserted into a loop in the carrier protein.
  • the carrier protein is a major pilin from Corynebacterium glutamicum (Spa2 protein) . It is observed that the Spa2 protein (SEQ ID NO: 1) comprises three tandem Ig-like domains, including N-domain (residues 36-197) , M-domain (residues 198-343) , and C-domain (residues 344-469) which is consistent with other major pilin. It is also observed that the deletion of M-domain does not influence the formation of CLP.
  • the polypeptide of interest is inserted into the M domain of the major pilin. In some embodiments, the polypeptide of interest replaces the M domain of the major pilin or a part thereof.
  • the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to SEQ ID NO: 1, 2, 3, or 4.
  • the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to SEQ ID NO: 1, 2, 3, or 4 with the residues corresponding to residues C97, C128, K194, C380, C432, and LPLTG (474-478) , and optionally E158, D246, and/or E435 of SEQ ID NO: 1 unchanged.
  • the carrier protein can be the mature form of SEQ ID NO: 1, 2, 3, or 4, i.e., with the deletion of the signal peptide.
  • the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4.
  • the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4with the residues corresponding to residues C97, C128, E158, K194, D246, C380, C432, E435, and LPLTGT (474-478) , and optionally E158, D246, and/or E435 of SEQ ID NO: 1 unchanged.
  • the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between positions corresponding to G215 and L216 of SEQ ID NO: 1, between positions corresponding to G236 and E237 of SEQ ID NO: 1, or between positions corresponding to G336 and T337 of SEQ ID NO: 1.
  • the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1.
  • the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between G215 and L216, between G236 and E237, or between G336 and T337 of SEQ ID NO: 1.
  • the polypeptide of interest is directly linked to the N terminal of the carrier polypeptide. In some embodiments, the polypeptide of interest is linked to the N terminal of the carrier polypeptide via a peptide linker such as a flexible linker.
  • a peptide linker can be generally short peptides with about 4-20 or more amino acids, such as combinations of Ser and Gly residues, which is a conventional flexible linker.
  • the peptide linker used in the present disclosure is (G4S) 2 i.e., SEQ ID NO: 22.
  • the peptide linker is a C10 linker of SEQ ID NO: 23.
  • the polypeptide of interest can be selected according to the desired application of the fusion polypeptide.
  • the fusion polypeptide is provided to degrade carbohydrates such as cellulose, and the polypeptide of interest can be the endo-1, 4- ⁇ -glucanase from Trichoderma reesei (TrEgl, SEQ ID NO: 19) and/or ⁇ -glucosidase from Saccharophagus degradans (SdBgl, SEQ ID NO: 21) .
  • the recombinant cell comprises two or more polynucleotide respectively encoding two or more fusion polypeptides each comprising a different polypeptide of interest, and the modified CLP comprises the two or more polypeptides.
  • the recombinant cell is provided to bind, capture or enrich a target molecule
  • the polypeptide of interest is a polypeptide that can recognize a target peptide, including but not limited to a ligand, a receptor, an antigen and an antibody such as scFV and nanobody.
  • the recombinant cell is provided to capture a protein comprising a SpyTag (SEQ ID NO: 37)
  • the polypeptide of interest comprises SpyCatcher (SEQ ID NO: 15) , vice versa.
  • the recombinant cell is provided as an adhesive agent, and the polypeptide of interest is an adhesive peptide, e.g., Mfp35 (SEQ ID NO: 38) .
  • the recombinant cell is provided to catalyze chemical or biochemical reactions, and the polypeptide of interest is an enzyme.
  • the recombinant cell is provided to degrade carbohydrates such as cellulose, and the polypeptide of interest can be the endo-1, 4- ⁇ -glucanase, e.g., from Trichoderma reesei (TrEgl, SEQ ID NO: 19) and/or ⁇ -glucosidase, e.g., from Saccharophagus degradans (SdBgl, SEQ ID NO: 21) .
  • the recombinant cell is provided to degrade refractory organics, such as plastics, and the polypeptide of interest is an enzyme responsible for the degradation, such as a PETase.
  • the present disclosure provides a method of preparing the recombinant cell of present disclosure, comprising introducing a polynucleotide encoding the fusion polypeptide of the present disclosure into a host cell.
  • the carrier protein in the fusion polypeptide is the native major pilin of the host cell.
  • the host cell is a gram-positive bacterium. In some embodiments, the host cell is a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei; preferably, Corynebacterium glutamicum.
  • the bacterium can include, but are not limited to, a bacterium selected from Corynebacterium glutamicum strain BE (GenBank assembly accession: GCA_013046805.1) , Corynebacterium glutamicum ATCC 14067 (GenBank assembly accession: GCA_002243555.1) , Corynebacterium glutamicum strain YI (GenBank assembly accession: GCA_001643035.1) , Corynebacterium glutamicum strain ATCC 13869 (GenBank assembly accession: GCA_001687645.1) , Corynebacterium glutamicum AJ1511 (GenBank assembly accession: GCA_002355675.1) , Corynebacterium glutamicum strain XV (GenBank assembly accession: GCA_001936195.1) , Corynebacterium glutamicum strain CP (GenBank assembly accession: GCA_001447865.2) , Corynebacterium glutamicum R (GenBank assembly accession: GCA
  • cremoris NZ9000 GenBank assembly accession: GCA_000143205.1
  • cremoris MG1363 GenBank assembly accession: GCA_000009425.1
  • cremoris A76 (GenBank assembly accession: GCA_000236475.1) , Lactococcus lactis strain SRCM103457 (GenBank assembly accession: GCA_004194355.1) , Lactococcus lactis strain CBA3619 (GenBank assembly accession: GCA_007954765.1) , Lactococcus lactis strain WiKim0098 (GenBank assembly accession: GCA_016406265.1) , Lactococcus lactis strain K_LL005 (GenBank assembly accession: GCA_014334715.1) , Lactococcus lactis subsp.
  • lactis strain G121 (GenBank assembly accession: GCA_013395015.1) , Lactococcus lactis strain N8 (GenBank assembly accession: GCA_014884605.1) , Lactococcus lactis subsp. lactis IO-1 (GenBank assembly accession: GCA_000344575.1) , Lactococcus lactis subsp. lactis strain F44 (GenBank assembly accession: GCA_002804185.1) , Lactococcus lactis subsp. lactis bv.
  • Lactococcus lactis strain S50 (GenBank assembly accession: GCA_003627395.2) , Lactococcus lactis strain FDAARGOS_1064 (GenBank assembly accession: GCA_016127135.1) , Lactococcus lactis strain FDAARGOS_887 (GenBank assembly accession: GCA_016027975.1) , Lactococcus lactis subsp.
  • lactis strain UC77 (GenBank assembly accession: GCA_002078615.2) , Lactococcus lactis strain FDAARGOS_866 (GenBank assembly accession: GCA_016028815.1) , Lactococcus lactis strain IL1403 (GenBank assembly accession: GCA_003722275.1) , Lactococcus lactis strain FDAARGOS_865 (GenBank assembly accession: GCA_016028835.1) , Lactococcus lactis subsp.
  • cremoris IBB477 (GenBank assembly accession: GCA_001856165.1) , Lacticaseibacillus paracasei strain TD 062 (GenBank assembly accession: GCA_009834405.1) , Lacticaseibacillus paracasei strain HM1 (GenBank assembly accession: GCA_018064185.1) , Bacillus thuringiensis strain FDAARGOS_794 (GenBank assembly accession: GCA_013267795.1) , Bacillus thuringiensis strain XL6 (GenBank assembly accession: GCA_000774075.2) , Bacillus thuringiensis strain Bt-GS57 (GenBank assembly accession: GCA_017751245.1) , Bacillus thuringiensis strain HER1410 (GenBank assembly accession: GCA_013340745.1) , Bacillus thuringiensis serovar tolworthi (GenBank assembly accession: GCA_001548175.1) , Bac
  • tolerans strain MGB0734 (GenBank assembly accession: GCA_015476135.1) , Lacticaseibacillus paracasei subsp. tolerans strain MGB0747 (GenBank assembly accession: GCA_015476175.1) , Lacticaseibacillus paracasei strain CBA3611 (GenBank assembly accession: GCA_007292115.1) , Lacticaseibacillus paracasei subsp. paracasei strain GR0548 (GenBank assembly accession: GCA_019175405.1) , Lacticaseibacillus paracasei subsp.
  • tolerans strain MGB0625 (GenBank assembly accession: GCA_015476155.1) , Lacticaseibacillus paracasei strain 10266 (GenBank assembly accession: GCA_008329845.1) , Lacticaseibacillus paracasei subsp. tolerans strain S-NB (GenBank assembly accession: GCA_016757695.1) , Lacticaseibacillus paracasei strain Lp02 (GenBank assembly accession: GCA_013307125.1) , Lacticaseibacillus paracasei strain ZFM54 (GenBank assembly accession: GCA_003627255.1) , Lacticaseibacillus paracasei subsp.
  • paracasei strain BD5115 GenBank assembly accession: GCA_018596415.1
  • Paracasei JCM 8130 GenBank assembly accession: GCA_000829035.1
  • Corynebacterium glutamicum ATCC 14067 preferably, Corynebacterium glutamicum ATCC 14067.
  • the host cell is modified to inactivate the native major pilin.
  • the method comprises a step of knocking out the native major pilin.
  • the endogenous polynucleotide encoding the major pilin can also be replaced by the polynucleotide encoding the fusion polypeptide via homologous recombination.
  • the present disclosure provides a modified covalently-linked pili (CLP) comprising a plurality of the fusion polypeptides of the present disclosure.
  • the modified CLP is cell-free.
  • the present disclosure further provides a method of preparing a modified CLP comprising the steps of a) providing the fusion polypeptide of the present disclosure; and b) providing an activity of sortase.
  • the modified CLP is cell-free.
  • the fusion polypeptide is provided by transcribing and/or translalting the polynucleotide of the present disclosure.
  • the activity of sortase is provided by transcribing and/or translalting one or more polynucleotides encoding a sortase.
  • the sortase is encoded by a gene which is identified to be present in the same cluster with the gene encoding the carrier protein in nature.
  • the method comprises contacting the fusion polypeptide of the present disclosure with the sortase protein.
  • the sortase is class C type sortase, such as srtC1 and/or srtC2, preferably wherein the srtC1 and srtC2 are encoded by genes from the same cluster.
  • the method is an in vitro method.
  • the present disclosure provides a polynucleotide construct or a combination of polynucleotide constructs comprising the polynucleotide of the present disclosure, and one or more polynucleotides encoding a sortase.
  • the sortase is encoded by a gene which is identified to be present in the same cluster with the gene encoding the carrier protein in nature.
  • the sortase is class C type sortase, such as srtC1 and/or srtC2, preferably wherein the srtC1 and srtC2 are encoded by genes from the same cluster.
  • the modified CLP and recombinant cell achieve the cascade reaction of enzymes, and improves the catalytic efficiency of a multi-enzyme system.
  • the immobilization of enzymes onto CLP and recombinant cells can achieve a whole-cell catalyzation.
  • the original DNA sequence was fully synthesized (Genewiz, Nanjing, China) or PCR-generated. All PCR products were generated by KOD DNA polymerase (TOYOBO, Japan) . All plasmid construction was performed using the T4 DNA ligase (New England BioLabs, Boston, MA) for ligations or the NEB Builder HiFi DNA Assembly Master Mix (New England BioLabs, Boston, MA) for assembly. All plasmids or markerless strains were confirmed by DNA sequencing (GENEWIZ, Guangzhou, China) . Primers used in the Examples are listed in Table 1.
  • C. glutamicum ATCC140675 was provided by Dr. Zheng’s research group at the South China University of Technology.
  • C. glutamicum ATCC14067 was grown in BHI liquid medium for recovery (37 g L -1 brain heart infusion (Becton, Dickinson and company) ) at 30 °C, 250 rpm, overnight.
  • BHI liquid medium for recovery 37 g L -1 brain heart infusion (Becton, Dickinson and company)
  • C. glutamicum ATCC14067 was inoculated into M63 liquid medium (15.6 g L -1 M63 Broth (Sangon Biotech, Guangzhou, China) , supplemented with 1 mM MgSO4, 0.2% (wt/vol) glucose) and cultivated in an incubator at 30 °C without shaking for 2-3 days.
  • Antibiotics for C. glutamicum culture were kanamycin (25 ⁇ g mL -1 ) and hloramphenicol (7.5 ⁇ g mL -1 )
  • Isopropyl- ⁇ -d-thiogalactoside (IPTG) at 1 mM/0.5mM or theophylline at 1mM was used to induce gene expression.
  • Trans1-T1 TransGen Biotech, Shenzhen, China
  • E. coli BL21 DE3 (New England BioLabs, Boston, MA) was used for protein expression.
  • E. coli was cultured in Luria-Bertani medium (10 g L -1 peptone, 5 g L -1 yeast extract, 10 g L -1 NaCl) at 37 °C or 16 °C when applicable for protein expression.
  • Antibiotics for E. coli culture were kanamycin (50 ⁇ g mL -1 ) and chloramphenicol (30 ⁇ g mL -1 ) .
  • the markerless deletion strains of C. glutamicum ATCC 14067 were achieved by the RecET-Cre/loxP system. Detailed methods for markerless deletion are described in Huang, Y. et al. (Recombineering using RecET in Corynebacterium glutamicum ATCC14067 via a self-excisable cassette. Sci. Rep. 7, 1-8, 2017) .
  • dsDNA fragments including the Cre-Kan cassette, the left and right homologous fragments, were used for subsequent fusion PCR to generate a ⁇ 4, 385 bp linear self-excisable dsDNA cassette with primer pairs clpL-S/clpR-A.
  • primer pairs spa1L-S/A, spa1R-S/A, ck-S/A and spa1L-S/spa1R-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassettes, respectively.
  • primer pairs spa2L-S/A, spa2R-S/A, ck-S/A and spa2L-S/spa2R-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassettes, respectively.
  • primer pairs spa3L-S/A, spa3R-S/A, ck-S/A and spa3L-S/spa3R-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassettes, respectively.
  • primer pairs srtC1L-S/A, srtC2R-S/A, ck-S/A and srtC1L-S/srtC2R-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassettes, respectively.
  • primer pairs srtAL-S/A, srtAR-S/A, ck-S/A and srtAL-S/srtAR-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassettes, respectively.
  • primer pairs decL-S/A, decR-S/A, ck-S/A and decL-S/decR-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassette, respectively.
  • the self-excisable dsDNA cassettes for markerless deletion of different genes were transformed into exonuclease-recombinase RecE/T expressed competent cells (C. glutamicum ATCC 1406) by electroporation, yielding multiple Kan-resistant colonies on BHI agar plates.
  • the cell-plasmid DNA/dsDNA mixture was transferred to an ice-cold electroporation cuvette (0.1 cm electrode gap) .
  • Electroporation was performed with a Bio-Rad Micropulser set by three times 1.8 KV/cm (Ec1) pulse (see Huang et al., Recombineering using RecET in Corynebacterium glutamicum ATCC14067 via a self-excisable cassette, Sci Rep 7, 7916 (2017) )
  • Cre enzyme was used to induce expression by adding 1 mM theophylline and excising selectable marker by Cre/lox site specific recombination. Finally, sequencing of the PCR fragments from the genomic of mutants was performed for further identification.
  • the resultant mutant strains used in this study were referred to as C. glutamicum ATCC 14067 ⁇ clp ( ⁇ clp) , C. glutamicum ATCC 14067 ⁇ spa1 ( ⁇ spa1) , C. glutamicum ATCC 14067 ⁇ spa2 ( ⁇ spa2) , C. glutamicum ATCC 14067 ⁇ spa3 ( ⁇ spa3) , and C.
  • glutamicum ATCC 14067 ⁇ srtC1 ⁇ srtC2 ( ⁇ srtC1 ⁇ srtC2) .
  • C. glutamicum ATCC 14067 ⁇ spa1 ⁇ spa3 ( ⁇ spa1 ⁇ spa3) mutant was constructed by transforming ⁇ spa3-cassette into ⁇ spa1 strain.
  • C. glutamicum ATCC 14067 ⁇ spa2 ⁇ srtA ( ⁇ spa2 ⁇ srtA) and C. glutamicum ATCC 14067 ⁇ spa2 ⁇ dec ( ⁇ spa2 ⁇ dec) mutants were constructed by transforming ⁇ srtA-cassette and ⁇ dec-cassette into ⁇ spa2 strain, respectively, as described above.
  • the pEC-XK99E plasmid was used as an original plasmid.
  • DNA fragments of the pEC-XK99E backbone (GNENWIZ, China) the coding sequence of Spa2 or various recombinant Spa2 (SEQ ID NOs: 1, 5, 8-14, and 24, respectively) , and the native promoter (SEQ ID NO: 25) of spa2 gene via PCR, and then all the DNA fragments were assembled by NEB Builder HiFi DNA Assembly Master Mix to construct the plasmids pEK-spa2, pEK-spa2cut, pEK-E1/mCherry-spa2, pEK-E2/mCherry-spa2, pEK-E3/mCherry-spa2, pEK-E4/mCherry-spa2, pEK-6his-spa2, pEK-SpyTagSpa2, pEK-Mfp3Spep-Spa2,
  • the two basic plasmids 203 and 204 were constructed based on pEC-XK99E backbone with additional restriction sites of SmaI, XbaI, NcoI, BamHI, SpeI and SalI by Gibson assembly with NEB Builder HiFi DNA Assembly Master Mix.
  • SmaI, XbaI, and NcoI were used to fuse proteins with Spa2 pilin, and SpeI and SalI (Takara) were used to insert another independent expression cassette for fusion protein.
  • CDSs coding sequences of SpyCatcher, Venus, CcEgl, N-Ven, and TrEgl
  • the CDSs of N-Ven and TrEgl were inserted into the linearized backbone of 203 (digestion with SmaI and SpeI, Takara) via Gibson assembly.
  • CDSs of C-Ven and SdBgl were cloned into the SmaI and XbaI sites in 204 by ligation.
  • CDSs of C-Ven and SdBgl were inserted into the linearized backbone of 204 (digestion with SmaI and SalI, Takara) via Gibson assembly.
  • the C-Ven-Spa2 cassette was obtained by digesting pEK-C-Ven-Spa2 with SpeI and SalI, and then, cloned into the plasmid of pEK-N-Ven-Spa2 (digested with SpeI and SalI, Takara) to construct tandem expression plasmids of pEK-N-Ven-Spa2_C-Ven-Spa2 (see Fig. 3) .
  • Spa2 The coding sequence of Spa2 (SEQ ID NO: 6) was amplified from the genome of C. glutamicum ATCC 14067, and then assembled into the pET-28a (+) backbone (Novagen, Madison, WI) by Gibson assembly (see Fig. 5) .
  • C. glutamicum cells cultured 2-3 days in M63 medium were collected and washed twice in PBS buffer, and 20 ⁇ L of liquid culture in M63 (OD600 ⁇ 1) were deposited onto carbon-coated TEM grids for 5-10 min.
  • the samples were washed two times with 50 ⁇ L PBS buffer and three times with 20 ⁇ L water, and then, the excessive solution was quickly wicked away with filter paper.
  • the cells were deposited onto the cropper wire mesh, and were negatively stained with 15 ⁇ L 2 w/v%uranyl acetate solutions for 1 min and dried for 10 min under an infrared lamp. Samples were examined in a JEOL JEM-1400 transmission electron microscope at an accelerating voltage of 120 kv.
  • C. glutamicum strains were cultured for 48 h in M63 liquid medium, and the cultures were collected, washed and diluted to an OD600 of 0.1 in Tris-buffered saline with 0.1%ProclinTM 300 (Sigma, 48912-U) on ice.
  • the recombinant Spa2 was expressed as an N-terminus His-tagged protein.
  • E. coli BL21 (DE3) transformed with plasmid PET-28a-Spa2 (CaCl 2 process) were grown overnight at 37°C to provide a starter culture for expression.
  • a total of 1 L medium with 50 ⁇ g mL -1 kanamycin was inoculated with 1% (v/v) of the starter culture and grown at 37°C.
  • the cultivation temperature was lowered to 16°C and IPTG was added to a final concentration of 0.5 mM to induce protein overexpression.
  • cells were collected by centrifugation, and the cell pellets were suspended in buffer A (50 mM Tris-HCl, 150 mM NaCl, pH 8.0) and lysed by high pressure homogenization. The cell lysates were centrifuged at 12, 000 rpm for 30 min at 4°C.
  • buffer A 50 mM Tris-HCl, 150 mM NaCl, pH 8.0
  • the resulting supernatant was loaded onto a Nickel-affinity column (5 mL, GE) pre-equilibrated with buffer A (50 mM Tris-HCl, 150 mM NaCl, pH 8.0) .
  • His-tagged Spa2 protein was eluted with buffer A with 50 mM imidazole.
  • the His-tagged Spa2 protein was buffer-exchanged into buffer A and subjected to tag removal by HRV3c (SEQ ID NO: 34, 1 mg/50 mg Spa2) at 4 °C overnight.
  • the digested product was loaded onto the 5-mL Ni-NTA column (GE) and eluted with a buffer A/buffer B (buffer A + 500 mM imidazole) gradient (5%buffer B, 10%buffer B, 20%buffer B and 100%buffer B) .
  • the flow-through at 10% buffer B was collected.
  • the final purified protein was concentrated to 20 mg mL-1 in 10 mM Tris-HCl pH 8.0 and 50 mM NaCl for crystallization.
  • the sitting drop vapor diffusion technique http: //soft-matter. seas. harvard. edu/index. php/Vapor_Diffusion_Method) was used to crystallize the Spa2 protein. Crystals were obtained by mixing 4 ⁇ L of Spa2 protein with 4 ⁇ L reservoir solution (0.2 M sodium sulfate, 0.1 M Bis-Tris propane pH 7.5, 20 %w/v PEG 3350) and incubating the mixture at 18 °C for 1-2 weeks.
  • the crystals were soaked in a cryo-protectant solution consisting of the reservoir solution and 20% (v/v) glycerol and then quickly frozen with liquid nitrogen. Diffraction data were collected on the BL18U1 beamline at the Shanghai Synchrotron Radiation Facility (Shanghai, China) with flash frozen crystals (at 100 K in a stream of nitrogen gas) . The data were processed by XDS9 and then further processed using STARANISO10 (aserver of Global Phasing Company) .
  • the structure was solved by the molecular replacement method using PHASER11 and the predicted Spa2 coordinates by Alphafold Colab12 as template. Further manual model building was carried out using COOT13. The model was refined by PHENLX14. Data collection, phasing and refinement statistics are given in Table 3. Structure figures were prepared using PyMOL2.3.4 (https: //pymol. org/2/) .
  • C. glutamicum colonies were inoculated into 10 mL BHI and cultured for 12 h. Then cells were transferred into M63 medium with an initial OD600 of 0.1 for 3 days at 30°C without shaking. Cells were collected by centrifugation at 5, 000 rpm, washed three times with PBS and diluted with PBS (OD600 ⁇ 0.5) . Exactly 200 ⁇ L of the samples were transferred to a flat-bottom 96-well black plate and analyzed on a Tecan Infinite Pro 200 Plate Reader, with excitation/emission wavelengths of 580/610 nm for mCherry fluorescence intensity, and 510/545 nm for Venus fluorescence intensity. The fluorescence intensity divided by the absorbance of OD is the normalized fluorescence intensity.
  • Fluorescence (confocal) microscopy imaging Cells prepared for plate-reader measurements were dripped on a glass slide and imaged under a Nikon TI2-E inverted microscope. Microscope light source power, detector gain, and image processing settings were consistent among different samples.
  • Stains expressing SpyTag-Spa2, SpyCatcher-Spa2 and Spa2 (strain ⁇ spa2 transformed with pEK-SpyTagSpa2, pEK-SpyCatcherSpa2, and pEK-spa2, respectively) were cultured in glass-bottom dishes in M63 for 3 days. The dishes were then gently washed three times with PBS containing 0.5%Tween80 (PBST) and blocked in PBST with 1%BSA for 1 h.
  • PBST PBS containing 0.5%Tween80
  • the group of SpyTag-Spa2 and Spa2 were incubated with purified GFP-SpyCatcher (SEQ ID NO: 35) , and the group of SpyCatcher-Spa2 and Spa2 were incubated with purified GFP-SpyTag (SEQ ID NO: 36) for 1 h at room temperature. All samples were washed three times with PBS buffer and imaged under a Nikon TI2-E inverted microscope.
  • Spa2 strain or the Mfp3Spep-Spa2 strain was cultured in the M63 medium (3 mL) supplemented with 200 ⁇ L of green-fluorescent PS microsphere solution in 35-mm Petri dishes containing 2-3 glass slides for 3 days at 30°C without shaking. The settled glass slides were then taken out and gently flushed to wash away the microspheres that had not adhered. The binding capacity of different samples was compared with water jetting at a constant discharge pressure of 5 psi for 15 s, performed on a pressure-flow controller (PG-MFC-8CH, PreciGenome) . Fluorescence images were recorded before and after the mechanical challenge with water jetting.
  • PG-MFC-8CH pressure-flow controller
  • the pEK-spa2cut plasmid was transferred into ⁇ spa2 by electroporation as described above to construct the strain ⁇ spa2-pEK-spa2cut, which was used to express the monomer of Spa2cut (SEQ ID NO: 5) .
  • Cells were inoculated into M63 medium with 25 ⁇ g mL-1 kanamycin and cultured for 3 days.
  • Supernatants 200 mL were collected and concentrated into 1 mL and then purified by nickel-affinity chromatography as previously described in the section of “Expression and purification of recombinant Spa2” .
  • Spa2cut was eluted with 100 mM imidazole.
  • the final purified protein was buffer-exchanged into 10 mM Tris-HCl, 100 mM NaCl, pH 8.0.
  • a similar process was followed for expression and purification of Spa2cut mutant variants of E158Acut, D246Acut, E435Acut, and D246A/E435Acut.
  • ⁇ spa2 ⁇ srtA-pEK-6his-spa2 strain enables secretion of the expressed 6His- Cg CLP into the culture medium due to lacking sortase A.
  • 6His- Cg CLP polymers ⁇ spa2 ⁇ srtA-pEK-6his-spa2 cells were inoculated into M63 medium with 25 ⁇ g mL -1 kanamycin and cultured for 3 days.
  • 6His- Cg CLP purification 500 mL supernatants were collected and concentrated to 5mL in buffer of 10 mM Tris-HCl, 100 mM NaCl, pH 8.0 and were purified by nickel affinity chromatography.
  • the 6His- Cg CLP polymers were eluted with 100 mM imidazole. Purified 6His- Cg CLP fibers were then boiled in SDS sample buffer (6 ⁇ Protein Loading Buffer, TransGen Biotech, DL101-02) and subjected to an SDS-PAGE gel. The high-molecular-weight Cg CLP polymer bands were excised from Coomassie brilliant blue stained SDS-PAGE gels and prepared for intermolecular isopeptide bond identification.
  • the Spa2cut solution was precipitated with acetone (1: 4) and the pellets were dried using a Speedvac (room temperature) for 1-2 min. The pellets were then dissolved in 100 mM Tris-HCl (pH 8.5) supplemented with 8 M urea. 5mM TCEP (Thermo Scientific) for reduction and 10 mM iodoacetamide (Sigma) for alkylation were added and incubated at room temperature for 30 min. The protein mixture was diluted (1: 4) and digested overnight with chymotrypsin at 1: 40 (w/w) . The protease-digested peptide solution was desalted using a MonoSpinTM C18 column (GL Science, Tokyo, Japan) and dried with a SpeedVac.
  • the Spa2cut sample was processed following the same protocol as previously described for signal peptide identification.
  • the Spa2cut sample was processed following a similar protocol except that pepsin (Promega) was purposely added for digestion, while addition of 5mM TCEP (Thermo Scientific) was avoided to ensure that the disulfide bond, if any, was kept intact.
  • the Coomassie brilliant blue stained SDS-PAGE gel band of Cg CLP fibers was excised into small pieces and washed in water, followed by 50 mM NH 4 HCO 3 in 50%acetonitrile and 100%acetonitrile.
  • the sample was reduced with 10 mM TCEP (Thermo Scientific) in 100 mM NH 4 HCO 3 at 55 °C for 1 h and alkylated with 55 mM iodoacetamide (Sigma) in 100 mM NH 4 HCO 3 at 37 °C in the dark for 30 min.
  • the gel pieces were then washed with 100 mM NH 4 HCO 3 and 100%acetonitrile, and dried.
  • the sample was primarily digested with 3 ⁇ g trypsin (Promega) in 50 mM NH 4 HCO 3 at 37 °C overnight, then 1 ⁇ g of Asp-N endoproteinase (Promega) was added for another overnight incubation. Digested peptides were extracted twice with 50%acetonitrile containing 5%formic acid.
  • protease-digested peptides were analyzed by LCMS/MS using an Easy-nLC 1200 nano HPLC (Thermo Scientific) hybrid of a Q Exactive Orbitrap mass spectrometer (Thermo Scientific) system. Peptides were separated on a 30 cm-long pulled-tip analytical column (75 ⁇ m ID packed with ReproSil-Pur C18-AQ 1.9 ⁇ m resin, Dr. Maisch GmbH) in 0.1%aqueous formic acid (buffer A) and 0.1%formic acid in 80%acetonitrile (buffer B) at 55 °C with a flow rate of 300 nl/min using a 120 min linear gradient.
  • Buffer A 0.1%aqueous formic acid
  • buffer B 0.1%formic acid in 80%acetonitrile
  • CMC-Na carboxymethylcellulose sodium salt
  • DMS 3,5dinitrosaloculoc acid
  • TrEgl-Spa2_SdBgl-Spa2 C003 strain
  • TrEgl_SdBgl C004 strain
  • the lycopene producing plasmid of pZ9-dxs_crtEBI was transferred into strain TrEgl_SdBgl to construct the recombinant strains of C003 and C004 for the utilization of cellulose to produce lycopene.
  • C003 and C004 strains were inoculated into 10 mL BHI with 25 ⁇ g mL -1 kanamycin and 7.5 ⁇ g mL -1 chloramphenicol, and cultured for 12 h at 30 °C at a stirring speed at 200 rpm.
  • modified M63 medium (15.6 g L -1 M63 broth, supplemented with 1 mM MgSO 4 , 2% (wt/vol) CMC-Na) with initial OD600 of 3 for 2 days at 30°C and 1 mM IPTG was added or not.
  • lycopene production was carried out according to Li, C. et al. (Heterologous production of ⁇ -Carotene in Corynebacterium glutamicum using a multi-copy chromosomal integration method. Bioresour. Technol. 341, 125782, 2021) .
  • IPTG induced and un-induced cells (1 mL) were separately collected into 2 mL tubes of lysing matrix Y (M. P. Biomedicals) by centrifugation at 12, 000 rpm for 5 min.
  • the pellets were resuspended in a 60%hexane and 40%acetone mixture and lysed using the FastPrepR-24 5G bead beating grinder and lysis system (M. P. Biomedicals) for lycopene extraction.
  • the lysis condition is 30 s once with a 1 min interval, for 6 times.
  • the samples were centrifuged at 14, 000 rpm for 10 min at 4 °C, and the resulting supernatant was then transferred to brown 2 mL screw cap glass vials (Agilent Technologies) and directly subjected to HPLC analysis.
  • the quantification of lycopene was performed on an Agilent 1260 series HPLC system (Agilent Technologies) using YMC Carotenoid (250 ⁇ 4.6 mml. D., YMC) and detected via a diode array detector (DAD) at 450 nm.
  • binary gradient elution was applied to change the eluent from 100%eluent A of methanol/Methyl tert-butyl ether/water (81/15/4) to 100%eluent B of methanol/Methyl tert-butyl ether/water (7/90/3) over 90 min at a flow rate of 1.0 mL ⁇ min-1 at 20 °C with an injection volume of 10 ⁇ L (eluent A for 2min, eluent B 2min-95min, and eluent A 95min-100min.
  • This Example was carried out to investigate the CLP assembly in the industrial workhorse C. glutamicum ATCC 14067 (referred to as Cg CLP) .
  • the industrial workhorse C. glutamicum is a ‘generally recognized as safe’ (GRAS) strain with well-established gene editing tools that is widely used for the industrial-scale production of valued products such as amino acids, diamines, terpenoids, and other chemicals (Zhao, N. et al. Development of a Transcription Factor-Based Diamine Biosensor in Corynebacterium glutamicum. ACS Synth. Biol. 10, 3074-3083, 2021; and Xu, X. et al., Ledesma-Amaro, R. &Liu, L. Microbial chassis development for natural product biosynthesis. Trends Biotechnol. 38, 779-796, 2020) .
  • GRAS generally recognized as safe
  • CLP BGC contains three pilin-encoding genes, spa1, spa2, and spa3, as well as two sortase coding genes of srtC1, and srtC2 (Fig. 6) , which is similar to the SpaH-type (arelatively less well-studied pili type) CLP gene cluster in the pathogenic C. diphtheriae (Mandlik, A. et al., Pili in Gram-positive bacteria: assembly, involvement in colonization and biofilm development. Trends Microbiol. 16, 33-40, 2008) .
  • the composition of Cg CLP was determined with polyclonal antibodies against Spa1, Spa2, and Spa3, respectively.
  • TEM images of the Cg CLP with immunogold labelling showed that the Cg CLP fibers comprise two minor pilins of Spa1 and Spa3 and a major pilin of Spa2 (Fig. 8) .
  • TEM and AFM imaging used to assess the specific roles of the three pilins in the Cg CLP assembly showed that the cells, which were defective for Spa1 ( ⁇ spa1 strain) , Spa3 ( ⁇ spa3 strain) , or both ( ⁇ spa1 ⁇ spa3 strain) , could still produce fibers (Fig. 7) .
  • cells lacking Spa2 ( ⁇ spa2) could not produce any fiber, and overexpression of Spa2 (Spa2) promoted the formation of abundant long fibers throughout the cell surface (Fig. 7) .
  • TEM and AFM images also showed that cells lacking both SrtC1 and SrtC2 ( ⁇ srtC1 ⁇ srtC2) completely blocked fiber formation (Fig. 9) .
  • the purified Cg CLP polymers were excised from Coomassie blue-stained SDS-PAGE gels (Fig. 10) and then digested in-gel with trypsin (Promega) and AspN endoproteinase (Promega) .
  • Liquid chromatography-tandem mass spectrometry was used to analyze the digestion products, and verify the presence of the intermolecular isopeptide bond (bond formation results in the elimination of a water molecule and thus a slight decrease of molecular weight) .
  • the peptide peak with m/z 832.9 2+ (Fig. 11 and Table 2) suggested that the major pilin of Spa2 was cross-linked between K194 in the N-terminus of Spa2 i and T477 in the C-terminus of Spa2 i+1 (Lys194-Thr477) .
  • This detected mass is consistent with the loss of three NH 3 units and two H 2 units, indicating the formation of three intramolecular isopeptide bonds (loss of one molecule of ammonia, ⁇ 17 Da) and two disulfide bonds (loss of two hydrogen atoms, ⁇ 2 Da) in Spa2.
  • a Values in parentheses correspond to the outermost shell of data.
  • d R free
  • Spa2 is arranged in three tandem Ig-like domains, including N-domain (residues 36-197, pink) , M-domain (residues 198-343, blue) , and C-domain (residues 344-469, green) , giving an elongated molecule in length (Fig. 15) .
  • These three tandem Ig-like domains of Spa2 are similar to the major pilin of SpaA (PDB ID: 3HR6, root-mean-square deviation (RMSD) over 270 alpha-carbon (C ⁇ ) atoms, Fig. 16b) and SpaD (PDB ID: 4HSS, RMSD over 311 C ⁇ atom, Fig. 16c) from human pathogen C.
  • glutamicum is similar to the feature of the major pilin SpaD from the pathogenic C. diphtheriae (Kang, H. J. et al., 2014 above) , but is quite different from the major pilin SpaA from the pathogenic C. diphtheriae lacking isopeptide bonds in the N-terminal domain (Kang, H.J. et al., 2009 above) .
  • two disulfide bonds were formed in the N-domain between Cys97 and Cys128 and the C-domain between Cys380 and Cys432, respectively (Fig. 17b) .
  • Spa2 the presence of two disulfide bonds in Spa2 is very unique in comparison with other major pilins in human pathogens, such as Spy0128 (PDB ID: 3B2M) from Streptococcus pyogenes 37 and BcpA (PDB ID: 3KPT) from Bacillus cereus 38 lacking disulfide bond, and the SpaA and SpaD from C. diphtheriae containing only one disulfide bond in the C-terminal domain (Kang, H. J. et al., 2009 and 2014 above) .
  • PDB ID: 3B2M Speptococcus pyogenes 37
  • BcpA PBD ID: 3KPT
  • SpaA and SpaD from C. diphtheriae containing only one disulfide bond in the C-terminal domain
  • the CLP structure may serve as an attractive building block for various applications because these extracellular fibers have extraordinarily high tensile strength owing to their extensive inter-and intra-molecular isopeptide bonds. Moreover, as an extracellular matrix, CLP fibers can be conveniently and reliably positioned directly outside cells. Finally, their proteinaceous nature makes them potentially amenable for elaboration using genetic engineering.
  • This Example was carried out to determine suitable fusion sites to append peptides/proteins to Spa2. According to both the Spa2 crystal structure and the characterization of specific functional domains within Spa2 observed in Example 2, four different positions to test the fusion of a protein-of-interest (POI) , with one site in the N-terminus of Spa2 and three sites in the M-domain lacking a disulfide bond (Fig. 22) .
  • POI protein-of-interest
  • the CLP-defective strain C. glutamicum ATCC 14067 ⁇ spa2 ( ⁇ spa2) with abrogated extracellular Cg CLP formation was transformed with the exogenous expression plasmid (pEK-E1/mCherry-spa2, pEK-E2/mCherry-spa2, pEK-E3/mCherry-spa2, or pEK-E4/mCherry-spa2) for Spa2 fusion protein expression to test the restored Cg CLP fiber production.
  • the fluorescent reporter protein mCherry was fused at the interrogated positions for generating functional fusion proteins (SEQ ID NOs: 8-11) while retaining the sortase-catalyzed covalently-linked pili formation capacity of Spa2.
  • SEQ ID NOs: 8-11 functional fusion proteins
  • four sites were tested for mCherry addition/insertion, including Q35 (E1) at the N-terminus of Spa2, G215 in loop 1 of the M-domain (E2) , G236 in the loop 2 of the M-domain (E3) , and G336 in the ⁇ 23-sheet of the M-domain (E4) .
  • Quantitative analysis showed that the cells expressing each of the fusion proteins fluoresced and enabled the formation of fiber (Fig. 23a) .
  • Spa2 fusion proteins (six POIs, each fused at the E1 position via a linker of SEQ ID NO: 23) (see Fig. 25) were expressed by ⁇ spa2 strains transformed with plasmids pEK-6his-spa2, pEK-SpyTagSpa2, pEK-Mfp3Spep-Spa2, pEK-SpyCatcher-Spa2, pEK-Venus-Spa2, and pEK-CcEgl-Spa2, respectively. All of these fusion proteins were successfully expressed, secreted, and formed Cg CLP (Fig. 26) .
  • TEM images showed that Ni-NTA-decorated AuNPs were anchored onto 6His-Spa2 Cg CLP (Fig. 27a) .
  • Confocal microscopic images showed the green fluorescence emitted from SpyTag-Spa2 Cg CLP cells to which SpyCatcher-EGFP protein binding partners were covalently attached via Spytag-SpyCatcher interaction pairs (Fig. 27b) .
  • Confocal microscopic images show the green fluorescence emitted from SpyCatcher-Spa2 Cg CLP cells to which SpyTag-EGFP protein binding partners were covalently attached via Spytag-SpyCatcher interaction pairs (Fig. 27c) .
  • ⁇ spa2 strain was transformed with plasmids pEK-N-Ven-Spa2, pEK-C-Ven-Spa2 and pEK-N-Ven-Spa2_C-Ven-Spa2, respectively, ⁇ spa2 strain transformed with pEK-N-Ven_C-Ven was used as a control.
  • This Example was carried out to verify the co-assembly of multiple cellulases into a catalytic cascade for extracellular degradation of cellulose into glucose to support production of specific chemicals of interest (e.g., lycopene) in C. glutamicum ATCC 14067 ⁇ spa2 (Fig. 31) .
  • specific chemicals of interest e.g., lycopene
  • endo-1, 4- ⁇ -glucanase from Trichoderma reesei (TrEgl, SEQ ID NO: 19) and ⁇ -glucosidase from Saccharophagus degradans (SdBgl, SEQ ID NO: 21) were co-assembled in the Cg CLP fiber; these two enzymes are known to work in concert to degrade cellulose into glucose via enzyme cascade reactions.
  • Lycopene can be produced via the methylerythritol phosphate (MEP) pathway by engineered C. glutamicum (Li, C. et al. Heterologous production of ⁇ -Carotene in Corynebacterium glutamicum using a multi-copy chromosomal integration method. Bioresour. Technol. 341, 125782, 2021) .
  • a C001 chassis ⁇ spa2 ⁇ dec
  • spa2 spa2 ⁇ dec
  • CEY17_RS03380 for the abrogation Cg CLP formation
  • CEY17_RS03560 ⁇ dec, for accumulation of the precursor for lycopene production
  • the basal lycopene-producing strain C002 was constructed by transforming strain C001 with plasmid pZ9-dxs_crtEBI for IPTG-inducible expression of the dxs gene and crtEBI gene cluster. Then, the C002 strain was transformed with plasmids pEC-TrEgl-Spa2_SdBgl-Spa2, and pEC-TrEgl_SdBgl, respectively, resulting in the strains C003 and C004.
  • the C003 strain co-assembled TrEgl and SdBgl in Cg CLP fiber on the cell surface (Fig. 32a) and enabled the degradation of carboxymethylcellulose sodium (CMC-Na, the ether derivate of cellulose) in medium, based on the medium turning from a viscous gel to a thin solution (Fig. 32b) .
  • Strain C004, which only simultaneously secreted both TrEgl and SdBgl without anchoring to the Cg CLP scaffold did not show similar behavior.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Biochemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Medicinal Chemistry (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Microbiology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

L'invention concerne un polypeptide de fusion comprenant une protéine porteuse et un polypeptide d'intérêt, le polypeptide d'intérêt étant fusionné à une extrémité de la protéine porteuse ou inséré dans la protéine porteuse, et la protéine porteuse étant une piline de pili liés de manière covalente (CLP) à partir d'un micro-organisme. L'invention concerne également une cellule recombinante comprenant un CLP modifié comprenant le polypeptide de fusion, ainsi que le CLP modifié.
PCT/CN2022/130033 2022-11-04 2022-11-04 Pili à liaison covalente modifiée et bactéries recombinantes les comprenant WO2024092769A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/130033 WO2024092769A1 (fr) 2022-11-04 2022-11-04 Pili à liaison covalente modifiée et bactéries recombinantes les comprenant

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/130033 WO2024092769A1 (fr) 2022-11-04 2022-11-04 Pili à liaison covalente modifiée et bactéries recombinantes les comprenant

Publications (1)

Publication Number Publication Date
WO2024092769A1 true WO2024092769A1 (fr) 2024-05-10

Family

ID=90929428

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/130033 WO2024092769A1 (fr) 2022-11-04 2022-11-04 Pili à liaison covalente modifiée et bactéries recombinantes les comprenant

Country Status (1)

Country Link
WO (1) WO2024092769A1 (fr)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009137763A2 (fr) * 2008-05-08 2009-11-12 Emory University Procédés et compositions pour l’affichage de polypeptides sur les fimbriae de bactéries gram-positives
WO2017003305A1 (fr) * 2015-07-01 2017-01-05 Auckland Uniservices Limited Peptides et leurs utilisations
WO2019213262A1 (fr) * 2018-05-01 2019-11-07 The Regents Of The University Of California Réactif pour le marquage de protéines par liaison isopeptidique à la lysine

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160304567A1 (en) * 2007-12-19 2016-10-20 Emory University Methods and compositions for the display of polypeptides on the pili of gram- positive bacteria
WO2009137763A2 (fr) * 2008-05-08 2009-11-12 Emory University Procédés et compositions pour l’affichage de polypeptides sur les fimbriae de bactéries gram-positives
US20110189236A1 (en) * 2008-05-08 2011-08-04 Emory University Methods and Compositions for the Display of Polypeptides on the Pili of Gram-Positive Bacteria
WO2017003305A1 (fr) * 2015-07-01 2017-01-05 Auckland Uniservices Limited Peptides et leurs utilisations
WO2019213262A1 (fr) * 2018-05-01 2019-11-07 The Regents Of The University Of California Réactif pour le marquage de protéines par liaison isopeptidique à la lysine

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
HUNG TON‐THAT: "Sortases and pilin elements involved in pilus assembly of Corynebacterium diphtheriae", MOLECULAR MICROBIOLOGY, WILEY-BLACKWELL PUBLISHING LTD, GB, vol. 53, no. 1, 1 July 2004 (2004-07-01), GB , pages 251 - 261, XP093168778, ISSN: 0950-382X, DOI: 10.1111/j.1365-2958.2004.04117.x *

Similar Documents

Publication Publication Date Title
Zou et al. Construction of a cellulase hyper-expression system in Trichoderma reesei by promoter and enzyme engineering
US20220002773A1 (en) Production of 3-fucosyllactose and lactose converting alpha-1,3-fucosyltransferase enzymes
Biedendieck et al. Plasmid system for the intracellular production and purification of affinity‐tagged proteins in Bacillus megaterium
EA017803B1 (ru) Система экспрессии
CN108103039B (zh) 一组岩藻糖基转移酶突变体及其筛选方法和应用
KR20220116243A (ko) 락토오스 전환 알파-1,2-푸코실트랜스퍼라제 효소
US10683509B2 (en) Surface display of functional proteins in a broad range of gram negative bacteria
WO2012118900A2 (fr) Présentation d'enzymes cellulolytiques et de complexes enzymatiques à la surface de microorganismes à gram positif
KR101481142B1 (ko) 코리네박테리아 발현용 합성프로모터
WO2014170460A2 (fr) Procede de production de proteines de collagene issues d'eponges marines et organisme apte a produire lesdites proteines
KR102350425B1 (ko) 프테로스틸벤의 생합성 제조를 위한 o-메틸트랜스퍼라제의 사용 방법
CN114196646B (zh) 一种橄榄醇合成酶变体a及其用途
EP3330282A1 (fr) Cipa et cipb pixa comme échafaudages pour organiser des protéines dans des inclusions cristallines
US20140011235A1 (en) Release factor 1 (rf1) in escherichia coli
WO2024092769A1 (fr) Pili à liaison covalente modifiée et bactéries recombinantes les comprenant
WO2023197692A1 (fr) Souche modifiée de levure ayant une voie tca réductrice positionnée sur les mitochondries et produisant efficacement de l'acide succinique, son procédé de construction et son utilisation
CN112342178A (zh) 重组微生物、其制备方法及在生产塔格糖中的应用
CN114032222B (zh) 糖链延伸糖基转移酶突变体及其编码基因以及基因工程菌和它们的应用
CN111363709B (zh) 一种提高异戊二烯产量的基因工程菌及其构建方法与应用
US20110262971A1 (en) Genetically Modified E. coli Strains for Producing Erythromycin
Zhang et al. Characterization of the complex involved in regulating V-ATPase activity of the vacuolar and endosomal membrane
KR102194697B1 (ko) 3-히드록시프로피온산 반응 전사인자를 이용한 3-하이드록시프로피온산 선택성 유전자회로 및 이를 이용한 3-히드록시프로피온산 생산 균주의 스크리닝 방법
US8636999B2 (en) Stable plasmid expression vector for bacteria
WO2012067220A1 (fr) Procédé d'expression d'une protéine utile à des taux élevés
WO2021188816A1 (fr) Procédés et systèmes biologiques de découverte et d'optimisation de peptides lasso