WO2024092769A1 - Pili à liaison covalente modifiée et bactéries recombinantes les comprenant - Google Patents
Pili à liaison covalente modifiée et bactéries recombinantes les comprenant Download PDFInfo
- Publication number
- WO2024092769A1 WO2024092769A1 PCT/CN2022/130033 CN2022130033W WO2024092769A1 WO 2024092769 A1 WO2024092769 A1 WO 2024092769A1 CN 2022130033 W CN2022130033 W CN 2022130033W WO 2024092769 A1 WO2024092769 A1 WO 2024092769A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- polypeptide
- gca
- carrier protein
- seq
- spa2
- Prior art date
Links
- 241000894006 Bacteria Species 0.000 title claims description 32
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 217
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 186
- 229920001184 polypeptide Polymers 0.000 claims abstract description 175
- 102000014914 Carrier Proteins Human genes 0.000 claims abstract description 107
- 108010078791 Carrier Proteins Proteins 0.000 claims abstract description 107
- 230000004927 fusion Effects 0.000 claims abstract description 68
- 108010000916 Fimbriae Proteins Proteins 0.000 claims abstract description 30
- 244000005700 microbiome Species 0.000 claims abstract description 10
- 241000186012 Bifidobacterium breve Species 0.000 claims description 106
- 108090000623 proteins and genes Proteins 0.000 claims description 80
- 241000186226 Corynebacterium glutamicum Species 0.000 claims description 75
- 102000040430 polynucleotide Human genes 0.000 claims description 74
- 108091033319 polynucleotide Proteins 0.000 claims description 74
- 239000002157 polynucleotide Substances 0.000 claims description 74
- 150000001413 amino acids Chemical class 0.000 claims description 61
- 101710082149 Major fimbrial subunit Proteins 0.000 claims description 49
- 238000000034 method Methods 0.000 claims description 42
- 241000194035 Lactococcus lactis Species 0.000 claims description 37
- 235000014897 Streptococcus lactis Nutrition 0.000 claims description 37
- 241000193388 Bacillus thuringiensis Species 0.000 claims description 25
- 229940097012 bacillus thuringiensis Drugs 0.000 claims description 25
- 239000013598 vector Substances 0.000 claims description 16
- 239000004280 Sodium formate Substances 0.000 claims description 12
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 11
- 229910052799 carbon Inorganic materials 0.000 claims description 8
- 230000000694 effects Effects 0.000 claims description 8
- 238000000338 in vitro Methods 0.000 claims description 3
- 229910052757 nitrogen Inorganic materials 0.000 claims description 3
- 210000004027 cell Anatomy 0.000 description 106
- 235000001014 amino acid Nutrition 0.000 description 52
- 102000004169 proteins and genes Human genes 0.000 description 46
- 235000018102 proteins Nutrition 0.000 description 43
- 239000013612 plasmid Substances 0.000 description 37
- 230000015572 biosynthetic process Effects 0.000 description 29
- 239000000835 fiber Substances 0.000 description 29
- 230000014509 gene expression Effects 0.000 description 25
- 241000807905 Corynebacterium glutamicum ATCC 14067 Species 0.000 description 24
- 108020004414 DNA Proteins 0.000 description 21
- 125000003275 alpha amino acid group Chemical group 0.000 description 20
- 238000002372 labelling Methods 0.000 description 20
- 239000000872 buffer Substances 0.000 description 19
- 239000012634 fragment Substances 0.000 description 19
- 239000000463 material Substances 0.000 description 18
- UPYKUZBSLRQECL-UKMVMLAPSA-N Lycopene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1C(=C)CCCC1(C)C)C=CC=C(/C)C=CC2C(=C)CCCC2(C)C UPYKUZBSLRQECL-UKMVMLAPSA-N 0.000 description 17
- JEVVKJMRZMXFBT-XWDZUXABSA-N Lycophyll Natural products OC/C(=C/CC/C(=C\C=C\C(=C/C=C/C(=C\C=C\C=C(/C=C/C=C(\C=C\C=C(/CC/C=C(/CO)\C)\C)/C)\C)/C)\C)/C)/C JEVVKJMRZMXFBT-XWDZUXABSA-N 0.000 description 17
- OAIJSZIZWZSQBC-GYZMGTAESA-N lycopene Chemical compound CC(C)=CCC\C(C)=C\C=C\C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)\C=C\C=C(/C)CCC=C(C)C OAIJSZIZWZSQBC-GYZMGTAESA-N 0.000 description 17
- 239000001751 lycopene Substances 0.000 description 17
- 229960004999 lycopene Drugs 0.000 description 17
- 235000012661 lycopene Nutrition 0.000 description 17
- 238000004519 manufacturing process Methods 0.000 description 17
- 239000002609 medium Substances 0.000 description 17
- 239000002773 nucleotide Substances 0.000 description 17
- 125000003729 nucleotide group Chemical group 0.000 description 17
- ZCIHMQAPACOQHT-ZGMPDRQDSA-N trans-isorenieratene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/c1c(C)ccc(C)c1C)C=CC=C(/C)C=Cc2c(C)ccc(C)c2C ZCIHMQAPACOQHT-ZGMPDRQDSA-N 0.000 description 17
- 238000003384 imaging method Methods 0.000 description 16
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 15
- 239000013078 crystal Substances 0.000 description 15
- 108020001507 fusion proteins Proteins 0.000 description 15
- 102000037865 fusion proteins Human genes 0.000 description 15
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 15
- 229940088598 enzyme Drugs 0.000 description 14
- 239000013615 primer Substances 0.000 description 14
- 102000004190 Enzymes Human genes 0.000 description 13
- 108090000790 Enzymes Proteins 0.000 description 13
- 238000004458 analytical method Methods 0.000 description 13
- 239000000178 monomer Substances 0.000 description 13
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 12
- 238000003917 TEM image Methods 0.000 description 12
- 239000000243 solution Substances 0.000 description 12
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 12
- 241000186227 Corynebacterium diphtheriae Species 0.000 description 11
- 229920002678 cellulose Polymers 0.000 description 11
- 239000001913 cellulose Substances 0.000 description 11
- 238000012217 deletion Methods 0.000 description 11
- 230000037430 deletion Effects 0.000 description 11
- 150000002500 ions Chemical class 0.000 description 11
- 239000000047 product Substances 0.000 description 11
- 238000004885 tandem mass spectrometry Methods 0.000 description 11
- 102000053602 DNA Human genes 0.000 description 10
- 238000002965 ELISA Methods 0.000 description 10
- 230000015556 catabolic process Effects 0.000 description 10
- 238000006731 degradation reaction Methods 0.000 description 10
- 238000002474 experimental method Methods 0.000 description 10
- 239000000203 mixture Substances 0.000 description 10
- 238000001228 spectrum Methods 0.000 description 10
- 241000283707 Capra Species 0.000 description 9
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 9
- 241000194041 Lactococcus lactis subsp. lactis Species 0.000 description 9
- 235000014969 Streptococcus diacetilactis Nutrition 0.000 description 9
- 238000003556 assay Methods 0.000 description 9
- 239000008103 glucose Substances 0.000 description 9
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 9
- 150000007523 nucleic acids Chemical class 0.000 description 9
- 102220237269 rs201396897 Human genes 0.000 description 9
- 108010076504 Protein Sorting Signals Proteins 0.000 description 8
- 239000000499 gel Substances 0.000 description 8
- 229930027917 kanamycin Natural products 0.000 description 8
- 229960000318 kanamycin Drugs 0.000 description 8
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 8
- 229930182823 kanamycin A Natural products 0.000 description 8
- 238000011002 quantification Methods 0.000 description 8
- 239000011734 sodium Substances 0.000 description 8
- 108091026890 Coding region Proteins 0.000 description 7
- 108091028043 Nucleic acid sequence Proteins 0.000 description 7
- 101100149471 Rattus norvegicus Sipa1l1 gene Proteins 0.000 description 7
- 238000010276 construction Methods 0.000 description 7
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 7
- 239000004005 microsphere Substances 0.000 description 7
- 102000039446 nucleic acids Human genes 0.000 description 7
- 108020004707 nucleic acids Proteins 0.000 description 7
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 6
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 6
- 108010059892 Cellulase Proteins 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 102000057297 Pepsin A Human genes 0.000 description 6
- 108090000284 Pepsin A Proteins 0.000 description 6
- 230000003197 catalytic effect Effects 0.000 description 6
- 238000001553 co-assembly Methods 0.000 description 6
- 230000000295 complement effect Effects 0.000 description 6
- 239000003480 eluent Substances 0.000 description 6
- 238000001914 filtration Methods 0.000 description 6
- 238000009396 hybridization Methods 0.000 description 6
- 238000003780 insertion Methods 0.000 description 6
- 230000037431 insertion Effects 0.000 description 6
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 6
- 229940111202 pepsin Drugs 0.000 description 6
- 229920000642 polymer Polymers 0.000 description 6
- 239000001818 polyoxyethylene sorbitan monostearate Substances 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 239000011780 sodium chloride Substances 0.000 description 6
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 239000000758 substrate Substances 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 5
- 241001670248 Saccharophagus degradans Species 0.000 description 5
- 241000499912 Trichoderma reesei Species 0.000 description 5
- 125000004429 atom Chemical group 0.000 description 5
- 108010047754 beta-Glucosidase Proteins 0.000 description 5
- 102000006995 beta-Glucosidase Human genes 0.000 description 5
- 238000004624 confocal microscopy Methods 0.000 description 5
- 230000029087 digestion Effects 0.000 description 5
- 101150056470 dxs gene Proteins 0.000 description 5
- 239000013604 expression vector Substances 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 238000006116 polymerization reaction Methods 0.000 description 5
- 238000004445 quantitative analysis Methods 0.000 description 5
- 239000000523 sample Substances 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 230000001131 transforming effect Effects 0.000 description 5
- 238000005406 washing Methods 0.000 description 5
- YMHOBZXQZVXHBM-UHFFFAOYSA-N 2,5-dimethoxy-4-bromophenethylamine Chemical compound COC1=CC(CCN)=C(OC)C=C1Br YMHOBZXQZVXHBM-UHFFFAOYSA-N 0.000 description 4
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 102220518600 Baculoviral IAP repeat-containing protein 6_C97A_mutation Human genes 0.000 description 4
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 4
- 108050001049 Extracellular proteins Proteins 0.000 description 4
- BZLVMXJERCGZMT-UHFFFAOYSA-N Methyl tert-butyl ether Chemical compound COC(C)(C)C BZLVMXJERCGZMT-UHFFFAOYSA-N 0.000 description 4
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 4
- 239000006180 TBST buffer Substances 0.000 description 4
- 241000545067 Venus Species 0.000 description 4
- 239000000853 adhesive Substances 0.000 description 4
- 238000001042 affinity chromatography Methods 0.000 description 4
- ANVAOWXLWRTKGA-XHGAXZNDSA-N all-trans-alpha-carotene Chemical compound CC=1CCCC(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1C(C)=CCCC1(C)C ANVAOWXLWRTKGA-XHGAXZNDSA-N 0.000 description 4
- 238000000089 atomic force micrograph Methods 0.000 description 4
- 238000006664 bond formation reaction Methods 0.000 description 4
- 238000012512 characterization method Methods 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 229960005091 chloramphenicol Drugs 0.000 description 4
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 238000004520 electroporation Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 239000011521 glass Substances 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 230000001717 pathogenic effect Effects 0.000 description 4
- 239000008188 pellet Substances 0.000 description 4
- 102200066074 rs387906774 Human genes 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- ZFXYFBGIUFBOJW-UHFFFAOYSA-N theophylline Chemical compound O=C1N(C)C(=O)N(C)C2=C1NC=N2 ZFXYFBGIUFBOJW-UHFFFAOYSA-N 0.000 description 4
- 238000001269 time-of-flight mass spectrometry Methods 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- 244000063299 Bacillus subtilis Species 0.000 description 3
- 235000014469 Bacillus subtilis Nutrition 0.000 description 3
- 241000754941 Bacillus thuringiensis LM1212 Species 0.000 description 3
- 241000193372 Bacillus thuringiensis serovar alesti Species 0.000 description 3
- 241000193370 Bacillus thuringiensis serovar tolworthi Species 0.000 description 3
- 241000277609 Bifidobacterium breve 12L Species 0.000 description 3
- 241000277623 Bifidobacterium breve 689b Species 0.000 description 3
- 241000025031 Bifidobacterium breve ACS-071-V-Sch8b Species 0.000 description 3
- 241000741973 Bifidobacterium breve DSM 20213 = JCM 1192 Species 0.000 description 3
- 241000277612 Bifidobacterium breve JCM 7017 Species 0.000 description 3
- 241000277615 Bifidobacterium breve NCFB 2258 Species 0.000 description 3
- 241000302944 Bifidobacterium breve S27 Species 0.000 description 3
- 241000003117 Bifidobacterium breve UCC2003 Species 0.000 description 3
- 229920002134 Carboxymethyl cellulose Polymers 0.000 description 3
- 241000671338 Corynebacterium glutamicum R Species 0.000 description 3
- 238000007702 DNA assembly Methods 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 235000000375 Lactococcus lactis subsp cremoris MG1363 Nutrition 0.000 description 3
- 244000063988 Lactococcus lactis subsp cremoris NZ9000 Species 0.000 description 3
- 235000012521 Lactococcus lactis subsp cremoris NZ9000 Nutrition 0.000 description 3
- 235000001252 Lactococcus lactis subsp lactis bv diacetylactis Nutrition 0.000 description 3
- 241001223921 Lactococcus lactis subsp. cremoris A76 Species 0.000 description 3
- 241000208789 Lactococcus lactis subsp. cremoris IBB477 Species 0.000 description 3
- 241001017508 Lactococcus lactis subsp. cremoris MG1363 Species 0.000 description 3
- 241001374059 Lactococcus lactis subsp. lactis IO-1 Species 0.000 description 3
- 241000168725 Lactococcus lactis subsp. lactis bv. diacetylactis Species 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- PZBFGYYEXUXCOF-UHFFFAOYSA-N TCEP Chemical compound OC(=O)CCP(CCC(O)=O)CCC(O)=O PZBFGYYEXUXCOF-UHFFFAOYSA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 230000000903 blocking effect Effects 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 235000014633 carbohydrates Nutrition 0.000 description 3
- 239000001768 carboxy methyl cellulose Substances 0.000 description 3
- 235000010948 carboxy methyl cellulose Nutrition 0.000 description 3
- 239000008112 carboxymethyl-cellulose Substances 0.000 description 3
- 229940105329 carboxymethylcellulose Drugs 0.000 description 3
- -1 cellulose Chemical class 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 239000013256 coordination polymer Substances 0.000 description 3
- 239000013613 expression plasmid Substances 0.000 description 3
- 235000019253 formic acid Nutrition 0.000 description 3
- 108091008053 gene clusters Proteins 0.000 description 3
- 238000010353 genetic engineering Methods 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 101150109249 lacI gene Proteins 0.000 description 3
- 239000003446 ligand Substances 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- LWGJTAZLEJHCPA-UHFFFAOYSA-N n-(2-chloroethyl)-n-nitrosomorpholine-4-carboxamide Chemical compound ClCCN(N=O)C(=O)N1CCOCC1 LWGJTAZLEJHCPA-UHFFFAOYSA-N 0.000 description 3
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 102000005962 receptors Human genes 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- 241000193755 Bacillus cereus Species 0.000 description 2
- 229920002749 Bacterial cellulose Polymers 0.000 description 2
- 239000002028 Biomass Substances 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 241000010804 Caulobacter vibrioides Species 0.000 description 2
- 108010084185 Cellulases Proteins 0.000 description 2
- 102000005575 Cellulases Human genes 0.000 description 2
- 108090000317 Chymotrypsin Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- 108091029865 Exogenous DNA Proteins 0.000 description 2
- 241000192125 Firmicutes Species 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 101000693878 Ideonella sakaiensis (strain NBRC 110686 / TISTR 2288 / 201-F6) Poly(ethylene terephthalate) hydrolase Proteins 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- 108010033276 Peptide Fragments Proteins 0.000 description 2
- 102000007079 Peptide Fragments Human genes 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 101100114901 Streptomyces griseus crtI gene Proteins 0.000 description 2
- 238000000692 Student's t-test Methods 0.000 description 2
- 108090000631 Trypsin Proteins 0.000 description 2
- 102000004142 Trypsin Human genes 0.000 description 2
- 101000693873 Unknown prokaryotic organism Leaf-branch compost cutinase Proteins 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 2
- 230000001070 adhesive effect Effects 0.000 description 2
- 239000011795 alpha-carotene Substances 0.000 description 2
- ANVAOWXLWRTKGA-HLLMEWEMSA-N alpha-carotene Natural products C(=C\C=C\C=C(/C=C/C=C(\C=C\C=1C(C)(C)CCCC=1C)/C)\C)(\C=C\C=C(/C=C/[C@H]1C(C)=CCCC1(C)C)\C)/C ANVAOWXLWRTKGA-HLLMEWEMSA-N 0.000 description 2
- 235000003903 alpha-carotene Nutrition 0.000 description 2
- 229910021529 ammonia Inorganic materials 0.000 description 2
- 238000004873 anchoring Methods 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 239000005016 bacterial cellulose Substances 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 238000005842 biochemical reaction Methods 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000001851 biosynthetic effect Effects 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 235000021466 carotenoid Nutrition 0.000 description 2
- 150000001747 carotenoids Chemical class 0.000 description 2
- 238000010523 cascade reaction Methods 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 229960002376 chymotrypsin Drugs 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 2
- 101150000046 crtE gene Proteins 0.000 description 2
- 238000002425 crystallisation Methods 0.000 description 2
- 230000008025 crystallization Effects 0.000 description 2
- 230000009089 cytolysis Effects 0.000 description 2
- 238000013480 data collection Methods 0.000 description 2
- 150000004985 diamines Chemical class 0.000 description 2
- 238000009792 diffusion process Methods 0.000 description 2
- 229910001873 dinitrogen Inorganic materials 0.000 description 2
- 238000001035 drying Methods 0.000 description 2
- 238000007380 fibre production Methods 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 244000052637 human pathogen Species 0.000 description 2
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 2
- 238000009630 liquid culture Methods 0.000 description 2
- 238000004949 mass spectrometry Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 229910052759 nickel Inorganic materials 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 239000004033 plastic Substances 0.000 description 2
- 229920003023 plastic Polymers 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 238000012207 quantitative assay Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 101150073162 spa1 gene Proteins 0.000 description 2
- 230000006641 stabilisation Effects 0.000 description 2
- 238000011105 stabilization Methods 0.000 description 2
- 230000000087 stabilizing effect Effects 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 230000005469 synchrotron radiation Effects 0.000 description 2
- 238000012353 t test Methods 0.000 description 2
- 229960000278 theophylline Drugs 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 239000012588 trypsin Substances 0.000 description 2
- MAKBWIUHFAVVJP-HAXARLPTSA-N (2R,3S)-pentane-1,2,3,4-tetrol phosphoric acid Chemical compound OP(O)(O)=O.CC(O)[C@H](O)[C@H](O)CO MAKBWIUHFAVVJP-HAXARLPTSA-N 0.000 description 1
- LXJXRIRHZLFYRP-VKHMYHEASA-L (R)-2-Hydroxy-3-(phosphonooxy)-propanal Natural products O=C[C@H](O)COP([O-])([O-])=O LXJXRIRHZLFYRP-VKHMYHEASA-L 0.000 description 1
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 1
- 241001156739 Actinobacteria <phylum> Species 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 1
- 239000007989 BIS-Tris Propane buffer Substances 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 210000003967 CLP Anatomy 0.000 description 1
- 101150082297 Clp gene Proteins 0.000 description 1
- 241001485655 Corynebacterium glutamicum ATCC 13032 Species 0.000 description 1
- LXJXRIRHZLFYRP-VKHMYHEASA-N D-glyceraldehyde 3-phosphate Chemical compound O=C[C@H](O)COP(O)(O)=O LXJXRIRHZLFYRP-VKHMYHEASA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 238000012270 DNA recombination Methods 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 238000012286 ELISA Assay Methods 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 238000010268 HPLC based assay Methods 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 108091006054 His-tagged proteins Proteins 0.000 description 1
- 241001596500 Komagataeibacter rhaeticus Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- 239000006391 Luria-Bertani Medium Substances 0.000 description 1
- JPNRPAJITHRXRH-BQBZGAKWSA-N Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O JPNRPAJITHRXRH-BQBZGAKWSA-N 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 241000588912 Pantoea agglomerans Species 0.000 description 1
- 229930040373 Paraformaldehyde Natural products 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- 229920002562 Polyethylene Glycol 3350 Polymers 0.000 description 1
- 108010026552 Proteome Proteins 0.000 description 1
- 101800001693 R-peptide Proteins 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 101710099182 S-layer protein Proteins 0.000 description 1
- 239000012722 SDS sample buffer Substances 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- PMZURENOXWZQFD-UHFFFAOYSA-L Sodium Sulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=O PMZURENOXWZQFD-UHFFFAOYSA-L 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- 101710183296 Surface layer protein Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- COQLPRJCUIATTQ-UHFFFAOYSA-N Uranyl acetate Chemical compound O.O.O=[U]=O.CC(O)=O.CC(O)=O COQLPRJCUIATTQ-UHFFFAOYSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 230000029936 alkylation Effects 0.000 description 1
- 238000005804 alkylation reaction Methods 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 206010002022 amyloidosis Diseases 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 238000010009 beating Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010378 bimolecular fluorescence complementation Methods 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 239000002551 biofuel Substances 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 238000013406 biomanufacturing process Methods 0.000 description 1
- 229920001222 biopolymer Polymers 0.000 description 1
- HHKZCCWKTZRCCL-UHFFFAOYSA-N bis-tris propane Chemical compound OCC(CO)(CO)NCCCNC(CO)(CO)CO HHKZCCWKTZRCCL-UHFFFAOYSA-N 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 229920003123 carboxymethyl cellulose sodium Polymers 0.000 description 1
- 229940063834 carboxymethylcellulose sodium Drugs 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 101150038575 clpS gene Proteins 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 239000002577 cryoprotective agent Substances 0.000 description 1
- 238000002447 crystallographic data Methods 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000008021 deposition Effects 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 108010003914 endoproteinase Asp-N Proteins 0.000 description 1
- RDYMFSUJUZBWLH-UHFFFAOYSA-N endosulfan Chemical compound C12COS(=O)OCC2C2(Cl)C(Cl)=C(Cl)C1(Cl)C2(Cl)Cl RDYMFSUJUZBWLH-UHFFFAOYSA-N 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000002073 fluorescence micrograph Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 238000002546 full scan Methods 0.000 description 1
- 238000002825 functional assay Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 235000021472 generally recognized as safe Nutrition 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010362 genome editing Methods 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 244000000059 gram-positive pathogen Species 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- NUHSROFQTUXZQQ-UHFFFAOYSA-N isopentenyl diphosphate Chemical compound CC(=C)CCO[P@](O)(=O)OP(O)(O)=O NUHSROFQTUXZQQ-UHFFFAOYSA-N 0.000 description 1
- QMZRXYCCCYYMHF-UHFFFAOYSA-N isopentenyl phosphate Chemical compound CC(=C)CCOP(O)(O)=O QMZRXYCCCYYMHF-UHFFFAOYSA-N 0.000 description 1
- 239000010410 layer Substances 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 230000002934 lysing effect Effects 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000010445 mica Substances 0.000 description 1
- 229910052618 mica group Inorganic materials 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 238000005319 nano flow HPLC Methods 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000012261 overproduction Methods 0.000 description 1
- 229920002866 paraformaldehyde Polymers 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000000379 polymerizing effect Effects 0.000 description 1
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000013777 protein digestion Effects 0.000 description 1
- 230000006920 protein precipitation Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000004043 responsiveness Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 229910052938 sodium sulfate Inorganic materials 0.000 description 1
- 235000011152 sodium sulphate Nutrition 0.000 description 1
- 108090000250 sortase A Proteins 0.000 description 1
- 238000012409 standard PCR amplification Methods 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 230000000153 supplemental effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 150000003505 terpenes Chemical class 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 238000004627 transmission electron microscopy Methods 0.000 description 1
- 239000003656 tris buffered saline Substances 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/34—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Corynebacterium (G)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
Definitions
- the present disclosure relates to biological engineering.
- the present disclosure relates to engineered bacteria, such as Corynebacterium glutamicum comprising modified covalently-linked pili (CLP) .
- CLP covalently-linked pili
- the engineered living materials relate to engineered biomaterials with distinctive “living” attributes such as autonomous growth, self-healing and environmental responsiveness that are only found in natural living materials, a wide range of remarkable ELMs had been developed for the applications in biosensors, bioremediation, biomedicine, biomanufacturing, wearable devices, and electronics.
- ELMs can be produced either by harnessing engineered cells to simultaneously make the material and incorporate novel functionalities into it (known as self-organizing living materials or biological ELMs) or by embedding living cells in an organic or inorganic matrix (referred to as hybrid living materials) .
- Self-organizing living materials aim to recapitulate the autonomous, adaptive, and versatile properties of natural living materials, and represent opportunities to harness engineered biological systems for new capabilities.
- Some Gram-positive bacteria comprise covalently-linked pili (CLP) .
- CLP covalently-linked pili
- the CLP monomer subunits are typically joined via intermolecular isopeptide bond catalyzed by sortase conferring enormous tensile strength (McConnell, S. A. et al., Protein labeling via a specific lysine-isopeptide bond using the pilin polymerizing sortase from Corynebacterium diphtheriae. J. Am.
- the CLP subunits contain auto-catalyzed intramolecular isopeptide bonds that are less susceptible to proteolytic cleavage and can dissipate mechanical energy (Ramirez, N.A. et al., 2020) imparting the robustness of CLP.
- several pilin proteins in the CLP structure of different strains contain additional disulfide bonds that further enhance stability (Kang, H. J. et al., The Corynebacterium diphtheriae shaft pilin SpaA is built of tandem Ig-like modules with stabilizing isopeptide and disulfide bonds. Proc. Natl. Acad. Sci. U.S.A. 106, 16967-16971, 2009) .
- the inventors develop an integrative technological platform for ELMs based on the discovary of the biosynthetic gene cluster (BGC) of the covalently-linked pili (CLP) fiber in the industrial workhorse Corynebacterium glutamicum.
- BGC biosynthetic gene cluster
- CLP covalently-linked pili
- the present disclosure provides a fusion polypeptide comprising a carrier protein and a polypeptide of interest, wherein the polypeptide of interest is fused to a terminus of the carrier protein or inserted into the carrier protein, and wherein the carrier protein is a pilin of covalently-linked pili (CLP) from a microorganism.
- CLP covalently-linked pili
- the microorganism is a gram-positive bacterium, such as a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei; preferably, Corynebacterium glutamicum.
- the carrier protein is a major pilin.
- the polypeptide of interest is fused to a terminus of the carrier protein. In some embodiments, the polypeptide of interest is fused to the N terminus of the carrier protein.
- the polypeptide of interest is inserted into the carrier protein. In some embodiments, the polypeptide of interest is inserted into a loop in the carrier protein.
- the carrier protein is a major pilin from Corynebacterium glutamicum.
- the polypeptide of interest is inserted into the M domain of the major pilin.
- the polypeptide of interest replaces the M domain of the major pilin or a part thereof.
- the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4.
- the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between positions corresponding to G215 and L216 of SEQ ID NO: 1, between positions corresponding to G236 and E237 of SEQ ID NO: 1, or between positions corresponding to G336 and T337 of SEQ ID NO: 1.
- the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1.
- the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between G215 and L216, between G236 and E237, or between G336 and T337 of SEQ ID NO: 1.
- the present disclosure provides a polynucleotide encoding the fusion polypeptide of the present disclosure, and a vector comprising the polynucleotide, as well as a host cell comprising the polypeptide, the polynucleotide or the vector of the present disclosure.
- the present disclosure provides a recombinant cell comprising a polynucleotide encoding a fusion polypeptide, wherein the fusion polypeptide comprises a carrier protein and a polypeptide of interest, wherein the polypeptide of interest is fused to a terminus of the carrier protein or inserted into the carrier protein, wherein the carrier protein is a pilin of CLP, and wherein the recombinant cell is capable of expressing the polynucleotide and displaying a modified CLP comprising the fusion polypeptide.
- the recombinant cell is a gram-positive bacterium, such as a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei; preferably, Corynebacterium glutamicum.
- the carrier protein is a major pilin.
- the polypeptide of interest is fused to a terminus of the carrier protein. In some embodiments, the polypeptide of interest is fused to the N terminus of the carrier protein.
- the polypeptide of interest is inserted into the carrier protein. In some embodiments, the polypeptide of interest is inserted into a loop in the carrier protein.
- the carrier protein is a major pilin from Corynebacterium glutamicum.
- the polypeptide of interest is inserted into the M domain of the major pilin.
- the polypeptide of interest replaces the M domain of the major pilin or a part thereof.
- the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4.
- the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between positions corresponding to G215 and L216 of SEQ ID NO: 1, between positions corresponding to G236 and E237 of SEQ ID NO: 1, or between positions corresponding to G336 and T337 of SEQ ID NO: 1.
- the carrier protein comprises amino acids 35-509 of SEQ ID NO: 1, and the polypeptide of interest is fused to the N terminus of carrier protein, or is inserted between G215 and L216, between G236 and E237, or between G336 and T337 of SEQ ID NO: 1.
- the recombinant cell comprises two or more polynucleotide respectively encoding two or more fusion polypeptides each comprising a different polypeptide of interest, and the modified CLP comprises the two or more polypeptides.
- the present disclosure provides a method of preparing the recombinant cell of present disclosure, comprising introducing a polynucleotide encoding the fusion polypeptide of the present disclosure into a host cell derived from a microorganism having CLP.
- the host cell is knock-out of native major pilin.
- the method comprises a step of native major pilin knock-out.
- the present disclosure provides a modified covalently-linked pili (CLP) comprising a plurality of the fusion polypeptides of the present disclosure.
- CLP covalently-linked pili
- the present disclosure provides a method of preparing a modified CLP comprising the steps of
- the fusion polypeptide is provided by transcribing and/or translalting the polynucleotide of the present disclosure.
- the activity of sortase is provided by transcribing and/or translalting one or more polynucleotides encoding a sortase.
- the sortase is encoded by a gene which is identified to be present in the same cluster with the gene encoding the carrier protein in nature.
- the sortase is class C type sortase, such as srtC1 and/or srtC2, preferably wherein the srtC1 and srtC2 are encoded by genes from the same cluster.
- the method is an in vitro method.
- the present disclosure provides a polynucleotide construct or a combination of polynucleotide constructs comprising the polynucleotide of the present disclosure, and one or more polynucleotides encoding a sortase.
- the sortase is encoded by a gene which is identified to be present in the same cluster with the gene encoding the carrier protein in nature.
- the sortase is class C type sortase, such as srtC1 and/or srtC2, preferably wherein the srtC1 and srtC2 are encoded by genes from the same cluster.
- Fig. 1 shows the map of plasmid pEK-spa2.
- Fig. 2 shows the workflow for constructing the tandem of two cassettes.
- Fig. 3 shows the maps of plasmids comprising the tandem of two cassettes.
- Fig. 4 shows the map of plasmid pZ9-dxs_crtEBI.
- Fig. 5 shows the map of plasmid pET-28a-Spa2.
- Fig. 6 shows the Cg CLP biosynthetic gene cluster (BGC) encoding the sortase genes srtC1 and srtC2, and the sortase-catalyzed pilin genes spa1, spa2, and spa3.
- BGC Cg CLP biosynthetic gene cluster
- Fig. 7 is the TEM and AFM images showing that the major pilin Spa2 is indispensable for Cg CLP fiber structure formation.
- the bars in the TEM and AFM images are 200 nm and 400 nm, respectively
- Fig. 8 shows the identification of the composition of CLP in C. glutamicum (CgCLP) by immunogold labelling.
- the cartoon shows that Cg CLP fibers comprise two minor pilins (Spa1 and Spa3) and a major pilin of Spa2.
- the immunogold labelling and TEM images show the constitution and distribution of Cg CLP pilins indicating that Spa2 is the major pilin.
- For single immunogold labelling of Cg CLP with primary polyclonal antibodies of Spa1, Spa2, and Spa3 ( ⁇ -Spa1, ⁇ -Spa2, and ⁇ -Spa3, respectively) ; gold-decorated goat anti-rabbit IgG was used as the secondary antibody for labelling target pilin.
- Fig. 9 shows the deletion of both the srtC1 and srtC2 genes abrogates pili formation.
- the bars in the TEM (a) and AFM (b) images are 200 nm and 400 nm, respectively.
- ⁇ -Spa2 is the primary antibody
- the 10 nm gold-decorated goat anti-rabbit IgG is the secondary antibody.
- Each ELISA experiment was performed at least in triplicate, and the standard error was shown.
- Fig. 10 shows the isolation of Cg CLP fibers for mass spectrometry analysis.
- SDS-PAGE gel electrophoresis analysis of the nickel affinity chromatography purified Cg CLP fibers showed the high-molecular Cg CLP polymers were eluted under 100 mM imidazole.
- Fig. 11 shows the identification of intermolecular isopeptide bonds for the polymerization of Spa2 monomers in Cg CLP. Fragmentation spectra of the parent ion at m/z 832.9 2+ containing the intermolecular isopeptide bond (green font) between Spa2 i Lys194 (blue font) and Spa2 i+1 Thr477 (red font) are shown.
- Fig. 12 shows the liquid chromatography-tandem mass spectrometry (LC-MS/MS) identifies the signal peptide of Spa2.
- the cartoon shows the amino acid sequence of Spa2 cut (replacing the 470-509 residues at the C-terminus of Spa2 with 6His) , enabling the Spa2 monomer not to be polymerized and to be secreted as a monomer in the medium.
- SDS-PAGE gel electrophoresis indicates the purified Spa2 cut .
- the LC-MS/MS identified that the residues 1-34 at the N-terminus of Spa2 are the signal peptide.
- This figure shows an MS/MS spectrum of the peptide with m/z 916.4538 2+ generated from chymotrypsin digest of Spa2.
- Predicted b-and y-type ions (not all included) are listed above and below the peptide sequence, respectively. Matched ions are labelled in the spectrum.
- Fig. 13 shows the Quadrupole time-of-flight mass spectrometry measured the accurate molecular weight of Spa2 cut .
- the measured molecular weight is ⁇ 54.7 Da less than the calculated value of Spa2 cut , indicating that three intramolecular isopeptide bonds and two disulfide bonds exist in the monomeric Spa2.
- An intramolecular isopeptide bond formation will lose one molecule of ammonia, ⁇ 17 Da; A disulfide bond formation will lose two hydrogen atoms, ⁇ 2 Da.
- Fig. 14 shows crystals of Spa2 diffracted to resolution on the BL18U1 beamline at the Shanghai Synchrotron Radiation Facility (Shanghai, China) .
- Fig. 15 shows the X-ray crystal structure of Spa2 which is arranged in three tandem Ig-like domains, N-domain (pink) , M-domain (blue) , and C-domain (green) . Residues involved in the formation of three intramolecular isopeptide bonds (yellow) and two disulfide bonds (red) are shown as sticks.
- Fig. 16 shows the comparison of Spa2 in the crystal structure with the prediction from AlphaFold2 and crystal structure of 3HR6 and 4HSS.
- C ⁇ alpha-carbon
- RMSD root-mean-square deviation
- Fig. 17 shows the Omit electron density maps showing the presence of internal covalent bonds in the crystal structure of Spa2.2mFo-DFc omit electron density maps of three isopeptide bonds (a) and two disulfide bonds (b) were shown in blue mesh, contoured at 1.0 ⁇ .
- the omit electron density maps were generated using Phenix composite omit map.
- Fig. 18 shows Identification of the disulfide bonds and intramolecular isopeptide bonds formation at appropriate sequence locations in Spa2 by LC-MS/MS analysis.
- the cartoon shows the critical features in Spa2, including three intramolecular isopeptide bonds in individual domains, two disulfide bonds in the N-domain (C97-C128) and the C-domain (C380-C432) , the pilin motif of YPKN in N-domain, and the sortase cleavage sorting signal motif of LPLTG in C-domain.
- Figs. 19 and 20 show the genetic manipulation in ⁇ spa2 strains (harboring a plasmid that expressed Spa2 or Spa2 variants of K194A, LPLTG 474LALAA478 , E158A, D246A, E435A, D246A/E435A, C97A, C380A, and C97A/C380A, respectively) to assess the key residues promoting the formation of inter-and intra-molecular isopeptide bonds, and disulfide bonds, in Spa2 by TEM bio-imaging (Fig. 19) and quantitative analysis of the amount of Cg CLP fiber by whole-cell filtration ELISA (detection by anti-Spa2 antibody) (Fig. 20) .
- Results are presented as mean ⁇ s.d in Fig. 20.
- Not significant (NS) P >0.05, *P ⁇ 0.05, **P ⁇ 0.01, ***P ⁇ 0.001, ****P ⁇ 0.0001.
- Statistics were derived using a t-test. The bars in Fig. 19 are 200nm.
- Fig. 21 shows the accurate molecular weight of Spa2 cut mutant variants determined by quadrupole time-of-flight mass spectrometry.
- the measured molecular weight of E158A cut (a) , D246A cut (b) , E435A cut (c) , and D246A/E435A cut (d) are ⁇ 54.9, 37.3, 21.4, and 4.0 Da less than the calculated value of related variants, indicating that three, two, one and no intramolecular isopeptide bonds are retained in the corresponding monomeric mutants, respectively.
- Spa2cut mutant variants E158A cut , D246A cut , E435A cut , and D246A/E435A cut were expressed in ⁇ spa2 and purified by nickel-affinity chromatography.
- Fig. 22 shows the rational engineering of the Cg CLP protein scaffold through a modular genetic design strategy: the cartoon shows a polymerized Spa2 major pilin functionalized by incorporating a protein-of-interest (POI) (e.g., mCherry, a fluorescent reporter protein) at candidate insertion sites (including Q35 (E1) at the N-terminus, and G215 (E2) , G236 (E3) and G336 (E4) in the M-domain lacking a disulfide bond) based on structural verification.
- POI protein-of-interest
- Fig. 24 shows the TEM morphologies of the assembled mCherry-Spa2 fusion proteins associated with cell surfaces based on immunogold labelling.
- TEM images of ⁇ spa2 cells (a) , E1 cells (b) , E2 cells (c) , E3 cells (d) and E4 cells (e) .
- the TEM samples were collected from the ⁇ spa2 strain harboring a plasmid that expresses various mCherry-Spa2 fusions under the native constitutive promoter of the spa2 gene.
- ⁇ -Spa2 is the primary antibody
- the 10 nm gold-decorated goat anti-rabbit IgG is the secondary antibody. Scale bars, 200 nm.
- Fig. 25 shows the extracellular secretion and assembly of R-Spa2 pilins into CgCLP fiber at the cell-surfaces of engineered C. glutamicum cells: a series of R-Spa2 fusion protein constructs comprising functional R peptides/proteins with different amino acid sequences.
- Fig. 27 shows the Functional characterization of engineered Cg CLP with various fusion domains.
- (a) TEM images showed that Ni-NTA-decorated AuNPs were anchored onto 6His-Spa2 Cg CLP.
- (b) Confocal microscopic images showed the green fluorescence emitted from SpyTag-Spa2 Cg CLP cells to which SpyCatcher-EGFP protein binding partners were covalently attached via Spytag-SpyCatcher interaction pairs.
- (c) Confocal microscopic images show the green fluorescence emitted from SpyCatcher-Spa2 Cg CLP cells to which SpyTag-EGFP protein binding partners were covalently attached via Spytag-SpyCatcher interaction pairs.
- Fig. 28 shows the schematic showing simultaneous expression of the two Spa2 pilin fusion proteins, N-Ven-Spa2 and C-Ven-Spa2 (N-Ven-Spa2+C-Ven-Spa2 strain) , containing the N-terminus (N-Ven) and C-terminus (C-Ven) module of the split-Venus system, resulting in co-assembly of the split-Venus components into the final functional Cg CLP structures.
- Fig. 29 shows the TEM morphologies of the assembled split-Venus components fused with Spa2 associated with cell surfaces based on immunogold labelling.
- N-Ven+C-Ven cells expressing co-secreted split-Venus system (a) , N-Ven-Spa2 cells expressing the Spa2 pilin fusion protein of N-Venus-Spa2 (b) , C-Ven-Spa2 cells expressing the Spa2 pilin fusion protein of C-Venus-Spa2 (c) , and N-Ven-Spa2+C-Ven-Spa2 cells for simultaneous expression of two Spa2 pilin fusion proteins, N-Ven-Spa2 and C-Ven-Spa2 (d) .
- TEM samples were collected from the ⁇ spa2 strain harboring a plasmid that expresses various Spa2 fusion proteins under the native constitutive promoter of the spa2 gene.
- ⁇ -Spa2 is the primary antibody
- 10 nm gold-decorated goat anti-rabbit IgG is the secondary antibody.
- Scale bars 200 nm.
- Fig. 30 shows the co-assembly of split-Venus components into the Cg CLP fibers leading to increased fluorescence intensity.
- the engineered C. glutamicum cells show greater fluorescence intensity only in the N-Ven-Spa2+C-Ven-Spa2 strain, and
- (b) confocal microscopy of C. glutamicum cells showing that the strongest Venus fluorescence signal appeared at the extracellular sites of the N-Ven-Spa2+C-Ven-Spa2 strain (scale bar 2 ⁇ m) .
- Fig. 31 shows the schematic illustrating of engineered C. glutamicum living materials transforming cellulosic biomass into a value-added product of lycopene by combining the extracellular cellulose degradation capacity and intracellular bioconversion ability.
- extracellular cellulose degradation (Step1) , endo-1, 4- ⁇ -glucanase from T. reesei (TrEgl) and a ⁇ -glucosidase from S.
- SdBgl Spa2 pilin
- TrEgl-Spa2+SdBgl-Spa2 Spa2 pilin
- Step2 the glucose was used for lycopene production in the pathway engineered C. glutamicum of C003 strain by inducing IPTG.
- G3P glyceraldehyde-3-phosphate
- IPP isopentenyl phosphate.
- Fig. 32 shows the lycopene production from biowastes with engineered C. glutamicum harboring modified CLPs.
- a TEM images show that cells of C003, which contain the P2 plasmid, enabled co-assembly of TrEgl and SdBgl into Cg CLP structure, while the cells of C001, C002, and C004 did not.
- Cg CLP was labeled with 10 nm gold particles by immunogold labelling. Scale bars, 200 nm.
- ELMs can degrade CMC-Na in a medium from a viscous gel to a thin solution only when both TrEgl and SdBgl were co-assembled into the CgCLP structure (TrEgl-Spa2+SdBgl-Spa2, C003 strain) , outperforming the case of the secreted free enzymes (TrEgl+SdBgl, C004 strain) .
- ⁇ spa2 ⁇ dec (C001 strain) is the negative control strain.
- the C003 strain showed 4-fold higher enzymeactivity than the C004 strain.
- covalently-linked pili or “CLP” refers to pili in which the monomers are linked to each other via covalent bonds.
- the engineered living materials herein refers to the pili formed by the engineered monomers, i.e., the fusion polypeptide of the present disclosure, or recombinant bacterium forming the pili.
- C. glutamicum a Gram-positive bacterium
- GRAS general regarded as safe
- peptide can be exchanged with “polypeptide” and “protein” , means a chain comprising at least two amino acids linked by peptide bond, such as ten or more amino acid residues.
- the chemical formulas or sequences of all the peptides and polypeptide herein are written in left-to-right order, showing the direction from the amino terminal to the carboxyl terminal.
- “Peptide” , “polypeptide” and “protein” can include, but are not limited to, an enzyme, an antibody, a hormone, a ligand, a receptor, etc.
- amino acid includes amino acids naturally occurred in proteins and the unnatural amino acids.
- the conventional nomenclature one-letter and three-letter of the amino acids naturally occurred in proteins is employed, which can be seen in Sambrook, et al. (Molecular Cloning: A Laboratory Manual, 2nd, ed. Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y., 1989) .
- fusion polypeptide is a recombinant product comprising two or more peptide fragments which are not present in a single natural polypeptide.
- the fragments can be fused directly or via a linker, such as a flexible linker, e.g., GS linkers.
- a fusion polypeptide can be produced by the expression of a polynucleotide comprising nucleotide sequences encoding the two or more peptide fragments and the linker, if present, in desired order.
- polynucleotide usually refers to generally a nucleic acid molecule (e.g., 100 nucleotides and up to 30k nucleotides in length) and a sequence that is either complementary (antisense) or identical (sense) to the sequence of a messenger RNA (mRNA) or miRNA fragment or molecule.
- mRNA messenger RNA
- miRNA fragment or molecule usually refers to DNA or RNA molecules that are either transcribed or non-transcribed.
- polynucleotide construct refers to a single-stranded or double-stranded polynucleotide, which is isolated from a naturally occurring gene or modified to contain a nucleic acid segment that does not naturally occur.
- polynucleotide construct contains the control sequences required to express the coding sequence of the present disclosure, the polynucleotide construct comprises an “expression cassette” .
- exogenous polynucleotide refers to a nucleotide sequence that does not originate from the host in which it is placed. It may be identical or heterologous to the host’s DNA. An example is a sequence of interest inserted into a vector. Such exogenous DNA sequences may be derived from a variety of sources including DNA, cDNA, synthetic DNA, and RNA. Exogenous polynucleotides also encompass DNA sequences that encode antisense oligonucleotides.
- expression cassette refers to a polynucleotide segment comprising a polynucleotide encoding a polypeptide operably linked to additional nucleotides provided for the expression of the polynucleotide, for example, control sequence.
- the term “encoding” means that a polynucleotide directly specifies the amino acid sequence of its protein product.
- the boundaries of the coding sequence are generally determined by an open reading frame, which generally starts with the ATG start codon or other start codons such as GTG and TTG, and ends with a stop codon such as TAA, TAG and TGA.
- the coding sequence can be a DNA, cDNA or recombinant nucleotide sequence.
- expression includes any step involved in the production of a polypeptide, including but not limited to transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
- control sequence includes all elements necessary or beneficial for the expression of the polynucleotide encoding the polypeptide of the present disclosure.
- Each control sequence may be natural or foreign to the nucleotide sequence encoding the polypeptide, or natural or foreign to each other.
- control sequences include, but are not limited to, leader sequence, polyadenylation sequence, propeptide sequence, promoter, enhancer, signal peptide sequence, and transcription terminator.
- control sequences include a promoter and signals for the termination of transcription and translation.
- control sequence may be a suitable promoter sequence, a nucleotide sequence recognized by the host cell to express the polynucleotide encoding the polypeptide of the present disclosure.
- the promoter sequence contains a transcription control sequence that mediates the expression of the polypeptide.
- the promoter may be any nucleotide sequence that exhibits transcriptional activity in the selected host cell, for example, lac operon of E. coli.
- the promoters also include mutant, truncated and hybrid promoters, and can be obtained from genes encoding extracellular or intracellular polypeptides, which are homologous or heterologous to the host cell.
- operably linked refers to a configuration in which a control sequence is placed at an appropriate position relative to the coding sequence of the polynucleotide sequence, whereby the control sequence directs the expression of the polypeptide coding sequence.
- the polynucleotide encoding a polypeptide of interest can be subjected to various manipulations to improve the expression of the polypeptide. Before the insertion thereof into a vector, manipulation of the polynucleotide according to the expression vector or the host, such as codon optimization, is desirable or necessary. Techniques for modifying polynucleotide sequences with recombinant DNA methods are well known in the art.
- recombinant refers to nucleic acids, vectors, polypeptides, or proteins that have been generated using DNA recombination (cloning) methods and are distinguishable from native or wild-type nucleic acids, vectors, polypeptides, or proteins.
- hybridization that nucleotides sequences, which are at least about 90%, preferably at least about 95%, more preferably at least about 96%, and more preferably at least 98%homologous to each other, generally maintain hybridization with each other under given stringent hybridization and washing conditions.
- the sequences are aligned for the purpose of optimal comparison (e.g., a gap can be introduced into the first amino acid or nucleic acid sequence for the optimal alignment with the second amino acid or nucleic acid sequence) . Then, the amino acid residues or nucleotides at the corresponding amino acid positions or nucleotide positions are compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide at the corresponding position in the second sequence, these molecules are identical at this position.
- the two sequences are identical in length.
- Identity percentage or “sequence identity percentage” refers to the comparison between the amino acids of two polypeptides or nucleotides between two polynucleotides, and when optimally aligned, the two polypeptides or polynucleotides have approximately the specified percentage of identical amino acids.
- 95% identity refers to the comparison between the amino acids of two polypeptides or nucleotides between two polynucleotides, and when optimally aligned, 95%of the amino acids in the two polypeptides or 95%of the nucleotides in the two polynucleotides are identical.
- polynucleotide of the present disclosure does not include a polynucleotide that only hybridizes to a poly A sequence (such as the 3' end poly (A) of mRNA) or a complementary stretch of poly T (or U) residues.
- the term “host cell” refers to, for example microorganisms, yeast cells, insect cells, and mammalian cells, that can be, or have been, used as recipients of vectors.
- the term includes the progeny of the original cell which has been transduced.
- a “host cell” as used herein generally refers to a cell which has been transduced with an exogenous DNA sequence. It is understood that the progeny of a single parental cell may not necessarily be completely identical in morphology or in genomic or total DNA complement to the original parent, due to natural, accidental, or deliberate mutation.
- Spa2 protein is identified as the major pilin of the CLP fiber structure.
- structure-guided design the inventor developed a new type of engineerable extracellular protein scaffold that can be genetically appended with diverse functional peptides or proteins at multiple sites of Spa2 protein.
- the present disclosure provides a fusion polypeptide comprising a carrier protein and a polypeptide of interest, wherein the polypeptide of interest is fused to a terminus of the carrier protein or inserted into the carrier protein, and wherein the carrier protein is a pilin of covalently-linked pili (CLP) from a microorganism.
- CLP covalently-linked pili
- the microorganism is a gram-positive bacterium, such as a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei; preferably, Corynebacterium glutamicum.
- the bacterium can include, but are not limited to, a bacterium selected from Corynebacterium glutamicum strain BE (GenBank assembly accession: GCA_013046805.1) , Corynebacterium glutamicum ATCC 14067 (GenBank assembly accession: GCA_002243555.1) , Corynebacterium glutamicum strain YI (GenBank assembly accession: GCA_001643035.1) , Corynebacterium glutamicum strain ATCC 13869 (GenBank assembly accession: GCA_001687645.1) , Corynebacterium glutamicum AJ1511 (GenBank assembly accession: GCA_002355675.1) , Corynebacterium glutamicum strain XV (GenBank assembly accession: GCA_001936195.1) , Corynebacterium glutamicum strain CP (GenBank assembly accession: GCA_001447865.2) , Corynebacterium glutamicum R (GenBank assembly accession: GCA
- cremoris NZ9000 GenBank assembly accession: GCA_000143205.1
- cremoris MG1363 GenBank assembly accession: GCA_000009425.1
- cremoris A76 (GenBank assembly accession: GCA_000236475.1) , Lactococcus lactis strain SRCM103457 (GenBank assembly accession: GCA_004194355.1) , Lactococcus lactis strain CBA3619 (GenBank assembly accession: GCA_007954765.1) , Lactococcus lactis strain WiKim0098 (GenBank assembly accession: GCA_016406265.1) , Lactococcus lactis strain K_LL005 (GenBank assembly accession: GCA_014334715.1) , Lactococcus lactis subsp.
- lactis strain G121 (GenBank assembly accession: GCA_013395015.1) , Lactococcus lactis strain N8 (GenBank assembly accession: GCA_014884605.1) , Lactococcus lactis subsp. lactis IO-1 (GenBank assembly accession: GCA_000344575.1) , Lactococcus lactis subsp. lactis strain F44 (GenBank assembly accession: GCA_002804185.1) , Lactococcus lactis subsp. lactis bv.
- Lactococcus lactis strain S50 (GenBank assembly accession: GCA_003627395.2) , Lactococcus lactis strain FDAARGOS_1064 (GenBank assembly accession: GCA_016127135.1) , Lactococcus lactis strain FDAARGOS_887 (GenBank assembly accession: GCA_016027975.1) , Lactococcus lactis subsp.
- lactis strain UC77 (GenBank assembly accession: GCA_002078615.2) , Lactococcus lactis strain FDAARGOS_866 (GenBank assembly accession: GCA_016028815.1) , Lactococcus lactis strain IL1403 (GenBank assembly accession: GCA_003722275.1) , Lactococcus lactis strain FDAARGOS_865 (GenBank assembly accession: GCA_016028835.1) , Lactococcus lactis subsp.
- cremoris IBB477 (GenBank assembly accession: GCA_001856165.1) , Lacticaseibacillus paracasei strain TD 062 (GenBank assembly accession: GCA_009834405.1) , Lacticaseibacillus paracasei strain HM1 (GenBank assembly accession: GCA_018064185.1) , Bacillus thuringiensis strain FDAARGOS_794 (GenBank assembly accession: GCA_013267795.1) , Bacillus thuringiensis strain XL6 (GenBank assembly accession: GCA_000774075.2) , Bacillus thuringiensis strain Bt-GS57 (GenBank assembly accession: GCA_017751245.1) , Bacillus thuringiensis strain HER1410 (GenBank assembly accession: GCA_013340745.1) , Bacillus thuringiensis serovar tolworthi (GenBank assembly accession: GCA_001548175.1) , Bac
- tolerans strain MGB0734 (GenBank assembly accession: GCA_015476135.1) , Lacticaseibacillus paracasei subsp. tolerans strain MGB0747 (GenBank assembly accession: GCA_015476175.1) , Lacticaseibacillus paracasei strain CBA3611 (GenBank assembly accession: GCA_007292115.1) , Lacticaseibacillus paracasei subsp. paracasei strain GR0548 (GenBank assembly accession: GCA_019175405.1) , Lacticaseibacillus paracasei subsp.
- tolerans strain MGB0625 (GenBank assembly accession: GCA_015476155.1) , Lacticaseibacillus paracasei strain 10266 (GenBank assembly accession: GCA_008329845.1) , Lacticaseibacillus paracasei subsp. tolerans strain S-NB (GenBank assembly accession: GCA_016757695.1) , Lacticaseibacillus paracasei strain Lp02 (GenBank assembly accession: GCA_013307125.1) , Lacticaseibacillus paracasei strain ZFM54 (GenBank assembly accession: GCA_003627255.1) , Lacticaseibacillus paracasei subsp.
- paracasei strain BD5115 GenBank assembly accession: GCA_018596415.1
- Paracasei JCM 8130 GenBank assembly accession: GCA_000829035.1
- Corynebacterium glutamicum ATCC 14067 preferably, Corynebacterium glutamicum ATCC 14067.
- the carrier protein is a major pilin.
- the fusion of insertion of the polypeptide of interest does not influence the formation of intermolecular isopeptide bond, disulfide bond, or intramolecular isopeptide bond in the carrier protein.
- the polypeptide of interest is fused to a terminus of the carrier protein. In some embodiments, the polypeptide of interest is fused to the N terminus of the carrier protein.
- the polypeptide of interest is inserted into the carrier protein. In some embodiments, the polypeptide of interest is inserted into a loop in the carrier protein.
- the carrier protein is a major pilin from Corynebacterium glutamicum (Spa2 protein) . It is observed that the Spa2 protein (SEQ ID NO: 1) comprises three tandem Ig-like domains, including N-domain (residues 36-197) , M-domain (residues 198-343) , and C-domain (residues 344-469) which is consistent with other major pilin. It is also observed that the deletion of M-domain does not influence the formation of CLP.
- the polypeptide of interest is inserted into the M domain of the major pilin. In some embodiments, the polypeptide of interest replaces the M domain of the major pilin or a part thereof.
- the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to SEQ ID NO: 1, 2, 3, or 4.
- the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to SEQ ID NO: 1, 2, 3, or 4 with the residues corresponding to residues C97, C128, K194, C380, C432, and LPLTG (474-478) , and optionally E158, D246, and/or E435 of SEQ ID NO: 1 unchanged.
- the carrier protein can be the mature form of SEQ ID NO: 1, 2, 3, or 4, i.e., with the deletion of the signal peptide.
- the carrier protein comprises amino acids 36 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4.
- the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4with the residues corresponding to residues C97, C128, E158, K194, D246, C380, C432, E435, and LPLTGT (474-478) , and optionally E158, D246, and/or E435 of SEQ ID NO: 1 unchanged.
- the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between positions corresponding to G215 and L216 of SEQ ID NO: 1, between positions corresponding to G236 and E237 of SEQ ID NO: 1, or between positions corresponding to G336 and T337 of SEQ ID NO: 1.
- the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1.
- the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between G215 and L216, between G236 and E237, or between G336 and T337 of SEQ ID NO: 1.
- the polypeptide of interest is directly linked to the N terminal of the carrier polypeptide. In some embodiments, the polypeptide of interest is linked to the N terminal of the carrier polypeptide via a peptide linker such as a flexible linker.
- a peptide linker can be generally short peptides with about 4-20 or more amino acids, such as combinations of Ser and Gly residues, which is a conventional flexible linker.
- the peptide linker used in the present disclosure is (G4S) 2 i.e., SEQ ID NO: 22.
- the peptide linker is a C10 linker of SEQ ID NO: 23.
- the polypeptide of interest can be selected according to the desired application of the fusion polypeptide.
- the fusion polypeptide is provided to bind, capture or enrich a target molecule
- the polypeptide of interest is a polypeptide that can recognize a target peptide, including but not limited to a ligand, a receptor, an antigen and an antibody such as scFV and nanobody.
- the fusion polypeptide is provided to capture a protein comprising a SpyTag (SEQ ID NO: 37)
- the polypeptide of interest comprises SpyCatcher (SEQ ID NO: 15) , vice versa.
- the fusion polypeptide is provided as an adhesive agent, and the polypeptide of interest is an adhesive peptide, e.g., Mfp35 (SEQ ID NO: 38) .
- the fusion polypeptide is provided to catalyze chemical or biochemical reactions, and the polypeptide of interest is an enzyme.
- the fusion polypeptide is provided to degrade carbohydrates such as cellulose, and the polypeptide of interest can be the endo-1, 4- ⁇ -glucanase, e.g., from Trichoderma reesei (TrEgl, SEQ ID NO: 19) and/or ⁇ -glucosidase, e.g., from Saccharophagus degradans (SdBgl, SEQ ID NO: 21) .
- the fusion polypeptide is provided to degrade refractory organics, such as plastics, and the polypeptide of interest is an enzyme responsible for the degradation, such as a PETase.
- the present disclosure provides a polynucleotide encoding the fusion polypeptide of the present disclosure.
- the polynucleotide of the present disclosure can be amplified with cDNA, mRNA or genomic DNA as the template and suitable oligonucleotide primers according to standard PCR amplification techniques.
- the nucleic acid amplified as above can be cloned into a suitable vector and characterized by DNA sequence analysis.
- the polynucleotide of the present disclosure can be prepared by standard synthesis techniques, for example, by using an automated DNA synthesizer.
- a nucleic acid molecule that is complementary to other nucleotide sequence is a molecule that is sufficiently complementary to the nucleotide sequence so that it can hybridize with the other nucleotide sequences to form a stable duplex.
- a polynucleotide construct and a vector comprising the polynucleotide of the present disclosure, such as an expression vector.
- the polynucleotide of the present disclosure is operably linked to a promoter.
- the promoter is a constitutive promoter, such as the native promoter driving Spa2 gene in Corynebacterium glutamicum.
- the promoter is an inducible promoter.
- the expression vector comprises a Lac operon.
- the polynucleotide encoding the polypeptide of the present disclosure can be subjected to various manipulations to allow the expression of the polypeptide. Before the insertion thereof into a vector, manipulation of the polynucleotide according to the expression vector is desirable or necessary. Techniques for modifying polynucleotide sequences with recombinant DNA methods are well known in the art.
- the vector of the present disclosure preferably contains one or more selectable markers, which allow simple selection of transformed, transfected, transduced, etc. cells.
- a selectable marker is a gene, of which the product provides biocide or virus resistance, heavy metal resistance, supplemental auxotrophs, etc.
- the bacterial selectable marker is the dal gene from Bacillus subtilis or Bacillus licheniformis, or a marker that confers antibiotic resistance such as ampicillin, kanamycin, chloramphenicol or tetracycline resistance.
- the vector of the present disclosure can be integrated into the genome of the host cell or autonomously replicate in the cell, which is independent of the genome.
- the elements required for the integration into the genome of the host cell or the autonomous replication are known in the art (see, for example, the aforementioned Sambrook et al., 1989) .
- the present disclosure provides a recombinant cell comprising a polynucleotide encoding a fusion polypeptide, wherein the fusion polypeptide comprises a carrier protein and a polypeptide of interest, wherein the polypeptide of interest is fused to a terminus of the carrier protein or inserted into the carrier protein, wherein the carrier protein is a pilin of CLP, and wherein the recombinant cell is capable of expressing the polynucleotide and displaying a modified CLP comprising the fusion polypeptide.
- the carrier protein in the fusion polypeptide is the native major pilin of the recombinant cell.
- the recombinant cell is a recombinant gram-positive bacterium, such as a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei; preferably, Corynebacterium glutamicum.
- a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei preferably, Corynebacterium glutamicum.
- the bacterium can include, but are not limited to, a bacterium selected from Corynebacterium glutamicum strain BE (GenBank assembly accession: GCA_013046805.1) , Corynebacterium glutamicum ATCC 14067 (GenBank assembly accession: GCA_002243555.1) , Corynebacterium glutamicum strain YI (GenBank assembly accession: GCA_001643035.1) , Corynebacterium glutamicum strain ATCC 13869 (GenBank assembly accession: GCA_001687645.1) , Corynebacterium glutamicum AJ1511 (GenBank assembly accession: GCA_002355675.1) , Corynebacterium glutamicum strain XV (GenBank assembly accession: GCA_001936195.1) , Corynebacterium glutamicum strain CP (GenBank assembly accession: GCA_001447865.2) , Corynebacterium glutamicum R (GenBank assembly accession: GCA
- cremoris NZ9000 GenBank assembly accession: GCA_000143205.1
- cremoris MG1363 GenBank assembly accession: GCA_000009425.1
- cremoris A76 (GenBank assembly accession: GCA_000236475.1) , Lactococcus lactis strain SRCM103457 (GenBank assembly accession: GCA_004194355.1) , Lactococcus lactis strain CBA3619 (GenBank assembly accession: GCA_007954765.1) , Lactococcus lactis strain WiKim0098 (GenBank assembly accession: GCA_016406265.1) , Lactococcus lactis strain K_LL005 (GenBank assembly accession: GCA_014334715.1) , Lactococcus lactis subsp.
- lactis strain G121 (GenBank assembly accession: GCA_013395015.1) , Lactococcus lactis strain N8 (GenBank assembly accession: GCA_014884605.1) , Lactococcus lactis subsp. lactis IO-1 (GenBank assembly accession: GCA_000344575.1) , Lactococcus lactis subsp. lactis strain F44 (GenBank assembly accession: GCA_002804185.1) , Lactococcus lactis subsp. lactis bv.
- Lactococcus lactis strain S50 (GenBank assembly accession: GCA_003627395.2) , Lactococcus lactis strain FDAARGOS_1064 (GenBank assembly accession: GCA_016127135.1) , Lactococcus lactis strain FDAARGOS_887 (GenBank assembly accession: GCA_016027975.1) , Lactococcus lactis subsp.
- lactis strain UC77 (GenBank assembly accession: GCA_002078615.2) , Lactococcus lactis strain FDAARGOS_866 (GenBank assembly accession: GCA_016028815.1) , Lactococcus lactis strain IL1403 (GenBank assembly accession: GCA_003722275.1) , Lactococcus lactis strain FDAARGOS_865 (GenBank assembly accession: GCA_016028835.1) , Lactococcus lactis subsp.
- cremoris IBB477 (GenBank assembly accession: GCA_001856165.1) , Lacticaseibacillus paracasei strain TD 062 (GenBank assembly accession: GCA_009834405.1) , Lacticaseibacillus paracasei strain HM1 (GenBank assembly accession: GCA_018064185.1) , Bacillus thuringiensis strain FDAARGOS_794 (GenBank assembly accession: GCA_013267795.1) , Bacillus thuringiensis strain XL6 (GenBank assembly accession: GCA_000774075.2) , Bacillus thuringiensis strain Bt-GS57 (GenBank assembly accession: GCA_017751245.1) , Bacillus thuringiensis strain HER1410 (GenBank assembly accession: GCA_013340745.1) , Bacillus thuringiensis serovar tolworthi (GenBank assembly accession: GCA_001548175.1) , Bac
- tolerans strain MGB0734 (GenBank assembly accession: GCA_015476135.1) , Lacticaseibacillus paracasei subsp. tolerans strain MGB0747 (GenBank assembly accession: GCA_015476175.1) , Lacticaseibacillus paracasei strain CBA3611 (GenBank assembly accession: GCA_007292115.1) , Lacticaseibacillus paracasei subsp. paracasei strain GR0548 (GenBank assembly accession: GCA_019175405.1) , Lacticaseibacillus paracasei subsp.
- tolerans strain MGB0625 (GenBank assembly accession: GCA_015476155.1) , Lacticaseibacillus paracasei strain 10266 (GenBank assembly accession: GCA_008329845.1) , Lacticaseibacillus paracasei subsp. tolerans strain S-NB (GenBank assembly accession: GCA_016757695.1) , Lacticaseibacillus paracasei strain Lp02 (GenBank assembly accession: GCA_013307125.1) , Lacticaseibacillus paracasei strain ZFM54 (GenBank assembly accession: GCA_003627255.1) , Lacticaseibacillus paracasei subsp.
- paracasei strain BD5115 GenBank assembly accession: GCA_018596415.1
- Paracasei JCM 8130 GenBank assembly accession: GCA_000829035.1
- Corynebacterium glutamicum ATCC 14067 preferably, Corynebacterium glutamicum ATCC 14067.
- the carrier protein is a major pilin. In some embodiments, the carrier protein is the native major pilin of the bacterium.
- the fusion of insertion of the polypeptide of interest does not influence the formation of intermolecular isopeptide bond, disulfide bond, or intramolecular isopeptide bond in the carrier protein.
- the polypeptide of interest is fused to a terminus of the carrier protein. In some embodiments, the polypeptide of interest is fused to the N terminus of the carrier protein.
- the polypeptide of interest is inserted into the carrier protein. In some embodiments, the polypeptide of interest is inserted into a loop in the carrier protein.
- the carrier protein is a major pilin from Corynebacterium glutamicum (Spa2 protein) . It is observed that the Spa2 protein (SEQ ID NO: 1) comprises three tandem Ig-like domains, including N-domain (residues 36-197) , M-domain (residues 198-343) , and C-domain (residues 344-469) which is consistent with other major pilin. It is also observed that the deletion of M-domain does not influence the formation of CLP.
- the polypeptide of interest is inserted into the M domain of the major pilin. In some embodiments, the polypeptide of interest replaces the M domain of the major pilin or a part thereof.
- the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to SEQ ID NO: 1, 2, 3, or 4.
- the carrier protein comprises an amino acid sequence of SEQ ID NO: 1, 2, 3, or 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to SEQ ID NO: 1, 2, 3, or 4 with the residues corresponding to residues C97, C128, K194, C380, C432, and LPLTG (474-478) , and optionally E158, D246, and/or E435 of SEQ ID NO: 1 unchanged.
- the carrier protein can be the mature form of SEQ ID NO: 1, 2, 3, or 4, i.e., with the deletion of the signal peptide.
- the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4.
- the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4, or an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%or 99.5%identical to amino acids 35 to 509 of SEQ ID NO: 1, amino acids 34 to 520 of SEQ ID NO: 2, amino acids 34 to 530 of SEQ ID NO: 3, or amino acids 34 to 519 of SEQ ID NO: 4with the residues corresponding to residues C97, C128, E158, K194, D246, C380, C432, E435, and LPLTGT (474-478) , and optionally E158, D246, and/or E435 of SEQ ID NO: 1 unchanged.
- the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between positions corresponding to G215 and L216 of SEQ ID NO: 1, between positions corresponding to G236 and E237 of SEQ ID NO: 1, or between positions corresponding to G336 and T337 of SEQ ID NO: 1.
- the carrier protein comprises amino acids 35 to 509 of SEQ ID NO: 1.
- the polypeptide of interest is fused to the N terminus of the carrier protein, or is inserted between G215 and L216, between G236 and E237, or between G336 and T337 of SEQ ID NO: 1.
- the polypeptide of interest is directly linked to the N terminal of the carrier polypeptide. In some embodiments, the polypeptide of interest is linked to the N terminal of the carrier polypeptide via a peptide linker such as a flexible linker.
- a peptide linker can be generally short peptides with about 4-20 or more amino acids, such as combinations of Ser and Gly residues, which is a conventional flexible linker.
- the peptide linker used in the present disclosure is (G4S) 2 i.e., SEQ ID NO: 22.
- the peptide linker is a C10 linker of SEQ ID NO: 23.
- the polypeptide of interest can be selected according to the desired application of the fusion polypeptide.
- the fusion polypeptide is provided to degrade carbohydrates such as cellulose, and the polypeptide of interest can be the endo-1, 4- ⁇ -glucanase from Trichoderma reesei (TrEgl, SEQ ID NO: 19) and/or ⁇ -glucosidase from Saccharophagus degradans (SdBgl, SEQ ID NO: 21) .
- the recombinant cell comprises two or more polynucleotide respectively encoding two or more fusion polypeptides each comprising a different polypeptide of interest, and the modified CLP comprises the two or more polypeptides.
- the recombinant cell is provided to bind, capture or enrich a target molecule
- the polypeptide of interest is a polypeptide that can recognize a target peptide, including but not limited to a ligand, a receptor, an antigen and an antibody such as scFV and nanobody.
- the recombinant cell is provided to capture a protein comprising a SpyTag (SEQ ID NO: 37)
- the polypeptide of interest comprises SpyCatcher (SEQ ID NO: 15) , vice versa.
- the recombinant cell is provided as an adhesive agent, and the polypeptide of interest is an adhesive peptide, e.g., Mfp35 (SEQ ID NO: 38) .
- the recombinant cell is provided to catalyze chemical or biochemical reactions, and the polypeptide of interest is an enzyme.
- the recombinant cell is provided to degrade carbohydrates such as cellulose, and the polypeptide of interest can be the endo-1, 4- ⁇ -glucanase, e.g., from Trichoderma reesei (TrEgl, SEQ ID NO: 19) and/or ⁇ -glucosidase, e.g., from Saccharophagus degradans (SdBgl, SEQ ID NO: 21) .
- the recombinant cell is provided to degrade refractory organics, such as plastics, and the polypeptide of interest is an enzyme responsible for the degradation, such as a PETase.
- the present disclosure provides a method of preparing the recombinant cell of present disclosure, comprising introducing a polynucleotide encoding the fusion polypeptide of the present disclosure into a host cell.
- the carrier protein in the fusion polypeptide is the native major pilin of the host cell.
- the host cell is a gram-positive bacterium. In some embodiments, the host cell is a bacterium selected from Corynebacterium glutamicum, Bifidobacterium breve, Lactococcus lactis, Lacticaseibacillus paracasei, Bacillus thuringiensis, and Lacticaseibacillus paracasei; preferably, Corynebacterium glutamicum.
- the bacterium can include, but are not limited to, a bacterium selected from Corynebacterium glutamicum strain BE (GenBank assembly accession: GCA_013046805.1) , Corynebacterium glutamicum ATCC 14067 (GenBank assembly accession: GCA_002243555.1) , Corynebacterium glutamicum strain YI (GenBank assembly accession: GCA_001643035.1) , Corynebacterium glutamicum strain ATCC 13869 (GenBank assembly accession: GCA_001687645.1) , Corynebacterium glutamicum AJ1511 (GenBank assembly accession: GCA_002355675.1) , Corynebacterium glutamicum strain XV (GenBank assembly accession: GCA_001936195.1) , Corynebacterium glutamicum strain CP (GenBank assembly accession: GCA_001447865.2) , Corynebacterium glutamicum R (GenBank assembly accession: GCA
- cremoris NZ9000 GenBank assembly accession: GCA_000143205.1
- cremoris MG1363 GenBank assembly accession: GCA_000009425.1
- cremoris A76 (GenBank assembly accession: GCA_000236475.1) , Lactococcus lactis strain SRCM103457 (GenBank assembly accession: GCA_004194355.1) , Lactococcus lactis strain CBA3619 (GenBank assembly accession: GCA_007954765.1) , Lactococcus lactis strain WiKim0098 (GenBank assembly accession: GCA_016406265.1) , Lactococcus lactis strain K_LL005 (GenBank assembly accession: GCA_014334715.1) , Lactococcus lactis subsp.
- lactis strain G121 (GenBank assembly accession: GCA_013395015.1) , Lactococcus lactis strain N8 (GenBank assembly accession: GCA_014884605.1) , Lactococcus lactis subsp. lactis IO-1 (GenBank assembly accession: GCA_000344575.1) , Lactococcus lactis subsp. lactis strain F44 (GenBank assembly accession: GCA_002804185.1) , Lactococcus lactis subsp. lactis bv.
- Lactococcus lactis strain S50 (GenBank assembly accession: GCA_003627395.2) , Lactococcus lactis strain FDAARGOS_1064 (GenBank assembly accession: GCA_016127135.1) , Lactococcus lactis strain FDAARGOS_887 (GenBank assembly accession: GCA_016027975.1) , Lactococcus lactis subsp.
- lactis strain UC77 (GenBank assembly accession: GCA_002078615.2) , Lactococcus lactis strain FDAARGOS_866 (GenBank assembly accession: GCA_016028815.1) , Lactococcus lactis strain IL1403 (GenBank assembly accession: GCA_003722275.1) , Lactococcus lactis strain FDAARGOS_865 (GenBank assembly accession: GCA_016028835.1) , Lactococcus lactis subsp.
- cremoris IBB477 (GenBank assembly accession: GCA_001856165.1) , Lacticaseibacillus paracasei strain TD 062 (GenBank assembly accession: GCA_009834405.1) , Lacticaseibacillus paracasei strain HM1 (GenBank assembly accession: GCA_018064185.1) , Bacillus thuringiensis strain FDAARGOS_794 (GenBank assembly accession: GCA_013267795.1) , Bacillus thuringiensis strain XL6 (GenBank assembly accession: GCA_000774075.2) , Bacillus thuringiensis strain Bt-GS57 (GenBank assembly accession: GCA_017751245.1) , Bacillus thuringiensis strain HER1410 (GenBank assembly accession: GCA_013340745.1) , Bacillus thuringiensis serovar tolworthi (GenBank assembly accession: GCA_001548175.1) , Bac
- tolerans strain MGB0734 (GenBank assembly accession: GCA_015476135.1) , Lacticaseibacillus paracasei subsp. tolerans strain MGB0747 (GenBank assembly accession: GCA_015476175.1) , Lacticaseibacillus paracasei strain CBA3611 (GenBank assembly accession: GCA_007292115.1) , Lacticaseibacillus paracasei subsp. paracasei strain GR0548 (GenBank assembly accession: GCA_019175405.1) , Lacticaseibacillus paracasei subsp.
- tolerans strain MGB0625 (GenBank assembly accession: GCA_015476155.1) , Lacticaseibacillus paracasei strain 10266 (GenBank assembly accession: GCA_008329845.1) , Lacticaseibacillus paracasei subsp. tolerans strain S-NB (GenBank assembly accession: GCA_016757695.1) , Lacticaseibacillus paracasei strain Lp02 (GenBank assembly accession: GCA_013307125.1) , Lacticaseibacillus paracasei strain ZFM54 (GenBank assembly accession: GCA_003627255.1) , Lacticaseibacillus paracasei subsp.
- paracasei strain BD5115 GenBank assembly accession: GCA_018596415.1
- Paracasei JCM 8130 GenBank assembly accession: GCA_000829035.1
- Corynebacterium glutamicum ATCC 14067 preferably, Corynebacterium glutamicum ATCC 14067.
- the host cell is modified to inactivate the native major pilin.
- the method comprises a step of knocking out the native major pilin.
- the endogenous polynucleotide encoding the major pilin can also be replaced by the polynucleotide encoding the fusion polypeptide via homologous recombination.
- the present disclosure provides a modified covalently-linked pili (CLP) comprising a plurality of the fusion polypeptides of the present disclosure.
- the modified CLP is cell-free.
- the present disclosure further provides a method of preparing a modified CLP comprising the steps of a) providing the fusion polypeptide of the present disclosure; and b) providing an activity of sortase.
- the modified CLP is cell-free.
- the fusion polypeptide is provided by transcribing and/or translalting the polynucleotide of the present disclosure.
- the activity of sortase is provided by transcribing and/or translalting one or more polynucleotides encoding a sortase.
- the sortase is encoded by a gene which is identified to be present in the same cluster with the gene encoding the carrier protein in nature.
- the method comprises contacting the fusion polypeptide of the present disclosure with the sortase protein.
- the sortase is class C type sortase, such as srtC1 and/or srtC2, preferably wherein the srtC1 and srtC2 are encoded by genes from the same cluster.
- the method is an in vitro method.
- the present disclosure provides a polynucleotide construct or a combination of polynucleotide constructs comprising the polynucleotide of the present disclosure, and one or more polynucleotides encoding a sortase.
- the sortase is encoded by a gene which is identified to be present in the same cluster with the gene encoding the carrier protein in nature.
- the sortase is class C type sortase, such as srtC1 and/or srtC2, preferably wherein the srtC1 and srtC2 are encoded by genes from the same cluster.
- the modified CLP and recombinant cell achieve the cascade reaction of enzymes, and improves the catalytic efficiency of a multi-enzyme system.
- the immobilization of enzymes onto CLP and recombinant cells can achieve a whole-cell catalyzation.
- the original DNA sequence was fully synthesized (Genewiz, Nanjing, China) or PCR-generated. All PCR products were generated by KOD DNA polymerase (TOYOBO, Japan) . All plasmid construction was performed using the T4 DNA ligase (New England BioLabs, Boston, MA) for ligations or the NEB Builder HiFi DNA Assembly Master Mix (New England BioLabs, Boston, MA) for assembly. All plasmids or markerless strains were confirmed by DNA sequencing (GENEWIZ, Guangzhou, China) . Primers used in the Examples are listed in Table 1.
- C. glutamicum ATCC140675 was provided by Dr. Zheng’s research group at the South China University of Technology.
- C. glutamicum ATCC14067 was grown in BHI liquid medium for recovery (37 g L -1 brain heart infusion (Becton, Dickinson and company) ) at 30 °C, 250 rpm, overnight.
- BHI liquid medium for recovery 37 g L -1 brain heart infusion (Becton, Dickinson and company)
- C. glutamicum ATCC14067 was inoculated into M63 liquid medium (15.6 g L -1 M63 Broth (Sangon Biotech, Guangzhou, China) , supplemented with 1 mM MgSO4, 0.2% (wt/vol) glucose) and cultivated in an incubator at 30 °C without shaking for 2-3 days.
- Antibiotics for C. glutamicum culture were kanamycin (25 ⁇ g mL -1 ) and hloramphenicol (7.5 ⁇ g mL -1 )
- Isopropyl- ⁇ -d-thiogalactoside (IPTG) at 1 mM/0.5mM or theophylline at 1mM was used to induce gene expression.
- Trans1-T1 TransGen Biotech, Shenzhen, China
- E. coli BL21 DE3 (New England BioLabs, Boston, MA) was used for protein expression.
- E. coli was cultured in Luria-Bertani medium (10 g L -1 peptone, 5 g L -1 yeast extract, 10 g L -1 NaCl) at 37 °C or 16 °C when applicable for protein expression.
- Antibiotics for E. coli culture were kanamycin (50 ⁇ g mL -1 ) and chloramphenicol (30 ⁇ g mL -1 ) .
- the markerless deletion strains of C. glutamicum ATCC 14067 were achieved by the RecET-Cre/loxP system. Detailed methods for markerless deletion are described in Huang, Y. et al. (Recombineering using RecET in Corynebacterium glutamicum ATCC14067 via a self-excisable cassette. Sci. Rep. 7, 1-8, 2017) .
- dsDNA fragments including the Cre-Kan cassette, the left and right homologous fragments, were used for subsequent fusion PCR to generate a ⁇ 4, 385 bp linear self-excisable dsDNA cassette with primer pairs clpL-S/clpR-A.
- primer pairs spa1L-S/A, spa1R-S/A, ck-S/A and spa1L-S/spa1R-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassettes, respectively.
- primer pairs spa2L-S/A, spa2R-S/A, ck-S/A and spa2L-S/spa2R-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassettes, respectively.
- primer pairs spa3L-S/A, spa3R-S/A, ck-S/A and spa3L-S/spa3R-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassettes, respectively.
- primer pairs srtC1L-S/A, srtC2R-S/A, ck-S/A and srtC1L-S/srtC2R-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassettes, respectively.
- primer pairs srtAL-S/A, srtAR-S/A, ck-S/A and srtAL-S/srtAR-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassettes, respectively.
- primer pairs decL-S/A, decR-S/A, ck-S/A and decL-S/decR-A were used to amplify the left and right homologous fragments, Cre-Kan cassette, and the linear self-excisable dsDNA cassette, respectively.
- the self-excisable dsDNA cassettes for markerless deletion of different genes were transformed into exonuclease-recombinase RecE/T expressed competent cells (C. glutamicum ATCC 1406) by electroporation, yielding multiple Kan-resistant colonies on BHI agar plates.
- the cell-plasmid DNA/dsDNA mixture was transferred to an ice-cold electroporation cuvette (0.1 cm electrode gap) .
- Electroporation was performed with a Bio-Rad Micropulser set by three times 1.8 KV/cm (Ec1) pulse (see Huang et al., Recombineering using RecET in Corynebacterium glutamicum ATCC14067 via a self-excisable cassette, Sci Rep 7, 7916 (2017) )
- Cre enzyme was used to induce expression by adding 1 mM theophylline and excising selectable marker by Cre/lox site specific recombination. Finally, sequencing of the PCR fragments from the genomic of mutants was performed for further identification.
- the resultant mutant strains used in this study were referred to as C. glutamicum ATCC 14067 ⁇ clp ( ⁇ clp) , C. glutamicum ATCC 14067 ⁇ spa1 ( ⁇ spa1) , C. glutamicum ATCC 14067 ⁇ spa2 ( ⁇ spa2) , C. glutamicum ATCC 14067 ⁇ spa3 ( ⁇ spa3) , and C.
- glutamicum ATCC 14067 ⁇ srtC1 ⁇ srtC2 ( ⁇ srtC1 ⁇ srtC2) .
- C. glutamicum ATCC 14067 ⁇ spa1 ⁇ spa3 ( ⁇ spa1 ⁇ spa3) mutant was constructed by transforming ⁇ spa3-cassette into ⁇ spa1 strain.
- C. glutamicum ATCC 14067 ⁇ spa2 ⁇ srtA ( ⁇ spa2 ⁇ srtA) and C. glutamicum ATCC 14067 ⁇ spa2 ⁇ dec ( ⁇ spa2 ⁇ dec) mutants were constructed by transforming ⁇ srtA-cassette and ⁇ dec-cassette into ⁇ spa2 strain, respectively, as described above.
- the pEC-XK99E plasmid was used as an original plasmid.
- DNA fragments of the pEC-XK99E backbone (GNENWIZ, China) the coding sequence of Spa2 or various recombinant Spa2 (SEQ ID NOs: 1, 5, 8-14, and 24, respectively) , and the native promoter (SEQ ID NO: 25) of spa2 gene via PCR, and then all the DNA fragments were assembled by NEB Builder HiFi DNA Assembly Master Mix to construct the plasmids pEK-spa2, pEK-spa2cut, pEK-E1/mCherry-spa2, pEK-E2/mCherry-spa2, pEK-E3/mCherry-spa2, pEK-E4/mCherry-spa2, pEK-6his-spa2, pEK-SpyTagSpa2, pEK-Mfp3Spep-Spa2,
- the two basic plasmids 203 and 204 were constructed based on pEC-XK99E backbone with additional restriction sites of SmaI, XbaI, NcoI, BamHI, SpeI and SalI by Gibson assembly with NEB Builder HiFi DNA Assembly Master Mix.
- SmaI, XbaI, and NcoI were used to fuse proteins with Spa2 pilin, and SpeI and SalI (Takara) were used to insert another independent expression cassette for fusion protein.
- CDSs coding sequences of SpyCatcher, Venus, CcEgl, N-Ven, and TrEgl
- the CDSs of N-Ven and TrEgl were inserted into the linearized backbone of 203 (digestion with SmaI and SpeI, Takara) via Gibson assembly.
- CDSs of C-Ven and SdBgl were cloned into the SmaI and XbaI sites in 204 by ligation.
- CDSs of C-Ven and SdBgl were inserted into the linearized backbone of 204 (digestion with SmaI and SalI, Takara) via Gibson assembly.
- the C-Ven-Spa2 cassette was obtained by digesting pEK-C-Ven-Spa2 with SpeI and SalI, and then, cloned into the plasmid of pEK-N-Ven-Spa2 (digested with SpeI and SalI, Takara) to construct tandem expression plasmids of pEK-N-Ven-Spa2_C-Ven-Spa2 (see Fig. 3) .
- Spa2 The coding sequence of Spa2 (SEQ ID NO: 6) was amplified from the genome of C. glutamicum ATCC 14067, and then assembled into the pET-28a (+) backbone (Novagen, Madison, WI) by Gibson assembly (see Fig. 5) .
- C. glutamicum cells cultured 2-3 days in M63 medium were collected and washed twice in PBS buffer, and 20 ⁇ L of liquid culture in M63 (OD600 ⁇ 1) were deposited onto carbon-coated TEM grids for 5-10 min.
- the samples were washed two times with 50 ⁇ L PBS buffer and three times with 20 ⁇ L water, and then, the excessive solution was quickly wicked away with filter paper.
- the cells were deposited onto the cropper wire mesh, and were negatively stained with 15 ⁇ L 2 w/v%uranyl acetate solutions for 1 min and dried for 10 min under an infrared lamp. Samples were examined in a JEOL JEM-1400 transmission electron microscope at an accelerating voltage of 120 kv.
- C. glutamicum strains were cultured for 48 h in M63 liquid medium, and the cultures were collected, washed and diluted to an OD600 of 0.1 in Tris-buffered saline with 0.1%ProclinTM 300 (Sigma, 48912-U) on ice.
- the recombinant Spa2 was expressed as an N-terminus His-tagged protein.
- E. coli BL21 (DE3) transformed with plasmid PET-28a-Spa2 (CaCl 2 process) were grown overnight at 37°C to provide a starter culture for expression.
- a total of 1 L medium with 50 ⁇ g mL -1 kanamycin was inoculated with 1% (v/v) of the starter culture and grown at 37°C.
- the cultivation temperature was lowered to 16°C and IPTG was added to a final concentration of 0.5 mM to induce protein overexpression.
- cells were collected by centrifugation, and the cell pellets were suspended in buffer A (50 mM Tris-HCl, 150 mM NaCl, pH 8.0) and lysed by high pressure homogenization. The cell lysates were centrifuged at 12, 000 rpm for 30 min at 4°C.
- buffer A 50 mM Tris-HCl, 150 mM NaCl, pH 8.0
- the resulting supernatant was loaded onto a Nickel-affinity column (5 mL, GE) pre-equilibrated with buffer A (50 mM Tris-HCl, 150 mM NaCl, pH 8.0) .
- His-tagged Spa2 protein was eluted with buffer A with 50 mM imidazole.
- the His-tagged Spa2 protein was buffer-exchanged into buffer A and subjected to tag removal by HRV3c (SEQ ID NO: 34, 1 mg/50 mg Spa2) at 4 °C overnight.
- the digested product was loaded onto the 5-mL Ni-NTA column (GE) and eluted with a buffer A/buffer B (buffer A + 500 mM imidazole) gradient (5%buffer B, 10%buffer B, 20%buffer B and 100%buffer B) .
- the flow-through at 10% buffer B was collected.
- the final purified protein was concentrated to 20 mg mL-1 in 10 mM Tris-HCl pH 8.0 and 50 mM NaCl for crystallization.
- the sitting drop vapor diffusion technique http: //soft-matter. seas. harvard. edu/index. php/Vapor_Diffusion_Method) was used to crystallize the Spa2 protein. Crystals were obtained by mixing 4 ⁇ L of Spa2 protein with 4 ⁇ L reservoir solution (0.2 M sodium sulfate, 0.1 M Bis-Tris propane pH 7.5, 20 %w/v PEG 3350) and incubating the mixture at 18 °C for 1-2 weeks.
- the crystals were soaked in a cryo-protectant solution consisting of the reservoir solution and 20% (v/v) glycerol and then quickly frozen with liquid nitrogen. Diffraction data were collected on the BL18U1 beamline at the Shanghai Synchrotron Radiation Facility (Shanghai, China) with flash frozen crystals (at 100 K in a stream of nitrogen gas) . The data were processed by XDS9 and then further processed using STARANISO10 (aserver of Global Phasing Company) .
- the structure was solved by the molecular replacement method using PHASER11 and the predicted Spa2 coordinates by Alphafold Colab12 as template. Further manual model building was carried out using COOT13. The model was refined by PHENLX14. Data collection, phasing and refinement statistics are given in Table 3. Structure figures were prepared using PyMOL2.3.4 (https: //pymol. org/2/) .
- C. glutamicum colonies were inoculated into 10 mL BHI and cultured for 12 h. Then cells were transferred into M63 medium with an initial OD600 of 0.1 for 3 days at 30°C without shaking. Cells were collected by centrifugation at 5, 000 rpm, washed three times with PBS and diluted with PBS (OD600 ⁇ 0.5) . Exactly 200 ⁇ L of the samples were transferred to a flat-bottom 96-well black plate and analyzed on a Tecan Infinite Pro 200 Plate Reader, with excitation/emission wavelengths of 580/610 nm for mCherry fluorescence intensity, and 510/545 nm for Venus fluorescence intensity. The fluorescence intensity divided by the absorbance of OD is the normalized fluorescence intensity.
- Fluorescence (confocal) microscopy imaging Cells prepared for plate-reader measurements were dripped on a glass slide and imaged under a Nikon TI2-E inverted microscope. Microscope light source power, detector gain, and image processing settings were consistent among different samples.
- Stains expressing SpyTag-Spa2, SpyCatcher-Spa2 and Spa2 (strain ⁇ spa2 transformed with pEK-SpyTagSpa2, pEK-SpyCatcherSpa2, and pEK-spa2, respectively) were cultured in glass-bottom dishes in M63 for 3 days. The dishes were then gently washed three times with PBS containing 0.5%Tween80 (PBST) and blocked in PBST with 1%BSA for 1 h.
- PBST PBS containing 0.5%Tween80
- the group of SpyTag-Spa2 and Spa2 were incubated with purified GFP-SpyCatcher (SEQ ID NO: 35) , and the group of SpyCatcher-Spa2 and Spa2 were incubated with purified GFP-SpyTag (SEQ ID NO: 36) for 1 h at room temperature. All samples were washed three times with PBS buffer and imaged under a Nikon TI2-E inverted microscope.
- Spa2 strain or the Mfp3Spep-Spa2 strain was cultured in the M63 medium (3 mL) supplemented with 200 ⁇ L of green-fluorescent PS microsphere solution in 35-mm Petri dishes containing 2-3 glass slides for 3 days at 30°C without shaking. The settled glass slides were then taken out and gently flushed to wash away the microspheres that had not adhered. The binding capacity of different samples was compared with water jetting at a constant discharge pressure of 5 psi for 15 s, performed on a pressure-flow controller (PG-MFC-8CH, PreciGenome) . Fluorescence images were recorded before and after the mechanical challenge with water jetting.
- PG-MFC-8CH pressure-flow controller
- the pEK-spa2cut plasmid was transferred into ⁇ spa2 by electroporation as described above to construct the strain ⁇ spa2-pEK-spa2cut, which was used to express the monomer of Spa2cut (SEQ ID NO: 5) .
- Cells were inoculated into M63 medium with 25 ⁇ g mL-1 kanamycin and cultured for 3 days.
- Supernatants 200 mL were collected and concentrated into 1 mL and then purified by nickel-affinity chromatography as previously described in the section of “Expression and purification of recombinant Spa2” .
- Spa2cut was eluted with 100 mM imidazole.
- the final purified protein was buffer-exchanged into 10 mM Tris-HCl, 100 mM NaCl, pH 8.0.
- a similar process was followed for expression and purification of Spa2cut mutant variants of E158Acut, D246Acut, E435Acut, and D246A/E435Acut.
- ⁇ spa2 ⁇ srtA-pEK-6his-spa2 strain enables secretion of the expressed 6His- Cg CLP into the culture medium due to lacking sortase A.
- 6His- Cg CLP polymers ⁇ spa2 ⁇ srtA-pEK-6his-spa2 cells were inoculated into M63 medium with 25 ⁇ g mL -1 kanamycin and cultured for 3 days.
- 6His- Cg CLP purification 500 mL supernatants were collected and concentrated to 5mL in buffer of 10 mM Tris-HCl, 100 mM NaCl, pH 8.0 and were purified by nickel affinity chromatography.
- the 6His- Cg CLP polymers were eluted with 100 mM imidazole. Purified 6His- Cg CLP fibers were then boiled in SDS sample buffer (6 ⁇ Protein Loading Buffer, TransGen Biotech, DL101-02) and subjected to an SDS-PAGE gel. The high-molecular-weight Cg CLP polymer bands were excised from Coomassie brilliant blue stained SDS-PAGE gels and prepared for intermolecular isopeptide bond identification.
- the Spa2cut solution was precipitated with acetone (1: 4) and the pellets were dried using a Speedvac (room temperature) for 1-2 min. The pellets were then dissolved in 100 mM Tris-HCl (pH 8.5) supplemented with 8 M urea. 5mM TCEP (Thermo Scientific) for reduction and 10 mM iodoacetamide (Sigma) for alkylation were added and incubated at room temperature for 30 min. The protein mixture was diluted (1: 4) and digested overnight with chymotrypsin at 1: 40 (w/w) . The protease-digested peptide solution was desalted using a MonoSpinTM C18 column (GL Science, Tokyo, Japan) and dried with a SpeedVac.
- the Spa2cut sample was processed following the same protocol as previously described for signal peptide identification.
- the Spa2cut sample was processed following a similar protocol except that pepsin (Promega) was purposely added for digestion, while addition of 5mM TCEP (Thermo Scientific) was avoided to ensure that the disulfide bond, if any, was kept intact.
- the Coomassie brilliant blue stained SDS-PAGE gel band of Cg CLP fibers was excised into small pieces and washed in water, followed by 50 mM NH 4 HCO 3 in 50%acetonitrile and 100%acetonitrile.
- the sample was reduced with 10 mM TCEP (Thermo Scientific) in 100 mM NH 4 HCO 3 at 55 °C for 1 h and alkylated with 55 mM iodoacetamide (Sigma) in 100 mM NH 4 HCO 3 at 37 °C in the dark for 30 min.
- the gel pieces were then washed with 100 mM NH 4 HCO 3 and 100%acetonitrile, and dried.
- the sample was primarily digested with 3 ⁇ g trypsin (Promega) in 50 mM NH 4 HCO 3 at 37 °C overnight, then 1 ⁇ g of Asp-N endoproteinase (Promega) was added for another overnight incubation. Digested peptides were extracted twice with 50%acetonitrile containing 5%formic acid.
- protease-digested peptides were analyzed by LCMS/MS using an Easy-nLC 1200 nano HPLC (Thermo Scientific) hybrid of a Q Exactive Orbitrap mass spectrometer (Thermo Scientific) system. Peptides were separated on a 30 cm-long pulled-tip analytical column (75 ⁇ m ID packed with ReproSil-Pur C18-AQ 1.9 ⁇ m resin, Dr. Maisch GmbH) in 0.1%aqueous formic acid (buffer A) and 0.1%formic acid in 80%acetonitrile (buffer B) at 55 °C with a flow rate of 300 nl/min using a 120 min linear gradient.
- Buffer A 0.1%aqueous formic acid
- buffer B 0.1%formic acid in 80%acetonitrile
- CMC-Na carboxymethylcellulose sodium salt
- DMS 3,5dinitrosaloculoc acid
- TrEgl-Spa2_SdBgl-Spa2 C003 strain
- TrEgl_SdBgl C004 strain
- the lycopene producing plasmid of pZ9-dxs_crtEBI was transferred into strain TrEgl_SdBgl to construct the recombinant strains of C003 and C004 for the utilization of cellulose to produce lycopene.
- C003 and C004 strains were inoculated into 10 mL BHI with 25 ⁇ g mL -1 kanamycin and 7.5 ⁇ g mL -1 chloramphenicol, and cultured for 12 h at 30 °C at a stirring speed at 200 rpm.
- modified M63 medium (15.6 g L -1 M63 broth, supplemented with 1 mM MgSO 4 , 2% (wt/vol) CMC-Na) with initial OD600 of 3 for 2 days at 30°C and 1 mM IPTG was added or not.
- lycopene production was carried out according to Li, C. et al. (Heterologous production of ⁇ -Carotene in Corynebacterium glutamicum using a multi-copy chromosomal integration method. Bioresour. Technol. 341, 125782, 2021) .
- IPTG induced and un-induced cells (1 mL) were separately collected into 2 mL tubes of lysing matrix Y (M. P. Biomedicals) by centrifugation at 12, 000 rpm for 5 min.
- the pellets were resuspended in a 60%hexane and 40%acetone mixture and lysed using the FastPrepR-24 5G bead beating grinder and lysis system (M. P. Biomedicals) for lycopene extraction.
- the lysis condition is 30 s once with a 1 min interval, for 6 times.
- the samples were centrifuged at 14, 000 rpm for 10 min at 4 °C, and the resulting supernatant was then transferred to brown 2 mL screw cap glass vials (Agilent Technologies) and directly subjected to HPLC analysis.
- the quantification of lycopene was performed on an Agilent 1260 series HPLC system (Agilent Technologies) using YMC Carotenoid (250 ⁇ 4.6 mml. D., YMC) and detected via a diode array detector (DAD) at 450 nm.
- binary gradient elution was applied to change the eluent from 100%eluent A of methanol/Methyl tert-butyl ether/water (81/15/4) to 100%eluent B of methanol/Methyl tert-butyl ether/water (7/90/3) over 90 min at a flow rate of 1.0 mL ⁇ min-1 at 20 °C with an injection volume of 10 ⁇ L (eluent A for 2min, eluent B 2min-95min, and eluent A 95min-100min.
- This Example was carried out to investigate the CLP assembly in the industrial workhorse C. glutamicum ATCC 14067 (referred to as Cg CLP) .
- the industrial workhorse C. glutamicum is a ‘generally recognized as safe’ (GRAS) strain with well-established gene editing tools that is widely used for the industrial-scale production of valued products such as amino acids, diamines, terpenoids, and other chemicals (Zhao, N. et al. Development of a Transcription Factor-Based Diamine Biosensor in Corynebacterium glutamicum. ACS Synth. Biol. 10, 3074-3083, 2021; and Xu, X. et al., Ledesma-Amaro, R. &Liu, L. Microbial chassis development for natural product biosynthesis. Trends Biotechnol. 38, 779-796, 2020) .
- GRAS generally recognized as safe
- CLP BGC contains three pilin-encoding genes, spa1, spa2, and spa3, as well as two sortase coding genes of srtC1, and srtC2 (Fig. 6) , which is similar to the SpaH-type (arelatively less well-studied pili type) CLP gene cluster in the pathogenic C. diphtheriae (Mandlik, A. et al., Pili in Gram-positive bacteria: assembly, involvement in colonization and biofilm development. Trends Microbiol. 16, 33-40, 2008) .
- the composition of Cg CLP was determined with polyclonal antibodies against Spa1, Spa2, and Spa3, respectively.
- TEM images of the Cg CLP with immunogold labelling showed that the Cg CLP fibers comprise two minor pilins of Spa1 and Spa3 and a major pilin of Spa2 (Fig. 8) .
- TEM and AFM imaging used to assess the specific roles of the three pilins in the Cg CLP assembly showed that the cells, which were defective for Spa1 ( ⁇ spa1 strain) , Spa3 ( ⁇ spa3 strain) , or both ( ⁇ spa1 ⁇ spa3 strain) , could still produce fibers (Fig. 7) .
- cells lacking Spa2 ( ⁇ spa2) could not produce any fiber, and overexpression of Spa2 (Spa2) promoted the formation of abundant long fibers throughout the cell surface (Fig. 7) .
- TEM and AFM images also showed that cells lacking both SrtC1 and SrtC2 ( ⁇ srtC1 ⁇ srtC2) completely blocked fiber formation (Fig. 9) .
- the purified Cg CLP polymers were excised from Coomassie blue-stained SDS-PAGE gels (Fig. 10) and then digested in-gel with trypsin (Promega) and AspN endoproteinase (Promega) .
- Liquid chromatography-tandem mass spectrometry was used to analyze the digestion products, and verify the presence of the intermolecular isopeptide bond (bond formation results in the elimination of a water molecule and thus a slight decrease of molecular weight) .
- the peptide peak with m/z 832.9 2+ (Fig. 11 and Table 2) suggested that the major pilin of Spa2 was cross-linked between K194 in the N-terminus of Spa2 i and T477 in the C-terminus of Spa2 i+1 (Lys194-Thr477) .
- This detected mass is consistent with the loss of three NH 3 units and two H 2 units, indicating the formation of three intramolecular isopeptide bonds (loss of one molecule of ammonia, ⁇ 17 Da) and two disulfide bonds (loss of two hydrogen atoms, ⁇ 2 Da) in Spa2.
- a Values in parentheses correspond to the outermost shell of data.
- d R free ⁇
- Spa2 is arranged in three tandem Ig-like domains, including N-domain (residues 36-197, pink) , M-domain (residues 198-343, blue) , and C-domain (residues 344-469, green) , giving an elongated molecule in length (Fig. 15) .
- These three tandem Ig-like domains of Spa2 are similar to the major pilin of SpaA (PDB ID: 3HR6, root-mean-square deviation (RMSD) over 270 alpha-carbon (C ⁇ ) atoms, Fig. 16b) and SpaD (PDB ID: 4HSS, RMSD over 311 C ⁇ atom, Fig. 16c) from human pathogen C.
- glutamicum is similar to the feature of the major pilin SpaD from the pathogenic C. diphtheriae (Kang, H. J. et al., 2014 above) , but is quite different from the major pilin SpaA from the pathogenic C. diphtheriae lacking isopeptide bonds in the N-terminal domain (Kang, H.J. et al., 2009 above) .
- two disulfide bonds were formed in the N-domain between Cys97 and Cys128 and the C-domain between Cys380 and Cys432, respectively (Fig. 17b) .
- Spa2 the presence of two disulfide bonds in Spa2 is very unique in comparison with other major pilins in human pathogens, such as Spy0128 (PDB ID: 3B2M) from Streptococcus pyogenes 37 and BcpA (PDB ID: 3KPT) from Bacillus cereus 38 lacking disulfide bond, and the SpaA and SpaD from C. diphtheriae containing only one disulfide bond in the C-terminal domain (Kang, H. J. et al., 2009 and 2014 above) .
- PDB ID: 3B2M Speptococcus pyogenes 37
- BcpA PBD ID: 3KPT
- SpaA and SpaD from C. diphtheriae containing only one disulfide bond in the C-terminal domain
- the CLP structure may serve as an attractive building block for various applications because these extracellular fibers have extraordinarily high tensile strength owing to their extensive inter-and intra-molecular isopeptide bonds. Moreover, as an extracellular matrix, CLP fibers can be conveniently and reliably positioned directly outside cells. Finally, their proteinaceous nature makes them potentially amenable for elaboration using genetic engineering.
- This Example was carried out to determine suitable fusion sites to append peptides/proteins to Spa2. According to both the Spa2 crystal structure and the characterization of specific functional domains within Spa2 observed in Example 2, four different positions to test the fusion of a protein-of-interest (POI) , with one site in the N-terminus of Spa2 and three sites in the M-domain lacking a disulfide bond (Fig. 22) .
- POI protein-of-interest
- the CLP-defective strain C. glutamicum ATCC 14067 ⁇ spa2 ( ⁇ spa2) with abrogated extracellular Cg CLP formation was transformed with the exogenous expression plasmid (pEK-E1/mCherry-spa2, pEK-E2/mCherry-spa2, pEK-E3/mCherry-spa2, or pEK-E4/mCherry-spa2) for Spa2 fusion protein expression to test the restored Cg CLP fiber production.
- the fluorescent reporter protein mCherry was fused at the interrogated positions for generating functional fusion proteins (SEQ ID NOs: 8-11) while retaining the sortase-catalyzed covalently-linked pili formation capacity of Spa2.
- SEQ ID NOs: 8-11 functional fusion proteins
- four sites were tested for mCherry addition/insertion, including Q35 (E1) at the N-terminus of Spa2, G215 in loop 1 of the M-domain (E2) , G236 in the loop 2 of the M-domain (E3) , and G336 in the ⁇ 23-sheet of the M-domain (E4) .
- Quantitative analysis showed that the cells expressing each of the fusion proteins fluoresced and enabled the formation of fiber (Fig. 23a) .
- Spa2 fusion proteins (six POIs, each fused at the E1 position via a linker of SEQ ID NO: 23) (see Fig. 25) were expressed by ⁇ spa2 strains transformed with plasmids pEK-6his-spa2, pEK-SpyTagSpa2, pEK-Mfp3Spep-Spa2, pEK-SpyCatcher-Spa2, pEK-Venus-Spa2, and pEK-CcEgl-Spa2, respectively. All of these fusion proteins were successfully expressed, secreted, and formed Cg CLP (Fig. 26) .
- TEM images showed that Ni-NTA-decorated AuNPs were anchored onto 6His-Spa2 Cg CLP (Fig. 27a) .
- Confocal microscopic images showed the green fluorescence emitted from SpyTag-Spa2 Cg CLP cells to which SpyCatcher-EGFP protein binding partners were covalently attached via Spytag-SpyCatcher interaction pairs (Fig. 27b) .
- Confocal microscopic images show the green fluorescence emitted from SpyCatcher-Spa2 Cg CLP cells to which SpyTag-EGFP protein binding partners were covalently attached via Spytag-SpyCatcher interaction pairs (Fig. 27c) .
- ⁇ spa2 strain was transformed with plasmids pEK-N-Ven-Spa2, pEK-C-Ven-Spa2 and pEK-N-Ven-Spa2_C-Ven-Spa2, respectively, ⁇ spa2 strain transformed with pEK-N-Ven_C-Ven was used as a control.
- This Example was carried out to verify the co-assembly of multiple cellulases into a catalytic cascade for extracellular degradation of cellulose into glucose to support production of specific chemicals of interest (e.g., lycopene) in C. glutamicum ATCC 14067 ⁇ spa2 (Fig. 31) .
- specific chemicals of interest e.g., lycopene
- endo-1, 4- ⁇ -glucanase from Trichoderma reesei (TrEgl, SEQ ID NO: 19) and ⁇ -glucosidase from Saccharophagus degradans (SdBgl, SEQ ID NO: 21) were co-assembled in the Cg CLP fiber; these two enzymes are known to work in concert to degrade cellulose into glucose via enzyme cascade reactions.
- Lycopene can be produced via the methylerythritol phosphate (MEP) pathway by engineered C. glutamicum (Li, C. et al. Heterologous production of ⁇ -Carotene in Corynebacterium glutamicum using a multi-copy chromosomal integration method. Bioresour. Technol. 341, 125782, 2021) .
- a C001 chassis ⁇ spa2 ⁇ dec
- spa2 spa2 ⁇ dec
- CEY17_RS03380 for the abrogation Cg CLP formation
- CEY17_RS03560 ⁇ dec, for accumulation of the precursor for lycopene production
- the basal lycopene-producing strain C002 was constructed by transforming strain C001 with plasmid pZ9-dxs_crtEBI for IPTG-inducible expression of the dxs gene and crtEBI gene cluster. Then, the C002 strain was transformed with plasmids pEC-TrEgl-Spa2_SdBgl-Spa2, and pEC-TrEgl_SdBgl, respectively, resulting in the strains C003 and C004.
- the C003 strain co-assembled TrEgl and SdBgl in Cg CLP fiber on the cell surface (Fig. 32a) and enabled the degradation of carboxymethylcellulose sodium (CMC-Na, the ether derivate of cellulose) in medium, based on the medium turning from a viscous gel to a thin solution (Fig. 32b) .
- Strain C004, which only simultaneously secreted both TrEgl and SdBgl without anchoring to the Cg CLP scaffold did not show similar behavior.
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Biochemistry (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicinal Chemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Microbiology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
L'invention concerne un polypeptide de fusion comprenant une protéine porteuse et un polypeptide d'intérêt, le polypeptide d'intérêt étant fusionné à une extrémité de la protéine porteuse ou inséré dans la protéine porteuse, et la protéine porteuse étant une piline de pili liés de manière covalente (CLP) à partir d'un micro-organisme. L'invention concerne également une cellule recombinante comprenant un CLP modifié comprenant le polypeptide de fusion, ainsi que le CLP modifié.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2022/130033 WO2024092769A1 (fr) | 2022-11-04 | 2022-11-04 | Pili à liaison covalente modifiée et bactéries recombinantes les comprenant |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2022/130033 WO2024092769A1 (fr) | 2022-11-04 | 2022-11-04 | Pili à liaison covalente modifiée et bactéries recombinantes les comprenant |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024092769A1 true WO2024092769A1 (fr) | 2024-05-10 |
Family
ID=90929428
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2022/130033 WO2024092769A1 (fr) | 2022-11-04 | 2022-11-04 | Pili à liaison covalente modifiée et bactéries recombinantes les comprenant |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024092769A1 (fr) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009137763A2 (fr) * | 2008-05-08 | 2009-11-12 | Emory University | Procédés et compositions pour l’affichage de polypeptides sur les fimbriae de bactéries gram-positives |
WO2017003305A1 (fr) * | 2015-07-01 | 2017-01-05 | Auckland Uniservices Limited | Peptides et leurs utilisations |
WO2019213262A1 (fr) * | 2018-05-01 | 2019-11-07 | The Regents Of The University Of California | Réactif pour le marquage de protéines par liaison isopeptidique à la lysine |
-
2022
- 2022-11-04 WO PCT/CN2022/130033 patent/WO2024092769A1/fr unknown
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160304567A1 (en) * | 2007-12-19 | 2016-10-20 | Emory University | Methods and compositions for the display of polypeptides on the pili of gram- positive bacteria |
WO2009137763A2 (fr) * | 2008-05-08 | 2009-11-12 | Emory University | Procédés et compositions pour l’affichage de polypeptides sur les fimbriae de bactéries gram-positives |
US20110189236A1 (en) * | 2008-05-08 | 2011-08-04 | Emory University | Methods and Compositions for the Display of Polypeptides on the Pili of Gram-Positive Bacteria |
WO2017003305A1 (fr) * | 2015-07-01 | 2017-01-05 | Auckland Uniservices Limited | Peptides et leurs utilisations |
WO2019213262A1 (fr) * | 2018-05-01 | 2019-11-07 | The Regents Of The University Of California | Réactif pour le marquage de protéines par liaison isopeptidique à la lysine |
Non-Patent Citations (1)
Title |
---|
HUNG TON‐THAT: "Sortases and pilin elements involved in pilus assembly of Corynebacterium diphtheriae", MOLECULAR MICROBIOLOGY, WILEY-BLACKWELL PUBLISHING LTD, GB, vol. 53, no. 1, 1 July 2004 (2004-07-01), GB , pages 251 - 261, XP093168778, ISSN: 0950-382X, DOI: 10.1111/j.1365-2958.2004.04117.x * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zou et al. | Construction of a cellulase hyper-expression system in Trichoderma reesei by promoter and enzyme engineering | |
US20220002773A1 (en) | Production of 3-fucosyllactose and lactose converting alpha-1,3-fucosyltransferase enzymes | |
Biedendieck et al. | Plasmid system for the intracellular production and purification of affinity‐tagged proteins in Bacillus megaterium | |
EA017803B1 (ru) | Система экспрессии | |
CN108103039B (zh) | 一组岩藻糖基转移酶突变体及其筛选方法和应用 | |
KR20220116243A (ko) | 락토오스 전환 알파-1,2-푸코실트랜스퍼라제 효소 | |
US10683509B2 (en) | Surface display of functional proteins in a broad range of gram negative bacteria | |
WO2012118900A2 (fr) | Présentation d'enzymes cellulolytiques et de complexes enzymatiques à la surface de microorganismes à gram positif | |
KR101481142B1 (ko) | 코리네박테리아 발현용 합성프로모터 | |
WO2014170460A2 (fr) | Procede de production de proteines de collagene issues d'eponges marines et organisme apte a produire lesdites proteines | |
KR102350425B1 (ko) | 프테로스틸벤의 생합성 제조를 위한 o-메틸트랜스퍼라제의 사용 방법 | |
CN114196646B (zh) | 一种橄榄醇合成酶变体a及其用途 | |
EP3330282A1 (fr) | Cipa et cipb pixa comme échafaudages pour organiser des protéines dans des inclusions cristallines | |
US20140011235A1 (en) | Release factor 1 (rf1) in escherichia coli | |
WO2024092769A1 (fr) | Pili à liaison covalente modifiée et bactéries recombinantes les comprenant | |
WO2023197692A1 (fr) | Souche modifiée de levure ayant une voie tca réductrice positionnée sur les mitochondries et produisant efficacement de l'acide succinique, son procédé de construction et son utilisation | |
CN112342178A (zh) | 重组微生物、其制备方法及在生产塔格糖中的应用 | |
CN114032222B (zh) | 糖链延伸糖基转移酶突变体及其编码基因以及基因工程菌和它们的应用 | |
CN111363709B (zh) | 一种提高异戊二烯产量的基因工程菌及其构建方法与应用 | |
US20110262971A1 (en) | Genetically Modified E. coli Strains for Producing Erythromycin | |
Zhang et al. | Characterization of the complex involved in regulating V-ATPase activity of the vacuolar and endosomal membrane | |
KR102194697B1 (ko) | 3-히드록시프로피온산 반응 전사인자를 이용한 3-하이드록시프로피온산 선택성 유전자회로 및 이를 이용한 3-히드록시프로피온산 생산 균주의 스크리닝 방법 | |
US8636999B2 (en) | Stable plasmid expression vector for bacteria | |
WO2012067220A1 (fr) | Procédé d'expression d'une protéine utile à des taux élevés | |
WO2021188816A1 (fr) | Procédés et systèmes biologiques de découverte et d'optimisation de peptides lasso |