WO2024056643A1 - Fungal signal peptides - Google Patents
Fungal signal peptides Download PDFInfo
- Publication number
- WO2024056643A1 WO2024056643A1 PCT/EP2023/074982 EP2023074982W WO2024056643A1 WO 2024056643 A1 WO2024056643 A1 WO 2024056643A1 EP 2023074982 W EP2023074982 W EP 2023074982W WO 2024056643 A1 WO2024056643 A1 WO 2024056643A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- polypeptide
- signal peptide
- nucleic acid
- polynucleotide
- Prior art date
Links
- 108010076504 Protein Sorting Signals Proteins 0.000 title claims abstract description 187
- 230000002538 fungal effect Effects 0.000 title claims abstract description 46
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 341
- 229920001184 polypeptide Polymers 0.000 claims abstract description 339
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 339
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 145
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 145
- 239000002157 polynucleotide Substances 0.000 claims abstract description 145
- 108090000942 Lactalbumin Proteins 0.000 claims abstract description 125
- 235000021241 α-lactalbumin Nutrition 0.000 claims abstract description 124
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 110
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 103
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 103
- 238000000034 method Methods 0.000 claims abstract description 46
- 239000013604 expression vector Substances 0.000 claims abstract description 23
- 102000004407 Lactalbumin Human genes 0.000 claims abstract 12
- 210000004027 cell Anatomy 0.000 claims description 167
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 23
- 230000004927 fusion Effects 0.000 claims description 23
- 238000004519 manufacturing process Methods 0.000 claims description 17
- 210000004899 c-terminal region Anatomy 0.000 claims description 12
- 102000007079 Peptide Fragments Human genes 0.000 claims description 4
- 235000004279 alanine Nutrition 0.000 claims description 4
- 108010033276 Peptide Fragments Chemical group 0.000 claims description 3
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 claims description 3
- 210000000170 cell membrane Anatomy 0.000 claims description 3
- 102000005744 Glycoside Hydrolases Human genes 0.000 abstract description 14
- 108010031186 Glycoside Hydrolases Proteins 0.000 abstract description 14
- 108020001507 fusion proteins Proteins 0.000 abstract description 3
- 102000037865 fusion proteins Human genes 0.000 abstract description 3
- 102100034668 Alpha-lactalbumin Human genes 0.000 description 114
- 108010011619 6-Phytase Proteins 0.000 description 106
- 229940085127 phytase Drugs 0.000 description 105
- 108091026890 Coding region Proteins 0.000 description 83
- 230000000694 effects Effects 0.000 description 77
- 230000014509 gene expression Effects 0.000 description 62
- 108090000623 proteins and genes Proteins 0.000 description 49
- 239000012634 fragment Substances 0.000 description 46
- 235000001014 amino acid Nutrition 0.000 description 43
- 150000001413 amino acids Chemical class 0.000 description 42
- 238000000855 fermentation Methods 0.000 description 36
- 230000004151 fermentation Effects 0.000 description 36
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 30
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 30
- 239000013598 vector Substances 0.000 description 30
- 239000000203 mixture Substances 0.000 description 27
- 241000459887 Aspergillus luchuensis Species 0.000 description 25
- 235000010633 broth Nutrition 0.000 description 24
- 108010070158 Lactose synthase Proteins 0.000 description 22
- 125000003729 nucleotide group Chemical group 0.000 description 22
- 108020004999 messenger RNA Proteins 0.000 description 21
- 239000002773 nucleotide Substances 0.000 description 21
- 102000004169 proteins and genes Human genes 0.000 description 21
- 108020004414 DNA Proteins 0.000 description 20
- 241000228245 Aspergillus niger Species 0.000 description 19
- 235000018102 proteins Nutrition 0.000 description 19
- 230000001105 regulatory effect Effects 0.000 description 19
- 238000006467 substitution reaction Methods 0.000 description 19
- 102000004190 Enzymes Human genes 0.000 description 18
- 108090000790 Enzymes Proteins 0.000 description 18
- 229940088598 enzyme Drugs 0.000 description 17
- 239000013612 plasmid Substances 0.000 description 15
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 15
- 241000228212 Aspergillus Species 0.000 description 14
- 230000004075 alteration Effects 0.000 description 14
- 102100032487 Beta-mannosidase Human genes 0.000 description 13
- 108010055059 beta-Mannosidase Proteins 0.000 description 13
- 238000012217 deletion Methods 0.000 description 13
- 230000037430 deletion Effects 0.000 description 13
- 239000003381 stabilizer Substances 0.000 description 13
- 238000003752 polymerase chain reaction Methods 0.000 description 12
- 238000013518 transcription Methods 0.000 description 12
- 230000035897 transcription Effects 0.000 description 12
- 230000009466 transformation Effects 0.000 description 12
- 241000233866 Fungi Species 0.000 description 11
- 108010005400 cutinase Proteins 0.000 description 11
- 238000003780 insertion Methods 0.000 description 11
- 230000037431 insertion Effects 0.000 description 11
- 230000028327 secretion Effects 0.000 description 11
- 239000000243 solution Substances 0.000 description 11
- 230000014616 translation Effects 0.000 description 11
- 240000006439 Aspergillus oryzae Species 0.000 description 10
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 10
- 101000946377 Bos taurus Alpha-lactalbumin Proteins 0.000 description 10
- 108091023040 Transcription factor Proteins 0.000 description 10
- 102000040945 Transcription factor Human genes 0.000 description 10
- 239000008367 deionised water Substances 0.000 description 10
- 229910021641 deionized water Inorganic materials 0.000 description 10
- 239000000833 heterodimer Substances 0.000 description 10
- 239000002609 medium Substances 0.000 description 10
- 238000011144 upstream manufacturing Methods 0.000 description 9
- 241000223218 Fusarium Species 0.000 description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 description 8
- 241000499912 Trichoderma reesei Species 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 230000010076 replication Effects 0.000 description 8
- 150000003839 salts Chemical class 0.000 description 8
- 238000013519 translation Methods 0.000 description 8
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 7
- 239000000463 material Substances 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- 210000001938 protoplast Anatomy 0.000 description 7
- -1 1-3 amino acids Chemical class 0.000 description 6
- 241000351920 Aspergillus nidulans Species 0.000 description 6
- 241000146399 Ceriporiopsis Species 0.000 description 6
- 241000221779 Fusarium sambucinum Species 0.000 description 6
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 239000008103 glucose Substances 0.000 description 6
- 230000000813 microbial effect Effects 0.000 description 6
- 150000007524 organic acids Chemical class 0.000 description 6
- 230000008488 polyadenylation Effects 0.000 description 6
- 238000011002 quantification Methods 0.000 description 6
- 238000001542 size-exclusion chromatography Methods 0.000 description 6
- 108010084185 Cellulases Proteins 0.000 description 5
- 102000005575 Cellulases Human genes 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- 241000223259 Trichoderma Species 0.000 description 5
- 229940041514 candida albicans extract Drugs 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 238000009472 formulation Methods 0.000 description 5
- 239000003550 marker Substances 0.000 description 5
- 239000012138 yeast extract Substances 0.000 description 5
- DLFVBJFMPXGRIB-UHFFFAOYSA-N Acetamide Chemical compound CC(N)=O DLFVBJFMPXGRIB-UHFFFAOYSA-N 0.000 description 4
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Natural products CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 4
- 229920000936 Agarose Polymers 0.000 description 4
- 239000004382 Amylase Substances 0.000 description 4
- 102000013142 Amylases Human genes 0.000 description 4
- 108010065511 Amylases Proteins 0.000 description 4
- 241001513093 Aspergillus awamori Species 0.000 description 4
- 241000892910 Aspergillus foetidus Species 0.000 description 4
- 241001225321 Aspergillus fumigatus Species 0.000 description 4
- 241001480052 Aspergillus japonicus Species 0.000 description 4
- 241000123346 Chrysosporium Species 0.000 description 4
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 4
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 4
- 241000567163 Fusarium cerealis Species 0.000 description 4
- 241000146406 Fusarium heterosporum Species 0.000 description 4
- 101000946384 Homo sapiens Alpha-lactalbumin Proteins 0.000 description 4
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 4
- 108010046068 N-Acetyllactosamine Synthase Proteins 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 235000019418 amylase Nutrition 0.000 description 4
- 229940091771 aspergillus fumigatus Drugs 0.000 description 4
- 230000027455 binding Effects 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 229910052799 carbon Inorganic materials 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 239000012228 culture supernatant Substances 0.000 description 4
- 239000013613 expression plasmid Substances 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 238000001840 matrix-assisted laser desorption--ionisation time-of-flight mass spectrometry Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 238000002703 mutagenesis Methods 0.000 description 4
- 231100000350 mutagenesis Toxicity 0.000 description 4
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 239000011541 reaction mixture Substances 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 229910001868 water Inorganic materials 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- 101000757144 Aspergillus niger Glucoamylase Proteins 0.000 description 3
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 description 3
- 229920002101 Chitin Polymers 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- 241000223221 Fusarium oxysporum Species 0.000 description 3
- 241000567178 Fusarium venenatum Species 0.000 description 3
- 241001480714 Humicola insolens Species 0.000 description 3
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 3
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 3
- 241000235648 Pichia Species 0.000 description 3
- 241000235403 Rhizomucor miehei Species 0.000 description 3
- 241001313536 Thermothelomyces thermophila Species 0.000 description 3
- 108090000637 alpha-Amylases Proteins 0.000 description 3
- 229940024171 alpha-amylase Drugs 0.000 description 3
- 230000003197 catalytic effect Effects 0.000 description 3
- 239000006143 cell culture medium Substances 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 239000003797 essential amino acid Substances 0.000 description 3
- 235000020776 essential amino acid Nutrition 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- XRECTZIEBJDKEO-UHFFFAOYSA-N flucytosine Chemical compound NC1=NC(=O)NC=C1F XRECTZIEBJDKEO-UHFFFAOYSA-N 0.000 description 3
- 229960004413 flucytosine Drugs 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 229910052816 inorganic phosphate Inorganic materials 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 239000012137 tryptone Substances 0.000 description 3
- 241001019659 Acremonium <Plectosphaerellaceae> Species 0.000 description 2
- 102100034044 All-trans-retinol dehydrogenase [NAD(+)] ADH1B Human genes 0.000 description 2
- 101710193111 All-trans-retinol dehydrogenase [NAD(+)] ADH4 Proteins 0.000 description 2
- 241000223651 Aureobasidium Species 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 241000222490 Bjerkandera Species 0.000 description 2
- 241000222478 Bjerkandera adusta Species 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 241001466517 Ceriporiopsis aneirina Species 0.000 description 2
- 241001646018 Ceriporiopsis gilvescens Species 0.000 description 2
- 241001277875 Ceriporiopsis rivulosa Species 0.000 description 2
- 241000524302 Ceriporiopsis subrufa Species 0.000 description 2
- 241000985909 Chrysosporium keratinophilum Species 0.000 description 2
- 241001674013 Chrysosporium lucknowense Species 0.000 description 2
- 241001556045 Chrysosporium merdarium Species 0.000 description 2
- 241000080524 Chrysosporium queenslandicum Species 0.000 description 2
- 241001674001 Chrysosporium tropicum Species 0.000 description 2
- 241000355696 Chrysosporium zonatum Species 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 241000222511 Coprinus Species 0.000 description 2
- 244000251987 Coprinus macrorhizus Species 0.000 description 2
- 235000001673 Coprinus macrorhizus Nutrition 0.000 description 2
- 241000222356 Coriolus Species 0.000 description 2
- 241001337994 Cryptococcus <scale insect> Species 0.000 description 2
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 102000010911 Enzyme Precursors Human genes 0.000 description 2
- 108010062466 Enzyme Precursors Proteins 0.000 description 2
- 241000145614 Fusarium bactridioides Species 0.000 description 2
- 241000223194 Fusarium culmorum Species 0.000 description 2
- 241000223195 Fusarium graminearum Species 0.000 description 2
- 241001112697 Fusarium reticulatum Species 0.000 description 2
- 241001014439 Fusarium sarcochroum Species 0.000 description 2
- 241000223192 Fusarium sporotrichioides Species 0.000 description 2
- 241001465753 Fusarium torulosum Species 0.000 description 2
- 241000146398 Gelatoporia subvermispora Species 0.000 description 2
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 2
- 244000068988 Glycine max Species 0.000 description 2
- 235000010469 Glycine max Nutrition 0.000 description 2
- 108700023372 Glycosyltransferases Proteins 0.000 description 2
- 102000051366 Glycosyltransferases Human genes 0.000 description 2
- 241000223198 Humicola Species 0.000 description 2
- 239000007836 KH2PO4 Substances 0.000 description 2
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 2
- 102100033468 Lysozyme C Human genes 0.000 description 2
- 241001344133 Magnaporthe Species 0.000 description 2
- 241000235395 Mucor Species 0.000 description 2
- 241000226677 Myceliophthora Species 0.000 description 2
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 2
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 2
- 241000233892 Neocallimastix Species 0.000 description 2
- 241000221960 Neurospora Species 0.000 description 2
- 241000221961 Neurospora crassa Species 0.000 description 2
- 241000233654 Oomycetes Species 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 241001236817 Paecilomyces <Clavicipitaceae> Species 0.000 description 2
- 241000228143 Penicillium Species 0.000 description 2
- 241000222385 Phanerochaete Species 0.000 description 2
- 241000222393 Phanerochaete chrysosporium Species 0.000 description 2
- 241000222395 Phlebia Species 0.000 description 2
- 241000222397 Phlebia radiata Species 0.000 description 2
- 241000235379 Piromyces Species 0.000 description 2
- 241000222350 Pleurotus Species 0.000 description 2
- 244000252132 Pleurotus eryngii Species 0.000 description 2
- 235000001681 Pleurotus eryngii Nutrition 0.000 description 2
- 241000959173 Rasamsonia emersonii Species 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 241000222480 Schizophyllum Species 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 241000228341 Talaromyces Species 0.000 description 2
- 241001540751 Talaromyces ruber Species 0.000 description 2
- 241000228178 Thermoascus Species 0.000 description 2
- 241000223258 Thermomyces lanuginosus Species 0.000 description 2
- 241001494489 Thielavia Species 0.000 description 2
- 241001495429 Thielavia terrestris Species 0.000 description 2
- 241001149964 Tolypocladium Species 0.000 description 2
- 241000222354 Trametes Species 0.000 description 2
- 241000222357 Trametes hirsuta Species 0.000 description 2
- 241000222355 Trametes versicolor Species 0.000 description 2
- 241000217816 Trametes villosa Species 0.000 description 2
- 241000223260 Trichoderma harzianum Species 0.000 description 2
- 241000378866 Trichoderma koningii Species 0.000 description 2
- 241000223262 Trichoderma longibrachiatum Species 0.000 description 2
- 241000223261 Trichoderma viride Species 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 2
- 241000409279 Xerochrysium dermatitidis Species 0.000 description 2
- 229960000583 acetic acid Drugs 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 102000004139 alpha-Amylases Human genes 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid group Chemical group C(C1=CC=CC=C1)(=O)O WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 2
- 108010019077 beta-Amylase Proteins 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 229910000366 copper(II) sulfate Inorganic materials 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 2
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 2
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 2
- 229910001385 heavy metal Inorganic materials 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- BHEPBYXIRTUNPN-UHFFFAOYSA-N hydridophosphorus(.) (triplet) Chemical compound [PH] BHEPBYXIRTUNPN-UHFFFAOYSA-N 0.000 description 2
- BAUYGSIQEAFULO-UHFFFAOYSA-L iron(2+) sulfate (anhydrous) Chemical compound [Fe+2].[O-]S([O-])(=O)=O BAUYGSIQEAFULO-UHFFFAOYSA-L 0.000 description 2
- 229910000359 iron(II) sulfate Inorganic materials 0.000 description 2
- FGKJLKRYENPLQH-UHFFFAOYSA-N isocaproic acid Chemical compound CC(C)CCC(O)=O FGKJLKRYENPLQH-UHFFFAOYSA-N 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 2
- 235000019341 magnesium sulphate Nutrition 0.000 description 2
- SQQMAOCOWKFBNP-UHFFFAOYSA-L manganese(II) sulfate Chemical compound [Mn+2].[O-]S([O-])(=O)=O SQQMAOCOWKFBNP-UHFFFAOYSA-L 0.000 description 2
- 229910000357 manganese(II) sulfate Inorganic materials 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 150000002739 metals Chemical class 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 2
- 235000019796 monopotassium phosphate Nutrition 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 230000006780 non-homologous end joining Effects 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 229920002401 polyacrylamide Polymers 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 2
- FGIUAXJPYTZDNR-UHFFFAOYSA-N potassium nitrate Chemical compound [K+].[O-][N+]([O-])=O FGIUAXJPYTZDNR-UHFFFAOYSA-N 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 238000001243 protein synthesis Methods 0.000 description 2
- 239000011535 reaction buffer Substances 0.000 description 2
- 239000007974 sodium acetate buffer Substances 0.000 description 2
- 239000000600 sorbitol Substances 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 229910021654 trace metal Inorganic materials 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 230000009105 vegetative growth Effects 0.000 description 2
- 239000007221 ypg medium Substances 0.000 description 2
- CHHHXKFHOYLYRE-UHFFFAOYSA-M 2,4-Hexadienoic acid, potassium salt (1:1), (2E,4E)- Chemical compound [K+].CC=CC=CC([O-])=O CHHHXKFHOYLYRE-UHFFFAOYSA-M 0.000 description 1
- WLJVXDMOQOGPHL-PPJXEINESA-N 2-phenylacetic acid Chemical compound O[14C](=O)CC1=CC=CC=C1 WLJVXDMOQOGPHL-PPJXEINESA-N 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- XZKIHKMTEMTJQX-UHFFFAOYSA-N 4-Nitrophenyl Phosphate Chemical compound OP(O)(=O)OC1=CC=C([N+]([O-])=O)C=C1 XZKIHKMTEMTJQX-UHFFFAOYSA-N 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 1
- 108010037870 Anthranilate Synthase Proteins 0.000 description 1
- 241000235349 Ascomycota Species 0.000 description 1
- 102000004580 Aspartic Acid Proteases Human genes 0.000 description 1
- 108010017640 Aspartic Acid Proteases Proteins 0.000 description 1
- 101000690713 Aspergillus niger Alpha-glucosidase Proteins 0.000 description 1
- 101900127796 Aspergillus oryzae Glucoamylase Proteins 0.000 description 1
- 108090000145 Bacillolysin Proteins 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000193388 Bacillus thuringiensis Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108010023063 Bacto-peptone Proteins 0.000 description 1
- 108091005658 Basic proteases Proteins 0.000 description 1
- 241000221198 Basidiomycota Species 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- 101710130006 Beta-glucanase Proteins 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 108091033409 CRISPR Proteins 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 229920001661 Chitosan Polymers 0.000 description 1
- 241000233652 Chytridiomycota Species 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 101100317179 Dictyostelium discoideum vps26 gene Proteins 0.000 description 1
- QXNVGIXVLWOKEQ-UHFFFAOYSA-N Disodium Chemical compound [Na][Na] QXNVGIXVLWOKEQ-UHFFFAOYSA-N 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 101100407639 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) prtB gene Proteins 0.000 description 1
- 108050001049 Extracellular proteins Proteins 0.000 description 1
- 101150094690 GAL1 gene Proteins 0.000 description 1
- 102100028501 Galanin peptides Human genes 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 101100080316 Geobacillus stearothermophilus nprT gene Proteins 0.000 description 1
- 241000123313 Gloeophyllum sepiarium Species 0.000 description 1
- 229920001503 Glucan Polymers 0.000 description 1
- 102100022624 Glucoamylase Human genes 0.000 description 1
- 229960005541 HAMLET Drugs 0.000 description 1
- 101000882901 Homo sapiens Claudin-2 Proteins 0.000 description 1
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 102100027612 Kallikrein-11 Human genes 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- 241001099157 Komagataella Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- 241001099156 Komagataella phaffii Species 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- 108010029541 Laccase Proteins 0.000 description 1
- 241000235087 Lachancea kluyveri Species 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 229920000057 Mannan Polymers 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 108091005461 Nucleic proteins Chemical group 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 102100026367 Pancreatic alpha-amylase Human genes 0.000 description 1
- 206010034133 Pathogen resistance Diseases 0.000 description 1
- 102100039087 Peptidyl-alpha-hydroxyglycine alpha-amidating lyase Human genes 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- 241000425347 Phyla <beetle> Species 0.000 description 1
- 229920001030 Polyethylene Glycol 4000 Polymers 0.000 description 1
- 238000012356 Product development Methods 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 108091036333 Rapid DNA Proteins 0.000 description 1
- 241000235402 Rhizomucor Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 235000003534 Saccharomyces carlsbergensis Nutrition 0.000 description 1
- 235000001006 Saccharomyces cerevisiae var diastaticus Nutrition 0.000 description 1
- 244000206963 Saccharomyces cerevisiae var. diastaticus Species 0.000 description 1
- 241000204893 Saccharomyces douglasii Species 0.000 description 1
- 241001407717 Saccharomyces norbensis Species 0.000 description 1
- 241001123227 Saccharomyces pastorianus Species 0.000 description 1
- 241000235343 Saccharomycetales Species 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 239000008049 TAE buffer Substances 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 1
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 1
- 101710152431 Trypsin-like protease Proteins 0.000 description 1
- TVXBFESIOXBWNM-UHFFFAOYSA-N Xylitol Natural products OCCC(O)C(O)C(O)CCO TVXBFESIOXBWNM-UHFFFAOYSA-N 0.000 description 1
- 241000235013 Yarrowia Species 0.000 description 1
- 241000235015 Yarrowia lipolytica Species 0.000 description 1
- PTFCDOFLOPIGGS-UHFFFAOYSA-N Zinc dication Chemical compound [Zn+2] PTFCDOFLOPIGGS-UHFFFAOYSA-N 0.000 description 1
- 241000758405 Zoopagomycotina Species 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 235000011054 acetic acid Nutrition 0.000 description 1
- 125000000218 acetic acid group Chemical group C(C)(=O)* 0.000 description 1
- HGEVZDLYZYVYHD-UHFFFAOYSA-N acetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid Chemical compound CC(O)=O.OCC(N)(CO)CO.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O HGEVZDLYZYVYHD-UHFFFAOYSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000012867 alanine scanning Methods 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000000845 anti-microbial effect Effects 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 239000004599 antimicrobial Substances 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 101150009206 aprE gene Proteins 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 229940097012 bacillus thuringiensis Drugs 0.000 description 1
- 230000003385 bacteriostatic effect Effects 0.000 description 1
- 235000010233 benzoic acid Nutrition 0.000 description 1
- 108010047754 beta-Glucosidase Proteins 0.000 description 1
- 102000006995 beta-Glucosidase Human genes 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 235000013361 beverage Nutrition 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 230000034303 cell budding Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- ARUVKPQLZAKDPS-UHFFFAOYSA-L copper(II) sulfate Chemical compound [Cu+2].[O-][S+2]([O-])([O-])[O-] ARUVKPQLZAKDPS-UHFFFAOYSA-L 0.000 description 1
- 235000013365 dairy product Nutrition 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000012470 diluted sample Substances 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- KETSPIPODMGOEJ-UHFFFAOYSA-B dodecasodium;(2,3,4,5,6-pentaphosphonatooxycyclohexyl) phosphate Chemical compound [Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[O-]P([O-])(=O)OC1C(OP([O-])([O-])=O)C(OP([O-])([O-])=O)C(OP([O-])([O-])=O)C(OP([O-])([O-])=O)C1OP([O-])([O-])=O KETSPIPODMGOEJ-UHFFFAOYSA-B 0.000 description 1
- 238000011143 downstream manufacturing Methods 0.000 description 1
- 238000007876 drug discovery Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 108010091371 endoglucanase 1 Proteins 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- FPIQZBQZKBKLEI-UHFFFAOYSA-N ethyl 1-[[2-chloroethyl(nitroso)carbamoyl]amino]cyclohexane-1-carboxylate Chemical compound ClCCN(N=O)C(=O)NC1(C(=O)OCC)CCCCC1 FPIQZBQZKBKLEI-UHFFFAOYSA-N 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 230000029142 excretion Effects 0.000 description 1
- 230000002550 fecal effect Effects 0.000 description 1
- 239000006052 feed supplement Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 239000012362 glacial acetic acid Substances 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 238000011090 industrial biotechnology method and process Methods 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 230000017730 intein-mediated protein splicing Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- HEBKCHPVOIAQTA-UHFFFAOYSA-N meso ribitol Natural products OCC(O)C(O)C(O)CO HEBKCHPVOIAQTA-UHFFFAOYSA-N 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 238000009629 microbiological culture Methods 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- MEFBJEMVZONFCJ-UHFFFAOYSA-N molybdate Chemical compound [O-][Mo]([O-])(=O)=O MEFBJEMVZONFCJ-UHFFFAOYSA-N 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 101150093025 pepA gene Proteins 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- JTJMJGYZQZDUJJ-UHFFFAOYSA-N phencyclidine Chemical compound C1CCCCN1C1(C=2C=CC=CC=2)CCCCC1 JTJMJGYZQZDUJJ-UHFFFAOYSA-N 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 235000010333 potassium nitrate Nutrition 0.000 description 1
- 239000004302 potassium sorbate Substances 0.000 description 1
- 235000010241 potassium sorbate Nutrition 0.000 description 1
- 229940069338 potassium sorbate Drugs 0.000 description 1
- OTYBMLCTZGSZBG-UHFFFAOYSA-L potassium sulfate Chemical compound [K+].[K+].[O-]S([O-])(=O)=O OTYBMLCTZGSZBG-UHFFFAOYSA-L 0.000 description 1
- 229910052939 potassium sulfate Inorganic materials 0.000 description 1
- 244000144977 poultry Species 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 235000019260 propionic acid Nutrition 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 239000012521 purified sample Substances 0.000 description 1
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 238000007423 screening assay Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 239000012089 stop solution Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- LSGOVYNHVSXFFJ-UHFFFAOYSA-N vanadate(3-) Chemical compound [O-][V]([O-])([O-])=O LSGOVYNHVSXFFJ-UHFFFAOYSA-N 0.000 description 1
- 239000000811 xylitol Substances 0.000 description 1
- HEBKCHPVOIAQTA-SCDXWVJYSA-N xylitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)CO HEBKCHPVOIAQTA-SCDXWVJYSA-N 0.000 description 1
- 229960002675 xylitol Drugs 0.000 description 1
- 235000010447 xylitol Nutrition 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- NWONKYPBYAMBJT-UHFFFAOYSA-L zinc sulfate Chemical compound [Zn+2].[O-]S([O-])(=O)=O NWONKYPBYAMBJT-UHFFFAOYSA-L 0.000 description 1
- 229910000368 zinc sulfate Inorganic materials 0.000 description 1
- 239000011686 zinc sulphate Substances 0.000 description 1
- 235000009529 zinc sulphate Nutrition 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/76—Albumins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y402/00—Carbon-oxygen lyases (4.2)
- C12Y402/02—Carbon-oxygen lyases (4.2) acting on polysaccharides (4.2.2)
- C12Y402/0201—Pectin lyase (4.2.2.10)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/02—Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
Definitions
- the present invention relates to nucleic acid constructs comprising a first polynucleotide encoding a signal peptide, e.g., from a fungal glycosidase, and a second polynucleotide encoding an alphalactalbumin (ALAB) polypeptide; expression vectors and host cells comprising said nucleic acid constructs; methods for producing ALAB polypeptides; and fusion proteins comprising an ALAB polypeptide and a signal peptide.
- ALAB alphalactalbumin
- SPs Signal peptides
- Signal peptides are short amino acid sequences present in the amino terminus of many newly synthesized polypeptides that target these into or across cellular membranes, thereby aiding maturation and secretion.
- the amino acid sequence of the SP influences secretion efficiency and thereby the yield of the polypeptide manufacturing process.
- Bioinformatic tools such as SignalP and SignalP5 can predict SPs from amino acid sequences, but most cannot distinguish between various types of SPs (Armenteros et al., Nat. Biotechnol. 37: 420-423, 2019).
- a large degree of redundancy in the amino acid sequence of SPs makes it difficult to predict the efficiency of any given SP for production of recombinant proteins at industrial scale.
- Alpha-lactalbumin is a principal protein of milk.
- ALAB forms the regulatory subunit of the lactose synthase (LS) heterodimer and beta 1 ,4-galactosyltransferase (beta4Gal-T1) forms the catalytic component.
- LS lactose synthase
- beta4Gal-T1 beta 1 ,4-galactosyltransferase
- alpha-lactalbumin strongly binds calcium and zinc ions and may possess bactericidal or antitumor activity.
- a folding variant of alpha-lactalbumin, called HAMLET likely induces apoptosis in tumor and immature cells.
- Recombinant ALAB has the potential to improve the nutritional value in food, beverages, and feed.
- Recombinant ALAB expression has been reported previously in different organisms including transgenic goats (Yuan YG et al, J Anal Methods Chem. 2014; 2014:281031. doi: 10.1155/2014/281031).
- Recombinantly produced phytase is widely used as a feed supplement to effectively improve phosphorous utilization and reduce fecal phosphorous excretion in animals as poultry and pig.
- polypeptides ALAB and phytase there is a growing need for access thereto in the respective industries.
- the present invention is based on the surprising and inventive finding that expression of difficult- to-express proteins (alpha-lactalbumin and phytase) with a fungal signal peptide provides increased yield when expressed in fungal host cells.
- difficult- to-express proteins alpha-lactalbumin and phytase
- MTP microtiter plates
- SF shake flasks
- laboratory fermentation tanks laboratory fermentation tanks.
- the present invention relates a nucleic acid construct comprising: a first polynucleotide encoding a signal peptide having a sequence identity of at least 80% to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49; and a second polynucleotide encoding an alpha-lactalbumin (ALAB) polypeptide having a sequence identity of at least 70% to the polypeptide sequence of SEQ ID NO:6; wherein the first polynucleotide and the second polynucleotide are operably linked in translational fusion.
- ALAB alpha-lactalbumin
- the present invention relates to expression vectors comprising a nucleic acid construct of the first aspect.
- the present invention relates to a fungal host cell comprising in its genome: a) a nucleic acid construct according to the first aspect; and/or b) an expression vector according to the second aspect.
- the present invention relates to a method of producing a alpha-lactalbumin (ALAB) polypeptide, the method comprising: a) cultivating a host cell according to the third aspect under conditions conducive for production of the ALAB polypeptide; and optionally b) recovering the ALAB polypeptide.
- ALAB alpha-lactalbumin
- the present invention relates to a method of producing a polypeptide having phytase activity, the method comprising: a) cultivating a host cell according to the third aspect under conditions conducive for production of the polypeptide having phytase activity; and optionally b) recovering the polypeptide having phytase activity.
- the present invention relates to a fusion polypeptide, comprising: a signal peptide having at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, sequence to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49, and
- an alpha-lactalbumin polypeptide having at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, sequence identity to SEQ ID NO: 6, or
- a polypeptide having phytase activity having at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, sequence identity to SEQ ID NO:8.
- SEQ ID NO:1 is the GH26 (JSP 002) signal peptide coding sequence
- SEQ ID NO:2 is the GH26 (JSP 002) signal peptide (MFAKLSLLSLLFSSAALG).
- SEQ ID NO:3 is the GH16 signal peptide coding sequence.
- SEQ ID NO:4 is the GH16 signal peptide (MRLPLVSSTVALLSSASLVAA).
- SEQ ID NO:5 is the ALAB coding sequence, (without SP)
- SEQ ID NO:6 is the ALAB amino acid sequence, (without SP)
- SEQ ID NO:7 is the ALAB coding sequence, (with SP GH26)
- SEQ ID NO:8 is the ALAB amino acid sequence (with SP GH26)
- SEQ ID NO:9 is the ALAB coding sequence, (with SP GH16)
- SEQ ID NQ:10 is the ALAB amino acid sequence (with SP GH16)
- SEQ ID NO:11 is the phytase coding sequence, (without SP)
- SEQ ID NO: 12 is the phytase amino acid sequence, (without SP)
- SEQ ID NO:13 is the phytase coding sequence, (with SP GH26)
- SEQ ID NO: 14 is the phytase amino acid sequence (with SP GH26)
- SEQ ID NO:15 is the phytase coding sequence, (with SP GH16)
- SEQ ID NO:16 is the phytase amino acid sequence (with SP GH16)
- SEQ ID NO:17 is the intron between SP GH16/ SP GH26 and the ALAB coding sequence
- SEQ ID NO:20 is the GH26 polypeptide coding sequence from Aspergillus luchuensis
- SEQ ID NO:21 is the GH26 polypeptide from A. luchuensis
- SEQ ID NO:22 is the GH16 polypeptide coding sequence from Aspergillus luchuensis
- SEQ ID NO:23 is the GH16 polypeptide from A. luchuensis
- SEQ ID NO:24 is the reference GH13 signal peptide coding sequence
- SEQ ID NO:25 is the reference GH13 signal peptide
- SEQ ID NO:26 is the reference cutinase (JSP004) signal peptide coding sequence
- SEQ ID NO:27 is the reference cutinase (JSP004) signal peptide
- SEQ ID NO:28 is the reference GH72 signal peptide coding sequence
- SEQ ID NO:29 is the reference GH72 signal peptide
- SEQ ID NO:30 is the SP GH16-ALAB expression cassette (promoter-SPGH16-intron-ALAB- terminator)
- SEQ ID NO: 31 is the SP GH26-ALAB expression cassette (promoter-SPGH26-intron-ALAB- terminator)
- SEQ ID NO:32 is the SP GH72-ALAB expression cassette (promoter-SPGH72-intron-ALAB- terminator)
- SEQ ID NO:33 is the SP GH16-phytase expression cassette (promoter-SPGH16-phytase- terminator)
- SEQ ID NO: 34 is HA442 primer
- SEQ ID NO: 35 is HA451 primer
- SEQ ID NO: 36 is HA283 primer
- SEQ ID NO: 37 is HA444 primer
- SEQ ID NO: 38 is HA450 primer
- SEQ ID NO: 39 is HA445 primer
- SEQ ID NO: 40 is HA442 primer
- SEQ ID NO: 41 is HA233 primer
- SEQ ID NO: 42 is HA268 primer
- SEQ ID NO: 43 is HA489 primer
- SEQ ID NO: 44 is the LYA1_4 (JSP017, Aspergillus luchuensis) signal peptide coding sequence
- SEQ ID NO: 45 is the LYA1_4 (JSP017, Aspergillus luchuensis) signal peptide (MKYAAALTAVAALAARAAA)
- SEQ ID NO: 46 is the pepesin A (JSP019, Aspergillus niger) signal peptide coding sequence
- SEQ ID NO: 47 is the pepesin A (JSP019, Aspergillus niger) signal peptide (MVVFSKTAALVLGLSSAVSA)
- SEQ ID NO: 48 is the GH28_9 endo-1 ,4-alpha-polygalacturonase (JSP008, Aspergillus luchuensis) signal peptide coding sequence
- SEQ ID NO: 49 is the GH28_9 endo-1 ,4-alpha-polygalacturonase (JSP008, Aspergillus luchuensis) signal peptide (MHFLQNAFVAATMGAALAAA)
- Alpha-lactalbumin The term “alpha-lactalbumin”, “a-lactalbumin” “a-LA”, “a-LAB”, “ALAB”, or “LALBA” means a polypeptide which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer.
- ALAB forms the regulatory subunit of the lactose synthase (LS) heterodimer and beta 1 ,4- galactosyltransferase (beta4Gal-T1) forms the catalytic component.
- cDNA means a DNA molecule that can be prepared by reverse transcription from a mature, spliced, mRNA molecule obtained from a eukaryotic or prokaryotic cell. cDNA lacks intron sequences that may be present in the corresponding genomic DNA.
- the initial, primary RNA transcript is a precursor to mRNA that is processed through a series of steps, including splicing, before appearing as mature spliced mRNA.
- Coding sequence means a polynucleotide, which directly specifies the amino acid sequence of a polypeptide.
- the boundaries of the coding sequence are generally determined by an open reading frame, which begins with a start codon, such as ATG, GTG, or TTG, and ends with a stop codon, such as TAA, TAG, or TGA.
- the coding sequence may be a genomic DNA, cDNA, synthetic DNA, or a combination thereof.
- control sequences means nucleic acid sequences involved in regulation of expression of a polynucleotide in a specific organism or in vitro. Each control sequence may be native (/.e., from the same gene) or heterologous (/.e., from a different gene) to the polynucleotide encoding the polypeptide, and native or heterologous to each other. Such control sequences include, but are not limited to leader, polyadenylation, prepropeptide, propeptide, signal peptide, promoter, terminator, enhancer, and transcription or translation initiator and terminator sequences. At a minimum, the control sequences include a promoter, and transcriptional and translational stop signals. The control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the polynucleotide encoding a polypeptide.
- expression means any step involved in the production of a polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
- Expression vector refers to a linear or circular DNA construct comprising a DNA sequence encoding a polypeptide, which coding sequence is operably linked to a suitable control sequence capable of effecting expression of the DNA in a suitable host.
- control sequences may include a promoter to effect transcription, an optional operator sequence to control transcription, a sequence encoding suitable ribosome binding sites on the mRNA, enhancers and sequences which control termination of transcription and translation.
- extension means an addition of one or more amino acids to the amino and/or carboxyl terminus of a polypeptide, wherein the “extended” polypeptide has phytase activity or wherein the extended polypeptide is an ALAB polypeptide which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer.
- LS lactose synthase
- fragment as used in the context of a polypeptide means a polypeptide having one or more amino acids absent from its amino and/or carboxyl terminus, wherein the fragment has phytase activity, or wherein the fragment is an ALAB fragment which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer.
- the fragment may be produced naturally during expression and/or purification of the polypeptide, or may be the result of expression of a modified nucleotide sequence expressing the fragment or of targeted removal of amino acids from the amino and/or carboxy terminus.
- Fungal glycosidase means any glycosidase (EC 3.2.1) that hydrolyses O- and S-glycosyl compounds and which is encoded with an N-terminal signal peptide and natively expressed by a fungal species.
- the term “fungal glycosidase” also includes, but is not limited to, fungal mannanases (EC 3.2.1.78) and fungal glucanases (EC 3.2.1.39).
- Other non-limiting examples are alpha-amylase (EC 3.2.1.1), beta-amylase (EC 3.2.1.2) and lysozyme (EC 3.2.1.17).
- Fusion polypeptide is a polypeptide in which one polypeptide is fused at the N-terminus and/or the C-terminus of a polypeptide of the present invention.
- a fusion polypeptide is produced by fusing a polynucleotide encoding another polypeptide to a polynucleotide of the present invention, or by fusing two or more polynucleotides of the present invention together.
- Techniques for producing fusion polypeptides are known in the art, and include ligating the coding sequences encoding the polypeptides so that they are in frame and that expression of the fusion polypeptide is under control of the same promoter(s) and terminator.
- Fusion polypeptides may also be constructed using intein technology in which fusion polypeptides are created post-translationally (Cooper et al., 1993, EMBO J. 12: 2575-2583; Dawson et al., 1994, Science 266: 776-779).
- a fusion polypeptide can further comprise a cleavage site between the two polypeptides. Upon secretion of the fusion protein, the site is cleaved releasing the two polypeptides. Examples of cleavage sites include, but are not limited to, the sites disclosed in Martin et al., 2003, J. Ind. Microbiol. Biotechnol. 3: 568-576; Svetina et al., 2000, J.
- Glucanase means a glucanase comprising a signal peptide at its N-terminal end, the glucanase having endo-b-1 ,3-glucanase activity (EC 3.2.1.39).
- a non limiting example for a glucanase is the Aspergillus luchuensis glucanase.
- the signal peptide sequence derived of the A. luchuensis glucanase is termed “GH16” or “GH16 SP” and is represented by SEQ ID NO:4, derived from the A. luchuensis glucanase polypeptide with SEQ ID NO: 23.
- heterologous means, with respect to a host cell, that a polypeptide or nucleic acid does not naturally occur in the host cell.
- heterologous means, with respect to a polypeptide or nucleic acid, that a control sequence, e.g., promoter, of a polypeptide or nucleic acid is not naturally associated with the polypeptide or nucleic acid, i.e., the control sequence is from a gene other than the gene encoding the mature polypeptide.
- Host Strain or Host Cell is an organism into which an expression vector, phage, virus, or other DNA construct, including a polynucleotide encoding a polypeptide of interest (e.g., an amylase) has been introduced.
- exemplary host strains are microorganism cells (e.g., bacteria, filamentous fungi, and yeast) capable of expressing the polypeptide of interest and/or fermenting saccharides.
- the term "host cell” includes protoplasts created from cells.
- Isolated means a polypeptide, nucleic acid, cell, or other specified material or component that has been separated from at least one other material or component, including but not limited to, other proteins, nucleic acids, cells, etc.
- An isolated polypeptide, nucleic acid, cell or other material is thus in a form that does not occur in nature.
- An isolated polypeptide includes, but is not limited to, a culture broth containing the secreted polypeptide expressed in a host cell.
- Mannanase means a mannanase comprising a signal peptide at its N- terminal end, the mannanase having endo-1 ,4-beta-mannanase activity (EC 3.2.1.78).
- a non limiting example for a manannase is the Aspergillus luchuensis mannanase.
- the signal peptide sequence derived of the Aspergillus luchuensis mannanase is termed “GH26” or “GH26 SP” and is represented by SEQ ID NO:2, derived from the A. luchuensis mannanase polypeptide with SEQ ID NO: 21 .
- Mature polypeptide means a polypeptide in its mature form following translation and any post-translational modifications such as N-terminal processing (e.g. removal of signal peptide), C-terminal truncation, glycosylation, phosphorylation, etc. It is known in the art that a host cell may produce a mixture of two of more different mature polypeptides (/.e., with a different C-terminal and/or N-terminal amino acid) expressed by the same polynucleotide. It is also known in the art that different host cells process polypeptides differently, and thus, one host cell expressing a polynucleotide may produce a different mature polypeptide (e.g.
- Mature polypeptides of the invention may therefore have slight differences at the N- and/or C-terminal due to such differentiated expression by the host cell.
- a mature polypeptide having one or more amino acids absent from the N- and/or C-terminal may be considered to be a “fragment” of the full-length polypeptide.
- the mature polypeptide is amino acids 1 to 123 of SEQ ID NO:6. In some aspects, the mature polypeptide is amino acids 19 to 141 of SEQ ID NO:8 and amino acids 1 to 18 of SEQ ID NO:8 are a signal peptide.
- the mature polypeptide is amino acids 22 to 144 of SEQ ID NQ:10 and amino acids 1 to 21 of SEQ ID NQ:10 are a signal peptide.
- the mature polypeptide is amino acids 1 to 411 of SEQ ID NO:12. In some aspects, the mature polypeptide is amino acids 19 to 429 of SEQ ID NO:14 and amino acids 1 to 18 of SEQ ID NO:14 are a signal peptide.
- the mature polypeptide is amino acids 22 to 432 of SEQ ID NO:16 and amino acids 1 to 21 of SEQ ID NO:16 are a signal peptide.
- Mature polypeptide coding sequence means a polynucleotide that encodes a mature polypeptide having phytase activity, or an ALAB polypeptide which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer.
- the mature polypeptide coding sequence is nucleotides 1 to 369 of SEQ ID NO:5.
- the mature polypeptide coding sequence is nucleotides 55 to 423 of SEQ ID NO: 7 and nucleotides 1 to 54 of SEQ ID NO: 7 encode a signal peptide.
- the mature polypeptide coding sequence is nucleotides 64 to 432 of SEQ ID NO: 9 and nucleotides 1 to 63 of SEQ ID NO: 9 encode a signal peptide.
- the mature polypeptide coding sequence is nucleotides 110 to 478 of SEQ ID NO:
- nucleotides 1 to 63 of SEQ ID NO:18 encode a signal peptide, and nucleotides 64 to 109 of SEQ ID NO:18 is an intron.
- the mature polypeptide coding sequence is nucleotides 101 to 469 of SEQ ID NO:
- nucleotides 1 to 54 of SEQ ID NO: 19 encode a signal peptide
- nucleotides 55 to 100 of SEQ ID NO:19 is an intron.
- the mature polypeptide coding sequence is nucleotides 1 to 1233 of SEQ ID NO:11.
- the mature polypeptide coding sequence is nucleotides 55 to 1287 of SEQ ID NO: 13 and nucleotides 1 to 54 of SEQ ID NO: 13 encode a signal.
- the mature polypeptide coding sequence is nucleotides 64 to 1296 of SEQ ID NO: 15 and nucleotides 1 to 63 of SEQ ID NO: 15 encode a signal peptide.
- Native means a nucleic acid or polypeptide naturally occurring in a host cell.
- Nucleic acid encompasses DNA, RNA, heteroduplexes, and synthetic molecules capable of encoding a polypeptide. Nucleic acids may be single stranded or double stranded, and may be chemical modifications. The terms “nucleic acid” and “polynucleotide” are used interchangeably. Because the genetic code is degenerate, more than one codon may be used to encode a particular amino acid, and the present compositions and methods encompass nucleotide sequences that encode a particular amino acid sequence. Unless otherwise indicated, nucleic acid sequences are presented in 5'-to-3' orientation.
- nucleic acid construct means a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or is modified to contain segments of nucleic acids in a manner that would not otherwise exist in nature or which is synthetic, and which comprises one or more control sequences operably linked to the nucleic acid sequence.
- Obtained polypeptide/peptide/polynucleotide The term “obtained” or “derived” when used in reference to a polynucleotide sequence, ALAB sequence, polypeptide sequence, mannanase sequence, glucanase sequence, phytase sequence, variant sequence or signal peptide sequence, means that the molecule originally has been isolated from the given source and that the molecule can either be utilized in its native sequence or that the molecule is modified by methods known to the skilled person.
- operably linked means that specified components are in a relationship (including but not limited to juxtaposition) permitting them to function in an intended manner.
- a regulatory sequence is operably linked to a coding sequence such that expression of the coding sequence is under control of the regulatory sequence.
- parent means a polypeptide functioning as a signal peptide, or a polypeptide having phytase activity, or an ALAB polypeptide which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer, to which an alteration is made to produce variants of the present invention.
- the parent may be a naturally occurring (wild-type) polypeptide or a variant or fragment thereof.
- Phytase In the present context, a preferred Phytase according to the invention is classified as belonging to the EC 3.1 .3.26 group. The EC numbers referto Enzyme Nomenclature 1992 from NC-IUBMB, Academic Press, San Diego, California, including supplements 1 -5 published in Eur. J.
- Phytase activity is determined by the libertation of inorganic phosphate from Na-phytate solution, wherein one phytase activity unit is the amount of enzyme which liberates 1 pmol inorganic phosphate per min from a 0.0051 M Na-phytate solution in 0.25 M Na-acetate, pH 5.5 and at 37° C (Engelen, A. J., et al., 1994, "Simple and rapid determination of phytase activity", J. AOAC Int. 77:760-764).
- Examples of activity unit names are: FYT, FTU and U.
- Phytase activity may be determined using the assay as described in Example 1 ("Determination of phytase activity").
- the polypeptides of the present invention have at least 20%, e.g., at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 100% of the phytase activity of SEQ ID NO:12.
- pNP assay and “FYT(B) assay”.
- Specific activity is measured on highly purified samples (an SDS poly acryl amide gel should show the presence of only one component).
- the enzyme protein concentration may be determined by amino acid analysis, and the phytase activity in the units of FYT.
- Specific activity is a characteristic of the specific phytase variant in question, and it is calculated as the phytase activity measured in FYT units per mg phytase enzyme protein.
- Recombinant is used in its conventional meaning to refer to the manipulation, e.g., cutting and rejoining, of nucleic acid sequences to form constellations different from those found in nature.
- the term recombinant refers to a cell, nucleic acid, polypeptide or vector that has been modified from its native state.
- recombinant cells express genes that are not found within the native (non-recombinant) form of the cell, or express native genes at different levels or under different conditions than found in nature.
- the term “recombinant” is synonymous with “genetically modified” and “transgenic”.
- Sequence identity The relatedness between two amino acid sequences or between two nucleotide sequences is described by the parameter “sequence identity”.
- the sequence identity between two amino acid sequences is determined as the output of “longest identity” using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, J. Mol. Biol. 48: 443-453) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, Trends Genet. 16: 276-277), preferably version 6.6.0 or later.
- the parameters used are a gap open penalty of 10, a gap extension penalty of 0.5, and the EBLOSUM62 (EMBOSS version of BLOSUM62) substitution matrix.
- the -nobrief option must be specified in the command line.
- the output of Needle labeled “longest identity” is calculated as follows: (Identical Residues x 100)/(Length of Alignment - Total Number of Gaps in Alignment)
- the sequence identity between two polynucleotide sequences is determined as the output of “longest identity” using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, supra) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, supra), preferably version 6.6.0 or later.
- the parameters used are a gap open penalty of 10, a gap extension penalty of 0.5, and the EDNAFULL (EMBOSS version of NCBI NUC4.4) substitution matrix.
- the nobrief option must be specified in the command line.
- the output of Needle labeled “longest identity” is calculated as follows:
- Signal Peptide A "signal peptide” is a sequence of amino acids attached to the N-terminal portion of a protein, which facilitates the secretion of the protein outside the cell.
- the mature form of an extracellular protein lacks the signal peptide, which is cleaved off during the secretion process.
- Subsequence means a polynucleotide having one or more nucleotides absent from the 5' and/or 3' end of a mature polypeptide coding sequence; wherein the subsequence encodes a fragment having phytase activity, or wherein the subsequence forms a functional regulatory subunit of the lactose synthase (LS) heterodimer.
- LS lactose synthase
- variant means a polypeptide having phytase activity, or a polypeptide forming a functional regulatory subunit of the lactose synthase (LS) heterodimer, comprising a man-made mutation, i.e., a substitution, insertion (including extension), and/or deletion (e.g., truncation), at one or more positions.
- a substitution means replacement of the amino acid occupying a position with a different amino acid;
- a deletion means removal of the amino acid occupying a position;
- an insertion means adding 1 -5 amino acids (e.g., 1-3 amino acids, in particular, 1 amino acid) adjacent to and immediately following the amino acid occupying a position.
- Wild-type in reference to an amino acid sequence or nucleic acid sequence means that the amino acid sequence or nucleic acid sequence is a native or naturally-occurring sequence.
- naturally-occurring refers to anything (e.g., proteins, amino acids, or nucleic acid sequences) that is found in nature.
- non-naturally occurring refers to anything that is not found in nature (e.g., recombinant nucleic acids and protein sequences produced in the laboratory or modification of the wild-type sequence).
- the present invention is based on the surprising and inventive finding that expression of difficult- to-express proteins (alpha-lactalbumin and phytase) with signal peptides obtained from fungal polypeptides provides increased yield when expressed in fungal host cells.
- difficult- to-express proteins alpha-lactalbumin and phytase
- the present invention also relates to polynucleotides encoding a polypeptide of the present invention, as described herein.
- the polynucleotide may be a genomic DNA, a cDNA, a synthetic DNA, a synthetic RNA, a mRNA, or a combination thereof.
- the polynucleotide may be cloned from a strain of Aspergillus, or a related organism and thus, for example, may be a polynucleotide sequence encoding a variant of the polypeptide of the invention.
- the polynucleotide is a subsequence encoding a fragment having phytase activity or a fragment which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer of the present invention.
- polynucleotide encoding the signal peptide of the present invention is isolated from an Aspergillus cell, such as an Aspergillus luchuensis cell.
- the polynucleotide may also be mutated by introduction of nucleotide substitutions that do not result in a change in the amino acid sequence of the polypeptide, but which correspond to the codon usage of the host organism intended for production of the enzyme, or by introduction of nucleotide substitutions that may give rise to a different amino acid sequence.
- nucleotide substitutions see, e.g., Ford et al., 1991 , Protein Expression and Purification 2: 95-107.
- the polynucleotide is isolated.
- the polynucleotide is purified.
- the present invention also relates to nucleic acid constructs comprising a polynucleotide of the present invention, wherein the polynucleotide is operably linked to one or more control sequences that direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences.
- the invention relates to a nucleic acid construct comprising: a first polynucleotide encoding a signal peptide having a sequence identity of at least 80% to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49; and a) a second polynucleotide encoding an alpha-lactalbumin (ALAB) polypeptide having a sequence identity of at least 70% to the polypeptide sequence of SEQ ID NO:6; or b) a second polynucleotide encoding a polypeptide having phytase activity; wherein the first polynucleotide and the second polynucleotide are operably linked in translational fusion.
- ALAB alpha-lactalbumin
- the second polynucleotide is located downstream from the first polynucleotide.
- the signal peptide is a naturally occurring signal peptide, or a functional fragment or functional variant of a naturally occurring signal peptide.
- the signal peptide is from a filamentous fungal glycosidase.
- the construct is further comprising a third polynucleotide downstream of the first polynucleotide and upstream of the second polynucleotide.
- the third polynucleotide is a non-coding intron.
- the third polynucleotide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:17 (gtaagtaacatccactctgttctagtgccatgctgagattgtacag).
- the construct comprises a polynucleotide sequence with a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:18 or SEQ ID NO:19.
- the nucleic acid construct further comprises a heterologous promoter, and wherein said promoter, the first polynucleotide, and the second polynucleotide, and optionally the third polynucleotide, are operably linked.
- the promoter is a P3 promoter or a P3-based promoter, preferably the heterologous promoter is a tandem promoter comprising the P3 promoter or is a tandem promoter derived from the P3 promoter.
- the promoter is operably linked to an mRNA stabilizer region; preferably the mRNA stabilizer region is the cry 11 IA mRNA stabilizer region.
- the signal peptide is a naturally occurring signal peptide, or a functional fragment or functional variant of a naturally occurring signal peptide.
- the signal peptide is from a glycosidase (EC 3.2.1).
- the signal peptide is obtained from a mannanase polypeptide (EC 3.2.1.78).
- the signal peptide is obtained from a b-transglycosidase polypeptide (EC 2.4.1.-).
- the signal peptide is obtained from a chitin b-1 ,3/1 ,6-glucanosyltransferase polypeptide (EC 2.4.1 .-) polypeptide.
- the signal peptide is obtained from an endo-b-1 ,3-glucanase polypeptide or laminarinase polypeptide (EC 3.2.1.39). In one embodiment, the signal peptide is obtained from a polypeptide, such as a mannanase, transglycosidase, glycosyltransferase, laminarinase, or glucanase, expressed by a filamentous fungal host cell.
- a polypeptide such as a mannanase, transglycosidase, glycosyltransferase, laminarinase, or glucanase
- the signal peptide is obtained from a polypeptide expressed by a Aspergillus host cell, such as an Aspergillus luchuensis.
- the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:1 or SEQ ID NO:3; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:1 , SEQ ID NO:3, SEQ ID NO:44, SEQ ID NO:46, or SEQ ID NO:48;.
- the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:1 ; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:1.
- the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:3; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:3.
- the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:44; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:44.
- the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:46; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:46.
- the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:48; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:48.
- the signal peptide is obtained from a glycosidase expressed by an Aspergillus species selected from the group consisting of Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus luchuensis, or Aspergillus oryzae.
- an Aspergillus species selected from the group consisting of Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus luchuensis, or Aspergillus oryzae.
- the e signal peptide is obtained from a glycosidase expressed by Aspergillus luchuensis.
- the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:2.
- the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:2.
- the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:4.
- the wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:4.
- the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:45.
- the wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:45.
- the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:47.
- the wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:47.
- the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:49.
- the wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:49.
- the signal peptide consists of the amino acid sequence of SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49 with or without its C-terminal alanine, or a peptide fragment thereof that retains the ability to direct the polypeptide into or across a cell membrane.
- the N- and/or C-terminal end of the signal peptide has been extended by addition of one or more amino acids.
- the polynucleotide encoding the alpha-lactalbumin polypeptide has a sequence identity of at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:5; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:5.
- the alpha-lactalbumin polypeptide is a bovine alpha-lactalbumin polypeptide.
- the alpha-lactalbumin polypeptide is a human alpha-lactalbumin polypeptide.
- the alpha-lactalbumin polypeptide has a sequence identity of at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide of SEQ ID NO:6.
- the alpha-lactalbumin polypeptide comprises, consists essentially of, or consists of the mature polypeptide of SEQ ID NO:6.
- the N- and/or C-terminal end of the alpha-lactalbumin polypeptide has been extended by addition of one or more amino acids.
- the polynucleotide encoding the polypeptide having phytase activity has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:11 ; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:1 1.
- the polypeptide having phytase activity is a bacterial polypeptide or variant thereof.
- polypeptide having phytase activity is EC 3.1.3.26.
- the polypeptide having phytase activity has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide of SEQ ID NO:12.
- the having phytase activity comprises, consists essentially of, or consists of the mature polypeptide of SEQ ID NO:12.
- the N- and/or C-terminal end of the polypeptide having phytase activity has been extended by addition of one or more amino acids. It is expected that the invention will be just as effective when employing a signal peptide that is highly similar to the signal peptide disclosed in SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, and SEQ ID NO: 49, encoded by SEQ ID NO:1 ,SEQ ID NO:3, SEQ ID NO:44, SEQ ID NO: 46, and SEQ ID NO:48, respectively.
- One or more non-essential amino acids may, for example, be altered.
- Non- essential amino acids in a signal peptide can be identified according to procedures known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, 1989, Science 244: 1081 -1085). In the latter technique, single alanine mutations are introduced at every residue in the molecule, and the resultant molecules are tested for signal peptide activity to identify amino acid residues that are critical to the activity of the molecule and residues that are non-essential. See also, Hilton et al., 1996, J. Biol. Chem. 271 : 4699-4708. The identity of essential and non-essential amino acids can also be inferred from an alignment with one or more related signal peptide.
- Single or multiple amino acid substitutions, deletions, and/or insertions can be made and tested using known methods of mutagenesis, recombination, and/or shuffling, followed by a relevant screening procedure, such as those disclosed by Reidhaar-Olson and Sauer, 1988, Science 241 : 53-57; Bowie and Sauer, 1989, Proc. Natl. Acad. Sci. USA 86: 2152-2156; WO 95/17413; or WO 95/22625.
- Other methods that can be used include error-prone PCR, phage display (e.g. Lowman et al., 1991 , Biochemistry 30: 10832-10837; U.S. Patent No. 5,223,409; WO 92/06204), and region-directed mutagenesis (Derbyshire et al., 1986, Gene 46: 145; Ner ef a/., 1988, DNA 7: 127).
- Mutagenesis/shuffling methods can be combined with high-throughput, automated screening methods to detect activity of cloned, mutagenized polypeptides expressed by host cells (Ness et al., 1999, Nature Biotechnology 17: 893-896). Mutagenized DNA molecules that encode active polypeptides can be recovered from the host cells and rapidly sequenced using standard methods in the art. These methods allow the rapid determination of the importance of individual amino acid residues in a polypeptide.
- the signal peptide is a variant (/.e., functional variant) or fragment (/.e., functional fragment) of the signal peptide of SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49.
- the number of alterations in the signal peptide variant of the present invention is 1- 10, e.g., 1-5, such as 1 , 2, 3, 4, or 5 alterations. Alterations includes substitutions, insertions, and/or deletions at one or more (e.g. , several) positions compared to the parent.
- a substitution means replacement of the amino acid occupying a position with a different amino acid;
- a deletion means removal of the amino acid occupying a position; and an insertion means adding an amino acid adjacent to and immediately following the amino acid occupying a position.
- the signal peptide is a variant of the mature polypeptide of SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49comprising 1-10 alterations, e.g., 1-5, such as 1 , 2, 3, 4, or 5 alterations, compared to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49 respectively.
- the first and second polynucleotide are operably linked in translational fusion.
- the term “operably linked in translation fusion” means that the signal peptide encoded by the first polynucleotide and the polypeptide encoded by the second polynucleotide are encoded in frame and translated together as a single polypeptide.
- the signal peptide is removed to provide the mature phytase polypeptide or the mature ALAB polypeptide.
- the signal peptide is not removed, or only removed partly to provide the mature ALAB polypeptide orthe mature polypeptide having phytase activity and comprising at least a fragment of the signal peptide.
- the first and second polynucleotide may be manipulated in a variety of ways to provide for expression of a variant. Manipulation of the polynucleotide priorto its insertion into a nucleic acid construct or expression vector may be desirable or necessary depending on the construct or vector. The techniques for modifying polynucleotides utilizing recombinant DNA methods are well known in the art.
- nucleic acid constructs of the invention may be operably linked to one or more further control sequences that direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences.
- the control sequence may be a promoter, a polynucleotide that is recognized by a host cell for expression of a polynucleotide encoding a polypeptide of the present invention.
- the promoter contains transcriptional control sequences that mediate the expression of the polypeptide.
- the promoter may be any polynucleotide that shows transcriptional activity in the host cell including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell.
- the nucleic acid construct further comprises a heterologous promoter, and wherein said promoter, the first polynucleotide, and the second polynucleotide are operably linked.
- the promoter is orientated upstream of the first polynucleotide.
- the promoter is a heterologous promoter.
- the promoter is a tandem promoter. More preferably, the promoter is a P3 promoter or a P3-based promoter.
- promoters for directing transcription of the polynucleotide of the present invention in a filamentous fungal host cell are promoters obtained from Aspergillus, Fusarium, Rhizomucor and Trichoderma cells, such as the promoters described in Mukherjee et al., 2013, “Trichoderma-. Biology and Applications”, and by Schmoll and Dattenbock, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
- the promoter is a promoter, such as a P3 promoter, operably linked to an mRNA stabilizer region.
- the mRNA stabilizer region is the cry 11 IA mRNA stabilizer region.
- control sequence may also be an mRNA stabilizer region downstream of a promoter and upstream of the coding sequence of a gene which increases expression of the gene.
- mRNA stabilizer regions are obtained from a Bacillus thuringiensis crylllA gene (WO 94/25612) and a Bacillus subtilis SP82 gene (Hue et al., 1995, Journal of Bacteriology 177: 3465-3471).
- mRNA stabilizer regions for fungal cells are described in Geisberg et al., 2014, Cell 156(4): 812-824, and in Morozov et al., 2006, Eukaryotic Cell 5(11): 1838-1846.
- the control sequence may also be a transcription terminator, which is recognized by a host cell to terminate transcription.
- the terminator is operably linked to the 3’-terminus of the polynucleotide encoding the polypeptide. Any terminator that is functional in the host cell may be used in the present invention.
- Preferred terminators for filamentous fungal host cells may be obtained from Aspergillus or Trichoderma species, such as obtained from the genes for Aspergillus niger glucoamylase, Trichoderma reesei beta-glucosidase, Trichoderma reesei cellobiohydrolase I, and Trichoderma reesei endoglucanase I, such as the terminators described in Mukherjee et al., 2013, “Trichoderma-. Biology and Applications”, and by Schmoll and Dattenbock, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
- Preferred terminators for yeast host cells may be obtained from the genes for Saccharomyces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), and Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase.
- Other useful terminators for yeast host cells are described by Romanos et al., 1992, Yeast 8: 423-488.
- the control sequence may also be a propeptide coding sequence that encodes a propeptide positioned at the N-terminus of a polypeptide.
- the resultant polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases).
- a propolypeptide is generally inactive and can be converted to an active polypeptide by catalytic or autocatalytic cleavage of the propeptide from the propolypeptide.
- the propeptide coding sequence may be obtained from the genes for Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease (nprT), Myceliophthora thermophila laccase (WO 95/33836), Rhizomucor miehei aspartic proteinase, and Saccharomyces cerevisiae alpha-factor.
- the propeptide sequence is positioned next to the N-terminus of a polypeptide and the signal peptide sequence is positioned next to the N-terminus of the propeptide sequence.
- the polypeptide may comprise only a part of the signal peptide sequence and/or only a part of the propeptide sequence.
- the final or isolated polypeptide may comprise a mixture of mature polypeptides and polypeptides which comprise, either partly or in full length, a propeptide sequence and/or a signal peptide sequence.
- regulatory sequences that regulate expression of the polypeptide relative to the growth of the host cell.
- regulatory sequences are those that cause expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound.
- yeast the ADH2 system or GAL1 system may be used.
- filamentous fungi the Aspergillus niger glucoamylase promoter, Aspergillus oryzae TAKA alpha-amylase promoter, and Aspergillus oryzae glucoamylase promoter, Trichoderma reesei cellobiohydrolase I promoter, and Trichoderma reesei cellobiohydrolase II promoter may be used.
- regulatory sequences are those that allow for gene amplification.
- these regulatory sequences include the dihydrofolate reductase gene that is amplified in the presence of methotrexate, and the metallothionein genes that are amplified with heavy metals.
- the control sequence may also be a leader, a non-translated region of an mRNA that is important for translation by the host cell.
- the leader is operably linked to the 5’-terminus of the polynucleotide encoding the polypeptide. Any leader that is functional in the host cell may be used.
- Preferred leaders for filamentous fungal host cells may be obtained from the genes for Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase.
- Suitable leaders for yeast host cells may be obtained from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase, Saccharomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP).
- ENO-1 Saccharomyces cerevisiae enolase
- Saccharomyces cerevisiae 3-phosphoglycerate kinase Saccharomyces cerevisiae alpha-factor
- Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase ADH2/GAP
- the control sequence may also be a transcription factor, a polynucleotide encoding a polynucleotide-specific DNA-binding polypeptide that controls the rate of the transcription of genetic information from DNA to mRNA by binding to a specific polynucleotide sequence.
- the transcription factor may function alone and/or together with one or more other polypeptides or transcription factors in a complex by promoting or blocking the recruitment of RNA polymerase.
- Transcription factors are characterized by comprising at least one DNA-binding domain which often attaches to a specific DNA sequence adjacent to the genetic elements which are regulated by the transcription factor.
- the transcription factor may regulate the expression of a protein of interest either directly, i.e.
- transcription factor for prokaryotic host cells are described in Seshasayee et al., Subcell Biochem 2011 ; 52:7-23, as well in Balleza et al., FEMS Microbiol Rev 2009, 33(1 ): 133-151 .
- the control sequence may also be a polyadenylation sequence, a sequence operably linked to the 3’-terminus of the polynucleotide which, when transcribed, is recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. Any polyadenylation sequence that is functional in the host cell may be used.
- Preferred polyadenylation sequences for filamentous fungal host cells are obtained from the genes for Aspergillus nidulans anthranilate synthase, Aspergillus niger glucoamylase, Aspergillus niger alphaglucosidase, Aspergillus oryzae TAKA amylase, and Fusarium oxysporum trypsin-like protease.
- the present invention also relates to recombinant expression vectors comprising a nucleic acid construct according to the first aspect.
- the expression vectors comprise a polynucleotide of the present invention, a promoter, and transcriptional and translational stop signals.
- the various nucleotide and control sequences may be joined together to produce a recombinant expression vector that may include one or more convenient restriction sites to allow for insertion or substitution of the polynucleotide encoding the polypeptide at such sites.
- the polynucleotide may be expressed by inserting the polynucleotide or a nucleic acid construct comprising the polynucleotide into an appropriate vector for expression.
- the coding sequence is located in the vector so that the coding sequence is operably linked with the appropriate control sequences for expression.
- the recombinant expression vector may be any vector (e.g., a plasmid or virus) that can be conveniently subjected to recombinant DNA procedures and can bring about expression of the polynucleotide.
- the choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced.
- the vector may be a linear or closed circular plasmid.
- the vector may be an autonomously replicating vector, i.e., a vector that exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal element, a mini-chromosome, or an artificial chromosome.
- the vector may contain any means for assuring self-replication.
- the vector may be one that, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated.
- a single vector or plasmid or two or more vectors or plasmids that together contain the total DNA to be introduced into the genome of the host cell, or a transposon may be used.
- the vector preferably contains one or more selectable markers that permit easy selection of transformed, transfected, transduced, or the like cells.
- a selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like.
- the vector preferably contains an element(s) that permits integration of the vector into the host cell's genome or autonomous replication of the vector in the cell independent of the genome.
- the vector may rely on the polynucleotide’s sequence encoding the polypeptide or any other element of the vector for integration into the genome by homologous recombination, such as homology-directed repair (HDR), or non-homologous recombination, such as non- homologous end-joining (NHEJ).
- homologous recombination such as homology-directed repair (HDR), or non-homologous recombination, such as non- homologous end-joining (NHEJ).
- the vector may further comprise an origin of replication enabling the vector to replicate autonomously in the host cell in question.
- the origin of replication may be any plasmid replicator mediating autonomous replication that functions in a cell.
- the term “origin of replication” or “plasmid replicator” means a polynucleotide that enables a plasmid or vector to replicate in vivo.
- More than one copy of a polynucleotide of the present invention may be inserted into a host cell to increase production of a polypeptide. For example, 2 or 3 or 4 or 5 or more copies are inserted into a host cell.
- An increase in the copy number of the polynucleotide can be obtained by integrating at least one additional copy of the sequence into the host cell genome or by including an amplifiable selectable marker gene with the polynucleotide where cells containing amplified copies of the selectable marker gene, and thereby additional copies of the polynucleotide, can be selected for by cultivating the cells in the presence of the appropriate selectable agent.
- the invention relates to fungal host cells comprising in its genome: a) a nucleic acid construct according to the first aspect; and/or b) an expression vector according to the second aspect.
- a construct or vector comprising a polynucleotide is introduced into a host cell so that the construct or vector is maintained as a chromosomal integrant or as a self-replicating extra-chromosomal vector as described earlier.
- the choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source.
- the polypeptide encoded by the introduced polynucleotide can be native or heterologous to the recombinant host cell.
- at least one of the one or more control sequences can be heterologous to the polynucleotide encoding the polypeptide.
- the recombinant host cell may comprise a single copy, or at least two copies, e.g. three, four, five or more copies of the polynucleotide of the present invention.
- the host cell comprises two or more copies of the nucleic acid construct and/or the expression vector.
- the host cell may be a fungal cell.
- “Fungi” as used herein includes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota as well as the Oomycota and all mitosporic fungi (as defined by Hawksworth et al., In, Ainsworth and Bisby’s Dictionary of The Fungi, 8th edition, 1995, CAB International, University Press, Cambridge, UK).
- Fungal cells may be transformed by a process involving protoplast-mediated transformation, Agrobacterium-mediated transformation, electroporation, biolistic method and shock-wave-mediated transformation as reviewed by Li et al., 2017, Microbial Cell Factories 16: 168 and procedures described in EP 238023, Yelton et al., 1984, Proc. Natl. Acad. Sci. USA 81 : 1470-1474, Christensen et al., 1988, Bio/Technology 6: 1419-1422, and Lubertozzi and Keasling, 2009, Biotechn. Advances 27 : 53-75.
- any method known in the art for introducing DNA into a fungal host cell can be used, and the DNA can be introduced as linearized or as circular polynucleotide.
- the fungal host cell may be a yeast cell.
- yeast as used herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes). For purposes of this invention, yeast shall be defined as described in Biology and Activities of Yeast (Skinner, Passmore, and Davenport, editors, Soc. App. Bacteriol. Symposium Series No. 9, 1980).
- the yeast host cell may be a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell.
- the yeast host cell is a Pichia or Komagataella cell, e.g., a Pichia pastoris cell ( Komagataella phaffii).
- the fungal host cell may be a filamentous fungal cell.
- “Filamentous fungi” include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra).
- the filamentous fungi are generally characterized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other complex polysaccharides.
- Vegetative growth is by hyphal elongation and carbon catabolism is obligately aerobic.
- vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding of a unicellular thallus and carbon catabolism may be fermentative.
- the filamentous fungal host cell may be an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell.
- the filamentous fungal host cell is an Aspergillus, Trichoderma or Fusarium cell. In a further preferred embodiment, the filamentous fungal host cell is an Aspergillus niger, Aspergillus oryzae, Trichoderma reesei, or Fusarium venenatum cell.
- the filamentous fungal host cell may be an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zona
- the host cell is an Aspergillus cell.
- the host cell is an Aspergillus niger cell.
- the host cell is an Aspergillus oryzae cell.
- the host cell is isolated.
- the host cell comprises at least two copies of the nucleic acid construct and/or the expression vector, such as two copies, three copies, four copies or more than four copies.
- the host cell is purified.
- the present invention also relates methods of producing an alpha-lactalbumin (ALAB) polypeptide, the method comprising: a) cultivating a host cell according to the third aspect under conditions conducive for production of the ALAB polypeptide; and optionally b) recovering the ALAB polypeptide.
- ALAB alpha-lactalbumin
- the present invention also relates to methods of producing a polypeptide having phytase activity, the method comprising: a) cultivating a host cell according to the third aspect under conditions conducive for production of the polypeptide having phytase activity; and optionally b) recovering the polypeptide having phytase activity.
- the host cell is cultivated in a nutrient medium suitable for production of the polypeptide using methods known in the art.
- the cell may be cultivated by shake flask cultivation, or small-scale or large- scale fermentation (including continuous, batch, fed-batch, or solid-state, and/or microcarrier-based fermentations) in laboratory or industrial fermentors in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated.
- suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). If the polypeptide is secreted into the nutrient medium, the polypeptide can be recovered directly from the medium. If the polypeptide is not secreted, it can be recovered from cell lysates.
- the polypeptide may be detected using methods known in the art that are specific for the polypeptide, including, but not limited to, the use of specific antibodies, formation of an enzyme product, disappearance of an enzyme substrate, or an assay determining the relative or specific activity of the polypeptide.
- the polypeptide may be recovered from the medium using methods known in the art, including, but not limited to, collection, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation.
- a whole fermentation broth comprising the polypeptide is recovered.
- a cell- free fermentation broth comprising the polypeptide is recovered.
- polypeptide may be purified by a variety of procedures known in the art to obtain substantially pure polypeptides and/or polypeptide fragments (see, e.g., Wingfield, 2015, Current Protocols in Protein Science; 80(1): 6.1.1-6.1.35; Labrou, 2014, Protein Downstream Processing, 1129: 3-10).
- the polypeptide having phytase activity is not recovered.
- the polypeptide having phytase activity is not recovered, but rather a host cell of the present invention expressing the polypeptide having phytase activity is used as a source of the variant.
- the invention in a sixth aspect, relates to a fusion polypeptide, comprising a) a signal peptide having at least 60% sequence identity to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49, and b) an alpha-lactalbumin polypeptide having at least 60% sequence identity to SEQ ID NO: 6, or a polypeptide having phytase activity having at least 60% sequence identity to SEQ ID NO:8.
- the signal peptide is located upstream of the alpha-lactalbumin polypeptide or upstream of the polypeptide having phytase activity.
- the signal peptide is located at the N-terminal end of the alpha-lactalbumin polypeptide or at the N-terminal end of the polypeptide having phytase activity.
- the alpha-lactalbumin polypeptide is selected from the group consisting of:
- polypeptide having phytase activity is selected from the group consisting of:
- the present invention also relates to a fermentation broth formulation or a cell composition comprising a polypeptide having phytase activity.
- the fermentation broth product further comprises additional ingredients used in the fermentation process, such as, for example, cells (including, the host cells containing the nucleic acid constructs of the present invention which are used to produce the polypeptide having phytase activity), cell debris, biomass, fermentation media and/or fermentation products.
- the composition is a cell-killed whole broth containing organic acid(s), killed cells and/or cell debris, and culture medium.
- fermentation broth refers to a preparation produced by cellular fermentation that undergoes no or minimal recovery and/or purification.
- fermentation broths are produced when microbial cultures are grown to saturation, incubated under carbon-limiting conditions to allow protein synthesis (e.g., expression of enzymes by host cells) and secretion into cell culture medium.
- the fermentation broth can contain unfractionated or fractionated contents of the fermentation materials derived at the end of the fermentation.
- the fermentation broth is unfractionated and comprises the spent culture medium and cell debris present afterthe microbial cells (e.g., filamentous fungal cells) are removed, e.g., by centrifugation.
- the fermentation broth contains spent cell culture medium, extracellular enzymes, and viable and/or nonviable microbial cells.
- the fermentation broth formulation or the cell composition comprises a first organic acid component comprising at least one 1 -5 carbon organic acid and/or a salt thereof and a second organic acid component comprising at least one 6 or more carbon organic acid and/or a salt thereof.
- the first organic acid component is acetic acid, formic acid, propionic acid, a salt thereof, or a mixture of two or more of the foregoing and the second organic acid component is benzoic acid, cyclohexanecarboxylic acid, 4-methylvaleric acid, phenylacetic acid, a salt thereof, or a mixture of two or more of the foregoing.
- the composition contains an organic acid(s), and optionally further contains killed cells and/or cell debris.
- the killed cells and/or cell debris are removed from a cell- killed whole broth to provide a composition that is free of these components.
- the fermentation broth formulation or cell composition may further comprise a preservative and/or anti-microbial (e.g., bacteriostatic) agent, including, but not limited to, sorbitol, sodium chloride, potassium sorbate, and others known in the art.
- a preservative and/or anti-microbial agent including, but not limited to, sorbitol, sodium chloride, potassium sorbate, and others known in the art.
- the cell-killed whole broth or cell composition may contain the unfractionated contents of the fermentation materials derived at the end of the fermentation.
- the cell-killed whole broth or cell composition contains the spent culture medium and cell debris present after the microbial cells (e.g., filamentous fungal cells) are grown to saturation, incubated under carbon-limiting conditions to allow protein synthesis.
- the cell-killed whole broth or cell composition contains the spent cell culture medium, extracellular enzymes, and killed filamentous fungal cells.
- the microbial cells present in the cell-killed whole broth or composition can be permeabilized and/or lysed using methods known in the art.
- a whole broth or cell composition as described herein is typically a liquid, but may contain insoluble components, such as killed cells, cell debris, culture media components, and/or insoluble enzyme(s). In some embodiments, insoluble components may be removed to provide a clarified liquid composition.
- the whole broth formulations and cell compositions of the present invention may be produced by a method described in WO 90/15861 or WO 2010/096673.
- Example 1 Signal peptide library for expression of phytase
- the phytase expression vectors comprising signal peptide variants JSP002, 010, 011 & 031 were prepared as follows.
- the phytase expression plasmid is composed of phytase coding sequence (SEQ ID NO: 11) and the marker expression cassettes, and E. coli vector fragment.
- SEQ ID NO: 11 phytase coding sequence
- E. coli vector fragment E. coli vector fragment.
- the first fragment signal peptide fragment
- partial E. coli vector fragment and partial phytase expression cassette including promotor, signal peptide sequences and first 21 bases of 5’ end of mature phytase.
- Second fragment (mature fragment) contain partial phytase expression cassette including mature peptide sequence and terminator, marker expression cassette and the partial E. coli derived fragments.
- Two fragments have about 20 bp of overlapping sequences at their 5’ and 3’ ends and can be assembled by Gibson Assembly Kit. Obtained plasmid will harbor different signal peptides for phytase expression. For the amplification of mature fragment, overlap PCR was performed to amplify a fragment coding for each signal fragment and phytase mature fragment. It was inserted between the promoter and terminator of A. niger expression plasmid. The plasmid DNAs prepared introduced into A. niger hosts strains (host: C3085 and/or C5553). All strains comprise identical phytase gene copy number
- SP GH13 with SEQ ID NO:25 is derived from A. niger endo-1 ,4-alpha amylase GH13.
- SP cutinase with SEQ ID NO:27 is derived from Humicola insolens cutinase.
- SP GH16 with SEQ ID NO:4 is derived from A. luchuensis endo-b-1 ,3-glucanase GH16 (SEQ ID NO:23).
- SP GH26 with SEQ ID NO:2 is derived from A. luchuensis endo-1 ,4-beta-mannanase GH26 (SEQ ID NO:21).
- Transformants constructed as in example 1 were fermented in 96-well multi-titer-plate (MTP) containing % YPG Ac, and 1% SBP at 30C for 3 days.
- the yield is detected by measuring phytase activity in culture supernatants using artificial substrate (described under section “pNP assay”).
- pNP assay As can be seen from Table 3, increased phytase activities were measured using SP GH26 and SP GH16, compared to phytase activities using SP cutinase.
- SP GH26 showed 187% yield increase compared to SP cutinase
- SP GH16 showed 12% yield increase compared to SP cutinase.
- GH16 signal peptide was compared to SP GH13 by lab-tank fermented under the current standard conditions.
- the strains have genetically the same background, but integrated gene copy number of phytase signal variants are different as indicated in Table 4. Results are shown in Table 4, where JSP010 signal peptide GH16 showed significantly higher yield compared to SP GH13, i.e. 2.3-fold increase. The yield is measured by phytase activity using phytase as substrate (FYT(B) assay).
- Signal peptide variants (shown in table 5) for expression of bovine alpha-lactalbumin (ALAB, SEQ ID NO:6) was constructed by 2 transformation steps.
- the first transformation (host C2552) was done to integrate an expression cassette of bovine alpha-lactalbumin, but an intron of an amylase containing PAM sequence is inserted in place of signal sequence.
- second transformation was performed.
- the target signal sequence fragments were amplified by PCR and integrated in between the promoter and mature sequence of bovine alpha-lactalbumin by CRISPR system. Obtained strains were bovine alpha-lactalbumin expressing strains under six different signal peptides.
- SP GH72 with SEQ ID NO:29 is derived from A. niger beta-1 ,3-glucanosyltransferase GH72 with GPI-anchor.
- SP GH16 with SEQ ID NO:4 is derived from A. luchuensis endo-b-1 ,3-glucanase GH16 (SEQ ID NO:23).
- SP GH26 with SEQ ID NO:2 is derived from A. luchuensis endo-1 ,4-beta-mannanase GH26 (SEQ ID NO:21).
- SP pepesin A with SEQ ID NO:47.
- Example 5 Increased yield of alpha-lactalbumin using SP GH26
- Transformants JSP002 and JSP031 constructed as in example 4 were fermented by MTP or by buffled shake flasks (SF). The culture broth was centrifuged (3,500 x g, 15 min) and the supernatant was used for yield evaluation by MALDI-TOF MS semi-quantification method.
- the expression plasmids were prepared by overlap PCR of signal and mature alphalactalbumin (ALAB) fragments. Obtained strains were fermented in lab-tanks under the current standard conditions and yields were compared by band intensities on SDS-PAGE.
- Signal peptide variants for expression of bovine ALAB was constructed as described in Example 4.
- the culture broth was centrifuged (3,500 x g, 15 min) and the supernatant was used for yield evaluation by SDS-PAGE.
- Culture supernatant was deglycosylated with EndoH and loaded on to SDS-PAGE gel.
- Intensity of bands corresponding to ALAB were compared to the intensity of ALAB fused to the cutinase SP (JSP004) used as control.
- the ALAB yield of the different SP variants relative to the ALAB yield using the cutinase SP is shown in table 8. All of the tested SP variants show increased ALAB yield relative to the SP cutinase. As shown in table 8, JSP017 and JSP019 resulted in an 342% and 472% ALAB yield, respectively.
- Aspergillus niger strains expressing ALAB fused to different signal peptides were cultivated in microbioreactors, each with a cultivation volume of 250 ml. The strains were constructed as described in Example 4, and grown in duplicates. Average ALAB yield was measured by size exclusion chromatography and is shown in Table 9, normalized to the JSP002 GH 26 SP with SEQ ID NO:2. As shown in table 9, ALAB yield is increased by 53 % for the JSP017 signal peptide, relative to the JSP002 signal peptide.
- E.coli DH5a (Toyobo) is used for plasmid construction and amplification. Amplified plasmids are recovered with Qiagen Plasmid Kit (Qiagen). Ligation is done with either Rapid DNA Dephos & Ligation Kit (Roche) or Gibson assembly kit (NEB) according to the manufactory instructions. Polymerase Chain Reaction (PCR) is carried out with KOD-Plus system (TOYOBO) or PrimeSTAR MAX DNA Polymerase (Takara). QIAquickTM Gel Extraction Kit (Qiagen) is used for the purification of PCR fragments and extraction of DNA fragment from agarose gel.
- Qiagen Plasmid Kit Qiagen Plasmid Kit
- Ligation is done with either Rapid DNA Dephos & Ligation Kit (Roche) or Gibson assembly kit (NEB) according to the manufactory instructions.
- Polymerase Chain Reaction (PCR) is carried out with KOD-Plus system (TOYOBO) or PrimeSTAR MA
- Enzymes for DNA manipulations e.g. restriction endonucleases, ligases etc. are obtainable from New England Biolabs, Inc. and were used according to the manufacturer’s instructions.
- SEQ ID NO:11 coding sequence
- SEQ ID NO:12 amino acid sequence
- the expression host strain Aspergillus niger C2552 was isolated by Novozymes and is a derivative of Aspergillus niger NN049184 which was isolated from soil described in example 14 in WQ2012/160093.
- C3085, C5553, C6242 are strains which can produce the glucoamylase (1 ,4-alpha-D-glucan glucohydrolase, EC 3.2.1 .3) from Gloeophyllum sepiarium (Gs AMG).
- COVE trace metals solution was composed of 0.04 g of NaB4O7 «10H2Q, 0.4 g of CuSO4 «5H2O, 1 .2 g of FeSO4 «7H2O, 0.7 g of MnSO4 «H2O, 0.8 g of Na2MoQ2 «2H20, 10 g of ZnSO4 «7H2O, and deionized water to 1 liter.
- 50X COVE salts solution was composed of 26 g of KCI, 26 g of MgSO4 «7H2O, 76 g of KH2PO4, 50 ml of COVE trace metals solution, and deionized water to 1 liter.
- COVE medium was composed of 342.3 g of sucrose, 20 ml of 50X COVE salts solution, 10 ml of 1 M acetamide, 10 ml of 1 .5 M CsCI2, 25 g of Noble agar, and deionized water to 1 liter.
- COVEII plus 5-Fluorocytosine top agarose was composed of 34 g of sucrose, 20 ml of 50X COVE salts solution, 10 ml of 1 M acetamide, 2 ml of 5 g/L 5-Fluorocytosine, 10 g of low melt agarose, and deionized water to 1 liter.
- COVE-N-GlyX plates were composed of 218 g of xylitol, 10 g of glycerol, 2.02 g of KNO3, 50 ml of COVE salts solution, 25 g of Noble agar, and deionized water to 1 liter.
- STC buffer was composed of 0.8 M sorbitol, 25 mM Tris pH 8, and 25 mM CaCI2.
- STPC buffer was composed of 40% PEG 4000 in STC buffer.
- LB medium was composed of 10 g of tryptone, 5 g of yeast extract, 5 g of sodium chloride, and deionized water to 1 liter.
- LB plus ampicillin plates were composed of 10 g of tryptone, 5 g of yeast extract, 5 g of sodium chloride, 15 g of Bacto agar, ampicillin at 100 pg per ml, and deionized water to 1 liter.
- YPG medium was composed of 10 g of yeast extract, 20 g of Bacto peptone, 20 g of glucose, and deionized water to 1 liter.
- SOC medium was composed of 20 g of tryptone, 5 g of yeast extract, 0.5 g of NaCI, 10 ml of 250 mM KCI, and deionized water to 1 liter.
- TAE buffer was composed of 4.84 g of Tris Base, 1.14 ml of Glacial acetic acid, 2 ml of 0.5 M EDTA pH 8.0, and deionized water to 1 liter.
- 1/4YPG Ac,1 % SBP was composed of 5.0 g/L glucose, 2.5 g/L yeast extract, 5 g/L peptone, 10g/L soy bean powder, 5ml/L 2M sodium acetate buffer, pH4.5
- MSG is composed of 72 g Glycerol, 92 g Soybean powder (pH 6.0), water to 1 litre.
- MU-1 glu is composed 260 g of glucose, 3 g of MgSO4 7H2O, 5 g of KH2PO4, 6 g of K2SO4, amyloglycosidase trace metal solution (pH 4.5), water to 1 liter.
- Amyloglycosidase trace metal solution is 13.9 g/L of FeSO4 7H2O, 13.56 g/L of MnSO4 5H2O, 6.8 g/L of ZnCI2, 2.5 g/L of CuSO4-5H2O, 0.24 g/L of NiCI2-6H2O, 3 g/L of Citric acid H2O.
- Transformation of Aspergillus species can be achieved using the general methods for yeast transformation.
- Aspergillus niger host strain was inoculated to 100 ml of YPG medium supplemented with 10 mM uridine and incubated for 16 h at 32°C at 80 rpm. Pellets were collected and washed with 0.6 M KCI, and resuspended 20 ml 0.6 M KCI containing a commercial beta-glucanase product (GLUCANEXTM, Novozymes A/S, Bagsvaerd, Denmark) at a final concentration of 20 mg per ml. The suspension was incubated at 32°C at 80 rpm until protoplasts were formed, and then washed twice with STC buffer.
- GLUCANEXTM commercial beta-glucanase product
- the protoplasts were counted with a hematometer and resuspended and adjusted in an 8:2:0.1 solution of STC:STPC:DMSO to a final concentration of 2.5x107 protoplasts/ml. Approximately 4 pg of plasmid DNA was added to 100 pl of the protoplast suspension, mixed gently, and incubated on ice for 30 minutes. One ml of SPTC was added and the protoplast suspension was incubated for 20 minutes at 37°C. After the addition of 10 ml of 50°C Cove top agarose, the reaction was poured onto Cove agar plates and the plates were incubated at 30°C for 3 days. COVEII plus 5-Fluorocytosine top agarose was overlayed to the transformation plates for counter-selection of the strains.
- PCR Polymerase Chain Reaction
- KOD plus reaction mix is shown in table 10.
- KOD plus PCR cycle is shown in table 11 .
- PrimeSTAR reaction mix is shown in table 12, and corresponding PCR cycle is shown in table 13.
- Spores of the selected transformants were inoculated in 0.45 mL of 1/4YPG Ac,1 % SBP in 96 well MTP and cultivated at 30°C for 3 days at 900 rpm.
- Spores are inoculated to 100 ml of MSG media in buffled flask and fermented on a rotary shaking table at 220 rpm, 30°C.
- Ten-milli liters of cultured broth was transferred tol OOml MU1 -glu with 4ml 50% urea in 500 ml baffled flasks and further fermented for 3 to 5 days.
- Fermentation was done as fed-batch fermentation (H. Pedersen 2000, Appl Microbiol Biotechnol, 53: 272-277). Selected strains were pre-cultured in liquid media then grown mycelia were transferred to the tanks for further cultivation of protein production. Cultivation was done at pH 4 to 7, 30 to 34°C, for 6 ⁇ 8 days with the feeding of glucose and ammonium without over-dosing. Culture supernatant after centrifugation was used for yield evaluation. pNP assay
- Phytase activity was measured as FYT(B) (Phytase (Braakii) Units), relative to an enzyme standard of a declared strength.
- the samples and standard phytase reacts with sodium phytase (phytic acid dodeca sodium salt C e H e O24P6Nai2) and releases inorganic phosphate.
- Catalytic reaction parameters and conditions are shown in table 14. This phosphate is determined spectrophometically from a yellow complex formed by an acidic complex reagent containing molybdate/vanadate. The yellow complex is measured spectrophotometrically at a wavelength of 405 nm. The rate of phosphate release can be observed by Konelab (Thermo Fisher Scientific). Colorimetric reaction parameters and conditions are shown in Table 15. Table 16 shows reaction buffers and reagent compositions.
- the activity of the enzyme samples is determined relative to the standard curve.
- V Volume of the measuring flask used in mL
- Diluted culture supernatant was mixed with an internal standard protein (e.g EndoH) and MS spectrum was taken.
- the ratio of signal intensities of target (bovine alpha-lactalbumin) and reference (internal standard) was recorded as rough expression level of target.
- Standard curve was prepared using series of diluted purified bovine alpha-lactalbumin.
- a nucleic acid construct comprising: a first polynucleotide encoding a signal peptide having a sequence identity of at least 80% to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49; and a) a second polynucleotide encoding an alpha-lactalbumin (ALAB) polypeptide having a sequence identity of at least 70% to the polypeptide sequence of SEQ ID NO:6; or b) a second polynucleotide encoding a polypeptide having phytase activity; wherein the first polynucleotide and the second polynucleotide are operably linked in translational fusion.
- ALAB alpha-lactalbumin
- nucleic acid construct according to any of embodiment 1 or 2, wherein the signal peptide is a naturally occurring signal peptide, or a functional fragment or functional variant of a naturally occurring signal peptide.
- nucleic acid construct according to any of embodiments 1 to 3, wherein the signal peptide is from a filamentous fungal glycosidase.
- nucleic acid construct according to any of embodiments 1 to 4, further comprising a third polynucleotide downstream of the first polynucleotide and upstream of the second polynucleotide.
- nucleic acid construct according to any previous embodiment, wherein the third polynucleotide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:17 (gtaagtaacatccactctgttctagtgccatgctgagattgtacag).
- nucleic acid construct comprising a polynucleotide sequence with a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:18 or SEQ ID NO:19.
- nucleic acid construct according to any previous embodiment, wherein the nucleic acid construct further comprises a heterologous promoter, and wherein said promoter, the first polynucleotide, the second polynucleotide, and optionally the third polynucleotide, are operably linked.
- the promoter is a P3 promoter or a P3-based promoter
- the heterologous promoter is a tandem promoter comprising the P3 promoter or is a tandem promoter derived from the P3 promoter.
- nucleic acid construct according to any previous embodiment, wherein the promoter is operably linked to an mRNA stabilizer region; preferably the mRNA stabilizer region is the crylllA mRNA stabilizer region.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is a naturally occurring signal peptide, or a functional fragment or functional variant of a naturally occurring signal peptide.
- the signal peptide is obtained from a mannanase polypeptide (EC 3.2.1.78).
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a b-transglycosidase polypeptide (EC 2.4.1.-).
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a chitin b-1 ,3/1 ,6-glucanosyltransferase polypeptide (EC 2.4.1 .-) polypeptide.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from an endo-b-1 ,3-glucanase polypeptide or laminarinase polypeptide (EC 3.2.1 .39).
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a polypeptide, such as a mannanase, transglycosidase, glycosyltransferase, laminarinase, or glucanase, expressed by a filamentous fungal host cell.
- a polypeptide such as a mannanase, transglycosidase, glycosyltransferase, laminarinase, or glucanase
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a polypeptide expressed by a Aspergillus host cell, such as an Aspergillus luchuensis.
- nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:1 ,SEQ ID NO:3, SEQ ID NO:44, SEQ ID NO:46, or SEQ ID NO:48; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:1 ,SEQ ID NO:3, SEQ ID NO:44, SEQ ID NO:46, or SEQ ID NO:48.
- nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:1 ; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:1.
- nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:3; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:3.
- nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:44; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:44.
- nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:46; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:46.
- nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:48; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:48.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a glycosidase expressed by an Aspergillus species selected from the group consisting of Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus luchuensis, or Aspergillus oryzae.
- Aspergillus species selected from the group consisting of Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus luchuensis, or Aspergillus oryzae.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a glycosidase expressed by Aspergillus luchuensis.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:2.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:2.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:45.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:45.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:47.
- signal peptide comprises, consists essentially of, or consists of SEQ ID NO:47.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:49.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:49.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:4.
- nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:4.
- nucleic acid construct according to any preceding embodiments, wherein the signal peptide consists of the amino acid sequence of SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49 with or without its C-terminal alanine, or a peptide fragment thereof that retains the ability to direct the polypeptide into or across a cell membrane.
- nucleic acid construct according to any of the preceding embodiments, wherein the N- and/or C- terminal end of the signal peptide has been extended by addition of one or more amino acids.
- nucleic acid construct according to any of embodiments 1 to 25, wherein the signal peptide is a fragment of the signal peptides of any of embodiments 1 to 25.
- polynucleotide encoding the alpha-lactalbumin polypeptide has a sequence identity of at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:5; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:5.
- nucleic acid construct according to any of the preceding embodiments, wherein the alphalactalbumin polypeptide is a bovine alpha-lactalbumin.
- nucleic acid construct according to any of the preceding embodiments, wherein the alphalactalbumin is a human alpha-lactalbumin.
- nucleic acid construct according to any preceding embodiments, wherein the alpha-lactalbumin polypeptide has a sequence identity of at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide of SEQ ID NO:6.
- the alpha-lactalbumin polypeptide comprises, consists essentially of, or consists of the mature polypeptide of SEQ ID NO:6.
- nucleic acid construct according to any of the preceding embodiments, wherein the N- and/or C- terminal end of the alpha-lactalbumin polypeptide has been extended by addition of one or more amino acids.
- nucleic acid construct according to any of the preceding embodiments, wherein polynucleotide encoding the polypeptide having phytase activity has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO: 11 ; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:11 .
- nucleic acid construct according to any of the preceding embodiments, wherein the polypeptide having phytase activity is a bacterial polypeptide or variant thereof.
- nucleic acid construct according to any of the preceding embodiments, wherein the polypeptide having phytase activity is EC 3.1.3.26.
- nucleic acid construct according to any preceding embodiments, wherein the polypeptide having phytase activity has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide of SEQ ID NO: 12.
- nucleic acid construct according to embodiment 36, wherein the polypeptide having phytase activity comprises, consists essentially of, or consists of the mature polypeptide of SEQ ID NO: 12.
- nucleic acid construct according to any of the preceding embodiments, wherein the N- and/or C- terminal end of the polypeptide having phytase activity has been extended by addition of one or more amino acids.
- nucleic acid construct according to any of embodiments 1 to 38, wherein the signal peptide is a fragment of the signal peptides of any of embodiments 1 to 38.
- a fungal host cell comprising in its genome: a) a nucleic acid construct according to any of embodiments 1 to 39; and/or b) an expression vector according to embodiment 40.
- the host cell of embodiment 41 wherein the fungal host cell is a filmentous fungal host cell, e.g., an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell, in particular, an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus
- a method of producing a alpha-lactalbuming (ALAB) polypeptide comprising: a) cultivating a host cell according to any of embodiments 41 to 45 under conditions conducive for production of the ALAB polypeptide; and optionally b) recovering the ALAB polypeptide.
- ALAB alpha-lactalbuming
- a method of producing a polypeptide having phytase activity comprising: a) cultivating a host cell according to any of embodiments 41 to 45 under conditions conducive for production of the polypeptide having phytase activity; and optionally b) recovering the polypeptide having phytase activity.
- a fusion polypeptide comprising a) a signal peptide having at least 60% sequence identity to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49, and b) an alpha-lactalbumin polypeptide having at least 60% sequence identity to SEQ ID NO: 6, or a polypeptide having phytase activity having at least 60% sequence identity to SEQ ID NO:8.
- fusion polypeptide according to any of embodiments 48 to 50, wherein the alpha-lactalbumin polypeptide is selected from the group consisting of:
- polypeptide having phytase activity is selected from the group consisting of:
Abstract
The present invention relates to nucleic acid constructs comprising a first polynucleotide encoding a signal peptide, e.g., from a fungal glycosidase, and a second polynucleotide encoding an alpha-lactalbumin (ALAB) polypeptide; expression vectors and host cells comprising said nucleic acid constructs; methods for producing ALAB polypeptides; and fusion proteins comprising an ALAB polypeptide and a signal peptide.
Description
FUNGAL SIGNAL PEPTIDES
Reference to a Sequence Listing
This application contains a Sequence Listing in computer readable form, which is incorporated herein by reference.
Background of the Invention
Field of the Invention
The present invention relates to nucleic acid constructs comprising a first polynucleotide encoding a signal peptide, e.g., from a fungal glycosidase, and a second polynucleotide encoding an alphalactalbumin (ALAB) polypeptide; expression vectors and host cells comprising said nucleic acid constructs; methods for producing ALAB polypeptides; and fusion proteins comprising an ALAB polypeptide and a signal peptide.
Description of the Related Art
Product development in industrial biotechnology includes a continuous challenge to increase recombinant protein yields at large scale to reduce costs. Two major approaches have been used for this purpose in the last decades. The first one is based on classical mutagenesis and screening. Here, the specific genetic modification is not predefined, and the main requirement is a screening assay that is sensitive to detect increments in yield. High-throughput screening enables large numbers of mutants to be screened in search forthe desired phenotype, i.e., higher recombinant protein yields. The second approach includes numerous strategies ranging from the use of stronger promoters and multi-copy strains to ensure high expression of the gene of interest to the use of codon-optimized gene sequences to aid translation. However, high-level production of a given protein may in turn trigger several bottlenecks in the cellular machinery for secretion of the enzyme of interest into the medium, emphasizing the need for further optimization strategies.
Signal peptides (SPs) are short amino acid sequences present in the amino terminus of many newly synthesized polypeptides that target these into or across cellular membranes, thereby aiding maturation and secretion. The amino acid sequence of the SP influences secretion efficiency and thereby the yield of the polypeptide manufacturing process. Bioinformatic tools such as SignalP and SignalP5 can predict SPs from amino acid sequences, but most cannot distinguish between various types of SPs (Armenteros et al., Nat. Biotechnol. 37: 420-423, 2019). Moreover, a large degree of redundancy in the amino acid sequence of SPs makes it difficult to predict the efficiency of any given SP for production of recombinant proteins at industrial scale. There are no tools to predict the efficiency with which a given SP directs the secretion of a target protein of interest (POI) (Owji et al., Eur. J. Cell Biol. 2018, 97, 422-441). In fact, finding an efficient SP to secrete a POI is currently still based on trial and error. It is established that the SP-POI match plays a crucial role in determining secretion efficiency (Peng, C. et al., Front. Bioeng. Biotechnol. 2019, 7, 139) whereas the underlying fundamental parameters remain unknown.
Hence, SP selection is an important step for manufacturing of recombinant proteins, but the optimal combination of signal peptide and mature protein is very context dependent and not easy to predict.
Alpha-lactalbumin (ALAB) is a principal protein of milk. ALAB forms the regulatory subunit of the lactose synthase (LS) heterodimer and beta 1 ,4-galactosyltransferase (beta4Gal-T1) forms the catalytic component. Together, these proteins enable LS to produce lactose by transfering galactose moieties to glucose. As a monomer, alpha-lactalbumin strongly binds calcium and zinc ions and may possess bactericidal or antitumor activity. A folding variant of alpha-lactalbumin, called HAMLET, likely induces apoptosis in tumor and immature cells.
Recombinant ALAB has the potential to improve the nutritional value in food, beverages, and feed. Recombinant ALAB expression has been reported previously in different organisms including transgenic goats (Yuan YG et al, J Anal Methods Chem. 2014; 2014:281031. doi: 10.1155/2014/281031). Recombinantly produced phytase is widely used as a feed supplement to effectively improve phosphorous utilization and reduce fecal phosphorous excretion in animals as poultry and pig. Thus, for both polypeptides ALAB and phytase, there is a growing need for access thereto in the respective industries.
Although there are expression systems available, there is a need for increasing yields during recombinant production. One major challenge is the fact that ALAB and phytase both have several disulfide bonds which makes correct expression, folding and secretion challenging. Thus, in order to satisfy the growing demand for recombinant ALAB and phytase it is necessary to provide recombinant expression systems with increased ALAB or phytase yields.
Summary of the Invention
The present invention is based on the surprising and inventive finding that expression of difficult- to-express proteins (alpha-lactalbumin and phytase) with a fungal signal peptide provides increased yield when expressed in fungal host cells.
Using the signal peptides of the invention an improved yield of the ALAB product compared to expression of the same ALAB product with other signal peptides was observed, e.g., using the JSP017 SP with SEQ ID NO: 45 the ALAB yield can be increased by 53% relative to JSP 002 (Table 9).
Using the signal peptides of the invention an improved yield of the phytase compared to expression of the same phytase with other signal peptides was observed, i.e., an 2.3-fold increase in phytase expression (Table 4).
Notably, the increased expression was achieved using several, different fermentation protocols at different scales, including microtiter plates (MTP), shake flasks (SF), and laboratory fermentation tanks.
In a first aspect, the present invention relates a nucleic acid construct comprising: a first polynucleotide encoding a signal peptide having a sequence identity of at least 80% to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49; and
a second polynucleotide encoding an alpha-lactalbumin (ALAB) polypeptide having a sequence identity of at least 70% to the polypeptide sequence of SEQ ID NO:6; wherein the first polynucleotide and the second polynucleotide are operably linked in translational fusion.
In a second aspect, the present invention relates to expression vectors comprising a nucleic acid construct of the first aspect.
In a third aspect, the present invention relates to a fungal host cell comprising in its genome: a) a nucleic acid construct according to the first aspect; and/or b) an expression vector according to the second aspect.
In a fourth aspect, the present invention relates to a method of producing a alpha-lactalbumin (ALAB) polypeptide, the method comprising: a) cultivating a host cell according to the third aspect under conditions conducive for production of the ALAB polypeptide; and optionally b) recovering the ALAB polypeptide.
In a fifth aspect, the present invention relates to a method of producing a polypeptide having phytase activity, the method comprising: a) cultivating a host cell according to the third aspect under conditions conducive for production of the polypeptide having phytase activity; and optionally b) recovering the polypeptide having phytase activity.
In a sixth aspect, the present invention relates to a fusion polypeptide, comprising: a signal peptide having at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, sequence to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49, and
(i) an alpha-lactalbumin polypeptide having at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, sequence identity to SEQ ID NO: 6, or
(ii) a polypeptide having phytase activity having at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, sequence identity to SEQ ID NO:8.
SEQUENCE OVERVIEW
SEQ ID NO:1 is the GH26 (JSP 002) signal peptide coding sequence
SEQ ID NO:2 is the GH26 (JSP 002) signal peptide (MFAKLSLLSLLFSSAALG).
SEQ ID NO:3 is the GH16 signal peptide coding sequence.
SEQ ID NO:4 is the GH16 signal peptide (MRLPLVSSTVALLSSASLVAA).
SEQ ID NO:5 is the ALAB coding sequence, (without SP)
SEQ ID NO:6 is the ALAB amino acid sequence, (without SP)
SEQ ID NO:7 is the ALAB coding sequence, (with SP GH26)
SEQ ID NO:8 is the ALAB amino acid sequence (with SP GH26)
SEQ ID NO:9 is the ALAB coding sequence, (with SP GH16)
SEQ ID NQ:10 is the ALAB amino acid sequence (with SP GH16)
SEQ ID NO:11 is the phytase coding sequence, (without SP)
SEQ ID NO: 12 is the phytase amino acid sequence, (without SP)
SEQ ID NO:13 is the phytase coding sequence, (with SP GH26)
SEQ ID NO: 14 is the phytase amino acid sequence (with SP GH26)
SEQ ID NO:15 is the phytase coding sequence, (with SP GH16)
SEQ ID NO:16 is the phytase amino acid sequence (with SP GH16)
SEQ ID NO:17 is the intron between SP GH16/ SP GH26 and the ALAB coding sequence
SEQ ID NO:18 coding sequence GH16 SP-intron-ALAB
SEQ ID NO:19 coding sequence GH26 SP-intron-ALAB
SEQ ID NO:20 is the GH26 polypeptide coding sequence from Aspergillus luchuensis
SEQ ID NO:21 is the GH26 polypeptide from A. luchuensis
SEQ ID NO:22 is the GH16 polypeptide coding sequence from Aspergillus luchuensis
SEQ ID NO:23 is the GH16 polypeptide from A. luchuensis
SEQ ID NO:24 is the reference GH13 signal peptide coding sequence
SEQ ID NO:25 is the reference GH13 signal peptide
SEQ ID NO:26 is the reference cutinase (JSP004) signal peptide coding sequence
SEQ ID NO:27 is the reference cutinase (JSP004) signal peptide
SEQ ID NO:28 is the reference GH72 signal peptide coding sequence
SEQ ID NO:29 is the reference GH72 signal peptide
SEQ ID NO:30 is the SP GH16-ALAB expression cassette (promoter-SPGH16-intron-ALAB- terminator)
SEQ ID NO: 31 is the SP GH26-ALAB expression cassette (promoter-SPGH26-intron-ALAB- terminator)
SEQ ID NO:32 is the SP GH72-ALAB expression cassette (promoter-SPGH72-intron-ALAB- terminator)
SEQ ID NO:33 is the SP GH16-phytase expression cassette (promoter-SPGH16-phytase- terminator)
SEQ ID NO: 34 is HA442 primer
SEQ ID NO: 35 is HA451 primer
SEQ ID NO: 36 is HA283 primer
SEQ ID NO: 37 is HA444 primer
SEQ ID NO: 38 is HA450 primer
SEQ ID NO: 39 is HA445 primer
SEQ ID NO: 40 is HA442 primer
SEQ ID NO: 41 is HA233 primer
SEQ ID NO: 42 is HA268 primer
SEQ ID NO: 43 is HA489 primer
SEQ ID NO: 44 is the LYA1_4 (JSP017, Aspergillus luchuensis) signal peptide coding sequence
SEQ ID NO: 45 is the LYA1_4 (JSP017, Aspergillus luchuensis) signal peptide (MKYAAALTAVAALAARAAA)
SEQ ID NO: 46 is the pepesin A (JSP019, Aspergillus niger) signal peptide coding sequence
SEQ ID NO: 47 is the pepesin A (JSP019, Aspergillus niger) signal peptide (MVVFSKTAALVLGLSSAVSA)
SEQ ID NO: 48 is the GH28_9 endo-1 ,4-alpha-polygalacturonase (JSP008, Aspergillus luchuensis) signal peptide coding sequence
SEQ ID NO: 49 is the GH28_9 endo-1 ,4-alpha-polygalacturonase (JSP008, Aspergillus luchuensis) signal peptide (MHFLQNAFVAATMGAALAAA)
Definitions
In accordance with this detailed description, the following definitions apply. Note that the singular forms "a," "an," and "the" include plural references unless the context clearly dictates otherwise.
Unless defined otherwise or clearly indicated by context, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
Alpha-lactalbumin: The term “alpha-lactalbumin”, “a-lactalbumin” “a-LA”, “a-LAB”, “ALAB”, or “LALBA” means a polypeptide which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer. ALAB forms the regulatory subunit of the lactose synthase (LS) heterodimer and beta 1 ,4- galactosyltransferase (beta4Gal-T1) forms the catalytic component. Together, these proteins enable LS to produce lactose by transfering galactose moieties to glucose. Non-limiting examples for ALAB polypeptides are bovine ALAB and human ALAB. ALAB quantification can be carried out as described under the Examples with section “Semi-quantification by MALDI-TOF MS” and “Size exclusion chromatography”. cDNA: The term "cDNA" means a DNA molecule that can be prepared by reverse transcription from a mature, spliced, mRNA molecule obtained from a eukaryotic or prokaryotic cell. cDNA lacks intron sequences that may be present in the corresponding genomic DNA. The initial, primary RNA transcript is a precursor to mRNA that is processed through a series of steps, including splicing, before appearing as mature spliced mRNA.
Coding sequence: The term “coding sequence” means a polynucleotide, which directly specifies the amino acid sequence of a polypeptide. The boundaries of the coding sequence are generally determined by an open reading frame, which begins with a start codon, such as ATG, GTG, or TTG, and ends with a stop codon, such as TAA, TAG, or TGA. The coding sequence may be a genomic DNA, cDNA, synthetic DNA, or a combination thereof.
Control sequences: The term “control sequences” means nucleic acid sequences involved in regulation of expression of a polynucleotide in a specific organism or in vitro. Each control sequence may be native (/.e., from the same gene) or heterologous (/.e., from a different gene) to the polynucleotide encoding the polypeptide, and native or heterologous to each other. Such control sequences include, but are not limited to leader, polyadenylation, prepropeptide, propeptide, signal peptide, promoter, terminator, enhancer, and transcription or translation initiator and terminator sequences. At a minimum, the control sequences include a promoter, and transcriptional and translational stop signals. The control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the polynucleotide encoding a polypeptide.
Expression: The term “expression” means any step involved in the production of a polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
Expression vector: An "expression vector" refers to a linear or circular DNA construct comprising a DNA sequence encoding a polypeptide, which coding sequence is operably linked to a suitable control sequence capable of effecting expression of the DNA in a suitable host. Such control sequences may include a promoter to effect transcription, an optional operator sequence to control transcription, a sequence encoding suitable ribosome binding sites on the mRNA, enhancers and sequences which control termination of transcription and translation.
Extension: The term “extension” means an addition of one or more amino acids to the amino and/or carboxyl terminus of a polypeptide, wherein the “extended” polypeptide has phytase activity or wherein the extended polypeptide is an ALAB polypeptide which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer. Persons skilled in the art will know that a polypeptide having a given amino acid sequence and enzymatic activity may be produced with one or a few additional amino acids at
the N- and/or C-terminus, and that such a polypeptide can have essentially the same enzyme activity. Such extended polypeptides are intended to be encompassed by the present invention.
Fragment: The term “fragment” as used in the context of a polypeptide means a polypeptide having one or more amino acids absent from its amino and/or carboxyl terminus, wherein the fragment has phytase activity, or wherein the fragment is an ALAB fragment which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer. The fragment may be produced naturally during expression and/or purification of the polypeptide, or may be the result of expression of a modified nucleotide sequence expressing the fragment or of targeted removal of amino acids from the amino and/or carboxy terminus.
Fungal glycosidase: The term “fungal glycosidase” means any glycosidase (EC 3.2.1) that hydrolyses O- and S-glycosyl compounds and which is encoded with an N-terminal signal peptide and natively expressed by a fungal species. The term “fungal glycosidase” also includes, but is not limited to, fungal mannanases (EC 3.2.1.78) and fungal glucanases (EC 3.2.1.39). Other non-limiting examples are alpha-amylase (EC 3.2.1.1), beta-amylase (EC 3.2.1.2) and lysozyme (EC 3.2.1.17).
Fusion polypeptide: The term “fusion polypeptide” is a polypeptide in which one polypeptide is fused at the N-terminus and/or the C-terminus of a polypeptide of the present invention. A fusion polypeptide is produced by fusing a polynucleotide encoding another polypeptide to a polynucleotide of the present invention, or by fusing two or more polynucleotides of the present invention together. Techniques for producing fusion polypeptides are known in the art, and include ligating the coding sequences encoding the polypeptides so that they are in frame and that expression of the fusion polypeptide is under control of the same promoter(s) and terminator. Fusion polypeptides may also be constructed using intein technology in which fusion polypeptides are created post-translationally (Cooper et al., 1993, EMBO J. 12: 2575-2583; Dawson et al., 1994, Science 266: 776-779). A fusion polypeptide can further comprise a cleavage site between the two polypeptides. Upon secretion of the fusion protein, the site is cleaved releasing the two polypeptides. Examples of cleavage sites include, but are not limited to, the sites disclosed in Martin et al., 2003, J. Ind. Microbiol. Biotechnol. 3: 568-576; Svetina et al., 2000, J. Biotechnol. 76: 245-251 ; Rasmussen- Wilson et al., 1997, Appl. Environ. Microbiol. 63: 3488-3493; Ward et al., 1995, Biotechnology 13: 498-503; and Contreras etal., 1991 , Biotechnology 9: 378-381 ; Eaton etal., 1986, Biochemistry 25: 505-512; Collins- Racie et al., 1995, Biotechnology 13: 982-987; Carter et al., 1989, Proteins: Structure, Function, and Genetics 6: 240-248; and Stevens, 2003, Drug Discovery World 4: 35-48.
Glucanase : The term "glucanase” means a glucanase comprising a signal peptide at its N-terminal end, the glucanase having endo-b-1 ,3-glucanase activity (EC 3.2.1.39). A non limiting example for a glucanase is the Aspergillus luchuensis glucanase. The signal peptide sequence derived of the A. luchuensis glucanase is termed “GH16” or “GH16 SP” and is represented by SEQ ID NO:4, derived from the A. luchuensis glucanase polypeptide with SEQ ID NO: 23.
Heterologous: The term "heterologous" means, with respect to a host cell, that a polypeptide or nucleic acid does not naturally occur in the host cell. The term "heterologous" means, with respect to a polypeptide or nucleic acid, that a control sequence, e.g., promoter, of a polypeptide or nucleic acid is not naturally associated with the polypeptide or nucleic acid, i.e., the control sequence is from a gene other than the gene encoding the mature polypeptide.
Host Strain or Host Cell: A "host strain" or "host cell" is an organism into which an expression vector, phage, virus, or other DNA construct, including a polynucleotide encoding a polypeptide of interest (e.g., an amylase) has been introduced. Exemplary host strains are microorganism cells (e.g., bacteria, filamentous fungi, and yeast) capable of expressing the polypeptide of interest and/or fermenting saccharides. The term "host cell" includes protoplasts created from cells.
Introduced: The term "introduced" in the context of inserting a nucleic acid sequence into a cell, means "transfection", "transformation" or "transduction," as known in the art.
Isolated: The term “isolated” means a polypeptide, nucleic acid, cell, or other specified material or component that has been separated from at least one other material or component, including but not limited to, other proteins, nucleic acids, cells, etc. An isolated polypeptide, nucleic acid, cell or other material is thus in a form that does not occur in nature. An isolated polypeptide includes, but is not limited to, a culture broth containing the secreted polypeptide expressed in a host cell.
Mannanase : The term "mannanase” means a mannanase comprising a signal peptide at its N- terminal end, the mannanase having endo-1 ,4-beta-mannanase activity (EC 3.2.1.78). A non limiting example for a manannase is the Aspergillus luchuensis mannanase. The signal peptide sequence derived of the Aspergillus luchuensis mannanase is termed “GH26” or “GH26 SP” and is represented by SEQ ID NO:2, derived from the A. luchuensis mannanase polypeptide with SEQ ID NO: 21 .
Mature polypeptide: The term “mature polypeptide” means a polypeptide in its mature form following translation and any post-translational modifications such as N-terminal processing (e.g. removal of signal peptide), C-terminal truncation, glycosylation, phosphorylation, etc. It is known in the art that a host cell may produce a mixture of two of more different mature polypeptides (/.e., with a different C-terminal and/or N-terminal amino acid) expressed by the same polynucleotide. It is also known in the art that different host cells process polypeptides differently, and thus, one host cell expressing a polynucleotide may produce a different mature polypeptide (e.g. having a different C-terminal and/or N-terminal amino acid) as compared to another host cell expressing the same polynucleotide. Mature polypeptides of the invention may therefore have slight differences at the N- and/or C-terminal due to such differentiated expression by the host cell. A mature polypeptide having one or more amino acids absent from the N- and/or C-terminal may be considered to be a “fragment” of the full-length polypeptide.
In one aspect the mature polypeptide is amino acids 1 to 123 of SEQ ID NO:6. In some aspects, the mature polypeptide is amino acids 19 to 141 of SEQ ID NO:8 and amino acids 1 to 18 of SEQ ID NO:8 are a signal peptide.
In some aspects, the mature polypeptide is amino acids 22 to 144 of SEQ ID NQ:10 and amino acids 1 to 21 of SEQ ID NQ:10 are a signal peptide.
In one aspect the mature polypeptide is amino acids 1 to 411 of SEQ ID NO:12. In some aspects, the mature polypeptide is amino acids 19 to 429 of SEQ ID NO:14 and amino acids 1 to 18 of SEQ ID NO:14 are a signal peptide.
In some aspects, the mature polypeptide is amino acids 22 to 432 of SEQ ID NO:16 and amino acids 1 to 21 of SEQ ID NO:16 are a signal peptide.
Mature polypeptide coding sequence: The term “mature polypeptide coding sequence” means a polynucleotide that encodes a mature polypeptide having phytase activity, or an ALAB polypeptide which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer.
In one aspect the mature polypeptide coding sequence is nucleotides 1 to 369 of SEQ ID NO:5. In one aspect, the mature polypeptide coding sequence is nucleotides 55 to 423 of SEQ ID NO: 7 and nucleotides 1 to 54 of SEQ ID NO: 7 encode a signal peptide. In one aspect, the mature polypeptide coding sequence is nucleotides 64 to 432 of SEQ ID NO: 9 and nucleotides 1 to 63 of SEQ ID NO: 9 encode a signal peptide.
In one aspect, the mature polypeptide coding sequence is nucleotides 110 to 478 of SEQ ID NO:
18, and nucleotides 1 to 63 of SEQ ID NO:18 encode a signal peptide, and nucleotides 64 to 109 of SEQ ID NO:18 is an intron.
In one aspect, the mature polypeptide coding sequence is nucleotides 101 to 469 of SEQ ID NO:
19, and nucleotides 1 to 54 of SEQ ID NO: 19 encode a signal peptide, and nucleotides 55 to 100 of SEQ ID NO:19 is an intron. In one aspect the mature polypeptide coding sequence is nucleotides 1 to 1233 of SEQ ID NO:11. In one aspect, the mature polypeptide coding sequence is nucleotides 55 to 1287 of SEQ ID NO: 13 and nucleotides 1 to 54 of SEQ ID NO: 13 encode a signal. In one aspect, the mature polypeptide coding sequence is nucleotides 64 to 1296 of SEQ ID NO: 15 and nucleotides 1 to 63 of SEQ ID NO: 15 encode a signal peptide.
Native: The term "native" means a nucleic acid or polypeptide naturally occurring in a host cell.
Nucleic acid: The term "nucleic acid" encompasses DNA, RNA, heteroduplexes, and synthetic molecules capable of encoding a polypeptide. Nucleic acids may be single stranded or double stranded, and may be chemical modifications. The terms "nucleic acid" and "polynucleotide" are used interchangeably. Because the genetic code is degenerate, more than one codon may be used to encode a particular amino acid, and the present compositions and methods encompass nucleotide sequences that encode a particular amino acid sequence. Unless otherwise indicated, nucleic acid sequences are presented in 5'-to-3' orientation.
Nucleic acid construct: The term "nucleic acid construct" means a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or is modified to contain segments of nucleic acids in a manner that would not otherwise exist in nature or which is synthetic, and which comprises one or more control sequences operably linked to the nucleic acid sequence.
Obtained polypeptide/peptide/polynucleotide: The term “obtained” or “derived” when used in reference to a polynucleotide sequence, ALAB sequence, polypeptide sequence, mannanase sequence, glucanase sequence, phytase sequence, variant sequence or signal peptide sequence, means that the molecule originally has been isolated from the given source and that the molecule can either be utilized in its native sequence or that the molecule is modified by methods known to the skilled person.
Operably linked: The term "operably linked" means that specified components are in a relationship (including but not limited to juxtaposition) permitting them to function in an intended manner. For example, a regulatory sequence is operably linked to a coding sequence such that expression of the coding sequence is under control of the regulatory sequence.
Parent: The term “parent” means a polypeptide functioning as a signal peptide, or a polypeptide having phytase activity, or an ALAB polypeptide which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer, to which an alteration is made to produce variants of the present invention. The parent may be a naturally occurring (wild-type) polypeptide or a variant or fragment thereof.
Phytase: In the present context, a preferred Phytase according to the invention is classified as belonging to the EC 3.1 .3.26 group. The EC numbers referto Enzyme Nomenclature 1992 from NC-IUBMB, Academic Press, San Diego, California, including supplements 1 -5 published in Eur. J. Biochem. 1994, 223, 1 -5; Eur. J. Biochem. 1995, 232, 1 -6; Eur. J. Biochem. 1996, 237, 1 -5; Eur. J. Biochem. 1997, 250, 1 -6; and Eur. J. Biochem. 1999, 264, 610-650; respectively. The nomenclature is regularly supplemented and updated; see e.g. the World Wide Web at http://www.chem.qmw.ac.uk/iubmb/enzyme/index.html. A non-limiting example for a phytase is shown in SEQ ID NO:12.
Phytase activity: For the purpose of the present invention, phytase activity is determined by the libertation of inorganic phosphate from Na-phytate solution, wherein one phytase activity unit is the amount of enzyme which liberates 1 pmol inorganic phosphate per min from a 0.0051 M Na-phytate solution in 0.25 M Na-acetate, pH 5.5 and at 37° C (Engelen, A. J., et al., 1994, "Simple and rapid determination of phytase activity", J. AOAC Int. 77:760-764). Examples of activity unit names are: FYT, FTU and U. Phytase activity may be determined using the assay as described in Example 1 ("Determination of phytase activity"). In one aspect, the polypeptides of the present invention have at least 20%, e.g., at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 100% of the phytase activity of SEQ ID NO:12. Detailed descriptions of how to detect phytase activity is given in the Examples under “pNP assay” and “FYT(B) assay”.
Specific activity is measured on highly purified samples (an SDS poly acryl amide gel should show the presence of only one component). The enzyme protein concentration may be determined by amino acid analysis, and the phytase activity in the units of FYT. Specific activity is a characteristic of the specific phytase variant in question, and it is calculated as the phytase activity measured in FYT units per mg phytase enzyme protein.
Recombinant: The term "recombinant" is used in its conventional meaning to refer to the manipulation, e.g., cutting and rejoining, of nucleic acid sequences to form constellations different from those found in nature. The term recombinant refers to a cell, nucleic acid, polypeptide or vector that has been modified from its native state. Thus, for example, recombinant cells express genes that are not found within the native (non-recombinant) form of the cell, or express native genes at different levels or under different conditions than found in nature. The term “recombinant” is synonymous with “genetically modified” and “transgenic”.
Sequence identity: The relatedness between two amino acid sequences or between two nucleotide sequences is described by the parameter “sequence identity”.
For purposes of the present invention, the sequence identity between two amino acid sequences is determined as the output of “longest identity” using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, J. Mol. Biol. 48: 443-453) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, Trends Genet. 16: 276-277), preferably version 6.6.0 or later. The parameters used are a gap open penalty of 10, a gap extension penalty of 0.5, and the EBLOSUM62 (EMBOSS version of BLOSUM62) substitution matrix. In order for the Needle program to report the longest identity, the -nobrief option must be specified in the command line. The output of Needle labeled “longest identity” is calculated as follows:
(Identical Residues x 100)/(Length of Alignment - Total Number of Gaps in Alignment)
For purposes of the present invention, the sequence identity between two polynucleotide sequences is determined as the output of “longest identity” using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, supra) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, supra), preferably version 6.6.0 or later. The parameters used are a gap open penalty of 10, a gap extension penalty of 0.5, and the EDNAFULL (EMBOSS version of NCBI NUC4.4) substitution matrix. In order for the Needle program to report the longest identity, the nobrief option must be specified in the command line. The output of Needle labeled “longest identity” is calculated as follows:
(Identical Deoxyribonucleotides x 100)/(Length of Alignment - Total Number of Gaps in Alignment)
Signal Peptide: A "signal peptide" is a sequence of amino acids attached to the N-terminal portion of a protein, which facilitates the secretion of the protein outside the cell. The mature form of an extracellular protein lacks the signal peptide, which is cleaved off during the secretion process.
Subsequence: The term “subsequence” means a polynucleotide having one or more nucleotides absent from the 5' and/or 3' end of a mature polypeptide coding sequence; wherein the subsequence encodes a fragment having phytase activity, or wherein the subsequence forms a functional regulatory subunit of the lactose synthase (LS) heterodimer.
Variant: The term “variant” means a polypeptide having phytase activity, or a polypeptide forming a functional regulatory subunit of the lactose synthase (LS) heterodimer, comprising a man-made mutation, i.e., a substitution, insertion (including extension), and/or deletion (e.g., truncation), at one or more positions. A substitution means replacement of the amino acid occupying a position with a different amino acid; a deletion means removal of the amino acid occupying a position; and an insertion means adding 1 -5 amino acids (e.g., 1-3 amino acids, in particular, 1 amino acid) adjacent to and immediately following the amino acid occupying a position.
Wild-type: The term "wild-type" in reference to an amino acid sequence or nucleic acid sequence means that the amino acid sequence or nucleic acid sequence is a native or naturally-occurring sequence. As used herein, the term "naturally-occurring" refers to anything (e.g., proteins, amino acids, or nucleic acid sequences) that is found in nature. Conversely, the term "non-naturally occurring" refers to anything that is not found in nature (e.g., recombinant nucleic acids and protein sequences produced in the laboratory or modification of the wild-type sequence).
Detailed Description of the Invention
The present invention is based on the surprising and inventive finding that expression of difficult- to-express proteins (alpha-lactalbumin and phytase) with signal peptides obtained from fungal polypeptides provides increased yield when expressed in fungal host cells.
Use of the signal peptides of the invention, an improved yield of both ALAB and phytase is achieved.
Notably, the increased expression was achieved using several, different fermentation protocols at different scales, including microtiter plates (MTP), shake flasks (SF), micro-bioreactors, and laboratory fermentation tanks.
Polynucleotides
The present invention also relates to polynucleotides encoding a polypeptide of the present invention, as described herein.
The polynucleotide may be a genomic DNA, a cDNA, a synthetic DNA, a synthetic RNA, a mRNA, or a combination thereof. The polynucleotide may be cloned from a strain of Aspergillus, or a related organism and thus, for example, may be a polynucleotide sequence encoding a variant of the polypeptide of the invention.
In an embodiment, the polynucleotide is a subsequence encoding a fragment having phytase activity or a fragment which forms a functional regulatory subunit of the lactose synthase (LS) heterodimer of the present invention.
In one embodiment the polynucleotide encoding the signal peptide of the present invention is isolated from an Aspergillus cell, such as an Aspergillus luchuensis cell.
The polynucleotide may also be mutated by introduction of nucleotide substitutions that do not result in a change in the amino acid sequence of the polypeptide, but which correspond to the codon usage of the host organism intended for production of the enzyme, or by introduction of nucleotide substitutions that may give rise to a different amino acid sequence. For a general description of nucleotide substitution, see, e.g., Ford et al., 1991 , Protein Expression and Purification 2: 95-107.
In an aspect, the polynucleotide is isolated.
In another aspect, the polynucleotide is purified.
Nucleic Acid Constructs
The present invention also relates to nucleic acid constructs comprising a polynucleotide of the present invention, wherein the polynucleotide is operably linked to one or more control sequences that direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences.
In a first aspect, the invention relates to a nucleic acid construct comprising: a first polynucleotide encoding a signal peptide having a sequence identity of at least 80% to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49; and a) a second polynucleotide encoding an alpha-lactalbumin (ALAB) polypeptide having a sequence identity of at least 70% to the polypeptide sequence of SEQ ID NO:6; or b) a second polynucleotide encoding a polypeptide having phytase activity; wherein the first polynucleotide and the second polynucleotide are operably linked in translational fusion.
In one embodiment, the second polynucleotide is located downstream from the first polynucleotide.
In one embodiment, the signal peptide is a naturally occurring signal peptide, or a functional fragment or functional variant of a naturally occurring signal peptide.
In one embodiment, the signal peptide is from a filamentous fungal glycosidase.
In one embodiment, the construct is further comprising a third polynucleotide downstream of the first polynucleotide and upstream of the second polynucleotide.
In one embodiment, the third polynucleotide is a non-coding intron.
In one embodiment, the third polynucleotide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:17 (gtaagtaacatccactctgttctagtgccatgctgagattgtacag).
In one embodiment, the construct comprises a polynucleotide sequence with a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:18 or SEQ ID NO:19.
In one embodiment, the nucleic acid construct further comprises a heterologous promoter, and wherein said promoter, the first polynucleotide, and the second polynucleotide, and optionally the third polynucleotide, are operably linked.
In another embodiment, the promoter is a P3 promoter or a P3-based promoter, preferably the heterologous promoter is a tandem promoter comprising the P3 promoter or is a tandem promoter derived from the P3 promoter.
In one embodiment, the promoter is operably linked to an mRNA stabilizer region; preferably the mRNA stabilizer region is the cry 11 IA mRNA stabilizer region.
In one embodiment, the signal peptide is a naturally occurring signal peptide, or a functional fragment or functional variant of a naturally occurring signal peptide.
In one embodiment, the signal peptide is from a glycosidase (EC 3.2.1).
In one embodiment, the signal peptide is obtained from a mannanase polypeptide (EC 3.2.1.78).
In one embodiment, the signal peptide is obtained from a b-transglycosidase polypeptide (EC 2.4.1.-).
In one embodiment, the the signal peptide is obtained from a chitin b-1 ,3/1 ,6-glucanosyltransferase polypeptide (EC 2.4.1 .-) polypeptide.
In one embodiment, the signal peptide is obtained from an endo-b-1 ,3-glucanase polypeptide or laminarinase polypeptide (EC 3.2.1.39).
In one embodiment, the signal peptide is obtained from a polypeptide, such as a mannanase, transglycosidase, glycosyltransferase, laminarinase, or glucanase, expressed by a filamentous fungal host cell.
In one embodiment, the signal peptide is obtained from a polypeptide expressed by a Aspergillus host cell, such as an Aspergillus luchuensis.
In one embodiment, the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:1 or SEQ ID NO:3; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:1 , SEQ ID NO:3, SEQ ID NO:44, SEQ ID NO:46, or SEQ ID NO:48;.
In one embodiment, the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:1 ; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:1.
In one embodiment, the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:3; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:3.
In one embodiment, the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:44; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:44.
In one embodiment, the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:46; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:46.
In one embodiment, the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:48; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:48.
In one embodiment, the the signal peptide is obtained from a glycosidase expressed by an Aspergillus species selected from the group consisting of Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus luchuensis, or Aspergillus oryzae.
In one embodiment, the e signal peptide is obtained from a glycosidase expressed by Aspergillus luchuensis.
In one embodiment, the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:2.
In one embodiment, the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:2.
In one embodiment, the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:4.
In one embodiment, the wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:4.
In one embodiment, the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:45.
In one embodiment, the wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:45.
In one embodiment, the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:47.
In one embodiment, the wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:47.
In one embodiment, the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:49.
In one embodiment, the wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:49.
In one embodiment, the signal peptide consists of the amino acid sequence of SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49 with or without its C-terminal alanine, or a peptide fragment thereof that retains the ability to direct the polypeptide into or across a cell membrane.
In one embodiment, the N- and/or C-terminal end of the signal peptide has been extended by addition of one or more amino acids.
In one embodiment, the polynucleotide encoding the alpha-lactalbumin polypeptide has a sequence identity of at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:5; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:5.
In one embodiment, the the alpha-lactalbumin polypeptide is a bovine alpha-lactalbumin polypeptide.
In one embodiment, the alpha-lactalbumin polypeptide is a human alpha-lactalbumin polypeptide.
In one embodiment, the the alpha-lactalbumin polypeptide has a sequence identity of at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide of SEQ ID NO:6.
In one embodiment, the alpha-lactalbumin polypeptide comprises, consists essentially of, or consists of the mature polypeptide of SEQ ID NO:6.
In one embodiment, the the N- and/or C-terminal end of the alpha-lactalbumin polypeptide has been extended by addition of one or more amino acids.
In one embodiment, the polynucleotide encoding the polypeptide having phytase activity has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:11 ; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:1 1.
In one embodiment, the polypeptide having phytase activity is a bacterial polypeptide or variant thereof.
In one embodiment, the polypeptide having phytase activity is EC 3.1.3.26.
In one embodiment, the polypeptide having phytase activity has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide of SEQ ID NO:12.
In one embodiment, the having phytase activity comprises, consists essentially of, or consists of the mature polypeptide of SEQ ID NO:12.
In one embodiment, the N- and/or C-terminal end of the polypeptide having phytase activity has been extended by addition of one or more amino acids.
It is expected that the invention will be just as effective when employing a signal peptide that is highly similar to the signal peptide disclosed in SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, and SEQ ID NO: 49, encoded by SEQ ID NO:1 ,SEQ ID NO:3, SEQ ID NO:44, SEQ ID NO: 46, and SEQ ID NO:48, respectively. One or more non-essential amino acids may, for example, be altered. Non- essential amino acids in a signal peptide can be identified according to procedures known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, 1989, Science 244: 1081 -1085). In the latter technique, single alanine mutations are introduced at every residue in the molecule, and the resultant molecules are tested for signal peptide activity to identify amino acid residues that are critical to the activity of the molecule and residues that are non-essential. See also, Hilton et al., 1996, J. Biol. Chem. 271 : 4699-4708. The identity of essential and non-essential amino acids can also be inferred from an alignment with one or more related signal peptide.
Single or multiple amino acid substitutions, deletions, and/or insertions can be made and tested using known methods of mutagenesis, recombination, and/or shuffling, followed by a relevant screening procedure, such as those disclosed by Reidhaar-Olson and Sauer, 1988, Science 241 : 53-57; Bowie and Sauer, 1989, Proc. Natl. Acad. Sci. USA 86: 2152-2156; WO 95/17413; or WO 95/22625. Other methods that can be used include error-prone PCR, phage display (e.g. Lowman et al., 1991 , Biochemistry 30: 10832-10837; U.S. Patent No. 5,223,409; WO 92/06204), and region-directed mutagenesis (Derbyshire et al., 1986, Gene 46: 145; Ner ef a/., 1988, DNA 7: 127).
Mutagenesis/shuffling methods can be combined with high-throughput, automated screening methods to detect activity of cloned, mutagenized polypeptides expressed by host cells (Ness et al., 1999, Nature Biotechnology 17: 893-896). Mutagenized DNA molecules that encode active polypeptides can be recovered from the host cells and rapidly sequenced using standard methods in the art. These methods allow the rapid determination of the importance of individual amino acid residues in a polypeptide.
In one aspect, the signal peptide is a variant (/.e., functional variant) or fragment (/.e., functional fragment) of the signal peptide of SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49. In one aspect, the number of alterations in the signal peptide variant of the present invention is 1- 10, e.g., 1-5, such as 1 , 2, 3, 4, or 5 alterations. Alterations includes substitutions, insertions, and/or deletions at one or more (e.g. , several) positions compared to the parent. A substitution means replacement of the amino acid occupying a position with a different amino acid; a deletion means removal of the amino acid occupying a position; and an insertion means adding an amino acid adjacent to and immediately following the amino acid occupying a position.
In a preferred embodiment, the signal peptide is a variant of the mature polypeptide of SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49comprising 1-10 alterations, e.g., 1-5, such as 1 , 2, 3, 4, or 5 alterations, compared to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49 respectively.
The first and second polynucleotide are operably linked in translational fusion. In the context of the present invention, the term “operably linked in translation fusion” means that the signal peptide encoded by the first polynucleotide and the polypeptide encoded by the second polynucleotide are encoded in frame and translated together as a single polypeptide. Preferably, following translation, the signal peptide is
removed to provide the mature phytase polypeptide or the mature ALAB polypeptide. Alternatively, the signal peptide is not removed, or only removed partly to provide the mature ALAB polypeptide orthe mature polypeptide having phytase activity and comprising at least a fragment of the signal peptide.
The first and second polynucleotide may be manipulated in a variety of ways to provide for expression of a variant. Manipulation of the polynucleotide priorto its insertion into a nucleic acid construct or expression vector may be desirable or necessary depending on the construct or vector. The techniques for modifying polynucleotides utilizing recombinant DNA methods are well known in the art.
Besides a signal peptide, the nucleic acid constructs of the invention may be operably linked to one or more further control sequences that direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences.
Promoters
The control sequence may be a promoter, a polynucleotide that is recognized by a host cell for expression of a polynucleotide encoding a polypeptide of the present invention. The promoter contains transcriptional control sequences that mediate the expression of the polypeptide. The promoter may be any polynucleotide that shows transcriptional activity in the host cell including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell.
In one embodiment, the nucleic acid construct further comprises a heterologous promoter, and wherein said promoter, the first polynucleotide, and the second polynucleotide are operably linked. The promoter is orientated upstream of the first polynucleotide.
In an embodiment, the promoter is a heterologous promoter. Preferably, the promoter is a tandem promoter. More preferably, the promoter is a P3 promoter or a P3-based promoter.
Further suitable promoters for directing transcription of the polynucleotide of the present invention in a filamentous fungal host cell are promoters obtained from Aspergillus, Fusarium, Rhizomucor and Trichoderma cells, such as the promoters described in Mukherjee et al., 2013, “Trichoderma-. Biology and Applications”, and by Schmoll and Dattenbock, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
For expression in a yeast host, examples of useful promoters are described by Smolke et al., 2018, “Synthetic Biology: Parts, Devices and Applications” (Chapter 6: Constitutive and Regulated Promoters in Yeast: How to Design and Make Use of Promoters in S. cerevisiae), and by Schmoll and Dattenbock, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
In one embodiment, the promoter is a promoter, such as a P3 promoter, operably linked to an mRNA stabilizer region. Preferably, the mRNA stabilizer region is the cry 11 IA mRNA stabilizer region. mRNA Stabilizers
The control sequence may also be an mRNA stabilizer region downstream of a promoter and upstream of the coding sequence of a gene which increases expression of the gene.
Examples of suitable mRNA stabilizer regions are obtained from a Bacillus thuringiensis crylllA gene (WO 94/25612) and a Bacillus subtilis SP82 gene (Hue et al., 1995, Journal of Bacteriology 177:
3465-3471). Examples of mRNA stabilizer regions for fungal cells are described in Geisberg et al., 2014, Cell 156(4): 812-824, and in Morozov et al., 2006, Eukaryotic Cell 5(11): 1838-1846.
Terminators
The control sequence may also be a transcription terminator, which is recognized by a host cell to terminate transcription. The terminator is operably linked to the 3’-terminus of the polynucleotide encoding the polypeptide. Any terminator that is functional in the host cell may be used in the present invention.
Preferred terminators for filamentous fungal host cells may be obtained from Aspergillus or Trichoderma species, such as obtained from the genes for Aspergillus niger glucoamylase, Trichoderma reesei beta-glucosidase, Trichoderma reesei cellobiohydrolase I, and Trichoderma reesei endoglucanase I, such as the terminators described in Mukherjee et al., 2013, “Trichoderma-. Biology and Applications”, and by Schmoll and Dattenbock, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
Preferred terminators for yeast host cells may be obtained from the genes for Saccharomyces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), and Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase. Other useful terminators for yeast host cells are described by Romanos et al., 1992, Yeast 8: 423-488.
The control sequence may also be a propeptide coding sequence that encodes a propeptide positioned at the N-terminus of a polypeptide. The resultant polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases). A propolypeptide is generally inactive and can be converted to an active polypeptide by catalytic or autocatalytic cleavage of the propeptide from the propolypeptide. The propeptide coding sequence may be obtained from the genes for Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease (nprT), Myceliophthora thermophila laccase (WO 95/33836), Rhizomucor miehei aspartic proteinase, and Saccharomyces cerevisiae alpha-factor.
Where both signal peptide and propeptide sequences are present, the propeptide sequence is positioned next to the N-terminus of a polypeptide and the signal peptide sequence is positioned next to the N-terminus of the propeptide sequence. Additionally or alternatively, when both signal peptide and propeptide sequences are present, the polypeptide may comprise only a part of the signal peptide sequence and/or only a part of the propeptide sequence. Alternatively, the final or isolated polypeptide may comprise a mixture of mature polypeptides and polypeptides which comprise, either partly or in full length, a propeptide sequence and/or a signal peptide sequence.
It may also be desirable to add regulatory sequences that regulate expression of the polypeptide relative to the growth of the host cell. Examples of regulatory sequences are those that cause expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound. In yeast, the ADH2 system or GAL1 system may be used. In filamentous fungi, the Aspergillus niger glucoamylase promoter, Aspergillus oryzae TAKA alpha-amylase promoter, and Aspergillus oryzae glucoamylase promoter, Trichoderma reesei cellobiohydrolase I promoter, and
Trichoderma reesei cellobiohydrolase II promoter may be used. Other examples of regulatory sequences are those that allow for gene amplification. In fungal systems, these regulatory sequences include the dihydrofolate reductase gene that is amplified in the presence of methotrexate, and the metallothionein genes that are amplified with heavy metals.
Leader Sequences
The control sequence may also be a leader, a non-translated region of an mRNA that is important for translation by the host cell. The leader is operably linked to the 5’-terminus of the polynucleotide encoding the polypeptide. Any leader that is functional in the host cell may be used.
Preferred leaders for filamentous fungal host cells may be obtained from the genes for Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase.
Suitable leaders for yeast host cells may be obtained from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase, Saccharomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP).
Transcription Factors
The control sequence may also be a transcription factor, a polynucleotide encoding a polynucleotide-specific DNA-binding polypeptide that controls the rate of the transcription of genetic information from DNA to mRNA by binding to a specific polynucleotide sequence. The transcription factor may function alone and/or together with one or more other polypeptides or transcription factors in a complex by promoting or blocking the recruitment of RNA polymerase. Transcription factors are characterized by comprising at least one DNA-binding domain which often attaches to a specific DNA sequence adjacent to the genetic elements which are regulated by the transcription factor. The transcription factor may regulate the expression of a protein of interest either directly, i.e. by activating the transcription of the gene encoding the protein of interest by binding to its promoter, or indirectly, i.e. by activating the transcription of a further transcription factor which regulates the transcription of the gene encoding the protein of interest, such as by binding to the promoter of the further transcription factor. Suitable transcription factors for prokaryotic host cells are described in Seshasayee et al., Subcell Biochem 2011 ; 52:7-23, as well in Balleza et al., FEMS Microbiol Rev 2009, 33(1 ): 133-151 .
Polyadenylation Sequences
The control sequence may also be a polyadenylation sequence, a sequence operably linked to the 3’-terminus of the polynucleotide which, when transcribed, is recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. Any polyadenylation sequence that is functional in the host cell may be used.
Preferred polyadenylation sequences for filamentous fungal host cells are obtained from the genes for Aspergillus nidulans anthranilate synthase, Aspergillus niger glucoamylase, Aspergillus niger alphaglucosidase, Aspergillus oryzae TAKA amylase, and Fusarium oxysporum trypsin-like protease.
Useful polyadenylation sequences for yeast host cells are described by Guo and Sherman, 1995, Mol. Cellular Biol. 15: 5983-5990.
Expression Vectors
In a second aspect, the present invention also relates to recombinant expression vectors comprising a nucleic acid construct according to the first aspect. The expression vectors comprise a polynucleotide of the present invention, a promoter, and transcriptional and translational stop signals. The various nucleotide and control sequences may be joined together to produce a recombinant expression vector that may include one or more convenient restriction sites to allow for insertion or substitution of the polynucleotide encoding the polypeptide at such sites. Alternatively, the polynucleotide may be expressed by inserting the polynucleotide or a nucleic acid construct comprising the polynucleotide into an appropriate vector for expression. In creating the expression vector, the coding sequence is located in the vector so that the coding sequence is operably linked with the appropriate control sequences for expression.
The recombinant expression vector may be any vector (e.g., a plasmid or virus) that can be conveniently subjected to recombinant DNA procedures and can bring about expression of the polynucleotide. The choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced. The vector may be a linear or closed circular plasmid.
The vector may be an autonomously replicating vector, i.e., a vector that exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal element, a mini-chromosome, or an artificial chromosome. The vector may contain any means for assuring self-replication. Alternatively, the vector may be one that, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated. Furthermore, a single vector or plasmid or two or more vectors or plasmids that together contain the total DNA to be introduced into the genome of the host cell, or a transposon, may be used.
The vector preferably contains one or more selectable markers that permit easy selection of transformed, transfected, transduced, or the like cells. A selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like.
The vector preferably contains an element(s) that permits integration of the vector into the host cell's genome or autonomous replication of the vector in the cell independent of the genome.
For integration into the host cell genome, the vector may rely on the polynucleotide’s sequence encoding the polypeptide or any other element of the vector for integration into the genome by homologous recombination, such as homology-directed repair (HDR), or non-homologous recombination, such as non- homologous end-joining (NHEJ).
For autonomous replication, the vector may further comprise an origin of replication enabling the vector to replicate autonomously in the host cell in question. The origin of replication may be any plasmid replicator mediating autonomous replication that functions in a cell. The term “origin of replication” or “plasmid replicator” means a polynucleotide that enables a plasmid or vector to replicate in vivo.
More than one copy of a polynucleotide of the present invention may be inserted into a host cell to increase production of a polypeptide. For example, 2 or 3 or 4 or 5 or more copies are inserted into a host cell. An increase in the copy number of the polynucleotide can be obtained by integrating at least one additional copy of the sequence into the host cell genome or by including an amplifiable selectable marker
gene with the polynucleotide where cells containing amplified copies of the selectable marker gene, and thereby additional copies of the polynucleotide, can be selected for by cultivating the cells in the presence of the appropriate selectable agent.
Host Cells
In a third aspect, the invention relates to fungal host cells comprising in its genome: a) a nucleic acid construct according to the first aspect; and/or b) an expression vector according to the second aspect.
A construct or vector comprising a polynucleotide is introduced into a host cell so that the construct or vector is maintained as a chromosomal integrant or as a self-replicating extra-chromosomal vector as described earlier. The choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source. The polypeptide encoded by the introduced polynucleotide can be native or heterologous to the recombinant host cell. Also, at least one of the one or more control sequences can be heterologous to the polynucleotide encoding the polypeptide. The recombinant host cell may comprise a single copy, or at least two copies, e.g. three, four, five or more copies of the polynucleotide of the present invention.
In one embodiment, the host cell comprises two or more copies of the nucleic acid construct and/or the expression vector.
The host cell may be a fungal cell. “Fungi” as used herein includes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota as well as the Oomycota and all mitosporic fungi (as defined by Hawksworth et al., In, Ainsworth and Bisby’s Dictionary of The Fungi, 8th edition, 1995, CAB International, University Press, Cambridge, UK).
Fungal cells may be transformed by a process involving protoplast-mediated transformation, Agrobacterium-mediated transformation, electroporation, biolistic method and shock-wave-mediated transformation as reviewed by Li et al., 2017, Microbial Cell Factories 16: 168 and procedures described in EP 238023, Yelton et al., 1984, Proc. Natl. Acad. Sci. USA 81 : 1470-1474, Christensen et al., 1988, Bio/Technology 6: 1419-1422, and Lubertozzi and Keasling, 2009, Biotechn. Advances 27 : 53-75. However, any method known in the art for introducing DNA into a fungal host cell can be used, and the DNA can be introduced as linearized or as circular polynucleotide.
The fungal host cell may be a yeast cell. “Yeast” as used herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes). For purposes of this invention, yeast shall be defined as described in Biology and Activities of Yeast (Skinner, Passmore, and Davenport, editors, Soc. App. Bacteriol. Symposium Series No. 9, 1980).
The yeast host cell may be a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell. In a preferred embodiment, the yeast host cell is a Pichia or Komagataella cell, e.g., a Pichia pastoris cell (Komagataella phaffii).
The fungal host cell may be a filamentous fungal cell. “Filamentous fungi” include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra). The filamentous fungi are generally characterized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other complex polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism is obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding of a unicellular thallus and carbon catabolism may be fermentative.
The filamentous fungal host cell may be an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell. In a preferred embodiment, the filamentous fungal host cell is an Aspergillus, Trichoderma or Fusarium cell. In a further preferred embodiment, the filamentous fungal host cell is an Aspergillus niger, Aspergillus oryzae, Trichoderma reesei, or Fusarium venenatum cell.
For example, the filamentous fungal host cell may be an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zonatum, Coprinus cinereus, Coriolus hirsutus, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Phanerochaete chrysosporium, Phlebia radiata, Pleurotus eryngii, Talaromyces emersonii, Thielavia terrestris, Trametes villosa, Trametes versicolor, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride cell.
In one embodiment the host cell is an Aspergillus cell.
In another embodiment, the host cell is an Aspergillus niger cell.
In one embodiment, the host cell is an Aspergillus oryzae cell.
In an aspect, the host cell is isolated.
In one embodiment, the host cell comprises at least two copies of the nucleic acid construct and/or the expression vector, such as two copies, three copies, four copies or more than four copies.
In another aspect, the host cell is purified.
Methods of Production
In a fourth aspect, the present invention also relates methods of producing an alpha-lactalbumin (ALAB) polypeptide, the method comprising:
a) cultivating a host cell according to the third aspect under conditions conducive for production of the ALAB polypeptide; and optionally b) recovering the ALAB polypeptide.
In a fifth aspect, the present invention also relates to methods of producing a polypeptide having phytase activity, the method comprising: a) cultivating a host cell according to the third aspect under conditions conducive for production of the polypeptide having phytase activity; and optionally b) recovering the polypeptide having phytase activity.
The host cell is cultivated in a nutrient medium suitable for production of the polypeptide using methods known in the art. For example, the cell may be cultivated by shake flask cultivation, or small-scale or large- scale fermentation (including continuous, batch, fed-batch, or solid-state, and/or microcarrier-based fermentations) in laboratory or industrial fermentors in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated. Suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). If the polypeptide is secreted into the nutrient medium, the polypeptide can be recovered directly from the medium. If the polypeptide is not secreted, it can be recovered from cell lysates.
The polypeptide may be detected using methods known in the art that are specific for the polypeptide, including, but not limited to, the use of specific antibodies, formation of an enzyme product, disappearance of an enzyme substrate, or an assay determining the relative or specific activity of the polypeptide.
The polypeptide may be recovered from the medium using methods known in the art, including, but not limited to, collection, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation. In one aspect, a whole fermentation broth comprising the polypeptide is recovered. In another aspect, a cell- free fermentation broth comprising the polypeptide is recovered.
The polypeptide may be purified by a variety of procedures known in the art to obtain substantially pure polypeptides and/or polypeptide fragments (see, e.g., Wingfield, 2015, Current Protocols in Protein Science; 80(1): 6.1.1-6.1.35; Labrou, 2014, Protein Downstream Processing, 1129: 3-10).
In an alternative aspect, the polypeptide having phytase activity is not recovered. In one aspect the polypeptide having phytase activity is not recovered, but rather a host cell of the present invention expressing the polypeptide having phytase activity is used as a source of the variant.
Fusion Polypeptide
In a sixth aspect, the invention relates to a fusion polypeptide, comprising a) a signal peptide having at least 60% sequence identity to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49, and b) an alpha-lactalbumin polypeptide having at least 60% sequence identity to SEQ ID NO: 6, or a polypeptide having phytase activity having at least 60% sequence identity to SEQ ID NO:8.
In one embodiment, the signal peptide is located upstream of the alpha-lactalbumin polypeptide or upstream of the polypeptide having phytase activity.
In one embodiment, the signal peptide is located at the N-terminal end of the alpha-lactalbumin polypeptide or at the N-terminal end of the polypeptide having phytase activity.
In a particular embodiment, the alpha-lactalbumin polypeptide is selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 8;
(b) a polypeptide having at least 60% sequence identity to SEQ ID NO: 10;
(c) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the mature polypeptide coding sequence of SEQ ID NO: 7 or SEQ ID NO: 9;
(d) a polypeptide derived from SEQ ID NO: 8, SEQ ID NO:10, SEQ ID NO:18 or SEQ ID NO:19, a mature polypeptide of SEQ ID NO: 8, SEQ ID NO:10, SEQ ID NO:18 or SEQ ID NO:19, having 1 -30 alterations, e.g., substitutions, deletions and/or insertions at one or more positions, e.g., 1 or 2 or 3 or 4 or 5 or 6 or 7 or 8 or 9 or 10 or 11 or 12 or 13 or 14 or 15 or 16 or 17 or 18 or 19 or 20 or 21 or 22 or 23 or 24 or 25 or 26 or 27 or 28 or 29 or 30 alterations, in particular substitutions;
(e) a polypeptide derived from the polypeptide of (a), (b), (c), or (d) wherein the N- and/or C- terminal end has been extended by addition of one or more amino acids;
(f) a fragment of the polypeptide of (a), (b), (c), (d), or (e), and
(g) the fragment of (f) wherein the polypeptide has 1 - 10 deletions at the N-terminal end.
In another embodiment, the polypeptide having phytase activity is selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 14;
(b) a polypeptide having at least 60% sequence identity to SEQ ID NO: 16;
(c) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the mature polypeptide coding sequence of SEQ ID NO: 13 or SEQ ID NO: 15;
(d) a polypeptide derived from SEQ ID NO: 14 or SEQ ID NO:16, a mature polypeptide of SEQ ID NO:14 or SEQ ID NO:16, having 1 -30 alterations, e.g., substitutions, deletions and/or insertions at one or more positions, e.g. , 1 or 2 or 3 or 4 or 5 or 6 or 7 or 8 or 9 or 10 or 11 or 12 or 13 or 14 or 15 or 16 or 17 or 18 or 19 or 20 or 21 or 22 or 23 or 24 or 25 or 26 or 27 or 28 or 29 or 30 alterations, in particular substitutions;
(e) a polypeptide derived from the polypeptide of (a), (b), (c), or (d) wherein the N- and/or C- terminal end has been extended by addition of one or more amino acids;
(f) a fragment of the polypeptide of (a), (b), (c), (d), or (e), and
(g) the fragment of (f) wherein the polypeptide has 1 - 10 deletions at the N-terminal end.
Fermentation Broth Formulations or Cell Compositions
The present invention also relates to a fermentation broth formulation or a cell composition comprising a polypeptide having phytase activity. The fermentation broth product further comprises additional ingredients used in the fermentation process, such as, for example, cells (including, the host cells containing the nucleic acid constructs of the present invention which are used to produce the polypeptide having phytase activity), cell debris, biomass, fermentation media and/or fermentation products. In some embodiments, the composition is a cell-killed whole broth containing organic acid(s), killed cells and/or cell debris, and culture medium.
The term "fermentation broth" as used herein refers to a preparation produced by cellular fermentation that undergoes no or minimal recovery and/or purification. For example, fermentation broths are produced when microbial cultures are grown to saturation, incubated under carbon-limiting conditions to allow protein synthesis (e.g., expression of enzymes by host cells) and secretion into cell culture medium. The fermentation broth can contain unfractionated or fractionated contents of the fermentation materials derived at the end of the fermentation. Typically, the fermentation broth is unfractionated and comprises the spent culture medium and cell debris present afterthe microbial cells (e.g., filamentous fungal cells) are removed, e.g., by centrifugation. In some embodiments, the fermentation broth contains spent cell culture medium, extracellular enzymes, and viable and/or nonviable microbial cells.
In some embodiments, the fermentation broth formulation or the cell composition comprises a first organic acid component comprising at least one 1 -5 carbon organic acid and/or a salt thereof and a second organic acid component comprising at least one 6 or more carbon organic acid and/or a salt thereof. In some embodiments, the first organic acid component is acetic acid, formic acid, propionic acid, a salt thereof, or a mixture of two or more of the foregoing and the second organic acid component is benzoic acid, cyclohexanecarboxylic acid, 4-methylvaleric acid, phenylacetic acid, a salt thereof, or a mixture of two or more of the foregoing.
In one aspect, the composition contains an organic acid(s), and optionally further contains killed cells and/or cell debris. In some embodiments, the killed cells and/or cell debris are removed from a cell- killed whole broth to provide a composition that is free of these components.
The fermentation broth formulation or cell composition may further comprise a preservative and/or anti-microbial (e.g., bacteriostatic) agent, including, but not limited to, sorbitol, sodium chloride, potassium sorbate, and others known in the art.
The cell-killed whole broth or cell composition may contain the unfractionated contents of the fermentation materials derived at the end of the fermentation. Typically, the cell-killed whole broth or cell composition contains the spent culture medium and cell debris present after the microbial cells (e.g., filamentous fungal cells) are grown to saturation, incubated under carbon-limiting conditions to allow protein synthesis. In some embodiments, the cell-killed whole broth or cell composition contains the spent cell culture medium, extracellular enzymes, and killed filamentous fungal cells. In some embodiments, the microbial cells present in the cell-killed whole broth or composition can be permeabilized and/or lysed using methods known in the art.
A whole broth or cell composition as described herein is typically a liquid, but may contain insoluble components, such as killed cells, cell debris, culture media components, and/or insoluble enzyme(s). In some embodiments, insoluble components may be removed to provide a clarified liquid composition.
The whole broth formulations and cell compositions of the present invention may be produced by a method described in WO 90/15861 or WO 2010/096673.
The present invention is further described by the following examples that should not be construed as limiting the scope of the invention.
Examples
Example 1 : Signal peptide library for expression of phytase
The phytase expression vectors comprising signal peptide variants JSP002, 010, 011 & 031 were prepared as follows.
The phytase expression plasmid is composed of phytase coding sequence (SEQ ID NO: 11) and the marker expression cassettes, and E. coli vector fragment. To prepare the expression plasmid for each signal peptide variant, two fragments were amplified. The first fragment (signal peptide fragment) with partial E. coli vector fragment and partial phytase expression cassette including promotor, signal peptide sequences and first 21 bases of 5’ end of mature phytase. Second fragment (mature fragment) contain partial phytase expression cassette including mature peptide sequence and terminator, marker expression cassette and the partial E. coli derived fragments. Two fragments have about 20 bp of overlapping sequences at their 5’ and 3’ ends and can be assembled by Gibson Assembly Kit. Obtained plasmid will harbor different signal peptides for phytase expression. For the amplification of mature fragment, overlap PCR was performed to amplify a fragment coding for each signal fragment and phytase mature fragment. It was inserted between the promoter and terminator of A. niger expression plasmid. The plasmid DNAs prepared introduced into A. niger hosts strains (host: C3085 and/or C5553). All strains comprise identical phytase gene copy number
SP GH13 with SEQ ID NO:25 is derived from A. niger endo-1 ,4-alpha amylase GH13.
SP cutinase with SEQ ID NO:27 is derived from Humicola insolens cutinase.
SP GH16 with SEQ ID NO:4 is derived from A. luchuensis endo-b-1 ,3-glucanase GH16 (SEQ ID NO:23).
SP GH26 with SEQ ID NO:2 is derived from A. luchuensis endo-1 ,4-beta-mannanase GH26 (SEQ ID NO:21).
Table 2. Primers
Example 2: SP GH16 and SP GH26 show increased phytase yields during MTP cultivation
Transformants constructed as in example 1 were fermented in 96-well multi-titer-plate (MTP) containing % YPG Ac, and 1% SBP at 30C for 3 days. The yield is detected by measuring phytase activity in culture supernatants using artificial substrate (described under section “pNP assay”). As can be seen from Table 3, increased phytase activities were measured using SP GH26 and SP GH16, compared to phytase activities using SP cutinase. In detail, SP GH26 showed 187% yield increase compared to SP cutinase, and SP GH16 showed 12% yield increase compared to SP cutinase.
Table 3. Relative phytase activities of signal variants in C3085 host
Example 3: SP GH16 with increased phytase yield during lab tank fermentation
GH16 signal peptide was compared to SP GH13 by lab-tank fermented under the current standard conditions. The strains have genetically the same background, but integrated gene copy number of phytase signal variants are different as indicated in Table 4. Results are shown in Table 4, where JSP010 signal peptide GH16 showed significantly higher yield compared to SP GH13, i.e. 2.3-fold increase. The yield is measured by phytase activity using phytase as substrate (FYT(B) assay).
Example 4: Signal peptide library for expression of bovine alpha-lactalbumin (ALAB)
Signal peptide variants (shown in table 5) for expression of bovine alpha-lactalbumin (ALAB, SEQ ID NO:6) was constructed by 2 transformation steps. The first transformation (host C2552) was done to integrate an expression cassette of bovine alpha-lactalbumin, but an intron of an amylase containing PAM sequence is inserted in place of signal sequence. To the obtained strain, second transformation was performed. The target signal sequence fragments were amplified by PCR and integrated in between the promoter and mature sequence of bovine alpha-lactalbumin by CRISPR system. Obtained strains were bovine alpha-lactalbumin expressing strains under six different signal peptides.
SP GH72 with SEQ ID NO:29 is derived from A. niger beta-1 ,3-glucanosyltransferase GH72 with GPI-anchor.
SP GH16 with SEQ ID NO:4 is derived from A. luchuensis endo-b-1 ,3-glucanase GH16 (SEQ ID NO:23).
SP GH26 with SEQ ID NO:2 is derived from A. luchuensis endo-1 ,4-beta-mannanase GH26 (SEQ ID NO:21).
SP LYA1_4 with SEQ ID NO:45.
SP pepesin A with SEQ ID NO:47.
SP GH28_9 with SEQ ID NO:49.
Example 5: Increased yield of alpha-lactalbumin using SP GH26
Transformants JSP002 and JSP031 constructed as in example 4 were fermented by MTP or by buffled shake flasks (SF). The culture broth was centrifuged (3,500 x g, 15 min) and the supernatant was used for yield evaluation by MALDI-TOF MS semi-quantification method.
Semi-quantification by MALDI-TOF MS
Yield of bovine alpha-lactalbumin was briefly measured by the intensity of MS spectrum relative to the reference signal. Results are summarized in Table 6. Accordingly, using SP GH26, ALAB expression increased by 91 % compared to ALAB expression with SP GH72 (MTP). In shake flask cultivation, ALAB expression with SP GH26 was increased ca. 2.6-fold compared to ALAB expression with SP GH72.
Table 6. Relative ALAB yield using different signal sequences and fermentation protocols (MTP & SF).
Example 6: SP GH16 and SP GH26 result in similar ALAB yield in lab tank fermentation
The JSP002 (SP GH26) signal peptide which showed superior expression compared to JSP031 SP in MTP and SF format of example 5, was compared to the JSP010 (SP GH16) signal peptide for the expression of ALAB. Therefore, ALAB expressing strains were re-constructed with A. niger host C6216 comprising three ALAB encoding gene copies, each with either SP GH26 or SP GH16 located upstream of the ALAB gene. The expression plasmids were prepared by overlap PCR of signal and mature alphalactalbumin (ALAB) fragments. Obtained strains were fermented in lab-tanks under the current standard conditions and yields were compared by band intensities on SDS-PAGE. As shown in Table 7, no significant ALAB yield difference was observed between two signal variants SP GH16 and SP GH26. Thus, both signal variants have significant yield increases compared to JSP031 from example 5, in which example SP GH26 showed 191 % increased ALAB yield compared to JSP031 with SP from GH72.
Table 7. Relative ALAB yields using different signal sequence in different fermentation protocols (SF & lab tank).
Example 7: Increased ALAB yield in MTP cultivation
Additional signal peptide variants for expression of bovine ALAB (SEQ ID NO: 6) was constructed as described in Example 4. Signal peptide variants used in Example 7 are: JSP004 (control, cutinase SP with SEQ ID NO: 27), JSP008 (GH28_9 SP with SEQ ID NO: 49), JSP017 (LYA1_4 SP with SEQ ID NO: 45), JSP002 (SP GH 26, SEQ ID NO:2), JSP031 (SP GH72, SEQ ID NO: 29), and JSP019 (pepesin A = pepA SP with SEQ ID NO: 47). Constructed strains were cultivated in MTP (multi-titer plates). The culture broth was centrifuged (3,500 x g, 15 min) and the supernatant was used for yield evaluation by SDS-PAGE. Culture supernatant was deglycosylated with EndoH and loaded on to SDS-PAGE gel. Intensity of bands corresponding to ALAB were compared to the intensity of ALAB fused to the cutinase SP (JSP004) used as control. The ALAB yield of the different SP variants relative to the ALAB yield using the cutinase SP is shown in table 8. All of the tested SP variants show increased ALAB yield relative to the SP cutinase. As shown in table 8, JSP017 and JSP019 resulted in an 342% and 472% ALAB yield, respectively.
Example 8: Increased ALAB yield in micro-bioreactors
Aspergillus niger strains expressing ALAB fused to different signal peptides were cultivated in microbioreactors, each with a cultivation volume of 250 ml. The strains were constructed as described in Example 4, and grown in duplicates. Average ALAB yield was measured by size exclusion chromatography and is shown in Table 9, normalized to the JSP002 GH 26 SP with SEQ ID NO:2. As shown in table 9, ALAB yield is increased by 53 % for the JSP017 signal peptide, relative to the JSP002 signal peptide.
Unless otherwise stated, DNA manipulation and transformation were performed using standard methods of molecular biology as described in Sambrook et al. (1989) Molecular cloning: A laboratory manual, Cold Spring Harbor lab., Cold Spring Harbor, NY; Ausubel, F. M. et al. (eds.) "Current protocols in Molecular Biology", John Wiley and Sons, 1995; Harwood, C. R., and Cut-ting, S. M. (eds.) "Molecular Biological Methods for Bacillus". John Wiley and Sons, 1990.
Purchased material (E.coli and kits)
E.coli DH5a (Toyobo) is used for plasmid construction and amplification. Amplified plasmids are recovered with Qiagen Plasmid Kit (Qiagen). Ligation is done with either Rapid DNA Dephos & Ligation Kit (Roche) or Gibson assembly kit (NEB) according to the manufactory instructions. Polymerase Chain Reaction (PCR) is carried out with KOD-Plus system (TOYOBO) or PrimeSTAR MAX DNA Polymerase (Takara). QIAquickTM Gel Extraction Kit (Qiagen) is used for the purification of PCR fragments and extraction of DNA fragment from agarose gel.
Enzymes
Enzymes for DNA manipulations (e.g. restriction endonucleases, ligases etc.) are obtainable from New England Biolabs, Inc. and were used according to the manufacturer’s instructions.
Plasmids
The sequence for phytase harboring the Phytase from Citrobacterium braakii is described as SEQ ID NO:11 (coding sequence) and SEQ ID NO:12 (Amino acid sequence).
Microbial strains
The expression host strain Aspergillus niger C2552 was isolated by Novozymes and is a derivative of Aspergillus niger NN049184 which was isolated from soil described in example 14 in WQ2012/160093. C3085, C5553, C6242 are strains which can produce the glucoamylase (1 ,4-alpha-D-glucan glucohydrolase, EC 3.2.1 .3) from Gloeophyllum sepiarium (Gs AMG).
Medium
COVE trace metals solution was composed of 0.04 g of NaB4O7«10H2Q, 0.4 g of CuSO4«5H2O, 1 .2 g of FeSO4«7H2O, 0.7 g of MnSO4«H2O, 0.8 g of Na2MoQ2«2H20, 10 g of ZnSO4«7H2O, and deionized water to 1 liter.
50X COVE salts solution was composed of 26 g of KCI, 26 g of MgSO4«7H2O, 76 g of KH2PO4, 50 ml of COVE trace metals solution, and deionized water to 1 liter.
COVE medium was composed of 342.3 g of sucrose, 20 ml of 50X COVE salts solution, 10 ml of 1 M acetamide, 10 ml of 1 .5 M CsCI2, 25 g of Noble agar, and deionized water to 1 liter.
COVEII plus 5-Fluorocytosine top agarose was composed of 34 g of sucrose, 20 ml of 50X COVE salts solution, 10 ml of 1 M acetamide, 2 ml of 5 g/L 5-Fluorocytosine, 10 g of low melt agarose, and deionized water to 1 liter.
COVE-N-GlyX plates were composed of 218 g of xylitol, 10 g of glycerol, 2.02 g of KNO3, 50 ml of COVE salts solution, 25 g of Noble agar, and deionized water to 1 liter.
STC buffer was composed of 0.8 M sorbitol, 25 mM Tris pH 8, and 25 mM CaCI2.
STPC buffer was composed of 40% PEG 4000 in STC buffer.
LB medium was composed of 10 g of tryptone, 5 g of yeast extract, 5 g of sodium chloride, and deionized water to 1 liter.
LB plus ampicillin plates were composed of 10 g of tryptone, 5 g of yeast extract, 5 g of sodium chloride, 15 g of Bacto agar, ampicillin at 100 pg per ml, and deionized water to 1 liter.
YPG medium was composed of 10 g of yeast extract, 20 g of Bacto peptone, 20 g of glucose, and deionized water to 1 liter.
SOC medium was composed of 20 g of tryptone, 5 g of yeast extract, 0.5 g of NaCI, 10 ml of 250 mM KCI, and deionized water to 1 liter.
TAE buffer was composed of 4.84 g of Tris Base, 1.14 ml of Glacial acetic acid, 2 ml of 0.5 M EDTA pH 8.0, and deionized water to 1 liter.
1/4YPG Ac,1 % SBP was composed of 5.0 g/L glucose, 2.5 g/L yeast extract, 5 g/L peptone, 10g/L soy bean powder, 5ml/L 2M sodium acetate buffer, pH4.5
MSG is composed of 72 g Glycerol, 92 g Soybean powder (pH 6.0), water to 1 litre.
MU-1 glu is composed 260 g of glucose, 3 g of MgSO4 7H2O, 5 g of KH2PO4, 6 g of K2SO4, amyloglycosidase trace metal solution (pH 4.5), water to 1 liter.
Amyloglycosidase trace metal solution is 13.9 g/L of FeSO4 7H2O, 13.56 g/L of MnSO4 5H2O, 6.8 g/L of ZnCI2, 2.5 g/L of CuSO4-5H2O, 0.24 g/L of NiCI2-6H2O, 3 g/L of Citric acid H2O.
Transformation of Aspergillus niger
Transformation of Aspergillus species can be achieved using the general methods for yeast transformation.
Aspergillus niger host strain was inoculated to 100 ml of YPG medium supplemented with 10 mM uridine and incubated for 16 h at 32°C at 80 rpm. Pellets were collected and washed with 0.6 M KCI, and resuspended 20 ml 0.6 M KCI containing a commercial beta-glucanase product (GLUCANEX™,
Novozymes A/S, Bagsvaerd, Denmark) at a final concentration of 20 mg per ml. The suspension was incubated at 32°C at 80 rpm until protoplasts were formed, and then washed twice with STC buffer. The protoplasts were counted with a hematometer and resuspended and adjusted in an 8:2:0.1 solution of STC:STPC:DMSO to a final concentration of 2.5x107 protoplasts/ml. Approximately 4 pg of plasmid DNA was added to 100 pl of the protoplast suspension, mixed gently, and incubated on ice for 30 minutes. One ml of SPTC was added and the protoplast suspension was incubated for 20 minutes at 37°C. After the addition of 10 ml of 50°C Cove top agarose, the reaction was poured onto Cove agar plates and the plates were incubated at 30°C for 3 days. COVEII plus 5-Fluorocytosine top agarose was overlayed to the transformation plates for counter-selection of the strains.
PCR amplifications
Polymerase Chain Reaction (PCR) was carried out with KOD plus neo [ToYoBo] or PrimeSTAR Max DNA polymerase [TaKaRa], KOD plus reaction mix is shown in table 10. KOD plus PCR cycle is shown in table 11 . PrimeSTAR reaction mix is shown in table 12, and corresponding PCR cycle is shown in table 13.
MTP fermentation
Spores of the selected transformants were inoculated in 0.45 mL of 1/4YPG Ac,1 % SBP in 96 well MTP and cultivated at 30°C for 3 days at 900 rpm.
Shake flask (SF) fermentation
Spores are inoculated to 100 ml of MSG media in buffled flask and fermented on a rotary shaking table at 220 rpm, 30°C. Ten-milli liters of cultured broth was transferred tol OOml MU1 -glu with 4ml 50% urea in 500 ml baffled flasks and further fermented for 3 to 5 days.
Lab tank fermentation
Fermentation was done as fed-batch fermentation (H. Pedersen 2000, Appl Microbiol Biotechnol, 53: 272-277). Selected strains were pre-cultured in liquid media then grown mycelia were transferred to the tanks for further cultivation of protein production. Cultivation was done at pH 4 to 7, 30 to 34°C, for 6~8 days with the feeding of glucose and ammonium without over-dosing. Culture supernatant after centrifugation was used for yield evaluation. pNP assay
Dilute cultured sup and purified standard samples to appropriate concentrations by 100 mM Sodium acetate buffer pH5.5. Start enzymatic reaction by mixing 10 uL of diluted samples and 100 uL of substrate (10mM p-nitrophenyl-phosphate, disodium in 100 mM Sodium acetate pH5.5) in 96-well plate. After incubation at room temperature for 18 min, add 80 uL of stop solution (0.5M NA2CO3, pH>1 1), hold for 2 min and measure absorbance at 405nm. Enzyme activity was calculated from standard curve and indicated as relative activity to the reference.
FYT(B) assay
Phytase activity was measured as FYT(B) (Phytase (Braakii) Units), relative to an enzyme standard of a declared strength.
The samples and standard phytase reacts with sodium phytase (phytic acid dodeca sodium salt CeHeO24P6Nai2) and releases inorganic phosphate. Catalytic reaction parameters and conditions are shown in table 14. This phosphate is determined spectrophometically from a yellow complex formed by an acidic complex reagent containing molybdate/vanadate. The yellow complex is measured spectrophotometrically at a wavelength of 405 nm. The rate of phosphate release can be observed by Konelab (Thermo Fisher Scientific). Colorimetric reaction parameters and conditions are shown in Table 15. Table 16 shows reaction buffers and reagent compositions.
The activity of the enzyme samples is determined relative to the standard curve.
Calculation was conducted as follows:
Activity FYT(B)/g = (S x V x F)/ W
S = Reading from the standard curve in FYT(B)/ml
V = Volume of the measuring flask used in mL
F = Dilution factor for second dilution
W = Weight of sample in g
Semi-quantification by MALDI-TOF MS
Diluted culture supernatant was mixed with an internal standard protein (e.g EndoH) and MS spectrum was taken. The ratio of signal intensities of target (bovine alpha-lactalbumin) and reference (internal standard) was recorded as rough expression level of target. Standard curve was prepared using series of diluted purified bovine alpha-lactalbumin.
Size exclusion chromatography (SEC)
Quantification of ALAB was carried out by size exclusion chromatography. SEC protocol is modified from Pinho et al. Journal of Dairy Research, 79(2) 2012.
The invention described and claimed herein is not to be limited in scope by the specific aspects herein disclosed, since these aspects are intended as illustrations of several aspects of the invention. Any equivalent aspects are intended to be within the scope of this invention. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims. In the case of conflict, the present disclosure including definitions will control.
LIST OF EMBODIMENTS
The invention is further defined by the following numbered embodiments:
[1] A nucleic acid construct comprising: a first polynucleotide encoding a signal peptide having a sequence identity of at least 80% to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49; and a) a second polynucleotide encoding an alpha-lactalbumin (ALAB) polypeptide having a sequence identity of at least 70% to the polypeptide sequence of SEQ ID NO:6; or b) a second polynucleotide encoding a polypeptide having phytase activity;
wherein the first polynucleotide and the second polynucleotide are operably linked in translational fusion.
[2] The nucleic acid construct according to embodiment 1 , wherein the second polynucleotide is located downstream from the first polynucleotide.
[3] The nucleic acid construct according to any of embodiment 1 or 2, wherein the signal peptide is a naturally occurring signal peptide, or a functional fragment or functional variant of a naturally occurring signal peptide.
[4] The nucleic acid construct according to any of embodiments 1 to 3, wherein the signal peptide is from a filamentous fungal glycosidase.
[4a] The nucleic acid construct according to any of embodiments 1 to 4, further comprising a third polynucleotide downstream of the first polynucleotide and upstream of the second polynucleotide.
[4b] The nucleic acid construct according to any previous embodiment, wherein the third polynucleotide is a non-coding intron.
[4c] The nucleic acid construct according to any previous embodiment, wherein the third polynucleotide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:17 (gtaagtaacatccactctgttctagtgccatgctgagattgtacag).
[4d] The nucleic acid construct according to any previous embodiment, comprising a polynucleotide sequence with a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:18 or SEQ ID NO:19.
[5] The nucleic acid construct according to any previous embodiment, wherein the nucleic acid construct further comprises a heterologous promoter, and wherein said promoter, the first polynucleotide, the second polynucleotide, and optionally the third polynucleotide, are operably linked.
[6] The nucleic acid construct according to any previous embodiment, wherein the promoter is a P3 promoter or a P3-based promoter, preferably the heterologous promoter is a tandem promoter comprising the P3 promoter or is a tandem promoter derived from the P3 promoter.
[7] The nucleic acid construct according to any previous embodiment, wherein the promoter is operably linked to an mRNA stabilizer region; preferably the mRNA stabilizer region is the crylllA mRNA stabilizer region.
[8] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is a naturally occurring signal peptide, or a functional fragment or functional variant of a naturally occurring signal peptide.
[9] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is from a glycosidase (EC 3.2.1).
[10] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a mannanase polypeptide (EC 3.2.1.78).
[11] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a b-transglycosidase polypeptide (EC 2.4.1.-).
[12] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a chitin b-1 ,3/1 ,6-glucanosyltransferase polypeptide (EC 2.4.1 .-) polypeptide.
[13] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from an endo-b-1 ,3-glucanase polypeptide or laminarinase polypeptide (EC 3.2.1 .39).
[14] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a polypeptide, such as a mannanase, transglycosidase, glycosyltransferase, laminarinase, or glucanase, expressed by a filamentous fungal host cell.
[15] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a polypeptide expressed by a Aspergillus host cell, such as an Aspergillus luchuensis.
[16] The nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:1 ,SEQ ID NO:3, SEQ ID NO:44, SEQ ID NO:46, or SEQ ID NO:48; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:1 ,SEQ ID NO:3, SEQ ID NO:44, SEQ ID NO:46, or SEQ ID NO:48.
[17] The nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:1 ; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:1.
[18a] The nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:3; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:3.
[18b] The nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:44; most
preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:44.
[18c] The nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:46; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:46.
[18d] The nucleic acid construct according to any of the preceding embodiments, wherein the first polynucleotide encoding the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:48; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:48.
[19] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a glycosidase expressed by an Aspergillus species selected from the group consisting of Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus luchuensis, or Aspergillus oryzae.
[20] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide is obtained from a glycosidase expressed by Aspergillus luchuensis.
[21] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:2.
[22a] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:2.
[22b] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:45.
[22c] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:45.
[22d] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:47.
[22e] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:47.
[22f] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:49.
[22g] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:49.
[23] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:4.
[24] The nucleic acid construct according to any of the preceding embodiments, wherein the signal peptide comprises, consists essentially of, or consists of SEQ ID NO:4.
[24a] The nucleic acid construct according to any preceding embodiments, wherein the signal peptide consists of the amino acid sequence of SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49 with or without its C-terminal alanine, or a peptide fragment thereof that retains the ability to direct the polypeptide into or across a cell membrane.
[25] The nucleic acid construct according to any of the preceding embodiments, wherein the N- and/or C- terminal end of the signal peptide has been extended by addition of one or more amino acids.
[26] The nucleic acid construct according to any of embodiments 1 to 25, wherein the signal peptide is a fragment of the signal peptides of any of embodiments 1 to 25.
[27] The nucleic acid construct according to any of the preceding embodiments, wherein polynucleotide encoding the alpha-lactalbumin polypeptide has a sequence identity of at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO:5; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:5.
[28] The nucleic acid construct according to any of the preceding embodiments, wherein the alphalactalbumin polypeptide is a bovine alpha-lactalbumin.
[29] The nucleic acid construct according to any of the preceding embodiments, wherein the alphalactalbumin is a human alpha-lactalbumin.
[30] The nucleic acid construct according to any preceding embodiments, wherein the alpha-lactalbumin polypeptide has a sequence identity of at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide of SEQ ID NO:6.
[31] The nucleic acid construct according to embodiment 30, wherein the alpha-lactalbumin polypeptide comprises, consists essentially of, or consists of the mature polypeptide of SEQ ID NO:6.
[32] The nucleic acid construct according to any of the preceding embodiments, wherein the N- and/or C- terminal end of the alpha-lactalbumin polypeptide has been extended by addition of one or more amino acids.
[33] The nucleic acid construct according to any of the preceding embodiments, wherein polynucleotide encoding the polypeptide having phytase activity has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide coding sequence of SEQ ID NO: 11 ; most preferably the polynucleotide comprises, consists essentially of, or consists of the mature polypeptide coding sequence of SEQ ID NO:11 .
[34] The nucleic acid construct according to any of the preceding embodiments, wherein the polypeptide having phytase activity is a bacterial polypeptide or variant thereof.
[35] The nucleic acid construct according to any of the preceding embodiments, wherein the polypeptide having phytase activity is EC 3.1.3.26.
[36] The nucleic acid construct according to any preceding embodiments, wherein the polypeptide having phytase activity has a sequence identity of at least 80%, e.g. at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide of SEQ ID NO: 12.
[37] The nucleic acid construct according to embodiment 36, wherein the polypeptide having phytase activity comprises, consists essentially of, or consists of the mature polypeptide of SEQ ID NO: 12.
[38] The nucleic acid construct according to any of the preceding embodiments, wherein the N- and/or C- terminal end of the polypeptide having phytase activity has been extended by addition of one or more amino acids.
[39] The nucleic acid construct according to any of embodiments 1 to 38, wherein the signal peptide is a fragment of the signal peptides of any of embodiments 1 to 38.
[40] An expression vector comprising a nucleic acid construct according to any of embodiments 1 to 39.
[41] A fungal host cell comprising in its genome: a) a nucleic acid construct according to any of embodiments 1 to 39; and/or b) an expression vector according to embodiment 40.
[42] The host cell of embodiment 41 , wherein the fungal host cell is a filmentous fungal host cell, e.g., an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell, in particular, an
Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zonatum, Coprinus cinereus, Coriolus hirsutus, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, Mucormiehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Phanerochaete chrysosporium, Phlebia radiata, Pleurotus eryngii, Talaromyces emersonii, Thielavia terrestris, Trametes villosa, Trametes versicolor, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride cell.
[43] The host cell of any of embodiments 41 to 42, wherein the host cell is an Aspergillus cell.
[44] The host cell of any of embodiments 41 to 43, wherein the host cell is an Aspergillus niger or Aspergillus oryzae host cell.
[45] The host cell of any of embodiments 41 to 44, wherein the host cell comprises at least two copies of the nucleic acid construct and/or the expression vector, such as two copies, three copies, four copies or more than four copies.
[46] A method of producing a alpha-lactalbuming (ALAB) polypeptide, the method comprising: a) cultivating a host cell according to any of embodiments 41 to 45 under conditions conducive for production of the ALAB polypeptide; and optionally b) recovering the ALAB polypeptide.
[47] A method of producing a polypeptide having phytase activity, the method comprising: a) cultivating a host cell according to any of embodiments 41 to 45 under conditions conducive for production of the polypeptide having phytase activity; and optionally b) recovering the polypeptide having phytase activity.
[48] A fusion polypeptide, comprising a) a signal peptide having at least 60% sequence identity to SEQ ID NO:45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49, and b) an alpha-lactalbumin polypeptide having at least 60% sequence identity to SEQ ID NO: 6, or a polypeptide having phytase activity having at least 60% sequence identity to SEQ ID NO:8.
[49] The fusion polypeptide according to embodiment 48, wherein the signal peptide is located upstream of the alpha-lactalbumin polypeptide or upstream of the polypeptide having phytase activity.
[50] The fusion polypeptide according to any of embodiments 48 to 49, wherein the signal peptide is located at the N-terminal end of the alpha-lactalbumin polypeptide or at the N-terminal end of the polypeptide having phytase activity.
[51] The fusion polypeptide according to any of embodiments 48 to 50, wherein the alpha-lactalbumin polypeptide is selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 8;
(b) a polypeptide having at least 60% sequence identity to SEQ ID NO: 10;
(c) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the mature polypeptide coding sequence of SEQ ID NO: 7 or SEQ ID NO: 9;
(d) a polypeptide derived from SEQ ID NO: 8, SEQ ID NO:10, SEQ ID NO:18 or SEQ ID NO:19, a mature polypeptide of SEQ ID NO: 8, SEQ ID NO:10, SEQ ID NO:18 or SEQ ID NO:19, having 1-30 alterations, e.g., substitutions, deletions and/or insertions at one or more positions, e.g., 1 or 2 or 3 or 4 or 5 or 6 or 7 or 8 or 9 or 10 or 11 or 12 or 13 or 14 or 15 or 16 or 17 or 18 or 19 or 20 or 21 or 22 or 23 or 24 or 25 or 26 or 27 or 28 or 29 or 30 alterations, in particular substitutions;
(e) a polypeptide derived from the polypeptide of (a), (b), (c), or (d) wherein the N- and/or C- terminal end has been extended by addition of one or more amino acids;
(f) a fragment of the polypeptide of (a), (b), (c), (d), or (e), and
(g) the fragment of (f) wherein the polypeptide has 1 - 10 deletions at the N-terminal end.
[52] The fusion polypeptide according to any of embodiments 48 to 50, wherein the polypeptide having phytase activity is selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 14;
(b) a polypeptide having at least 60% sequence identity to SEQ ID NO: 16;
(c) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the mature polypeptide coding sequence of SEQ ID NO: 13 or SEQ ID NO: 15;
(d) a polypeptide derived from SEQ ID NO: 14 or SEQ ID NO:16, a mature polypeptide of SEQ ID NO:14 or SEQ ID NO:16, having 1-30 alterations, e.g., substitutions, deletions and/or insertions at one or more positions, e.g. , 1 or 2 or 3 or 4 or 5 or 6 or 7 or 8 or 9 or 10 or 11 or 12 or 13 or 14 or 15 or 16 or 17 or 18 or 19 or 20 or 21 or 22 or 23 or 24 or 25 or 26 or 27 or 28 or 29 or 30 alterations, in particular substitutions;
(e) a polypeptide derived from the polypeptide of (a), (b), (c), or (d) wherein the N- and/or C- terminal end has been extended by addition of one or more amino acids;
(f) a fragment of the polypeptide of (a), (b), (c), (d), or (e), and
(g) the fragment of (f) wherein the polypeptide has 1 - 10 deletions at the N-terminal end.
Claims
1 . A nucleic acid construct comprising: a first polynucleotide encoding a signal peptide having a sequence identity of at least 80% to SEQ ID NO: 45, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49; and a second polynucleotide encoding an alpha-lactalbumin (ALAB) polypeptide having a sequence identity of at least 70% to the polypeptide sequence of SEQ ID NO:6; wherein the first polynucleotide and the second polynucleotide are operably linked in translational fusion.
2. The nucleic acid construct according to claim 1 , wherein the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:45.
3. The nucleic acid construct according to claim 1 , wherein the signal peptide has a sequence identity of at least 85%, e.g. at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to SEQ ID NO:2.
4. The nucleic acid construct according to any preceding claims, wherein the signal peptide consists ofthe amino acid sequence of SEQ ID NO:2 or SEQ ID NO:45 with orwithout its C-terminal alanine, or a peptide fragment thereof that retains the ability to direct the polypeptide into or across a cell membrane.
5. The nucleic acid construct according to any preceding claims, wherein the alpha-lactalbumin polypeptide has a sequence identity of at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, to the mature polypeptide of SEQ ID NO:6.
6. An expression vector comprising a nucleic acid construct according to any of claims 1 to 5.
7. A fungal host cell comprising in its genome: a) a nucleic acid construct according to any of claims 1 to 5; and/or b) an expression vector according to claim 6.
8. A method of producing an alpha-lactalbumin (ALAB) polypeptide, the method comprising: a) cultivating a host cell according to claim 7 under conditions conducive for production ofthe ALAB polypeptide; and optionally b) recovering the ALAB polypeptide.
A fusion polypeptide, comprising: a signal peptide having at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, sequence identity to SEQ ID NO: 45,
SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO: 47, or SEQ ID NO: 49, and an alpha-lactalbumin polypeptide having at least 60%, e.g. at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, sequence identity to SEQ ID NO: 6.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP22195778 | 2022-09-15 | ||
EP22195778.0 | 2022-09-15 | ||
EP22202556.1 | 2022-10-19 | ||
EP22202556 | 2022-10-19 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024056643A1 true WO2024056643A1 (en) | 2024-03-21 |
Family
ID=88060568
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2023/074982 WO2024056643A1 (en) | 2022-09-15 | 2023-09-12 | Fungal signal peptides |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024056643A1 (en) |
Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0238023A2 (en) | 1986-03-17 | 1987-09-23 | Novo Nordisk A/S | Process for the production of protein products in Aspergillus oryzae and a promoter for use in Aspergillus |
WO1990015861A1 (en) | 1989-06-13 | 1990-12-27 | Genencor International, Inc. | A method for killing cells without cell lysis |
WO1992006204A1 (en) | 1990-09-28 | 1992-04-16 | Ixsys, Inc. | Surface expression libraries of heteromeric receptors |
US5223409A (en) | 1988-09-02 | 1993-06-29 | Protein Engineering Corp. | Directed evolution of novel binding proteins |
WO1994025612A2 (en) | 1993-05-05 | 1994-11-10 | Institut Pasteur | Nucleotide sequences for the control of the expression of dna sequences in a cellular host |
WO1995017413A1 (en) | 1993-12-21 | 1995-06-29 | Evotec Biosystems Gmbh | Process for the evolutive design and synthesis of functional polymers based on designer elements and codes |
WO1995022625A1 (en) | 1994-02-17 | 1995-08-24 | Affymax Technologies N.V. | Dna mutagenesis by random fragmentation and reassembly |
WO1995033836A1 (en) | 1994-06-03 | 1995-12-14 | Novo Nordisk Biotech, Inc. | Phosphonyldipeptides useful in the treatment of cardiovascular diseases |
WO1996014413A1 (en) * | 1994-11-02 | 1996-05-17 | Agennix, Inc. | Expression of processed recombinant lactoferrin and lactoferrin polypeptide fragments from a fusion product in aspergillus |
WO2010096673A1 (en) | 2009-02-20 | 2010-08-26 | Danisco Us Inc. | Fermentation broth formulations |
WO2012160093A1 (en) | 2011-05-23 | 2012-11-29 | Novozymes A/S | Simultaneous site-specific integrations of multiple gene-copies in filamentous fungi |
WO2015048332A2 (en) * | 2013-09-25 | 2015-04-02 | Pronutria, Inc. | Secreted nutritive polypeptides and formulations thereof, and methods of production and use thereof |
WO2020219596A1 (en) * | 2019-04-22 | 2020-10-29 | Perfect Day, Inc. | Recombinant milk proteins and compositions comprising the same |
WO2021013464A1 (en) * | 2019-07-25 | 2021-01-28 | Novozymes A/S | Filamentous fungal expression system |
CN114774392A (en) * | 2022-04-12 | 2022-07-22 | 昆明学院 | Mannase and application thereof |
-
2023
- 2023-09-12 WO PCT/EP2023/074982 patent/WO2024056643A1/en unknown
Patent Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0238023A2 (en) | 1986-03-17 | 1987-09-23 | Novo Nordisk A/S | Process for the production of protein products in Aspergillus oryzae and a promoter for use in Aspergillus |
US5223409A (en) | 1988-09-02 | 1993-06-29 | Protein Engineering Corp. | Directed evolution of novel binding proteins |
WO1990015861A1 (en) | 1989-06-13 | 1990-12-27 | Genencor International, Inc. | A method for killing cells without cell lysis |
WO1992006204A1 (en) | 1990-09-28 | 1992-04-16 | Ixsys, Inc. | Surface expression libraries of heteromeric receptors |
WO1994025612A2 (en) | 1993-05-05 | 1994-11-10 | Institut Pasteur | Nucleotide sequences for the control of the expression of dna sequences in a cellular host |
WO1995017413A1 (en) | 1993-12-21 | 1995-06-29 | Evotec Biosystems Gmbh | Process for the evolutive design and synthesis of functional polymers based on designer elements and codes |
WO1995022625A1 (en) | 1994-02-17 | 1995-08-24 | Affymax Technologies N.V. | Dna mutagenesis by random fragmentation and reassembly |
WO1995033836A1 (en) | 1994-06-03 | 1995-12-14 | Novo Nordisk Biotech, Inc. | Phosphonyldipeptides useful in the treatment of cardiovascular diseases |
WO1996014413A1 (en) * | 1994-11-02 | 1996-05-17 | Agennix, Inc. | Expression of processed recombinant lactoferrin and lactoferrin polypeptide fragments from a fusion product in aspergillus |
WO2010096673A1 (en) | 2009-02-20 | 2010-08-26 | Danisco Us Inc. | Fermentation broth formulations |
WO2012160093A1 (en) | 2011-05-23 | 2012-11-29 | Novozymes A/S | Simultaneous site-specific integrations of multiple gene-copies in filamentous fungi |
WO2015048332A2 (en) * | 2013-09-25 | 2015-04-02 | Pronutria, Inc. | Secreted nutritive polypeptides and formulations thereof, and methods of production and use thereof |
WO2020219596A1 (en) * | 2019-04-22 | 2020-10-29 | Perfect Day, Inc. | Recombinant milk proteins and compositions comprising the same |
WO2021013464A1 (en) * | 2019-07-25 | 2021-01-28 | Novozymes A/S | Filamentous fungal expression system |
CN114774392A (en) * | 2022-04-12 | 2022-07-22 | 昆明学院 | Mannase and application thereof |
Non-Patent Citations (56)
Title |
---|
"Molecular Biological Methods for Bacillus", 1990, JOHN WILEY AND SONS |
"Soc. App. Bacteriol. Symposium Series", 1980, article "Biology and Activities of Yeast" |
ARMENTEROS ET AL., NAT. BIOTECHNOL., vol. 37, 2019, pages 420 - 423 |
BALLEZA ET AL., FEMS MICROBIOL REV, vol. 33, no. 1, 2009, pages 133 - 151 |
BOWIESAUER, PROC. NATL. ACAD. SCI. USA, vol. 86, 1989, pages 2152 - 2156 |
CARTER ET AL., PROTEINS: STRUCTURE, FUNCTION, AND GENETICS, vol. 6, 1989, pages 240 - 248 |
CHRISTENSEN ET AL., BIO/TECHNOLOGY, vol. 6, 1988, pages 1419 - 1422 |
COLLINS-RACIE ET AL., BIOTECHNOLOGY, vol. 13, 1995, pages 982 - 987 |
CONTRERAS ET AL., BIOTECHNOLOGY, vol. 9, 1991, pages 378 - 381 |
COOPER ET AL., EMBO J., vol. 12, 1993, pages 2575 - 2583 |
CUNNINGHAMWELLS, SCIENCE, vol. 244, 1989, pages 1081 - 1085 |
DATABASE Geneseq [online] 29 September 2022 (2022-09-29), "Aspergillus niger mannase ABL-03475 protein, SEQ ID:2.", retrieved from EBI accession no. GSP:BLN52566 Database accession no. BLN52566 * |
DATABASE USPTO Proteins [online] 10 February 2020 (2020-02-10), "Sequence 4765 from patent US 10463711.", XP093104377, retrieved from EBI accession no. USPOP:QHY33248 Database accession no. QHY33248 * |
DAWSON ET AL., SCIENCE, vol. 266, 1994, pages 776 - 779 |
DERBYSHIRE ET AL., GENE, vol. 46, 1986, pages 145 |
EATON ET AL., BIOCHEMISTRY, vol. 25, 1986, pages 505 - 512 |
ENGELEN, A. J. ET AL.: "Simple and rapid determination of phytase activity", J. AOAC INT., vol. 77, 1994, pages 760 - 764 |
EUR. J. BIOCHEM., vol. 223, 1994, pages 1 - 5 |
EUR. J. BIOCHEM., vol. 232, 1995, pages 1 - 6 |
EUR. J. BIOCHEM., vol. 237, 1996, pages 1 - 5 |
EUR. J. BIOCHEM., vol. 250, 1997, pages 1 - 6 |
EUR. J. BIOCHEM., vol. 264, 1999, pages 610 - 650 |
FORD ET AL., PROTEIN EXPRESSION AND PURIFICATION, vol. 2, 1991, pages 95 - 107 |
GEISBERG ET AL., CELL, vol. 156, no. 4, pages 812 - 824 |
GUOSHERMAN, MOL. CELLULAR BIOL., vol. 15, 1995, pages 5983 - 5990 |
H. PEDERSEN, APPL MICROBIOL BIOTECHNOL, vol. 53, 2000, pages 272 - 277 |
HAWKSWORTH ET AL.: "In, Ainsworth and Bisby's Dictionary of The Fungi", 1995, CAB INTERNATIONAL, UNIVERSITY PRESS |
HILTON ET AL., J. BIOL. CHEM., vol. 271, 1996, pages 4699 - 4708 |
HUE ET AL., JOURNAL OF BACTERIOLOGY, vol. 177, 1995, pages 3465 - 3471 |
LABROU, PROTEIN DOWNSTREAM PROCESSING, vol. 1129, 2014, pages 3 - 10 |
LI ET AL., MICROBIAL CELL FACTORIES, vol. 16, 2017, pages 168 |
LOWMAN ET AL., BIOCHEMISTRY, vol. 30, 1991, pages 10832 - 10837 |
LUBERTOZZIKEASLING, BIOTECHN. ADVANCES, vol. 27, 2009, pages 53 - 75 |
MADHAVAN ARAVIND ET AL: "Signal peptides from filamentous fungi efficiently mediate the secretion of recombinant proteins inKluyveromyces lactis", BIOCHEMICAL ENGINEERING JOURNAL, ELSEVIER, AMSTERDAM, NL, vol. 102, 12 March 2015 (2015-03-12), pages 31 - 37, XP029252972, ISSN: 1369-703X, DOI: 10.1016/J.BEJ.2015.03.008 * |
MARTIN ET AL., J. IND. MICROBIOL. BIOTECHNOL., vol. 3, 2003, pages 568 - 576 |
MOROZOV ET AL., EUKARYOTIC CELL, vol. 5, no. 11, 2006, pages 1838 - 1846 |
MUKHERJEE ET AL., TRICHODERMA: BIOLOGY AND APPLICATIONS, 2013 |
NEEDLEMANWUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 443 - 453 |
NER ET AL., DNA, vol. 7, 1988, pages 127 |
NESS ET AL., NATURE BIOTECHNOLOGY, vol. 17, 1999, pages 893 - 896 |
OWJI ET AL., EUR. J. CELL BIOL., vol. 97, 2018, pages 422 - 441 |
PENG, C. ET AL., FRONT. BIOENG. BIOTECHNOL., vol. 7, 2019, pages 139 |
PINHO ET AL., JOURNAL OF DAIRY RESEARCH, vol. 79, no. 2, 2012 |
RASMUSSEN-WILSON ET AL., APPL. ENVIRON. MICROBIOL., vol. 63, 1997, pages 3488 - 3493 |
REIDHAAR-OLSONSAUER, SCIENCE, vol. 241, 1988, pages 53 - 57 |
RICE ET AL.: "EMBOSS: The European Molecular Biology Open Software Suite", TRENDS GENET., vol. 16, 2000, pages 276 - 277, XP004200114, DOI: 10.1016/S0168-9525(00)02024-2 |
ROMANOS ET AL., YEAST, vol. 8, 1992, pages 423 - 488 |
SAMBROOK ET AL.: "Molecular cloning: A laboratory manual", 1989, COLD SPRING HARBOR LAB. |
SCHMOLLDATTENBOCK: "Gene Expression Systems in Fungi: Advancements and Applications", FUNGAL BIOLOGY, 2016 |
SESHASAYEE ET AL., SUBCELL BIOCHEM, vol. 52, 2011, pages 7 - 23 |
SMOLKE ET AL.: "Synthetic Biology: Parts, Devices and Applications", CONSTITUTIVE AND REGULATED PROMOTERS IN YEAST: HOW TO DESIGN AND MAKE USE OF PROMOTERS IN S. CEREVISIAE, 2018 |
STEVENS, DRUG DISCOVERY WORLD, vol. 4, 2003, pages 35 - 48 |
SVETINA ET AL., J. BIOTECHNOL., vol. 76, 2000, pages 245 - 251 |
WINGFIELD, CURRENT PROTOCOLS IN PROTEIN SCIENCE, vol. 80, no. 1, 2015 |
YELTON ET AL., PROC. NATL. ACAD. SCI. USA, vol. 81, 1984, pages 1470 - 1474 |
YUAN YG ET AL., J ANAL METHODS CHEM. 2014, 2014, pages 281031 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2044203B1 (en) | Methods for increasing expression of genes in a fungal cell | |
US20160304905A1 (en) | Fungal Gene Library By Double Split-Marker Integration | |
US10619144B2 (en) | Regulated PepC expression | |
US11046736B2 (en) | Filamentous fungal host | |
US9701970B2 (en) | Promoters for expressing genes in a fungal cell | |
CN108603181B (en) | Phytase and use thereof | |
WO2009071530A1 (en) | Fungal pepc inhibitor | |
US8802412B2 (en) | Polynucleotides having promoter activity | |
US20220267783A1 (en) | Filamentous fungal expression system | |
EP3898985B1 (en) | Tandem protein expression | |
WO2024056643A1 (en) | Fungal signal peptides | |
US11667922B2 (en) | Fungal chaperone proteins | |
US20220025422A1 (en) | Improved Filamentous Fungal Host Cells | |
US8735563B2 (en) | Polynucleotides having promoter activity | |
US9657300B2 (en) | Polynucleotides having leader sequence function | |
WO2022090555A1 (en) | Leader peptides and polynucleotides encoding the same | |
US20230407273A1 (en) | Glycosyltransferase variants for improved protein production | |
WO2017211803A1 (en) | Co-expression of heterologous polypeptides to increase yield |