WO2007118836A1 - Bi-directional selection markers with improved activity - Google Patents
Bi-directional selection markers with improved activity Download PDFInfo
- Publication number
- WO2007118836A1 WO2007118836A1 PCT/EP2007/053549 EP2007053549W WO2007118836A1 WO 2007118836 A1 WO2007118836 A1 WO 2007118836A1 EP 2007053549 W EP2007053549 W EP 2007053549W WO 2007118836 A1 WO2007118836 A1 WO 2007118836A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- polypeptide
- sequence
- gene
- interest
- seq
- Prior art date
Links
- 230000000694 effects Effects 0.000 title claims abstract description 13
- 230000001976 improved effect Effects 0.000 title description 4
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 85
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 67
- 229920001184 polypeptide Polymers 0.000 claims abstract description 65
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 65
- 239000003550 marker Substances 0.000 claims abstract description 37
- FVTWJXMFYOXOKK-UHFFFAOYSA-N 2-fluoroacetamide Chemical compound NC(=O)CF FVTWJXMFYOXOKK-UHFFFAOYSA-N 0.000 claims abstract description 24
- 238000000034 method Methods 0.000 claims abstract description 24
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 18
- 108010048241 acetamidase Proteins 0.000 claims abstract description 15
- 238000004519 manufacturing process Methods 0.000 claims abstract description 14
- 150000001413 amino acids Chemical class 0.000 claims abstract description 13
- 150000001875 compounds Chemical class 0.000 claims abstract description 7
- DLFVBJFMPXGRIB-UHFFFAOYSA-N Acetamide Chemical compound CC(N)=O DLFVBJFMPXGRIB-UHFFFAOYSA-N 0.000 claims description 64
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 34
- 241000228150 Penicillium chrysogenum Species 0.000 claims description 32
- 108020004414 DNA Proteins 0.000 claims description 30
- 108091033319 polynucleotide Proteins 0.000 claims description 27
- 102000040430 polynucleotide Human genes 0.000 claims description 27
- 239000002157 polynucleotide Substances 0.000 claims description 27
- 230000014509 gene expression Effects 0.000 claims description 20
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 claims description 18
- 230000002538 fungal effect Effects 0.000 claims description 15
- 238000012217 deletion Methods 0.000 claims description 13
- 230000037430 deletion Effects 0.000 claims description 13
- 229910052757 nitrogen Inorganic materials 0.000 claims description 9
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 6
- 229910052799 carbon Inorganic materials 0.000 claims description 6
- 230000001131 transforming effect Effects 0.000 claims description 4
- 230000008569 process Effects 0.000 abstract description 2
- 238000010276 construction Methods 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 49
- 108700023418 Amidases Proteins 0.000 description 27
- 150000007523 nucleic acids Chemical group 0.000 description 26
- 101150069003 amdS gene Proteins 0.000 description 19
- 241000233866 Fungi Species 0.000 description 18
- 239000012634 fragment Substances 0.000 description 17
- 239000013598 vector Substances 0.000 description 17
- 241000351920 Aspergillus nidulans Species 0.000 description 15
- 102000005922 amidase Human genes 0.000 description 13
- 230000009466 transformation Effects 0.000 description 11
- 108091026890 Coding region Proteins 0.000 description 10
- 235000001014 amino acid Nutrition 0.000 description 10
- 239000013604 expression vector Substances 0.000 description 10
- 230000012010 growth Effects 0.000 description 10
- 108020004999 messenger RNA Proteins 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- 229940024606 amino acid Drugs 0.000 description 8
- 102000039446 nucleic acids Human genes 0.000 description 8
- 108020004707 nucleic acids Proteins 0.000 description 8
- 238000003752 polymerase chain reaction Methods 0.000 description 8
- 235000018102 proteins Nutrition 0.000 description 8
- 102000004169 proteins and genes Human genes 0.000 description 8
- 238000006467 substitution reaction Methods 0.000 description 8
- 108091005804 Peptidases Proteins 0.000 description 7
- 230000010354 integration Effects 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 240000006439 Aspergillus oryzae Species 0.000 description 6
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 6
- 239000004365 Protease Substances 0.000 description 6
- 101150046337 amdA gene Proteins 0.000 description 6
- 239000013599 cloning vector Substances 0.000 description 6
- 238000009396 hybridization Methods 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- 108010065511 Amylases Proteins 0.000 description 5
- 241000228245 Aspergillus niger Species 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 241000228143 Penicillium Species 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 239000012895 dilution Substances 0.000 description 5
- 238000010790 dilution Methods 0.000 description 5
- 229940088598 enzyme Drugs 0.000 description 5
- 239000002773 nucleotide Substances 0.000 description 5
- 125000003729 nucleotide group Chemical group 0.000 description 5
- 238000012216 screening Methods 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 239000004382 Amylase Substances 0.000 description 4
- 102000013142 Amylases Human genes 0.000 description 4
- 241000228212 Aspergillus Species 0.000 description 4
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 4
- 235000019418 amylase Nutrition 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 230000002950 deficient Effects 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000008488 polyadenylation Effects 0.000 description 4
- 102000004092 Amidohydrolases Human genes 0.000 description 3
- 108090000531 Amidohydrolases Proteins 0.000 description 3
- 241000123346 Chrysosporium Species 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 3
- 241000223259 Trichoderma Species 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 239000002207 metabolite Substances 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 244000005700 microbiome Species 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 210000001938 protoplast Anatomy 0.000 description 3
- 230000006798 recombination Effects 0.000 description 3
- 238000005215 recombination Methods 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 241001019659 Acremonium <Plectosphaerellaceae> Species 0.000 description 2
- 108010037870 Anthranilate Synthase Proteins 0.000 description 2
- 101000690713 Aspergillus niger Alpha-glucosidase Proteins 0.000 description 2
- 101000757144 Aspergillus niger Glucoamylase Proteins 0.000 description 2
- 241000223651 Aureobasidium Species 0.000 description 2
- 241001337994 Cryptococcus <scale insect> Species 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 241000223221 Fusarium oxysporum Species 0.000 description 2
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 2
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 2
- 102100022624 Glucoamylase Human genes 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 241000223198 Humicola Species 0.000 description 2
- 102100027612 Kallikrein-11 Human genes 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- 241001344133 Magnaporthe Species 0.000 description 2
- 241000235395 Mucor Species 0.000 description 2
- 241000226677 Myceliophthora Species 0.000 description 2
- 241000233892 Neocallimastix Species 0.000 description 2
- 241000221960 Neurospora Species 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 241001236817 Paecilomyces <Clavicipitaceae> Species 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 241000235379 Piromyces Species 0.000 description 2
- 241000222480 Schizophyllum Species 0.000 description 2
- 241000228341 Talaromyces Species 0.000 description 2
- 241000228178 Thermoascus Species 0.000 description 2
- 241001494489 Thielavia Species 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 241001149964 Tolypocladium Species 0.000 description 2
- 101710152431 Trypsin-like protease Proteins 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 239000003797 essential amino acid Substances 0.000 description 2
- 235000020776 essential amino acid Nutrition 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 101150093025 pepA gene Proteins 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 238000010187 selection method Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 230000028070 sporulation Effects 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 230000004906 unfolded protein response Effects 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 1
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 101710081721 Alpha-amylase A Proteins 0.000 description 1
- 101710081719 Alpha-amylase B Proteins 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 241001513093 Aspergillus awamori Species 0.000 description 1
- 241001225321 Aspergillus fumigatus Species 0.000 description 1
- 101100216041 Aspergillus niger GLAA gene Proteins 0.000 description 1
- 241000131386 Aspergillus sojae Species 0.000 description 1
- 101100317631 Aspergillus tubingensis xynA gene Proteins 0.000 description 1
- 101100277447 Bacillus subtilis (strain 168) degQ gene Proteins 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 101150092493 CREA gene Proteins 0.000 description 1
- 102100035882 Catalase Human genes 0.000 description 1
- 108010053835 Catalase Proteins 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 description 1
- 241001674013 Chrysosporium lucknowense Species 0.000 description 1
- CNZIQHGDUXRUJS-UHFFFAOYSA-N Cyclopiazonic acid Natural products CC(=C/1C(=O)C2C3C(Cc4cccc5[nH]cc3c45)C(C)(C)N2C1=O)O CNZIQHGDUXRUJS-UHFFFAOYSA-N 0.000 description 1
- NBSCHQHZLSJFNQ-GASJEMHNSA-N D-Glucose 6-phosphate Chemical compound OC1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H](O)[C@H]1O NBSCHQHZLSJFNQ-GASJEMHNSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 101100317179 Dictyostelium discoideum vps26 gene Proteins 0.000 description 1
- 101100273894 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) celB gene Proteins 0.000 description 1
- 101100407639 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) prtB gene Proteins 0.000 description 1
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 description 1
- 108010046276 FLP recombinase Proteins 0.000 description 1
- 241000223218 Fusarium Species 0.000 description 1
- 101150108358 GLAA gene Proteins 0.000 description 1
- VFRROHXSMXFLSN-UHFFFAOYSA-N Glc6P Natural products OP(=O)(O)OCC(O)C(O)C(O)C(O)C=O VFRROHXSMXFLSN-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 239000004366 Glucose oxidase Substances 0.000 description 1
- 108010015776 Glucose oxidase Proteins 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 102100024295 Maltase-glucoamylase Human genes 0.000 description 1
- 241000233654 Oomycetes Species 0.000 description 1
- 101000847770 Penicillium chrysogenum N-(5'-phosphoribosyl)anthranilate isomerase Proteins 0.000 description 1
- 102000002508 Peptide Elongation Factors Human genes 0.000 description 1
- 108010068204 Peptide Elongation Factors Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 1
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 1
- 108010039203 Tripeptidyl-Peptidase 1 Proteins 0.000 description 1
- 102100034197 Tripeptidyl-peptidase 1 Human genes 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 229940117913 acrylamide Drugs 0.000 description 1
- CSGFFYNMTALICU-ZWNOBZJWSA-N adipyl-7-aminodesacetoxycephalosporanic acid Natural products CC1=C(N2[C@H](SC1)[C@H](NC(=O)CCCCC(O)=O)C2=O)C(O)=O CSGFFYNMTALICU-ZWNOBZJWSA-N 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010028144 alpha-Glucosidases Proteins 0.000 description 1
- CNZIQHGDUXRUJS-DQYPLSBCSA-N alpha-cyclopiazonic acid Natural products CC(O)=C1C(=O)[C@@H]2[C@@H]3[C@@H](Cc4cccc5[nH]cc3c45)C(C)(C)N2C1=O CNZIQHGDUXRUJS-DQYPLSBCSA-N 0.000 description 1
- 101150078331 ama-1 gene Proteins 0.000 description 1
- 101150009288 amyB gene Proteins 0.000 description 1
- 101150075954 apeB gene Proteins 0.000 description 1
- -1 areA Proteins 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 150000001508 asparagines Chemical class 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 229940091771 aspergillus fumigatus Drugs 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- ZKSIPEYIAHUPNM-ZEQRLZLVSA-N butobendine Chemical compound C([C@H](CC)N(C)CCN(C)[C@@H](CC)COC(=O)C=1C=C(OC)C(OC)=C(OC)C=1)OC(=O)C1=CC(OC)=C(OC)C(OC)=C1 ZKSIPEYIAHUPNM-ZEQRLZLVSA-N 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 101150072516 cbhA gene Proteins 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 230000007850 degeneration Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 101150038498 eglB gene Proteins 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 229940116332 glucose oxidase Drugs 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 101150073906 gpdA gene Proteins 0.000 description 1
- 101150095733 gpsA gene Proteins 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000126 in silico method Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- BEJNERDRQOWKJM-UHFFFAOYSA-N kojic acid Chemical compound OCC1=CC(=O)C(O)=CO1 BEJNERDRQOWKJM-UHFFFAOYSA-N 0.000 description 1
- 229960004705 kojic acid Drugs 0.000 description 1
- WZNJWVWKTVETCG-UHFFFAOYSA-N kojic acid Natural products OC(=O)C(N)CN1C=CC(=O)C(O)=C1 WZNJWVWKTVETCG-UHFFFAOYSA-N 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- OYKBQNOPCSXWBL-SNAWJCMRSA-N n-hydroxy-3-[(e)-3-(hydroxyamino)-3-oxoprop-1-enyl]benzamide Chemical compound ONC(=O)\C=C\C1=CC=CC(C(=O)NO)=C1 OYKBQNOPCSXWBL-SNAWJCMRSA-N 0.000 description 1
- 230000001937 non-anti-biotic effect Effects 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- 101150100654 pacC gene Proteins 0.000 description 1
- 101150035909 pepB gene Proteins 0.000 description 1
- 101150094986 pepC gene Proteins 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 101150023641 ppc gene Proteins 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 229930010796 primary metabolite Natural products 0.000 description 1
- 101150115781 prtT gene Proteins 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 229930000044 secondary metabolite Natural products 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000012289 standard assay Methods 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 108091006106 transcriptional activators Proteins 0.000 description 1
- 108091008023 transcriptional regulators Proteins 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 108010039189 tripeptidyl-peptidase 2 Proteins 0.000 description 1
- 101150016309 trpC gene Proteins 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 101150077833 xlnA gene Proteins 0.000 description 1
- 101150021205 xlnB gene Proteins 0.000 description 1
- 101150068226 xlnC gene Proteins 0.000 description 1
- 101150011516 xlnD gene Proteins 0.000 description 1
- 101150034760 xlnR gene Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
- C12N9/80—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5) acting on amide bonds in linear amides (3.5.1)
Definitions
- the present invention relates to new amdS genes for use in an improved method for the transformation of filamentous fungi for obtaining selection marker gene free recombinant strains.
- the transformed microorganism would contain only the desired modifications at the gene(s)-of-interest and as little as possible or remnants of other DNA fragments used during the transformation experiment or during cloning.
- this selectable marker is a so-called bi-directional marker, meaning that besides the positive selection for its presence (forward selection) also a reverse selection selecting for the absence of the gene can be applied, using fluoroacetamide (see Figure 1 ). This is successfully applied in species from the genera Aspergillus, Penicillium, Saccharomyces and Trichoderma.
- the present invention surprisingly shows that the reverse selection, i.e. the deletion of an amidase marker gene from microbial strains, works with 100% efficiency when using the amidase encoding genes according to the invention. As a result, sequential modification of industrial production strains is feasible with high efficiency and throughput.
- the present invention provides a polypeptide displaying acetamidase activity and providing a reverse selection on fluoroacetamide. Furthermore, the present invention provides a polynucleotide encoding the above polypeptide and the use of said polynucleotide.
- the present invention provides a polypeptide selected from the group consisting of a polypeptide having an amino acid sequence according to SEQ ID NO 3, a polypeptide having an amino acid sequence according to SEQ ID NO 6, a polypeptide having an amino acid that is substantially homologous to the sequence of SEQ ID NO 3 and a polypeptide having an amino acid that is substantially homologous to the sequence of SEQ ID NO 6, the polypeptide displaying acetamidase activity and that providing a reverse selection on fluoroacetamide with an efficiency of at least 95%, preferably at least 96%, more preferably at least 97%, more preferably at least 98%, more preferably at least 99%, most preferably 100%.
- the present invention further provides in a second aspect a polynucleotide encoding a polypeptide of the first aspect.
- a specific DNA sequence is provided encoding the polypeptide of SEQ ID NO 3, i.e. SEQ ID NO 1 or 2, or encoding the polypeptide of SEQ ID NO 6, i.e. SEQ ID NO 4 or 5.
- SEQ ID NO 7 is a variant coding sequence derived from the genomic DNA of SEQ ID NO 4, due to an event of alternative splicing
- SEQ ID NO 8 is the protein encoded by the alternative coding sequence.
- both polypeptides i.e. SEQ ID NO 6 and SEQ ID NO 8 are identical up to amino acid 365.
- an efficiency of at least 95% means that at least 95% of the strains resulting from the reverse selection on fluoroacetamide has the amidase gene deleted from the genome. This is typically observed upon further analyzing isolated colonies obtained after selection on standard fluoroacetamide plates containing 32 m M fluoroacetamide, 5 mM urea (as N-source) and 1.1 % glucose (see WO 9706261 ).
- a polypeptide having an amino acid sequence that is "substantially homologous" to the sequence of SEQ ID NO 3 and/or 6 is defined as a polypeptide having an amino acid sequence possessing a degree of identity to the specified amino acid sequence of at least 75%, preferably at least 80%, more preferably at least 85%, still more preferably at least 90%, still preferably at least 95%, still more preferably at least 96%, still more preferably at least 97%, still more preferably at least 98% and most preferably at least 99%, the substantially homologous peptide displaying acetamidase activity and providing a reverse selection on fluoroacetamide with an efficiency of at least 95%.
- a substantially homologous polypeptide may encompass polymorphisms that may exist in cells from different populations or within a population due to natural allelic or intra-strain variation.
- a substantially homologous polypeptide may further be derived from a fungus other than the fungus where the specified amino acid and/or DNA sequence originates from, or may be encoded by an artificially designed and synthesized DNA sequence.
- DNA sequences related to the specified DNA sequences and obtained by degeneration of the genetic code are also part of the invention.
- Homologues may also encompass biologically active fragments of the full-length sequence, for example the polypeptide with an amino acid sequence of SEQ ID NO 8.
- the degree of identity between two amino acid sequences refers to the percentage of amino acids that are identical between the two sequences.
- the degree of identity is determined using the BLAST algorithm, which is described in Altschul, et al., J. MoI. Biol. 215: 403-410 (1990).
- Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/).
- the BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment.
- Substantially homologous polypeptides may contain only conservative substitutions of one or more amino acids of the specified amino acid sequences or substitutions, insertions or deletions of non-essential amino acids.
- a non-essential amino acid is a residue that can be altered in one of these sequences without substantially altering the biological function.
- guidance concerning how to make pheno- typically silent amino acid substitutions is provided in Bowie, J. U. et al., Science 247:1306-1310 (1990) wherein the authors indicate that there are two main approaches for studying the tolerance of an amino acid sequence to change. The first method relies on the process of evolution, in which mutations are either accepted or rejected by natural selection.
- the second approach uses genetic engineering to introduce amino acid changes at specific positions of a cloned gene and selects or screens to identify sequences that maintain functionality.
- proteins are surprisingly tolerant of amino acid substitutions.
- the authors further indicate which changes are likely to be permissive at a certain position of the protein. For example, most buried amino acid residues require non-polar side chains, whereas few features of surface side chains are generally conserved. Other such phenotypically silent substitutions are described in Bowie et al, and the references cited therein.
- aspartic acid glutamic acid
- uncharged polar side chains e.g., glycine, asparagines, glutamine, serine, threonine, tyrosine, cysteine
- non-polar side chains e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan
- ⁇ -branched side chains e.g., threonine, valine, isoleucine
- aromatic side chains e.g., tyrosine, phenylalanine tryptophan, histidine
- a gene-of-interest in filamentous fungi goes via co-transformation; i.e. a mixture of DNA fragments is incubated with the competent cells.
- This mixture contains at least a gene-fragment encoding a selectable marker (in this case acetamidase).
- a selectable marker in this case acetamidase
- Transformants are first selected by means of the integration of the selection marker, and are subsequently screened by molecular techniques (i.e. colony PCR or Southern blotting) for the presence of one or more GOI.
- the polynucleotide or nucleic acid sequence of the invention may be an isolated polynucleotide of genomic, cDNA, RNA, semi-synthetic, synthetic origin, or any combinations thereof.
- isolated polynucleotide or nucleic acid sequence refers to a polynucleotide or nucleic acid sequence which is essentially free of other nucleic acid sequences, e. g., at least about 20% pure, preferably at least about 40% pure, more preferably at least about 60% pure, even more preferably at least about 80% pure, most preferably at least about 90% pure as determined by agarose electrophoresis.
- an isolated nucleic acid sequence can be obtained by standard cloning procedures used in genetic engineering to relocate the nucleic acid sequence from its natural location to a different site where it will be reproduced.
- polypeptides according to the invention and the encoding nucleic acid sequences may be obtained from any eukaryotic cell, preferably from a fungus, more preferably from a filamentous fungus.
- "Filamentous fungi” include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra).
- Filamentous fungal strains include, but are not limited to, strains of Acremonium, Aspergillus, Aureobasidium, Chrysosporium, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Piromyces, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, and Trichoderma.
- nucleic acid sequence encoding a polypeptide of the present invention is obtained from a strain of Penicillium chrysogenum.
- DNA sequences of the invention may be obtained by hybridization.
- Nucleic acid molecules corresponding to variants (e.g. natural allelic variants) and homologues of the DNA of the invention can be isolated based on their homology to the nucleic acids disclosed herein using these nucleic acids or a suitable fragment thereof, as a hybridization probe according to standard hybridization techniques, preferably under highly stringent hybridization conditions. Alternatively, one could apply in silico screening through the available genome databases.
- the nucleic acid sequence may be isolated by e.g. screening a genomic or cDNA library of the microorganism in question. Once a nucleic acid sequence encoding a polypeptide having an activity according to the invention has been detected with e.g. a probe derived from SEQ ID NO 2 or 5, the sequence may be isolated or cloned by utilizing techniques which are known to those of ordinary skill in the art (see, e.g., J. Sambrook, E. F. Fritsch, and T. Maniatis, 1989, Molecular Cloning, A Laboratory Manual, 2d edition, Cold Spring Harbor, New York).
- the cloning of the nucleic acid sequences of the present invention from such (genomic) DNA can also be effected, e.g. by using methods based on polymerase chain reaction (PCR) or antibody screening of expression libraries to detect cloned DNA fragments with shared structural features (See, e.g., lnnis et al., 1990, PCR: A Guide to Methods and Application, Academic Press, New York.).
- PCR polymerase chain reaction
- sequence information as provided herein should not be so narrowly construed as to require inclusion of erroneously identified bases.
- the specific sequences disclosed herein can be readily used to isolate the complete gene from filamentous fungi, in particular Penicillium chrysogenum, which in turn can easily be subjected to further sequence analyses thereby identifying sequencing errors.
- all nucleotide sequences determined by sequencing a DNA molecule herein were determined using an automated DNA sequencer and all amino acid sequences of polypeptides encoded by DNA molecules determined herein were predicted by translation of a DNA sequence determined as above. Therefore, as is known in the art for any DNA sequence determined by this automated approach, any nucleotide sequence determined herein may contain some errors.
- Nucleotide sequences determined by automation are typically at least about 90% identical, more typically at least about 95% to at least about 99.9% identical to the actual nucleotide sequence of the sequenced DNA molecule.
- the actual sequence can be more precisely determined by other approaches including manual DNA sequencing methods well known in the art.
- a single insertion or deletion in a determined nucleotide sequence compared to the actual sequence will cause a frame shift in translation of the nucleotide sequence such that the predicted amino acid sequence encoded by a determined nucleotide sequence will be completely different from the amino acid sequence actually encoded by the sequenced DNA molecule, beginning at the point of such an insertion or deletion.
- the person skilled in the art is capable of identifying such erroneously identified bases and knows how to correct for such errors.
- the present invention discloses a strain comprising the polynucleotide mentioned above in the second aspect.
- Said strain may be any eukaryotic cell, preferably a fungus, more preferably a filamentous fungus.
- Filamentous fungal strains include, but are not limited to, strains of Acremonium, Aspergillus, Aureobasidium, Chrysosporium, Cryptococcus, Filibasidium, Fu sari urn, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Piromyces, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, and Trichoderma.
- the present invention discloses the use of a polynucleotide of the first aspect as a selection marker for selecting transformed host strains.
- the advantage of the selection marker of the present invention is that it can be easily deleted from the transformed host organism. The deletion of the selection marker is based on dominant selection.
- the choice of a host cell in the methods of the present invention will to a large extent depend upon the source of the nucleic acid sequence (gene) of interest encoding a polypeptide.
- the host cell is a eukaryotic cell, more preferably a fungus, most preferably a filamentous fungus.
- the filamentous fungal host cell is a cell of a species cited as species from which the polynucleotide of the first aspect may be obtained, examples of which are, but are not limited to, Aspergillus species (i.e.
- the host cell may be a wild type filamentous fungus host cell or a variant, a mutant or a genetically modified filamentous fungus host cell.
- modified filamentous fungal host cells include e.g.
- host cells with reduced protease levels such as the protease deficient strains as Aspergillus oryzae JaL 125 (described in WO 97/35956 or EP 429490); the tripeptidyl-aminopeptidases-deficient Aspergillus niger strain as disclosed in WO 96/14404, or host cells with reduced production of the protease transcriptional activator (prtT; as described in WO 01/68864 and US 2004/0191864); host strains like the Aspergillus oryzae BECh2, wherein three TAKA amylase genes, two protease genes, as well as the ability to form the metabolites cyclopiazonic acid and kojic acid have been inactivated (BECh2 is described in WO 00/39322); filamentous fungal host cells comprising an elevated unfolded protein response (UPR) compared to the wild type cell to enhance production abilities of a polypeptide of interest (described in US 2004/0186070, US
- Nucleic acid constructs may contain a gene of interest and the polynucleotide of the invention (selection marker), each operably linked to one or more control sequences, which direct the expression of the encoded polypeptide in a suitable expression host.
- the nucleic acid constructs may be on one DNA fragment, or, preferably, on separate fragments. Expression will be understood to include any step involved in the production of the polypeptide and may include transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
- nucleic acid construct is synonymous with the term expression vector or cassette when the nucleic acid construct contains all the control sequences required for expression of a coding sequence in a particular host organism.
- control sequences is defined herein to include all components, which are necessary or advantageous for the expression of a polypeptide.
- Each control sequence may be native or foreign to the nucleic acid sequence encoding the polypeptide.
- Such control sequences may include, but are not limited to, a promoter, a leader, optimal translation initiation sequences (as described in Kozak, 1991 , J. Biol. Chem. 266:19867- 19870 and WO 2006/077258), a secretion signal sequence, a pro-peptide sequence, a polyadenylation sequence, a transcription terminator.
- the control sequences include a promoter, and transcriptional and translational stop signals.
- operably linked is defined herein as a configuration in which a control sequence is appropriately placed at a position relative to the coding sequence of the DNA sequence such that the control sequence directs the production of a polypeptide.
- the control sequence may include an appropriate promoter sequence containing transcriptional control sequences.
- the promoter may be any nucleic acid sequence, which shows transcription regulatory activity in the cell including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extra cellular or intracellular polypeptides.
- the promoter may be either homologous or heterologous to the cell or to the polypeptide.
- the promoter may be derived from a gene, which is highly expressed (defined herein as the mRNA concentration with at least 0.5% (w/w) of the total cellular mRNA). In another preferred embodiment, the promoter may be derived from a gene, which is medium expressed (defined herein as the mRNA concentration with at least 0.01 % until 0.5% (w/w) of the total cellular mRNA). In another preferred embodiment, the promoter may be derived from a gene, which is low expressed (defined herein as the mRNA concentration lower than 0.01 % (w/w) of the total cellular mRNA).
- Micro Array data is used to select genes, and thus promoters of those genes, that have a certain transcriptional level and regulation. In this way one can adapt the gene expression cassettes optimally to the conditions it should function in.
- DNA fragments in front of the polynucleotides of this invention.
- acetamide plate assay one can easily screen for active promoters, as these should facilitate growth on acetamide as the sole nitrogen source.
- DNA fragments can be derived from many sources, i.e. different species, PCR amplified, synthetically and the like.
- the control sequence may also include a suitable transcription terminator sequence, a sequence recognized by a filamentous fungal cell to terminate transcription.
- the terminator sequence is operably linked to the 3' terminus of the nucleic acid sequence encoding the polypeptide. Any terminator, which is functional in the cell, may be used in the present invention.
- Preferred terminators for filamentous fungal cells are obtained from the genes encoding Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate synthase, Aspergillus niger alpha- glucosidase, trpC gene and Fusarium oxysporum trypsin-like protease.
- the control sequence may also include a suitable leader sequence, a non- translated region of a mRNA, which is important for translation by the filamentous fungal cell.
- the leader sequence is operably linked to the 5' terminus of the nucleic acid sequence encoding the polypeptide. Any leader sequence, which is functional in the cell, may be used in the present invention.
- Preferred leaders for filamentous fungal cells are obtained from the genes encoding Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase and Aspergillus niger glaA.
- Preferred polyadenylation sequences for filamentous fungal cells are obtained from the genes encoding Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate synthase, Fusarium oxysporum trypsin-like protease and Aspergillus niger alpha- glucosidase.
- the control sequence may also include a signal peptide-encoding region, which codes for an amino acid sequence linked to the amino terminus of the polypeptide, which can direct the encoded polypeptide into the cell's secretory pathway.
- the 5'end of the coding sequence of the nucleic acid sequence may inherently contain a signal peptide-coding region naturally linked in translation reading frame with the segment of the coding region, which encodes the secreted polypeptide.
- the 5'end of the coding sequence may contain a signal peptide-coding region, which is foreign to the coding sequence.
- the foreign signal peptide-coding region may be required where the coding sequence does not normally contain a signal peptide- coding region.
- the foreign signal peptide-coding region may simply replace the natural signal peptide-coding region in order to obtain enhanced secretion of the polypeptide.
- the nucleic acid construct may be an expression vector.
- the expression vector may be any vector (e.g. a plasmid or virus), which can be conveniently subjected to recombinant DNA procedures and can bring about the expression of the nucleic acid sequence encoding the polypeptide.
- the choice of the vector will typically depend on the compatibility of the vector with the cell into which the vector is to be introduced.
- the vectors may be linear or closed circular plasmids.
- the vector may be an autonomously replicating vector, i.e. a vector, which exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g. a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome.
- An autonomously maintained cloning vector for a filamentous fungus may comprise the AMA1 -sequence (see e.g. Aleksenko and Clutterbuck (1997), Fungal Genet. Biol. 21 : 373-397).
- the vector may be one which, when introduced into the cell, is integrated into the genome and replicated together with the chromosome (s) into which it has been integrated.
- the integrative cloning vector may integrate at random or at a predetermined target locus in the chromosomes of the host cell.
- the integrative cloning vector comprises a DNA fragment, which is homologous to a DNA sequence in a predetermined target locus in the genome of host cell for targeting the integration of the cloning vector to this predetermined locus.
- the cloning vector is preferably linearized prior to transformation of the host cell.
- Linearization is preferably performed such that at least one but preferably either end of the cloning vector is flanked by sequences homologous to the target locus.
- the length of the homologous sequences flanking the target locus is preferably at least at least 0.1 kb, even preferably at least 0.2kb, more preferably at least 0.5 kb, even more preferably at least 1 kb, most preferably at least 2 kb.
- the vector system may be a single vector or plasmid or two or more vectors or plasmids, which together contain the total DNA to be introduced into the genome of the host cell.
- the DNA constructs may be used on an episomal vector. However in the present invention the constructs are preferably integrated in the genome of the host strain.
- the obtained host cell may be used for producing a compound of interest, for example a primary or secondary metabolite or a polypeptide.
- a gene of interest may thus encompass a gene encoding a polypeptide involved in a metabolic pathway or may be a gene encoding a polypeptide of interest to be produced.
- the present description further uses the term 'introduction' of one or more gene(s)-of-interest (abbreviated as GOI) or fragments thereof. With this is meant an insertion, duplication, deletion, or substitution of a GOI. In general all these alterations can be performed using the polynucleotide of the present invention as a bi-directional selection marker, applying a forward and reverse selection protocol.
- GOI gene(s)-of-interest
- amidase genes disclosed by the present invention not only can be used in the forward selection on acetamide, but perform also very effectively in fluoroacetamide reverse selection procedures to obtain selection marker gene free strains. Due to the fact that a final recombinant strain does not contain the selection marker, the procedure of the present invention can be repeated, so that several alterations suggested above can be combined in one recombinant strain. Surprisingly, it was found that the deletion of an amidase marker gene according to the invention from microbial strains works with a 100% efficiency.
- the efficiency of deleting the amidase gene(s) through selection on fluoroacetamide can be increased by flanking the amidase gene by DNA repeats, enabling efficient recombination and subsequent loss of the amidase gene and one of the direct repeats.
- the amidase genes can be flanked by so-called lox or frt sequences, which upon addition of recombinase enzymes (ere or flp recombinase, respectively), also enables efficient recombination and subsequent loss of the amidase gene and one of the direct repeats.
- other features can be used to remove the selection marker genes, i.e. restriction enzyme recognition sites.
- the invention provides a method for obtaining selection marker gene free recombinant strains comprising the following steps:
- the present invention further shows that this selection marker gene can be deleted from the chromosomes of the transformed organisms without leaving a trace of DNA used for cloning.
- sequences used for integration as mentioned under (i) may be surrounded by endogenous DNA fragments homologous to DNA sequences in the host genome. If such a sequence is not present the DNA nevertheless may integrate into the genome. This does not influence the possibility of deletion of the selection marker gene.
- sequences of (i) may be surrounded by recognition sites for restriction or recombination enzymes to facilitate efficient integration in step (i) and/or efficient deletion in step (iii).
- step (iv) using a reverse selected clone for the production of the compound of interest.
- step (Ni) can be omitted and the acetamidase containing clone can be used for production of the compound of interest.
- the vector used for integration of the amidase gene also contains a gene of interest.
- the invention thus further enables the introduction of desired foreign or homologous genes or DNA elements in the host organisms of choice using the amidase gene as a selection marker gene.
- the amidase gene is deleted.
- the amidase and the desired genes or DNA elements are introduced site-specifically, where after the amidase gene is deleted.
- the invention discloses selection marker free organisms containing (site-specifically) introduced genes, which can be used for a new round of transformation.
- the invention is used for repeated introduction of multiple copies of various genes or a DNA element at a predetermined genomic locus.
- the method as disclosed by the invention can be used to transform a host cell with a DNA sequence of interest affecting expression of a GOI.
- the promoter of a GOI may be mutated or exchanged for a different promoter, thereby altering the regulation and thus expression level of the GOI.
- the disclosed invention can be used to introduce additional copies of certain genes. This is particularly useful when altering the regulation of these GOI via exchange of the homologous promoter for a strong promoter is not an option.
- the application of the polypeptides of the present invention can be improved by deleting one or more of the endogenous amidase encoding genes from the genome of the host strain. Although these might be non-functional or non-transcribed genes, deleting them from the genome prevents that they could mutate into active amidases and interfere with the polypeptides from the present invention.
- Figure 2 is a representation of plasmid pHELY-A1 , the expression vector for Aspergillus nidulans amdS.
- Figure 3 is a representation of plasmid pGATWAn, a Penicillium chrysogenum expression vector.
- Figure 4 is a representation of plasmid pGPcamdScA, the expression vector for Penicillium chrysogenum amdS (sequence described in EP0758020A2).
- Figure 5 is a representation of plasmid pGPcamdAcA, the expression vector for Penicillium chrysogenum amdA.
- Figure 6 is a representation of plasmid pGPcamdBcA, the expression vector for Penicillium chrysogenum amdB.
- Figure 7 shows the acetamide (left plate) versus fluoroacetamide (right plate) plate selection.
- A transformant with Penicillium chrysogenum amidase amdA gene;
- B transformant with Penicillium chrysogenum amidase amdB gene;
- C transformant with Aspergillus nidulans amdS gene.
- pHELY-A1 (described in WO 04106347) was used as expression construct for testing the Aspergillus nidulans amdS gene. Two ⁇ g of vector was transformed to Penicillium chrysogenum. Transformants were selected on media with acetamide as the sole nitrogen source. To secure obtaining stable transformants, first round positives were colony purified on fresh acetamide plates and subsequently transferred to o non-selective, rich media (YEPD) to induce sporulation. Afterwards all colonies were again tested on acetamide media. Spores of stable amdS transformants were used to compare growth on acetamide and fluoroacetamide media.
- YEPD o non-selective, rich media
- WO 9706261 discloses this gene sequence with low homology to the Aspergillus nidulans amdS gene sequence to be present in the genome of Penicillium chrysogenum, we have demonstrated that this gene sequence is not a functional equivalent of the Aspergillus nidulans amdS gene and does not enable Penicillium chrysogenum to grow on acetamide as the sole N-source.
- Amidase encoding open reading frames amdA and amdB were identified by inspection of the Penicillium chrysogenum genome sequence of the Wisconsin54-1255 strain by those skilled in the art. They were PCR amplified from Penicillium chrysogenum DNA using Herculase ® Hotstart DNA Polymerase (Stratagene). SEQ ID NO 1 1 and 12 were used to amplify the ORF encoding SEQ ID NO 3; SEQ ID NO 13 and 14 were used to amplify the ORF encoding SEQ ID NO 6. The amplified amidase genes were cloned in pCR ® -Blunt II-TOPO ® (Invitrogen) and sequence verified before further processing.
- the ORF of amdB was amplified from cDNA and after sequencing found to retain one of the predicted introns. Still, this clone was able to give acetamide positive transformants and therefore would be functionally active as an acetamidase.
- a rtPCR was performed on several transformants. Cells from acetamide plates (inducing conditions) were thereto resuspended in water and RNA was isolated using the StrataPrep ® Total RNA Microprep kit (Stratagene). rtPCR was performed with the SuperscriptTM III One-Step RT-PCR System (Invitrogen). The results indicate that in all transformants both mRNA variants are present (see Figure 8).
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
The present invention discloses a polypeptide selected from the group consisting of: a polypeptide having an amino acid sequence according to SEQ ID NO 3, a polypeptide having an amino acid sequence according to SEQ ID NO 6, a polypeptide having an amino acid that is substantially homologous to the sequence of SEQ ID NO 3 and a polypeptide having an amino acid that is substantially homologous to the sequence of SEQ ID NO 6, the polypeptide displaying acetamidase activity and providing a reverse selection on fluoroacetamide with an efficiency of at least 95%, preferably at least 96%, more preferably at least 97%, more preferably at least 98%, more preferably at least 99%, most preferably 100%. The gene encoding the polypeptide of the invention is used as an efficient bi-directional selection marker in the construction of selection marker free strains, in particular for processes for the production of a compound of interest.
Description
BI-DIRECTIONAL SELECTION MARKERS WITH IMPROVED ACTIVITY
Field of the invention
The present invention relates to new amdS genes for use in an improved method for the transformation of filamentous fungi for obtaining selection marker gene free recombinant strains.
Background of the invention
In the field of microbial production of compounds of interest, there is in general a growing desire to use recombinant microorganisms containing as little as possible of foreign DNA. Ideally the transformed microorganism would contain only the desired modifications at the gene(s)-of-interest and as little as possible or remnants of other DNA fragments used during the transformation experiment or during cloning.
Patent applications EP 635,574 and WO 9706261 reveal genes encoding phenotypically selectable markers, which can be removed once the gene(s)-of-interest are stably maintained in the organism. The examples in these patent applications are genes encoding acetamidases, which upon expression in a transformed cell enable the cell to grow on media with acetamide as the sole carbon and/or nitrogen source. This selectable marker has several advantages: it is a dominant but non-antibiotic marker, and it can be used even in fungi with an endogenous amdS gene, like Aspergillus nidulans (Tilburn et al., 1983, Gene 26: 205-221 ). Moreover, this selectable marker is a so-called bi-directional marker, meaning that besides the positive selection for its presence (forward selection) also a reverse selection selecting for the absence of the gene can be applied, using fluoroacetamide (see Figure 1 ). This is successfully applied in species from the genera Aspergillus, Penicillium, Saccharomyces and Trichoderma.
The amdS gene of Aspergillus nidulans is successfully used as a selection marker for different fungal species, for instance in Aspergillus niger and Penicillium chrysogenum. In addition, homologous amdS genes were described, obtained from Aspergillus niger for transformation of Aspergillus niger and obtained from Penicillium chrysogenum for transformation of Penicillium chrysogenum (WO 9706261 ). However,
these amdS genes still have a major drawback. The forward selection, e.g. selection for the presence of the selectable marker gene, works, although for one of the genes the present invention will show that this is not the case. The problems become apparent when the reverse selection is applied, e.g. selection for the absence of the selectable marker gene. The most widely used (acet)amidase expression cassette, PgpdA-amdS from Aspergillus nidulans can give rise to isolation of false negatives, e.g. isolates which suggest by phenotype that they are devoid of the selection marker as a result of the negative selection protocol, but actually have the selection marker gene or fragments thereof stably maintained in the genome. For some reason they escape the selection pressure. This can be a burden in strain improvement programs where repeated transformations have to be performed, especially in those cases where a high throughput is required. Therefore, a bi-directional selection marker gene that functions 100% correctly in both directions, e.g. the forward and the reverse selection, is not available but is highly desirable.
The present invention surprisingly shows that the reverse selection, i.e. the deletion of an amidase marker gene from microbial strains, works with 100% efficiency when using the amidase encoding genes according to the invention. As a result, sequential modification of industrial production strains is feasible with high efficiency and throughput.
Summary of the invention
The present invention provides a polypeptide displaying acetamidase activity and providing a reverse selection on fluoroacetamide. Furthermore, the present invention provides a polynucleotide encoding the above polypeptide and the use of said polynucleotide.
Detailed description of the invention
In a first aspect, the present invention provides a polypeptide selected from the group consisting of a polypeptide having an amino acid sequence according to SEQ ID NO 3, a polypeptide having an amino acid sequence according to SEQ ID NO 6,
a polypeptide having an amino acid that is substantially homologous to the sequence of SEQ ID NO 3 and a polypeptide having an amino acid that is substantially homologous to the sequence of SEQ ID NO 6, the polypeptide displaying acetamidase activity and that providing a reverse selection on fluoroacetamide with an efficiency of at least 95%, preferably at least 96%, more preferably at least 97%, more preferably at least 98%, more preferably at least 99%, most preferably 100%.
The present invention further provides in a second aspect a polynucleotide encoding a polypeptide of the first aspect. In particular, a specific DNA sequence is provided encoding the polypeptide of SEQ ID NO 3, i.e. SEQ ID NO 1 or 2, or encoding the polypeptide of SEQ ID NO 6, i.e. SEQ ID NO 4 or 5. SEQ ID NO 7 is a variant coding sequence derived from the genomic DNA of SEQ ID NO 4, due to an event of alternative splicing, and SEQ ID NO 8 is the protein encoded by the alternative coding sequence. On protein level both polypeptides (i.e. SEQ ID NO 6 and SEQ ID NO 8) are identical up to amino acid 365.
As part of the present invention it is demonstrated that the fluoroacetamide screening using the currently available acetamidase genes is not functioning properly; i.e. screening gives rise to false negative mutants. The amidase genes described by the present invention do not have this problem. The activity of the novel polypeptides encoded by these genes can therefore be characterized as follows:
They enable a forward selection, i.e. provide growth on acetamide as the sole nitrogen source,
They enable an efficient reverse selection, i.e. provide resistance to fluoroacetamide with an efficiency of at least 95%, as mentioned above.
In the context of the invention, an efficiency of at least 95% means that at least 95% of the strains resulting from the reverse selection on fluoroacetamide has the amidase gene deleted from the genome. This is typically observed upon further analyzing isolated colonies obtained after selection on standard fluoroacetamide plates containing 32 m M fluoroacetamide, 5 mM urea (as N-source) and 1.1 % glucose (see WO 9706261 ).
A polypeptide having an amino acid sequence that is "substantially homologous" to the sequence of SEQ ID NO 3 and/or 6 is defined as a polypeptide having an amino acid sequence possessing a degree of identity to the specified amino acid sequence of at least 75%, preferably at least 80%, more preferably at least 85%, still more preferably
at least 90%, still preferably at least 95%, still more preferably at least 96%, still more preferably at least 97%, still more preferably at least 98% and most preferably at least 99%, the substantially homologous peptide displaying acetamidase activity and providing a reverse selection on fluoroacetamide with an efficiency of at least 95%. A substantially homologous polypeptide may encompass polymorphisms that may exist in cells from different populations or within a population due to natural allelic or intra-strain variation. A substantially homologous polypeptide may further be derived from a fungus other than the fungus where the specified amino acid and/or DNA sequence originates from, or may be encoded by an artificially designed and synthesized DNA sequence. DNA sequences related to the specified DNA sequences and obtained by degeneration of the genetic code are also part of the invention. Homologues may also encompass biologically active fragments of the full-length sequence, for example the polypeptide with an amino acid sequence of SEQ ID NO 8.
For the purpose of the present invention, the degree of identity between two amino acid sequences refers to the percentage of amino acids that are identical between the two sequences. The degree of identity is determined using the BLAST algorithm, which is described in Altschul, et al., J. MoI. Biol. 215: 403-410 (1990). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLAST program uses as defaults a word length (W) of 11 , the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89: 10915 (1989)) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands.
Substantially homologous polypeptides may contain only conservative substitutions of one or more amino acids of the specified amino acid sequences or substitutions, insertions or deletions of non-essential amino acids. Accordingly, a non-essential amino acid is a residue that can be altered in one of these sequences without substantially altering the biological function. For example, guidance concerning how to make pheno- typically silent amino acid substitutions is provided in Bowie, J. U. et al., Science 247:1306-1310 (1990) wherein the authors indicate that there are two main approaches for studying the tolerance of an amino acid sequence to change. The first method relies on the process of evolution, in which mutations are either accepted or rejected by natural selection. The second approach uses genetic engineering to introduce amino acid changes at specific positions of a cloned gene and selects or screens to identify
sequences that maintain functionality. As the authors state, these studies have revealed that proteins are surprisingly tolerant of amino acid substitutions. The authors further indicate which changes are likely to be permissive at a certain position of the protein. For example, most buried amino acid residues require non-polar side chains, whereas few features of surface side chains are generally conserved. Other such phenotypically silent substitutions are described in Bowie et al, and the references cited therein.
The term "conservative substitution" is intended to mean that a substitution in which the amino acid residue is replaced with an amino acid residue having a similar side chain. These families are known in the art and include amino acids with basic side chains (e.g. lysine, arginine and histidine), acidic side chains (e.g. aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagines, glutamine, serine, threonine, tyrosine, cysteine), non-polar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), β-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine tryptophan, histidine).
In one embodiment of the invention, a deteriorated polypeptide of the invention is provided. A "deteriorated polypeptide" of the invention is an acetamidase protein wherein at least the acetamidase activity in the forward selection is decreased. The standard acetamide forward selection provides for a standard assay to measure their activity and thus deteriorated proteins may easily be selected. Deteriorated polypeptides of the invention thus are characterized by the fact that they show less activity in the forward selection (i.e. on acetamide), but they still retain at least 95% efficiency in the reverse selection (i.e. on fluoroacetamide). Such deteriorated enzymes are particular useful if one wants to screen for increased copy numbers after transformation. Introduction of a gene-of-interest (GOI) in filamentous fungi goes via co-transformation; i.e. a mixture of DNA fragments is incubated with the competent cells. This mixture contains at least a gene-fragment encoding a selectable marker (in this case acetamidase). Furthermore, it contains at least one GOI, but this might be extended to several GOI, or even to hundreds of GOI. Transformants are first selected by means of the integration of the selection marker, and are subsequently screened by molecular techniques (i.e. colony PCR or Southern blotting) for the presence of one or more GOI. Depending on the strain, DNA mixture and transformation conditions, 1-50%, or even more than 50%, of the acetamidase positive transformants contain also the GOI. If a high gene copy number of the GOI is desired, one can apply these deteriorated polypeptides of the invention to
screen and select for strains with high gene copy numbers of the GOI. As additional copies of a deteriorated amidase will lead to more enzyme molecules and thus more enzyme activity, additional gene copies will lead to better growing isolates that can be selected from a primary transformant. In many of these cases the GOI will also be present in multiple copies. These isolates can thus be screened for additional copies of the GOI too.
Deteriorated polypeptides of the invention may be obtained by randomly introducing mutations along all or part of the amidase coding sequence, such as by saturation mutagenesis, and the resulting mutants can be expressed recombinantly and screened for biological activity. They can also be isolated via error prone PCR or can be produced synthetically.
Alternatively, high copy numbers of a GOI may be obtained using the non- deteriorated amidases as described in SEQ ID NO 3 or 6 or 8, and decreasing the concentration of acetamide or using an N-source having a low bioavailability to the transformant, like acryl amide in agar plates, and then selecting the best growing colonies as the ones with additional gene copies. These isolates can than be screened for additional copies of the GOI too.
The polynucleotide or nucleic acid sequence of the invention may be an isolated polynucleotide of genomic, cDNA, RNA, semi-synthetic, synthetic origin, or any combinations thereof. The term "isolated polynucleotide or nucleic acid sequence" as used herein refers to a polynucleotide or nucleic acid sequence which is essentially free of other nucleic acid sequences, e. g., at least about 20% pure, preferably at least about 40% pure, more preferably at least about 60% pure, even more preferably at least about 80% pure, most preferably at least about 90% pure as determined by agarose electrophoresis. For example, an isolated nucleic acid sequence can be obtained by standard cloning procedures used in genetic engineering to relocate the nucleic acid sequence from its natural location to a different site where it will be reproduced.
The polypeptides according to the invention and the encoding nucleic acid sequences may be obtained from any eukaryotic cell, preferably from a fungus, more preferably from a filamentous fungus. "Filamentous fungi" include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra). Filamentous fungal strains include, but are not limited to, strains of Acremonium, Aspergillus, Aureobasidium, Chrysosporium, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora,
Paecilomyces, Penicillium, Piromyces, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, and Trichoderma.
In a more preferred embodiment, the nucleic acid sequence encoding a polypeptide of the present invention is obtained from a strain of Penicillium chrysogenum.
DNA sequences of the invention may be obtained by hybridization. Nucleic acid molecules corresponding to variants (e.g. natural allelic variants) and homologues of the DNA of the invention can be isolated based on their homology to the nucleic acids disclosed herein using these nucleic acids or a suitable fragment thereof, as a hybridization probe according to standard hybridization techniques, preferably under highly stringent hybridization conditions. Alternatively, one could apply in silico screening through the available genome databases.
"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art. For additional details and explanation of stringency of hybridization reactions, see Ausubel et al, Current Protocols in Molecular Biology, Wiley lnterscience Publishers, (1995).
The nucleic acid sequence may be isolated by e.g. screening a genomic or cDNA library of the microorganism in question. Once a nucleic acid sequence encoding a polypeptide having an activity according to the invention has been detected with e.g. a probe derived from SEQ ID NO 2 or 5, the sequence may be isolated or cloned by utilizing techniques which are known to those of ordinary skill in the art (see, e.g., J. Sambrook, E. F. Fritsch, and T. Maniatis, 1989, Molecular Cloning, A Laboratory Manual, 2d edition, Cold Spring Harbor, New York).
The cloning of the nucleic acid sequences of the present invention from such (genomic) DNA can also be effected, e.g. by using methods based on polymerase chain reaction (PCR) or antibody screening of expression libraries to detect cloned DNA fragments with shared structural features (See, e.g., lnnis et al., 1990, PCR: A Guide to Methods and Application, Academic Press, New York.).
The sequence information as provided herein should not be so narrowly construed as to require inclusion of erroneously identified bases. The specific sequences disclosed herein can be readily used to isolate the complete gene from filamentous fungi, in particular Penicillium chrysogenum, which in turn can easily be subjected to further sequence analyses thereby identifying sequencing errors.
Unless otherwise indicated, all nucleotide sequences determined by sequencing a DNA molecule herein were determined using an automated DNA sequencer and all amino acid sequences of polypeptides encoded by DNA molecules determined herein were predicted by translation of a DNA sequence determined as above. Therefore, as is known in the art for any DNA sequence determined by this automated approach, any nucleotide sequence determined herein may contain some errors. Nucleotide sequences determined by automation are typically at least about 90% identical, more typically at least about 95% to at least about 99.9% identical to the actual nucleotide sequence of the sequenced DNA molecule. The actual sequence can be more precisely determined by other approaches including manual DNA sequencing methods well known in the art. As is also known in the art, a single insertion or deletion in a determined nucleotide sequence compared to the actual sequence will cause a frame shift in translation of the nucleotide sequence such that the predicted amino acid sequence encoded by a determined nucleotide sequence will be completely different from the amino acid sequence actually encoded by the sequenced DNA molecule, beginning at the point of such an insertion or deletion. The person skilled in the art is capable of identifying such erroneously identified bases and knows how to correct for such errors.
In a third aspect, the present invention discloses a strain comprising the polynucleotide mentioned above in the second aspect. Said strain may be any eukaryotic cell, preferably a fungus, more preferably a filamentous fungus. Filamentous fungal strains include, but are not limited to, strains of Acremonium, Aspergillus, Aureobasidium, Chrysosporium, Cryptococcus, Filibasidium, Fu sari urn, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Piromyces, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, and Trichoderma.
In a fourth aspect, the present invention discloses the use of a polynucleotide of the first aspect as a selection marker for selecting transformed host strains. The advantage of the selection marker of the present invention is that it can be easily deleted from the transformed host organism. The deletion of the selection marker is based on dominant selection.
The choice of a host cell in the methods of the present invention will to a large extent depend upon the source of the nucleic acid sequence (gene) of interest encoding
a polypeptide. Preferably, the host cell is a eukaryotic cell, more preferably a fungus, most preferably a filamentous fungus. In a preferred embodiment, the filamentous fungal host cell is a cell of a species cited as species from which the polynucleotide of the first aspect may be obtained, examples of which are, but are not limited to, Aspergillus species (i.e. Aspergillus awamori, Aspergillus fumigatus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae or Aspergillus sojae), Chrysosporium species (i.e. Chrysosporium lucknowense) or Penicillium species (i.e. Penicillium chrysogenum). The host cell may be a wild type filamentous fungus host cell or a variant, a mutant or a genetically modified filamentous fungus host cell. Such modified filamentous fungal host cells include e.g. host cells with reduced protease levels, such as the protease deficient strains as Aspergillus oryzae JaL 125 (described in WO 97/35956 or EP 429490); the tripeptidyl-aminopeptidases-deficient Aspergillus niger strain as disclosed in WO 96/14404, or host cells with reduced production of the protease transcriptional activator (prtT; as described in WO 01/68864 and US 2004/0191864); host strains like the Aspergillus oryzae BECh2, wherein three TAKA amylase genes, two protease genes, as well as the ability to form the metabolites cyclopiazonic acid and kojic acid have been inactivated (BECh2 is described in WO 00/39322); filamentous fungal host cells comprising an elevated unfolded protein response (UPR) compared to the wild type cell to enhance production abilities of a polypeptide of interest (described in US 2004/0186070, US 2001/0034045, WO 01/72783 and WO 2005/123763); host cells with an oxalate deficient phenotype (described in WO 2004/070022); host cells with a reduced expression of an abundant endogenous polypeptide such as a glucoamylase, neutral alpha-amylase A, neutral alpha-amylase B, alpha-1 , 6-transglucosidase, proteases, cellobiohydrolase and/or oxalic acid hydrolase (as may be obtained by genetic modification according to the techniques described in US 2004/0191864); host cells with an increased efficiency of homologous recombination (having deficient hdfA or hdfB gene as described in WO 2005/095624); Penicillium host cells producing adipyl-7- aminodesacetoxycephalosporanic acid and derivatives thereof; and host cells having any possible combination of these modifications.
Nucleic acid constructs, e.g. expression constructs, may contain a gene of interest and the polynucleotide of the invention (selection marker), each operably linked to one or more control sequences, which direct the expression of the encoded polypeptide in a suitable expression host. The nucleic acid constructs may be on one DNA fragment, or, preferably, on separate fragments. Expression will be understood to
include any step involved in the production of the polypeptide and may include transcription, post-transcriptional modification, translation, post-translational modification, and secretion. The term nucleic acid construct is synonymous with the term expression vector or cassette when the nucleic acid construct contains all the control sequences required for expression of a coding sequence in a particular host organism. The term "control sequences" is defined herein to include all components, which are necessary or advantageous for the expression of a polypeptide. Each control sequence may be native or foreign to the nucleic acid sequence encoding the polypeptide. Such control sequences may include, but are not limited to, a promoter, a leader, optimal translation initiation sequences (as described in Kozak, 1991 , J. Biol. Chem. 266:19867- 19870 and WO 2006/077258), a secretion signal sequence, a pro-peptide sequence, a polyadenylation sequence, a transcription terminator. At a minimum, the control sequences include a promoter, and transcriptional and translational stop signals. The term "operably linked" is defined herein as a configuration in which a control sequence is appropriately placed at a position relative to the coding sequence of the DNA sequence such that the control sequence directs the production of a polypeptide.
The control sequence may include an appropriate promoter sequence containing transcriptional control sequences. The promoter may be any nucleic acid sequence, which shows transcription regulatory activity in the cell including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extra cellular or intracellular polypeptides. The promoter may be either homologous or heterologous to the cell or to the polypeptide.
Preferred promoters for filamentous fungal cells are known in the art and can be, for example, the glucose-6-phosphate dehyrogenase gpdA promoters, protease promoters such as pepA, pepB, pepC, the glucoamylase glaA promoters, amylase amyf\, amyB promoters, the catalase caiR or catA promoters, glucose oxidase goxC promoter, beta-galactosidase lack promoter, alpha-glucosidase aglA promoter, translation elongation factor tefA promoter, xylanase promoters such as xlnA, xlnB, xlnC, xlnD, cellulase promoters such as eg/A, eglB, cbhA, promoters of transcriptional regulators such as areA, creA, xlnR, pacC, prfl, etc or any other, and can be found among others at the NCBI website (http://www.ncbi.nlm.nih.gov/entrez/).
In a preferred embodiment, the promoter may be derived from a gene, which is highly expressed (defined herein as the mRNA concentration with at least 0.5% (w/w) of the total cellular mRNA). In another preferred embodiment, the promoter may be derived
from a gene, which is medium expressed (defined herein as the mRNA concentration with at least 0.01 % until 0.5% (w/w) of the total cellular mRNA). In another preferred embodiment, the promoter may be derived from a gene, which is low expressed (defined herein as the mRNA concentration lower than 0.01 % (w/w) of the total cellular mRNA).
In an even more preferred embodiment, Micro Array data is used to select genes, and thus promoters of those genes, that have a certain transcriptional level and regulation. In this way one can adapt the gene expression cassettes optimally to the conditions it should function in.
Alternatively, one could clone random DNA fragments in front of the polynucleotides of this invention. Using the acetamide plate assay one can easily screen for active promoters, as these should facilitate growth on acetamide as the sole nitrogen source. These DNA fragments can be derived from many sources, i.e. different species, PCR amplified, synthetically and the like.
The control sequence may also include a suitable transcription terminator sequence, a sequence recognized by a filamentous fungal cell to terminate transcription. The terminator sequence is operably linked to the 3' terminus of the nucleic acid sequence encoding the polypeptide. Any terminator, which is functional in the cell, may be used in the present invention. Preferred terminators for filamentous fungal cells are obtained from the genes encoding Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate synthase, Aspergillus niger alpha- glucosidase, trpC gene and Fusarium oxysporum trypsin-like protease.
The control sequence may also include a suitable leader sequence, a non- translated region of a mRNA, which is important for translation by the filamentous fungal cell. The leader sequence is operably linked to the 5' terminus of the nucleic acid sequence encoding the polypeptide. Any leader sequence, which is functional in the cell, may be used in the present invention. Preferred leaders for filamentous fungal cells are obtained from the genes encoding Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase and Aspergillus niger glaA.
The control sequence may also include a polyadenylation sequence, a sequence which is operably linked to the 3' terminus of the nucleic acid sequence and which, when transcribed, is recognized by the filamentous fungal cell as a signal to add polyadenosine residues to transcribed mRNA. Any polyadenylation sequence, which is functional in the cell, may be used in the present invention. Preferred polyadenylation sequences for filamentous fungal cells are obtained from the genes encoding Aspergillus
oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate synthase, Fusarium oxysporum trypsin-like protease and Aspergillus niger alpha- glucosidase.
For a polypeptide to be secreted, the control sequence may also include a signal peptide-encoding region, which codes for an amino acid sequence linked to the amino terminus of the polypeptide, which can direct the encoded polypeptide into the cell's secretory pathway. The 5'end of the coding sequence of the nucleic acid sequence may inherently contain a signal peptide-coding region naturally linked in translation reading frame with the segment of the coding region, which encodes the secreted polypeptide. Alternatively, the 5'end of the coding sequence may contain a signal peptide-coding region, which is foreign to the coding sequence. The foreign signal peptide-coding region may be required where the coding sequence does not normally contain a signal peptide- coding region. Alternatively, the foreign signal peptide-coding region may simply replace the natural signal peptide-coding region in order to obtain enhanced secretion of the polypeptide.
The nucleic acid construct may be an expression vector. The expression vector may be any vector (e.g. a plasmid or virus), which can be conveniently subjected to recombinant DNA procedures and can bring about the expression of the nucleic acid sequence encoding the polypeptide. The choice of the vector will typically depend on the compatibility of the vector with the cell into which the vector is to be introduced. The vectors may be linear or closed circular plasmids.
The vector may be an autonomously replicating vector, i.e. a vector, which exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g. a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome. An autonomously maintained cloning vector for a filamentous fungus may comprise the AMA1 -sequence (see e.g. Aleksenko and Clutterbuck (1997), Fungal Genet. Biol. 21 : 373-397).
Alternatively, the vector may be one which, when introduced into the cell, is integrated into the genome and replicated together with the chromosome (s) into which it has been integrated. The integrative cloning vector may integrate at random or at a predetermined target locus in the chromosomes of the host cell. In a preferred embodiment of the invention, the integrative cloning vector comprises a DNA fragment, which is homologous to a DNA sequence in a predetermined target locus in the genome of host cell for targeting the integration of the cloning vector to this predetermined locus. In order to
promote targeted integration, the cloning vector is preferably linearized prior to transformation of the host cell. Linearization is preferably performed such that at least one but preferably either end of the cloning vector is flanked by sequences homologous to the target locus. The length of the homologous sequences flanking the target locus is preferably at least at least 0.1 kb, even preferably at least 0.2kb, more preferably at least 0.5 kb, even more preferably at least 1 kb, most preferably at least 2 kb.
The vector system may be a single vector or plasmid or two or more vectors or plasmids, which together contain the total DNA to be introduced into the genome of the host cell.
The DNA constructs may be used on an episomal vector. However in the present invention the constructs are preferably integrated in the genome of the host strain.
The obtained host cell may be used for producing a compound of interest, for example a primary or secondary metabolite or a polypeptide. A gene of interest may thus encompass a gene encoding a polypeptide involved in a metabolic pathway or may be a gene encoding a polypeptide of interest to be produced.
The present description further uses the term 'introduction' of one or more gene(s)-of-interest (abbreviated as GOI) or fragments thereof. With this is meant an insertion, duplication, deletion, or substitution of a GOI. In general all these alterations can be performed using the polynucleotide of the present invention as a bi-directional selection marker, applying a forward and reverse selection protocol.
The amidase genes disclosed by the present invention not only can be used in the forward selection on acetamide, but perform also very effectively in fluoroacetamide reverse selection procedures to obtain selection marker gene free strains. Due to the fact that a final recombinant strain does not contain the selection marker, the procedure of the present invention can be repeated, so that several alterations suggested above can be combined in one recombinant strain. Surprisingly, it was found that the deletion of an amidase marker gene according to the invention from microbial strains works with a 100% efficiency.
The efficiency of deleting the amidase gene(s) through selection on fluoroacetamide can be increased by flanking the amidase gene by DNA repeats, enabling efficient recombination and subsequent loss of the amidase gene and one of the direct repeats. Alternatively, the amidase genes can be flanked by so-called lox or frt sequences, which upon addition of recombinase enzymes (ere or flp recombinase, respectively), also enables efficient recombination and subsequent loss of the amidase
gene and one of the direct repeats. Alternatively, other features can be used to remove the selection marker genes, i.e. restriction enzyme recognition sites.
The invention provides a method for obtaining selection marker gene free recombinant strains comprising the following steps:
(i) transforming a host cell of interest with a polynucleotide comprising a gene of interest (GOI) and/or a polynucleotide comprising a DNA sequence affecting expression of a GOI and with a polynucleotide comprising the selection marker gene according to the invention,
(ii) selecting clones of transformed cells for their capacity to grow on acetamide as the sole nitrogen and/or carbon source, (iii) effectuating deletion of the selection marker gene from transfected clones by reverse selection on fluoroacetamide.
The present invention further shows that this selection marker gene can be deleted from the chromosomes of the transformed organisms without leaving a trace of DNA used for cloning.
If site-specific (or better locus specific) integration is desired, the sequences used for integration as mentioned under (i) may be surrounded by endogenous DNA fragments homologous to DNA sequences in the host genome. If such a sequence is not present the DNA nevertheless may integrate into the genome. This does not influence the possibility of deletion of the selection marker gene. Alternatively, the sequences of (i) may be surrounded by recognition sites for restriction or recombination enzymes to facilitate efficient integration in step (i) and/or efficient deletion in step (iii).
The present invention further discloses a method for producing a compound of interest comprising:
(i) transforming a host cell of interest with a polynucleotide comprising a gene of interest and/or with a polynucleotide comprising a DNA sequence affecting expression of a gene of interest, and with the polynucleotide comprising the selection marker gene according to the invention,
(ii) selecting clones of transformed cells for their capacity to grow on acetamide as the sole nitrogen and/or carbon source, (iii) effectuating deletion of the selection marker gene from transformed clones by reverse selection on fluoroacetamide,
(iv) using a reverse selected clone for the production of the compound of interest.
Alternatively, step (Ni) can be omitted and the acetamidase containing clone can be used for production of the compound of interest.
The dominant selection and reverse selection method described above can be employed in the development of production strains in various ways: To introduce a new GOI
The vector used for integration of the amidase gene also contains a gene of interest. The invention thus further enables the introduction of desired foreign or homologous genes or DNA elements in the host organisms of choice using the amidase gene as a selection marker gene. Subsequently, the amidase gene is deleted. Preferably the amidase and the desired genes or DNA elements are introduced site-specifically, where after the amidase gene is deleted. To introduce multiple GOI sequentially (at predetermined loci) Specifically, the invention discloses selection marker free organisms containing (site-specifically) introduced genes, which can be used for a new round of transformation. The invention is used for repeated introduction of multiple copies of various genes or a DNA element at a predetermined genomic locus. To modify transcription levels of a GOI
The method as disclosed by the invention can be used to transform a host cell with a DNA sequence of interest affecting expression of a GOI. For instance, the promoter of a GOI may be mutated or exchanged for a different promoter, thereby altering the regulation and thus expression level of the GOI. Alternatively, one can modify the transcript level of a GOI by introducing an expression construct via co-transformation (see above) which mediates RNA inhibition. To delete a GOI
The disclosed method can also be used to remove a complete or a certain part of a gene. This is of interest when certain genes coding for proteins that negatively influence production levels of desired proteins or metabolites, again without leaving a marker gene in the genome. Examples are proteases in enzyme production, transcription regulators and competing pathways in metabolite production processes and the like. To amplify the copy numbers of GOI
The disclosed invention can be used to introduce additional copies of certain genes. This is particularly useful when altering the regulation of these GOI via exchange of the homologous promoter for a strong promoter is not an option.
To introduce site specific mutations in GOI, which alter the kinetics of the protein(s) produced
In a fifth aspect, the application of the polypeptides of the present invention can be improved by deleting one or more of the endogenous amidase encoding genes from the genome of the host strain. Although these might be non-functional or non-transcribed genes, deleting them from the genome prevents that they could mutate into active amidases and interfere with the polypeptides from the present invention.
Legend to the figures
Figure 1 is a representation of the forward and reverse amidase reactions.
Figure 2 is a representation of plasmid pHELY-A1 , the expression vector for Aspergillus nidulans amdS.
Figure 3 is a representation of plasmid pGATWAn, a Penicillium chrysogenum expression vector.
Figure 4 is a representation of plasmid pGPcamdScA, the expression vector for Penicillium chrysogenum amdS (sequence described in EP0758020A2).
Figure 5 is a representation of plasmid pGPcamdAcA, the expression vector for Penicillium chrysogenum amdA.
Figure 6 is a representation of plasmid pGPcamdBcA, the expression vector for Penicillium chrysogenum amdB.
Figure 7 shows the acetamide (left plate) versus fluoroacetamide (right plate) plate selection. A=transformant with Penicillium chrysogenum amidase amdA gene; B=transformant with Penicillium chrysogenum amidase amdB gene; C=transformant with Aspergillus nidulans amdS gene.
Figure 8 shows the rtPCR on amdB transformants. Panel A is a schematic representation of the 3' part of the gene and the oligonucleotides used. Panel B is the gel-electrophoresis of the amplified fragments (L=kb ladder; 1 =transformant 1 ; 2=transformant 2; 3=transformant 3; 4=transformant 4; 5=control, untransformed strain; a=rtPCR with SEQ ID NO 15 and 16; b=rtPCR with SEQ ID NO 15 and 17).
EXAMPLES
General Methods
In the examples standard molecular techniques have been applied as described in literature (Sambrook et al., 1989, Molecular cloning: a laboratory manual", CSHL press, 5 Cold Spring Harbour, NY), unless stated otherwise.
Example 1
Wild type Penicillium chrysogenum cells do not use acetamide as sole N-source Penicillium chrysogenum strain Wisconsin 54-1255 (ATCC 28089) was tested for growth o on acetamide plates, prepared as described previously (Cantwell CA, Beckmann RJ, Dotzlaf JE, Fisher DL, Skatrud PL, Yeh WK, Queener SW (1990) Curr Genet. 17:213- 21.) and several growth stages of cells were plated out: spores, mycelium and protoplasts. None of these gave significant growth on acetamide plates, demonstrating that wild type Penicillium chrysogenum is not able to use and grow on acetamide.
5
Example 2 Transformation of Penicillium chrysogenum with Aspergillus nidulans amdS and subsequent reverse selection Techniques involved in the transfer of DNA to protoplasts of Penicillium chrysogenum o are well known in the art and are described in many references, including Finkelstein and Ball (eds.), Biotechnology of filamentous fungi, technology and products, Butter- worth-Heinemann (1992); Bennett and Lasure (eds.) More Gene Manipulations in fungi, Academic Press (1991 ); Turner, in: Pϋhler (ed), Biotechnology, 2nd completely revised edition, VHC (1992). The Ca-PEG mediated protoplast transformation is used as descri-
5 bed in EP 635,574. pHELY-A1 (described in WO 04106347) was used as expression construct for testing the Aspergillus nidulans amdS gene. Two μg of vector was transformed to Penicillium chrysogenum. Transformants were selected on media with acetamide as the sole nitrogen source. To secure obtaining stable transformants, first round positives were colony purified on fresh acetamide plates and subsequently transferred to o non-selective, rich media (YEPD) to induce sporulation. Afterwards all colonies were again tested on acetamide media. Spores of stable amdS transformants were used to compare growth on acetamide and fluoroacetamide media. For this a spore solution was made in 0.9 mM NaCI and several dilutions were spotted on both media. Although, acetamide positive, and thus amdS positive, in all dilutions there is also growth in the
undiluted suspension on fluoroacetamide plates (see Figure 7), demonstrating that the reverse selection using the Aspergillus nidulans amdS is not tight.
Example 3
A known Penicillium chrysogenum amdS \s not a functional acetamidase An alleged Penicillium chrysogenum amdS gene (WO 9706261 ) was PCR amplified from Penicillium chrysogenum DNA using Herculase® Hotstart DNA Polymerase (Stratagene). The oligonucleotides of SEQ ID NO 9 and 10 were used to amplify the ORF. The amplified gene was cloned in pCR®-Blunt II-TOPO® (Invitrogen) and sequence verified before further processing. After digestion with Ase\ en Sbft, the ORF fragment was cloned in pGATWAn (see Figure 3) digested with Nde\ en Λ/s/'l, resulting in the PcamdS expression vector pGPcamdScA (Figure 4). Vector pGPcamdScA (2 μg) was transformed to Penicillium chrysogenum and plated out on media with acetamide as the sole carbon source. No colonies were obtained, demonstrating that the Penicillium chrysogenum amdS gene is not functional. Although WO 9706261 discloses this gene sequence with low homology to the Aspergillus nidulans amdS gene sequence to be present in the genome of Penicillium chrysogenum, we have demonstrated that this gene sequence is not a functional equivalent of the Aspergillus nidulans amdS gene and does not enable Penicillium chrysogenum to grow on acetamide as the sole N-source.
Example 4
Amidases amdA and amdB of Penicillium chrysogenum as efficient bi-directional selection marker genes
Cloning of genes
Amidase encoding open reading frames amdA and amdB were identified by inspection of the Penicillium chrysogenum genome sequence of the Wisconsin54-1255 strain by those skilled in the art. They were PCR amplified from Penicillium chrysogenum DNA using Herculase® Hotstart DNA Polymerase (Stratagene). SEQ ID NO 1 1 and 12 were used to amplify the ORF encoding SEQ ID NO 3; SEQ ID NO 13 and 14 were used to amplify the ORF encoding SEQ ID NO 6. The amplified amidase genes were cloned in pCR®-Blunt II-TOPO® (Invitrogen) and sequence verified before further processing. After digestion with Ase\ en Sbft, fragments were cloned in pGATWAn digested with Nde\ en Λ/s/1, resulting in two new expression vectors pGPcamdAcA and pGPcamdBcA, respectively (see Figures 5 and 6).
Acetamide selection
Two μg of both vectors was transformed to Penicillium chrysogenum. Transformants were selected on media with acetamide as the sole nitrogen source. To make sure that stable transformants were obtained, first round positives were colony purified on fresh acetamide plates and subsequently transferred to non-selective, rich media (YEPD) to induce sporulation. Afterwards all colonies were again tested on acetamide media. As stable transformants were obtained, we concluded that both amdA and amdB could function as an acetamidase.
Fluoroacetamide selection
Spores of the stable amidase transformants were used to compare growth on acetamide and fluoroacetamide media with the Aspergillus nidulans amdS (see example 2). For this a spore solution was made in physiological salt (0.9 mM NaCI) and several dilutions
were spotted on both media. As demonstrated in Figure 7, both Penicillium chrysogenum amidase genes are supporting growth on acetamide in all dilutions and show no growth in all dilutions on fluoroacetamide plates, demonstrating that the reverse selection using the Penicillium chrysogenum amdA and amdB genes is very tight.
Alternative splicing of Penicillium chrysogenum amdB
The ORF of amdB was amplified from cDNA and after sequencing found to retain one of the predicted introns. Still, this clone was able to give acetamide positive transformants and therefore would be functionally active as an acetamidase. To assess which form of the mRNA was active in the transformants a rtPCR was performed on several transformants. Cells from acetamide plates (inducing conditions) were thereto resuspended in water and RNA was isolated using the StrataPrep® Total RNA Microprep kit (Stratagene). rtPCR was performed with the Superscript™ III One-Step RT-PCR System (Invitrogen). The results indicate that in all transformants both mRNA variants are present (see Figure 8).
Claims
1. A polypeptide selected from the group consisting of a polypeptide having an amino acid sequence according to SEQ ID NO 3, a polypeptide having an amino acid
5 sequence according to SEQ ID NO 6, a polypeptide having an amino acid that is substantially homologous to the sequence of SEQ ID NO 3 and a polypeptide having an amino acid that is substantially homologous to the sequence of SEQ ID NO 6, the polypeptide displaying acetamidase activity and providing a reverse selection on fluoroacetamide with an efficiency of at least 95%, preferably at least 96%, more o preferably at least 97%, more preferably at least 98%, more preferably at least 99%, most preferably 100%.
2. A polynucleotide comprising a DNA sequence encoding the polypeptide of claim 1.
5
3. The polynucleotide of claim 2 that is SEQ ID NO 1 , 2, 4 or 5.
4. Strain comprising the polynucleotide of claim 2 or 3 wherein said strain is not Penicillium chrysogenum. o
5. Use of the polynucleotide of claim 2 or 3 as a bi-directional selection marker.
6. A method for the preparation of a strain expressing a gene of interest comprising: (i) transforming a host cell of interest with a gene of interest and/or with a DNA
5 sequence affecting expression of a gene of interest, and with the polynucleotide of claim 2 or 3,
(ii) selecting clones of transformed cells for their capacity to grow on acetamide as the sole nitrogen and/or carbon source.
o
7. Method according to claim 6 further comprising deletion of the polynucleotide of claims 2 or 3 from transformed clones by reverse selection on fluoroacetamide.
8. A method for producing a compound of interest comprising: (i) transforming a host cell of interest with a gene of interest and/or with a DNA sequence affecting expression of a gene of interest, and with the polynucleotide of claims 2 or 3,
(ii) selecting clones of transformed cells for their capacity to grow on acetamide as the sole nitrogen and/or carbon source.
9. Method according to claim 8 further comprising deletion of the polynucleotide of claims 2 or 3 from transformed clones by reverse selection on fluoroacetamide
10. A method according to any one of claims 5-8 wherein said host cell of interest is a fungal cell.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/226,282 US7951568B2 (en) | 2006-04-13 | 2007-04-12 | Bi-directional selection markers with improved activity |
EP07728017A EP2007893A1 (en) | 2006-04-13 | 2007-04-12 | Bi-directional selection markers with improved activity |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP06112618 | 2006-04-13 | ||
EP06112618.1 | 2006-04-13 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2007118836A1 true WO2007118836A1 (en) | 2007-10-25 |
Family
ID=38462024
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2007/053549 WO2007118836A1 (en) | 2006-04-13 | 2007-04-12 | Bi-directional selection markers with improved activity |
Country Status (3)
Country | Link |
---|---|
US (1) | US7951568B2 (en) |
EP (1) | EP2007893A1 (en) |
WO (1) | WO2007118836A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2123772A1 (en) | 2008-04-29 | 2009-11-25 | DSM IP Assets B.V. | Beta-lactam antibiotic producing strains |
EP2392649A2 (en) | 2008-08-05 | 2011-12-07 | DSM IP Assets B.V. | Adipoyl-7-ADCA producing strains |
WO2024118881A1 (en) | 2022-12-01 | 2024-06-06 | Genencor International Bv | Iterative muliplex genome engineering in microbial cells using a bidirectional selection marker system |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997006261A2 (en) * | 1995-08-03 | 1997-02-20 | Gist-Brocades B.V. | THE USE OF HOMOLOGOUS amdS GENES AS SELECTABLE MARKERS |
WO2006040358A2 (en) * | 2004-10-15 | 2006-04-20 | Dsm Ip Assets B.V. | Homologous amds genes as selectable marker |
-
2007
- 2007-04-12 WO PCT/EP2007/053549 patent/WO2007118836A1/en active Application Filing
- 2007-04-12 EP EP07728017A patent/EP2007893A1/en not_active Withdrawn
- 2007-04-12 US US12/226,282 patent/US7951568B2/en not_active Expired - Fee Related
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997006261A2 (en) * | 1995-08-03 | 1997-02-20 | Gist-Brocades B.V. | THE USE OF HOMOLOGOUS amdS GENES AS SELECTABLE MARKERS |
WO2006040358A2 (en) * | 2004-10-15 | 2006-04-20 | Dsm Ip Assets B.V. | Homologous amds genes as selectable marker |
Non-Patent Citations (3)
Title |
---|
CORRICK C M ET AL: "THE NUCLEOTIDE SEQUENCE OF THE AMDS GENE OF ASPERGILLUS NIDULANS AN THE MOLECULAR CHARACTERIZATION OF 5' MUTATIONS", GENE, ELSEVIER, AMSTERDAM, NL, vol. 53, 1987, pages 63 - 71, XP002906320, ISSN: 0378-1119 * |
DATABASE UniProt 7 March 2006 (2006-03-07), MACHIDA M ET AL: "Amidases", XP002450282, Database accession no. Q2PIS7 * |
DEBETS A J M ET AL: "GENETIC ANALYSIS OF AMDS TRANSFORMANTS OF ASPERGILLUS NIGER AND THEIR USE IN CHROMOSOME MAPPING", MOLECULAR AND GENERAL GENETICS, SPRINGER VERLAG, BERLIN,, DE, vol. 222, no. 2/3, 1 July 1990 (1990-07-01), pages 284 - 290, XP000609458, ISSN: 0026-8925 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2123772A1 (en) | 2008-04-29 | 2009-11-25 | DSM IP Assets B.V. | Beta-lactam antibiotic producing strains |
EP2392649A2 (en) | 2008-08-05 | 2011-12-07 | DSM IP Assets B.V. | Adipoyl-7-ADCA producing strains |
WO2024118881A1 (en) | 2022-12-01 | 2024-06-06 | Genencor International Bv | Iterative muliplex genome engineering in microbial cells using a bidirectional selection marker system |
Also Published As
Publication number | Publication date |
---|---|
US7951568B2 (en) | 2011-05-31 |
EP2007893A1 (en) | 2008-12-31 |
US20090246826A1 (en) | 2009-10-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1756145B1 (en) | Production of polypeptides by improved secretion | |
US8642290B2 (en) | Fungal transcriptional activators useful in methods for producing a polypeptide | |
EP1266011B1 (en) | Fungal transcriptional activator useful in methods for producing polypeptides | |
US20080227148A1 (en) | Aspergillus Promotors for Expressing a Gene in a Fungal Cell | |
DK2683732T3 (en) | Vector-host-system | |
AU2005293516B2 (en) | Homologous amdS genes as selectable marker | |
JP2007532124A (en) | Fungal promoter for gene expression in fungal cells | |
US20170313997A1 (en) | Filamentous Fungal Double-Mutant Host Cells | |
JP2021521821A (en) | Filamentous strain containing phenotype with reduced viscosity | |
US20130189733A1 (en) | Production of secreted proteins by filamentous fungi | |
US7951568B2 (en) | Bi-directional selection markers with improved activity | |
DK2981546T3 (en) | Filamentous fungal cell with inactivated component of the selective autophagy signaling pathway and method of use thereof | |
JP4756125B2 (en) | Novel mutants and selectable markers of Aspergillus oryzae | |
JP2018533932A (en) | New episomal plasmid vector | |
DK2646558T3 (en) | Promoters for expression of genes in a fungal cell | |
JP2000116392A (en) | Novel vector | |
JPH10506787A (en) | Gene encoding a signal recognition particle of Aspergillus niger | |
JP2006075123A (en) | Promoter dna originated from candida utilis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07728017 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007728017 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 12226282 Country of ref document: US |