WO2010144283A1 - Bacillus strain for increased protein production - Google Patents

Bacillus strain for increased protein production Download PDF

Info

Publication number
WO2010144283A1
WO2010144283A1 PCT/US2010/037040 US2010037040W WO2010144283A1 WO 2010144283 A1 WO2010144283 A1 WO 2010144283A1 US 2010037040 W US2010037040 W US 2010037040W WO 2010144283 A1 WO2010144283 A1 WO 2010144283A1
Authority
WO
WIPO (PCT)
Prior art keywords
host cell
bacillus
gene
protein
interest
Prior art date
Application number
PCT/US2010/037040
Other languages
French (fr)
Inventor
Cristina Bongiorni
Original Assignee
Danisco Us Inc.
Ferrari, Eugenio
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Danisco Us Inc., Ferrari, Eugenio filed Critical Danisco Us Inc.
Priority to CN201080026055.XA priority Critical patent/CN102803290B/en
Priority to EP10721097.3A priority patent/EP2440573B1/en
Priority to DK10721097.3T priority patent/DK2440573T3/en
Publication of WO2010144283A1 publication Critical patent/WO2010144283A1/en
Priority to US13/622,968 priority patent/US8476042B2/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • C07K14/32Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Bacillus (G)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1205Phosphotransferases with an alcohol group as acceptor (2.7.1), e.g. protein kinases

Definitions

  • the present invention provides host cells that have been genetically manipulated to have an enhanced capacity to produce proteins of interest.
  • the invention relates to modified Bacillus sp. host cells that have at least one inactivated phr and/or rap gene.
  • the enhanced production of proteins of interest by the modified Bacillus sp. host cells is further increased in modified Bacillus sp. host cells that overexpress YmaH.
  • exogenous polypeptides of interest for expression and production of large quantities of the desired polypeptides.
  • the methods are used to produce amounts of polypeptide over what would be produced naturally by the originating organism. Indeed, expression of exogenous nucleic acid sequences, as well as over-expression of endogenous sequences have been extensively used in modern biotechnology.
  • the present invention provides host cells that have been genetically manipulated to have an enhanced capacity to produce proteins of interest.
  • the invention relates to modified Bacillus sp. host cells that have at least one inactivated phr and/or rap
  • the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, 5 and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr
  • the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest.
  • the promoter is the wild-type or mutant aprE promoter.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a
  • the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host
  • the inactivated rap gene is the rapA gene.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene, (e.g., rapA gene), and a recombinant nucleic acid for
  • the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest.
  • the promoter is the wild-type or mutant aprE promoter.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell.
  • the at least one inactivated phr gene is chosen from phr A, phrE, phrC, phrF, phrG, phrl, and phrK.
  • the inactivated phr gene is the inactivated phrA gene, while, other embodiments the inactivated phr gene is the phrE gene.
  • the protein of interest is an 3
  • protease e.g., a subtilisin
  • the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and a recombinant nucleic acid for producing a protein of interest at a level that is 5 greater than that produced by the unmodified precursor host cell.
  • the at least one inactivated phr gene is chosen from phr A, phrE, phrC, phrF, phrG, phrl au ⁇ phrK.
  • the inactivated phr gene is the inactivated phrA gene, while, other embodiments the inactivated phr gene is the phrE gene.
  • the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes
  • the promoter is the wild-type or mutant aprE promoter.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • a protease e.g., a subtilisin.
  • the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene, and a recombinant nucleic acid for producing a protein
  • the inactivated rap gene is the rapA gene
  • the at least one inactivated phr gene is chosen from phr A, phrE, phrC, phrF, phrG, phrl and phrK.
  • the inactivated phr gene is the inactivated phrA gene
  • the inactivated phr gene is the phrE gene.
  • the protein of interest is an
  • protease e.g., a subtilisin
  • the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene (e.g., rapA gene), and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the
  • the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest.
  • the promoter is the wild-type or mutant aprE promoter.
  • the at least one inactivated phr gene is chosen from phrA, phrE, phrC, phrF, phrG, phrl and phrK.
  • the inactivated phr gene is the inactivated phrA or phrE gene.
  • a protease e.g., a subtilisin
  • the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has an inactivated phrA gene and an inactivated phrE gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • a protease e.g., a subtilisin
  • the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has an inactivated phrA gene and an inactivated phrE gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell.
  • the recombinant nucleic acid comprises a promoter that is operably linked to the 5 polynucleotide sequence that encodes the protein of interest.
  • the promoter is the wild-type or mutant aprE promoter.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has an inactivated phrA gene, an
  • the invention provides a modified Bacillus sp. host cell
  • the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest.
  • the promoter is the wild-type or mutant aprE promoter.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that over expresses YmaH and that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and a recombinant nucleic acid for producing a
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has
  • the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest.
  • the promoter is the wild-type or mutant aprE promoter.
  • the protein of interest is an enzyme, and
  • a protease e.g., a subtilisin
  • the invention provides a modified Bacillus sp. host cell 5
  • the inactivated rap gene is the rapA gene.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene (e.g., rapA gene), and a recombinant nucleic acid for producing a protein of interest at a level that is greater than
  • the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest.
  • the promoter is the wild-type or mutant aprE promoter.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell
  • the at least one inactivated phr gene is chosen from phrA, phrE, phrC, phrF, phrG, phrl, and phrK.
  • the inactivated phr gene is the inactivated phr A
  • the inactivated phr gene is the phrE gene.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and a recombinant nucleic acid for producing a protein
  • the at least one inactivated phr gene is chosen from phr A, phrE, phrC, phrF, phrG, phrl, and phrK.
  • the inactivated phr gene is the inactivated phr A gene, while, other embodiments the inactivated phr gene is the phrE gene.
  • the recombinant nucleic acid comprises a promoter that is operably linked to the
  • the promoter is the wild-type or mutant aprE promoter.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has
  • the inactivated rap gene is the rapA gene
  • the at least one inactivated phr gene is chosen from phrA, phrE, phrC, phrF, phrG, phrl, and phrK.
  • the inactivated phr gene is the inactivated phrA gene
  • the inactivated phr gene is the phrE gene.
  • the protein of 5 interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene (e.g., rapA gene), and a recombinant nucleic acid for producing a protein of interest at a level that is greater than
  • the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest.
  • the promoter is the wild-type or mutant aprE promoter.
  • the at least one inactivated phr gene is chosen from phrA, phrE, phrC, phrF, phrG, phrl, and phrK.
  • the inactivated phr gene is the inactivated phrA or phrE gene.
  • 15 protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • a protease e.g., a subtilisin
  • the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has an inactivated phrA gene and an inactivated phrE gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has an inactivated phrA gene and an inactivated phrE gene, and a recombinant nucleic acid
  • the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest.
  • the promoter is the wild-type or mutant aprE promoter.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has an inactivated phrA gene, an inactivated phrE gene, an inactivated rapA gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell.
  • the protein of interest is an enzyme
  • a protease e.g., a subtilisin
  • the invention provides a modified Bacillus sp. host cell 7
  • the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell an
  • inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of at least one indigenous phr and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable conditions, wherein production of a protein of interest is greater in said modified host cell when compared to the production of said protein of interest in said precursor host cell.
  • the method further comprises recovering the protein of interest.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the host cell comprises a mutation in at least one gene chosen from degU, degQ, degS, sco4, spollE, degQ and degR.
  • the host cell comprises a deg(Hy)32 mutation.
  • the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of at least one indigenous phr and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable conditions, wherein
  • the at least one indigenous phr gene that is inactivated is chosen from phrA, phrE, phrC, phrF, phrG, phrl, and phrK.
  • the inactivated phr gene is the inactivated phrA gene, while, other embodiments the inactivated phr gene is the phrE gene.
  • the method further comprises recovering the protein of interest.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the host cell comprises a mutation in at least one gene chosen from degil, degQ, degS, sco4, spollE, degQ and degR.
  • the host cell comprises a
  • the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of the indigenous phrA and phrE genes and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable 5 conditions, wherein production of a protein of interest is greater in said modified host cell when compared to the production of said protein of interest in said precursor host cell.
  • the method further comprises recovering the protein of interest.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the host cell comprises a mutation in at least one gene chosen from degU,
  • the host cell comprises a deg(Hy)32 mutation.
  • the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell an inactivating DNA construct comprising an inactivating polynucleotide that results in the
  • the method further comprises recovering the protein of interest.
  • the protein of interest is an enzyme
  • the host cell comprises a mutation in at least one gene chosen from degU, degQ, degS, sco4, spollE, degQ and degR.
  • the host cell comprises a deg(Hy)32 mutation.
  • the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell that
  • an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of at least one indigenous phr and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable conditions, wherein production of a protein of interest is greater in said modified host cell when compared to the production of said protein of interest in said
  • the method further comprises recovering the protein of interest.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the host cell comprises a mutation in at least one gene chosen from degU, degQ, degS, sco4, spollE, degQ and degR.
  • the host cell comprises a deg(Hy)32 mutation. Overexpression of YmaH is achieved by introducing
  • a SigH construct e.g., SEQ ID NO:23
  • a SigA construct e.g., SEQ ID NOS:26 and 31
  • a SigA promoter operably linked to a polynucleotide encoding YmaH.
  • the invention provides a method for producing a protein of 5 interest in a host cell that comprises introducing into a precursor Bacillus sp.
  • an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of at least one indigenous phr and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable conditions, wherein production of a protein of interest is greater in said
  • the at least one indigenous phr gene that is inactivated is chosen from phrA, phrE, phrC, phrF, phrG, phrl, and phrK.
  • the inactivated phr gene is the inactivated phrA gene, while, other embodiments the inactivated phr gene is the phrE gene.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a
  • the method further comprises recovering the protein of interest.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the host cell comprises a mutation in at least one gene chosen from degU, degQ, degS, sco4, spollE, degQ and degR.
  • the host cell comprises a deg(Hy)32 mutation. Overexpression of YmaH is achieved by introducing
  • SigH construct e.g., SEQ ID NO:23
  • SigA construct e.g., SEQ ID NOS:26 and 31
  • SigA promoter operably linked to a polynucleotide encoding YmaH.
  • the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell that overexpresses YmaH, an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of the indigenous phrA and phrE genes and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified
  • the method further comprises recovering the protein of interest.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
  • the host cell comprises a mutation in at least one
  • the host cell comprises a deg(Hy)32 mutation.
  • Overexpression of YmaH is achieved by 10
  • a SigH construct e.g., SEQ ID NO:23
  • a SigH promoter operably linked to a polynucleotide encoding a YmaH protein.
  • overexpression of YmaH is achieved by introducing into either the precursor or the modified host cell a SigA construct (e.g., SEQ ID NOS:26 and 5 31 ), comprising a SigA promoter operably linked to a polynucleotide encoding YmaH.
  • the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell that overexpresses YmaH, an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of the indigenous phrA and rap genes to
  • the method further comprises recovering the protein of interest.
  • the protein of interest is an enzyme, and preferably, a protease (e.g., a
  • the host cell comprises a mutation in at least one gene chosen from degil, degQ, degS, sco4, spollE, degQ and degR.
  • the host cell comprises a deg(Hy)32 mutation.
  • Overexpression of YmaH is achieved by introducing into either the precursor or the modified host cell a SigH construct (e.g., SEQ ID NO:23), comprising a SigH promoter operably linked to a polynucleotide encoding a YmaH protein.
  • overexpression of YmaH is achieved by introducing into either the precursor or the modified host cell a SigA construct (e.g., SEQ ID NOS:26 and 31 ), comprising a SigA promoter operably linked to a polynucleotide encoding YmaH. protein.
  • a SigA construct e.g., SEQ ID NOS:26 and 31
  • the Bacillus sp. host cell of the embodiments described is a Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus
  • Bacillus sp. host cell of the embodiments described is a Bacillus subtilis host cell.
  • the present invention provides isolated host cells, as well as cells in culture.
  • the present invention provides a host cell comprising a rap operon comprising at least one inactivated phr and/or at least one inactivated rap gene.
  • the host cell overexpresses YmaH.
  • the host cell further comprises a recombinant nucleic acid.
  • the host cell further comprises a polynucleotide sequence encoding a protein of interest.
  • the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence encoding a protein of interest.
  • the promoter is the wild-type or a mutant aprE promoter.
  • the host cell is a Bacillus sp. host cell.
  • the Bacillus sp. host cell is Bacillus subtilis.
  • the host cell produces the protein of interest at a level that is greater than 5 that produced by a host cell that does not comprise at least one inactivated phr and/or rap gene.
  • the protein of interest is an enzyme.
  • the enzyme is a protease.
  • the at least one inactivated rap gene is the rapk gene.
  • the at least one inactivated phr gene is selected from phr A, phrE, phrC, phrF, phrG, phrl, and
  • the at least one inactivated phrgeue is phrA
  • the at least one inactivated phr gene is phrE.
  • the host cell comprises at least one inactivated phr gene and at least one inactivated rap gene.
  • the inactivated rap gene is the rapA gene.
  • the 15 gene e.g., rapA
  • at least one inactivated phr gene selected from phr A, phrE, phrC, phrF, phrG, phrl, and phrK.
  • the at least one inactivated phr gene is phr A, while in some alternative embodiments, the at least one inactivated phr gene is phrE.
  • the host cell comprises an inactivated phrA gene, an inactivated phrE gene, an inactivated rapA gene, and a recombinant nucleic acid
  • the protein of interest is an enzyme.
  • the enzyme is a protease.
  • the host cell is a Bacillus sp. host cell.
  • the Bacillus sp. host cell is a Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus,
  • Bacillus licheniformis Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus,
  • Bacillus subtilis or Bacillus thuringiensis cell.
  • Bacillus sp. host cell is a Bacillus subtilis host cell.
  • the present invention also provides methods for producing at least one protein of interest comprising providing a precursor host cell and an inactivating nucleotide construct
  • the protein of interest is encoded by a recombinant nucleic acid
  • the protein of interest is encoded by a recombinant nucleic acid present in the modified host 12
  • the protein of interest is encoded by a recombinant nucleic acid present in the precursor host cell and/or the modified host cell.
  • the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence encoding the protein of 5 interest.
  • the protein of interest is a wild-type protein of interest.
  • the precursor host cell naturally produces the protein of interest.
  • the production of the protein of interest by the modified host cell is greater than the production of the protein of interest by the
  • the methods further comprise the step of recovering the protein of interest.
  • the protein of interest is an enzyme.
  • the enzyme is a protease.
  • the modified host cell comprises a mutation in at least one gene
  • the host cell comprises a deg(Hy)32 mutation.
  • the at least one indigenous phr gene that is inactivated is chosen from phrA, phrE, phrC, phrF, phrG, phrl,an ⁇ phrK.
  • the inactivating polynucleotide inactivates
  • the indigenous phrA and phrE genes and/or rap gene are the indigenous phrA and phrE genes and/or rap gene.
  • the at least one indigenous phr gene is phrA, while in some alternative embodiments, the at least one indigenous phr gene is phrE.
  • the indigenous rap gene is inactivated.
  • the indigenous rap gene is rapA.
  • the precursor or modified host cell overexpresses YmaH.
  • the overexpression of YmaH is achieved by introducing a SigH construct into the precursor or the modified host cell.
  • the SigH construct comprises SEQ ID NO:23, comprising a SigH promoter operably linked to a
  • the overexpression of YmaH is achieved by introducing a SigA construct into the precursor or said modified host cell.
  • the SigA construct comprises SEQ ID NO:26 and/or 31 , comprising a SigA promoter operably linked to a polynucleotide encoding YmaH.
  • the host cell is a Bacillus sp. host cell.
  • the Bacillus sp. host cell is a Bacillus alkalophilus, 13
  • Bacillus amyloliquefaciens Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megate ⁇ um, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis cell.
  • the Bacillus 5 sp. host cell is a Bacillus subtilis cell.
  • Figure 1 illustrates the arrangement of phr and rap genes in the Bacillus subtilis
  • Figure 2 schematically illustrates features common to the inactivation cassettes used to delete phr genes in Bacillus subtilis.
  • Figure 3 shows the production of the AprE protease in the modified Bacillus subtilis strains that comprise a deletion of the phr A, phrE, phrC, phrF, phrG, phrl, and
  • Figure 4 is a graph showing the production of AprE in the control Bacillus subtilis parent strain BG2942 (diamonds) and in the modified Bacillus subtilis strains CB2-1 (squares) and CB2-2 (triangles), which respectively contain the deletion of the phrA and the phrE gene.
  • Figure 5 is a graph showing the production of the protease FNA in the parent B. subtilis strain CF471 (diamonds), and in the modified Bacillus subtilis strains CB3-48 (squares) and CB3-47 (triangles), which respectively contain the deletion of the phrE and the phrA gene.
  • Figure 6 is a graph showing a combined effect of deleting phrA and phrE gene on
  • Figure 7 illustrates the location of primers used for generating polynucleotide constructs used to overexpress YmaH in Bacillus subtilis.
  • Panels B-E show the position of the primers used to generate construct SigH (panel B), and SigA constructs SigA1 (panel C), SigA2 (panel D) and SigA3 (panel E) relative to the Bacillus chromosomal
  • Primer pairs P4 - P5 and P6 - P7 are fusion primers, which comprise a "tail'Of base pairs at their 5' end that are homologous to the sequence being directly amplified, and are complementary to each other.
  • the complemetary tails of the fusion primers allow fusion of the amplified Sigma A promoter DNA to the amplified YmaH-encoding DNA to 5 obtain chimeric polynucleotides containing the Sigma A promoter sequence adjacent to the YmaH-encoding sequence while deleting most, or all, of the miaA coding sequence.
  • Figure 8 shows the polynucleotide sequence of a portion of the Bacillus subtilis genome that comprises the sequence defining a sigA promoter to the end of the sequence encoding the YmaH protein (SEQ ID NO:101 ). This sequence is diagrammed
  • Figure 9 shows a map of the plasmid pBS19-ymaH sigH.
  • FIG. 15 [051]
  • Panel A shows a graph of the proteolytic activity of subtilisin produced by Bacillus control host cells (42pBS) and by Bacillus subtilis host cells that overexpress ymaH (42SigA1 and 42SigH).
  • Panel B shows the subtilisin activity produced by Bacillus control host cells (41 pBS) and by Bacillus subtilis host cells that overexpress ymaH (41 SigH). The proteolytic activity was measured as the increase in absorbance at
  • the level of enzymatic activity is indicative of the effect of overexpressing ymaH on the production of subtilisin by Bacillus host cells.
  • Figure 1 1 shows the level of production of subtilisin by Bacillus subtilis control host cells 42pBS19 and by Bacillus host cells 42SigH and 42SigA1 , which overexpress ymaH.
  • Figure 12 is a graph showing a synergistic effect of phr deletion and YamH over- expression (using multicopy plasmid pBS19-ymaH sigH) on AprE expression.
  • the effect of overexpression of YmaH is shown in the Bacillus subtilis strain named YmaH (squares), and in the modified strains CB2-1 1 (triangles) and CB2-12 (crosses), which respectively contain a deletion of the phrA and phrE gene, and is compared to the
  • Figure 13 schematically illustrates the DNA construct used to delete the rapA gene.
  • Figure 14 shows the level of production of subtilisin FNA by Bacillus subtilis control host cells CF471 (filled diamond), the modified Bacillus subtilis cells CB3-47 (filled diamond).
  • JS1 121 (open triangle) comprising an inactivated rapA gene and an inactivated phrA 15
  • Figure 15 shows the effect of deleting the phrH gene (filled square, panel A) or the rapH gene (filled square, panel B) genes on the production of AprE by Bacillus subtilis. 5
  • the present invention provides host cells that have been genetically manipulated to have an enhanced capacity to produce proteins of interest.
  • the invention relates to modified Bacillus sp. host cells that have at least one inactivated phr and/or rap
  • compositions and methods of the present invention are not limited to serine
  • proteases 15 proteases. Indeed, the present invention finds use in improving the production of various classes of enzymes as well as proteases (e.g., amylases, cellulases, oxidases, oxidoreductases, cutinases, mannanases, pectinases, amylases, lipases, etc). Indeed, it is not intended that the present invention be limited to any particular enzyme nor class of enzyme.
  • proteases e.g., amylases, cellulases, oxidases, oxidoreductases, cutinases, mannanases, pectinases, amylases, lipases, etc. Indeed, it is not intended that the present invention be limited to any particular enzyme nor class of enzyme.
  • nucleic acids 5 are written left to right in 5' to 3' orientation and amino acid sequences are written left to right in amino to carboxy orientation, respectively.
  • Every maximum numerical limitation given throughout this specification includes every lower numerical limitation, as if such lower numerical limitations were expressly written herein. Every minimum numerical limitation given throughout this specification will include every higher numerical limitation, as if such higher numerical limitations were expressly written herein. Every numerical range given throughout this specification will include every
  • a "modified host cell” is a recombinant host cell that contains at least one inactivated phr and/or a rap gene.
  • a modified host cell is derived from a
  • precursor host cell which can be a wild-type or a recombinant precursor host cell comprising a phr gene that is not inactivated.
  • recombinant host cell refers to a cell that has been modified by the introduction of at least one recombinant/heterologous nucleic acid.
  • recombinant host cells express genes that are not found in identical form within the parent
  • recombinant polynucleotide and “recombinant polypeptide” respectively refer to a polynucleotide and a polypeptide that do not naturally 5 occur in a host cell.
  • a recombinant polynucleotide or polypeptide molecule may contain two or more naturally-occurring sequences that are linked together in a way that does not occur naturally.
  • “Recombination, "recombining,” or generating a “recombined” or “recombinant” nucleic acid is generally the assembly of two or more nucleic acid fragments wherein the assembly gives rise to a chimeric gene.
  • the term'Yecombinant when used in reference to a cell means a cell that has been modified by the introduction of a heterologous nucleic acid sequence or that the cell is derived from a cell so modified.
  • recombinant cells express genes that are not found in identical form within the native (non-recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under
  • an "analogous sequence” is a primary biological sequence, such as the amino-acid sequence or the nucleotides of DNA sequences wherein the function of the protein or encoded protein is essentially the same as that designated for Phr, Rap and YmaH proteins recited herein. Additionally, analogous proteins have at least about 60%,
  • PILEUP creates a multiple sequence alignment from a group of related sequences using progressive, pairwise alignments. It can also plot a tree showing the clustering relationships used to create the alignment. PILEUP uses a simplification of the progressive alignment method
  • Useful PILEUP parameters including a default gap weight of 3.00, a default gap length weight of 0.10, and weighted end gaps.
  • Another example of a useful algorithm is the BLAST algorithm, described by Altschul et al., (Altschul et al., J. MoI. Biol., 215:403-
  • a particularly useful BLAST program is the WU-BLAST-2 program (See, Altschul et al., 18
  • WU-BLAST-2 uses several search parameters, most of which are set to the default values.
  • the HSP S and HSP S2 parameters are dynamic values and are established by the program 5 itself depending upon the composition of the particular sequence and composition of the particular database against which the sequence of interest is being searched. However, the values may be adjusted to increase sensitivity.
  • a % amino acid sequence identity value is determined by the number of matching identical residues divided by the total number of residues of the "longer" sequence in the aligned region. The "longer"
  • polynucleotide or to a polypeptide sequence is defined as the percentage of nucleotide or amino acid residues in a candidate sequence that are identical with the nucleotide or amino acid residues of a starting sequence (i.e., the sequence of interest).
  • the percent identity shared by polynucleotide or polypeptide sequences is determined by direct comparison of the sequence information between the molecules by aligning the
  • the alignment includes the introduction of gaps in the sequences to be aligned.
  • sequences which contain either more or fewer nucleotides or amino acids than those of the candidate polynucleotide or polypeptide sequences it is understood that the percentage of homology will be determined based on the number of
  • homologous nucleotides or amino acids in relation to the total number of nucleotides or amino acids.
  • homology refers to sequence similarity or identity, with identity being preferred. This homology is determined using standard techniques known in the art (See e.g., Smith and Waterman, Adv. Appl. Math., 2:482 [1981 ]; Needleman and Wunsch, J. MoI. Biol., 48:443 [1970]; Pearson and Lipman, Proc. Natl. Acad. Sci. USA
  • heterologous refers to elements that are not normally associated with each other. For example, if a host cell produces a heterologous protein,
  • a promoter that is operably linked to a heterologous coding sequence is a promoter that is 19
  • a "protein of interest,” or “polypeptide of interest,” refers to a protein that is expressed/produced by a host cell. Generally, proteins of interest are desirable proteins that have commercial significance. The protein of interest may be 5 either homologous or heterologous to the host.
  • the protein of interest is a secreted polypeptide, particularly an enzyme, including but not limited to amylolytic enzymes, proteolytic enzymes, cellulytic enzymes, oxidoreductase enzymes and plant wall degrading enzymes. In further embodiments, these enzyme include, but are not limited to amylases, proteases, xylanases, lipases, laccases, phenol oxidases,
  • the expressed polypeptide is a hormone, cytokine, growth factor, receptor, vaccine, antibody, or the like. While it is not intended that the present invention be limited to any particular protein/polypeptide, in
  • the expressed protein of interest is a protease.
  • proteolytic activity refers to a protein or peptide exhibiting the ability to hydrolyze peptides or substrates having peptide linkages.
  • Many well known procedures exist for measuring proteolytic activity Kalisz, "Microbial Proteinases,” In: Fiechter (ed.), Advances in Biochemical Engineering/Biotechnology,
  • proteolytic activity may be ascertained by comparative assays which analyze the respective protease's ability to hydrolyze a commercial substrate.
  • Exemplary substrates useful in the analysis of protease or proteolytic activity include, but are not limited to di-methyl casein (Sigma C-9801 ), bovine collagen (Sigma C-9879), bovine elastin (Sigma E-1625), and bovine keratin (ICN Biomedical 9021 1 1 ).
  • the pNA assay (See e.g., Del Mar et al., Anal. Biochem., 99:316-320 [1979]) also finds use in determining the active enzyme concentration for fractions collected during gradient elution. This assay measures the rate at which p-nitroaniline is released as the enzyme
  • sAAPF-pNA succinyl-alanine-alanine-proline- phenylalanine-p-nitroanilide
  • subtilisin refers to a protease belonging to the group of 20
  • subtilisins which initiate the nucleophilic attack on the peptide bond through a serine residue at the active site (serine endopeptidase).
  • Subtilisins are secreted in large amounts from many Bacillus species.
  • FNA which is subtilisin BPN' containing the Y217L subtitution, is a subtilisin obatained from Bacillus amyloliquefaciens, 5 and AprE is the subtilisin obtained from Bacillus subtilis.
  • deletion of a gene refers to deletion of the entire coding sequence, deletion of part of the coding sequence, or deletion of the coding sequence including flanking regions.
  • the deletion may be partial as long as the sequences left in the chromosome provides the desired loss of the biological activity of the gene.
  • flanking regions of the coding sequence may include from about 1 bp to about 500 bp at the 5' and 3' ends.
  • the flanking region may be larger than 500 bp but will preferably not include other genes in the region which may be inactivated or deleted according to the invention. The end result is that the deleted gene is effectively non-functional.
  • a "deletion" is defined as a change in either nucleotide or amino acid sequence in
  • deletion of a phr gene provides enhanced expression of a protein of interest (e.g., a protease). [078] In some embodiments, deletion of one or more of genes selected from the group
  • a "corresponding unmodified Bacillus strain" or "parent” or “precursor” Bacillus sp. host cell is the originating host strain from which the indigenous chromosomal region (e.g., phrA and/or phrE gene), is inactivated and from which the indigenous chromosomal region (e.g., phrA and/or phrE gene), is inactivated and from which the indigenous chromosomal region (e.g., phrA and/or phrE gene), is inactivated and from which the indigenous chromosomal region (e.g., phrA and/or phrE gene), is inactivated and from which the indigenous chromosomal region (e.g., phrA and/or phrE gene), is inactivated and from which the indigenous chromosomal region (e.g., phrA and/or phrE gene), is inactivated and from
  • a polypeptide is "overexpressed" in a recombinant host cell if the polypeptide is expressed in the recombinant cell at a higher level that the level at which it is expressed in the precursor cell.
  • polynucleotide or protein refers to a polynucleotide or protein that occurs naturally in a host cell.
  • polypeptide refers to a compound made up of amino acid residues linked by peptide bonds.
  • protein as used herein, may be synonymous with the term “polypeptide” or may refer, in addition, to a complex of two or
  • chimeric polypeptide and “fusion polypeptide” are used interchangeably to refer to a protein that comprises at least two separate and distinct regions that may or may not originate from the same protein.
  • a signal peptide linked to the protein of interest wherein the signal peptide is not normally 5 associated with the protein of interest would be termed a chimeric polypeptide or chimeric protein.
  • a "signal sequence” is a sequence of amino acids present at the N-terminal portion of a protein which facilitates the secretion of the mature form of the protein outside the cell.
  • the definition of a signal sequence is a functional one.
  • a "prosequence” is an amino acid sequence between the signal sequence and mature protease that is necessary for the secretion of the protease. Cleavage of the pro sequence results in a mature active protease.
  • signal sequence refers to any sequence of nucleotides and/or amino acids which may participate in the secretion of the mature or precursor forms of the protein.
  • This definition of signal sequence is a functional one, meant to include all those amino acid sequences encoded by the N-terminal portion of the protein gene, which participate in the effectuation of the secretion of protein. They are
  • the signal sequence may be endogenous or exogenous.
  • the signal sequence may be that normally associated with the protein (e.g., protease), or may be from a gene encoding another secreted protein.
  • One exemplary exogenous signal sequence comprises the first seven amino acid residues of the signal sequence
  • aprE promoter refers to the polynucleotide promoter sequence that naturally drives the expression of subtilisin in B. subtilis (Ferrari et ai, J Bacteriol. 170:289-295 [1988]).
  • an aprE promoter herein refers to
  • the aprE promoter includes the nucleotide sequences necessary for the transcriptional regulation exerted by DegU, ScoC, AbrB and any other regulator of such promoter, and/or the aprE transcriptional leader (Hambraeus et ai, Microbiology 148:1795-1803 [2002]). In some alternative embodiments, the aprE promoter does not include all of the nucleotide
  • an "inactivated gene” is a locus of a genome that, prior to its inactivation, was capable of producing a protein (i.e., capable of being transcribed into an RNA that could be translated to produce a full length polypeptide).
  • a gene encoding a polypeptide is inactivated when it not transcribed and translated into a full length protein 5 that has biological activity (e.g., catalytic activity, in the case of an enzyme).
  • a gene may be inactivated by altering a sequence required for its transcription, for example by altering a sequence required for RNA processing (e.g., poly-A tail addition), by altering a sequence required for translation, or by altering the amino acid sequence of the encoded polypeptide (e.g., by a nucleotide substitution, etc).
  • inactivated genes include
  • a gene may also be inactivated by altering or deleting the sequence of the adjacent gene in an operon.
  • a gene may also be inactivated using antisense or any other method that abolishes expression of that gene.
  • nucleic acid encompasses DNA, RNA, whether single stranded or double stranded, and encompasses chemically modified DNA or RNA.
  • nucleic acid and polynucleotide are used interchangeably herein.
  • activation includes any method that prevents the functional expression of one or more of the phr genes (phrA, phrC, ph ⁇ , phrf, ph ⁇ , and phrK),
  • an altered/recombinant Bacillus strain comprises inactivation of one or more genes that results preferably in stable and non-reverting
  • inactivation is achieved by deletion.
  • the gene is deleted by homologous recombination.
  • an inactivating DNA construct comprising an incoming sequence having a selective marker flanked on each side by a homology box is used.
  • the homology box comprises nucleotide sequences homologous
  • the altered/recombinant cell is a Bacillus sp. host cell that comprises two inactivated genes (e.g., phrA andphrE). In other embodiments, the
  • Bacillus sp. host cell comprises three inactivated genes, four inactivated genes, five inactivated genes, six inactivated genes, or more. Thus, it is not intended that the number 23
  • inactivated genes be limited to any particular number of genes.
  • the inactivated genes are contiguous to each another, while in other embodiments, they are located in separate regions of the Bacillus chromosome.
  • an inactivated chromosomal gene has a necessary function under certain conditions, but the 5 gene is not necessary for Bacillus strain viability under laboratory conditions.
  • Preferred laboratory conditions include but are not limited to conditions such as growth in a fermenter, in a shake flask, plated media, etc., suitable for the growth of the microorganism.
  • polynucleotide and “deletion cassette” are used interchangeably to refer to a DNA construct comprising a non-functional sequence that may be inserted into a gene to disrupt the function of the gene.
  • the inactivating DNA construct comprises a sequence encoding a selective marker.
  • the inactivating DNA construct may also include two homology boxes.
  • expression cassette and "expression vector” refer to nucleic acid constructs generated recombinantly or synthetically, with a series of specified nucleic acid elements that permit transcription of a particular nucleic acid in a target cell.
  • the recombinant expression cassette can be incorporated into a plasm id, chromosome, mitochondrial DNA, plastid DNA, virus, or nucleic acid fragment.
  • the recombinant expression cassette can be incorporated into a plasm id, chromosome, mitochondrial DNA, plastid DNA, virus, or nucleic acid fragment.
  • the terms "expression cassette” and "expression vector” refer to nucleic acid constructs generated recombinantly or synthetically, with a series of specified nucleic acid elements that permit transcription of a particular nucleic acid in a target cell.
  • the recombinant expression cassette can be incorporated into a plasm id, chromosome, mitochondrial DNA, plastid DNA, virus, or nucleic acid fragment
  • 20 recombinant expression cassette portion of an expression vector includes, among other sequences, a nucleic acid sequence to be transcribed and a promoter.
  • expression vectors have the ability to incorporate and express heterologous DNA fragments in a host cell.
  • Many prokaryotic and eukaryotic expression vectors are commercially available. Selection of appropriate expression vectors is within
  • DNA construct and transforming DNA are used interchangeably to refer to DNA used to introduce sequences into a host cell or organism.
  • the DNA may be generated in vitro by PCR or any other suitable technique(s) known to those in the art.
  • the DNA construct comprises a sequence of interest (e.g., as an incoming sequence).
  • the sequence is operably linked to additional elements such as control elements (e.g., promoters, etc.).
  • the DNA construct may further comprise a selectable marker. It may
  • the transforming DNA comprises other non-homologous sequences, added 24
  • the ends of the incoming sequence are closed such that the transforming DNA forms a closed circle.
  • the transforming sequences may be wild-type, mutant or modified.
  • the DNA construct comprises sequences homologous to the host cell chromosome. In 5 other embodiments, the DNA construct comprises non-homologous sequences.
  • DNA construct is assembled in vitro it may be used to: 1 ) insert heterologous sequences into a desired target sequence of a host cell, and/or 2) mutagenize a region of the host cell chromosome (i.e., replace an endogenous sequence with a heterologous sequence), 3) delete target genes; and/or introduce a replicating plasmid into the host.
  • heterologous DNA sequence refers to a DNA sequence that does not naturally occur in a host cell.
  • a heterologous DNA sequence is a chimeric DNA sequence that is comprised of parts of different genes, including regulatory elements.
  • heterologous protein refers to a protein or polypeptide
  • homologous protein refers to a protein or polypeptide native or naturally occurring in a cell.
  • YmaH protein is interchangeably used with “Hfq protein” and refers to a
  • YmaH protein herein refers to a wild-type YmaH protein and variants thereof, including orthologs.
  • vector refers to a polynucleotide designed to introduce nucleic acids into one or more host cells.
  • vectors are preferred embodiments, vectors
  • the term is intended to encompass, but is not limited to cloning vectors, expression vectors, shuttle vectors, plasmids, phage particles, cassettes, and the like.
  • an "expression vector” as used herein refers to a DNA construct comprising a protein-coding region that is operably linked to a suitable control sequence capable of
  • control sequences include a promoter to effect transcription, an optional operator sequence to control transcription to produce mRNA, a sequence encoding suitable ribosome binding sites on the mRNA, and enhancers and sequences which control termination of transcription and translation.
  • promoter refers to a regulatory sequence that initiates transcription of a downstream nucleic acid. 25
  • operably linked refers to an arrangement of elements that allows them to be functionally related.
  • a promoter is operably linked to a coding sequence if it controls the transcription of the sequence.
  • derived encompasses the terms “originated from,” 5 “obtained,” or “obtainable from,” and “isolated from”.
  • non-pathogenic organism is an organism that is not pathogenic to humans and/or other animals.
  • recovered refers to a protein, cell, nucleic acid or amino acid that is removed from at least one component with
  • the term "introduced” refers to any method suitable for transferring the nucleic acid sequence into the cell. Such methods for introduction include but are not limited to protoplast fusion, transfection, transformation, conjugation, and transduction (See e.g., Ferrari et al.,
  • the terms “transformed” and “stably transformed” refers to a cell that has a non-native (heterologous) polynucleotide sequence integrated into its genome or as an episomal plasmid that is maintained for at least two generations.
  • expression refers to the process by which a polypeptide is produced based on the nucleic acid sequence of a gene.
  • the process includes both transcription and translation.
  • selectable marker-encoding nucleotide sequence refers to a nucleotide sequence, which is capable of expression in the host cells and where
  • 25 expression of the selectable marker confers to cells containing the expressed gene the ability to grow in the presence of a corresponding selective agent or lack of an essential nutrient.
  • selectable marker and “selective marker” refer to a nucleic acid (e.g., a gene) capable of expression in host cell, which allows for ease of
  • selectable markers include but are not limited to antimicrobials.
  • selectable marker refers to genes that provide an indication that a host cell has taken up an incoming DNA of interest or some other reaction has occurred.
  • selectable markers are genes that confer antimicrobial resistance or a metabolic advantage on the host cell to allow cells containing
  • a "residing selectable marker” is one that is located 26
  • the marker is an antimicrobial resistant marker (e.g., amp R ; phleo R ; 5 spec R ; kan R ; ery R ; tet R ; cmp R ; and neo R (See e.g., Guerot-Fleury, Gene, 167:335-337,
  • markers useful in accordance with the invention include, but are not limited to auxotrophic markers, such as tryptophan; and detection markers, such as ⁇ - galactosidase.
  • culturing refers to growing a population of microbial cells under suitable conditions in a liquid or solid medium. In some embodiments, culturing refers to fermentative recombinant production of an exogenous protein of interest or other desired end products (typically in a vessel or reactor). [0112] As used herein, the term "production" when used in reference to a protein of
  • interest encompasses the processes of transcription, and translation, and when needed, the processes of secretion and maturation, which creates the active from of the protein.
  • proteins that aer secreted into the extracellular medium e.g., proteases
  • the level of protein production is assessed as the amount of active protein secreted into the extracellular medium.
  • Bacillus sp refers to all of the species within the genus
  • Bacillus which are Gram-positive bacteria classified as members of the Family Bacillaceae, Order Bacillales, Class Bacilli.
  • the genus “Bacillus” includes all species within the genus “Bacillus,” as known to those of skill in the art, including but not limited to Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans,
  • Bacillus clausii Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus,
  • Bacillus licheniformis Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thu ⁇ ngiensis. It is recognized that the genus Bacillus continues to undergo taxonomical reorganization. Thus, it is intended that the genus include species that have been reclassified, including but not limited to such organisms as
  • stearothermophilus which is now named "Geobacillus stearothermophilus.”
  • the production of resistant endospores in the presence of oxygen is considered the defining feature of the genus Bacillus, although this characteristic also applies to the recently named Alicyclobacillus, Amphibacillus, Aneurinibacillus, Anoxybacillus, Brevibacillus, Filobacillus, Gracilibacillus, Halobacillus, Paenibacillus, Salibacillus, Thermobacillus,
  • Phr peptides contain an amino-terminal signal peptide and are exported as pro-peptides, most likely via the Sec
  • Phr pentapeptides After re-import by cells in the culture via the oligopeptide permease (Opp) system, Phr peptides specifically inhibit the activity of their cognate Rap phosphatase (Solomon, Genes and Development 1996 10, 2014-2024;
  • Phr peptides act as quorum 28
  • Rap protein and the Phr peptide that inhibits the Rap protein are encoded on a single operon. There are eight rap operons transcribed with their cognate phr genes, and three other rap coding genes in the B. subtilis genome (Kunst, Nature 1997 390, 249-256). The 5 rap/phr signaling systems of Bacillus subtilis are reviewed in Pottathil (Front Biosci. 2003
  • the present invention provides modified Bacillus sp.host cells that are genetically manipulated to have an enhanced capacity to produce proteins of interest.
  • the present invention relates to modified Bacillus sp. cells that contain a genome
  • the modified Bacillus sp. cells contain a genome comprising at least one rap operon that comprises an inactivated phr gene and an inactivated rap gene. Inactivation of the phr and/or rap gene enhances the production of a protein of interest by the modified Bacillus sp. cell when compared to the production of the same protein by the modified Bacillus sp. cell.
  • the modified Bacillus sp. cell comprises at least one inactivated phr and/or rap gene and a polynucleotide that encodes a protein of interest.
  • the polynucleotide that encodes the protein of interest is a wild-type polynucleotide.
  • the polynucleotide that encodes the protein of interest is a recombinant polynucleotide.
  • a Bacillus sp. rap operon modified in the subject cell may have at least about 70% , at least about 80%, at least about 90%, at least about 95%, at least about 97% or at least about 98% sequence identity to the
  • a rap operon sequence deposited in NCBI's Genbank database 25 sequence of a rap operon sequence deposited in NCBI's Genbank database; b) may hybridize under stringent conditions to a rap operon sequence deposited in NCBI's Genbank database; or c) may encode a polypeptide that has at least about 70% sequence identity (e.g., at least about 80%, at least about 90%, at least about 93%, at least about 95%, at least about 97% or at least about 98% sequence identity) to a Rap or
  • Exemplary phr protein and nucleotide sequences deposited in NCBI's Genbank database include those annotated in Genbank accession no. NC_000964.2; GID: 50812173 ( ⁇ . subtilis), Genbank accession no. NC_009848.1 ; GID: 157690798 (Bacillus pumilus), Genbank accession no. NC_006270.3; GID: 1631 19169 (Bacillus licheniformis) and Genbank accession no.
  • Rap proteins may be identified as containing a so-called tetratricopeptide repeat domain, a pfam domain 29
  • Genbank accessions are incorporated by reference in their entirety, including the nucleic acid and protein sequences therein and the annotation of those sequences, as of 5 the earliest filing date of this patent application.
  • the well-known Bacillus subtilis strain 168 finds use in the present invention. Indeed, the genome of this strain has been well-characterized (See, Kunststoff et al., Nature 390:249-256 [1997]; and Henner et al., Microbiol. Rev., 44:57-82 [1980]). The genome is comprised of one 4215 kb chromosome. While the coordinates
  • a modified Bacillus sp. cell comprises a single inactivated phr gene (e.g., a rapA operon containing an inactive phrA gene, a rapC operon containing an inactive phrC gene; a rapE operon containing an inactive phrE gene, a rapF operon
  • a single inactivated phr gene e.g., a rapA operon containing an inactive phrA gene, a rapC operon containing an inactive phrC gene; a rapE operon containing an inactive phrE gene, a rapF operon
  • the modified Bacillus sp. cell comprises an inactivated phr A gene (e.g., a rapA operon containing an inactive phrA gene).
  • inactivation results from the deletion of the entire endogenous DNA sequence that
  • the entire endogenous DNA sequence of the Bacillus subtilis phrA gene is deleted using the inactivating DNA deletion construct of SEQ ID NO:17.
  • Bacillus subtilis 168 the DNA sequence that encodes the phrA protein MKSKWMSGLL LVAVGFSFTQ VMVHAGETAN TEGKTFHIAA RNQT; SEQ ID NO:42 (Swiss-Prot:Q00829) is
  • inactivation of the phrA gene is by insertion of a selectable marker that interrupts the phrA gene.
  • the modified Bacillus sp. cell comprises an inactivated phrE gene (e.g., a rapE operon containing an inactive phrE gene).
  • inactivation results from the deletion of the entire endogenous DNA sequence that
  • the entire endogenous DNA sequence of the Bacillus 30 is identical to the entire endogenous DNA sequence of the Bacillus 30.
  • subtilis phrE gene is deleted using the inactivating DNA deletion construct of SEQ ID NO:18.
  • Bacillus subtilis 168 the DNA sequence that encodes the phrE protein MKSKLFISLS AVLIGLAFFG SMYNGEMKEA SRNVTLAPTH EFLV; SEQ ID NO:44 (Swiss-Prot:032025) is
  • inactivation of the phrE gene results from the deletion of a fragment of the phrE gene that prevents the functional expression of the PhrE protein.
  • the phrE gene is located at about 2659557-2659691 bp of the B. subtilis 168 chromosome (Accession no.NC_000964).
  • inactivation of the phrE gene is by insertion of a selectable marker that interrupts the phrE gene.
  • the phrA and the phrE genes are deleted from the Bacillus subtilis chromosome using the phrA and the phrE deletion constructs set forth in SEQ ID NOS:17 and 18, respectively.
  • the modified Bacillus sp. cell comprises at least two inactivated phr genes (e.g., two rap operons each containing an inactivated phr gene), at least three inactivated phr genes (e.g., three rap operons each containing an inactivated phr gene) at least four inactivated phr genes (e.g., four rap operons each containing an inactivated phr gene), at least five inactivated phrgeues (e.g., five rap operons each
  • a subject host cell may contain both a) a rapA operon
  • inactivation results from the deletion of the entire endogenous DNA sequences that encode the PhrA and the PhrE proteins, respectively.
  • inactivation of the phrA and phrE gene results from the deletion of a fragment of the phrA and the phrE gene that prevents the functional expression of the PhrA and the PhrE
  • a segment of the phrA gene is deleted, and a segment of the phrE gene is deleted from the chromosome.
  • the inactivation of the phrA and the phrE genes results from the deletion of the entire endogenous DNA sequence that encodes the PhrA and the deletion of a DNA sequence that encodes a fragment of the PhrE protein.
  • 35 and the phrE genes results from the deletion of the entire endogenous DNA sequence that encodes the PhrE and the deletion of a DNA sequence that encodes a fragment of 31
  • Fragments of phr genes include a range of about 1 % to about 99% of the indigenous chromosomal region encoding the phrA and/or phrE proteins. In other embodiments, fragments include a range of about 5% to 95% of the indigenous chromosomal region. In yet additional embodiments, fragments comprise 5 at least about 99%, about 98%, about 97%, about 96%, about 95%, about 94%, about
  • inactivation of the phrA and/or phrE genes is achieved by
  • an inactivating DNA construct comprising a selectable marker flanked on each side by a homology box is used.
  • the homology box comprises nucleotide sequences homologous to nucleic acids flanking regions of the chromosomal phr gene.
  • the DNA construct aligns with the homologous sequences of the Bacillus host
  • the inactivating DNA construct is assembled in vitro, followed by direct cloning of the construct into a competent Bacillus host, such that the DNA construct becomes integrated into the Bacillus chromosome.
  • PCR fusion and/or ligation can be employed to assemble a DNA construct in vitro.
  • DNA construct is a non-plasmid construct, while in other embodiments it is incorporated into a vector (e.g., a plasm id).
  • the inactivating DNA construct comprises a selectable marker flanked on the 5' and 3' ends with a fragment of the gene sequence.
  • the DNA construct comprising the selectable marker and gene, gene
  • the inactivating DNA construct comprises the selectable marker located in the promoter region of the gene. In other embodiments, the inactivating DNA construct comprises the selectable marker located 3' to the promoter region of gene. In yet other
  • the inactivating DNA construct comprises the selectable marker located in the coding region of the gene.
  • the inactivating DNA construct comprises a selectable marker flanked by a homology box on both ends.
  • the inactivating DNA construct includes a sequence that interrupts the transcription and/or translation of the coding sequence.
  • DNA construct includes restriction sites engineered at the upstream and downstream ends of the construct.
  • inactivation of the phrA and/or phrE gene is by insertion of a selectable marker that interrupts the phrA and/or phrE gene in a single crossover event.
  • the selectable marker is located within the gene coding sequence 5 or on a part of the plasmid separate from the gene. The vector is integrated into the
  • Bacillus chromosome Bacillus chromosome, and the gene is inactivated by the insertion of the vector in the coding sequence.
  • a modified Bacillus sp. cell comprises inactivation of one or more phr genes that results preferably in stable and non-reverting inactivation.
  • Methods of mutating genes are well known in the art and include but are not limited to site-directed mutation, generation of random mutations, and gapped-duplex approaches (See e.g., U.S. Pat. No. 4,760,025; Moring et ai, Biotech.
  • the inactivating DNA construct is incorporated into a vector or used without the presence of plasmid DNA, it is used to transform microorganisms. It is contemplated that any suitable method for transformation will find use with the present invention. In some embodiments, at least one copy of the inactivating DNA construct is
  • one or more inactivating DNA constructs of the invention are used to transform host cells.
  • one inactivating DNA construct may be used to inactivate a phrA gene and another construct may be used to inactivate a phrE gene.
  • additional combinations are contemplated and provided by the present invention.
  • the phrA and/or phrE gene is deleted in a precursor recombinant Bacillus subtilis strain in which one or more genes encoding an endogenous protease have been deleted.
  • the Bacillus sp. host cell comprises two or more inactivated protease genes.
  • the Bacillus host cell contains two inactivated protease genes (See e.g., U.S. Patent 5,387,521 ) while in other
  • the Bacillus host cell contains 5 inactivated protease genes: nprE, aprE, epr, ispA, and bpr genes (See e.g., US20050202535). Since the sequence of the entire B. subtilis genome is publicly available and annotated (See e.g., Moszer, FEBS Lett., 430:28-36 [1998]), the proteases of B. subtilis have been identified and reviewed in detail (See e.g., He et ai., Res. Microbiol., 142:797-803 [1991 ]). In addition, gene disruption
  • the modified Bacillus sp. host cell comprises an inactivated phr gene and an inactivated rap gene.
  • a cell comprises a single rap operon that contains an inactivated phr gene and an inactivated rap gene (e.g., a rapA operon containing an inactive phrA gene and an inactivated rapA gene, a rapC operon containing an inactive phrC gene and an inactivated rap gene (e.g., a rapA operon containing an inactive phrA gene and an inactivated rapA gene, a rapC operon containing an inactive phrC gene and an inactivated rap gene (e.g., a rapA operon containing an inactive phrA gene and an inactivated rapA gene, a rapC operon containing an inactive phrC gene and an inactivated rap gene (e.g., a rapA operon containing an inactive phrA gene and an inactivated rapA gene, a rapC operon containing an
  • the modified Bacillus sp. cell comprises at least two rap operons each
  • inactivation results from the deletion of the entire endogenous DNA sequences that encode the Phr and the Rap proteins.
  • the entire endogenous DNA sequence of the Bacillus subtilis phrA gene is deleted using the inactivating DNA deletion construct of SEQ ID NO: 1
  • SEQ ID NO:42 (Swiss-Prot:Q00829) is:
  • inactivation of the phrA gene results from the deletion of a fragment of the phrA gene that prevents the functional expression of the PhrA protein.
  • the phrA gene is located at about 1316305-1316439 bp of the B. subtilis 168 chromosome (Accession no.NC_000964). According to one embodiment, inactivation of the phrA gene
  • inactivation of the rapA gene results from the inactivation of the rapA gene by introducing a selectable marker comprising a terminator sequence in the rapA gene thereby preventing the functional expression of the rapA and phrA protein
  • inactivation of the rapA gene is by insertion of a selectable marker that
  • the modified Bacillus sp. cell comprising the rap operon
  • the inactive phr gene may contain an active or inactive rap gene. If the rap gene is active, it may have a wild-type sequence (e.g., may be endogenous to the cell) or may be modified such that it is functionally equivalent to the wild type protein of the same species.
  • the modified Bacillus sp. host cell comprises an inactivated
  • the modified Bacillus sp. cell comprises a single rap operon that contains an inactivated rap gene (e.g., a rapA operon containing an inactive an inactivated rapA gene, a rapB operon containing an inactive an inactivated rapB gene, a rapC operon containing an inactivated rapC gene, a rapD operon containing an inactive an inactivated rapD gene, a rapE operon containing an inactivated rapE gene, a rapF
  • an inactivated rap gene e.g., a rapA operon containing an inactive an inactivated rapA gene, a rapB operon containing an inactive an inactivated rapB gene, a rapC operon containing an inactivated rapC gene, a rapD operon containing an inactive an inactivated rapD gene, a rapE operon containing an inactivated rapE gene, a
  • the modified Bacillus sp. cell comprises at least two rap operons each containing an inactivated rap gene. In some embodiments, inactivation results from the deletion of the entire endogenous DNA sequences that encode the Rap proteins. 5 [0148] The modified Bacillus sp. cell is derived from a precursor host cell of a Bacillus sp.
  • strain including Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis strains.
  • Bacillus alkalophilus Bacillus amyloliquefaciens
  • Bacillus brevis Bacillus circulans
  • Bacillus clausii Bacillus coagulans
  • Bacillus firmus Bacillus lautus
  • Bacillus lentus Bacillus licheniformis
  • Bacillus megaterium Bacillus pumilus
  • Bacillus stearothermophilus Bacillus subtilis
  • Bacillus subtilis or Bacillus thuringiensis strains.
  • the modified Bacillus sp. cell is derived from an alkalophilic Bacillus sp. cell. Numerous alkalophilic Bacillus sp. are known (See e.g., U.S. Pat. No. 5,217,878; and Aunstrup et al., Proc IV IFS: Ferment. Technol. Today, 299-305 [1972]).
  • the Bacillus sp. precursor host cell is an industrial Bacillus sp. host cell. Examples of industrial Bacillus sp. host cells include, but are not limited to
  • Bacillus sp. host cell is selected from the group consisting of Bacillus lentus, Bacillus brevis, Bacillus stearothermophilus, Bacillus alkalophilus, Bacillus coagulans, Bacillus circulans, Bacillus pumilus, Bacillus thuringiensis, Bacillus clausii, and Bacillus megaterium, as well as other organisms within
  • Bacillus subtilis is used.
  • U.S. Pat. Nos. 5,264,366 and 4,760,025 describe various Bacillus host strains that find use in the present invention, although other suitable strains (e.g., industrial strains) are contemplated for use in the present invention.
  • An industrial strain may be a non-recombinant strain of a Bacillus sp., a mutant of a naturally occurring strain, or a recombinant strain.
  • the host strain is a recombinant host strain wherein a recombinant polynucleotide encoding a polypeptide of interest has been introduced into the host.
  • the polypeptide of interest is an enzyme (e.g., a protease).
  • a further preferred host strain is a Bacillus
  • subtilis host strain and in particular a recombinant Bacillus subtilis host strain.
  • Bacillus subtilis strains are known, including but not limited to 1 A6 (ATCC 39085), 168 (1 A01 ), SB19, W23, Ts85, B637, PB1753 through PB1758, PB3360, JH642, 1 A243 (ATCC 39,087), ATCC 21332, ATCC 6051 , MM 13, DE100 (ATCC 39,094), GX4931 , PBT 1 10, and PEP 21 1 strain ⁇ See e.g., Hoch et al, Genetics, 73:215-228 [1973]; U.S. Pat.
  • B. subtilis as an expression host is further described by Palva et al. and others (See, Palva et al., Gene 36
  • Industrial protease producing Bacillus sp. hostceWs provide particularly preferred host cells. In some preferred embodiments, use of these host cells in the present 5 invention enhances protease production.
  • Two general types of proteases are typically secreted by Bacillus sp., namely neutral (or "metalloproteases") and alkaline (or "serine”) proteases.
  • Serine proteases are enzymes which catalyze the hydrolysis of peptide bonds in which there is an essential serine residue at the active site. Serine proteases have molecular weights in the 25,000 to 30,000 range (See, Priest, Bacteriol. Rev., 41 :71 1 -753
  • Subtilisin is a preferred serine protease that is produced by the modified Bacillus sp. host cells of the present invention.
  • Bacillus subtilisins A wide variety of Bacillus subtilisins have been identified and sequenced, for example, GG36, subtilisin 168, subtilisin BPN', subtilisin Carlsberg, subtilisin DY, subtilisin 147 and subtilisin 309 (See e.g., EP 414279 B; WO 89/06279; and Stahl et al., J. Bacteriol., 159:81 1 -818 [1984]).
  • GG36 subtilisin 168
  • subtilisin BPN' subtilisin Carlsberg
  • subtilisin DY subtilisin 147
  • subtilisin 309 See e.g., EP 414279 B; WO 89/06279; and Stahl et al., J. Bacteriol., 159:81 1 -818 [1984]).
  • the Bacillus host strains produce mutant (e.g., variant) proteases.
  • a preferred Bacillus sp. host is a Bacillus sp. that includes a mutation or deletion in at least one of the following genes, degU, degS, degR and degQ.
  • the mutation is in a degU geue, and more preferably the mutation is degU(Hy)32 (See e.g., Msadek et al., J. Bacteriol., 172:824-834 [1990]; and Olmos et al., MoI. Gen. Genet., 253:562-567 [1997]).
  • the host cell is a Bacillus subtilis host cell that carries a degU32(Hy) mutation.
  • the Bacillus sp. host cell comprises a mutation or deletion in scoC4, (See e.g., Caldwell et al.,
  • oppA or other genes of the opp operon See e.g., Perego et al., MoI. Microbiol., 5:173-185 [1991 ]). Indeed, it is contemplated that any mutation in the opp operon that causes the same phenotype as a mutation in the oppA gene will find use in some embodiments of the modified Bacillus sp. cell of the present invention. In some
  • these mutations occur alone, while in other embodiments, combinations of mutations are present.
  • a Bacillus sp. host cell that already includes a mutation to one or more of the above-mentioned genes.
  • a modified Bacillus sp. cell of the invention is further engineered to include mutation of one or more of the above-mentioned genes. 5 Proteins of Interest
  • the invention provides modified Bacillus sp. cells that are used to produce proteins of interest at a level that is greater than that produced by the unmodified precursor host cells.
  • proteins of interest are desirable proteins that have commercial significance.
  • the protein of interest may be either homologous or
  • the protein of interest is a secreted polypeptide, particularly an enzyme, including but not limited to amylolytic enzymes, proteolytic enzymes, cellulytic enzymes, oxidoreductase enzymes and plant wall degrading enzymes.
  • these enzyme include, but are not limited to amylases, proteases, xylanases, lipases, laccases, phenol oxidases, oxidases, cutinases,
  • the expressed polypeptide is a hormone, cytokine, growth factor, receptor, vaccine, antibody, or the like. While it is not intended that the present invention be limited to any particular protein/polypeptide, in some most
  • the expressed protein of interest is a protease.
  • the host cell contains a recombinant expression cassette that comprises a polynucleotide sequence encoding a protein of interest (i.e., an expression cassette for production of a protein that is not native to the host cell).
  • the host cell comprises a recombinant nucleic acid
  • the recombinant nucleic acid is integrated into the genome of the host cell, while in other embodiments, the recombinant nucleic acid is present in a vector that replicates
  • the polynucleotide encoding the protein of interest is codon optimized for expression of the protein in the Bacillus sp. host cell. While any promoter may be employed in a subject expression cassette, promoters that are regulated by the rap/phr systems (e.g., the aprE and nprE promoters) may be employed in some embodiments.
  • the protein of interest may be, for example, an enzyme (e.g., a so-called "industrial enzyme"), or a protein having therapeutic activity such an antibody.
  • the protein of interest is a subtilisin, where the term "subtilisin” refers to a serine endopeptidase of the S8 family of peptidases.
  • Subtilisin protein has an activity described as EC 3.4.21 .62 (previously EC 3.4.4.16), according to IUMBM enzyme nomenclature.
  • the activity of exemplary subtilisin proteins is generally 5 described in Philipp et al, (MoI. Cell. Biochem. 1983 51 : 5-32), Siezen (Protein ScL, 1997
  • a subtilisin has an amino acid sequence that is found in a wild-type genome (i.e., the subtilisin is a naturally-occurring subtilisin), while in other
  • the subtilisin is a variant of a naturally-occurring subtilisin.
  • the variant subtilisin comprises an amino acid sequence that is at least about 80%, at least about 90%, at least about 95% or at least about 98% identical to a subtilisin encoded by a wild-type genome.
  • Exemplary subtilisins include, but are not limited to: ALCANASE® (Novozymes), FNATM (Genencor), SAVINASE® (Novozymes)
  • subtilisin includes, but is not limited to subtilisin BPN', subtilisin Carlsberg, subtilisin DY, subtilisin 147, or subtilisin 309 (See e.g., WO89/06279;
  • subtilisins and other proteases that find use in the present invention include but are not limited to those described in WO 99/20770; WO 99/20726; WO 99/20769; WO 89/06279; RE 34,606; U.S. Patent No. 4,914,031 ; U.S. Patent No. 4,980,288; U.S. Patent No. 5,208,158; U.S. Patent No. 5,310,675; U.S. Patent No. 5,336,61 1 ; U.S. Patent No. 5,399,283; U.S. Patent No.
  • the expression of the protein of interest in a host cell is
  • aprE promoter of the aprE gene from which the B. subtilis subtilisin is naturally transcribed The aprE gene is transcribed by sigma A ( ⁇ A ) factor and its expression is highly controlled by several regulators, such as: DegU/DegS, AbrB, Hpr and SinR (VaIIe and Ferrari (1989) In: Smith I, Slepecky RA, Setlow P (eds) Regulation of Procaryotic Development. American Society for Microbiology. Washington, DC pp 131 -
  • the host cell comprises an aprE promoter that is the wild-type aprE promoter tgggtctactaaaatattattccatctattacaataaattcacaga (SEQ ID NO:39; U.S. Patent Application Publication No. 20030148461 ). 5 [0157] In other embodiments, the expression of a protein of interest by a host cell is driven by mutant of the B. subtilis aprE promoters.
  • the invention provides for a Bacillus host cell that contains a mutant aprE promoter operably linked to a polynucleotide sequence that encodes a protein of interest.
  • the invention encompasses host cells that express a protein of interest from a mutant aprE promoter.
  • mutant aprE promoter having the sequence:
  • tgggtc ttgaca aatattattccatctat tacaat aaattcacaga (SEQ ID NO:40), [0159] which is described in U.S. Patent Application Publication No. 20030148461 .
  • Any one of the proteins of interest recited herein e.g., Bacillus subtilisins
  • Bacillus subtilisins can be transcribed from an aprE promoter.
  • the invention provides for a modified
  • Bacillus host cell that is capable of expressing a protein of interest from an aprE promoter.
  • the modified host cell is a modified B. subtilis host cell capable of expressing a protease driven by an aprE promoter.
  • the aprE promoter includes the aprE promoter regulatory elements and/or the aprE transcriptional leader, while in other embodiments, the aprE promoter does not include the aprE
  • the invention also encompasses compositions and methods for expressing a protein of interest by a host cell, wherein the expression is driven by any promoter suitable for driving the transcription of the gene of interest as long as the promoter comprises the transcriptional leader sequence of the aprE gene.
  • any promoter suitable for driving the transcription of the gene of interest as long as the promoter comprises the transcriptional leader sequence of the aprE gene.
  • suitable promoters and terminators for use in Bacillus host cells include: the promoters and terminators of npr (neutral protease; i.e., NprE promoter), amy ( ⁇ - amylase) and ⁇ -lactamase genes, as well as the B. subtilis levansucrase gene (sacB), B. licheniformis alpha-amylase gene (amyL), B. stearothermophilus maltogenic amylase gene (amyM), B. amyloliquefaciens alpha-amylase gene (amyQ), B. licheniformis
  • penicillinase gene penP
  • B. subtilis xylA and xylB genes the promoters and terminators described in WO 93/10249, WO 98/07846, and WO 99/43835.
  • the modified host cell may produce a protein of interest that is a recombinant carbohydrase, such as a liquefying and saccharifying ⁇ -amylase, an alkaline ⁇ -amylase, a ⁇ -amylase, a cellulase; a dextranase, an ⁇ -glucosidase, an ⁇ -
  • carbohydrase such as a liquefying and saccharifying ⁇ -amylase, an alkaline ⁇ -amylase, a ⁇ -amylase, a cellulase; a dextranase, an ⁇ -glucosidase, an ⁇ -
  • protease an alkali protease, bromelain, ficin, a neutral protease, papain, pepsin, a peptidase, rennet, rennin, chymosin, thermolysin, an aspartic proteinase, or trypsin; a lipase or esterase, such as a triglyceridase, a phospholipase, a pregastric esterase, a phosphatase, a phytase, an amidase, an iminoacylase, a glutaminase, a lysozyme, or a 5 penicillin acylase; an isomerase such as glucose isomerase; an oxidoreductases (e.g., an amino acid oxidase), a catalase, a chloroperoxidase, a glucose oxidase, a hydroxysteroid dehydrogenase or a peroxid
  • the protein may be a therapeutic protein.
  • suitable target therapeutic proteins which may be produced using a subject cell include:
  • erythropoietin cytokines such as interferon- ⁇ , interferon- ⁇ , interferon ⁇ , interferon-o, and granulocyte-CSF, GM-CSF, coagulation factors such as factor VIII, factor IX, and human protein C, antithrombin III, thrombin, soluble IgE receptor ⁇ -chain, IgG, IgG fragments, IgG fusions, IgM, IgA, interleukins, urokinase, chymase, and urea trypsin resume inhibitor, IGF-binding protein, epidermal growth factor, growth hormone- releasing factor, annexin V
  • fusion protein 20 fusion protein, angiostatin, vascular endothelial growth factor-2, myeloid progenitor inhibitory factor-1 , osteoprotegerin, ⁇ -1 -antitrypsin, ⁇ -feto proteins, DNase II, kringle 3 of human plasminogen, glucocerebrosidase, TNF binding protein 1 , follicle stimulating hormone, cytotoxic T lymphocyte associated antigen 4-lg, transmembrane activator and calcium modulator and cyclophilin ligand, soluble TNF receptor Fc fusion, glucagon like
  • the cell may be engineered so that the protein produced by the cell may be secreted from the cell into culture media.
  • the cell may further contain a recombinant nucleic acid encoding a fusion polypeptide containing a signal sequence, a protease cleavage site and the protein.
  • the signal may be any suitable signal sequence.
  • 30 sequence may be one that is naturally associated with the polypeptide to be expressed.
  • the signal sequence may be any sequence of amino acids that is capable of directing the fusion protein into the secretory pathway of the Bacillus host cell.
  • signal sequences that may be employed include the signal sequences of proteins that are secreted from wild-type Bacillus cells. Such signal sequences include the signal
  • exemplary signal sequences include, but are not limited to, the signal sequences 41
  • Bacillus species including, but not limited to Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus 5 lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megate ⁇ um, Bacillus pumilus,
  • the signal sequence is encoded by the aprE gene of B. subtilis (as described in Appl. Microbiol. Biotechnol. 2003 62:369-73). Further signal peptides are described by Simonen and Palva (Microbiological Reviews 1993 57: 109-137), and other
  • the invention also provides methods for producing a protein of interest in a modified Bacillus sp. host cell, which comprises at least one inactivated phr gene (e.g., an inactivated phrA and/or a phrE gene), or an inactivated phr and an inactivated rap gene by culturing a modified cell that is capable of producing a protein of interest and
  • a modified Bacillus sp. host cell which comprises at least one inactivated phr gene (e.g., an inactivated phrA and/or a phrE gene), or an inactivated phr and an inactivated rap gene by culturing a modified cell that is capable of producing a protein of interest and
  • the methods provide for the production of any one protein of interest described above.
  • the protein of interest produced by the method of the invention is a protease (e.g., a subtilisin).
  • Production of a protein of interest by a modified Bacillus sp. cell is greater than that obtained from a corresponding unmodified precursor host cell.
  • the improved level of protease production by a modified Bacillus sp. cell is further enhanced in the modified cell by overexpressing ymati, as described below.
  • Modified Bacillus sp. Host Cells That Overexpress YmaH [0166] In the embodiments described above, the modified Bacillus sp. cells, which
  • the enhanced level of production of a protein of interest by the modified Bacillus sp. cells is further increased by altering the modified cell to overexpress the RNA-binding protein
  • the invention provides for a modified Bacillus sp. cell that comprises at least one inactivated phr gene (e.g., an inactivated phrA and/or phrE gene), a polynucleotide that encodes a protein of interest (e.g., a protease), and a heterologous polynucleotide that encodes a YmaH protein.
  • the modified Bacillus sp. cell comprises at least one inactivated phr gene (e.g. an inactivated
  • rap gene 35 phrA and/or phrE gene), and/or an inactivated rap gene, a polynucleotide that encodes a protein of interest (e.g., a protease), and a heterologous polynucleotide that encodes a 42
  • the modified Bacillus sp. cell comprises a polynucleotide expression construct comprising a YmaH promoter that is operably linked to a polynucleotide sequence that encodes a YmaH protein.
  • the Bacillus subtilis YmaH also 5 known as HFQ BACSU is an RNA-binding protein, is a member of the Hfq family of RNA- binding proteins (Sauter et ai, Nucleic Acid Res 31 :4091 -4098, [2003]).
  • the YmaH protein is encoded in Bacillus subtilis by the ymaH gene, which is an ortholog of the hfq gene of E. coli. (Silvaggi et al ., J Bacteriol. 187(19): 6641-6650, [2005]).
  • YmaH is an abundant and ubiquitous RNA-binding protein that functions as a pleiotrophic regulator of
  • RNA metabolism in prokaryotes is required for stabilization of some transcripts and degradation of others.
  • YmaH binds preferentially to unstructured A/U-rich RNA sequences and is similar to the eukaryotic Sm proteins in both sequence and structure.
  • YmaH is also known to bind small RNA molecules called ri bo regulators that modulate the stability or translation efficiency of RNA transcripts.
  • the naturally-occurring YmaH protein from Bacillus subtilis is a 73 amino acid protein:
  • MKPINIQDQFLNQIRKENTYVTVFLLNGFQLRGQVKGFDNFTVLLESEGKQQLIYK HAISTFAPQKNVQLELE (Swiss-Prot:P3756; SEQ ID NO:45) [0170] that is encoded by a 219 (222 including the stop codon) base pair polynucleotide
  • the modified Bacillus sp. cell of the invention further comprises a heterologous polynucleotide sequence that encodes ymaH.
  • the ymaH protein is encoded by the naturally-occurring polynucleotide sequence found in the genome of the wild-type Bacillus subtilis strain 168 (SEQ ID NO: 1]
  • the modified Bacillus sp. cell of the invention comprises a heterologous polynucleotide sequence that encodes variants of the naturally occurring ymaH.
  • Variant YamH proteins include proteins derived from the wild-type protein by deletion (i.e., truncation), addition, or substitution of one or more amino acids at one or more sites in the native protein. Methods for such deletions, additions and substitutions
  • amino acid sequence variants of the polypeptide can be prepared by mutations in the cloned DNA sequence encoding the native protein of interest. Methods for mutagenesis and nucleotide sequence alterations are well known in the art (See e.g., Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488 492; Kunkel et al. (1987) Methods Enzymol. 154:367 382; U.S. Pat. No. 4,873,192; and the
  • the Bacillus sp. cell comprises a polynucleotide encoding a YmaH 5 protein comprising a nucleotide sequence having at least about 70% sequence identity, at least about 75% sequence identity, at least about 80% sequence identity, at least about 85% sequence identity, at least about 90% sequence identity, at least about 92% sequence identity, at least about 95% sequence identity, at least about 97% sequence identity, at least about 98% sequence identity, or at least about 99% sequence identity to
  • the modified Bacillus sp. cell comprises polynucleotide constructs that comprise ymaH coding sequences that are analogous to the ymaH coding sequence of Bacillus subtilis strain 168.
  • the genome of this strain which is contained in one 4215 kb genome, has been well-characterized (See, Kunststoff et ai, Nature 390:249-
  • the YmaH-encoding polynucleotide constructs encode a YmaH protein that shares at least about 65% amino acid sequence identity, at least about 70% amino acid sequence identity, at least about 75% amino acid sequence identity, at least about 80% amino acid sequence identity, at least about 85% amino acid sequence identity, at least about 90%
  • amino acid sequence identity with the amino acid sequence of the wild-type form of the YmaH protein and that has comparable or improved ability to enhance the production of a protein
  • the modified Bacillus sp. cell comprises YmaH-encoding polynucleotide constructs comprising polynucleotide sequences that are homologous, orthologous or paralogous to genes of the wild-type
  • the modified Bacillus sp. cell of the invention also encompasses polynucleotide constructs that comprise coding sequences encoding YmaH proteins that are related by being structurally and/or functionally similar.
  • these proteins are derived from a different genus and/or species, including differences between classes of organisms (e.g., a bacterial protein and a fungal protein). 44
  • these proteins are derived from a different genus and/or species.
  • related proteins are provided from the same species. Indeed, it is not intended that the present invention be limited to related proteins from any particular source(s).
  • the term "related proteins” encompasses tertiary 5 structural homologs and primary sequence homologs (e.g., the YmaH of the present invention).
  • the present invention encompasses such homologues including but not limited to such YmaH proteins as the YmaH of E.
  • HFQ ECOLI Shighella flexneri
  • HFQ SHIFL Salmonella typhimurium
  • HFQ SALTY Salmonella typhimurium
  • HFQ_YEREN Yersinia enterocolitica
  • HFQ_YERPE Erwinia carotovora
  • HFQ_HAEIN Haemophilus influenzae
  • HFQ_PASMU Pasteurella multocida
  • Vibrio cholerae HFQ VIBCH
  • Pseudomonas aeruginosa HFQ PSEAE
  • HFQ XANAC Pseudomonas aeruginosa
  • HFQ XANCP Xanthomonas campestris
  • GSQ XYLFA XyIeIIa fastidiosa
  • GSQ XYLFA Neisseria meningitidis
  • HFQ RALSO Neisseria meningitidis
  • HFQ RALSO Agrobacterium tumefaciens
  • HFQ_AZOCA Caulobacter crescentus
  • HFQ_CAUCR Caulobacter crescentus
  • Aquifex melitensis HFQ AQU AE
  • Thermotoga maritime HFQ THEMA
  • Clostridium acetobutylicum HFQ_CLOAB
  • Clostridium perfringens HFQ_CLOPE
  • Bacillus halodurans HFQ BACHD
  • variant proteins comprise variant YmaH proteins.
  • variant proteins differ from a parent protein and one another by a small number of amino acid residues. The number of differing amino acid residues may
  • related proteins and particularly variant proteins comprise at least about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 97%, about 98%, or about 99%
  • ymaH proteins that are capable of further enhancing the production of a protein of interest by a modified Bacillus sp. cell will find use.
  • Overexpression of ymaH in the modified Bacillus sp. cell of the invention can be achieved by various means including enhancing the transcription and/or translation of the 5 YmaH encoding polynucleotide.
  • overexpression of ymaH can be achieved by increasing the number of polynucleotide sequences that encode ymaH in a host cell, and/or by increasing the binding strength of a ymaH promoter to enhance the activity of the cognate RNA polymerase.
  • overexpression of ymaH can be achieved by enhancing the translational activity by
  • the modified Bacillus sp. cells of the invention comprise a
  • polynucleotide construct that comprises a polynucleotide sequence encoding ymaH operably linked to a ymaH promoter.
  • the transcription of ymaH may be naturally driven by two promoters: a SigA promoter that is present upstream of miaA coding region, and the SigH promoter that is immediately upstream of the ymaH coding region in the miaA operon of B. subtilis.
  • a ymaH promoter can be any promoter that drives the expression of
  • 20 yamH (e.g., a SigA and/or a SigH promoter), and may be any nucleic acid sequence which shows transcriptional activity in the host cell of choice and includes mutant, truncated and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell.
  • the promoter sequence may be native or foreign to the host cell.
  • the modified Bacillus sp. cells of the invention comprise a polynucleotide construct that comprises a polynucleotide sequence that encodes YmaH operably linked to a SigH promoter (e.g., SEQ ID NO:23, as shown below).
  • SEQ ID NO:23 also exemplifies a polynucleotide construct that comprises a YmaH coding sequence that is naturally contiguous with a SigH promoter:
  • the modified Bacillus sp. cells of the invention comprise a polynucleotide construct that comprises a polynucleotide sequence that encodes YmaH operably linked to a SigA promoter ⁇ e.g., SEQ ID NO:26 (SigA1 ) and SEQ ID NO:31 (SigA2 construct)).
  • SEQ ID NOs:26 and 31 exemplify embodiments wherein the ymaH 5 coding sequence is contiguous with a SigA promoter sequence to provide a chimeric polynucleotide construct.
  • chimeric polynucleotide constructs thus comprise a promoter sequence that in nature is not contiguous with the ymaH coding sequence.
  • SEQ ID NOS:26 and 31 exemplify chimeric constructs that comprise a SigA promoter that is operably linked to a polynucleotide
  • the Bacillus sp. cells of invention comprise a polynucleotide construct that comprise a polynucleotide sequence that encodes YmaH and a SigA and a SigH promoter (e.g., SEQ ID NO: 22, as shown below).
  • a polynucleotide construct that comprise a polynucleotide sequence that encodes YmaH and a SigA and a SigH promoter (e.g., SEQ ID NO: 22, as shown below).
  • Suitable promoters for directing the expression of the ymaH gene in are the SigA and the SigH promoters from the B. subtilis operon that encompasses the
  • the invention provides a polynucleotide sequence defining a SigA promoter (SEQ ID NO:47, as shown below).
  • SEQ ID NO:47 tcataccctgaaaggaaagacaagggaaattgtcggcaatgagccgctcggcaggtagaaggatgtttaccgat gcaaaaaagggcaaaatggataggtggttgtccatgttgaatgctataatgggggagatttataaaagagagtgatacata (SEQ ID NO:47)
  • the invention provides a polynucleotide sequence defining a SigH promoter (SEQ ID NO:48, as shown below).
  • Streptomyces coelicolor agarase gene (dagA), the promoter of the Bacillus lentus alkaline protease gene (aprH), the promoter of the Bacillus licheniformis alkaline protease gene (subtilisin Carlsberg gene), the promoter of the Bacillus subtilis levansucrase gene (sacB), the promoter of the Bacillus subtilis alpha-amylase gene (amyE), the promoter of the
  • Bacillus licheniformis alpha-amylase gene (amyL), the promoter of the Bacillus stearothermophilus maltogenic amylase gene (amyM), and the promoter of the Bacillus amtyloliquefacietis alpha-amylase gene (amyQ).
  • promoters that can be used for expressing the ymaH gene include Sigma H promoters that are recognized by ⁇ H factors including spoOA, spoOF, spoVG and citG (See, Helmann, J. D. and C. P. Moran.
  • a consensus SigA and/or SigH promoter finds use in the present invention.
  • the construction of a consensus promoter may be accomplished by site-directed mutagenesis to create a promoter which conforms more perfectly to the 5 established consensus sequences for the "-10" and “-35" regions of the "sigma A-type" promoters for Bacillus subtilis (Voskuil et al., MoI Microbiol 17: 271 279 [1995]).
  • a consensus promoter is created by site-directed mutagenesis to create a promoter which conforms more perfectly to the established consensus sequences for the "-10" and “-35” regions of the vegetative "sigma H-type” promoters for Bacillus subtilis
  • the consensus promoter may be obtained from any promoter which can function in a Bacillus host cell.
  • the SigA promoter which encompasses SEQ ID NO:47 is defined by a polynucleotide sequence that is naturally present upstream of the miaA
  • SigH promoter which encompasses SEQ ID NO: 48, is defined by the polynucleotide sequence that is naturally present upstream of the yam/-/ coding region (SEQ ID NO:46, shown below).
  • the SigA/SigH constructs encompass promoter sequences 5 that have been mutated to increase the activity of the promoter when compared to the activity of the corresponding wild-type promoter resulting in the overexpression of the YmaH protein.
  • variants of the sequences that define the SigA and SigH promoters find use in the YmaH-expression constructs.
  • the promoter is the B. subtilis sigH promoter, while in other embodiments the promoter is the B. subtilis sigA promoter. In further embodiments, the sigH and the sigA promoters serve to effect the overexpression of YmaH protein.
  • the SigA/SigH polynucleotide constructs of the invention also comprise the requisite ribosome binding site to ensure optimal translation of the
  • the polynucleotide construct comprises the ribosome bind site (RBS) sequence of the miaA gene (aagagag; SEQ ID NO:50), while in other embodiments, polynucleotide construct comprises the RBS sequence of the ymati gene (ggagg; SEQ ID NO:51 ). In yet other embodiments, the polynucleotide construct comprises the ribosome binding site sequences of the miaA and the ymati genes.
  • RBS ribosome bind site sequence of the miaA gene
  • the invention provides constructs having the promoter and ribosome binding site sequences upstream of the ymati coding sequence.
  • the invention is not limited to the ribosome binding site sequences disclosed herein, as it also encompasses any suitable ribosome binding site sequences that have been mutated to increase the level of expression of the ymati gene. Methods for obtaining mutated ribosome binding
  • the invention provides methods for producing a protein of 50
  • the protein of interest produced by the method of the invention is a protease (e.g., a subtilisin).
  • the method of the invention comprises inactivating at least one phr gene by introducing an inactivating DNA construct into a Bacillus sp. host cell to
  • Precursor host cells include precursor host cells of Bacillus sp. strains as described above, including Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans,
  • the precursor host cell is a Bacillus subtilis host cell.
  • the precursor host cells are recombinant cells comprising a recombinant polynucleotide that encodes a polypeptide of interest, as described above.
  • the polypeptide of interest is an enzyme (e.g., a protease, such as a subtilisin).
  • a protease such as a subtilisin.
  • the method of inactivating at least one phr gene (e.g. phrA and/or phrE) in a precursor Bacillus sp. host cell generates a modified Bacillus sp. cell that produces a polypeptide of interest at a level that is greater than that achieved by the corresponding unmodified precursor host cell.
  • the method comprises inactivating a phrA gene by introducing into the precursor Bacillus sp. host cell an inactivating DNA construct that deletes the indigenous phrA gene.
  • the inactivating DNA construct of SEQ ID NO:17 is introduced to delete the indigenous phrA gene by homologous recombination.
  • the method comprises inactivating a phrE gene by introducing into the
  • Bacillus sp. host cell an inactivating DNA construct that deletes the indigenous phrE gene.
  • the inactivating DNA construct of SEQ ID NO:18 is introduced to delete the indigenous phrE gene by homologous recombination.
  • both the phrA and phrE genes are inactivated using the inactivating constructs of SEQ ID NOs:17 and 18 .
  • the method of the invention is similarly used to
  • 35 inactivate other phr genes including phrC, phrF, phrG, phrti, phrl, and phrK and/or the rap genes including rapB, rapC, rapD, rapE, rapF, rapG, rapH, rapl, rapJ and rapK. 51
  • inactivation of the phrA gene is by insertion of a selectable marker that interrupts the phrA gene.
  • inactivation of the phrA gene results from the inactivation of the rapA gene by introducing a selectable marker comprising a terminator sequence in the rapA gene thereby preventing the functional 5 expression of the rapA and phrA protein
  • inactivation of the rapA gene is by insertion of a selectable marker that interrupts the rapA gene.
  • the production of a protein of interest by a modified Bacillus sp. cell is further enhanced from the expression of one or more copies of a YmaH-encoding polynucleotide comprised in an expression construct that is present on a
  • any one of the YmaH-encoding polynucleotide constructs described above e.g., SigA; SigA1 , SigA2, SigA3 or SigH constructs, are used to transform the modified Bacillus sp. cells.
  • the YmaH-encoding polynucleotide that is present on a replicating plasmid is introduced into a precursor host cell prior to the precursor host cell being modified to
  • the invention provides for modified Bacillus sp. cell comprising a vector comprising an expression construct comprising a YmaH-encoding polynucleotide operably linked to a YmaH promoter that is incorporated into the vector.
  • overexpression of YmaH is achieved by introducing a SigH expression construct that
  • a YmaH-encoding polynucleotide operably linked to a SigH promoter e.g., the expression construct of SEQ ID NO:23.
  • overexpression of YmaH is achieved by introducing a SigA expression construct that comprises a YmaH-encoding polynucleotide operably linked to a SigA promoter.
  • SigA constructs include the SigA1 expression construct of SEQ ID NO:26, the SigA2 expression construct of SEQ
  • the vector is a multicopy/replicating plasmid vector which forms an extrachromosomal self-replicating genetic element that overexpresses YmaH in the modified cell.
  • the vector is a plasmid vector, which carries a selectable marker gene that allows for ease of selecting the host cells that contain the plasmid.
  • Vectors that replicate autonomously in a host cell include vectors that comprise an origin of replication, which enables the vector to replicate autonomously in the Bacillus cell. 52
  • bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19, pACYC177, and pACYC184 permitting replication in E. coli, and pUB1 10, pC194, pE194, pTA1060, and pAM ⁇ i permitting replication in Bacillus.
  • the origin of replication may be one having a mutation to make its function temperature- 5 sensitive in the Bacillus cell (See e.g., Ehrlich, Proceedings of the National Academy of
  • a polynucleotide encoding the YmaH protein is introduced into a modified cell via an expression vector capable of replicating within the host cell. Suitable replicating and
  • the overexpression of a YmaH polypeptide results from the expression of at least one copy of a YmaH-encoding polynucleotide that is integrated into
  • the vector when the vector is introduced into the host cell, it is integrated into the genome and replicated together with the genome into which it has integrated.
  • Multiple copies of the YmaH gene can be integrated at several positions in the genome of the host cell.
  • an amplifiable expression cassette carrying a sequence encoding YmaH and a selectable marker e.g., an
  • antimicrobial resistance marker such as a gene coding chloramphenicol acetyl transferase
  • the invention provides a polynucleotide construct that is
  • the polynucleotide constructs of the invention that are incorporated into an integrating vector are targeted to chromosomal sequences of the Bacillus sp. host cell to create modified host cells that comprise stable tandem integrations of multiple vector copies.
  • the polynucleotide construct that is incorporated into the integration vector typically comprises a selectable
  • the invention provides methods for producing a protein of interest in a modified Bacillus cell by culturing the modified cell that is capable of producing a protein of interest and growing the cell under suitable growth conditions for expressing the protein of interest.
  • the host cells and modified host cells of the present 5 invention are cultured in conventional nutrient media.
  • the suitable specific culture conditions such as temperature, pH and the like are known to those skilled in the art. Additional preferred culture conditions are well known to those of skill in the art and are described in various reference publications.
  • the protein of interest produced by the modified host cell is
  • the protein of interest produced by the host cell is secreted into the extracellular space (i.e., the culture medium).
  • the protein of interest can be recovered from the intracellular milieu of the cell in which it is expressed by lysing the host cell and recovering the protein of interest by methods known in the art.
  • the protein of interest can be recovered from the intracellular milieu of the cell in which it is expressed by lysing the host cell and recovering the protein of interest by methods known in the art.
  • modified host cells are cultured under conditions suitable for the expression and recovery of the protein of interest from the cell culture.
  • the protein of interest produced by a modified host cell overexpressing ymaH according to the present invention is secreted into the culture media.
  • the protein of interest e.g., a protease
  • the cells is recovered from the culture medium by conventional procedures,
  • any method suitable for recovering the protease(s) of the present invention finds use in the present invention. Indeed, it is not limited to separating the host cells from the medium by centrifugation or filtration, precipitating the proteinaceous components of the supernatant or filtrate by means of a salt (e.g., ammonium sulfate), chromatographic purification (e.g., ion exchange, gel filtration, affinity, etc.).
  • a salt e.g., ammonium sulfate
  • chromatographic purification e.g., ion exchange, gel filtration, affinity, etc.
  • heterologous or homologous polynucleotide sequences encoding the proteins of interest join the heterologous or homologous polynucleotide sequences encoding the proteins of interest to nucleotide sequence encoding a polypeptide domain which facilitates purification of soluble proteins (Kroll DJ et al., DNA Cell Biol 12:441 -53 [1993]). Such purification facilitating domains
  • metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilized metals (Porath, Protein Expr Purif 3:263- 281 [1992]), protein A domains that allow purification on immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification system (Immunex Corp, Seattle WA).
  • metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilized metals (Porath, Protein Expr Purif 3:263- 281 [1992])
  • protein A domains that allow purification on immobilized immunoglobulin
  • FLAGS extension/affinity purification system Immunex Corp, Seattle WA.
  • cleavable linker sequence such as Factor XA or
  • the transformed host cells of the present invention are cultured in a suitable nutrient medium under conditions permitting the expression of a protein of interest (e.g., a protease), after which the resulting protease is recovered from the culture.
  • a protein of interest e.g., a protease
  • the medium used to culture the cells comprises any conventional medium 5 suitable for growing the host cells, such as minimal or complex media containing appropriate supplements. Suitable media are available from commercial suppliers or may be prepared according to published recipes (e.g., in catalogues of the American Type Culture Collection).
  • the host cells are cultured under batch, fed- batch or continuous fermentation conditions. Classical batch fermentation methods use a
  • the culture medium is made prior to the beginning of the fermentation run, the medium is inoculated with the desired organism(s), and fermentation occurs without the subsequent addition of any components to the medium.
  • the pH and oxygen content, but not the carbon source content, of the growth medium are altered during batch methods.
  • the metabolites and cell biomass of the batch are altered during batch methods.
  • a variation on the standard batch system is the "fed-batch fermentation" system.
  • nutrients e.g., a carbon source, nitrogen source, O 2 , and typically, other nutrients
  • Fed- batch systems are useful when catabolite repression is apt to inhibit the metabolism of the cells and where it is desirable to have limited amounts of nutrients in the medium.
  • Continuous fermentation generally maintains the cultures at a constant high density where cells are primarily in log phase growth.
  • Continuous fermentation allows for the modulation of one factor or any number of factors that affect cell growth and/or end product concentration. For example, in some
  • a limiting nutrient such as the carbon source or nitrogen source is maintained at a fixed rate and all other parameters are allowed to moderate.
  • a limiting nutrient such as the carbon source or nitrogen source is maintained at a fixed rate and all other parameters are allowed to moderate.
  • YmaH overexpression of YmaH in a Bacillus host cell results in an increase in the production of a protein of interest above the level obtained in the corresponding modified
  • the invention provides modified Bacillus host cells that overexpress YmaH.
  • the recombinant Bacillus host cell is a cell that was altered to produce greater levels of a protease than the unaltered parent/precursor Bacillus cell when grown under the same conditions.
  • the present invention also encompasses methods for producing a protein of interest in a modified cell that overexpresses YmaH in less time than that required by the precursor host cell.
  • the modified host cells of the invention are capable of producing a protein of interest at a greater level and at an earlier time than the corresponding unmodified precursor host cell.
  • a protein of interest e.g., a protease
  • the modified host produces a protein of interest in about 1/5 th , about 1/4 th , about 1/3 rd , or about Vz of the time it takes the precursor host cell to attain its maximum level of
  • the phr genes: phrA, phrE, phrC, phrF, phrG, phrH, phrl and phrK were deleted in the Bacillus subtilis strain BG2942 ( ⁇ nprE, degU(Hy)32, amyE::[PxylRA-comK eryR]), and AprE protease expression in the resulting modified Bacillus subtilis strains was determined using an AAPF assay. Deletion of phr genes was performed by inserting a spectinomycin selectable marker flanked by the lox site in the phr locus of the Bacillus
  • the PaprE-FNA expression construct comprises a polynucleotide sequence encoding the FNA protease operably linked to the aprE promoter of Bacillus subtilis.
  • FNA PURAFECT
  • PRIME [Genencor] is subtilisin BPN' from B. amyloliquefaciens that has the
  • the upstream region of the phrA gene containing the rapA sequence was amplified with the primers CB2 008-007 (SEQ ID NO:1 ) and CB2 008-009 (SEQ ID NO:3) and fused to the spectinomycin cassette, flanked by the loxP
  • oligos CB2 008-010 SEQ ID NO:5
  • CB2008-008 SEQ ID NO:2
  • fused to the PCR product containing the rapA sequences and the spectinomycin cassette To create the phrC deletion cassette, the upstream region of the phrC gene containing the rapC sequence was amplified with the primers CB2 008-015 and CB2 008- 5 016 and fused to the spectinomycin cassette, flanked by the loxP sequence, and amplified with the oligos CB2 008-016R and CB2 008-017R.
  • the downstream region of the phrC gene was amplified with the oligos CB2 008-017 and CB2008-018 and fused to the PCR product containing the rapC sequences and the spectinomycin cassette. [0232] To create the phrE deletion cassette, the upstream region of the phrE gene
  • the upstream region of the phrFgeue containing the rapFsequence was amplified with the primers CB2008-022 and CB2008- 023 and fused to the spectinomycin cassette amplified with the oligos CB2008-023R and CB2008-024R.
  • the downstream region of the phrFgeue was amplified with the oligos
  • the upstream region of the phrG gene containing the rapG sequence was amplified with the primers CB2008-026 and CB2008- 027R and fused to the spectinomycin cassette amplified with the oligos CB2008-027 and
  • CB2008-032R The downstream region of the phrl gene was amplified with the oligos CB2008-032 and CB2008-033 and fused to the purified PCR product containing the rapl sequence and the spectinomycin cassette.
  • the upstream region of the phrK gene 5 containing the rap/Csequence was amplified with the primers CB2008-034 and CB2008-
  • nucleotide sequence of the phrA deletion construct is: attcgttattgcaggtaattatgatgatatgcagtatccagaaagagcattgccccacttagaactggctttagatcttgcaaagaaaga
  • nucleotide sequence of the phrC deletion construct is: [0245] tcactaatggaattccggcaccagcttatgctggattatcttgagccgttagagaaattaaatatcgaagaccagcc 25 aagcctgtctgaattatcaagaaacattgacagcaaccaggcagatctcaaagggctgctcgactattacgtgaattttttcgc gggatgtatgaatttgataagcgggaatttattttctgcattacatactataaacaggcggagaaaagctctctttgtcgcag accatattgaacgggctgaattcttttaaaatcgcggaagcttattattatatgaagcaacgtattt
  • nucleotide sequence of the phrF deletion construct is: [0248] agtttcggcacaacctaatgcttgagtaccttgaaccgttagaaaaaatgaggattgaggaacagccgagactgt ctgatctgcttgagattgataaaaaacaggctcgtttaactggtctgcttgagtactatttttaacttcttcagaggcatgtacga gctggaccagcgggaatatctgtcggctattaaatttttcaaaaaggccgaaagcaagctgatattcgttaaggatcggatag 30 agaaagctgagtttttctttaagatgtcttattactatatgaaaca
  • nucleotide sequence of the phrl deletion construct is:
  • the nucleotide sequence of the phrK ⁇ e ⁇ e ⁇ on construct is: [0256] gatgaaatggaagaagatcaagaagttcttgcgtattatagtctattagaagaaagacataaaatgttgctgcattct tcacg agg ag agcctttacaaagcacacctattttactg aag acaatcaaaacttcataacaaaaacaaatg ataaattag aatacaacttttatttttgaagcaatgtacgaggcatacaacaaaaactatgatcgagcaattaacctatatggattagctga 5 gaaaaaagcttgcagaaattccagatgaaattgaagcagctgaatttactctaaagtcttacttatattaagtc
  • the assay measured the production 30 of protease as the increase in absorbance at 405 nm/min resulting from the hydrolysis and release of p-nitroanaline (Estell et al., J Biol Chem., 260:6518-6521 (1985)).
  • the measurements were made using the Sofmax Pro software, and the specified conditions were set as: Type: Kinetic; Reduction: Vmax Points (Read best 15/28 points); Lm 1 : 405 nm; Time: 5 minutes; and Interval: 1 1 Seconds.
  • Twenty microliters of each of the B. subtilis supematants 35 were diluted in 10Oul of Tris Buffer, containing 10 mM Tris + 0.005% TWEEN ⁇ -80, pH 8.6; and 73
  • CB2-5, CB2-7 and CB2-8 Bacillus sp. cells when compared to the production in the unmodified parent strain BG2942 (diamonds).
  • the inactivation constructs cassettes of phrA and phrE (SEQ ID NOS:17 and 18, respectively) were introduced into the Bacillus subtilis strain CF471 .
  • the CF471 strain is the BG3594 strain described above (degU(Hy)32, oppA, LspollE, AaprE, AnprE) and that
  • PaprE-FNA expression construct (SEQ ID NO:19), which encodes for the protease FNA (SEQ ID NO:20).
  • the resulting modified strains CB3-47 (BG3594 phrA::spcR, aprE:[PaprE-FNA, caf ⁇ ), and CB3-48 (BG3594 phrE::spcR, aprE:[PaprE-FNA, caf ⁇ ) were grown in autoclaved suitable growth medium for 50 hours. Samples of the cell culture were centrifuged, and the production of protease was quantified as a function of
  • FIG. 6 shows a graph of protease expression in the double phr deleted strains (CB4-68: BG3594 phrA, phrE, aprE:[PaprE-FNA, caf ⁇ ; 10 triangles, CB4-69: BG3594 phrA, phrE aprE:[PaprE-FNA, caf ⁇ ; crosses) compared to the phrA deleted strains (CB4-46: BG3594 phrA, aprE:[PaprE-FNA, cat ⁇ ; diamonds, CB4-48: BG3594 phrE, aprE:[ PaprE- FN A, caf ⁇ ; squares).
  • strains carrying both deletions of the phrA and phrE genes i.e., strains CB4-
  • Polynucleotide constructs SigH, SigA1 , SigA2, and SigA3 were generated to overexpress YmaH in host cells of Bacillus subtilis.
  • PCR primers were designed to be homologous to the Bacillus subtilis genome ( Figure 7A) and to contain a 6 base pair restriction enzyme site located 6 base pairs from the 5' end of the primer. Primers were designed to engineer unique restriction sites at the
  • KNVQLELE SEQ ID NO:45 (Swiss-Prot:P3756).
  • Sigma H promoter is naturally located within the polynucleotide sequence encoding the miaA gene, close to the 3' end of the gene, and immediately upstream of the ymaH gene.
  • the entire Sigma H promoter and adjacent ymaH coding sequence was amplified by PCR using the forward primer P1 : 5 ggcaccgaattcgacgtggtttcgcaacaaatgcag (SEQ ID NO:24; position 987 to 101 1 of SEQ ID NO:22), with an EcoRI restriction site added at the 5' end, and a reverse primer P2: ggcaccggatccctcataaaaaaaagaccgtgccttgg (SEQ ID NO:25, at position 1472 to 1496 of SEQ ID NO:22), with and added BamhW restriction site (Figure 7B).
  • the SigA1 construct ( Figure 7C; SEQ ID NO:26) gcgccgaattctcataccctgaaaggaaagacaagggaaattgtcggcaatgagccgctcggcaggtagaaggatgtttaccgat gcaaaaaaagggcaaaatggataggtggttgtccatgttgaatgctataatgggggagatttataaaagagagtgatacatattgaat
  • ccggtgcc 20 ccggtgcc (SEQ ID NO:26) was generated using two sets of primers.
  • Reverse primer P4 and forward primer P5 are fusion primers that were designed to contain tails that are complementary to each other but that are not homologous to the sequence that is being amplified to eliminate the intervening miaA coding sequence.
  • the two fragments were annealed, and the resulting SigA1 construct contained the SigA promoter (SEQ ID NO:47)
  • the pBS19 plasmid can replicate in E.
  • SigA promoter was amplified using forward primer P3 (SEQ ID NO:27) and reverse fusion 79
  • catacagtttcgattaaagttcgagcactctcttttataaatctccccca (SEQ ID NO:33) [0276] located from bp 125 to bp 149 on the SEQ ID NO:22.
  • the second fragment containing the DNA sequence encoding the YmaH protein was amplified using the 5 forward fusion primer P6:
  • tgggggagatttataaaagagagtgctcgaactttaatcgaaactgtatg (SEQ ID NO:32) located from bp 1090 to bp 1 1 14 on the SEQ ID NO:22 and the reverse primer P2 (SEQ ID NO:25).
  • the two fragments were annealed, and the resulting SigA2 construct contained the SigA promoter, the ribosome binding site GGAGG; SEQ ID NO:51 ) and the 10 transcription start site of the ymati gene.
  • the invention also encompasses a fourth SigA construct (SigA3; SEQ ID NO:22; Figure 7E), which is generated by amplifying the miaA ymati region of the Bacillus chromosomal DNA that includes a SigA promoter, the region encoding the MiaA protein, the a YmaH promoter and the region encoding the YmaH protein.
  • SigA3 SEQ ID NO:22; Figure 7E
  • the SigA3 construct was generated using forward primer P8 gcgcgcgaattcagggaaattgtcggcaatgagccgctcggc (SEQ ID NO:34) and reverse primer P9 gcgcgccatggctgattcgtctcagttctgcttcactttca (SEQ ID NO:35).
  • SEQ ID NO:34 places an EcoRI restriction site at the 5' end of the fragment, while SEQ ID NO:35 places a Ncol site at the 3' end.
  • PCR reactions were performed in 50 ul volume conatining 1 -2 ul DNA or from a colony resuspension, 5 ul of 1 OX Pfu Ultra buffer (Stratagene), 1 uL of 1 OmM dNTP blend 5 (Roche), 0.5 uL of 0.2uM primers, 1 ul Pfu Ultra High Fidelity Polymerase, and the volume adjusted with water to have a total volume of 50 ul.
  • the PCR conditions were: 95 0 C for 2 min, 30 cycles of 95O for 30 sec, 62 0 C for 30 sec, 72°C for 1 min, followed by 1 cycle of 72O fOr 10 min.
  • the obtained PCR fragments were gel purified using Qiagen Gel Purification Kit
  • Fusion constructs were obtained by annealing 0.25ul aliquots of purified PCR fragments that were mixed together and added into fresh PCR mix following the above recipe using primers P3 and P2.
  • the total volume of the PCR mixture was 50 ⁇ l.
  • the PCR conditions were the same as above adjusting the annealing temperature according to the
  • SigH, SigA1 , and SigA2 constructs were ligated into pBS19 plasmids that had been digested with EcoRI and BamhW to generate SigA and SigH expression vectors that were used to transform host cells as described in Example 4.
  • the transformation mixture was plated on LB+1 .6% skim milk+5 ug/ml cmp plates.
  • Plasmid DNA was prepared from the E. coli cultures, and a portion of the plasmid DNA preparation was sequenced (Sequetech). Automated sequence analysis was performed using Phrep, Phrap, Consed,
  • the expression vectors containing the SigH (SEQ ID NO:23) and SigA1 (SEQ ID NO:26) and SigA2 (SEQ ID NO:31 ) constructs were named pBS19 ymaH-H and pBS19 ymaH-A1 and pBS19 ymaH-A2 were transformed into B. subtilis strains BG2941 and BG2942 as follows. Two microliters of the plasmid DNA 5 carrying the appropriate constructs were used to transform 10O ⁇ l of B. substilis cells BG
  • BG2941 ( ⁇ nprE, amyE::PxylRA-comK-phleoR) and BG2942 ⁇ nprE, degU(Hy)32, amyE::PxylRA-comK-eryR).
  • the BG2941 and BG2942 transformants carrying the SigH constructs were named 41 SigH and 42SigH, respectively; and the BG2941 and BG2942 transformants carrying the SigA1 constructs were named 41 SigA1 and 42SigA1 ,
  • BG2941 and BG2942 host cells were also transformed with a control
  • BG2941 and BG2942 host cells carry the deletion of the nprE gene, which abolishes most of the non- aprE background proteolytic activity, thus facilitating the measurement of the alkaline protease (AprE) produced.
  • the BG2941 and BG2942 host cells also carry the cassette
  • Casein assay - The effect of overexpressing YmaH on the production of endogenous AprE subtilisin protease by Bacillus host cells was determined first by a qualitative assay that compares the size of the halos produced by the colonies grown on agar plates containing casein in the form of skim milk. As protease enzyme is secreted by
  • the Bacillus cells it digests the casein in the skim milk, and forms regions of clearing, or halos around the growing colony. Host cells which have an inactive protease will exhibit little or no halo around the colonies. Thus, the size of the halo provides a qualitative assessment of the amount of protease that is produced by the secreting colony (Wells, T. A. et al. Nucleic Acids Res., 1 1 , 791 1 -7925: [1983]).
  • BG2941 and BG2942 Bacillus subtilis host cells transformed with SigH or SigA1 expression vectors were plated onto LB agar plates containing 1 .6% skim milk and 5ppm 83
  • the largest halos were produced by the 42SigH host cells.
  • the 42SigH cells are
  • BG2942 Bacillus subtilis host cells that carry the degU(Hy)32 mutation and the SigH construct that enables the overexpression of YmaH protein.
  • the size of the halos of the 42SigH cells evidences that overexpressing ymaH further enhances the production of subtilisin in host cells that already produce levels of the enzyme that are
  • 42SigH cells produce halos that are bigger than those produced by the 42pBS19 cells, which carry the degU(Hy) mutation but do not carry a construct that enables overexpression of ymaH, but which in turn produce halos that are bigger than the halos produced by the 41 pBS19 cells, which are BG2941 Bacillus subtilis host cells that do not carry the degU(Hy)32 mutation and do
  • proteolytic activity of the secreted protease was determined as the rate of hydrolysis of the substrate succinyl -L-Ala-L-Ala-L-Pro-L-Phe-p-nitroanalide (AAPF from Sigma Chemical Co).
  • the assay measured the level of production of protease as the
  • Figures 10A and 10B show that overexpressing YmaH in Bacillus host cells, whether in presence (42SigA and 42SigH; Figure 10A) or absence (41 SigH; Figure 10B) of the degU(Hy) mutation, enhances the production of the AprE subtilisin by several fold
  • FIG. 10A shows that 42sigH cells produce almost as much subtilisin at 20 hours of growth as the parent control cells produce at 48 hours.
  • Figure 10B shows that 41 SigH cells
  • FIG. 1 1 shows that cells that the expression of YmaH when driven by the SigH promoter (42SigH) results in the production of subtilisin that is greater than that produced by cells in which YmaH expression is driven by the Sigma A promoter (42SigA).
  • Figure 1 1 also shows that overexpression of YmaH whether driven by the SigH
  • SigA promoter results in enhanced production of AprE subtilisin as early as after only one hour of cell growth.
  • SigH which comprises the ymaH gene operably linked to its native promoter (SigH promoter)
  • SigH promoter was amplified by PCR using the primers ymaH 1 F 30 EcoRI (P1 ; SEQ ID NO:24) and ymaH 3'R BamHI (P2; SEQ ID NO:25) and cloned in the multicopy plasmid pBS19 using EcoRI and BamHI restriction sites to generate plasmid pBS19 ymaH sigH (SEQ ID NO:37).
  • strain BG2942 deleted for the phrA (CB2-1 ) and the strain BG2942 deleted for the phrE gene (CB 2-2) were each transformed with the multicopy plasm id pBS19 ymaH sigH (SEQ ID NO:37) to generate strains CB2-1 1 (BG2942 phrA:spc, pBSW ymaH sigH) 25 and CB2-12 (BG2942 phrEispc, pBS19 ymati sigH), respectively, and tested for the expression of aprE.
  • BG2942 cells that do not carry a deletion of either the phrA or the phrE gene were transformed with the pBS19 ymaH sigH plasmid to generate the control strain 42SigH (BG2942 pBS19 ymaH sigH). All BG2942 derived strains (42SigH, CB2-1 1 and CB2-12) were grown for nine hours in 2X SNB media and the supernatants were 30 utilized for assaying the activity of AprE using the AAPF assay.
  • Figure 12 shows the effect of overexpressing YmaH on the production of protease by strains carrying the deletion of either the phrA or phrE gene.
  • the strains carrying the multicopy plasmid pBS19 ymaH sigH i.e., 42SigH
  • CB2-1 1 and CB2-12 showed a higher protease expression when compared to the BG2942 strain that was transformed 35 only with a control pBS19 plasmid (42pBS19).
  • the results show that overexpression of YmaH in the 42SigH strain (BG2942 pBS19 ymaH sigH) (squares) 87
  • YmaH enhances the production of a protein of interest (e.g., a subtilisin)
  • a protein of interest e.g., a subtilisin
  • the deletion cassette of the rapA/phrA operon is diagramed in Figure 13, and the polynucleotide sequence is: tggagggagtcagaccgcgtctttgggaaaaagcaagcggaaagtgaccgtgtttacggatggagatggagggacttca agagagcaggaagccattgtcagagaggttcagcggagtcaagtcatcatgaatccgctattgaaaaaagagatatacag

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Medicinal Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

The present invention provides host cells that have been genetically manipulated to have an enhanced capacity to produce proteins of interest. In particular, the invention relates to modified Bacillus sp. Host cells that have at least one inactivated phr gene. The enhanced production of proteins of interest by the modified Bacillus sp. Host cells is further increased in modified Bacillus sp. Host cells that overexpress YmaH. Methods for producing proteins of interest in the modified host cells are also provided

Description

BACILLUS STRAIN FOR INCREASED PROTEIN PRODUCTION
5 CROSS REFERENCE TO RELATED APPLICATION
[01] This application claims the benefit of U.S. Provisional Application No. 61/186,321 , filed on June 1 1 , 2009, which is hereby incorporated by reference in its entirety.
10 [02] The present invention provides host cells that have been genetically manipulated to have an enhanced capacity to produce proteins of interest. In particular, the invention relates to modified Bacillus sp. host cells that have at least one inactivated phr and/or rap gene. The enhanced production of proteins of interest by the modified Bacillus sp. host cells is further increased in modified Bacillus sp. host cells that overexpress YmaH.
15 Methods for producing proteins of interest in the modified host cells are also provided.
BACKGROUND
[03] Expression and recombinant production of exogenous polypeptides is a widely used technique. It is well known that cells can be transformed with nucleic acids encoding
20 exogenous polypeptides of interest for expression and production of large quantities of the desired polypeptides. In some applications, the methods are used to produce amounts of polypeptide over what would be produced naturally by the originating organism. Indeed, expression of exogenous nucleic acid sequences, as well as over-expression of endogenous sequences have been extensively used in modern biotechnology.
25 [04] In spite of the implementation of various approaches for increasing protease yield, including screening for hyper-producing strains, cloning and over-expressing proteases, improving fed-batch and chemostat fermentations, and optimizing fermentation technologies, there remains a need for additional means for enhancing the production of proteases.
30
SUMMARY OF THE INVENTION
[05] The present invention provides host cells that have been genetically manipulated to have an enhanced capacity to produce proteins of interest. In particular, the invention relates to modified Bacillus sp. host cells that have at least one inactivated phr and/or rap
35 gene. The enhanced production of proteins of interest by the modified Bacillus sp. host cells is further increased in modified Bacillus sp. host cells that overexpress YmaH. Methods for producing proteins of interest in the modified host cells are also provided. [06] In one embodiment, the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, 5 and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[07] In another embodiment, the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr
10 gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest. Preferably the promoter is the wild-type or mutant aprE promoter. The protein of interest is an enzyme, and preferably, a protease (e.g., a
15 subtilisin).
[08] In another embodiment, the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host
20 cell. Preferably, the inactivated rap gene is the rapA gene. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[09] In another embodiment, the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene, (e.g., rapA gene), and a recombinant nucleic acid for
25 producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest. Preferably, the promoter is the wild-type or mutant aprE promoter. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
30 [010] In another embodiment, the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The at least one inactivated phr gene is chosen from phr A, phrE, phrC, phrF, phrG, phrl, and phrK. In
35 some embodiments, the inactivated phr gene is the inactivated phrA gene, while, other embodiments the inactivated phr gene is the phrE gene. The protein of interest is an 3
enzyme, and preferably, a protease (e.g., a subtilisin)
[011] In another embodiment, the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and a recombinant nucleic acid for producing a protein of interest at a level that is 5 greater than that produced by the unmodified precursor host cell. The at least one inactivated phr gene is chosen from phr A, phrE, phrC, phrF, phrG, phrl auά phrK. In some embodiments, the inactivated phr gene is the inactivated phrA gene, while, other embodiments the inactivated phr gene is the phrE gene. The recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes
10 the protein of interest. Preferably the promoter is the wild-type or mutant aprE promoter.
The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin). [012] In another embodiment, the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene, and a recombinant nucleic acid for producing a protein
15 of interest at a level that is greater than that produced by the unmodified precursor host cell. Preferably, the inactivated rap gene is the rapA gene, and the at least one inactivated phr gene is chosen from phr A, phrE, phrC, phrF, phrG, phrl and phrK. In some embodiments, the inactivated phr gene is the inactivated phrA gene, while, other embodiments the inactivated phr gene is the phrE gene. The protein of interest is an
20 enzyme, and preferably, a protease (e.g., a subtilisin).
[013] In another embodiment, the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene (e.g., rapA gene), and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the
25 unmodified precursor host cell. The recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest. Preferably, the promoter is the wild-type or mutant aprE promoter. . The at least one inactivated phr gene is chosen from phrA, phrE, phrC, phrF, phrG, phrl and phrK. Preferably, the inactivated phr gene is the inactivated phrA or phrE gene. The protein of
30 interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[014] In another embodiment, the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has an inactivated phrA gene and an inactivated phrE gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell.
35 The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[015] In another embodiment, the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has an inactivated phrA gene and an inactivated phrE gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The recombinant nucleic acid comprises a promoter that is operably linked to the 5 polynucleotide sequence that encodes the protein of interest. Preferably the promoter is the wild-type or mutant aprE promoter. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[016] In another embodiment, the invention provides a modified Bacillus sp. host cell that comprises a genome comprising a rap operon that has an inactivated phrA gene, an
10 inactivated phrE gene, an inactivated rapA gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin). [017] In another embodiment, the invention provides a modified Bacillus sp. host cell
15 that comprises a genome comprising a rap operon that has an inactivated phrA gene, an inactivated phrE gene, an inactivated rapA gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest.
20 Preferably the promoter is the wild-type or mutant aprE promoter. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[018] In another embodiment, the invention provides a modified Bacillus sp. host cell that over expresses YmaH and that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and a recombinant nucleic acid for producing a
25 protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[019] In another embodiment, the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has
30 at least one inactivated phr gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest. Preferably the promoter is the wild-type or mutant aprE promoter. The protein of interest is an enzyme, and
35 preferably, a protease (e.g., a subtilisin).
[020] In another embodiment, the invention provides a modified Bacillus sp. host cell 5
that overexpresses YmaH and that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. Preferably, the inactivated rap gene is the rapA gene. 5 The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[021] In another embodiment, the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene (e.g., rapA gene), and a recombinant nucleic acid for producing a protein of interest at a level that is greater than
10 that produced by the unmodified precursor host cell. The recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest. Preferably, the promoter is the wild-type or mutant aprE promoter. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin). [022] In another embodiment, the invention provides a modified Bacillus sp. host cell
15 that overexpresses YmaH and that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The at least one inactivated phr gene is chosen from phrA, phrE, phrC, phrF, phrG, phrl, and phrK. In some embodiments, the inactivated phr gene is the inactivated phr A
20 gene, while, other embodiments the inactivated phr gene is the phrE gene. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin). [023] In another embodiment, the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and a recombinant nucleic acid for producing a protein
25 of interest at a level that is greater than that produced by the unmodified precursor host cell. The at least one inactivated phr gene is chosen from phr A, phrE, phrC, phrF, phrG, phrl, and phrK. In some embodiments, the inactivated phr gene is the inactivated phr A gene, while, other embodiments the inactivated phr gene is the phrE gene. The recombinant nucleic acid comprises a promoter that is operably linked to the
30 polynucleotide sequence that encodes the protein of interest. Preferably the promoter is the wild-type or mutant aprE promoter. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[024] In another embodiment, the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has
35 at least one inactivated phr gene, and an inactivated rap gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. Preferably, the inactivated rap gene is the rapA gene, and the at least one inactivated phr gene is chosen from phrA, phrE, phrC, phrF, phrG, phrl, and phrK. In some embodiments, the inactivated phr gene is the inactivated phrA gene, while, other embodiments the inactivated phr gene is the phrE gene. The protein of 5 interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[025] In another embodiment, the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has at least one inactivated phr gene, and an inactivated rap gene (e.g., rapA gene), and a recombinant nucleic acid for producing a protein of interest at a level that is greater than
10 that produced by the unmodified precursor host cell. The recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest. Preferably, the promoter is the wild-type or mutant aprE promoter. . The at least one inactivated phr gene is chosen from phrA, phrE, phrC, phrF, phrG, phrl, and phrK. Preferably, the inactivated phr gene is the inactivated phrA or phrE gene. The
15 protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[026] In another embodiment, the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has an inactivated phrA gene and an inactivated phrE gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the
20 unmodified precursor host cell. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
[027] In another embodiment, the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has an inactivated phrA gene and an inactivated phrE gene, and a recombinant nucleic acid
25 for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest. Preferably the promoter is the wild-type or mutant aprE promoter. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin).
30 [028] In another embodiment, the invention provides a modified Bacillus sp. host cell that overexpresses YmaH and that comprises a genome comprising a rap operon that has an inactivated phrA gene, an inactivated phrE gene, an inactivated rapA gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The protein of interest is an enzyme,
35 and preferably, a protease (e.g., a subtilisin).
[029] In another embodiment, the invention provides a modified Bacillus sp. host cell 7
that overexpresses YmaH and that comprises a genome comprising a rap operon that has an inactivated phrA gene, an inactivated phrE gene, an inactivated rapA gene, and a recombinant nucleic acid for producing a protein of interest at a level that is greater than that produced by the unmodified precursor host cell. The recombinant nucleic acid 5 comprises a promoter that is operably linked to the polynucleotide sequence that encodes the protein of interest. Preferably the promoter is the wild-type or mutant aprE promoter. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin). [030] In another embodiment, the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell an
10 inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of at least one indigenous phr and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable conditions, wherein production of a protein of interest is greater in said modified host cell when compared to the production of said protein of interest in said precursor host cell. In some
15 embodiments, the method further comprises recovering the protein of interest. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin). In some embodiments, the host cell comprises a mutation in at least one gene chosen from degU, degQ, degS, sco4, spollE, degQ and degR. Preferably, the host cell comprises a deg(Hy)32 mutation.
20 [031] In another embodiment, the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of at least one indigenous phr and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable conditions, wherein
25 production of a protein of interest is greater in said modified host cell when compared to the production of said protein of interest in said precursor host cell. The at least one indigenous phr gene that is inactivated is chosen from phrA, phrE, phrC, phrF, phrG, phrl, and phrK. In some embodiments, the inactivated phr gene is the inactivated phrA gene, while, other embodiments the inactivated phr gene is the phrE gene. The protein of
30 interest is an enzyme, and preferably, a protease (e.g., a subtilisin). In some embodiments, the method further comprises recovering the protein of interest. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin). In some embodiments, the host cell comprises a mutation in at least one gene chosen from degil, degQ, degS, sco4, spollE, degQ and degR. Preferably, the host cell comprises a
35 deg(Hy)32 mutation.
[032] In another embodiment, the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of the indigenous phrA and phrE genes and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable 5 conditions, wherein production of a protein of interest is greater in said modified host cell when compared to the production of said protein of interest in said precursor host cell. In some embodiments, the method further comprises recovering the protein of interest. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin). In some embodiments, the host cell comprises a mutation in at least one gene chosen from degU,
10 degQ, degS, sco4, spollE, degQ and degR. Preferably, the host cell comprises a deg(Hy)32 mutation.
[033] In another embodiment, the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell an inactivating DNA construct comprising an inactivating polynucleotide that results in the
15 inactivation of the indigenous phrA and rap genes to generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable conditions, wherein production of a protein of interest is greater in said modified host cell when compared to the production of said protein of interest in said precursor host cell. In some embodiments, the method further comprises recovering the protein of interest. The protein of interest is an enzyme,
20 and preferably, a protease (e.g., a subtilisin). In some embodiments, the host cell comprises a mutation in at least one gene chosen from degU, degQ, degS, sco4, spollE, degQ and degR. Preferably, the host cell comprises a deg(Hy)32 mutation. [034] In another embodiment, the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell that
25 overexpresses YmaH, an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of at least one indigenous phr and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable conditions, wherein production of a protein of interest is greater in said modified host cell when compared to the production of said protein of interest in said
30 precursor host cell. In some embodiments, the method further comprises recovering the protein of interest. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin). In some embodiments, the host cell comprises a mutation in at least one gene chosen from degU, degQ, degS, sco4, spollE, degQ and degR. Preferably, the host cell comprises a deg(Hy)32 mutation. Overexpression of YmaH is achieved by introducing
35 into either the precursor or the modified host cell a SigH construct (e.g., SEQ ID NO:23), comprising a SigH promoter operably linked to a polynucleotide encoding a YmaH protein. Alternatively, overexpression of YmaH is achieved by introducing into either the precursor or the modified host cell a SigA construct (e.g., SEQ ID NOS:26 and 31 ), comprising a SigA promoter operably linked to a polynucleotide encoding YmaH. [035] In another embodiment, the invention provides a method for producing a protein of 5 interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell that overexpresses YmaH, an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of at least one indigenous phr and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable conditions, wherein production of a protein of interest is greater in said
10 modified host cell when compared to the production of said protein of interest in said precursor host cell. The at least one indigenous phr gene that is inactivated is chosen from phrA, phrE, phrC, phrF, phrG, phrl, and phrK. In some embodiments, the inactivated phr gene is the inactivated phrA gene, while, other embodiments the inactivated phr gene is the phrE gene. The protein of interest is an enzyme, and preferably, a protease (e.g., a
15 subtilisin). In some embodiments, the method further comprises recovering the protein of interest. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin). In some embodiments, the host cell comprises a mutation in at least one gene chosen from degU, degQ, degS, sco4, spollE, degQ and degR. Preferably, the host cell comprises a deg(Hy)32 mutation. Overexpression of YmaH is achieved by introducing
20 into either the precursor or the modified host cell a SigH construct (e.g., SEQ ID NO:23), comprising a SigH promoter operably linked to a polynucleotide encoding a YmaH protein. Alternatively, overexpression of YmaH is achieved by introducing into either the precursor or the modified host cell a SigA construct (e.g., SEQ ID NOS:26 and 31 ), comprising a SigA promoter operably linked to a polynucleotide encoding YmaH.
25 [036] In another embodiment, the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell that overexpresses YmaH, an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of the indigenous phrA and phrE genes and/or rap gene to generate a modified Bacillus sp. host cell; and growing said modified
30 host cell under suitable conditions, wherein production of a protein of interest is greater in said modified host cell when compared to the production of said protein of interest in said precursor host cell. In some embodiments, the method further comprises recovering the protein of interest. The protein of interest is an enzyme, and preferably, a protease (e.g., a subtilisin). In some embodiments, the host cell comprises a mutation in at least one
35 gene chosen from degU, degQ, degS, sco4, spollE, degQ and degR. Preferably, the host cell comprises a deg(Hy)32 mutation. Overexpression of YmaH is achieved by 10
introducing into either the precursor or the modified host cell a SigH construct (e.g., SEQ ID NO:23), comprising a SigH promoter operably linked to a polynucleotide encoding a YmaH protein. Alternatively, overexpression of YmaH is achieved by introducing into either the precursor or the modified host cell a SigA construct (e.g., SEQ ID NOS:26 and 5 31 ), comprising a SigA promoter operably linked to a polynucleotide encoding YmaH.
[037] In another embodiment, the invention provides a method for producing a protein of interest in a host cell that comprises introducing into a precursor Bacillus sp. host cell that overexpresses YmaH, an inactivating DNA construct comprising an inactivating polynucleotide that results in the inactivation of the indigenous phrA and rap genes to
10 generate a modified Bacillus sp. host cell; and growing said modified host cell under suitable conditions, wherein production of a protein of interest is greater in said modified host cell when compared to the production of said protein of interest in said precursor host cell. In some embodiments, the method further comprises recovering the protein of interest. The protein of interest is an enzyme, and preferably, a protease (e.g., a
15 subtilisin). In some embodiments, the host cell comprises a mutation in at least one gene chosen from degil, degQ, degS, sco4, spollE, degQ and degR. Preferably, the host cell comprises a deg(Hy)32 mutation. Overexpression of YmaH is achieved by introducing into either the precursor or the modified host cell a SigH construct (e.g., SEQ ID NO:23), comprising a SigH promoter operably linked to a polynucleotide encoding a YmaH protein.
20 Alternatively, overexpression of YmaH is achieved by introducing into either the precursor or the modified host cell a SigA construct (e.g., SEQ ID NOS:26 and 31 ), comprising a SigA promoter operably linked to a polynucleotide encoding YmaH. protein. [038] The Bacillus sp. host cell of the embodiments described is a Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus
25 coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis cell. Preferably, the Bacillus sp. host cell of the embodiments described is a Bacillus subtilis host cell. In each of the present embodiments provided herein, the present invention provides isolated host cells, as well as cells in culture.
30 [039] The present invention provides a host cell comprising a rap operon comprising at least one inactivated phr and/or at least one inactivated rap gene. In some embodiments, the host cell overexpresses YmaH. In some further embodiments, the host cell further comprises a recombinant nucleic acid. In still some further embodiments, the host cell further comprises a polynucleotide sequence encoding a protein of interest. In some
35 additional embodiments, the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence encoding a protein of interest. In some 11
further embodiments, the promoter is the wild-type or a mutant aprE promoter. In some additional embodiments, the host cell is a Bacillus sp. host cell. In still some further embodiments, the Bacillus sp. host cell is Bacillus subtilis. In some additional embodiments, the host cell produces the protein of interest at a level that is greater than 5 that produced by a host cell that does not comprise at least one inactivated phr and/or rap gene. In some further embodiments, the protein of interest is an enzyme. In some additional embodiments, the enzyme is a protease. In still some additional embodiments, the at least one inactivated rap gene is the rapk gene. In some further embodiments, the at least one inactivated phr gene is selected from phr A, phrE, phrC, phrF, phrG, phrl, and
10 phrK. In some embodiments, the at least one inactivated phrgeue is phrA, while in some alternative embodiments, the at least one inactivated phr gene is phrE. In still some further embodiments, the host cell comprises at least one inactivated phr gene and at least one inactivated rap gene. In some further embodiments, the inactivated rap gene is the rapA gene. In still some further embodiments, there is at least one inactivated rap
15 gene (e.g., rapA) and at least one inactivated phr gene selected from phr A, phrE, phrC, phrF, phrG, phrl, and phrK. In some embodiments, the at least one inactivated phr gene is phr A, while in some alternative embodiments, the at least one inactivated phr gene is phrE. In still some further embodiments, the host cell comprises an inactivated phrA gene, an inactivated phrE gene, an inactivated rapA gene, and a recombinant nucleic acid
20 encoding a protein of interest. In some embodiments, the protein of interest is an enzyme.
In still some further embodiments, the enzyme is a protease. In some embodiments, the host cell is a Bacillus sp. host cell. In some further embodiments, the Bacillus sp. host cell is a Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus,
25 Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus,
Bacillus subtilis, or Bacillus thuringiensis cell. In some additional embodiments, the Bacillus sp. host cell is a Bacillus subtilis host cell.
[040] The present invention also provides methods for producing at least one protein of interest comprising providing a precursor host cell and an inactivating nucleotide construct
30 comprising an inactivating polynucleotide that inactivates at least one indigenous phr and/or rap gene; introducing said inactivating nucleotide construct into said precursor host cell to generate a modified host cell; and growing the modified host cell under suitable conditions for producing of the at least one protein of interest. In some embodiments of the present methods, the protein of interest is encoded by a recombinant nucleic acid
35 present in the precursor host cell. In some embodiments of the present methods, the protein of interest is encoded by a recombinant nucleic acid present in the modified host 12
cell. In some embodiments of the present methods, the protein of interest is encoded by a recombinant nucleic acid present in the precursor host cell and/or the modified host cell. In some embodiments of the present methods, the recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence encoding the protein of 5 interest. In some additional embodiments of the present methods, the protein of interest is a wild-type protein of interest. In still some additional embodiments of the present methods, the precursor host cell naturally produces the protein of interest. In some further embodiments of the present methods, the production of the protein of interest by the modified host cell is greater than the production of the protein of interest by the
10 precursor host cell. In some embodiments of the present methods, the methods further comprise the step of recovering the protein of interest. In some embodiments of the present methods, the the protein of interest is an enzyme. In some further embodiments of the present methods, the enzyme is a protease. In still some further embodiments of the present methods, the modified host cell comprises a mutation in at least one gene
15 chosen from degil, degQ, degS, sco4, spollE, degQ and degR. In some embodiments of the present methods, the host cell comprises a deg(Hy)32 mutation. In some further embodiments of the present methods, the at least one indigenous phr gene that is inactivated is chosen from phrA, phrE, phrC, phrF, phrG, phrl,anό phrK. In still some further embodiments of the present methods, the inactivating polynucleotide inactivates
20 the indigenous phrA and phrE genes and/or rap gene. In some embodiments of the present methods, the at least one indigenous phr gene is phrA, while in some alternative embodiments, the at least one indigenous phr gene is phrE. In still some additional embodiments of the present methods, the indigenous rap gene is inactivated. In some further embodiments of the present methods, the indigenous rap gene is rapA. In some
25 additional embodiments of the present methods, the precursor or modified host cell overexpresses YmaH. In some embodiments of the present methods, the overexpression of YmaH is achieved by introducing a SigH construct into the precursor or the modified host cell. In some further embodiments of the present methods, the SigH construct comprises SEQ ID NO:23, comprising a SigH promoter operably linked to a
30 polynucleotide encoding YmaH protein. In some additional embodiments of the present methods, the overexpression of YmaH is achieved by introducing a SigA construct into the precursor or said modified host cell. In still some further embodiments of the present methods, the SigA construct comprises SEQ ID NO:26 and/or 31 , comprising a SigA promoter operably linked to a polynucleotide encoding YmaH. In some embodiments of
35 the present methods, the host cell is a Bacillus sp. host cell. In some further embodiments of the present methods, the Bacillus sp. host cell is a Bacillus alkalophilus, 13
Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megateήum, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis cell. In some additional embodiments of the present methods, the Bacillus 5 sp. host cell is a Bacillus subtilis cell.
BRIEF DESCRIPTION OF THE DRAWINGS
[041] Certain aspects of the following detailed description are best understood when
10 read in conjunction with the accompanying drawings. It is emphasized that, according to common practice, the various features of the drawings are not to-scale. On the contrary, the dimensions of the various features are arbitrarily expanded or reduced for clarity. Included in the drawings are the following figures: [042] Figure 1 illustrates the arrangement of phr and rap genes in the Bacillus subtilis
15 rap operons.
[043] Figure 2 schematically illustrates features common to the inactivation cassettes used to delete phr genes in Bacillus subtilis.
[044] Figure 3 shows the production of the AprE protease in the modified Bacillus subtilis strains that comprise a deletion of the phr A, phrE, phrC, phrF, phrG, phrl, and
20 phrK.
[045] Figure 4 is a graph showing the production of AprE in the control Bacillus subtilis parent strain BG2942 (diamonds) and in the modified Bacillus subtilis strains CB2-1 (squares) and CB2-2 (triangles), which respectively contain the deletion of the phrA and the phrE gene.
25 [046] Figure 5 is a graph showing the production of the protease FNA in the parent B. subtilis strain CF471 (diamonds), and in the modified Bacillus subtilis strains CB3-48 (squares) and CB3-47 (triangles), which respectively contain the deletion of the phrE and the phrA gene. [047] Figure 6 is a graph showing a combined effect of deleting phrA and phrE gene on
30 protease production in Bacillus subtilis.
[048] Figure 7 illustrates the location of primers used for generating polynucleotide constructs used to overexpress YmaH in Bacillus subtilis. Panels B-E show the position of the primers used to generate construct SigH (panel B), and SigA constructs SigA1 (panel C), SigA2 (panel D) and SigA3 (panel E) relative to the Bacillus chromosomal
35 sequence of the miaA operon of Bacillus subtilis (base pairs 1865428-1867019 of the
Bacillus subtilis strain 168; NCBI accession number NC000964), which is illustrated in 14
Panel A. Primer pairs P4 - P5 and P6 - P7 are fusion primers, which comprise a "tail'Of base pairs at their 5' end that are homologous to the sequence being directly amplified, and are complementary to each other. The complemetary tails of the fusion primers allow fusion of the amplified Sigma A promoter DNA to the amplified YmaH-encoding DNA to 5 obtain chimeric polynucleotides containing the Sigma A promoter sequence adjacent to the YmaH-encoding sequence while deleting most, or all, of the miaA coding sequence. [049] Figure 8 shows the polynucleotide sequence of a portion of the Bacillus subtilis genome that comprises the sequence defining a sigA promoter to the end of the sequence encoding the YmaH protein (SEQ ID NO:101 ). This sequence is diagrammed
10 in Figure 7, panel A. The beginning of the sequence encoding the miaA protein is indicated and the entire miaA coding sequence shown in bold letters; the beginning of sequence encoding the YmaH protein is indicated and the entire ymaH coding sequence shown in underlined bold letters. [050] Figure 9 shows a map of the plasmid pBS19-ymaH sigH.
15 [051] Figure 10 (A-B) Panel A shows a graph of the proteolytic activity of subtilisin produced by Bacillus control host cells (42pBS) and by Bacillus subtilis host cells that overexpress ymaH (42SigA1 and 42SigH). Panel B shows the subtilisin activity produced by Bacillus control host cells (41 pBS) and by Bacillus subtilis host cells that overexpress ymaH (41 SigH). The proteolytic activity was measured as the increase in absorbance at
20 405 nm due to the hydrolysis and release of p-nitroanaline. The level of enzymatic activity is indicative of the effect of overexpressing ymaH on the production of subtilisin by Bacillus host cells.
[052] Figure 1 1 shows the level of production of subtilisin by Bacillus subtilis control host cells 42pBS19 and by Bacillus host cells 42SigH and 42SigA1 , which overexpress ymaH.
25 [053] Figure 12 is a graph showing a synergistic effect of phr deletion and YamH over- expression (using multicopy plasmid pBS19-ymaH sigH) on AprE expression. The effect of overexpression of YmaH is shown in the Bacillus subtilis strain named YmaH (squares), and in the modified strains CB2-1 1 (triangles) and CB2-12 (crosses), which respectively contain a deletion of the phrA and phrE gene, and is compared to the
30 production of AprE in the control strain 42pBS19.
[054] Figure 13 schematically illustrates the DNA construct used to delete the rapA gene.
[055] Figure 14 shows the level of production of subtilisin FNA by Bacillus subtilis control host cells CF471 (filled diamond), the modified Bacillus subtilis cells CB3-47 (filled
35 square) comprising an inactivated phrA gene, and the modified Bacillus subtilis cells
JS1 121 (open triangle) comprising an inactivated rapA gene and an inactivated phrA 15
gene.
[056] Figure 15 (A-B) shows the effect of deleting the phrH gene (filled square, panel A) or the rapH gene (filled square, panel B) genes on the production of AprE by Bacillus subtilis. 5
DESCRIPTION OF THE INVENTION
[057] The present invention provides host cells that have been genetically manipulated to have an enhanced capacity to produce proteins of interest. In particular, the invention relates to modified Bacillus sp. host cells that have at least one inactivated phr and/or rap
10 gene. The enhanced production of proteins of interest by the modified Bacillus sp. host cells is further increased in modified Bacillus sp. host cells that overexpress YmaH. Methods for producing proteins of interest in the modified host cells are also provided. [058] Although described herein in regard to exemplary serine proteases (e.g., FNA and AprE), the compositions and methods of the present invention are not limited to serine
15 proteases. Indeed, the present invention finds use in improving the production of various classes of enzymes as well as proteases (e.g., amylases, cellulases, oxidases, oxidoreductases, cutinases, mannanases, pectinases, amylases, lipases, etc). Indeed, it is not intended that the present invention be limited to any particular enzyme nor class of enzyme.
20 [059] Unless otherwise indicated, the practice of the present invention involves conventional techniques commonly used in molecular biology, microbiology, protein purification, protein engineering, protein and DNA sequencing, and recombinant DNA fields, which are within the skill of the art. Such techniques are known to those of skill in the art and are described in numerous standard texts and reference works. All patents,
25 patent applications, articles and publications mentioned herein, both supra and infra, are hereby expressly incorporated herein by reference.
[060] Unless defined otherwise herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Various scientific dictionaries that include the terms
30 included herein are well known and available to those in the art. Although any methods and materials similar or equivalent to those described herein find use in the practice or testing of the present invention, some preferred methods and materials are described. Accordingly, the terms defined immediately below are more fully described by reference to the Specification as a whole. It is to be understood that this invention is not limited to the
35 particular methodology, protocols, and 16
[061] reagents described, as these may vary, depending upon the context they are used by those of skill in the art.
[062] As used herein, the singular terms "a", "an," and "the" include the plural reference unless the context clearly indicates otherwise. Unless otherwise indicated, nucleic acids 5 are written left to right in 5' to 3' orientation and amino acid sequences are written left to right in amino to carboxy orientation, respectively.
[063] All patents, patent applications, and other publications, including all sequences disclosed within these references, referred to herein are expressly incorporated by reference, to the same extent as if each individual publication, patent or patent application
10 was specifically and individually indicated to be incorporated by reference. All documents cited are, in relevant part, incorporated herein by reference. However, the citation of any document is not to be construed as an admission that it is prior art with respect to the present invention. [064] Numeric ranges are inclusive of the numbers defining the range. It is intended that
15 every maximum numerical limitation given throughout this specification includes every lower numerical limitation, as if such lower numerical limitations were expressly written herein. Every minimum numerical limitation given throughout this specification will include every higher numerical limitation, as if such higher numerical limitations were expressly written herein. Every numerical range given throughout this specification will include every
20 narrower numerical range that falls within such broader numerical range, as if such narrower numerical ranges were all expressly written herein. [065] The headings provided herein are not limitations of the various aspects or embodiments of the invention which can be had by reference to the Specification as a whole. Accordingly, as indicated above, the terms defined immediately below are more
25 fully defined by reference to the specification as a whole.
Definitions
[066] As used herein, a "modified host cell" is a recombinant host cell that contains at least one inactivated phr and/or a rap gene. A modified host cell is derived from a
30 precursor host cell, which can be a wild-type or a recombinant precursor host cell comprising a phr gene that is not inactivated.
[067] As used herein, "recombinant host cell" refers to a cell that has been modified by the introduction of at least one recombinant/heterologous nucleic acid. Thus, for example, recombinant host cells express genes that are not found in identical form within the parent
35 form of the cell or express native genes that are otherwise abnormally expressed, under expressed or not expressed at all as a result of deliberate human intervention. 17
[068] As used herein "precursor host cell" is used interchangeably with "parent host cell" to refer to a host cell that is genetically altered to generate a modified host cell. [069] As used herein, the term "recombinant polynucleotide" and "recombinant polypeptide" respectively refer to a polynucleotide and a polypeptide that do not naturally 5 occur in a host cell. A recombinant polynucleotide or polypeptide molecule may contain two or more naturally-occurring sequences that are linked together in a way that does not occur naturally. "Recombination, "recombining," or generating a "recombined" or "recombinant" nucleic acid is generally the assembly of two or more nucleic acid fragments wherein the assembly gives rise to a chimeric gene.
10 [070] As used herein, the term'Yecombinant" when used in reference to a cell means a cell that has been modified by the introduction of a heterologous nucleic acid sequence or that the cell is derived from a cell so modified. Thus, for example, recombinant cells express genes that are not found in identical form within the native (non-recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under
15 expressed or not expressed at all as a result of deliberate human intervention.
[071] As used herein, an "analogous sequence" is a primary biological sequence, such as the amino-acid sequence or the nucleotides of DNA sequences wherein the function of the protein or encoded protein is essentially the same as that designated for Phr, Rap and YmaH proteins recited herein. Additionally, analogous proteins have at least about 60%,
20 at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 97%, at least about 98%, at least about 99% or about 100% sequence identity to the sequence of variants of Phr, Rap and YmaH proteins recited herein. Analogous sequences are determined by known methods of sequence alignment. A commonly used alignment method is BLAST,
25 although as indicated above and below, there are other methods that also find use in aligning sequences. One example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence alignment from a group of related sequences using progressive, pairwise alignments. It can also plot a tree showing the clustering relationships used to create the alignment. PILEUP uses a simplification of the progressive alignment method
30 of Feng and Doolittle (Feng and Doolittle, J. MoI. Evol., 35:351 -360 [1987]). The method is similar to that described by Higgins and Sharp (Higgins and Sharp, CABIOS 5:151 -153 [1989]). Useful PILEUP parameters including a default gap weight of 3.00, a default gap length weight of 0.10, and weighted end gaps. Another example of a useful algorithm is the BLAST algorithm, described by Altschul et al., (Altschul et al., J. MoI. Biol., 215:403-
35 410, [1990]; and Karlin et al., Proc. Natl. Acad. Sci. USA 90:5873-5787 [1993]). A particularly useful BLAST program is the WU-BLAST-2 program (See, Altschul et al., 18
Meth. Enzymol., 266:460-480 [1996]). WU-BLAST-2 uses several search parameters, most of which are set to the default values. The adjustable parameters are set with the following values: overlap span =1 , overlap fraction = 0.125, word threshold (T) = 1 1 . The HSP S and HSP S2 parameters are dynamic values and are established by the program 5 itself depending upon the composition of the particular sequence and composition of the particular database against which the sequence of interest is being searched. However, the values may be adjusted to increase sensitivity. A % amino acid sequence identity value is determined by the number of matching identical residues divided by the total number of residues of the "longer" sequence in the aligned region. The "longer"
10 sequence is the one having the most actual residues in the aligned region (gaps introduced by WU-Blast-2 to maximize the alignment score are ignored). A preferred method utilizes the BLASTN module of WU-BLAST-2 set to the default parameters, with overlap span and overlap fraction set to 1 and 0.125, respectively. [072] As used herein, "percent (%) sequence identity" or "percent homology" when used
15 in reference to a polynucleotide or to a polypeptide sequence is defined as the percentage of nucleotide or amino acid residues in a candidate sequence that are identical with the nucleotide or amino acid residues of a starting sequence (i.e., the sequence of interest). The percent identity shared by polynucleotide or polypeptide sequences is determined by direct comparison of the sequence information between the molecules by aligning the
20 sequences and determining the identity by methods known in the art. In some embodiments, the alignment includes the introduction of gaps in the sequences to be aligned. In addition, for sequences which contain either more or fewer nucleotides or amino acids than those of the candidate polynucleotide or polypeptide sequences, it is understood that the percentage of homology will be determined based on the number of
25 homologous nucleotides or amino acids in relation to the total number of nucleotides or amino acids. As used herein "homology" refers to sequence similarity or identity, with identity being preferred. This homology is determined using standard techniques known in the art (See e.g., Smith and Waterman, Adv. Appl. Math., 2:482 [1981 ]; Needleman and Wunsch, J. MoI. Biol., 48:443 [1970]; Pearson and Lipman, Proc. Natl. Acad. Sci. USA
30 85:2444 [1988]; programs such as GAP, BESTFIT, FASTA, and TFASTA in the
Wisconsin Genetics Software Package (Genetics Computer Group, Madison, Wl); and Devereux et al., Nucl. Acid Res., 12:387-395 [1984]).
[073] As used herein, the term "heterologous" refers to elements that are not normally associated with each other. For example, if a host cell produces a heterologous protein,
35 that protein is a protein that is not normally produced by that host cell. Likewise, a promoter that is operably linked to a heterologous coding sequence is a promoter that is 19
operably linked to a coding sequence that it is not a wild-type sequence. [074] As used herein, a "protein of interest," or "polypeptide of interest," refers to a protein that is expressed/produced by a host cell. Generally, proteins of interest are desirable proteins that have commercial significance. The protein of interest may be 5 either homologous or heterologous to the host. In some embodiments, the protein of interest is a secreted polypeptide, particularly an enzyme, including but not limited to amylolytic enzymes, proteolytic enzymes, cellulytic enzymes, oxidoreductase enzymes and plant wall degrading enzymes. In further embodiments, these enzyme include, but are not limited to amylases, proteases, xylanases, lipases, laccases, phenol oxidases,
10 oxidases, cutinases, cellulases, hemicellulases, esterases, peroxidases, catalases, glucose oxidases, phytases, pectinases, glucosidases, isomerases, transferases, galactosidases and chitinases. In still further embodiments, the expressed polypeptide is a hormone, cytokine, growth factor, receptor, vaccine, antibody, or the like. While it is not intended that the present invention be limited to any particular protein/polypeptide, in
15 some most preferred embodiments, the expressed protein of interest is a protease.
[075] As used herein, the terms "protease," and "proteolytic activity" refer to a protein or peptide exhibiting the ability to hydrolyze peptides or substrates having peptide linkages. Many well known procedures exist for measuring proteolytic activity (Kalisz, "Microbial Proteinases," In: Fiechter (ed.), Advances in Biochemical Engineering/Biotechnology,
20 [1988]). For example, proteolytic activity may be ascertained by comparative assays which analyze the respective protease's ability to hydrolyze a commercial substrate. Exemplary substrates useful in the analysis of protease or proteolytic activity, include, but are not limited to di-methyl casein (Sigma C-9801 ), bovine collagen (Sigma C-9879), bovine elastin (Sigma E-1625), and bovine keratin (ICN Biomedical 9021 1 1 ). Colorimetric
25 assays utilizing these substrates are well known in the art (See e.g., WO 99/3401 1 ; and
U.S. Pat. No. 6,376,450, both of which are incorporated herein by reference). The pNA assay (See e.g., Del Mar et al., Anal. Biochem., 99:316-320 [1979]) also finds use in determining the active enzyme concentration for fractions collected during gradient elution. This assay measures the rate at which p-nitroaniline is released as the enzyme
30 hydrolyzes the soluble synthetic substrate, succinyl-alanine-alanine-proline- phenylalanine-p-nitroanilide (sAAPF-pNA). The rate of production of yellow color from the hydrolysis reaction is measured at 410 nm on a spectrophotometer and is proportional to the active enzyme concentration. In addition, absorbance measurements at 280 nm can be used to determine the total protein concentration. The active enzyme/total-protein ratio
35 gives the enzyme purity.
[076] As used herein, the term "subtilisin" refers to a protease belonging to the group of 20
serine proteases which initiate the nucleophilic attack on the peptide bond through a serine residue at the active site (serine endopeptidase). Subtilisins are secreted in large amounts from many Bacillus species. For example, FNA, which is subtilisin BPN' containing the Y217L subtitution, is a subtilisin obatained from Bacillus amyloliquefaciens, 5 and AprE is the subtilisin obtained from Bacillus subtilis.
[077] As used herein, "deletion" of a gene refers to deletion of the entire coding sequence, deletion of part of the coding sequence, or deletion of the coding sequence including flanking regions. The deletion may be partial as long as the sequences left in the chromosome provides the desired loss of the biological activity of the gene. The
10 flanking regions of the coding sequence may include from about 1 bp to about 500 bp at the 5' and 3' ends. The flanking region may be larger than 500 bp but will preferably not include other genes in the region which may be inactivated or deleted according to the invention. The end result is that the deleted gene is effectively non-functional. In simple terms, a "deletion" is defined as a change in either nucleotide or amino acid sequence in
15 which one or more nucleotides or amino acid residues, respectively, have been removed
(i.e., are absent). Thus, a "deletion mutant" has fewer nucleotides or amino acids than the respective parent host cell. In some embodiments, deletion of a phr gene provides enhanced expression of a protein of interest (e.g., a protease). [078] In some embodiments, deletion of one or more of genes selected from the group
20 consisting of phrA, phrC, phήΞ, phrf, phή, and phrK, provides an improved strain for the enhanced production of a protease.
[079] As used herein, a "corresponding unmodified Bacillus strain" or "parent" or "precursor" Bacillus sp. host cell is the originating host strain from which the indigenous chromosomal region (e.g., phrA and/or phrE gene), is inactivated and from which the
25 altered/recombinant strain is derived.
[080] A polypeptide is "overexpressed" in a recombinant host cell if the polypeptide is expressed in the recombinant cell at a higher level that the level at which it is expressed in the precursor cell. [081] As used herein, the term "homologous," when used in reference to a
30 polynucleotide or protein, refers to a polynucleotide or protein that occurs naturally in a host cell.
[082] The term "polypeptide," as used herein, refers to a compound made up of amino acid residues linked by peptide bonds. The term "protein" as used herein, may be synonymous with the term "polypeptide" or may refer, in addition, to a complex of two or
35 more polypeptides. Thus, the terms "protein," "peptide," and "polypeptide" are used interchangeably. 21
[083] As used herein, the terms "chimeric polypeptide" and "fusion polypeptide" are used interchangeably to refer to a protein that comprises at least two separate and distinct regions that may or may not originate from the same protein. For example, a signal peptide linked to the protein of interest wherein the signal peptide is not normally 5 associated with the protein of interest would be termed a chimeric polypeptide or chimeric protein.
[084] As used herein, a "signal sequence" is a sequence of amino acids present at the N-terminal portion of a protein which facilitates the secretion of the mature form of the protein outside the cell. The definition of a signal sequence is a functional one. The
10 mature form of the extracellular protein lacks the signal sequence which is cleaved off during the secretion process.
[085] A "prosequence" is an amino acid sequence between the signal sequence and mature protease that is necessary for the secretion of the protease. Cleavage of the pro sequence results in a mature active protease.
15 [086] The term "signal sequence" or "signal peptide" refers to any sequence of nucleotides and/or amino acids which may participate in the secretion of the mature or precursor forms of the protein. This definition of signal sequence is a functional one, meant to include all those amino acid sequences encoded by the N-terminal portion of the protein gene, which participate in the effectuation of the secretion of protein. They are
20 often, but not universally, bound to the N-terminal portion of a protein or to the N-terminal portion of a precursor protein. The signal sequence may be endogenous or exogenous. The signal sequence may be that normally associated with the protein (e.g., protease), or may be from a gene encoding another secreted protein. One exemplary exogenous signal sequence comprises the first seven amino acid residues of the signal sequence
25 from Bacillus subtilis subtilisin fused to the remainder of the signal sequence of the subtilisin from Bacillus lentus (ATCC 21536).
[087] The term "aprE promoter" herein refers to the polynucleotide promoter sequence that naturally drives the expression of subtilisin in B. subtilis (Ferrari et ai, J Bacteriol. 170:289-295 [1988]). In the context of aprE promoter, "an aprE promoter" herein refers to
30 a wild-type aprE promoter and mutants thereof. In some embodiments, the aprE promoter includes the nucleotide sequences necessary for the transcriptional regulation exerted by DegU, ScoC, AbrB and any other regulator of such promoter, and/or the aprE transcriptional leader (Hambraeus et ai, Microbiology 148:1795-1803 [2002]). In some alternative embodiments, the aprE promoter does not include all of the nucleotide
35 sequences necessary for the transcriptional regulation exerted by DegU, ScoC, AbrB and other regulators, and/or does not include the aprE transcriptional leader sequence. 22
[088] As used herein, an "inactivated gene" is a locus of a genome that, prior to its inactivation, was capable of producing a protein (i.e., capable of being transcribed into an RNA that could be translated to produce a full length polypeptide). A gene encoding a polypeptide is inactivated when it not transcribed and translated into a full length protein 5 that has biological activity (e.g., catalytic activity, in the case of an enzyme). A gene may be inactivated by altering a sequence required for its transcription, for example by altering a sequence required for RNA processing (e.g., poly-A tail addition), by altering a sequence required for translation, or by altering the amino acid sequence of the encoded polypeptide (e.g., by a nucleotide substitution, etc). Examples of inactivated genes include
10 but are not limited to a deleted gene, a gene containing a deleted region, a gene containing a rearranged region, a gene having an inactivating point mutation or frameshift, and a gene containing an insertion. A gene may also be inactivated by altering or deleting the sequence of the adjacent gene in an operon. In addition, a gene may also be inactivated using antisense or any other method that abolishes expression of that gene.
15 [089] As used herein, the term "nucleic acid" encompasses DNA, RNA, whether single stranded or double stranded, and encompasses chemically modified DNA or RNA. The terms "nucleic acid" and "polynucleotide" are used interchangeably herein. [090] The term "inactivation" includes any method that prevents the functional expression of one or more of the phr genes (phrA, phrC, phήΞ, phrf, phή, and phrK),
20 wherein the gene or gene product (i.e., the encoded Phr protein), is unable to exert its known function. Inactivation occurs via any suitable means, including deletions, substitutions (e.g., mutations), interruptions, and/or insertions in the nucleic acid gene sequence. In some embodiments, an altered/recombinant Bacillus strain comprises inactivation of one or more genes that results preferably in stable and non-reverting
25 inactivation. In some embodiments, inactivation is achieved by deletion. In some preferred embodiments, the gene is deleted by homologous recombination. For example, in some embodiments when phrA is the gene to be deleted, an inactivating DNA construct comprising an incoming sequence having a selective marker flanked on each side by a homology box is used. The homology box comprises nucleotide sequences homologous
30 to nucleic acids flanking regions of the chromosomal phrA gene. The inactivating DNA construct aligns with the homologous sequences of the Bacillus host chromosome and in a double crossover event the phrA gene is excised out of the host chromosome. [091] In certain embodiments, the altered/recombinant cell is a Bacillus sp. host cell that comprises two inactivated genes (e.g., phrA andphrE). In other embodiments, the
35 Bacillus sp. host cell comprises three inactivated genes, four inactivated genes, five inactivated genes, six inactivated genes, or more. Thus, it is not intended that the number 23
of inactivated genes be limited to any particular number of genes. In some embodiments, the inactivated genes are contiguous to each another, while in other embodiments, they are located in separate regions of the Bacillus chromosome. In some embodiments, an inactivated chromosomal gene has a necessary function under certain conditions, but the 5 gene is not necessary for Bacillus strain viability under laboratory conditions. Preferred laboratory conditions include but are not limited to conditions such as growth in a fermenter, in a shake flask, plated media, etc., suitable for the growth of the microorganism. [092] As used herein, the terms "inactivating DNA construct", "inactivating
10 polynucleotide" and "deletion cassette" are used interchangeably to refer to a DNA construct comprising a non-functional sequence that may be inserted into a gene to disrupt the function of the gene. In some embodiments, the inactivating DNA construct comprises a sequence encoding a selective marker. The inactivating DNA construct may also include two homology boxes.
15 [093] As used herein, the terms "expression cassette" and "expression vector" refer to nucleic acid constructs generated recombinantly or synthetically, with a series of specified nucleic acid elements that permit transcription of a particular nucleic acid in a target cell. The recombinant expression cassette can be incorporated into a plasm id, chromosome, mitochondrial DNA, plastid DNA, virus, or nucleic acid fragment. Typically, the
20 recombinant expression cassette portion of an expression vector includes, among other sequences, a nucleic acid sequence to be transcribed and a promoter. In preferred embodiments, expression vectors have the ability to incorporate and express heterologous DNA fragments in a host cell. Many prokaryotic and eukaryotic expression vectors are commercially available. Selection of appropriate expression vectors is within
25 the knowledge of those of skill in the art. The term "expression cassette" is used interchangeably herein with "DNA construct," and their grammatical equivalents. Selection of appropriate expression vectors is within the knowledge of those of skill in the art. [094] As used herein, the terms "DNA construct" and "transforming DNA" are used interchangeably to refer to DNA used to introduce sequences into a host cell or organism.
30 The DNA may be generated in vitro by PCR or any other suitable technique(s) known to those in the art. In particularly preferred embodiments, the DNA construct comprises a sequence of interest (e.g., as an incoming sequence). In some embodiments, the sequence is operably linked to additional elements such as control elements (e.g., promoters, etc.). The DNA construct may further comprise a selectable marker. It may
35 further comprise an incoming sequence flanked by homology boxes. In a further embodiment, the transforming DNA comprises other non-homologous sequences, added 24
to the ends (e.g., stuffer sequences or flanks). In some embodiments, the ends of the incoming sequence are closed such that the transforming DNA forms a closed circle. The transforming sequences may be wild-type, mutant or modified. In some embodiments, the DNA construct comprises sequences homologous to the host cell chromosome. In 5 other embodiments, the DNA construct comprises non-homologous sequences. Once the
DNA construct is assembled in vitro it may be used to: 1 ) insert heterologous sequences into a desired target sequence of a host cell, and/or 2) mutagenize a region of the host cell chromosome (i.e., replace an endogenous sequence with a heterologous sequence), 3) delete target genes; and/or introduce a replicating plasmid into the host.
10 [095] As used herein, the term "heterologous DNA sequence" refers to a DNA sequence that does not naturally occur in a host cell. In some embodiments, a heterologous DNA sequence is a chimeric DNA sequence that is comprised of parts of different genes, including regulatory elements. [096] As used herein, the term "heterologous protein" refers to a protein or polypeptide
15 that does not naturally occur in the host cell (i.e., it is encoded by a heterologous sequence).
[097] As used herein, "homologous protein" refers to a protein or polypeptide native or naturally occurring in a cell. [098] The term "YmaH protein" is interchangeably used with "Hfq protein" and refers to a
20 protein that enhances the expression of a protein of interest. In the context of YmaH, "a
YmaH protein" herein refers to a wild-type YmaH protein and variants thereof, including orthologs.
[099] As used herein, the term "vector" refers to a polynucleotide designed to introduce nucleic acids into one or more host cells. In preferred embodiments, vectors
25 autonomously replicate in different host cells. The term is intended to encompass, but is not limited to cloning vectors, expression vectors, shuttle vectors, plasmids, phage particles, cassettes, and the like.
[0100] An "expression vector" as used herein refers to a DNA construct comprising a protein-coding region that is operably linked to a suitable control sequence capable of
30 effecting expression of the protein in a suitable host cell. In some embodiments, such control sequences include a promoter to effect transcription, an optional operator sequence to control transcription to produce mRNA, a sequence encoding suitable ribosome binding sites on the mRNA, and enhancers and sequences which control termination of transcription and translation.
35 [0101] As used herein, the term "promoter" refers to a regulatory sequence that initiates transcription of a downstream nucleic acid. 25
[0102] As used herein, the term "operably linked" refers to an arrangement of elements that allows them to be functionally related. For example, a promoter is operably linked to a coding sequence if it controls the transcription of the sequence. [0103] As used herein, the term "derived" encompasses the terms "originated from," 5 "obtained," or "obtainable from," and "isolated from".
[0104] As used herein, a "non-pathogenic" organism is an organism that is not pathogenic to humans and/or other animals.
[0105] The terms "recovered," "isolated," and "separated," as used herein refer to a protein, cell, nucleic acid or amino acid that is removed from at least one component with
10 which it is naturally associated.
[0106] As used herein in the context of introducing a nucleic acid sequence into a cell, the term "introduced" refers to any method suitable for transferring the nucleic acid sequence into the cell. Such methods for introduction include but are not limited to protoplast fusion, transfection, transformation, conjugation, and transduction (See e.g., Ferrari et al.,
15 "Genetics, " in Hardwood et al, (eds.), Bacillus, Plenum Publishing Corp., pages 57-72,
[1989]).
[0107] As used herein, the terms "transformed" and "stably transformed" refers to a cell that has a non-native (heterologous) polynucleotide sequence integrated into its genome or as an episomal plasmid that is maintained for at least two generations.
20 [0108] As used herein, the term "expression" refers to the process by which a polypeptide is produced based on the nucleic acid sequence of a gene. The process includes both transcription and translation.
[0109] As used herein, the term "selectable marker-encoding nucleotide sequence" refers to a nucleotide sequence, which is capable of expression in the host cells and where
25 expression of the selectable marker confers to cells containing the expressed gene the ability to grow in the presence of a corresponding selective agent or lack of an essential nutrient.
[0110] As used herein, the terms "selectable marker" and "selective marker" refer to a nucleic acid (e.g., a gene) capable of expression in host cell, which allows for ease of
30 selection of those hosts containing the vector. Examples of such selectable markers include but are not limited to antimicrobials. Thus, the term "selectable marker" refers to genes that provide an indication that a host cell has taken up an incoming DNA of interest or some other reaction has occurred. Typically, selectable markers are genes that confer antimicrobial resistance or a metabolic advantage on the host cell to allow cells containing
35 the exogenous DNA to be distinguished from cells that have not received any exogenous sequence during the transformation. A "residing selectable marker" is one that is located 26
on the chromosome of the microorganism to be transformed. A residing selectable marker encodes a gene that is different from the selectable marker on the transforming DNA construct. Selective markers are well known to those of skill in the art. As indicated above, preferably the marker is an antimicrobial resistant marker (e.g., ampR; phleoR; 5 specR ; kanR; eryR; tetR; cmpR; and neoR (See e.g., Guerot-Fleury, Gene, 167:335-337,
1995; Palmeros et al., Gene 247:255-264, 2000; and Trieu-Cuot et al., Gene, 23:331 -341 , 1983). Other markers useful in accordance with the invention include, but are not limited to auxotrophic markers, such as tryptophan; and detection markers, such as β- galactosidase.
10 [0111] As used herein, "culturing" refers to growing a population of microbial cells under suitable conditions in a liquid or solid medium. In some embodiments, culturing refers to fermentative recombinant production of an exogenous protein of interest or other desired end products (typically in a vessel or reactor). [0112] As used herein, the term "production" when used in reference to a protein of
15 interest encompasses the processes of transcription, and translation, and when needed, the processes of secretion and maturation, which creates the active from of the protein. For proteins that aer secreted into the extracellular medium (e.g., proteases), the level of protein production is assessed as the amount of active protein secreted into the extracellular medium.
20 [0113] As used herein, " Bacillus sp." refers to all of the species within the genus
"Bacillus," which are Gram-positive bacteria classified as members of the Family Bacillaceae, Order Bacillales, Class Bacilli. The genus "Bacillus" includes all species within the genus "Bacillus," as known to those of skill in the art, including but not limited to Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans,
25 Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus,
Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuήngiensis. It is recognized that the genus Bacillus continues to undergo taxonomical reorganization. Thus, it is intended that the genus include species that have been reclassified, including but not limited to such organisms as
30 B. stearothermophilus, which is now named "Geobacillus stearothermophilus." The production of resistant endospores in the presence of oxygen is considered the defining feature of the genus Bacillus, although this characteristic also applies to the recently named Alicyclobacillus, Amphibacillus, Aneurinibacillus, Anoxybacillus, Brevibacillus, Filobacillus, Gracilibacillus, Halobacillus, Paenibacillus, Salibacillus, Thermobacillus,
35 Ureibacillus, and Virgibacillus.
[0114] Other definitions of terms may appear throughout the Specification. 27
[0115] Before the exemplary embodiments are described in more detail, it is to be understood that the present invention is not limited to particular embodiments described, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be 5 limiting.
[0116] Before the exemplary embodiments are described in more detail, it is to be understood that the present invention is not limited to particular embodiments described, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be
10 limiting.
[0117]
[0118] Modified Host Cells
[0119] Bacillus sp. cells make use of two-component signal transduction systems, each containing a sensor kinase and a response regulator, to sense and respond to a wide
15 variety of extracellular stimuli. Known two-component systems are involved in various processes, such as competence development (Dubnau, Microbiological Reviews 1991 55, 395-424), protein secretion (Kunst, Research in Microbiology 1994 145, 393-402; Darmon, Journal of Bacteriology 2002 184, 5661 -5671 ), synthesis of peptide antibiotics and bacteriocins (Marahiel Molecular Microbiology 1993 7, 631-636; Stein, Molecular
20 Microbiology 2002 44, 403-416) and sporulation (Grossman, Annual Reviews Genetics
1995 29, 477-508). These regulatory systems are governed by intracellular response regulators aspartyl phosphatases (Raps), and their antagonistic phosphatase regulators (Phrs). The Raps dephosphorylate response regulators, which alter gene expression thereby produce cellular responses. The Phr peptides serve as cell density-signaling
25 molecules and inhibit the Rap phosphatases (Perego, Procedings of the National
Acadamy of Science USA 1997 94, 8612-8617; Perego, M. Trends in Microbiology 1998 6, 366-370; Perego, Cell 1994 79, 1047-1055).
[0120] While the Rap phosphatases remain in the cytoplasm, Phr peptides contain an amino-terminal signal peptide and are exported as pro-peptides, most likely via the Sec
30 pathway (Perego, Molecular Microbiology 1996 19, 1 151-1 170; Tjalsma. Microbiological and Molecular Biology Reviews 2000 64, 515-547). Further, extracellular processing results in active Phr pentapeptides. After re-import by cells in the culture via the oligopeptide permease (Opp) system, Phr peptides specifically inhibit the activity of their cognate Rap phosphatase (Solomon, Genes and Development 1996 10, 2014-2024;
35 Perego, Procedings of the National Acadamy of Science USA 1997 94, 8612-8617;
Perego, Trends in Microbiology 1998 6, 366-370). The Phr peptides act as quorum 28
sensors in that they initiate cellular responses in response to changes in cell density. A Rap protein and the Phr peptide that inhibits the Rap protein are encoded on a single operon. There are eight rap operons transcribed with their cognate phr genes, and three other rap coding genes in the B. subtilis genome (Kunst, Nature 1997 390, 249-256). The 5 rap/phr signaling systems of Bacillus subtilis are reviewed in Pottathil (Front Biosci. 2003
8:d32-45) and Perego (Peptides 2001 22:1541 -7).
[0121] The present invention provides modified Bacillus sp.host cells that are genetically manipulated to have an enhanced capacity to produce proteins of interest. In particular, the present invention relates to modified Bacillus sp. cells that contain a genome
10 comprising at least one rap operon that comprises an inactivated phr gene. In some embodiments, the modified Bacillus sp. cells contain a genome comprising at least one rap operon that comprises an inactivated phr gene and an inactivated rap gene. Inactivation of the phr and/or rap gene enhances the production of a protein of interest by the modified Bacillus sp. cell when compared to the production of the same protein by the
15 unmodified precursor Bacillus sp. cell. Thus, the modified Bacillus sp. cell comprises at least one inactivated phr and/or rap gene and a polynucleotide that encodes a protein of interest. In some embodiments, the polynucleotide that encodes the protein of interest is a wild-type polynucleotide. In other embodiments, the polynucleotide that encodes the protein of interest is a recombinant polynucleotide.
20 [0122] The DNA sequences of several Bacillus sp. rap operons and the Rap and Phr proteins encoded by the operons have been determined and deposited into NCBI's Genbank database. In certain embodiments, a Bacillus sp. rap operon modified in the subject cell: a) may have at least about 70% , at least about 80%, at least about 90%, at least about 95%, at least about 97% or at least about 98% sequence identity to the
25 sequence of a rap operon sequence deposited in NCBI's Genbank database; b) may hybridize under stringent conditions to a rap operon sequence deposited in NCBI's Genbank database; or c) may encode a polypeptide that has at least about 70% sequence identity (e.g., at least about 80%, at least about 90%, at least about 93%, at least about 95%, at least about 97% or at least about 98% sequence identity) to a Rap or
30 Phr sequence deposited in NCBI's Genbank database. Exemplary phr protein and nucleotide sequences deposited in NCBI's Genbank database include those annotated in Genbank accession no. NC_000964.2; GID: 50812173 (β. subtilis), Genbank accession no. NC_009848.1 ; GID: 157690798 (Bacillus pumilus), Genbank accession no. NC_006270.3; GID: 1631 19169 (Bacillus licheniformis) and Genbank accession no.
35 NC_005957.1 ; GID 49476684 (Bacillus thuringiensis) among others. Rap proteins may be identified as containing a so-called tetratricopeptide repeat domain, a pfam domain 29
that typically contains 34 amino acids and contains the following amino acid sequence [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE]. The above Genbank accessions are incorporated by reference in their entirety, including the nucleic acid and protein sequences therein and the annotation of those sequences, as of 5 the earliest filing date of this patent application.
[0123] In some embodiments, the well-known Bacillus subtilis strain 168 finds use in the present invention. Indeed, the genome of this strain has been well-characterized (See, Kunst et al., Nature 390:249-256 [1997]; and Henner et al., Microbiol. Rev., 44:57-82 [1980]). The genome is comprised of one 4215 kb chromosome. While the coordinates
10 used herein refer to the 168 strain, the invention encompasses analogous sequences from Bacillus strains other than Bacillus subtilis 168.
[0124] In one embodiment, a modified Bacillus sp. cell comprises a single inactivated phr gene (e.g., a rapA operon containing an inactive phrA gene, a rapC operon containing an inactive phrC gene; a rapE operon containing an inactive phrE gene, a rapF operon
15 containing an inactive phrFgeue, a rap/ operon containing an inactive phrl gene, or a rapK operon containing an inactive phrK gene).
[0125] In one embodiment, the modified Bacillus sp. cell comprises an inactivated phr A gene (e.g., a rapA operon containing an inactive phrA gene). In some embodiments, inactivation results from the deletion of the entire endogenous DNA sequence that
20 encodes the PhrA protein. In some embodiments, the entire endogenous DNA sequence of the Bacillus subtilis phrA gene is deleted using the inactivating DNA deletion construct of SEQ ID NO:17. In Bacillus subtilis 168, the DNA sequence that encodes the phrA protein MKSKWMSGLL LVAVGFSFTQ VMVHAGETAN TEGKTFHIAA RNQT; SEQ ID NO:42 (Swiss-Prot:Q00829) is
25 [0126] atgaaatctaaatggatgtcaggtttgttgctcgttgcggtcgggttcagctttactcaggtgatggttcatgcaggtga aacagcaaacacagaagggaaaacatttcatattgcggcacgcaatcaaaca; SEQ ID NO:41 (NP_389126). Alternatively, inactivation of the phrA gene results from the deletion of a fragment of the phrA gene that prevents the functional expression of the PhrA protein. The phrA gene is located at about 1316305-1316439 bp of the B. subtilis 168
30 chromosome (Accession no.NC_000964). According to one embodiment, inactivation of the phrA gene is by insertion of a selectable marker that interrupts the phrA gene. [0127] In another embodiment, the modified Bacillus sp. cell comprises an inactivated phrE gene (e.g., a rapE operon containing an inactive phrE gene). In some embodiments, inactivation results from the deletion of the entire endogenous DNA sequence that
35 encodes the PhrE protein.
[0128] In some embodiments, the entire endogenous DNA sequence of the Bacillus 30
subtilis phrE gene is deleted using the inactivating DNA deletion construct of SEQ ID NO:18. In Bacillus subtilis 168, the DNA sequence that encodes the phrE protein MKSKLFISLS AVLIGLAFFG SMYNGEMKEA SRNVTLAPTH EFLV; SEQ ID NO:44 (Swiss-Prot:032025) is
5 atgaaatctaaattgtttatcagtttatccgccgttttaattggacttgcctttttcggatctatgtataatggcgaaatgaaggaagc atcccggaatgtaactctcgcacctactcatgaattccttgtt; SEQ ID NO:43 (NP 390461 ). Alternatively, inactivation of the phrE gene results from the deletion of a fragment of the phrE gene that prevents the functional expression of the PhrE protein. The phrE gene is located at about 2659557-2659691 bp of the B. subtilis 168 chromosome (Accession no.NC_000964).
10 According to one embodiment, inactivation of the phrE gene is by insertion of a selectable marker that interrupts the phrE gene.
[0129] In yet other embodiments, the phrA and the phrE genes are deleted from the Bacillus subtilis chromosome using the phrA and the phrE deletion constructs set forth in SEQ ID NOS:17 and 18, respectively.
15 [0130] In some other embodiments, the modified Bacillus sp. cell comprises at least two inactivated phr genes (e.g., two rap operons each containing an inactivated phr gene), at least three inactivated phr genes (e.g., three rap operons each containing an inactivated phr gene) at least four inactivated phr genes (e.g., four rap operons each containing an inactivated phr gene), at least five inactivated phrgeues (e.g., five rap operons each
20 containing an inactivated phr gene), at least six inactivated phr genes (e.g., six rap operons each containing an inactivated phr gene), at least seven inactivated phr genes, (e.g., seven rap operons each containing an inactivated phr gene), or at least eight inactivated phr genes (e.g., eight rap operons each containing an inactivated phr gene). In one exemplary embodiment, a subject host cell may contain both a) a rapA operon
25 containing an inactive phrA gene and b) a rapE operon containing an inactive phrE gene.
In some embodiments, inactivation results from the deletion of the entire endogenous DNA sequences that encode the PhrA and the PhrE proteins, respectively. Alternatively, inactivation of the phrA and phrE gene results from the deletion of a fragment of the phrA and the phrE gene that prevents the functional expression of the PhrA and the PhrE
30 proteins, respectively. Thus, in some embodiments, a segment of the phrA gene is deleted, and a segment of the phrE gene is deleted from the chromosome. Similarly, the inactivation of the phrA and the phrE genes results from the deletion of the entire endogenous DNA sequence that encodes the PhrA and the deletion of a DNA sequence that encodes a fragment of the PhrE protein. Alternatively, the inactivation of the phrA
35 and the phrE genes results from the deletion of the entire endogenous DNA sequence that encodes the PhrE and the deletion of a DNA sequence that encodes a fragment of 31
the PhrA protein. Fragments of phr genes (e.g. phrA and/or phrE), include a range of about 1 % to about 99% of the indigenous chromosomal region encoding the phrA and/or phrE proteins. In other embodiments, fragments include a range of about 5% to 95% of the indigenous chromosomal region. In yet additional embodiments, fragments comprise 5 at least about 99%, about 98%, about 97%, about 96%, about 95%, about 94%, about
93%, about 92%, about 90%, about 88%, about 85%, about 80%, about 75%, about 70%, about 65%, about 50%, about 40%, about 30%, about 25%, about 20% and about 10% of the indigenous chromosomal region. [0131] In some embodiments, inactivation of the phrA and/or phrE genes is achieved by
10 deletion resulting from homologous recombination. For example, in some embodiments when phr is the gene to be deleted, an inactivating DNA construct comprising a selectable marker flanked on each side by a homology box is used. The homology box comprises nucleotide sequences homologous to nucleic acids flanking regions of the chromosomal phr gene. The DNA construct aligns with the homologous sequences of the Bacillus host
15 chromosome and in a double crossover event the phr gene is excised out of the host chromosome. The inactivating DNA construct is assembled in vitro, followed by direct cloning of the construct into a competent Bacillus host, such that the DNA construct becomes integrated into the Bacillus chromosome. For example, PCR fusion and/or ligation can be employed to assemble a DNA construct in vitro. In some embodiments, the
20 DNA construct is a non-plasmid construct, while in other embodiments it is incorporated into a vector (e.g., a plasm id).
[0132] In other embodiments, the inactivating DNA construct comprises a selectable marker flanked on the 5' and 3' ends with a fragment of the gene sequence. In some embodiments, when the DNA construct comprising the selectable marker and gene, gene
25 fragment or homologous sequence thereto is transformed into a host cell, the location of the selectable marker renders the gene non-functional for its intended purpose. In some embodiments, the inactivating DNA construct comprises the selectable marker located in the promoter region of the gene. In other embodiments, the inactivating DNA construct comprises the selectable marker located 3' to the promoter region of gene. In yet other
30 embodiments, the inactivating DNA construct comprises the selectable marker located in the coding region of the gene. In further embodiments, the inactivating DNA construct comprises a selectable marker flanked by a homology box on both ends. In still further embodiments, the inactivating DNA construct includes a sequence that interrupts the transcription and/or translation of the coding sequence. In yet additional embodiments, the
35 DNA construct includes restriction sites engineered at the upstream and downstream ends of the construct. 32
[0133]
[0134] In another embodiment, inactivation of the phrA and/or phrE gene is by insertion of a selectable marker that interrupts the phrA and/or phrE gene in a single crossover event. In some embodiments, the selectable marker is located within the gene coding sequence 5 or on a part of the plasmid separate from the gene. The vector is integrated into the
Bacillus chromosome, and the gene is inactivated by the insertion of the vector in the coding sequence.
[0135] Other suitable means for inactivating a phr gene include introducing mutations that result in amino acid substitutions, and truncations that accompany a corresponding loss in
10 the biological activity of the phr protein. In some embodiments, a modified Bacillus sp. cell comprises inactivation of one or more phr genes that results preferably in stable and non-reverting inactivation. Methods of mutating genes are well known in the art and include but are not limited to site-directed mutation, generation of random mutations, and gapped-duplex approaches (See e.g., U.S. Pat. No. 4,760,025; Moring et ai, Biotech.
15 2:646 [1984]; and Kramer et ai, Nucleic Acids Res., 12:9441 [1984]).
[0136] Whether the inactivating DNA construct is incorporated into a vector or used without the presence of plasmid DNA, it is used to transform microorganisms. It is contemplated that any suitable method for transformation will find use with the present invention. In some embodiments, at least one copy of the inactivating DNA construct is
20 integrated into the host Bacillus chromosome. In some embodiments, one or more inactivating DNA constructs of the invention are used to transform host cells. For example, one inactivating DNA construct may be used to inactivate a phrA gene and another construct may be used to inactivate a phrE gene. Of course, additional combinations are contemplated and provided by the present invention.
25 [0137] In some embodiments, the phrA and/or phrE gene is deleted in a precursor recombinant Bacillus subtilis strain in which one or more genes encoding an endogenous protease have been deleted. In some embodiments, the Bacillus sp. host cell comprises two or more inactivated protease genes. In some embodiments, the Bacillus host cell contains two inactivated protease genes (See e.g., U.S. Patent 5,387,521 ) while in other
30 embodiments, the Bacillus host cell contains 5 inactivated protease genes: nprE, aprE, epr, ispA, and bpr genes (See e.g., US20050202535). Since the sequence of the entire B. subtilis genome is publicly available and annotated (See e.g., Moszer, FEBS Lett., 430:28-36 [1998]), the proteases of B. subtilis have been identified and reviewed in detail (See e.g., He et ai., Res. Microbiol., 142:797-803 [1991 ]). In addition, gene disruption
35 methods for Bacillus cells are generally well known in the art (See e.g., Lee et ai., Appl.
Environ. Microbiol., 66: 476-480 [2000]; Ye et ai., Proc. Intematl. Symp. Rec. Adv. 33
Bioindustry, Seoul, Korea: The Korean Society for Applied
[0138] Microbiology, pp. 160-169 [1996]; Wu et al., J. Bacteriol., 173:4952-4958 [1991 ]; and Sloma et al., J. Bacteriol. ,173:6889-6895 [1991 ]). Thus, the construction of such strains is well within the ability of one of skill in the art. 5 [0139] As indicated above, in some embodiments, the modified Bacillus sp. host cell comprises an inactivated phr gene and an inactivated rap gene. In one embodiment, the modified Bacillus sp. cell comprises a single rap operon that contains an inactivated phr gene and an inactivated rap gene (e.g., a rapA operon containing an inactive phrA gene and an inactivated rapA gene, a rapC operon containing an inactive phrC gene and an
10 inactivated rapC gene; a rapE operon containing an inactive phrE gene and an inactivated rapE gene, a rapF operon containing an inactive phrF gene and an inactivated rapFgene, a rap/ operon containing an inactive phrl gene and an inactivated rapl gene, or a rapK operon containing an inactive phrK gene and an inactivated rapK gene). In other embodiments, the modified Bacillus sp. cell comprises at least two rap operons each
15 containing an inactivated phr gene and an inactivated rap gene. In some embodiments, inactivation results from the deletion of the entire endogenous DNA sequences that encode the Phr and the Rap proteins.
[0140] In some embodiments, the entire endogenous DNA sequence of the Bacillus subtilis phrA gene is deleted using the inactivating DNA deletion construct of SEQ ID
20 NO:17. In Bacillus subtilis 168, the DNA sequence that encodes the PhrA protein
MKSKWMSGLL LVAVGFSFTQ VMVHAGETAN TEGKTFH IAA RNQT; SEQ ID NO:42 (Swiss-Prot:Q00829) is:
[0141 ] atgaaatctaaatggatgtcaggtttgttgctcgttgcggtcgggttcagctttactcaggtgatggttcatgcaggtga aacagcaaacacagaagggaaaacatttcatattgcggcacgcaatcaaaca; SEQ ID NO:41
25 (NP_389126).
[0142] Alternatively, inactivation of the phrA gene results from the deletion of a fragment of the phrA gene that prevents the functional expression of the PhrA protein. The phrA gene is located at about 1316305-1316439 bp of the B. subtilis 168 chromosome (Accession no.NC_000964). According to one embodiment, inactivation of the phrA gene
30 is by insertion of a selectable marker that interrupts the phrA gene. Alternatively, inactivation of the phrA gene results from the inactivation of the rapA gene by introducing a selectable marker comprising a terminator sequence in the rapA gene thereby preventing the functional expression of the rapA and phrA protein According to one embodiment, inactivation of the rapA gene is by insertion of a selectable marker that
35 interrupts the rapA gene.
[0143] In one embodiment, the endogenous DNA sequence of the Bacillus subtilis rapA 34
gene is deleted using the inactivating DNA deletion construct of SEQ ID NO:52. In Bacillus subtilis 168, the DNA sequence that encodes the rapA protein:
[0144] MRMKQTIPSSYVGLKINEWYTHIRQFHVAEAERVKLEVEREIEDMEEDQDLLLYY SLMEFRHRVMLDYIKPFGEDTSQLEFSELLEDIEGNQYKLTGLLEYYFNFFRGMYEFKQK 5 MFVSAMMYYKRAEKNLALVSDDIEKAEFAFKMAEIFYNLKQTYVSMSYAVQALETYQMY
ETYTVRRIQCEFVIAGNYDDMQYPERALPHLELALDLAKKEGNPRLISSALYNLGNCYEK MGELQKAAEYFGKSVSICKSEKFDNLPHSIYSLTQVLYKQKNDAEAQKKYREGLEIARQY SDELFVELFQFLHALYGKNIDTESVSHTFQFLEEHMLYPYIEELAHDAAQFYIENGQPEKA
LSFYEKMVHAQKQIQRGDCLYEI; SEQ ID NO:54 (Swiss-Prot: Q00828) is
10 [0145] ttgaggatgaagcagacgattccgtcctcttatgtcgggcttaaaattaatgaatggtatactcatatccggcagttcc acgtcgctgaagccgaacgggtcaagctcgaagtagaaagagaaattgaggatatggaagaagaccaagatttgctgctg tattattctttaatggagttcaggcaccgtgtcatgctggattacattaagccttttggagaggacacgtcgcagctagagttttca gaattgttagaagacatcgaagggaatcagtacaagctgacagggcttctcgaatattactttaatttttttcgaggaatgtatga atttaagcagaagatgtttgtcagtgccatgatgtattataaacgggcagaaaagaatcttgccctcgtctcggatgatattgag
15 aaagcagagtttgcttttaaaatggctgagattttttacaatttaaaacaaacctatgtttcgatgagctacgccgttcaggcatta gaaacataccaaatgtatgaaacgtacaccgtccgcagaatccaatgtgaattcgttattgcaggtaattatgatgatatgcag tatccagaaagagcattgccccacttagaactggctttagatcttgcaaagaaagaaggcaatccccgcctgatcagttctgc cctatataatctcggaaactgctatgagaaaatgggtgaactgcaaaaggcagccgaatactttgggaaatctgtttctatttgc aagtcggaaaagttcgataatcttccgcattctatctactctttaacacaagttctgtataaacaaaaaaatgacgccgaagcg
20 caaaaaaagtatcgtgaaggattggaaatcgcccgtcaatacagtgatgaattatttgtggagctttttcaatttttacatgcgtta tacggaaaaaacattgacacagaatcagtctcacacacctttcaatttcttgaagaacatatgctgtatccttatattgaagagct ggcgcatgatgctgcccaattctatatagaaaacggacagcccgaaaaagcactttcattttatgagaaaatggtgcacgca caaaaacaaatccagagaggagattgtttatatgaaatc; SEQ ID NO:53 (NP 389125). [0146] In certain embodiments, the modified Bacillus sp. cell comprising the rap operon
25 containing the inactive phr gene may contain an active or inactive rap gene. If the rap gene is active, it may have a wild-type sequence (e.g., may be endogenous to the cell) or may be modified such that it is functionally equivalent to the wild type protein of the same species.
[0147] In some embodiments, the modified Bacillus sp. host cell comprises an inactivated
30 rap gene. In one embodiment, the modified Bacillus sp. cell comprises a single rap operon that contains an inactivated rap gene (e.g., a rapA operon containing an inactive an inactivated rapA gene, a rapB operon containing an inactive an inactivated rapB gene, a rapC operon containing an inactivated rapC gene, a rapD operon containing an inactive an inactivated rapD gene, a rapE operon containing an inactivated rapE gene, a rapF
35 operon containing an inactivated rapFgene, a rapG operon containing an inactivated rapG, a rap/ operon containing an inactivated rapl gene, a rapJ operon containing an 35
inactivated rapJ gene,, or a rapK operon containing an inactivated rapKgene). In other embodiments, the modified Bacillus sp. cell comprises at least two rap operons each containing an inactivated rap gene. In some embodiments, inactivation results from the deletion of the entire endogenous DNA sequences that encode the Rap proteins. 5 [0148] The modified Bacillus sp. cell is derived from a precursor host cell of a Bacillus sp. strain including Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis strains. In some
10 embodiments, the modified Bacillus sp. cell is derived from an alkalophilic Bacillus sp. cell. Numerous alkalophilic Bacillus sp. are known (See e.g., U.S. Pat. No. 5,217,878; and Aunstrup et al., Proc IV IFS: Ferment. Technol. Today, 299-305 [1972]). In some particular embodiments, the Bacillus sp. precursor host cell is an industrial Bacillus sp. host cell. Examples of industrial Bacillus sp. host cells include, but are not limited to
15 Bacillus licheniformis, Bacillus lentus, Bacillus subtilis, and Bacillus amyloliquefaciens host cells. In additional embodiments, the Bacillus sp. host cell is selected from the group consisting of Bacillus lentus, Bacillus brevis, Bacillus stearothermophilus, Bacillus alkalophilus, Bacillus coagulans, Bacillus circulans, Bacillus pumilus, Bacillus thuringiensis, Bacillus clausii, and Bacillus megaterium, as well as other organisms within
20 the genus Bacillus, as discussed above. In some particularly preferred embodiments,
Bacillus subtilis is used. For example, U.S. Pat. Nos. 5,264,366 and 4,760,025 (RE 34,606) describe various Bacillus host strains that find use in the present invention, although other suitable strains (e.g., industrial strains) are contemplated for use in the present invention.
25 [0149] An industrial strain may be a non-recombinant strain of a Bacillus sp., a mutant of a naturally occurring strain, or a recombinant strain. Preferably, the host strain is a recombinant host strain wherein a recombinant polynucleotide encoding a polypeptide of interest has been introduced into the host. In some embodiments, the polypeptide of interest is an enzyme (e.g., a protease). A further preferred host strain is a Bacillus
30 subtilis host strain, and in particular a recombinant Bacillus subtilis host strain. Numerous
Bacillus subtilis strains are known, including but not limited to 1 A6 (ATCC 39085), 168 (1 A01 ), SB19, W23, Ts85, B637, PB1753 through PB1758, PB3360, JH642, 1 A243 (ATCC 39,087), ATCC 21332, ATCC 6051 , MM 13, DE100 (ATCC 39,094), GX4931 , PBT 1 10, and PEP 21 1 strain {See e.g., Hoch et al, Genetics, 73:215-228 [1973]; U.S. Pat.
35 No. 4,450,235; U.S. Pat. No. 4,302,544; and EP 0134048). The use of B. subtilis as an expression host is further described by Palva et al. and others (See, Palva et al., Gene 36
19:81 -87 [1982]; See also, Fahnestock and Fischer, J. Bacteriol., 165:796-804 [1986]; and Wang et al., Gene 69:39-47 [1988]).
[0150] Industrial protease producing Bacillus sp. hostceWs provide particularly preferred host cells. In some preferred embodiments, use of these host cells in the present 5 invention enhances protease production. Two general types of proteases are typically secreted by Bacillus sp., namely neutral (or "metalloproteases") and alkaline (or "serine") proteases. Serine proteases are enzymes which catalyze the hydrolysis of peptide bonds in which there is an essential serine residue at the active site. Serine proteases have molecular weights in the 25,000 to 30,000 range (See, Priest, Bacteriol. Rev., 41 :71 1 -753
10 [1977]). Subtilisin is a preferred serine protease that is produced by the modified Bacillus sp. host cells of the present invention. A wide variety of Bacillus subtilisins have been identified and sequenced, for example, GG36, subtilisin 168, subtilisin BPN', subtilisin Carlsberg, subtilisin DY, subtilisin 147 and subtilisin 309 (See e.g., EP 414279 B; WO 89/06279; and Stahl et al., J. Bacteriol., 159:81 1 -818 [1984]). In some embodiments of
15 the present invention, the Bacillus host strains produce mutant (e.g., variant) proteases.
Numerous references provide examples of variant proteases and reference (See e.g., WO 99/20770; WO 99/20726; WO 99/20769; WO 89/06279; RE 34,606; U.S. Pat. No. 4,914,031 ; U.S. Pat. No. 4,980,288; U.S. Pat. No. 5,208,158; U.S. Pat. No. 5,310,675; U.S. Pat. No. 5,336,61 1 ; U.S. Pat. No. 5,399,283; U.S. Pat. No. 5,441 ,882; U.S. Pat. No.
20 5,482,849; U.S. Pat. No. 5,631 ,217; U.S. Pat. No. 5,665,587; U.S. Pat. No. 5,700,676;
U.S. Pat. No. 5,741 ,694; U.S. Pat. No. 5,858,757; U.S. Pat. No. 5,880,080; U.S. Pat. No. 6,197,567; and U.S. Pat. No. 6,218,165).
[0151] In another embodiment, a preferred Bacillus sp. host is a Bacillus sp. that includes a mutation or deletion in at least one of the following genes, degU, degS, degR and degQ.
25 Preferably the mutation is in a degU geue, and more preferably the mutation is degU(Hy)32 (See e.g., Msadek et al., J. Bacteriol., 172:824-834 [1990]; and Olmos et al., MoI. Gen. Genet., 253:562-567 [1997]). In one embodiment, the host cell is a Bacillus subtilis host cell that carries a degU32(Hy) mutation. In a further embodiment, the Bacillus sp. host cell comprises a mutation or deletion in scoC4, (See e.g., Caldwell et al.,
30 J. Bacteriol., 183:7329-7340 [2001 ]); spollE (See e.g., Arigoni et al., MoI. Microbiol.,
31 :1407-1415 [1999]); oppA or other genes of the opp operon (See e.g., Perego et al., MoI. Microbiol., 5:173-185 [1991 ]). Indeed, it is contemplated that any mutation in the opp operon that causes the same phenotype as a mutation in the oppA gene will find use in some embodiments of the modified Bacillus sp. cell of the present invention. In some
35 embodiments, these mutations occur alone, while in other embodiments, combinations of mutations are present. In some embodiments, a modified Bacillus sp. cell of the invention 37
is derived from a Bacillus sp. host cell that already includes a mutation to one or more of the above-mentioned genes. In alternate embodiments, a modified Bacillus sp. cell of the invention is further engineered to include mutation of one or more of the above-mentioned genes. 5 Proteins of Interest
[0152] The invention provides modified Bacillus sp. cells that are used to produce proteins of interest at a level that is greater than that produced by the unmodified precursor host cells. Generally, proteins of interest are desirable proteins that have commercial significance. The protein of interest may be either homologous or
10 heterologous to the host. In some embodiments, the protein of interest is a secreted polypeptide, particularly an enzyme, including but not limited to amylolytic enzymes, proteolytic enzymes, cellulytic enzymes, oxidoreductase enzymes and plant wall degrading enzymes. In further embodiments, these enzyme include, but are not limited to amylases, proteases, xylanases, lipases, laccases, phenol oxidases, oxidases, cutinases,
15 cellulases, hemicellulases, esterases, peroxidases, catalases, glucose oxidases, phytases, pectinases, glucosidases, isomerases, transferases, galactosidases and chitinases. In still further embodiments, the expressed polypeptide is a hormone, cytokine, growth factor, receptor, vaccine, antibody, or the like. While it is not intended that the present invention be limited to any particular protein/polypeptide, in some most
20 preferred embodiments, the expressed protein of interest is a protease.
[0153] As noted above, in certain embodiments the host cell contains a recombinant expression cassette that comprises a polynucleotide sequence encoding a protein of interest (i.e., an expression cassette for production of a protein that is not native to the host cell). In some embodiments, the host cell comprises a recombinant nucleic acid
25 comprising an expression cassette (i.e., a promoter, a polynucleotide encoding the protein of interest, and a transcriptional terminator), wherein the expression cassette is sufficient for the production of the protein by the Bacillus sp. host cell. In some embodiments, the recombinant nucleic acid is integrated into the genome of the host cell, while in other embodiments, the recombinant nucleic acid is present in a vector that replicates
30 autonomously from the genome. In some embodiments, the polynucleotide encoding the protein of interest is codon optimized for expression of the protein in the Bacillus sp. host cell. While any promoter may be employed in a subject expression cassette, promoters that are regulated by the rap/phr systems (e.g., the aprE and nprE promoters) may be employed in some embodiments.
35 [0154] In one embodiment, the protein of interest may be, for example, an enzyme (e.g., a so-called "industrial enzyme"), or a protein having therapeutic activity such an antibody. 38
In one particular embodiment, the protein of interest is a subtilisin, where the term "subtilisin" refers to a serine endopeptidase of the S8 family of peptidases. Subtilisin protein has an activity described as EC 3.4.21 .62 (previously EC 3.4.4.16), according to IUMBM enzyme nomenclature. The activity of exemplary subtilisin proteins is generally 5 described in Philipp et al, (MoI. Cell. Biochem. 1983 51 : 5-32), Siezen (Protein ScL, 1997
6:501 -523); Bryan (Biochim. Biophys. Acta, 2000 1543:203-222); Maurer, 2004 Curr. Op, Biotechnol., 2004 15:330-334); and Gupta, Appl. Microbiol. Biotechnol., 2002 59:15-32). [0155] In some embodiments, a subtilisin has an amino acid sequence that is found in a wild-type genome (i.e., the subtilisin is a naturally-occurring subtilisin), while in other
10 embodiments, the subtilisin is a variant of a naturally-occurring subtilisin. In some embodiments, the variant subtilisin comprises an amino acid sequence that is at least about 80%, at least about 90%, at least about 95% or at least about 98% identical to a subtilisin encoded by a wild-type genome. Exemplary subtilisins include, but are not limited to: ALCANASE® (Novozymes), FNA™ (Genencor), SAVINASE® (Novozymes)
15 PURAFECT™ (Genencor), KAP™ (Kao), EVERLASE™ (Novozymes), PURAFECT
OxP™ (Genencor), FN4™ (Genencor), BLAP S™ (Henkel), BLAP X™ (Henkel), ESPERASE® (Novozymes), KANNASE™ (Novozymes) and PROPERASE™ (Genencor). In yet additional embodiments, the subtilisin includes, but is not limited to subtilisin BPN', subtilisin Carlsberg, subtilisin DY, subtilisin 147, or subtilisin 309 (See e.g., WO89/06279;
20 and Stahl et al., J. Bacteriol., 159:81 1-818 [1984]). Additional subtilisins and other proteases that find use in the present invention include but are not limited to those described in WO 99/20770; WO 99/20726; WO 99/20769; WO 89/06279; RE 34,606; U.S. Patent No. 4,914,031 ; U.S. Patent No. 4,980,288; U.S. Patent No. 5,208,158; U.S. Patent No. 5,310,675; U.S. Patent No. 5,336,61 1 ; U.S. Patent No. 5,399,283; U.S. Patent No.
25 5,441 ,882; U.S. Patent No. 5,482,849; U.S. Patent No. 5,631 ,217; U.S. Patent No.
5,665,587; U.S. Patent No. 5,700,676; U.S. Patent No. 5,741 ,694; U.S. Patent No. 5,858,757; U.S. Patent No. 5,880,080; U.S. Patent No. 6,197,567; and U.S. Patent No. 6,218,165. [0156] In some embodiments, the expression of the protein of interest in a host cell is
30 driven by the aprE promoter of the aprE gene from which the B. subtilis subtilisin is naturally transcribed. The aprE gene is transcribed by sigma A (σA) factor and its expression is highly controlled by several regulators, such as: DegU/DegS, AbrB, Hpr and SinR (VaIIe and Ferrari (1989) In: Smith I, Slepecky RA, Setlow P (eds) Regulation of Procaryotic Development. American Society for Microbiology. Washington, DC pp 131 -
35 146), and aprE Sigma A promoter has been identified tgggtcttgacaaatattattccatctattacaataaattcacaga (SEQ ID NO:38; US 20030148461 ; 39
Helman et al., 1995, Nucleic Acid Research, Vol. 24, pp. 2351 -2360). In some embodiments, the host cell comprises an aprE promoter that is the wild-type aprE promoter tgggtctactaaaatattattccatctattacaataaattcacaga (SEQ ID NO:39; U.S. Patent Application Publication No. 20030148461 ). 5 [0157] In other embodiments, the expression of a protein of interest by a host cell is driven by mutant of the B. subtilis aprE promoters. In some embodiments, the invention provides for a Bacillus host cell that contains a mutant aprE promoter operably linked to a polynucleotide sequence that encodes a protein of interest. Thus, the invention encompasses host cells that express a protein of interest from a mutant aprE promoter.
10 An example of a mutant aprE promoter is the mutant aprE promoter having the sequence:
[0158] tgggtc ttgaca aatattattccatctat tacaat aaattcacaga (SEQ ID NO:40), [0159] which is described in U.S. Patent Application Publication No. 20030148461 . Any one of the proteins of interest recited herein (e.g., Bacillus subtilisins) can be transcribed from an aprE promoter. In some embodiments, the invention provides for a modified
15 Bacillus host cell that is capable of expressing a protein of interest from an aprE promoter.
In some embodiments, the modified host cell is a modified B. subtilis host cell capable of expressing a protease driven by an aprE promoter. In some embodiments, the aprE promoter includes the aprE promoter regulatory elements and/or the aprE transcriptional leader, while in other embodiments, the aprE promoter does not include the aprE
20 promoter regulatory elements and/or the aprE transcriptional leader.
[0160] In addition to the aprE promoter, the invention also encompasses compositions and methods for expressing a protein of interest by a host cell, wherein the expression is driven by any promoter suitable for driving the transcription of the gene of interest as long as the promoter comprises the transcriptional leader sequence of the aprE gene. Other
25 suitable promoters and terminators for use in Bacillus host cells are known and include: the promoters and terminators of npr (neutral protease; i.e., NprE promoter), amy (α- amylase) and α-lactamase genes, as well as the B. subtilis levansucrase gene (sacB), B. licheniformis alpha-amylase gene (amyL), B. stearothermophilus maltogenic amylase gene (amyM), B. amyloliquefaciens alpha-amylase gene (amyQ), B. licheniformis
30 penicillinase gene (penP), B. subtilis xylA and xylB genes, the promoters and terminators described in WO 93/10249, WO 98/07846, and WO 99/43835.
[0161] In other embodiments, the modified host cell may produce a protein of interest that is a recombinant carbohydrase, such as a liquefying and saccharifying α-amylase, an alkaline α-amylase, a α-amylase, a cellulase; a dextranase, an α-glucosidase, an α-
35 galactosidase, a glucoamylase, a hemicellulase, a pentosanase, a xylanase, an invertase, a lactase, a naringanase, a pectinase or a pullulanase; a protease such as an acid 40
protease, an alkali protease, bromelain, ficin, a neutral protease, papain, pepsin, a peptidase, rennet, rennin, chymosin, thermolysin, an aspartic proteinase, or trypsin; a lipase or esterase, such as a triglyceridase, a phospholipase, a pregastric esterase, a phosphatase, a phytase, an amidase, an iminoacylase, a glutaminase, a lysozyme, or a 5 penicillin acylase; an isomerase such as glucose isomerase; an oxidoreductases (e.g., an amino acid oxidase), a catalase, a chloroperoxidase, a glucose oxidase, a hydroxysteroid dehydrogenase or a peroxidase; a lyase such as a acetolactate decarboxylase, a aspartic β-decarboxylase, a fumarase or a histadase; a transferase such as cyclodextrin glycosyltranferase; or a ligase, for example. In particular embodiments, the protein may
10 be an aminopeptidase, a carboxypeptidase, a chitinase, a cutinase, a deoxyribonuclease, an α-galactosidase, a β-galactosidase, a β-glucosidase, a laccase, a mannosidase, a mutanase, a pectinolytic enzyme, a polyphenoloxidase, ribonuclease or transglutaminase. [0162] In particular embodiments, the protein may be a therapeutic protein. Examples of suitable target therapeutic proteins which may be produced using a subject cell include:
15 erythropoietin, cytokines such as interferon-α, interferon-β, interferon^, interferon-o, and granulocyte-CSF, GM-CSF, coagulation factors such as factor VIII, factor IX, and human protein C, antithrombin III, thrombin, soluble IgE receptor α-chain, IgG, IgG fragments, IgG fusions, IgM, IgA, interleukins, urokinase, chymase, and urea trypsin resume inhibitor, IGF-binding protein, epidermal growth factor, growth hormone- releasing factor, annexin V
20 fusion protein, angiostatin, vascular endothelial growth factor-2, myeloid progenitor inhibitory factor-1 , osteoprotegerin, α-1 -antitrypsin, α-feto proteins, DNase II, kringle 3 of human plasminogen, glucocerebrosidase, TNF binding protein 1 , follicle stimulating hormone, cytotoxic T lymphocyte associated antigen 4-lg, transmembrane activator and calcium modulator and cyclophilin ligand, soluble TNF receptor Fc fusion, glucagon like
25 protein 1 and IL-2 receptor agonist. Monoclonal antibodies may also be made.
[0163] In certain embodiments, the cell may be engineered so that the protein produced by the cell may be secreted from the cell into culture media. As such, the cell may further contain a recombinant nucleic acid encoding a fusion polypeptide containing a signal sequence, a protease cleavage site and the protein. In some embodiments, the signal
30 sequence may be one that is naturally associated with the polypeptide to be expressed.
The signal sequence may be any sequence of amino acids that is capable of directing the fusion protein into the secretory pathway of the Bacillus host cell. In certain cases, signal sequences that may be employed include the signal sequences of proteins that are secreted from wild-type Bacillus cells. Such signal sequences include the signal
35 sequences encoded by α-amylase, protease (e.g., aprE or subtilisin E), or β-lactamase genes. Exemplary signal sequences include, but are not limited to, the signal sequences 41
encoded by an α-amylase gene, a subtilisin gene, a β-lactamase gene, a neutral protease gene (e.g., nprT, nprS, nprM), or a prsA gene from any suitable Bacillus species, including, but not limited to Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus 5 lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megateήum, Bacillus pumilus,
Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis. In one embodiment, the signal sequence is encoded by the aprE gene of B. subtilis (as described in Appl. Microbiol. Biotechnol. 2003 62:369-73). Further signal peptides are described by Simonen and Palva (Microbiological Reviews 1993 57: 109-137), and other
10 references.
[0164] The invention also provides methods for producing a protein of interest in a modified Bacillus sp. host cell, which comprises at least one inactivated phr gene (e.g., an inactivated phrA and/or a phrE gene), or an inactivated phr and an inactivated rap gene by culturing a modified cell that is capable of producing a protein of interest and
15 growing the cell under suitable growth conditions for expressing the protein of interest.
The methods provide for the production of any one protein of interest described above. In preferred embodiments, the protein of interest produced by the method of the invention is a protease (e.g., a subtilisin). Production of a protein of interest by a modified Bacillus sp. cell is greater than that obtained from a corresponding unmodified precursor host cell.
20 In some embodiments, the improved level of protease production by a modified Bacillus sp. cell is further enhanced in the modified cell by overexpressing ymati, as described below.
[0165] Modified Bacillus sp. Host Cells That Overexpress YmaH [0166] In the embodiments described above, the modified Bacillus sp. cells, which
25 comprise at least one inactivated phr gene and/or an inactivated rap gene, have an enhanced capacity to produce a protein of interest at a level that is greater than that reached by an unmodified precursor cell. In further embodiments described below, the enhanced level of production of a protein of interest by the modified Bacillus sp. cells is further increased by altering the modified cell to overexpress the RNA-binding protein
30 ymati. Thus, in one embodiment, the invention provides for a modified Bacillus sp. cell that comprises at least one inactivated phr gene (e.g., an inactivated phrA and/or phrE gene), a polynucleotide that encodes a protein of interest (e.g., a protease), and a heterologous polynucleotide that encodes a YmaH protein. In another embodiment, the modified Bacillus sp. cell comprises at least one inactivated phr gene (e.g. an inactivated
35 phrA and/or phrE gene), and/or an inactivated rap gene, a polynucleotide that encodes a protein of interest (e.g., a protease), and a heterologous polynucleotide that encodes a 42
YmaH protein.
[0167] In some embodiments, the modified Bacillus sp. cell comprises a polynucleotide expression construct comprising a YmaH promoter that is operably linked to a polynucleotide sequence that encodes a YmaH protein. The Bacillus subtilis YmaH, also 5 known as HFQ BACSU is an RNA-binding protein, is a member of the Hfq family of RNA- binding proteins (Sauter et ai, Nucleic Acid Res 31 :4091 -4098, [2003]). The YmaH protein is encoded in Bacillus subtilis by the ymaH gene, which is an ortholog of the hfq gene of E. coli. (Silvaggi et al ., J Bacteriol. 187(19): 6641-6650, [2005]). YmaH is an abundant and ubiquitous RNA-binding protein that functions as a pleiotrophic regulator of
10 RNA metabolism in prokaryotes, and is required for stabilization of some transcripts and degradation of others. YmaH binds preferentially to unstructured A/U-rich RNA sequences and is similar to the eukaryotic Sm proteins in both sequence and structure. YmaH is also known to bind small RNA molecules called ri bo regulators that modulate the stability or translation efficiency of RNA transcripts.
15 [0168] The naturally-occurring YmaH protein from Bacillus subtilis is a 73 amino acid protein:
[0169] MKPINIQDQFLNQIRKENTYVTVFLLNGFQLRGQVKGFDNFTVLLESEGKQQLIYK HAISTFAPQKNVQLELE (Swiss-Prot:P3756; SEQ ID NO:45) [0170] that is encoded by a 219 (222 including the stop codon) base pair polynucleotide
20 (EMBL Primary Accession Number Z991 13; SEQ ID NO:46).
[0171] Thus, in some embodiments, the modified Bacillus sp. cell of the invention further comprises a heterologous polynucleotide sequence that encodes ymaH. In one embodiment, the ymaH protein is encoded by the naturally-occurring polynucleotide sequence found in the genome of the wild-type Bacillus subtilis strain 168 (SEQ ID
25 NO:45). In some embodiments, the modified Bacillus sp. cell of the invention comprises a heterologous polynucleotide sequence that encodes variants of the naturally occurring ymaH. Variant YamH proteins include proteins derived from the wild-type protein by deletion (i.e., truncation), addition, or substitution of one or more amino acids at one or more sites in the native protein. Methods for such deletions, additions and substitutions
30 are generally known in the art. For example, amino acid sequence variants of the polypeptide can be prepared by mutations in the cloned DNA sequence encoding the native protein of interest. Methods for mutagenesis and nucleotide sequence alterations are well known in the art (See e.g., Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488 492; Kunkel et al. (1987) Methods Enzymol. 154:367 382; U.S. Pat. No. 4,873,192; and the
35 references cited therein; herein incorporated by reference. In constructing variants of the proteins of interest, modifications to the nucleotide sequences encoding the variants will 43
be made such that variants continue to possess the desired activity. As will be understood by the skilled artisan, due to the degeneracy of the genetic code, a variety of modified polynucleotides encode a YmaH protein. In some other embodiments of the present invention, the Bacillus sp. cell comprises a polynucleotide encoding a YmaH 5 protein comprising a nucleotide sequence having at least about 70% sequence identity, at least about 75% sequence identity, at least about 80% sequence identity, at least about 85% sequence identity, at least about 90% sequence identity, at least about 92% sequence identity, at least about 95% sequence identity, at least about 97% sequence identity, at least about 98% sequence identity, or at least about 99% sequence identity to
10 the polynucleotide sequence of SEQ ID NO:46.
[0172] In other embodiments, the modified Bacillus sp. cell comprises polynucleotide constructs that comprise ymaH coding sequences that are analogous to the ymaH coding sequence of Bacillus subtilis strain 168. The genome of this strain, which is contained in one 4215 kb genome, has been well-characterized (See, Kunst et ai, Nature 390:249-
15 256 [1997]; and Henner et ai, Microbiol. Rev, 44:57-82 [1980]). In some embodiments, the YmaH-encoding polynucleotide constructs encode a YmaH protein that shares at least about 65% amino acid sequence identity, at least about 70% amino acid sequence identity, at least about 75% amino acid sequence identity, at least about 80% amino acid sequence identity, at least about 85% amino acid sequence identity, at least about 90%
20 amino acid sequence identity, at least about 92% amino acid sequence identity, at least about 95% amino acid sequence identity, at least about 97% amino acid sequence identity, at least about 98% amino acid sequence identity, and at least about 99% amino acid sequence identity with the amino acid sequence of the wild-type form of the YmaH protein and that has comparable or improved ability to enhance the production of a protein
25 of interest in a host cell when compared to the wild-type polypeptide (SEQ ID NO:45), and that retains the ability to enhance the expression of a protein of interest in a Bacillus sp. {e.g., Bacillus subtilis) host cell. In yet other embodiments, the modified Bacillus sp. cell comprises YmaH-encoding polynucleotide constructs comprising polynucleotide sequences that are homologous, orthologous or paralogous to genes of the wild-type
30 Bacillus sequence of SEQ ID NO:46 and that retain the ability to enhance the production of a protein of interest.
[0173] In other embodiments, the modified Bacillus sp. cell of the invention also encompasses polynucleotide constructs that comprise coding sequences encoding YmaH proteins that are related by being structurally and/or functionally similar. In some
35 embodiments, these proteins are derived from a different genus and/or species, including differences between classes of organisms (e.g., a bacterial protein and a fungal protein). 44
In some embodiments, these proteins are derived from a different genus and/or species. In additional embodiments, related proteins are provided from the same species. Indeed, it is not intended that the present invention be limited to related proteins from any particular source(s). In addition, the term "related proteins" encompasses tertiary 5 structural homologs and primary sequence homologs (e.g., the YmaH of the present invention). For example, the present invention encompasses such homologues including but not limited to such YmaH proteins as the YmaH of E. coli, (HFQ ECOLI), Shighella flexneri (HFQ SHIFL), Salmonella typhimurium (HFQ SALTY), Yersinia enterocolitica (HFQ_YEREN), Yersinia pestis (HFQ_YERPE), Erwinia carotovora (H FQ_E RWCA),
10 Haemophilus influenzae (HFQ_HAEIN), Pasteurella multocida (HFQ_PASMU), Vibrio cholerae (HFQ VIBCH), Pseudomonas aeruginosa (HFQ PSEAE), Xanthomonas axonopodis (HFQ XANAC), Xanthomonas campestris (HFQ XANCP), XyIeIIa fastidiosa (GSQ XYLFA), Neisseria meningitidis (HFQ NEIMA), Ralstonia solanacearum (HFQ RALSO), Agrobacterium tumefaciens (HFQ AGRTS), Brucella melitensis
15 (HFQ_BRUME), Rhizobium loti (HFQ_RHILO), Azorhizobium caulinodans
(HFQ_AZOCA), Caulobacter crescentus (HFQ_CAUCR), Aquifex melitensis (HFQ AQU AE), Thermotoga maritime (HFQ THEMA), Clostridium acetobutylicum (HFQ_CLOAB), Clostridium perfringens (HFQ_CLOPE), Bacillus halodurans (HFQ BACHD), Bacillus subtilis (HFQ BACSU), Thermoanaerobacter tengcongensis
20 (HFQJTHETN), S. aureaus (Q99UG9), and M. jannasci (Q58830) (Sauter et al, Nucleic
Acids Res. 31 :4091 -4098 [2003]).
[0174] Related (and derivative) proteins comprise variant YmaH proteins. In some preferred embodiments, variant proteins differ from a parent protein and one another by a small number of amino acid residues. The number of differing amino acid residues may
25 be one or more, preferably about 1 , 2, 3, 4, 5, 10, 15, 20, 30, 40, 50, or more amino acid residues. In some preferred embodiments, the number of different amino acids between variants is between about 1 and about 10. In some particularly preferred embodiments, related proteins and particularly variant proteins comprise at least about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 97%, about 98%, or about 99%
30 amino acid sequence identity. Several methods are known in the art that are suitable for generating variants of the YmaH proteins of the present invention, including but not limited to site-saturation mutagenesis, scanning mutagenesis, insertional mutagenesis, random mutagenesis, site-directed mutagenesis, and directed-evolution, as well as various other recombinatorial approaches.
35 [0175] Characterization of wild-type and mutant proteins is accomplished via any means suitable and is preferably based on the assessment of properties of interest. For 45
example, it is contemplated that YmaH proteins that are capable of further enhancing the production of a protein of interest by a modified Bacillus sp. cell will find use. [0176] Overexpression of ymaH in the modified Bacillus sp. cell of the invention can be achieved by various means including enhancing the transcription and/or translation of the 5 YmaH encoding polynucleotide. For example, at the transcriptional level, overexpression of ymaH can be achieved by increasing the number of polynucleotide sequences that encode ymaH in a host cell, and/or by increasing the binding strength of a ymaH promoter to enhance the activity of the cognate RNA polymerase. At the translational level, overexpression of ymaH can be achieved by enhancing the translational activity by
10 mutating the ribosome binding site (RBS) to increase the affinity of ribosomes for the
RBS. One skilled in the art will recognize that overexpression of ymaH can be effected by increasing the number of copies of the ymaH gene alone or in combination with other possible modifications made to the ymaH gene to achieve the overexpression of YmaH. [0177] In one embodiment, the modified Bacillus sp. cells of the invention comprise a
15 polynucleotide construct that comprises a polynucleotide sequence encoding ymaH operably linked to a ymaH promoter. The transcription of ymaH may be naturally driven by two promoters: a SigA promoter that is present upstream of miaA coding region, and the SigH promoter that is immediately upstream of the ymaH coding region in the miaA operon of B. subtilis. A ymaH promoter can be any promoter that drives the expression of
20 yamH (e.g., a SigA and/or a SigH promoter), and may be any nucleic acid sequence which shows transcriptional activity in the host cell of choice and includes mutant, truncated and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell. The promoter sequence may be native or foreign to the host cell.
25 [0178] In one embodiment, the modified Bacillus sp. cells of the invention comprise a polynucleotide construct that comprises a polynucleotide sequence that encodes YmaH operably linked to a SigH promoter (e.g., SEQ ID NO:23, as shown below). SEQ ID NO:23 also exemplifies a polynucleotide construct that comprises a YmaH coding sequence that is naturally contiguous with a SigH promoter:
30 [0179] ggcaccgaattcgacgtggtttcgcaacaaaatgcaggtcacatggttcgatatgacaccgcctgttgatatggag ctgaaaaaaaaggaaattttcacacatatagcaggaaaactcgaactttaatcgaaactgtatgatatagagaatcaagga ggacgaaacatgaaaccgattaatattcaggatcagtttttgaatcaaatccggaaagaaaatacgtatgtcactgtttttttgct gaacggctttcagttgcggggccaggtgaaaggctttgataactttaccgtattgttggaatcggaaggtaagcagcagcttat atataaacatgcgatctcaacgtttgcgccgcaaaaaaacgtccagcttgaactcgaatagatcaaaaaatgccatgtcaag
35 acatgaggaaaggctgtcgggggttcccggcggccatttttaacatgaatccacttttgctccaagctttttgtgtaagctgacca tgccaaggcacggtctttttttatgagggatccggagcc (SEQ ID NO:23). 46
[0180] In another embodiment, the modified Bacillus sp. cells of the invention comprise a polynucleotide construct that comprises a polynucleotide sequence that encodes YmaH operably linked to a SigA promoter {e.g., SEQ ID NO:26 (SigA1 ) and SEQ ID NO:31 (SigA2 construct)). SEQ ID NOs:26 and 31 exemplify embodiments wherein the ymaH 5 coding sequence is contiguous with a SigA promoter sequence to provide a chimeric polynucleotide construct. In some preferred embodiments, chimeric polynucleotide constructs thus comprise a promoter sequence that in nature is not contiguous with the ymaH coding sequence. For example, SEQ ID NOS:26 and 31 exemplify chimeric constructs that comprise a SigA promoter that is operably linked to a polynucleotide
10 sequence encoding YmaH, as shown below:
[0181 ] gcgccgaattctcataccctgaaaggaaagacaagggaaattgtcggcaatgagccgctcggcaggtagaag gatgtttaccgatgcaaaaaaagggcaaaatggataggtggttgtccatgttgaatgctataatgggggagatttataaaaga gagtgatacatattgaataatacgaagcagccccacacatatagcaggaaaactcgaactttaatcgaaactgtatgatata gagaatcaaggaggacgaaacatgaaaccgattaatattcaggatcagtttttgaatcaaatccggaaagaaaatacgtat
15 gtcactgtttttttgctgaacggctttcagttgcggggccaggtgaaaggctttgataactttaccgtattgttggaatcggaaggta agcagcagcttatatataaacatgcgatctcaacgtttgcgccgcaaaaaaacgtccagcttgaactcgaatagatcaaaaa atgccatgtcaagacatgaggaaaggctgtcgggggttcccggcggccatttttaacatgaatccacttttgctccaagctttttg tgtaagctgaccatgccaaggcacggtctttttttatgagggatccggtgcc (SEQ ID NO:26) [0182] gcgccgaattctcataccctgaaaggaaagacaagggaaattgtcggcaatgagccgctcggcaggtagaag
20 gatgtttaccgatgcaaaaaaagggcaaaatggataggtggttgtccatgttgaatgctataatgggggagatttataaaaga gagtgctcgaactttaatcgaaactgtatgatatagagaatcaaggaggacgaaacatgaaaccgattaatattcaggatca gtttttgaatcaaatccggaaagaaaatacgtatgtcactgtttttttgctgaacggctttcagttgcggggccaggtgaaaggctt tgataactttaccgtattgttggaatcggaaggtaagcagcagcttatatataaacatgcgatctcaacgtttgcgccgcaaaa aaacgtccagcttgaactcgaatagatcaaaaaatgccatgtcaagacatgaggaaaggctgtcgggggttcccggcggc
25 catttttaacatgaatccacttttgctccaagctttttgtgtaagctgaccatgccaaggcacggtctttttttatgagggatccggtg cc (SEQ ID NO:31 )
[0183] In yet another embodiment, the Bacillus sp. cells of invention comprise a polynucleotide construct that comprise a polynucleotide sequence that encodes YmaH and a SigA and a SigH promoter (e.g., SEQ ID NO: 22, as shown below).
30 [0184] tcataccctgaaaggaaagacaagggaaattgtcggcaatgagccgctcggcaggtagaaggatgtttaccgat gcaaaaaaagggcaaaatggataggtggttgtccatgttgaatgctataatgggggagatttataaaagagagtgatacata ttgaataatacgaagcagcccgttgtcattttagtcggaccgacggcagtggggaaaaccaatttaagtattcagctagccaa atccttaaacgcggaaattatcagcggagattcgatgcagatttataaagggatggatattggaacagctaaaattaccgaac aggagatggagggagtgccccatcatctgattgacattttagatccccaagactctttctctactgccgattatcaaagcttagta
35 agaaataaaatcagcgagattgcaaatagaggaaagcttccgatgattgacggcggtacagggctttatatacaatctgagc tttacgattatacatttacggaagaggcaaatgatcccgtgtttcgagagagcatgcaaatggctgctgagcgggaaggcgct 47
gactttcttcatgccaaacttgctgcagcagatcccgaggcagcagctgcgattcatccgaataatacaagaagagtcattcg cgcactggaaattttacatacgtccggaaaaacgatgtcccagcatttgaaggaacaaaaacgagaacttctgtacaatgca gtgttaattggcctgacaatggatagagacacgctttacgaaagaattaatcagcgggtcgatttgatgatgcagtcaggcctt cttccggaagtgaaacgcttatacgacaagaacgtgagagactgtcaatcaatacaggcgataggctataaagagctgtat 5 gcatattttgacggttttgtgacactttccgatgctgtcgaacagctaaagcagaactcgaggcggtatgcgaaacgccagctg acgtggtttcgcaacaaaatgcaggtcacatggttcgatatgacaccgcctgttgatatggagctgaaaaaaaaggaaatttt cacacatatagcaggaaaactcgaactttaatcgaaactgtatgatatagagaatcaaggaggacgaaacatgaaaccga ttaatattcaggatcagtttttgaatcaaatccggaaagaaaatacgtatgtcactgtttttttgctgaacggctttcagttgcgggg ccaggtgaaaggctttgataactttaccgtattgttggaatcggaaggtaagcagcagcttatatataaacatgcgatctcaac
10 gtttgcgccgcaaaaaaacgtccagcttgaactcgaatagatcaaaaaatgccatgtcaagacatgaggaaaggctgtcg ggggttcccggcggccatttttaacatgaatccacttttgctccaagctttttgtgtaagctgaccatgccaaggcacggtcttttttt atgag (SEQ ID NO:22)
[0185] Examples of suitable promoters for directing the expression of the ymaH gene in are the SigA and the SigH promoters from the B. subtilis operon that encompasses the
15 gene encoding miaA. For example, in one embodiment, the invention provides a polynucleotide sequence defining a SigA promoter (SEQ ID NO:47, as shown below). [0186] tcataccctgaaaggaaagacaagggaaattgtcggcaatgagccgctcggcaggtagaaggatgtttaccgat gcaaaaaaagggcaaaatggataggtggttgtccatgttgaatgctataatgggggagatttataaaagagagtgatacata (SEQ ID NO:47)
20 [0187] In another embodiment, the invention provides a polynucleotide sequence defining a SigH promoter (SEQ ID NO:48, as shown below).
[0188] aaaggaaattttcacacatatagcaggaaaactcgaactttaatcgaaactgtatgatatagagaatcaaggagg acgaaac (SEQ ID NO:48) [0189] Other examples of promoters that can be used for expressing the ymaH gene
25 include Sigma A promoters that are recognized by σAfactor including the promoter of the
Streptomyces coelicolor agarase gene (dagA), the promoter of the Bacillus lentus alkaline protease gene (aprH), the promoter of the Bacillus licheniformis alkaline protease gene (subtilisin Carlsberg gene), the promoter of the Bacillus subtilis levansucrase gene (sacB), the promoter of the Bacillus subtilis alpha-amylase gene (amyE), the promoter of the
30 Bacillus licheniformis alpha-amylase gene (amyL), the promoter of the Bacillus stearothermophilus maltogenic amylase gene (amyM), and the promoter of the Bacillus amtyloliquefacietis alpha-amylase gene (amyQ). Examples of promoters that can be used for expressing the ymaH gene include Sigma H promoters that are recognized by σH factors including spoOA, spoOF, spoVG and citG (See, Helmann, J. D. and C. P. Moran.
35 2002. RNA polymerase and sigma factors, pp289-312 In A. L. Sonenshein, J. A. Hoch and R. Losick (ed), Bacillus subtilis and its closest relatives: from genes to cells. 48
American Society for Microbiology, Washington, D. C).
[0190] In some embodiments, a consensus SigA and/or SigH promoter finds use in the present invention. The construction of a consensus promoter may be accomplished by site-directed mutagenesis to create a promoter which conforms more perfectly to the 5 established consensus sequences for the "-10" and "-35" regions of the "sigma A-type" promoters for Bacillus subtilis (Voskuil et al., MoI Microbiol 17: 271 279 [1995]). In other embodiments, a consensus promoter is created by site-directed mutagenesis to create a promoter which conforms more perfectly to the established consensus sequences for the "-10" and "-35" regions of the vegetative "sigma H-type" promoters for Bacillus subtilis
10 (See, Helman and Moran in Bacillus subtilis and its closest relatives, Ch.21 , pg 289-312;
Sonenshein et al (2002 ASM Press, Wshington, D. C.) The consensus sequence for the "- 35" region for the sigma A-type promoter is TTGaca and for the "-10" region is tgnTATaat, and the consensus sequence for the "-35" region for the sigma H-type promoter is RnAGGAwWW and for the "-10" region is RnnGAAT. Capital letters indicate highly
15 conserved positions; lower case letters indicate less conserved positions; abbreviation R can be A or G, and W can be A or T. The consensus promoter may be obtained from any promoter which can function in a Bacillus host cell.
[0191] In some embodiments, the SigA promoter, which encompasses SEQ ID NO:47 is defined by a polynucleotide sequence that is naturally present upstream of the miaA
20 coding sequence (NP_389615; SEQ ID NO:49, shown below), while the SigH promoter, which encompasses SEQ ID NO: 48, is defined by the polynucleotide sequence that is naturally present upstream of the yam/-/ coding region (SEQ ID NO:46, shown below). [0192] ttgaataatacgaagcagcccgttgtcattttagtcggaccgacggcagtggggaaaaccaatttaagtattcagct agccaaatccttaaacgcggaaattatcagcggagattcgatgcagatttataaagggatggatattggaacagctaaaatt
25 accgaacaggagatggagggagtgccccatcatctgattgacattttagatccccaagactctttctctactgccgattatcaa agcttagtaagaaataaaatcagcgagattgcaaatagaggaaagcttccgatgattgacggcggtacagggctttatatac aatctgagctttacgattatacatttacggaagaggcaaatgatcccgtgtttcgagagagcatgcaaatggctgctgagcgg gaaggcgctgactttcttcatgccaaacttgctgcagcagatcccgaggcagcagctgcgattcatccgaataatacaagaa gagtcattcgcgcactggaaattttacatacgtccggaaaaacgatgtcccagcatttgaaggaacaaaaacgagaacttct
30 gtacaatgcagtgttaattggcctgacaatggatagagacacgctttacgaaagaattaatcagcgggtcgatttgatgatgca gtcaggccttcttccggaagtgaaacgcttatacgacaagaacgtgagagactgtcaatcaatacaggcgataggctataa agagctgtatgcatattttgacggttttgtgacactttccgatgctgtcgaacagctaaagcagaactcgaggcggtatgcgaa acgccagctgacgtggtttcgcaacaaaatgcaggtcacatggttcgatatgacaccgcctgttgatatggagctgaaaaaa aaggaaattttcacacatatagcaggaaaactcgaactttaa (SEQ ID NO:49)
35
[0193] atgaaaccgattaatattcaggatcagtttttgaatcaaatccggaaagaaaatacgtatgtcactgtttttttgctgaa 49
cggctttcagttgcggggccaggtgaaaggctttgataactttaccgtattgttggaatcggaaggtaagcagcagcttatatat aaacatgcgatctcaacgtttgcgccgcaaaaaaacgtccagcttgaactcgaatag (NP 389616; SEQ ID NO:46)
[0194] In some embodiments, the SigA/SigH constructs encompass promoter sequences 5 that have been mutated to increase the activity of the promoter when compared to the activity of the corresponding wild-type promoter resulting in the overexpression of the YmaH protein. Thus, it is understood that variants of the sequences that define the SigA and SigH promoters find use in the YmaH-expression constructs. Methods for creating promoter variants in Bacillus sp. are well known in the art (See e.g., Helmann et al.,
10 2002. RNA polymerase and sigma factors, pp289-312 In A. L. Sonenshein, J. A. Hoch and R. Losick (ed), Bacillus subtilis and its closest relatives: from genes to cells. American Society for Microbiology, Washington, D. C.) It is not intended that the present invention be limited to any particular promoter, as any suitable promoter known to those skilled in the art finds use with the present invention. Nonetheless, in some embodiments,
15 the promoter is the B. subtilis sigH promoter, while in other embodiments the promoter is the B. subtilis sigA promoter. In further embodiments, the sigH and the sigA promoters serve to effect the overexpression of YmaH protein.
[0195] In some embodiments, the SigA/SigH polynucleotide constructs of the invention also comprise the requisite ribosome binding site to ensure optimal translation of the
20 ymati RNA transcript. In some embodiments, the polynucleotide construct comprises the ribosome bind site (RBS) sequence of the miaA gene (aagagag; SEQ ID NO:50), while in other embodiments, polynucleotide construct comprises the RBS sequence of the ymati gene (ggagg; SEQ ID NO:51 ). In yet other embodiments, the polynucleotide construct comprises the ribosome binding site sequences of the miaA and the ymati genes. In
25 some embodiments, the invention provides constructs having the promoter and ribosome binding site sequences upstream of the ymati coding sequence. The invention is not limited to the ribosome binding site sequences disclosed herein, as it also encompasses any suitable ribosome binding site sequences that have been mutated to increase the level of expression of the ymati gene. Methods for obtaining mutated ribosome binding
30 sequences that increase the expression of a gene in Bacillus are known in the art. For example, Band and Henner successfully increased the level of expression of Interferon in B. subtilis by modifying the RBS to obtain a tighter base-pairing to the 16S rRNA (Band, L. and D. J. Henner, DNA 3:17-21 [1984]).
35 [0196] Production of a Protein of Interest in a Modified Cell
[0197] In some embodiments, the invention provides methods for producing a protein of 50
interest in a modified Bacillus sp. host cell, which comprises at least one inactivated phr gene (e.g., an inactivated phrA and/or a phrE gene), or an inactivated phr and/or rap gene by culturing a modified cell that is capable of producing a protein of interest and growing the cell under suitable growth conditions for expressing the protein of interest. 5 The methods provide for the production of any one protein of interest described above. In some embodiments, the protein of interest produced by the method of the invention is a protease (e.g., a subtilisin).
[0198] In one embodiment, the method of the invention comprises inactivating at least one phr gene by introducing an inactivating DNA construct into a Bacillus sp. host cell to
10 generate a modified Bacillus sp. host cell, and growing the modified cell under suitable conditions to produce a protein of interest at a level that is greater than that produced by the unmodified or precursor Bacillus host cell. Precursor host cells include precursor host cells of Bacillus sp. strains as described above, including Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans,
15 Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megateήum, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis strains. In some embodiments, the precursor host cell is a Bacillus subtilis host cell. Preferably, the precursor host cells are recombinant cells comprising a recombinant polynucleotide that encodes a polypeptide of interest, as described above.
20 In some embodiments, the polypeptide of interest is an enzyme (e.g., a protease, such as a subtilisin). The method of inactivating at least one phr gene (e.g. phrA and/or phrE) in a precursor Bacillus sp. host cell, generates a modified Bacillus sp. cell that produces a polypeptide of interest at a level that is greater than that achieved by the corresponding unmodified precursor host cell.
25 [0199] In one embodiment, the method comprises inactivating a phrA gene by introducing into the precursor Bacillus sp. host cell an inactivating DNA construct that deletes the indigenous phrA gene. For example the inactivating DNA construct of SEQ ID NO:17 is introduced to delete the indigenous phrA gene by homologous recombination. In another embodiment, the method comprises inactivating a phrE gene by introducing into the
30 precursor Bacillus sp. host cell an inactivating DNA construct that deletes the indigenous phrE gene. For example the inactivating DNA construct of SEQ ID NO:18 is introduced to delete the indigenous phrE gene by homologous recombination. In yet another embodiment, both the phrA and phrE genes are inactivated using the inactivating constructs of SEQ ID NOs:17 and 18 . The method of the invention is similarly used to
35 inactivate other phr genes including phrC, phrF, phrG, phrti, phrl, and phrK and/or the rap genes including rapB, rapC, rapD, rapE, rapF, rapG, rapH, rapl, rapJ and rapK. 51
[0200] According to one embodiment, inactivation of the phrA gene is by insertion of a selectable marker that interrupts the phrA gene. Alternatively, inactivation of the phrA gene results from the inactivation of the rapA gene by introducing a selectable marker comprising a terminator sequence in the rapA gene thereby preventing the functional 5 expression of the rapA and phrA protein According to one embodiment, inactivation of the rapA gene is by insertion of a selectable marker that interrupts the rapA gene. [0201] Methods for inactivating phr and/or rap genes are exemplified in the experimental section below. [0202] Production of a protein of interest (e.g., a protease), by a modified Bacillus sp. cell
10 comprising at least one einactivated phr gene and/or rap gene as described above, is greater than that obtained from a corresponding unmodified precursor cell. [0203] In some embodiments, the production of a protein of interest by a modified Bacillus sp. cell is further enhanced from the expression of one or more copies of a YmaH-encoding polynucleotide comprised in an expression construct that is present on a
15 multicopy/replicating plasmid that has been introduced into the modified cell. Any one of the YmaH-encoding polynucleotide constructs described above (e.g., SigA; SigA1 , SigA2, SigA3) or SigH constructs, are used to transform the modified Bacillus sp. cells. In some embodiments, the YmaH-encoding polynucleotide that is present on a replicating plasmid is introduced into a precursor host cell prior to the precursor host cell being modified to
20 contain a deletion in at least one phr and/or rap gene. Thus, in some embodiments, the invention provides for modified Bacillus sp. cell comprising a vector comprising an expression construct comprising a YmaH-encoding polynucleotide operably linked to a YmaH promoter that is incorporated into the vector. In some embodiments, overexpression of YmaH is achieved by introducing a SigH expression construct that
25 comprises a YmaH-encoding polynucleotide operably linked to a SigH promoter (e.g., the expression construct of SEQ ID NO:23). In embodiments, overexpression of YmaH is achieved by introducing a SigA expression construct that comprises a YmaH-encoding polynucleotide operably linked to a SigA promoter. Examples of SigA constructs include the SigA1 expression construct of SEQ ID NO:26, the SigA2 expression construct of SEQ
30 ID NO:31 , and the SigA3 construct of SEQ ID NO:22.
[0204] In some embodiments, the vector is a multicopy/replicating plasmid vector which forms an extrachromosomal self-replicating genetic element that overexpresses YmaH in the modified cell. Typically, the vector is a plasmid vector, which carries a selectable marker gene that allows for ease of selecting the host cells that contain the plasmid.
35 Vectors that replicate autonomously in a host cell include vectors that comprise an origin of replication, which enables the vector to replicate autonomously in the Bacillus cell. 52
Examples of bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19, pACYC177, and pACYC184 permitting replication in E. coli, and pUB1 10, pC194, pE194, pTA1060, and pAMβi permitting replication in Bacillus. The origin of replication may be one having a mutation to make its function temperature- 5 sensitive in the Bacillus cell (See e.g., Ehrlich, Proceedings of the National Academy of
Sciences USA 75:1433 [1978]).
[0205] As indicated above, in some embodiments of the present invention, a polynucleotide encoding the YmaH protein is introduced into a modified cell via an expression vector capable of replicating within the host cell. Suitable replicating and
10 integrating plasmids for Bacillus known in the art (See e.g., Harwood and Cutting (eds),
Molecular Biological Methods for Bacillus, John Wiley & Sons, [1990], in particular, chapter 3; suitable replicating plasmids for B. subtilis include those listed on page 92). [0206] In some embodiments, the overexpression of a YmaH polypeptide results from the expression of at least one copy of a YmaH-encoding polynucleotide that is integrated into
15 the genome of the host cell. Thus, in some embodiments, when the vector is introduced into the host cell, it is integrated into the genome and replicated together with the genome into which it has integrated. Multiple copies of the YmaH gene can be integrated at several positions in the genome of the host cell. Alternatively, an amplifiable expression cassette carrying a sequence encoding YmaH and a selectable marker (e.g., an
20 antimicrobial resistance marker, such as a gene coding chloramphenicol acetyl transferase) can be integrated in the genome via a single cross-over event and then amplified by challenging the transformed host cell with increasing concentrations of the appropriate antimicrobial ( e.g., chloramphenicol). [0207] In other embodiments, the invention provides a polynucleotide construct that is
25 incorporated into an integrating vector. In some embodiments, the polynucleotide constructs of the invention that are incorporated into an integrating vector are targeted to chromosomal sequences of the Bacillus sp. host cell to create modified host cells that comprise stable tandem integrations of multiple vector copies. The polynucleotide construct that is incorporated into the integration vector typically comprises a selectable
30 marker gene that provides the cell with resistance to an antimicrobial agent and allows for the amplification of the integrated ymaH construct. Tandem integration into a single site as well as single-copy and two-site integration may occur. Whether the polynucleotide construct is incorporated into a vector or used without the presence of plasmid DNA, it is used to transform modified cells using any suitable method known in the art.
35
[0208] Culturing Methods 53
[0209] The invention provides methods for producing a protein of interest in a modified Bacillus cell by culturing the modified cell that is capable of producing a protein of interest and growing the cell under suitable growth conditions for expressing the protein of interest. In some embodiments, the host cells and modified host cells of the present 5 invention are cultured in conventional nutrient media. The suitable specific culture conditions, such as temperature, pH and the like are known to those skilled in the art. Additional preferred culture conditions are well known to those of skill in the art and are described in various reference publications. [0210] In some embodiments, the protein of interest produced by the modified host cell is
10 confined to the intracellular milieu of the host cell, while in other embodiments the protein of interest produced by the host cell is secreted into the extracellular space (i.e., the culture medium). Thus, in some embodiments, the protein of interest can be recovered from the intracellular milieu of the cell in which it is expressed by lysing the host cell and recovering the protein of interest by methods known in the art. In other embodiments,
15 modified host cells are cultured under conditions suitable for the expression and recovery of the protein of interest from the cell culture. The protein of interest produced by a modified host cell overexpressing ymaH according to the present invention is secreted into the culture media. In some embodiments, the protein of interest (e.g., a protease), produced by the cells is recovered from the culture medium by conventional procedures,
20 including, but not limited to separating the host cells from the medium by centrifugation or filtration, precipitating the proteinaceous components of the supernatant or filtrate by means of a salt (e.g., ammonium sulfate), chromatographic purification (e.g., ion exchange, gel filtration, affinity, etc.). Thus, any method suitable for recovering the protease(s) of the present invention finds use in the present invention. Indeed, it is not
25 intended that the present invention be limited to any particular purification method.
[0211] In some embodiments, other recombinant constructions join the heterologous or homologous polynucleotide sequences encoding the proteins of interest to nucleotide sequence encoding a polypeptide domain which facilitates purification of soluble proteins (Kroll DJ et al., DNA Cell Biol 12:441 -53 [1993]). Such purification facilitating domains
30 include, but are not limited to, metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilized metals (Porath, Protein Expr Purif 3:263- 281 [1992]), protein A domains that allow purification on immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification system (Immunex Corp, Seattle WA). The inclusion of a cleavable linker sequence such as Factor XA or
35 enterokinase (Invitrogen, San Diego CA) between the purification domain and the heterologous protein also find use to facilitate purification. 54
[0212] In some embodiments, the transformed host cells of the present invention are cultured in a suitable nutrient medium under conditions permitting the expression of a protein of interest (e.g., a protease), after which the resulting protease is recovered from the culture. The medium used to culture the cells comprises any conventional medium 5 suitable for growing the host cells, such as minimal or complex media containing appropriate supplements. Suitable media are available from commercial suppliers or may be prepared according to published recipes (e.g., in catalogues of the American Type Culture Collection). In some embodiments, the host cells are cultured under batch, fed- batch or continuous fermentation conditions. Classical batch fermentation methods use a
10 closed system, wherein the culture medium is made prior to the beginning of the fermentation run, the medium is inoculated with the desired organism(s), and fermentation occurs without the subsequent addition of any components to the medium. In certain cases, the pH and oxygen content, but not the carbon source content, of the growth medium are altered during batch methods. The metabolites and cell biomass of the batch
15 system change constantly up to the time the fermentation is stopped. In a batch system, cells usually progress through a static lag phase to a high growth log phase and finally to a stationary phase where growth rate is diminished or halted. If untreated, cells in the stationary phase eventually die. In general terms, the cells in log phase produce most protein.
20 [0213] A variation on the standard batch system is the "fed-batch fermentation" system. In this system, nutrients (e.g., a carbon source, nitrogen source, O2, and typically, other nutrients) are only added when their concentration in culture falls below a threshold. Fed- batch systems are useful when catabolite repression is apt to inhibit the metabolism of the cells and where it is desirable to have limited amounts of nutrients in the medium.
25 Measurement of the actual nutrient concentration in fed-batch systems is estimated on the basis of the changes of measurable factors such as pH, dissolved oxygen and the partial pressure of waste gases such as CO2. Batch and fed-batch fermentations are common and well known in the art. [0214] Continuous fermentation is an open system where a defined culture medium is
30 added continuously to a bioreactor and an equal amount of conditioned medium is removed simultaneously for processing. Continuous fermentation generally maintains the cultures at a constant high density where cells are primarily in log phase growth. [0215] Continuous fermentation allows for the modulation of one factor or any number of factors that affect cell growth and/or end product concentration. For example, in some
35 embodiments, a limiting nutrient such as the carbon source or nitrogen source is maintained at a fixed rate and all other parameters are allowed to moderate. In other 55
systems, a number of factors affecting growth are altered continuously while the cell concentration, measured by media turbidity, is kept constant. Continuous systems strive to maintain steady state growth conditions. Thus, cell loss due to medium being drawn off may be balanced against the cell growth rate in the fermentation. Methods of modulating 5 nutrients and growth factors for continuous fermentation processes as well as techniques for maximizing the rate of product formation are known to those of skill in the art and find use in the production of a protein of interest (e.g., a protease) according to the methods of the invention. [0216] As indicated above, the modified Bacillus sp. of the invention produce proteins of
10 interest at a level that is greater than that obtained from the corresponding unmodified precursor Bacillus sp. cells. The enhanced level of protein production by the modified cells is further increased by overexpressing YmaH. In some embodiments of the present invention, overexpression of YmaH in a Bacillus host cell results in an increase in the production of a protein of interest above the level obtained in the corresponding modified
15 precursor Bacillus sp. cell that does not overexpress YmaH. In some embodiments, the invention provides modified Bacillus host cells that overexpress YmaH. In some embodiments the recombinant Bacillus host cell is a cell that was altered to produce greater levels of a protease than the unaltered parent/precursor Bacillus cell when grown under the same conditions.
20 [0217] The present invention also encompasses methods for producing a protein of interest in a modified cell that overexpresses YmaH in less time than that required by the precursor host cell. For example, the modified host cells of the invention are capable of producing a protein of interest at a greater level and at an earlier time than the corresponding unmodified precursor host cell. Thus, in some embodiments, the invention
25 provides for methods of producing a protein of interest (e.g., a protease), at a level that is greater than that produced by the parent host cell and in about 1/6th of the time it takes the precursor host cell to attain its maximum level of expression. In other embodiments, the modified host produces a protein of interest in about 1/5th, about 1/4th, about 1/3rd, or about Vz of the time it takes the precursor host cell to attain its maximum level of
30 expression.
[0218] Measurement of Production/Activity [0219] EXPERIMENTAL
[0220] The following examples provide those of ordinary skill in the art with a complete 35 disclosure and description of how to make and use the present invention, and are not intended to limit the scope of what the inventors regard as their invention nor are they 56
intended to represent that the experiments below are all or the only experiments performed. Efforts have been made to ensure accuracy with respect to numbers used (e.g., amounts, temperature, etc.) but some experimental errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, molecular weight is 5 weight average molecular weight, temperature is in degrees Centigrade, and pressure is at or near atmospheric.
[0221] In the experimental disclosure which follows, the following abbreviations apply: 0C (degrees Centigrade); rpm (revolutions per minute); H2O (water); aa (amino acid); bp (base pair); kb (kilobase pair); kD (kilodaltons); gm (grams); μg and ug (micrograms); mg
10 (milligrams); ng (nanograms); μl and ul (microliters); ml (milliliters); mm (millimeters); nm
(nanometers); μm and urn (micrometer); M (molar); mM (millimolar); μM and uM (micromolar); U (units); V (volts); MW (molecular weight); sec (seconds); min(s) (minute/minutes); h(s) and hr(s) (hour/hours); OD280 (optical density at 280 nm); OD405 (optical density at 405 nm); OD600 (optical density at 600 nm); PAGE (polyacrylamide gel
15 electrophoresis); LAS (lauryl sodium sulfonate); SDS (sodium dodecyl sulfate); and Tris
(tris(hydroxymethyl)aminomethane).
[0222] EXAMPLE 1 [0223] phr Gene Deletions
20 [0224] The phr genes: phrA, phrE, phrC, phrF, phrG, phrH, phrl and phrK were deleted in the Bacillus subtilis strain BG2942 (ΔnprE, degU(Hy)32, amyE::[PxylRA-comK eryR]), and AprE protease expression in the resulting modified Bacillus subtilis strains was determined using an AAPF assay. Deletion of phr genes was performed by inserting a spectinomycin selectable marker flanked by the lox site in the phr locus of the Bacillus
25 chromosome, while leaving the upstream rap gene and the downstream genes intact. The inactivation cassette used to delete the phr genes is illustrated in Figure 2. The deletion of phrA and phrE genes was also performed in the Bacillus subtilis strain BG3594 (degU(Hy)32, oppA, ΔspollE, ΔaprE, ΔnprE), which carries the amplifiable expression construct PaprE-FNA (nucleotide sequence of aprE promoter-FNA: SEQ ID NO:19) for
30 expressing FNA.
[0225] gaattcctccattttcttctgctatcaaaataacagactcgtgattttccaaacgagctttcaaaaaagcctctgcccctt gcaaatcggatgcctgtctataaaattcccgatattggcttaaacagcggcgcaatggcggccgcatctgatgtctttgcttggc gaatgttcatcttatttcttcctccctctcaataattttttcattctatcccttttctgtaaagtttatttttcagaatacttttatcatcatgcttt gaaaaaatatcacgataatatccattgttctcacggaagcacacgcaggtcatttgaacgaattttttcgacaggaatttgccg
35 ggactcaggagcatttaacctaaaaaagcatgacatttcagcataatgaacatttactcatgtctattttcgttcttttctgtatgaa aatagttatttcgagtctctacggaaatagcgagagatgatatacctaaatagagataaaatcatctcaaaaaaatgggtcta 57
ctaaaatattattccatctattacaataaattcacagaatagtcttttaagtaagtctactctgaatttttttaaaaggagagggtaa agagtgagaagcaaaaaattgtggatcagtttgctgtttgctttagcgttaatctttacgatggcgttcggcagcacatcctctgc ccaggcggcagggaaatcaaacggggaaaagaaatatattgtcgggtttaaacagacaatgagcacgatgagcgccgct aagaagaaagatgtcatttctgaaaaaggcgggaaagtgcaaaagcaattcaaatatgtagacgcagcttcagctacatta 5 aacgaaaaagctgtaaaagaattgaaaaaagacccgagcgtcgcttacgttgaagaagatcacgtagcacatgcgtacg cgcagtccgtgccttacggcgtatcacaaattaaagcccctgctctgcactctcaaggctacactggatcaaatgttaaagtag cggttatcgacagcggtatcgattcttctcatcctgatttaaaggtagcaggcggagccagcatggttccttctgaaacaaatcc tttccaagacaacaactctcacggaactcacgttgccggcacagttgcggctcttaataactcaatcggtgtattaggcgttgcg ccaagcgcatcactttacgctgtaaaagttctcggtgctgacggttccggccaatacagctggatcattaacggaatcgagtg
10 ggcgatcgcaaacaatatggacgttattaacatgagcctcggcggaccttctggttctgctgctttaaaagcggcagttgataa agccgttgcatccggcgtcgtagtcgttgcggcagccggtaacgaaggcacttccggcagctcaagcacagtgggctaccc tggtaaatacccttctgtcattgcagtaggcgctgttgacagcagcaaccaaagagcatctttctcaagcgtaggacctgagct tgatgtcatggcacctggcgtatctatccaaagcacgcttcctggaaacaaatacggcgcgttgaacggtacatcaatggcat ctccgcacgttgccggagcggctgctttgattctttctaagcacccgaactggacaaacactcaagtccgcagcagtttagaa
15 aacaccactacaaaacttggtgattctttctactatggaaaagggctgatcaacgtacaggcggcagctcagtaa (SEQ
ID NO:19).
[0226] The PaprE-FNA expression construct comprises a polynucleotide sequence encoding the FNA protease operably linked to the aprE promoter of Bacillus subtilis. FNA (PURAFECT
20 [0227] PRIME [Genencor]) is subtilisin BPN' from B. amyloliquefaciens that has the
Y217N substitution (SEQ ID NO:20)
[0228] VRSKKLWISLLFALALIFTMAFGSTSSAQAAGKSNGEKKYIVGFKQTMSTMSAAK KKDVISEKGGKVQKQFKYVDAASATLNEKAVKELKKDPSVAYVEEDHVAHAYAQSVPYG VSQIKAPALHSQGYTGSNVKVAVIDSGIDSSHPDLKVAGGASMVPSETNPFQDNNSHGT 25 HVAGTVAALNNSIGVLGVAPSASLYAVKVLGADGSGQYSWIINGIEWAIANNMDVINMSL
GGPSGSAALKAAVDKAVASGVVVVAAAGNEGTSGSSSTVGYPGKYPSVIAVGAVDSSN QRASFSSVGPELDVMAPGVSIQSTLPGNKYGALNGTSMASPHVAGAAALILSKHPNWTN TQVRSSLENTTTKLGDSFYYGKGLINVQAAAQ (SEQ ID NO:20).
[0229] A more detailed description of the construction of these strains is set forth below.
30 The sequences of the primers used for creating the constructs used to delete the phr genes are provided in Table 1 .
[0230] For the phrA deletion cassette the upstream region of the phrA gene containing the rapA sequence was amplified with the primers CB2 008-007 (SEQ ID NO:1 ) and CB2 008-009 (SEQ ID NO:3) and fused to the spectinomycin cassette, flanked by the loxP
35 sequence, and amplified with the oligos CB2 008-009R (SEQ ID NO:4) and CB2 008-
01 OR (SEQ ID NO:6). The downstream region of the phrA gene was amplified with the 58
oligos CB2 008-010 (SEQ ID NO:5) and CB2008-008 (SEQ ID NO:2) and fused to the PCR product containing the rapA sequences and the spectinomycin cassette. [0231] To create the phrC deletion cassette, the upstream region of the phrC gene containing the rapC sequence was amplified with the primers CB2 008-015 and CB2 008- 5 016 and fused to the spectinomycin cassette, flanked by the loxP sequence, and amplified with the oligos CB2 008-016R and CB2 008-017R. The downstream region of the phrC gene was amplified with the oligos CB2 008-017 and CB2008-018 and fused to the PCR product containing the rapC sequences and the spectinomycin cassette. [0232] To create the phrE deletion cassette, the upstream region of the phrE gene
10 containing the rapE sequence was amplified with the primers CB2008-019A (SEQ ID
NO:7) and CB2008-019B (SEQ ID NO:9) and fused to the spectinomycin cassette amplified with the oligos CB2008-019R (SEQ ID NO:10) and CB2008-020R (SEQ ID NO:12). The downstream region of the phrE gene was amplified with the oligos CB2008- 020 (SEQ ID NO:1 1 ). and CB2008-021 (SEQ ID NO:8). and fused to the purified PCR
15 product containing the partial rapE sequence and the spectinomycin cassette.
[0233] To create the pfrrFdeletion cassette, the upstream region of the phrFgeue containing the rapFsequence was amplified with the primers CB2008-022 and CB2008- 023 and fused to the spectinomycin cassette amplified with the oligos CB2008-023R and CB2008-024R. The downstream region of the phrFgeue was amplified with the oligos
20 CB2008-024 and CB2008-025 and fused to the purified PCR product containing the rapF sequence and the spectinomycin cassette.
[0234] To create the phrG deletion cassette, the upstream region of the phrG gene containing the rapG sequence was amplified with the primers CB2008-026 and CB2008- 027R and fused to the spectinomycin cassette amplified with the oligos CB2008-027 and
25 CB2008-028R. The downstream region of the phrG gene was amplified with the oligos
CB2008-028 and CB2008-029 and fused to the purified PCR product containing the rapG sequence and the spectinomycin cassette.
[0235] To create the phrH deletion cassette, the upstream region of the phrti gene containing the rap/-/ sequence was amplified with the primers CB2008-01 1 and CB2008-
30 012 and fused to the spectinomycin cassette amplified with the oligos CB2008-012R and
CB2008-013R. The downstream region of the phrH gene was amplified with the oligos CB2008-013 and CB2008-014 and fused to the purified PCR product containing the rapH sequence and the spectinomycin cassette. [0236] To create the phrl deletion cassette, the upstream region of the phrl gene
35 containing the rap/ sequence was amplified with the primers CB2008-030 and CB2008-
031 and fused to the spectinomycin cassette amplified with the oligos CB2008-031 R and 59
CB2008-032R. The downstream region of the phrl gene was amplified with the oligos CB2008-032 and CB2008-033 and fused to the purified PCR product containing the rapl sequence and the spectinomycin cassette.
[0237] To create the phrKόe\eϊ\on cassette, the upstream region of the phrK gene 5 containing the rap/Csequence was amplified with the primers CB2008-034 and CB2008-
035 and fused to the spectinomycin cassette amplified with the oligos CB2008-035R and CB2008-036R. The downstream region of the phrKgene was amplified with the oligos CB2008-036 and CB2008-037 and fused to the purified PCR product containing the rapK sequence and the spectinomycin cassette.
10 [0238] Two loxP sites were introduced on both sides of the spectinomycin selectable marker to facilitate the removal of the antibiotic resistance. The final PCR products were purified and transformed into Bacillus subtilis BG2942 (ΔnprE, degU(Hy)32, amyE::[PxylRA-comK-eryR]).
[0239] Once the DNA constructs were stably integrated, via double cross-over, into the
15 chromosome of a competent Bacillus subtilis BG2942 strain, the deletions were confirmed by PCR analysis. The phrA region was amplified with the primers CB2008-041 (SEQ ID NO:13) and CB2008-042 (SEQ ID NO:14) and the phrE region was amplified with the primers CB2008-051 (SEQ ID NO:15) and CB2008-052 (SEQ ID NO:16). The resulting PCR products were sequenced to confirm the absence of PCR errors and the insertion of
20 the antibiotic marker in the target phr gene.
[0240] The transformation of Bacillus subtilis BG2942, which carries an inducible ComK construct in the amyE site, was performed as described in the patent application published as US2002182734. [0241] The BG2942 derived strains carrying the phrA or phrE deletion were then
25 transformed with a plasmid expressing the Cre recombinase. This was a necessary step to eliminate the spectynomycin antibiotic marker by site-specific recombination. [0242] The sequences and the descriptions of the primers used in the above experimental procedure are set are in Table 1 .
30 TABLE 1
Figure imgf000061_0001
60
Figure imgf000062_0001
61
Figure imgf000063_0001
62
Figure imgf000064_0001
63
Figure imgf000065_0001
[0243] The nucleotide sequence of the phrA deletion construct is: attcgttattgcaggtaattatgatgatatgcagtatccagaaagagcattgccccacttagaactggctttagatcttgcaaagaaaga
5 aggcaatccccgcctgatcagttctgccctatataatctcggaaactgctatgagaaaatgggtgaactgcaaaaggcagccgaata ctttgggaaatctgtttctatttgcaagtcggaaaagttcgataatcttccgcattctatctactctttaacacaagttctgtataaacaaaaa aatgacgccgaagcgcaaaaaaagtatcgtgaaggattggaaatcgcccgtcaatacagtgatgaattatttgtggagctttttcaattt ttacatgcgttatacggaaaaaacattgacacagaatcagtctcacacacctttcaatttcttgaagaacatatgctgtatccttatattga agagctggcgcatgatgctgcccaattctatatagaaaacggacagcccgaaaaagcactttcattttatgagaaaatggtgcacgc
10 acaaaaacaaatccagagaggagattgtttatatgaaatctaaatggatgtcaggtttgttgctcgttgcggtcgggggcctaggatgc atatggcggccgcataacttcgtatagcatacattatacgaagttatctagacatatgcaagggtttattgttttctaaaatctgattaccaa 64
ttagaatgaatatttcccaaatattaaataataaaacaaaaaaattgaaaaaagtgtttccaccattttttcaatttttttataatttttttaatct gttatttaaatagtttatagttaaatttacattttcattagtccattcaatattctctccaagataactacgaactgctaacaaaattctctcccta tgttctaatggagaagattcagccactgcatttcccgcaatatcttttggtatgattttacccgtgtccatagttaaaatcatacggcataaa gttaatatagagttggtttcatcatcctgataattatctattaattcctctgacgaatccataatggctcttctcacatcagaaaatggaatat 5 caggtagtaattcctctaagtcataatttccgtatattcttttattttttcgttttgcttggtaaagcattatggttaaatctgaatttaattccttctga ggaatgtatccttgttcataaagctcttgtaaccattctccataaataaattcttgtttgggaggatgattccacggtaccatttcttgctgaat aataattgttaattcaatatatcgtaagttgcttttatctcctattttttttgaaataggtctaattttttgtataagtatttctttactttgatctgtcaat ggttcagatacgacgactaaaaagtcaagatcactatttggttttagtccactctcaactcctgatccaaacatgtaagtaccaataagg ttattttttaaatgtttccgaagtatttttttcactttattaatttgttcgtatgtattcaaatatatcctcctcactattttgattagtacctattttatatcc 10 atagttgttaattaaataaacttaatttagtttatttatagatttcattggcttctaaattttttatctagataacttcgtatagcatacattatacga agttatggatccagcttatcgataccgtcgctcggatccactagtatgcataaaaaaagacccttaggggtcttttttatttcttcagcttcc attcttttatcgtcagctcagaagatccacttgccaccagcggatccgcatggccgatttccgctgcctcttccagtgaatctgcttcgatg acatacgctccgcctgtggcgtcgctgaatggcccaaacatttttaaacgtttttctgcctgtaaacgatccagaaattcatagtgcccag ccacatgctcctgattaaatttctccgttctcattgtcagcattaaatatggtatacatattcagaccctccgtgaacttcagtttaacacattt 15 atccatattacggtgatagatgatatgagcttttcgtcctacgaatgccacctatttatgaaaaaagaaaaggagagatgataggtgag cattccagtaaagaaaaatttggtttctgaggcgaaatacgcgttgaagtgtcctaatgcaatgtccgctgaatacattaccattcacaa cacggcaaacgatgcatcagcggccaatgaaatcagctatatgatcgggaacacaagctcgacaagctttcattttgcggtcgatga tcaagaggtgattcaaggtctgccgcttaaccgaaacgcttggcacactggtgacggcacaaacggtccgggaaaccgcaaatca atcggtgttgagatttgctacagcaaatcgggaggcccgaagtatgaggcagctgaagccttggcgatttcatttgttgcacagctgttg 20 aaggagcgcggctggggcatcgatcgggtgagaaagcatcaggactggagcggaaagtattgcccgcaccgcattttatcagag gggcgctgggatcaagtgaaggcggcgattgaaaaggaattaaacgggggcgtatcagcgaaaaaagctgcagtctcttcttcgg cgtctgaatatcatgtaaaaaaaggtgacacactgtcagggattgccgcatcacacggggcc (SEQ ID NO:17). [0244] The nucleotide sequence of the phrC deletion construct is: [0245] tcactaatggaattccggcaccagcttatgctggattatcttgagccgttagagaaattaaatatcgaagaccagcc 25 aagcctgtctgaattatcaagaaacattgacagcaaccaggcagatctcaaagggctgctcgactattacgtgaatttttttcgc gggatgtatgaatttgataagcgggaatttatttctgccattacatactataaacaggcggagaaaaagctctcctttgtcgcag accatattgaacgggctgaattctattttaaaatcgcggaagcttattattatatgaagcaaacgtatttttcattgattaatataaa aaacgcctatgaaatttacgtggagcaggaaacctataatgtgagaatcattcagtgccatttcgtcttcggggtcaacctgat ggatgaaagaaatttcgaacaagccgcacgccatttcaaattggcgctcaacatggcccaagcagaacaaaaagcccag 30 ctggttggaagagcatactacaatctcgggttatgctattacaatcaagaccttctagaccctgccattgattactttgaaaaagc ggtctccacatttgaaagcagcaggatcgtcaattctctcccgcaagcctattttttaatcaccctgatttattataaacagggaa aacatgataaagcttcggaatatcacaagcggggctatgaatatgctaaagaaacagacgatgcagactatgccgtaaaat tcgagtttttgcaatccctatatctggatcagcccaatgaagaaggaatcgaacgatgtttccagtacttaaaaaataaaaatat gtacgctgatatagaggatttagccctagaagtagcaaaatattactatgaacagaaatggtttaaactgtctgcttcctactttct 35 acaagttgaagaggcaagaaaacaaatacaaaggagtgaaggtttgtatgaaattgaaatctaagttgtttgttatttgtttggc cgcagccgcgatggcctaggatgcatatggcggccgcataacttcgtatagcatacattatacgaagttatctagacatatgc 65
aagggtttattgttttctaaaatctgattaccaattagaatgaatatttcccaaatattaaataataaaacaaaaaaattgaaaaa agtgtttccaccattttttcaatttttttataatttttttaatctgttatttaaatagtttatagttaaatttacattttcattagtccattcaatatt ctctccaagataactacgaactgctaacaaaattctctccctatgttctaatggagaagattcagccactgcatttcccgcaatat cttttggtatgattttacccgtgtccatagttaaaatcatacggcataaagttaatatagagttggtttcatcatcctgataattatctat 5 taattcctctgacgaatccataatggctcttctcacatcagaaaatggaatatcaggtagtaattcctctaagtcataatttccgtat attcttttattttttcgttttgcttggtaaagcattatggttaaatctgaatttaattccttctgaggaatgtatccttgttcataaagctcttgt aaccattctccataaataaattcttgtttgggaggatgattccacggtaccatttcttgctgaataataattgttaattcaatatatcgt aagttgcttttatctcctattttttttgaaataggtctaattttttgtataagtatttctttactttgatctgtcaatggttcagatacgacgact aaaaagtcaagatcactatttggttttagtccactctcaactcctgatccaaacatgtaagtaccaataaggttattttttaaatgttt 10 ccgaagtatttttttcactttattaatttgttcgtatgtattcaaatatatcctcctcactattttgattagtacctattttatatccatagttgtt aattaaataaacttaatttagtttatttatagatttcattggcttctaaattttttatctagataacttcgtatagcatacattatacgaagt tatggatccagcttatcgataccgtcgagaacaagccccttctcattagcgagaaggggtttttcttttcaaaaaaacaccgca agacatagtcttgcggtgccgccttcatggagattacgtttatttagtagcctcctacaaatgcagttcccacaatgatcaagag gataaataacacaacaatcaaagcgaaagaagttccgtaacctgacattttgtgcacctccttgcgagattgcttcagcaaat 15 gctgcaaaactgtggcggacagggtcccgcagagacggtcagcagcttagaagccgccaacaaacgcagtccctacgat aattaatagaataaacaatacaacgattaaagcgaaagaactgatgccgccgtaaccgccgccgttagagtatcctgacat aaggtttcacctccctatgaaggatactataagatatgctgaaccgatccatttggcagggataatagtggacaagagaaaa aatgaagaattcggctatatgaaggtgatataaaaaaatagcgggcgctgccgcccgctatttatgtacgattaagagatcag cacgcccgcgaaaaattcctggtataacgcttgaacggcttttctttcttcggcttcttttacgccaaacatcatgctcacttcaga 20 agacccctgattgatcatttcgatattcacctgtgcctctgataatgctttggcggctcttgccgttgtaccgacattgtggcgcatc gcttcccctacaaccataatcagggcgagatgatgctcgacgatgacttcatcggcatgcaaatcctcttcgatccgtttgatga cgctgcgttcagtggcggcatccatttgcccctgccgtaaaatgattgtcatgtcatcgattcccgatggaacatgctcatacgtc aaaccatgctcctccaggatttgaagggctctgcggccaaaaccgatttctctgttcatgagatacttgctgatataaatgctgc (SEQ ID NO:95) 25 [0246] The nucleotide sequence of the phrE deletion construct is: ttttttctgttcagacataatggattttgatttggtgtaggcgttagcaagctcatgcgctaaaaaggtttcttctatgtaggcatctgataagtt ggcatcttctaaaaaaccaggaatactcgttaagaaagaaattttcatctcaattaaatcaatccattggtcagcaatgccagcttgatct tcataaaatgatttaatgttattagcgcctttgcctgaaaactcgctatcatctaaatctgcaacagctttgaacgctttctttaatttgaccatt ttactttttaaatctttgtattcctgtgctcgcttttcagcctcggtgagcaaggttttggcttcaaatactttcatgatcatatcctttcatttaatcg 30 tcataacaaaatattaccatggaagaatgatgaaactaactgttatgtggatcaaatggtggaaatgaatcattcgatctgtgtcatttta cctatttgttaatcctttcaatgaaaggggactttccaattgtaacatcgccatcatgaaaaaattcgataacgtagccagattcactaaa cataaaagtatccgatccaacggcagttacatcatcaattacgtttaatgcatgctcaagactggtttttaatgctggctgttctccgtaac cccaaagaataataatgttcctatctttaaaatggtgtttagctagccaatcgtaaatctcttcctcgtaatctatagattgatgacaacaa acttcttcccacttgattcgtccccaagatgtaagggaaaactgtttgaaagcagttcataatattttgccgttaattcttctgataagatttctt 35 tgtttttccctagagcttctaagcattcatcaaataagtccaaaatgttcacctcaaaagctttaagtatgatagattttttcagtattagaaat aagaaaaagccgttatgaaacggctaaagggaatcagaactaatgttgtttatcgaatcgacggtatatcgaaaggggaatgcatgt 66
atgaaatctaaattgtttatcagtttatccgccgttttaattggacttgcgaaaggcgaattccagcacactggcggccgttactagtggat ccgagctcggatccataacttcgtataatgtatgctatacgaagttatctagataaaaaatttagaagccaatgaaatctataaataaac taaattaagtttatttaattaacaactatggatataaaataggtactaatcaaaatagtgaggaggatatatttgaatacatacgaacaa attaataaagtgaaaaaaatacttcggaaacatttaaaaaataaccttattggtacttacatgtttggatcaggagttgagagtggacta 5 aaaccaaatagtgatcttgactttttagtcgtcgtatctgaaccattgacagatcaaagtaaagaaatacttatacaaaaaattagacct atttcaaaaaaaataggagataaaagcaacttacgatatattgaattaacaattattattcagcaagaaatggtaccgtggaatcatcc tcccaaacaagaatttatttatggagaatggttacaagagctttatgaacaaggatacattcctcagaaggaattaaattcagatttaac cataatgctttaccaagcaaaacg aaaaaataaaag aatatacgg aaattatg acttag agg aattactacctg atattccattttctg a tgtgagaagagccattatggattcgtcagaggaattaatagataattatcaggatgatgaaaccaactctatattaactttatgccgtatg 10 attttaactatggacacgggtaaaatcataccaaaagatattgcgggaaatgcagtggctgaatcttctccattagaacatagggaga gaattttgttagcagttcgtagttatcttggagagaatattgaatggactaatgaaaatgtaaatttaactataaactatttaaataacagat taaaaaaattataaaaaaattgaaaaaatggtggaaacacttttttcaatttttttgttttattatttaatatttgggaaatattcattctaattggt aatcagattttagaaaacaataaacccttgcatatgtctagataacttcgtataatgtatgctatacgaagttatgcggccgccatatgca tcctaggccgcaagtccaattaaaacggcggataaactgataaacaatttagatttcatacatgcattcccctttcgatatttgcttttgag 15 catataccatcttcttgaaacagatgatactatcctctattttcccattataatcgaaaaggttgcctcctaacaatgccagctcttccagat aagggtatcctttgccgttctctaaacgagaaaaaatgttgagaagtttaggtgtatcgccatttcttatataaagaacgtctaatgcttca aataagttcataaatagttcgtctttaaaatctacagcacttctgattcctttgcggaagcaatccattgcttgtccttttttgccttgtttaaaat aaatcaacgctaggtcatgataagcttgcggaagtacgtcagagttaatttttctgtattgaaccaaggcttgttcgatgtaacgagcag ccttatttaagttgtccattttgtgatagcaattgccgagattgaaaaacgcagtggcatagatatgagtatttttacttttaagcagctcggc 20 accttttaaagcttcttgaaggtgggggagagctttttcatgattttcaaggtcatcgtagttaccggcaatgacaaaatggcactgaata cgacgaacagagtaaagctcgtgtttcttataaatgttgtatgaaagctcagcgtaatgcatcgaaatgtgtgtcattttcatatgataata gacttcagacagtttaaaataaaactcagctttttcaatcttgtcggagattgtaggaattttgcgttcagcttttttgtaatatgtaatggctctt gtgtattcaccgtttctaaactcatacatcccgcggaagaagttataataatatgcccgcatattgtctaattttttcttatggccctcaatttta tttaaatattctgaaagttccattcggttttcatcagatggcagcgtgtattccaacataattttatggcgaaaggccattagttgataataaa 25 taagcaagtcttgatcttcttccataacctc (SEQ ID NO:18).
[0247] The nucleotide sequence of the phrF deletion construct is: [0248] agtttcggcacaacctaatgcttgagtaccttgaaccgttagaaaaaatgaggattgaggaacagccgagactgt ctgatctgctgcttgagattgataaaaaacaggctcgtttaactggtctgcttgagtactattttaacttcttcagaggcatgtacga gctggaccagcgggaatatctgtcggctattaaatttttcaaaaaggccgaaagcaagctgatattcgttaaggatcggatag 30 agaaagctgagtttttctttaagatgtctgaatcttattactatatgaaacaaacgtatttttcaatggactatgcacggcaagcata tgaaatatacaaagaacatgaagcttataatataagattgctgcagtgtcattctttatttgccaccaattttttagatttaaaacag tatgaggatgccatctcacattttcaaaaagcttattctatggcagaagctgaaaagcagccccaattaatggggagaactttg tacaatatcgggctttgtaaaaacagccaaagccaatatgaggatgccataccttatttcaaaagagcaatagctgtttttgaa gaatcaaatattcttccttccttacctcaagcgtattttttaattacacagatccattataaattaggaaaaatagataaagctcatg 35 aatatcatagtaagggaatggcttattcacaaaaggccggagatgtaatatatttatcagagtttgaatttttgaaatctttatactt atcaggcccggatgaagaagcaattcaaggattttttgattttctcgaaagtaaaatgttgtatgctgatcttgaagatttcgctatt 67
gatgtggcaaaatattatcatgaacgtaaaaattttcaaaaagcttctgcttattttttgaaggtggaacaagtaaggcaacttatt caaggaggagtgagtttgtatgaaattgaagtctaaactattactggcctaggatgcatatggcggccgcataacttcgtatag catacattatacgaagttatctagacatatgcaagggtttattgttttctaaaatctgattaccaattagaatgaatatttcccaaata ttaaataataaaacaaaaaaattgaaaaaagtgtttccaccattttttcaatttttttataatttttttaatctgttatttaaatagtttata 5 gttaaatttacattttcattagtccattcaatattctctccaagataactacgaactgctaacaaaattctctccctatgttctaatgga gaagattcagccactgcatttcccgcaatatcttttggtatgattttacccgtgtccatagttaaaatcatacggcataaagttaata tagagttggtttcatcatcctgataattatctattaattcctctgacgaatccataatggctcttctcacatcagaaaatggaatatca ggtagtaattcctctaagtcataatttccgtatattcttttattttttcgttttgcttggtaaagcattatggttaaatctgaatttaattccttc tgaggaatgtatccttgttcataaagctcttgtaaccattctccataaataaattcttgtttgggaggatgattccacggtaccatttc
10 ttgctgaataataattgttaattcaatatatcgtaagttgcttttatctcctattttttttgaaataggtctaattttttgtataagtatttcttta ctttgatctgtcaatggttcagatacgacgactaaaaagtcaagatcactatttggttttagtccactctcaactcctgatccaaac atgtaagtaccaataaggttattttttaaatgtttccgaagtatttttttcactttattaatttgttcgtatgtattcaaatatatcctcctcac tattttgattagtacctattttatatccatagttgttaattaaataaacttaatttagtttatttatagatttcattggcttctaaattttttatcta gataacttcgtatagcatacattatacgaagttatggatccagcttatcgataccgtcgaccgccgtccatcggcggttttttcgtc
15 ccctctttaccaaagtctcccaatccatgctatgatcttttcaataatcttgaagagagtggaaatgcagcatgtctctaaaaagt gtgagaacccactttactcaatggaatcgagaaaatgatgtgacggagttcgaaacgtcgagtgcgacagttgaacaggca gctgagacaatcggcgtaagcctgtctagaatcgccaagtccctgtccttcagaggggaaggagatcaggtgattctgattgt ggcagccggcgatgccaagatcgacaacaaaaagtccaggcaaacatttggctttaaagcaagaatgctctctcctaatga ggtgctggagcagacaggccatgaaattggaggagtttgcccatttggattggctcatgatcctgaggtttatcttgatgtatcgc
20 tgaaacggtttcagactgttttccccgcatgcggcagcagaaactccgctattgaattaacaccgaaagaattatccgaattttc tttctcaaaagtgtggattgatgtttgtaaagactgggaataaaaaaacatccagacatcgtctggatgtttacttatttcacaaac ccaagcagcatttcacggatgattttgctggctgtgtttgccgtttgctctgagtggtcgtataccggcgcgacttccactaaatca gcgccctttacgtttacctctgaacgcgcaatttcatggaccgatgcaagcagttctttagacgtgatgccgccggcgtcaacc gttcctgtacccggtgcgtgtgcagggtctaatacgtcaatgtcaattgtgacataaaccggacggcccgccagcttcggaag
25 cacctctttcagcggttcaagcacttcaaattttgagatgtgcatgccgttttccttcgcccattcaaactcttctttcatgccggaac ggattccgaatgaatacacattgtgcggtccga (SEQ ID NO:96) [0249] The nucleotide sequence of the phrG deletion construct is: [0250] agaggatcaggaggtgcttgcctacttctccttattggaactgcgccacaaggttttgcttcacgaggcgagaggac agggctttcagcatgaggagccttctcatatgaatgctacgtctgacatgctgaaatattacttttttctgtttgaaggcatgtatga
30 ggcctataaaaataattatgacattgccattgggctgtataaagatgcagagcagtatctcgacaacattcccgatccgattga aaaagccgaatttcacctgaaggtcggtaagctctattataagctgggacaaaatattgtgtccctcaatcatacacggcaag cagtcaaaacattcagagaagagacagattataaaaagaagctggcttcagccctgattaccatgtcaggcaattttacaga gatgagccagtttgaagaagctgaggcttatttggacgaagcaattcggatcacgagtgaattagaggatcatttttttgaagc ccagcttttgcataacttcggccttctacatgcgcaaagcggcaaatcagaagaagcggtttcgaaattagaggaggctctac
35 agaacgatgagtatgcccgctccgcctattattatcattctgcctacttgctgatacgagagctgtttaagatcaaaaagaaaga acaggccttatcttattaccaagacgtgaaggaaaaattgactgctgagccgaatagaatatgtgaggcaaaaatagacattt 68
tatatgccatttatgcagaagggggtcatgcggaaacgtttcacttatgcaaacaacatatggatgacttgttgtccgagaaag agtatgacagtgtaagagaactttccattttggctggcgaacggtatagggaacttgagctttacaaagaagctgcccactttttt tatgaagcattacagattgaagaactgattaaacgaacggaggttatataaatgaaaagatggcctaggatgcatatggcgg ccgcataacttcgtatagcatacattatacgaagttatctagacatatgcaagggtttattgttttctaaaatctgattaccaattag 5 aatgaatatttcccaaatattaaataataaaacaaaaaaattgaaaaaagtgtttccaccattttttcaatttttttataatttttttaat ctgttatttaaatagtttatagttaaatttacattttcattagtccattcaatattctctccaagataactacgaactgctaacaaaattc tctccctatgttctaatggagaagattcagccactgcatttcccgcaatatcttttggtatgattttacccgtgtccatagttaaaatc atacggcataaagttaatatagagttggtttcatcatcctgataattatctattaattcctctgacgaatccataatggctcttctcac atcagaaaatggaatatcaggtagtaattcctctaagtcataatttccgtatattcttttattttttcgttttgcttggtaaagcattatgg
10 ttaaatctgaatttaattccttctgaggaatgtatccttgttcataaagctcttgtaaccattctccataaataaattcttgtttgggagg atgattccacggtaccatttcttgctgaataataattgttaattcaatatatcgtaagttgcttttatctcctattttttttgaaataggtcta attttttgtataagtatttctttactttgatctgtcaatggttcagatacgacgactaaaaagtcaagatcactatttggttttagtccact ctcaactcctgatccaaacatgtaagtaccaataaggttattttttaaatgtttccgaagtatttttttcactttattaatttgttcgtatgt attcaaatatatcctcctcactattttgattagtacctattttatatccatagttgttaattaaataaacttaatttagtttatttatagatttc
15 attggcttctaaattttttatctagataacttcgtatagcatacattatacgaagttatggatccagcttatcgataccgtcgaatgaa aaacccccgcgggatgcgggggttcaatttaacgaaagaatcctaaaacggtttgtagttttaggattctttcatcttttcagcgt gattgaaaacccttgaagtctaggaagaacgagcattggagcgcagcgaatgtttggaattcgtgagcaccgaagcgcag gcctgacaacgaatgcgagggtttgtcgacacgctgaaaacccgcgggtgcgggggttttcttattacagcagcttcttcccta acagggattctacgagctctactgctgttttgcccgttttgtttttgtgatcaaggatcgggttaacctcaacgaattcggctgaggt
20 aatgatgcctgcgtcatacagcatttccatagccaaatggctctcccggtagctgatgccgccgacgacaggggttccgaca cccggtgcgtcgttcggatcaagtccgtccagatcaaggctcagatggacgccatcacatgctgataaataatcaagggtttc ttcaatgacctttgtcatgccaagacgatcgatttcgtgcattgtgtacaccttcatgccgctttccttaatgtacttgcgctccccttc atcaagtgaccgggcgccaatgatgacgacgttttccggtttgattttaggcgcgtagccttcaaggttaaccagtgactcgtgg ccaatgcctaggctgaccgcgagcggcatgccgtgaatattgcccgatggtgaagtttcaagtgtattcaaatcgccgtgcgc
25 gtcataccagatgacgccgagattatcgtaatgcttcgctgtgcctgcaagcgtgccgatcgcaatactgtggtcaccgccca ggacaagcgggaattttttctcttcaatgactttgttgaccttttgcgcgagtttttcatttcccgccaaaacggaattcaggtttttca gttcctcgtcatttttgattttttcgcgattgatcggaatgtcaccgaga (SEQ ID NO:97) [0251] The nucleotide sequence of the phrti deletion construct is: [0252] ggagggaagccgttgagtcaagccataccgtcttcgcgtgttggtgttaagattaatgaatggtataaaatgattcgc
30 cagttcagtgttccggatgctgagattctgaaagcggaggttgagcaggacattcagcagatggaagaagatcaggatttact gatctattattctctgatgtgttttcggcaccagctgatgcttgattatttggagccgggaaaaacatacgggaatcgccctacagt gacagagcttcttgaaacgatcgagacccctcagaaaaaactcacaggtcttttgaaatactactctttgtttttccgcggcatgt atgaatttgaccaaaaagaatatgtggaagcgatcggatattatcgcgaggcggagaaagaactgccgtttgtgtcagatga tattgagaaagcggaattccattttaaagtggcagaagcgtattatcacatgaagcaaacccatgtgtcgatgtatcatattcttc
35 aagccttggacatttatcaaaaccatcctctatacagcattagaacgatacaaagcttgtttgtgatcgccggcaactatgatga tttcaaacattatgataaagcgctcccgcatttagaggcggcgctggaattggcaatggacattcaaaatgacaggtttatcgc 69
catttctctattgaacattgcaaacagctatgacagatcaggagacgatcagatggctgtagaacatttccaaaaagcggcg aaagtaagcagagagaaagtgcctgatctgcttccgaaagtcttgtttggattaagctggacattatgtaaagcgggccaaac acagaaggcgtttcagttcatagaggaaggattagaccatatcacagcacgttctcacaaattttataaagaattgtttctgttctt gcaggccgtgtacaaggagactgttgatgaacgaaaaattcatgatcttttaagctatttcgaaaaaaagaacctgcacgctt 5 acattgaagcatgtgcccggagtgctgccgctgtttttgaaagcagctgtcactttgaacaagcagctgcgttttatcggaaagt gctgaaagcccaagaagatattctaaaagggagagtgtttatatgcctattaagaaaaaaagtgatgaggcctaggatgcat atggcggccgcataacttcgtatagcatacattatacgaagttatctagacatatgcaagggtttattgttttctaaaatctgattac caattag aatg aatatttcccaaatattaaataataaaacaaaaaaattg aaaaaag tg tttccaccattttttcaatttttttataat ttttttaatctgttatttaaatagtttatagttaaatttacattttcattagtccattcaatattctctccaagataactacgaactgctaac
10 aaaattctctccctatgttctaatggagaagattcagccactgcatttcccgcaatatcttttggtatgattttacccgtgtccatagtt aaaatcatacggcataaagttaatatagagttggtttcatcatcctgataattatctattaattcctctgacgaatccataatggctc ttctcacatcagaaaatggaatatcaggtagtaattcctctaagtcataatttccgtatattcttttattttttcgttttgcttggtaaagc attatggttaaatctgaatttaattccttctgaggaatgtatccttgttcataaagctcttgtaaccattctccataaataaattcttgttt gggaggatgattccacggtaccatttcttgctgaataataattgttaattcaatatatcgtaagttgcttttatctcctattttttttgaaat
15 aggtctaattttttgtataagtatttctttactttgatctgtcaatggttcagatacgacgactaaaaagtcaagatcactatttggtttt agtccactctcaactcctgatccaaacatgtaagtaccaataaggttattttttaaatgtttccgaagtatttttttcactttattaatttg ttcgtatgtattcaaatatatcctcctcactattttgattagtacctattttatatccatagttgttaattaaataaacttaatttagtttattta tagatttcattggcttctaaattttttatctagataacttcgtatagcatacattatacgaagttatggatccagcttatcgataccgtc gaggctttttcttgctttacggaagacggttccattttccacatcgcggcattccttctatttctaacgcaagacactcgaaacaac
20 caaaccatttgaggtataatggataaagtgaataacagtatttagattgatatatatgaaagagagtggaacatcatgggccgt aagtggaacaatattaaagagaagaaggcgtctaaggacgcaaatacgagtcggatttatgcgaagtttggccgtgagattt atgtggcggcgaaacagggcgagcctgatccggaatccaaccaggcgctgaaggttgtgcttgaacgtgcgaagacttac agcgtgccgaaaaacatcattgaacgtgcgatcgagaaggcgaagggcggagcggaagagaattacgatgagcttcgtt atgagggcttcgggccgaacggatcaatgattatcgttgatgcgctgacgaataatgtaaaccgtacggcgccggaagtgc
25 gtgcggcgttcgggaaaaacggcggaaacatgggtgtgagcggatctgttgcttacatgtttgacgcgacggctgtaatcgtg gtggaaggcaaaacggctgacgaagcgcttgaaatcctgatggaagcggatgttgatgtacgtgacattttagaagaggat gacagcgcgatcgtgtatgccgagcctgatcaattccatgcggtgcaagaggcgtttaaaaacgcgggtgtcgaggaattta cagtagcggagctgcaaatgcttgcgcaaagtgaagtaacgcttccggatgatgcaaaggaacagtttgaaaaattgattga tgcattagaagatttggaagatgttcagcaggtatatcataacgttgatttaggtgagtaaggagtgagcaggctgttatggcct
30 gctttttttgtcccggaaattgttttagctgtatgtaggcggccgcctatacgatctataagatattctcatactctggactgtaaccta tgtgaaggagagagtaaatatgactgatacaagacatatgtatggcggacctggttttggtcattatcagggctttggtattggc cacccgggctatggcatgcaaagcacaggctatccgggctatggcatgtatggaggccacccgggctatggcatgcaagg ctacccagatcacggcatacatggaggagtcggcggctatccgggatatggtgggtacggcggttacccaagcggcggct atggaggctctccgggaactggaagctatccgagcatgcaccatgaaaatgatggc (SEQ ID NO:98)
35 [0253] The nucleotide sequence of the phrl deletion construct is:
[0254] gaattgttaaacatggaagaaaatcaagatgccctgttatattatcaactattagaatttagacatgagataatgctg 70
agttatatgaaatctaaggaaatagaagatctcaataatgcttatgagactataaaagaaattgagaagcaagggcaattaa ctggcatgttggaatactatttttacttttttaagggtatgtacgagtttaggcgtaaagaattaatttcagcgataagtgcttatcga atagctgaatcaaagttgtcagaagttgaggatgaaatagagaaagcagagttttttttcaaagtgtcctatgtatattattatatg aaacaaacatacttctccatgaattatgcaaatcgtgcactcaaaatatttagagagtatgaagaatatgctgtccagactgtg 5 cgttgtcaatttattgtagcaggaaacttgatcgattcattggaatatgaaagagccttggaacaatttttgaagtctttggaaattt ccaaggaaagtaacatagagcatttaattgcaatgtcacatatgaatattgggatttgttatgatgaattgaaagaatataaga aggcttcacaacatttaattttagcgttagaaatttttgaaaaatcaaaacatagtttcttaacaaagactttattcactctaacctat gtagaagcaaaacaacaaaattataatgttgctttgatatactttaggaaagggcgatttattgccgataaaagtgatgataag gaatactcagcgaaattcaaaatattagagggattatttttttctgatggtgagactcaattaataaagaatgcattttcatatctgg
10 cttcgagaaaaatgtttgctgatgttgaaaatttttcgattgaagtcgctgattattttcatgaacaaggaaatttaatgctctctaat gaatattatcgtatgagtattgaagcaagacgaaaaattaaaaaaggggagattattgatgaaaatcagccggattctattgg cagcagtgattttaagtagtgtattggcctaggatgcatatggcggccgcataacttcgtatagcatacattatacgaagttatct agacatatgcaagggtttattgttttctaaaatctgattaccaattagaatgaatatttcccaaatattaaataataaaacaaaaa aattgaaaaaagtgtttccaccattttttcaatttttttataatttttttaatctgttatttaaatagtttatagttaaatttacattttcattagtc
15 cattcaatattctctccaagataactacgaactgctaacaaaattctctccctatgttctaatggagaagattcagccactgcattt cccgcaatatcttttggtatgattttacccgtgtccatagttaaaatcatacggcataaagttaatatagagttggtttcatcatcctg ataattatctattaattcctctgacgaatccataatggctcttctcacatcagaaaatggaatatcaggtagtaattcctctaagtca taatttccgtatattcttttattttttcgttttgcttggtaaagcattatggttaaatctgaatttaattccttctgaggaatgtatccttgttcat aaagctcttgtaaccattctccataaataaattcttgtttgggaggatgattccacggtaccatttcttgctgaataataattgttaatt
20 caatatatcgtaagttgcttttatctcctattttttttgaaataggtctaattttttgtataagtatttctttactttgatctgtcaatggttcag atacgacgactaaaaagtcaagatcactatttggttttagtccactctcaactcctgatccaaacatgtaagtaccaataaggtt attttttaaatgtttccgaagtatttttttcactttattaatttgttcgtatgtattcaaatatatcctcctcactattttgattagtacctattttat atccatagttgttaattaaataaacttaatttagtttatttatagatttcattggcttctaaattttttatctagataacttcgtatagcatac attatacgaagttatggatccagcttatcgataccgtcgacttagataattggaaaagaggaaaaaagcttaatcttttttcgaa
25 ggttaagctttttcttttatttataaaaagtgaactaactatcagaaagaaattatattaaattttatttttttgtttaaaaagtagattata taaaggcaagctaggtgggggaaaatatgtttaaaaaagaaaaagtcacagaatacatttggactatactaataccaacaa tcatcacttttatcattagttgggttgggtcttattacaatggtacttcgacagttagtattggacaacctacaaaagtttccggtcag tatatcacgccaataaatataagtccctatcatgatattaaggaattaagaataacttttccgcaaaaactagatgtaaaacaa attagttcaaatgagcctataaatgtaaaatcagataagaacaatataggagttgaaagtaattccacttttgagattgcgaaa
30 atcgttgaaaataatagcgttcagttgctaattacaacacaaaaaaagttaaacgataaggaaattagaattgataaaaatgg aaataacatttctgtaaattatgaatctcagattgttaatcctgcaaaaaaacaattaatcaatcttataattacgtcatctatttatttt ataatgcttaatatactagcattgattatgaacaaaagatgggataagtattatgcaaaaatgaaaaatgaaatcaaagaattt gaggataatgcaaaagatcttgataaaaaatcaaagaagaaaagcgaggaattatcggagctgcgaaagaccttgaacc aagcgtttgaggaaactgataggataaaatatcatgagaagaaaaaacaaatcctcctcttagctaagttaaacgattataa
35 aaaag aactaaccttttgg ag aaatacaataag aaaag ttctttatg aacttcctg atgg ag ataaaaaagcag ataaacta atagggacag (SEQ ID NO:99) 71
[0255] The nucleotide sequence of the phrK όe\eϊ\on construct is: [0256] gatgaaatggaagaagatcaagaagttcttgcgtattatagtctattagaagaaagacataaaatgttgctgcattct tcacg agg ag agcctttacaaaagcacacctattttactg aag acaatcaaaacttcataacaaaaacaaatg ataaattag aatacaacttttatttatttgaagcaatgtacgaggcatacaacaaaaactatgatcgagcaattaacctatatggattagctga 5 gaaaaagcttgcagaaattccagatgaaattgaagcagctgaattttactctaaagtctcttacttatatactcttgttaaacaaa gcattgtggcacaacattatataaaaaatgcaatttcaatatataagcgacaccctgattataaatgcaaactagctacatcaa caatgattgcagctgcaaactatgctgatatgaaacgatttgaggaagcagaacaatattacttagaagcaattgatattgca aaagaaacaaaagatgaatttttaaaagctcaattatttcacaatcttagtatcgtttattctgattggaacaaacctgataaatg cattgaatctcttgaaaaagcaataggaaatgaatcttggttacattcgatttattatataaattctttattcatgatgattaaagaac
10 tctttaaaattgacgaaaaaatgaaagccattaatttttacaataaagcacaggaaagactcatattaatggagaataaagtat acgaagccaaaatcagcatcctgtataacctttattgtggggaattaaaaaataatttcaataattgtattagtaatattgagttttt aaaacagcaaaatgaacttgaaagtgtagatgaattgtcctacatagctgcaaaaaggtttgaatcaataggtgcttttgaag aagcaacgagctttttcaatgcgaaaatttgggctgaacagaaaatgaatcaggtggagggaatcttatgaaaaaacttgtg ctttgcgtatctattttagctgtgattttaatcgacggtatcgataagctggatccataacttcgtataatgtatgctatacgaagttat
15 ctagataaaaaatttagaagccaatgaaatctataaataaactaaattaagtttatttaattaacaactatggatataaaatagg tactaatcaaaatag tg agg ag g atatatttg aatacatacg aacaaattaataaag tg aaaaaaatacttcgg aaacattta aaaaataaccttattggtacttacatgtttggatcaggagttgagagtggactaaaaccaaatagtgatcttgactttttagtcgtc gtatctgaaccattgacagatcaaagtaaagaaatacttatacaaaaaattagacctatttcaaaaaaaataggagataaaa gcaacttacgatatattgaattaacaattattattcagcaagaaatggtaccgtggaatcatcctcccaaacaagaatttatttat
20 ggagaatggttacaagagctttatgaacaaggatacattcctcagaaggaattaaattcagatttaaccataatgctttaccaa gcaaaacg aaaaaataaaag aatatacgg aaattatg acttag agg aattactacctg atattccattttctg atg tg ag aag agccattatggattcgtcagaggaattaatagataattatcaggatgatgaaaccaactctatattaactttatgccgtatgatttt aactatggacacgggtaaaatcataccaaaagatattgcgggaaatgcagtggctgaatcttctccattagaacatagggag agaattttgttagcagttcgtagttatcttggagagaatattgaatggactaatgaaaatgtaaatttaactataaactatttaaata
25 acagattaaaaaaattataaaaaaattgaaaaaatggtggaaacacttttttcaatttttttgttttattatttaatatttgggaaatatt cattctaattggtaatcagattttagaaaacaataaacccttgcatatgtctagataacttcgtataatgtatgctatacgaagttat gcggccgccatatgcatcctaggccaaaaggttgattaattaatttagccctactcaaacatttgagtgggcttttattttatgattt atgtccaccggtcagccctgctctgtggagcgcagtacctgcaaacgtaactgagatacttctcactgttttttgcccgagtaaa acttattaaagaacatcaagcaacacttataaatatccatcgtgatatttgtgggaaaatcaattgttttggatcgatgaaaacca
30 ccgccaagctcatctttactgtatccaattcctagacttattgttcgaccaactttattatatgtacgtgcccttcttgcgacttcctcac aaatctccaagagcacagctttaatctcttctctctttgtataatccctaaacaaaatctgactcttaccaaaactaatctgcccct gcatcaatggagctcctatttcagataaatcaattccgtgagcatgatagtacaactggtttcccattattccgaacttcttttcaag cagctctaaaggaaatttagctaactgacctacagttgatatacccattcgattcagatttctttccatcctccctcctatcccccac attttagacaaaggtcgaactttccagagtctatttggcacatcttcatatctccaacgtgcaataccactctttgttttcttactctcc
35 aggtcaagtgcaagcttactaagcaacatattgtcaccaattccaactgtgcacatcaaaccaaattctctccacatgctgcttt ggattgctttggccatttcttcaggattctcttttcctgcatctaaaaaagattaatcaattgaatacgtgtggacacatttttcagga 72
acaaatctgtaaaacagctttgtaatctcagttgaaactctgatgaaaagcttcatttgtggatttacaatgtatattcttggatcttc aggtatctcaaatagtctcgat (SEQ ID NO:100)
5 EXAMPLE 2
Protease Expression in Bacillus sp. Cells
[0257] BG2942 precursor host cells {ΔnprE, degU(Hy)32, amyE::[PxylRA-comK eryR] and the derived modified strains BG2942phrA::spc (CB2-1 ), BG2942phrE::spc (CB2-2), BG2942 phrCispc (CB2-3), BG2942 phrF:spc (CB2-4), BG2942 phrG:spc (CB2-5), 10 BG2942 phrl:spc (CB207) and BG2942 phrK:spc (CB2-8) were streaked onto Luria-
Bertani medium-1 .6% skim milk plates for overnight growth at 370C. For each strain, single colonies were then inoculated into 10 ml of Luria-Bertani medium and grown overnight at 300C. The pre-cultures were used to inoculate 25ml of freshly prepared 2χ SNB medium in a 250-ml flask. This medium contained the following (per liter): 16 g of Difco 15 nutrient broth, 50 ml of 10% maltrin M150, and 40 ml of 25χ SNB salts (25χ salts contain
[per liter] 3.7 g of CaCI2 2H2O, 9.6 mg of FeSO4 7H2O, 6 mg of MnCI2 4H2O, 25.0 g of KCI, and 3.26 g of MgSO4 7H2O). The strains were grown for nine hours and samples were taken at hourly intervals. The supematants were tested for AprE expression and activity. 20 [0258] Each of the Bacillus subtilis cultures was assayed for the production of the native subtlisin AprE (Swiss-Prot:P37562):
MRSKKLWISLLFALTLIFTMAFSNMSVQAAGKSSTEKKYIVGFKQTMSAMSSAKKKDVISEKGG KVQKQFKYVNAAAATLDEKAVKELKKDPSVAYVEEDHIAHEYAQSVPYGISQIKAPALHSQGYT GSNVKVAVIDSGIDSSHPDLNVRGGASFVPSETNPYQDGSSHGTHVAGTIAALNNSIGVLGVS 25 PSASLYAVKVLDSTGSGQYSWIINGIEWAISNNMDVINMSLGGPTGSTALKTVVDKAVSSGIVV AAAAGNEGSSGSTSTVGYPAKYPSTIAVGAVNSSNQRASFSSAGSELDVMAPGVSIQSTLPG GTYGAYNGTSMATPHVAGAAALILSKHPTWTNAQVRDRLESTATYLGNSFYYGKGLINVQAAA Q (SEQ ID NO:21 ) The enzyme produced was assayed for activity against the substrate, succinyl -L-Ala-L-Ala-L-Pro-L-Phe-p-nitroanalide (AAPF). The assay measured the production 30 of protease as the increase in absorbance at 405 nm/min resulting from the hydrolysis and release of p-nitroanaline (Estell et al., J Biol Chem., 260:6518-6521 (1985)). The measurements were made using the Sofmax Pro software, and the specified conditions were set as: Type: Kinetic; Reduction: Vmax Points (Read best 15/28 points); Lm 1 : 405 nm; Time: 5 minutes; and Interval: 1 1 Seconds. Twenty microliters of each of the B. subtilis supematants 35 were diluted in 10Oul of Tris Buffer, containing 10 mM Tris + 0.005% TWEENΘ-80, pH 8.6; and 73
25ul of 100 mg/ml AAPF. Assays were done in microtiter plates and the Softmax Pro Software was used.
[0259] The relative amounts and the activities of the AprE protease produced by the unmodified precursor strain BG2942 and from each of the modified strains CB2-1 , 2-2, 2- 5 3, 2-4, 2-5, 2-7, and 2-8 were determined and graphed as a function of absorbance
(A405nm) as shown in Figure 3. The results for the BG2942 derived strains (CB2-1 and CB2-2) carrying the deletion of the phrA and phrE genes, respectively, are also shown in Figure 4.
[0260] The data shown in Figures 3 and 4 show that that the deletion of phrA, phrE, phrC, 10 phrG, phr I and phrK increases AprE expression in the modified CB2-1 , CB2-2, CB2-3,
CB2-5, CB2-7 and CB2-8 Bacillus sp. cells when compared to the production in the unmodified parent strain BG2942 (diamonds).
EXAMPLE 3
15 Protease Expression in Bacillus sp. Cells Containing a Deletion of the phrA or the phrE gene
[0261] The inactivation constructs cassettes of phrA and phrE (SEQ ID NOS:17 and 18, respectively) were introduced into the Bacillus subtilis strain CF471 . The CF471 strain is the BG3594 strain described above (degU(Hy)32, oppA, LspollE, AaprE, AnprE) and that
20 further comprises the PaprE-FNA expression construct (SEQ ID NO:19), which encodes for the protease FNA (SEQ ID NO:20). The resulting modified strains CB3-47 (BG3594 phrA::spcR, aprE:[PaprE-FNA, caf\), and CB3-48 (BG3594 phrE::spcR, aprE:[PaprE-FNA, caf\) were grown in autoclaved suitable growth medium for 50 hours. Samples of the cell culture were centrifuged, and the production of protease was quantified as a function of
25 the activity of the secreted FNA protease present in the supernatants according to the
AAPF assay described above.
[0262] The results are graphed in Figure 5, and they show that the modified cells carrying deletions of the phrA (triangles) and the phrE (squares) genes produce PaprE dependent FNA protease expression at a greater level than that produced by the unmodified parent
30 strain CF471 (BG3594, aprE::[PaprE-FNA]; diamonds), which does not contain a deletion of either phrA and/or phrE.
[0263] Therefore, deleting phrA and phrE in a Bacillus sp. cell (e.g., a Bacillus subtilis cell), enhances the level of production of the protease FNA. 74
EXAMPLE 4
Protease Expression in Bacillus sp. Cells
Containing Deletions of phrA and phrE Genes
[0264] The spectinomycin cassette associated with the deletion of phrA was removed 5 through the lox recombination system in strain CF471 (BG3594, aprE::[PaprE-FNA]).
The resulting strain was transformed with the construct carrying the deletion of the phrE gene. After the antibiotic resistance cassette was removed the strain was tested for PaprE dependent protease expression. Figure 6 shows a graph of protease expression in the double phr deleted strains (CB4-68: BG3594 phrA, phrE, aprE:[PaprE-FNA, caf\; 10 triangles, CB4-69: BG3594 phrA, phrE aprE:[PaprE-FNA, caf\; crosses) compared to the phrA deleted strains (CB4-46: BG3594 phrA, aprE:[PaprE-FNA, cat\; diamonds, CB4-48: BG3594 phrE, aprE:[ PaprE- FN A, caf\; squares). The BG3594 derived strains carrying the deletion of the two phrA and phrE genes were grown in suitable growth medium for 50 hours and the supematants were tested in an AAPF assay.
15 [0265] The strains carrying both deletions of the phrA and phrE genes (i.e., strains CB4-
68 and CB4-69 showed an increase in FNA production when compared to the production by the strains CB4-46 and CB4-48, which both carried the deletion of only the phrA gene). [0266] Therefore, deleting the phrA and the phrE genes from a Bacillus sp. cell (e.g., Bacillus subtilis) enhances the level of production of FNA when compared to the level of 20 production by the Bacillus subtilis cells that were modified to contain the deletion of only the phrA gene.
EXAMPLE 5
Overexpression of YmaH: Generation of SigA and SigH Polynucleotide Constructs
25 [0267] Polynucleotide constructs SigH, SigA1 , SigA2, and SigA3 were generated to overexpress YmaH in host cells of Bacillus subtilis.
[0268] PCR primers were designed to be homologous to the Bacillus subtilis genome (Figure 7A) and to contain a 6 base pair restriction enzyme site located 6 base pairs from the 5' end of the primer. Primers were designed to engineer unique restriction sites at the
30 upstream and downstream ends of the construct. The primary source of genome sequence (Kunst et al, Nature 390:249-256 [1997]), gene localization, and start and stop codon information was obtained from the NCBI Database: Completed Bacillus subtilis subsp. subtilis str. 168, or from the SubtiList World Wide Web Server known to those in the art (Moser, 1. 1998. FEBS Lett. 430(1 -2):28-36). The sequence considered is
35 reported as SEQ ID NO:22 with coordinates 1865428-1867019 in the NCBI database,
ACC No NC000964 is shown in Figure 7A. 75
[0269] tcataccctgaaaggaaagacaagggaaattgtcggcaatgagccgctcggcaggtagaaggatgtttaccgat gcaaaaaaagggcaaaatggataggtggttgtccatgttgaatgctataatgggggagatttataaaagagagtgatacata ttgaataatacgaagcagcccgttgtcattttagtcggaccgacggcagtggggaaaaccaatttaagtattcagctagccaa atccttaaacgcggaaattatcagcggagattcgatgcagatttataaagggatggatattggaacagctaaaattaccgaac 5 aggagatggagggagtgccccatcatctgattgacattttagatccccaagactctttctctactgccgattatcaaagcttagta agaaataaaatcagcgagattgcaaatagaggaaagcttccgatgattgacggcggtacagggctttatatacaatctgagc tttacgattatacatttacggaagaggcaaatgatcccgtgtttcgagagagcatgcaaatggctgctgagcgggaaggcgct gactttcttcatgccaaacttgctgcagcagatcccgaggcagcagctgcgattcatccgaataatacaagaagagtcattcg cgcactggaaattttacatacgtccggaaaaacgatgtcccagcatttgaaggaacaaaaacgagaacttctgtacaatgca 10 gtgttaattggcctgacaatggatagagacacgctttacgaaagaattaatcagcgggtcgatttgatgatgcagtcaggcctt cttccggaagtgaaacgcttatacgacaagaacgtgagagactgtcaatcaatacaggcgataggctataaagagctgtat gcatattttgacggttttgtgacactttccgatgctgtcgaacagctaaagcagaactcgaggcggtatgcgaaacgccagctg acgtggtttcgcaacaaaatgcaggtcacatggttcgatatgacaccgcctgttgatatggagctgaaaaaaaaggaaatttt cacacatatagcaggaaaactcgaactttaatcgaaactgtatgatatagagaatcaaggaggacgaaacatgaaaccga 15 ttaatattcaggatcagtttttgaatcaaatccggaaagaaaatacgtatgtcactgtttttttgctgaacggctttcagttgcgggg ccaggtgaaaggctttgataactttaccgtattgttggaatcggaaggtaagcagcagcttatatataaacatgcgatctcaac gtttgcgccgcaaaaaaacgtccagcttgaactcgaatagatcaaaaaatgccatgtcaagacatgaggaaaggctgtcg ggggttcccggcggccatttttaacatgaatccacttttgctccaagctttttgtgtaagctgaccatgccaaggcacggtcttttttt atgag (SEQ ID NO:22). 20
[0270] The SigH construct (Figure 5B; SEQ ID NO:23) ggcaccgaattcgacgtggtttcgcaacaaaatgcaggtcacatggttcgatatgacaccgcctgttgatatggagctgaaaaaaaa ggaaattttcacacatatagcaggaaaactcgaactttaatcgaaactgtatgatatagagaatcaaggaggacgaaacatgaaac cgattaatattcaggatcagtttttgaatcaaatccggaaagaaaatacgtatgtcactgtttttttgctgaacggctttcagttgcggggcc 25 aggtgaaaggctttgataactttaccgtattgttggaatcggaaggtaagcagcagcttatatataaacatgcgatctcaacgtttgcgc cgcaaaaaaacgtccagcttgaactcgaatagatcaaaaaatgccatgtcaagacatgaggaaaggctgtcgggggttcccggcg gccatttttaacatgaatccacttttgctccaagctttttgtgtaagctgaccatgccaaggcacggtctttttttatgagggatccggagcc (SEQ ID NO:23) was generated to comprise the polynucleotide sequence encompassing the Sigma H promoter 30 aaaggaaattttcacacatatagcaggaaaactcgaactttaatcgaaactgtatgatatagagaatcaaggaggacgaaac; SEQ ID NO:48, and the adjacent sequence atgaaaccgattaatattcaggatcagtttttgaatcaaatccggaaagaaaatacgtatgtcactgtttttttgctgaacggctttcagttg cggggccaggtgaaaggctttgataactttaccgtattgttggaatcggaaggtaagcagcagcttatatataaacatgcgatctcaac gtttgcgccgcaaaaaaacgtccagcttgaactcgaatag; SEQ ID NO:46 (NP 389616), encoding the YmaH 35 protein
MKPINIQDQFLNQIRKENTYVTVFLLNGFQLRGQVKGFDNFTVLLESEGKQQLIYKHAISTFAPQ 76
KNVQLELE; SEQ ID NO:45 (Swiss-Prot:P3756). The Sigma H promoter is naturally located within the polynucleotide sequence encoding the miaA gene, close to the 3' end of the gene, and immediately upstream of the ymaH gene. The entire Sigma H promoter and adjacent ymaH coding sequence was amplified by PCR using the forward primer P1 : 5 ggcaccgaattcgacgtggtttcgcaacaaaatgcag (SEQ ID NO:24; position 987 to 101 1 of SEQ ID NO:22), with an EcoRI restriction site added at the 5' end, and a reverse primer P2: ggcaccggatccctcataaaaaaagaccgtgccttgg (SEQ ID NO:25, at position 1472 to 1496 of SEQ ID NO:22), with and added BamhW restriction site (Figure 7B).
[0271] The SigA1 and SigA2 constructs were generated in a three step process by 1 )
10 amplifying individual fragments of Bacillus subtilis chromosomal DNA, 2) purifying and assembling the fragments; and 3) amplifying the assembled product by PCR. [0272] The SigA1 construct (Figure 7C; SEQ ID NO:26) gcgccgaattctcataccctgaaaggaaagacaagggaaattgtcggcaatgagccgctcggcaggtagaaggatgtttaccgat gcaaaaaaagggcaaaatggataggtggttgtccatgttgaatgctataatgggggagatttataaaagagagtgatacatattgaat
15 aatacgaagcagccccacacatatagcaggaaaactcgaactttaatcgaaactgtatgatatagagaatcaaggaggacgaaa catgaaaccgattaatattcaggatcagtttttgaatcaaatccggaaagaaaatacgtatgtcactgtttttttgctgaacggctttcagtt gcggggccaggtgaaaggctttgataactttaccgtattgttggaatcggaaggtaagcagcagcttatatataaacatgcgatctcaa cgtttgcgccgcaaaaaaacgtccagcttgaactcgaatagatcaaaaaatgccatgtcaagacatgaggaaaggctgtcggggg ttcccggcggccatttttaacatgaatccacttttgctccaagctttttgtgtaagctgaccatgccaaggcacggtctttttttatgagggat
20 ccggtgcc (SEQ ID NO:26) was generated using two sets of primers. A first set of primers: forward primer P3: gcgccgaattctcataccctgaaaggaaagacaagg (SEQ ID NO:27) located at the 5' end of SEQ ID NO:22; and reverse primer P4: ttcgagttttcctgctatatgtgtggggctgcttcgtattattcaatatg (SEQ ID NO:28) located from bp 153 to bp 177 on the SEQ ID NO:22, was used to amplify a
25 first fragment containing the SigA promoter, Ribosome Binding Site, start codon and the first few codons of the miaA gene ttgaataatacgaagcagcccgttgtcattttagtcggaccgacggcagtggggaaaaccaatttaagtattcagctagccaaatcctt aaacgcggaaattatcagcggagattcgatgcagatttataaagggatggatattggaacagctaaaattaccgaacaggagatgg agggagtgccccatcatctgattgacattttagatccccaagactctttctctactgccgattatcaaagcttagtaagaaataaaatcag
30 cgagattgcaaatagaggaaagcttccgatgattgacggcggtacagggctttatatacaatctgagctttacgattatacatttacgga agaggcaaatgatcccgtgtttcgagagagcatgcaaatggctgctgagcgggaaggcgctgactttcttcatgccaaacttgctgca gcagatcccgaggcagcagctgcgattcatccgaataatacaagaagagtcattcgcgcactggaaattttacatacgtccggaaa aacgatgtcccagcatttgaaggaacaaaaacgagaacttctgtacaatgcagtgttaattggcctgacaatggatagagacacgct ttacgaaagaattaatcagcgggtcgatttgatgatgcagtcaggccttcttccggaagtgaaacgcttatacgacaagaacgtgaga
35 gactgtcaatcaatacaggcgataggctataaagagctgtatgcatattttgacggttttgtgacactttccgatgctgtcgaacagctaa agcagaactcgaggcggtatgcgaaacgccagctgacgtggtttcgcaacaaaatgcaggtcacatggttcgatatgacaccgcct 77
gttgatatggagctgaaaaaaaaggaaattttcacacatatagcaggaaaactcgaactttaa; SEQ ID NO:49. A second set of primers, forward primer P5: catattgaataatacgaagcagccccacacatatagcaggaaaactcgaa (SEQ ID NO:29) located from bp 1071 to bp 1095 on the SEQ ID NO:22 and reverse primer P2 (SEQ ID NO:25), were used to amplify 5 a second fragment containing the DNA sequence encoding the YmaH protein. Reverse primer P4 and forward primer P5 are fusion primers that were designed to contain tails that are complementary to each other but that are not homologous to the sequence that is being amplified to eliminate the intervening miaA coding sequence. The two fragments were annealed, and the resulting SigA1 construct contained the SigA promoter (SEQ ID NO:47)
10 tcataccctgaaaggaaagacaagggaaattgtcggcaatgagccgctcggcaggtagaaggatgtttaccgatgcaaaaaaag ggcaaaatggataggtggttgtccatgttgaatgctataatgggggagatttataaaagagagtgatacata; (SEQ ID NO:47), the ribosome binding site aagagag; SEQ ID NO:50, and the transcription start site of the miaA gene. The SigA1 construct was amplified using forward primer P3 (SEQ ID NO:27) and reverse primer P2 (SEQ ID NO:25), which respectively contain an EcoRI and a BamYW
15 restriction site, and ligated into the polylinker of replicating plasmid pBS19. The polynucleotide sequence of pBS19 is shown below (SEQ ID NO:30). The pBS19 plasmid can replicate in E. coli and Bacillus subtilis, and carries the chloamphenicol resistance selection marker gene, gaattcgagctcggtacccggggatcctctagagtcgacctgcaggcatgcaagcttggcgatcctgcctcgcgcgtttcggtgatga cggtgaaaacctctgacacatgcagctcccggagacggtcacagcttgtctgtaagcggatgccgggagcagacaagcccgtcag
20 ggcgcgtcagcgggtgttggcgggtgtcggggcgcagccatgacccagtcacgtagcgatagcggagtgtatactggcttaactatg cggcatcagagcagattgtactgagagtgcaccatatgcggtgtgaaataccgcacagatgcgtaaggagaaaataccgcatcag gcgctcttccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatac ggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaagg ccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccga
25 caggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccg cctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtg tgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgcc actggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacg gctacactagaaggacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaa
30 caaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttt tctacggggtctgacgctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatctggagctgtaatat aaaaaccttcttcaactaacggggcaggttagtgacattagaaaaccgactgtaaaaagtacagtcggcattatctcatattataaaa gccagtcattaggcctatctgacaattcctgaatagagttcataaacaatcctgcatgataaccatcacaaacagaatgatgtacctgt aaagatagcggtaaatatattgaattacctttattaatgaattttcctgctgtaataatgggtagaaggtaattactattattattgatatttaa
35 gttaaacccagtaaatgaagtccatggaataatagaaagagaaaaagcattttcaggtataggtgttttgggaaacaatttccccgaa ccattatatttctctacatcagaaaggtataaatcataaaactctttgaagtcattctttacaggagtccaaataccagagaatgttttagat 78
acaccatcaaaaattgtataaagtggctctaacttatcccaataacctaactctccgtcgctattgtaaccagttctaaaagctgtatttga gtttatcacccttgtcactaagaaaataaatgcagggtaaaatttatatccttcttgttttatgtttcggtataaaacactaatatcaatttctgt ggttatactaaaagtcgtttgttggttcaaataatgattaaatatctcttttctcttccaattgtctaaatcaattttattaaagttcatttgatatgc ctcctaaatttttatctaaagtgaatttaggaggcttacttgtctgctttcttcattagaatcaatccttttttaaaagtcaatattactgtaacata 5 aatatatattttaaaaatatcccactttatccaattttcgtttgttgaactaatgggtgctttagttgaagaataaaagaccacattaaaaaat gtggtcttttgtgtttttttaaaggatttgagcgtagcgaaaaatccttttctttcttatcttgataataagggtaactattgccggttgtccattcat ggctgaactctgcttcctctgttgacatgacacacatcatctcaatatccgaatagggcccatcagtctgacgaccaagagagccata aacaccaatagccttaacatcatccccatatttatccaatattcgttccttaatttcatgaacaatcttcattctttcttctctagtcattattattg gtccattcactattctcattcccttttcagataattttagatttgcttttctaaataagaatatttggagagcaccgttcttattcagctattaataa
10 ctcgtcttcctaagcatccttcaatccttttaataacaattatagcatctaatcttcaacaaactggcccgtttgttgaactactctttaataaa ataatttttccgttcccaattccacattgcaataatagaaaatccatcttcatcggctttttcgtcatcatctgtatgaatcaaatcgccttcttct gtgtcatcaaggtttaattttttatgtatttcttttaacaaaccaccataggagattaaccttttacggtgtaaaccttcctccaaatcagacaa acgtttcaaattcttttcttcatcatcggtcataaaatccgtatcctttacaggatattttgcagtttcgtcaattgccgattgtatatccgatttat atttatttttcggtcgaatcatttgaacttttacatttggatcatagtctaatttcattgcctttttccaaaattgaatccattgtttttgattcacgtag
15 ttttctgtattcttaaaataagttggttccacacataccaatacatgcatgtgctgattataagaattatctttattatttattgtcacttccgttgc acgcataaaaccaacaagatttttattaatttttttatattgcatcattcggcgaaatccttgagccatatctgacaaactcttatttaattcttc gccatcataaacatttttaactgttaatgtgagaaacaaccaacgaactgttggcttttgtttaataacttcagcaacaaccttttgtgactg aatgccatgtttcattgctctcctccagttgcacattggacaaagcctggatttacaaaaccacactcgatacaactttctttcgcctgtttc acgattttgtttatactctaatatttcagcacaatcttttactctttcagcctttttaaattcaagaatatgcagaagttcaaagtaatcaacatt
20 agcgattttcttttctctccatggtctcacttttccactttttgtcttgtccactaaaacccttgatttttcatctgaataaatgctactattaggaca cataatattaaaagaaacccccatctatttagttatttgtttagtcacttataactttaacagatggggtttttctgtgcaaccaattttaagggt tttcaatactttaaaacacatacataccaacacttcaacgcacctttcagcaactaaaataaaaatgacgttatttctatatgtatcaagat aagaaagaacaagttcaaaaccatcaaaaaaagacaccttttcaggtgctttttttattttataaactcattccctgatctcgacttcgttctt tttttacctctcggttatgagttagttcaaattcgttctttttaggttctaaatcgtgtttttcttggaattgtgctgttttatcctttaccttgtctacaaa
25 ccccttaaaaacgtttttaaaggcttttaagccgtctgtacgttccttaag (SEQ ID NO:30)
[0273] The SigA2 construct (Figure 7C; SEQ ID N0:31 ) gcgccgaattctcataccctgaaaggaaagacaagggaaattgtcggcaatgagccgctcggcaggtagaaggatgtttaccgat gcaaaaaaagggcaaaatggataggtggttgtccatgttgaatgctataatgggggagatttataaaagagagtgctcgaactttaat cgaaactgtatgatatagagaatcaaggaggacgaaacatgaaaccgattaatattcaggatcagtttttgaatcaaatccggaaag
30 aaaatacgtatgtcactgtttttttgctgaacggctttcagttgcggggccaggtgaaaggctttgataactttaccgtattgttggaatcgg aaggtaagcagcagcttatatataaacatgcgatctcaacgtttgcgccgcaaaaaaacgtccagcttgaactcgaatagatcaaaa aatgccatgtcaagacatgaggaaaggctgtcgggggttcccggcggccatttttaacatgaatccacttttgctccaagctttttgtgta agctgaccatgccaaggcacggtctttttttatgagggatccggtgcc (SEQ ID NO:31 )
[0274] was generated according to the method described for the construction of the
35 SigA1 construct using the following primers (Figure 7D). The first fragment containing the
SigA promoter was amplified using forward primer P3 (SEQ ID NO:27) and reverse fusion 79
primer P7:
[0275] catacagtttcgattaaagttcgagcactctcttttataaatctccccca (SEQ ID NO:33) [0276] located from bp 125 to bp 149 on the SEQ ID NO:22. The second fragment containing the DNA sequence encoding the YmaH protein was amplified using the 5 forward fusion primer P6:
[0277] tgggggagatttataaaagagagtgctcgaactttaatcgaaactgtatg (SEQ ID NO:32) located from bp 1090 to bp 1 1 14 on the SEQ ID NO:22 and the reverse primer P2 (SEQ ID NO:25). The two fragments were annealed, and the resulting SigA2 construct contained the SigA promoter, the ribosome binding site GGAGG; SEQ ID NO:51 ) and the 10 transcription start site of the ymati gene.
[0278] The invention also encompasses a fourth SigA construct (SigA3; SEQ ID NO:22; Figure 7E), which is generated by amplifying the miaA ymati region of the Bacillus chromosomal DNA that includes a SigA promoter, the region encoding the MiaA protein, the a YmaH promoter and the region encoding the YmaH protein. 15 [0279] The SigA3 construct was generated using forward primer P8 gcgcgcgaattcagggaaattgtcggcaatgagccgctcggc (SEQ ID NO:34) and reverse primer P9 gcgcgccatggctgattcgtctcagttctgcttcactttca (SEQ ID NO:35). SEQ ID NO:34 places an EcoRI restriction site at the 5' end of the fragment, while SEQ ID NO:35 places a Ncol site at the 3' end. This allows to clone the fragment in the pBN3 vector reported as SEQ ID 20 NO:36, shown below: gacgtctaagaaaccattattatcatgacattaacctataaaaataggcgtatcacgaggccctttcgtcttcaagaattaattctcatgttt gacagcttatcatcgataagcttgcatgcctgcaggtcgactctagaggatccccgggtaccgagctcgaattccttaaggaacgtac agacggcttaaaagcctttaaaaacgtttttaaggggtttgtagacaaggtaaaggataaaacagcacaattccaagaaaaacacg atttagaacctaaaaagaacgaatttgaactaactcataaccgagaggtaaaaaaagaacgaagtcgagatcagggaatgagttt 25 ataaaataaaaaaagcacctgaaaaggtgtctttttttgatggttttgaacttgttctttcttatcttgatacatatagaaataacgtcatttttat tttagttgctgaaaggtgcgttgaagtgttggtatgtatgtgttttaaagtattgaaaacccttaaaattggttgcacagaaaaaccccatct gttaaagttataagtgactaaacaaataactaaatagatgggggtttcttttaatattatgtgtcctaatagtagcatttattcagatgaaaa atcaagggttttagtggacaagacaaaaagtggaaaagtgagaccatggagagaaaagaaaatcgctaatgttgattactttgaact tctgcatattcttgaatttaaaaaggctgaaagagtaaaagattgtgctgaaatattagagtataaacaaaatcgtgaaacaggcgaa 30 agaaagttgtatcgagtgtggttttgtaaatccaggctttgtccaatgtgcaactggaggagagcaatgaaacatggcattcagtcaca aaaggttgttgctgaagttattaaacaaaagccaacagttcgttggttgtttctcacattaacagttaaaaatgtttatgatggcgaagaat taaataagagtttgtcagatatggctcaaggatttcgccgaatgatgcaatataaaaaaattaataaaaatcttgttggttttatgcgtgca acggaagtgacaataaataataaagataattcttataatcagcacatgcatgtattggtatgtgtggaaccaacttattttaagaataca gaaaactacgtgaatcaaaaacaatggattcaattttggaaaaaggcaatgaaattagactatgatccaaatgtaaaagttcaaatg 35 attcgaccgaaaaataaatataaatcggatatacaatcggcaattgacgaaactgcaaaatatcctgtaaaggatacggattttatga ccgatgatgaagaaaagaatttgaaacgtttgtctgatttggaggaaggtttacaccgtaaaaggttaatctcctatggtggtttgttaaa 80
ag aaatacataaaaaattaaaccttg atg acacag aag aaggcg atttg attcatacag atg atg acg aaaaagccg atg aag at ggattttctattattgcaatgtggaattgggaacggaaaaattattttattaaagagtagttcaacaaacgggccagtttgttgaagattag atgctataattgttattaaaaggattgaaggatgcttaggaagacgagttattaatagctgaataagaacggtgctctccaaatattcttat ttagaaaagcaaatctaaaattatctgaaaagggaatgagaatagtgaatggaccaataataatgactagagaagaaagaatgaa 5 gattgttcatgaaattaaggaacgaatattggataaatatggggatgatgttaaggctattggtgtttatggctctcttggtcgtcagactga tgggccctattcggatattgagatgatgtgtgtcatgtcaacagaggaagcagagttcagccatgaatggacaaccggtgagtggaa ggtggaagtgaattttgatagcgaagagattctactagattatgcatctcaggtggaatcagattggccgcttacacatggtcaatttttct ctattttgccgatttatgattcaggtggatacttagagaaagtgtatcaaactgctaaatcggtagaagcccaaacgttccacgatgcga tttgtgcccttatcgtagaagagctgtttgaatatgcaggcaaatggcgtaatattcgtgtgcaaggaccgacaacatttctaccatccttg
10 actgtacaggtagcaatggcaggtgccatgttgattggtctgcatcatcgcatctgttatacgacgagcgcttcggtcttaactgaagca gttaagcaatcagatcttccttcaggttatgaccatctgtgccagttcgtaatgtctggtcaactttccgactctgagaaacttctggaatcg ctagagaatttctggaatgggattcaggagtggacagaacgacacggatatatagtggatgtgtcaaaacgcataccattttgaacg atgacctctaataattgttaatcatgttggttacctgcctcgcgcgtttcggtgatgacggtgaaaacctctgacacatgcagctcccgga gacggtcacagcttgtctgtaagcggatgccgggagcagacaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggggc
15 gcagccatgacccagtcacgtagcgatagcggagtgtatactggcttaactatgcggcatcagagcagattgtactgagagtgcacc atatgcggtgtgaaataccgcacagatgcgtaaggagaaaataccgcatcaggcgctcttccgcttcctcgctcactgactcgctgcg ctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaa agaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccc tgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctgg
20 aagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtggcgctttctcata gctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgc gccttatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagca gagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgct ctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgca
25 agcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaact cacgttaagggattttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagt atatatgagtaaacttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcct gactccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctcac cggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctccatccag
30 tctattaattgttgccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgccattgctacaggcatcgtggtgtc acgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgatcccccatgttgtgcaaaaaagcggtt agctccttcggtcctccgatcgttgtcagaagtaagttggccgcagtgttatcactcatggttatggcagcactgcataattctcttactgtc atgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcc cggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgctcatcattggaaaacgttcttcggggcgaaaactct
35 caaggatcttaccgctgttgagatccagttcgatgtaacccactcgtgcacccaactgatcttcagcatcttttactttcaccagcgtttctg ggtgagcaaaaacaggaaggcaaaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcatactcttccttttt 81
caatattattgaagcatttatcagggttattgtctcatgagcggatacatatttgaatgtatttagaaaaataaacaaataggggttccgcg cacatttccccgaaaagtgccacct (SEQ ID NO:36)
[0280] All PCR reactions were performed in 50 ul volume conatining 1 -2 ul DNA or from a colony resuspension, 5 ul of 1 OX Pfu Ultra buffer (Stratagene), 1 uL of 1 OmM dNTP blend 5 (Roche), 0.5 uL of 0.2uM primers, 1 ul Pfu Ultra High Fidelity Polymerase, and the volume adjusted with water to have a total volume of 50 ul. The PCR conditions were: 950C for 2 min, 30 cycles of 95O for 30 sec, 620C for 30 sec, 72°C for 1 min, followed by 1 cycle of 72O fOr 10 min. [0281] The obtained PCR fragments were gel purified using Qiagen Gel Purification Kit
10 according to the manufacturer's instructions.
[0282] Fusion constructs were obtained by annealing 0.25ul aliquots of purified PCR fragments that were mixed together and added into fresh PCR mix following the above recipe using primers P3 and P2. The total volume of the PCR mixture was 50 μl. The PCR conditions were the same as above adjusting the annealing temperature according to the
15 Tm of the primers.
[0283] The desired SigH, SigA1 , and SigA2 constructs were ligated into pBS19 plasmids that had been digested with EcoRI and BamhW to generate SigA and SigH expression vectors that were used to transform host cells as described in Example 4. [0284] The transformation mixture was plated on LB+1 .6% skim milk+5 ug/ml cmp plates.
20 The next day, halo-forming colonies were picked and plated for single colonies. The colony purification was performed twice. Five individual clones were analyzed by sequencing of aprE promoter region. All of them had consensus sequence at -35 region of aprE promoter.
25 EXAMPLE 6
Host Cell Transformation and Expression of AprE Protease
[0285] Five microliters of the ligation mixture containing either the SigA1 or SigH constructs were used to transform E. coli Topi 0 cells (Invitrogen) by electroporation. The transformed cells were plated onto LB agar plates containing 5ppm/ml chloramphenicol
30 (Cm), and colonies were allowed to grow overnight at 37C. Individual colonies were picked and transferred to tubes containing 5 ml of LB+ 5ppm/ml Cm. Cultures were grown overnight at 37°C while shaking at 250 rpm. Plasmid DNA was prepared from the E. coli cultures, and a portion of the plasmid DNA preparation was sequenced (Sequetech). Automated sequence analysis was performed using Phrep, Phrap, Consed,
35 Custal W software.
[0286] The plasmid bearing the right construct from each of the expression vectors was 82
used to transform Bacillus subtilis host cells. The expression vectors containing the SigH (SEQ ID NO:23) and SigA1 (SEQ ID NO:26) and SigA2 (SEQ ID NO:31 ) constructs were named pBS19 ymaH-H and pBS19 ymaH-A1 and pBS19 ymaH-A2 were transformed into B. subtilis strains BG2941 and BG2942 as follows. Two microliters of the plasmid DNA 5 carrying the appropriate constructs were used to transform 10Oμl of B. substilis cells BG
2941 (ΔnprE, amyE::PxylRA-comK-phleoR) and BG2942 {ΔnprE, degU(Hy)32, amyE::PxylRA-comK-eryR). The BG2941 and BG2942 transformants carrying the SigH constructs were named 41 SigH and 42SigH, respectively; and the BG2941 and BG2942 transformants carrying the SigA1 constructs were named 41 SigA1 and 42SigA1 ,
10 respectively. Some BG2941 and BG2942 host cells were also transformed with a control
(empty) pBS19 plasmid, and were named 41 pBS19 and 42pBS19. Both BG2941 and BG2942 host cells carry the deletion of the nprE gene, which abolishes most of the non- aprE background proteolytic activity, thus facilitating the measurement of the alkaline protease (AprE) produced. The BG2941 and BG2942 host cells also carry the cassette
15 amy E:\PxylRA-comK-phleoR, which allows to make competent cells by inducing a growing culture with xylose (Hahn et al., MoI Microbiol. 18:755-67 [1995]). The BG2942 host cells also carry a mutation in the degU gene (degU(Hy)32 mutation) , which alone increases the level of subtilisin secreted by the host cells by several fold relative to that secreted by host cells that do not carry the degU(Hy) mutation (Msadek et al. J Bacteriol,
20 172:824-834 [1990]).
[0287] The effect of overexpressing YmaH in Bacillus host cells was determined qualitatively and quantitatively in assays described in Example 7.
EXAMPLE 7
25 Effect of Overexpressing YmaH on the Production of Protease
[0288] Casein assay: - The effect of overexpressing YmaH on the production of endogenous AprE subtilisin protease by Bacillus host cells was determined first by a qualitative assay that compares the size of the halos produced by the colonies grown on agar plates containing casein in the form of skim milk. As protease enzyme is secreted by
30 the Bacillus cells, it digests the casein in the skim milk, and forms regions of clearing, or halos around the growing colony. Host cells which have an inactive protease will exhibit little or no halo around the colonies. Thus, the size of the halo provides a qualitative assessment of the amount of protease that is produced by the secreting colony (Wells, T. A. et al. Nucleic Acids Res., 1 1 , 791 1 -7925: [1983]).
35 [0289] BG2941 and BG2942 Bacillus subtilis host cells transformed with SigH or SigA1 expression vectors were plated onto LB agar plates containing 1 .6% skim milk and 5ppm 83
Cm, and incubated overnight in at 37°C. The following day, colonies from some of the transfomants were single colony isolated on LB agar plates with 5ppm Cm, and the plates were incubated overnight at 37C. Single colony isolates were picked and patched on the same type of plates and incubated again at 37°C overnight.
5 [0290] The largest halos were produced by the 42SigH host cells. The 42SigH cells are
BG2942 Bacillus subtilis host cells that carry the degU(Hy)32 mutation and the SigH construct that enables the overexpression of YmaH protein. In particular, the size of the halos of the 42SigH cells evidences that overexpressing ymaH further enhances the production of subtilisin in host cells that already produce levels of the enzyme that are
10 greater than those produced by wild-type cells. For example, 42SigH cells produce halos that are bigger than those produced by the 42pBS19 cells, which carry the degU(Hy) mutation but do not carry a construct that enables overexpression of ymaH, but which in turn produce halos that are bigger than the halos produced by the 41 pBS19 cells, which are BG2941 Bacillus subtilis host cells that do not carry the degU(Hy)32 mutation and do
15 not carry a construct that enables overexpression of ymaH. The halos produced by the
42SigH cells were also greater than the halos produced by the 41 SigH cells, which do not carry the degU(Hy) mutation but carry the SigH construct to enable overexpression of YmaH. [0291] AAPF assay - The production of subtilisin by transformed Bacillus host cells
20 42SigH, 42SigA1 , 41 SigA2, which overexpress ymaH, and their respective controls
42pBS19, and 41 pBS19 was quantified as a function of the activity of the secreted AprE protease. The proteolytic activity of the secreted protease was determined as the rate of hydrolysis of the substrate succinyl -L-Ala-L-Ala-L-Pro-L-Phe-p-nitroanalide (AAPF from Sigma Chemical Co). The assay measured the level of production of protease as the
25 absorbance at 405 nm/min resulting from the hydrolysis and release of p-nitroanaline
(Estell et al., J Biol Chem., 260:6518-6521 [1985]). The measurements were made using the Sofmax Pro software, and the specified conditions were set as: Type: Kinetic; Reduction: Vmax Points (Read best 15/28 points); Lm1 : 405 nm; Time: 5 minutes; and Interval: 1 1 Seconds.
30 [0292] Liquid cultures of B. subtilis control host cells 41 pBS19 and 42pBS19, and host cells overexpressing YmaH were obtained by inoculating 5ml of LB containing 5pmm of chloramphenicol (Cm) with single colonies of transformed cells 41 SigH and 42SigA1 and 42SigH, and allowing the cells to grow while shaking at 37 C until growth reached mid- logarithmic phase. Each of the cultures was diluted 1 :100 with fresh complex medim
35 containing 5ppm Cm, and allowed to grow at 37° C while shaking at 250 rpm. Samples of the cultures were taken at the times indicated in the figures. The samples were 84
centrifuged and the supernatants were tested for production of subtilisin. [0293] Ten microliters of each of the B. subtilis cultures supernatants were diluted 10OuI of Tris Buffer, containing 10 mM Tris + 0.005% TWEEN®-80, pH 8.6; and 25ul of 100 mg/ml AAPF. The activity of each of the protease was calculated, and the effect of 5 overexpressing YmaH on the production of the protease is shown in Figures 10A-B and
Figure 1 1 .
[0294] Figures 10A and 10B show that overexpressing YmaH in Bacillus host cells, whether in presence (42SigA and 42SigH; Figure 10A) or absence (41 SigH; Figure 10B) of the degU(Hy) mutation, enhances the production of the AprE subtilisin by several fold
10 when compared to the level produces by the respective control cells 41 pBS19 and
42pBS19. In addition, cells that overexpress YmaH produce elevated levels of the AprE subtilisin earlier than cells that do not overexpress YmaH. For example, Figure 10A shows that 42sigH cells produce almost as much subtilisin at 20 hours of growth as the parent control cells produce at 48 hours. Similarly, Figure 10B shows that 41 SigH cells
15 produce more subtilisin at 25 hours than the 41 pBS control cells produce at 48 hours.
The graph shown in Figure 1 1 shows that cells that the expression of YmaH when driven by the SigH promoter (42SigH) results in the production of subtilisin that is greater than that produced by cells in which YmaH expression is driven by the Sigma A promoter (42SigA). Figure 1 1 also shows that overexpression of YmaH whether driven by the SigH
20 or SigA promoter results in enhanced production of AprE subtilisin as early as after only one hour of cell growth.
EXAMPLE 8
Effect of YmaH Overexpression on Protease Expression in Modified Bacillus sp. Host Cells 25 containing an Inactivated phrA or phrE gene
[0295] The effect of overexpressing YmaH protein on the enhanced ability of Bacillus cells that lack phrA or phrE io produce protease was tested.
[0296] The expression construct SigH, which comprises the ymaH gene operably linked to its native promoter (SigH promoter), was amplified by PCR using the primers ymaH 1 F 30 EcoRI (P1 ; SEQ ID NO:24) and ymaH 3'R BamHI (P2; SEQ ID NO:25) and cloned in the multicopy plasmid pBS19 using EcoRI and BamHI restriction sites to generate plasmid pBS19 ymaH sigH (SEQ ID NO:37).
[0297] The sequence of the primers used for the amplification is set forth below:
35
Figure imgf000086_0001
85
Figure imgf000087_0001
[0298] The map of the plasmid pBS19 ymaH sigH is set forth in Figure 9, and the sequence of the plasmid pBS19 ymaH sigH is set forth below: gaattcgacgtggtttcgcaacaaaatgcaggtcacatggttcgatatgacaccgcctgttgatatggagctgaaaaaaaaggaaatt 5 ttcacacatatagcaggaaaactcgaactttaatcgaaactgtatgatatagagaatcaaggaggacgaaacatgaaaccgattaat attcaggatcagtttttgaatcaaatccggaaagaaaatacgtatgtcactgtttttttgctgaacggctttcagttgcggggccaggtgaa aggctttgataactttaccgtattgttggaatcggaaggtaagcagcagcttatatataaacatgcgatctcaacgtttgcgccgcaaaa aaacgtccagcttgaactcgaatagatcaaaaaatgccatgtcaagacatgaggaaaggctgtcgggggttcccggcggccattttt aacatgaatccacttttgctccaagctttttgtgtaagctgaccatgccaaggcacggtctttttttatgaggatcctctagagtcgacctgc
10 aggcatgcaagcttggcgatcctgcctcgcgcgtttcggtgatgacggtgaaaacctctgacacatgcagctcccggagacggtcac agcttgtctgtaagcggatgccgggagcagacaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggggcgcagccatg acccagtcacgtagcgatagcggagtgtatactggcttaactatgcggcatcagagcagattgtactgagagtgcaccatatgcggtg tgaaataccgcacagatgcgtaaggagaaaataccgcatcaggcgctcttccgcttcctcgctcactgactcgctgcgctcggtcgttc ggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgt
15 gagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagc atcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccct cgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgct gtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccg gtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgagg
20 tatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgctctgctgaag ccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcag attacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaag ggattttggtcatgagattatcaaaaaggatctggagctgtaatataaaaaccttcttcaactaacggggcaggttagtgacattagaa aaccgactgtaaaaagtacagtcggcattatctcatattataaaagccagtcattaggcctatctgacaattcctgaatagagttcataa
25 acaatcctgcatgataaccatcacaaacagaatgatgtacctgtaaagatagcggtaaatatattgaattacctttattaatgaattttcct gctgtaataatgggtagaaggtaattactattattattgatatttaagttaaacccagtaaatgaagtccatggaataatagaaagagaa aaagcattttcaggtataggtgttttgggaaacaatttccccgaaccattatatttctctacatcagaaaggtataaatcataaaactctttg aagtcattctttacaggagtccaaataccagagaatgttttagatacaccatcaaaaattgtataaagtggctctaacttatcccaataa cctaactctccgtcgctattgtaaccagttctaaaagctgtatttgagtttatcacccttgtcactaagaaaataaatgcagggtaaaattta
30 tatccttcttgttttatgtttcggtataaaacactaatatcaatttctgtggttatactaaaagtcgtttgttggttcaaataatgattaaatatctct tttctcttccaattgtctaaatcaattttattaaagttcatttgatatgcctcctaaatttttatctaaagtgaatttaggaggcttacttgtctgcttt cttcattagaatcaatccttttttaaaagtcaatattactgtaacataaatatatattttaaaaatatcccactttatccaattttcgtttgttgaac 86
taatgggtgctttagttgaagaataaaagaccacattaaaaaatgtggtcttttgtgtttttttaaaggatttgagcgtagcgaaaaatccttt tctttcttatcttgataataagggtaactattgccggttgtccattcatggctgaactctgcttcctctgttgacatgacacacatcatctcaat atccgaatagggcccatcagtctgacgaccaagagagccataaacaccaatagccttaacatcatccccatatttatccaatattcgtt ccttaatttcatgaacaatcttcattctttcttctctagtcattattattggtccattcactattctcattcccttttcagataattttagatttgcttttct 5 aaataagaatatttggagagcaccgttcttattcagctattaataactcgtcttcctaagcatccttcaatccttttaataacaattatagcat ctaatcttcaacaaactggcccgtttgttgaactactctttaataaaataatttttccgttcccaattccacattgcaataatagaaaatccat cttcatcggctttttcgtcatcatctgtatgaatcaaatcgccttcttctgtgtcatcaaggtttaattttttatgtatttcttttaacaaaccaccat aggagattaaccttttacggtgtaaaccttcctccaaatcagacaaacgtttcaaattcttttcttcatcatcggtcataaaatccgtatcctt tacaggatattttgcagtttcgtcaattgccgattgtatatccgatttatatttatttttcggtcgaatcatttgaacttttacatttggatcatagtct 10 aatttcattgcctttttccaaaattgaatccattgtttttgattcacgtagttttctgtattcttaaaataagttggttccacacataccaatacatg catgtgctgattataagaattatctttattatttattgtcacttccgttgcacgcataaaaccaacaagatttttattaatttttttatattgcatcatt cggcgaaatccttgagccatatctgacaaactcttatttaattcttcgccatcataaacatttttaactgttaatgtgagaaacaaccaacg aactgttggcttttgtttaataacttcagcaacaaccttttgtgactgaatgccatgtttcattgctctcctccagttgcacattggacaaagc ctggatttacaaaaccacactcgatacaactttctttcgcctgtttcacgattttgtttatactctaatatttcagcacaatcttttactctttcagc 15 ctttttaaattcaagaatatgcagaagttcaaagtaatcaacattagcgattttcttttctctccatggtctcacttttccactttttgtcttgtcca ctaaaacccttgatttttcatctgaataaatgctactattaggacacataatattaaaagaaacccccatctatttagttatttgtttagtcact tataactttaacagatggggtttttctgtgcaaccaattttaagggttttcaatactttaaaacacatacataccaacacttcaacgcaccttt cagcaactaaaataaaaatgacgttatttctatatgtatcaagataagaaagaacaagttcaaaaccatcaaaaaaagacaccttttc aggtgctttttttattttataaactcattccctgatctcgacttcgttctttttttacctctcggttatgagttagttcaaattcgttctttttaggttctaa 20 atcgtgtttttcttggaattgtgctgttttatcctttaccttgtctacaaaccccttaaaaacgtttttaaaggcttttaagccgtctgtacgttcctt aag (SEQ ID NO:37).
[0299] The strain BG2942 deleted for the phrA (CB2-1 ) and the strain BG2942 deleted for the phrE gene (CB 2-2) were each transformed with the multicopy plasm id pBS19 ymaH sigH (SEQ ID NO:37) to generate strains CB2-1 1 (BG2942 phrA:spc, pBSW ymaH sigH) 25 and CB2-12 (BG2942 phrEispc, pBS19 ymati sigH), respectively, and tested for the expression of aprE. BG2942 cells that do not carry a deletion of either the phrA or the phrE gene were transformed with the pBS19 ymaH sigH plasmid to generate the control strain 42SigH (BG2942 pBS19 ymaH sigH). All BG2942 derived strains (42SigH, CB2-1 1 and CB2-12) were grown for nine hours in 2X SNB media and the supernatants were 30 utilized for assaying the activity of AprE using the AAPF assay.
[0300] Figure 12 shows the effect of overexpressing YmaH on the production of protease by strains carrying the deletion of either the phrA or phrE gene. The strains carrying the multicopy plasmid pBS19 ymaH sigH (i.e., 42SigH), CB2-1 1 and CB2-12, showed a higher protease expression when compared to the BG2942 strain that was transformed 35 only with a control pBS19 plasmid (42pBS19). In particular, the results show that overexpression of YmaH in the 42SigH strain (BG2942 pBS19 ymaH sigH) (squares) 87
enhances the production of the AprE protease obtained in the control BG2942 pBS19 (diamonds). In addition, the results also show that deletion of phrE in combination with overexpression of ymaH (CB2-12;crosses) further enhances the production of protease by the BG2942 pBS19 ymati sight strain (42SigH; squares) when compared to the 5 production by the modified Bacillus subtilis strain CB2-2 (BG2942 phrE:spc) or to the
42SigH (BG2942 ymaH sigH) strain.
[0301] Thus, while overexpression of YmaH enhances the production of a protein of interest (e.g., a subtilisin), combining the overexpression of YmaH with the deletion of a phr gene, in particular, the phrE gene, further enhances the production of a protein of
10 interest.
EXAMPLE 9
Protease Expression in Bacillus sp. Cells Containing a Deletion of the rapA/phrA genes [0302] Transcription of the rapA/phrA operon was abolished in Bacillus subtilis strain BG3594 {degU(Hy)32, oppA, ΔspollE, AaprE, AnprE) that carries the PaprE-FNA
15 expression construct to generate strain JS1 121 according to the following.
[0303] The deletion cassette of the rapA/phrA operon is diagramed in Figure 13, and the polynucleotide sequence is: tggagggagtcagaccgcgtctttgggaaaaaagcaagcggaaagtgaccgtgtttacggatggagatggagggacttca agagagcaggaagccattgtcagagaggttcagcggagtcaagtcatcatgaatccgctattgaaaaaagagatatacag
20 atcaattgatcagttttttcatagtgataaatcgttttatcaaacatatgacatcccttacaagcgcggcattctgttatatggacctc ctggaaacggaaagacgacgttagtgaagtcgatcgcaggcagtatcgatgcacctgttgcttattggcaaattactgaattta cgtcgagcgagacaatagaagaagtctttcaggcagcgagacgcctcgctcctgcagttctggtcatcgaggatatagattc gatgccggaagatgtgcggtccttttttctcaatacgctggacggcgcgacatcaaaagaggggctatttctcatcggtacgac aaactatcccgaagagatcgatccaggtttgatgaatcgtgcaggacgatttgaccgtgcctatgaaatcgggcttccggatg
25 aagagctgcggctggaatatatgaaaatgagaggctttggcatctttttgagtgaaggagaaataaaaaacgccgcaaaac ttacagaaggcttttcctttgcacagctgggagaattatatgtatcttcagcccttcaatggcaccaagaagggaatcaccatatt gaaaccatggtgaaagacatgacaggagagcaaagaaaaagccagcggggaagctggatggaaagaaacaaagtc ggttttcactaaaagaaagcacgggtgtttgaaaaacccgtgcttttttgttgcggttagccgaaattcgacaattgcggttattttg cgttcttctttttcttgtaaatatgataaaatatgacatatctcgggtaattcaaaaggggggattaattgaggatgaagcagacg
30 ctcgaggtcgacggtatcgataagctggatccataacttcgtataatgtatgctatacgaagttatctagataaaaaatttagaa gccaatgaaatctataaataaactaaattaagtttatttaattaacaactatggatataaaataggtactaatcaaaatagtgag gaggatatatttgaatacatacgaacaaattaataaagtgaaaaaaatacttcggaaacatttaaaaaataaccttattggtac ttacatgtttggatcaggagttgagagtggactaaaaccaaatagtgatcttgactttttagtcgtcgtatctgaaccattgacaga tcaaagtaaagaaatacttatacaaaaaattagacctatttcaaaaaaaataggagataaaagcaacttacgatatattgaat
35 taacaattattattcagcaagaaatggtaccgtggaatcatcctcccaaacaagaatttatttatggagaatggttacaagagct ttatgaacaaggatacattcctcagaaggaattaaattcagatttaaccataatgctttaccaagcaaaacgaaaaaataaaa gaatatacggaaattatgacttagaggaattactacctgatattccattttctgatgtgagaagagccattatggattcgtcagag gaattaatagataattatcaggatgatgaaaccaactctatattaactttatgccgtatgattttaactatggacacgggtaaaat cataccaaaagatattgcgggaaatgcagtggctgaatcttctccattagaacatagggagagaattttgttagcagttcgtagt tatcttggagagaatattgaatggactaatgaaaatgtaaatttaactataaactatttaaataacagattaaaaaaattataaa 5 aaaattgaaaaaatggtggaaacacttttttcaatttttttgttttattatttaatatttgggaaatattcattctaattggtaatcagatttt agaaaacaataaacccttgcatatgtctagataacttcgtataatgtatgctatacgaagttatgcggccgccacgcacaaaa acaaatccagagaggagattgtttatatgaaatctaaatggatgtcaggtttgttgctcgttgcggtcgggttcagctttactcagg tgatggttcatgcaggtgaaacagcaaacacagaagggaaaacatttcatattgcggcacgcaatcaaacatgatgcataa aaaaagacccttaggggtcttttttatttcttcagcttccattcttttatcgtcagctcagaagatccacttgccaccagcggatccg
10 catggccgatttccgctgcctcttccagtgaatctgcttcgatgacatacgctccgcctgtggcgtcgctgaatggcccaaacatt tttaaacgtttttctgcctgtaaacgatccagaaattcatagtgcccagccacatgctcctgattaaatttctccgttctcattgtcag cattaaatatggtatacatattcagaccctccgtgaacttcagtttaacacatttatccatattacggtgatagatgatatgagctttt cgtcctacgaatgccacctatttatgaaaaaagaaaaggagagatgataggtgagcattccagtaaagaaaaatttggtttct gaggcgaaatacgcgttgaagtgtcctaatgcaatgtccgctgaatacattaccattcacaacacggcaaacgatgcatcag
15 cggccaatgaaatcagctatatgatcgggaacacaagctcgacaagctttcattttgcggtcgatgatcaagaggtgattcaa ggtctgccgcttaaccgaaacgcttggcacactggtgacggcacaaacggtccgggaaaccgcaaatcaatcggtgttga gatttgctacagcaaatcgggaggcccgaagtatgaggcagctgaagccttggcgatttcatttgttgcacagctgttgaagg agcgcggctggggcatcgatcgggtgagaaagcatcaggactggagcggaaagtattgcccgcaccgcattttatcagag gggcgctgggatcaagtgaaggcggcgattgaaaaggaattaaacgggggcgtatcagcgaaaaaagctgcagtctcttc
20 ttcggcgtctgaatatcatgtaaaaaaaggtgacacactgtcagggattgccgcatca; SEQ ID NO:52, was made by PCR amplification of the two partial yjo nucleotide sequences and ligation of the amplified fragments to the lox-Spectinomycin-lox cassette. The partial yjoB gene sequence located upstream the rapA sequence was amplified using the oligos [0304] HindlllUF gcgtgcaagctt ggagggagtcagaccgcgtctttgg; SEQ ID NO:56, and
25 [0305] XhoUR agagga ctcgagcgtctgcttcatcctcaattaatc ; SEQ ID NO:55,
[0306] and the sequence located downstream the rapA gene containing the phrA and yjpA gene sequences was amplified using oligos
[0307] NotlDF ttatgaga gcggccgc cacgcacaaaaacaaatccagagag; SEQ ID NO:57, and BgIIIDR ccccgtagatctcggcaatccctgacagtgtgtcacc; SEQ ID NO:58.
30 [0308] Since the phrA gene is transcribed by the rapA promoter (McQuade et a/. J.
Bacteriology 2001 Aug;183(16):4905-9) both the rapA (NP_389125) and the phrA (NP 389126) sequences are not transcribed in this construct. [0309] Bacillus sp. strains CF471 , CB3-47, JS1 121 , which contain the PaprE-FNA expression cassette, were grown in a suitable growth medium for 50 hours in shake
35 flasks, and the supernatants were sampled at 18, 24, 42, 48 hours and tested in an AAPF assay as described above. [0310] The results (Figure 14) showed that the strain carrying the deletion of phrA (CB3- 47; closed squares), and the strain carrying the deletion of rapA and phrA genes (JS1 121 ; open triangles) exhibit increased FNA expression when compared to the control strain CF471 (closed diamonds).
5 [0311] Therefore, inactivation of the phrA and/or the rapA genes increases the production of the heterologous subtilisin FNA when compared to the production of the same enzyme by the unmodified precursor host cell.
[0312] While particular embodiments of the present invention have been illustrated and described, it will be apparent to those skilled in the art that various other changes and
10 modifications can be made without departing from the spirit and scope of the invention. It is therefore intended to cover in the appended claims all such changes and modifications that are within the scope of this invention.
[0313] All patents and publications mentioned in the specification are indicative of the levels of those skilled in the art to which the invention pertains. All patents and
15 publications are herein incorporated by reference to the same extent as if each individual publication was specifically and individually indicated to be incorporated by reference. [0314] Having described the preferred embodiments of the present invention, it will appear to those ordinarily skilled in the art that various modifications may be made to the disclosed embodiments, and that such modifications are intended to be within the scope
20 of the present invention.
[0315] Those of skill in the art readily appreciate that the present invention is well adapted to carry out the objects and obtain the ends and advantages mentioned, as well as those inherent therein. The compositions and methods described herein are representative of preferred embodiments, are exemplary, and are not intended as limitations on the scope
25 of the invention. It is readily apparent to one skilled in the art that varying substitutions and modifications may be made to the invention disclosed herein without departing from the scope and spirit of the invention.
[0316] The invention illustratively described herein suitably may be practiced in the absence of any element or elements, limitation or limitations which is not specifically
30 disclosed herein. The terms and expressions which have been employed are used as terms of description and not of limitation, and there is no intention that in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention claimed. Thus, it should be understood that although the present
35 invention has been specifically disclosed by preferred embodiments and optional features, modification and variation of the concepts herein disclosed may be resorted to by those 90
skilled in the art, and that such modifications and variations are considered to be within the scope of this invention as defined by the appended claims. [0317] The invention has been described broadly and generically herein. Each of the narrower species and subgeneric groupings falling within the generic disclosure also form 5 part of the invention. This includes the generic description of the invention with a proviso or negative limitation removing any subject matter from the genus, regardless of whether or not the excised material is specifically recited herein.

Claims

91
[0318] CLAIMS
[0319] What is claimed is:
5 [0320] 1 . A host cell comprising a rap operon comprising at least one inactivated phr and/or at least one inactivated rap gene.
[0321] 2. The host cell of Claim 1 , wherein said host cell overexpresses YmaH.
10 [0322] 3. The host cell of Claims 1 or 2, wherein said host cell further comprises a recombinant nucleic acid.
[0323] 4. The host cell of any of Claims 1 -3, wherein said host cell further comprises a polynucleotide sequence encoding a protein of interest. 15
[0324] 5. The host cell of Claim 4, wherein said recombinant nucleic acid comprises a promoter that is operably linked to said polynucleotide sequence encoding a protein of interest.
20 [0325] 6. The host cell of Claim 5, wherein said promoter is the wild-type or a mutant aprE promoter.
[0326] 7. The host cell of any of Claims 4-6, wherein said host cell produces said protein of interest at a level that is greater than that produced by a host cell that does not 25 comprise at least one inactivated phr and/or rap gene.
[0327] 8. The host cell of Claim 7, wherein said protein of interest is an enzyme.
[0328] 9. The host cell of Claim 8, wherein said enzyme is a protease.
30
[0329] 10. The host cell of any of Claims 1 -8, wherein said at least one inactivated rap gene is the rapk gene.
[0330] 1 1 . The host cell of any of Claims 1 -10, wherein said at least one inactivated 35 phr gene is selected from phr A, phrE, phrC, phrF, phrG, phrl, and phrK. 92
[0331] 12. The host cell of Claim 1 1 , wherein said at least one inactivated phr gene is phr A.
[0332] 13. The host cell of Claim 1 1 , wherein said at least one inactivated phr gene 5 is phrE.
[0333] 14. The host cell of any of Claims 1 to 13, wherein said host cell comprises at least one inactivated phr gene and at least one inactivated rap gene.
10 [0334] 15. The host cell of Claim 14, wherein said inactivated rap gene is the rapA gene.
[0335] 16. The host cell of any of Claims 14-15, wherein said at least one inactivated phr gene is selected from phr A, phrE, phrC, phrF, phrG, phrl, and phrK. 15
[0336] 17. The host cell of Claim 16, wherein said at least one inactivated phr gene is phr A.
[0337] 18. The host cell of Claim 16, wherein said at least one inactivated phr gene 20 is phrE.
[0338] 19. The host cell of any of Claims 1 -18, comprising an inactivated phr A gene, an inactivated phrE gene, an inactivated rapA gene, and a recombinant nucleic acid encoding a protein of interest. 25
[0339] 20. The host cell of Claim 19, wherein said protein of interest is an enzyme.
[0340] 21 . The host cell of Claim 20, wherein said enzyme is a protease.
30 [0341] 22. The host cell of any of Claims 1 to 21 , wherein said host cell is a Bacillus sp. host cell.
[0342] 23. The host cell of Claim 22, wherein said Bacillus sp. host cell is a Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus 35 clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus 93
subtilis, or Bacillus thuringiensis cell.
[0343] 24. The host cell of Claim 22 or 23, wherein said Bacillus sp. host cell is a Bacillus subtilis host cell. 5
[0344] 25. A method for producing at least one protein of interest comprising providing a precursor host cell and an inactivating nucleotide construct comprising an inactivating polynucleotide that inactivates at least one indigenous phr and/or rap gene; introducing said inactivating nucleotide construct into said precursor host cell to generate 10 a modified host cell; and growing said modified host cell under suitable conditions for producing of said at least one protein of interest.
[0345] 26. The method of Claim 25, wherein said protein of interest is encoded by a recombinant nucleic acid present in said precursor host cell. 15
[0346] 27. The method of Claim 25, wherein said protein of interest is encoded by a recombinant nucleic acid present in said modified host cell.
[0347] 28. The method of Claim 25, wherein said protein of interest is encoded by a 20 recombinant nucleic acid is present in said precursor host cell and/or said modified host cell.
[0348] 29. The method of any of Claims 26-28, wherein said recombinant nucleic acid comprises a promoter that is operably linked to the polynucleotide sequence 25 encoding said protein of interest.
[0349] 30. The method of any of Claims 25-29, wherein said protein of interest is a wild-type protein of interest.
30 [0350] 31 . The method of any of Claims 25-30, wherein said precursor host cell naturally produces said protein of interest.
[0351] 32. The method of any of Claims 25-31 , wherein production of said protein of interest by said modified host cell is greater than the production of said protein of interest 35 by said precursor host cell. 94
[0352] 33. The method of any of Claims 25-32, further comprising the step of recovering said protein of interest.
[0353] 34. The method of any of Claims 25 to 33, wherein the protein of interest is 5 an enzyme.
[0354] 35. The method of Claim 34, wherein said enzyme is a protease.
[0355] 36. The method of any of Claims 25 to 35, wherein said modified host cell 10 comprises a mutation in at least one gene chosen from degil, degQ, degS, sco4, spollE, degQ and degR.
[0356] 37. The method of Claim 36, wherein said host cell comprises a deg(Hy)32 mutation. 15
[0357] 38. The method of any of Claims 25 to 37, wherein said at least one indigenous phr gene that is inactivated is chosen from phrA, phrE, phrC, phrF, phrG, phrl,anό phrK.
20 [0358] 39. The method of any of Claims 25-38, wherein said inactivating polynucleotide inactivates the indigenous phr A and phrE genes and/or rap gene.
[0359] 40. The method of Claim 39, wherein said at least one indigenous phr gene is phr A. 25
[0360] 41 . The method of Claim 39, wherein said at least one indigenous phr gene is phrE.
[0361] 42. The method of any of Claims 25-41 , wherein said indigenous rap gene is 30 inactivated.
[0362] 43. The method of Claim 42, wherein said indigenous rap gene is rapA.
[0363] 44. The method of any of Claims 25-41 , further comprising an inactivated 35 indigenous rap gene. 95
[0364] 45. The method of any of Claims 25-44, wherein said precursor or modified host cell overexpresses YmaH.
[0365] 46. The method of Claim 45, wherein said overexpression of YmaH is 5 achieved by introducing a SigH construct into said precursor or said modified host cell.
[0366] 47. The method of Claim 46, wherein said SigH construct comprises SEQ ID NO:23, comprising a SigH promoter operably linked to a polynucleotide encoding YmaH protein. 10
[0367] 48. The method of Claim 45, wherein said overexpression of YmaH is achieved by introducing a SigA construct into said precursor or said modified host cell.
[0368] 49. The method of Claim 48, wherein said SigA construct comprises SEQ ID 15 NO:26 and/or 31 , comprising a SigA promoter operably linked to a polynucleotide encoding YmaH.
[0369] 50. The method of any of Claims 25-49, wherein said host cell is a Bacillus sp. host cell.
20
[0370] 51 . The method of Claim 50, wherein said Bacillus sp. host cell is a Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus
25 subtilis, or Bacillus thuringiensis cell.
[0371] 52. The method of any of Claims 50-51 , wherein said Bacillus sp. host cell is a Bacillus subtilis cell.
PCT/US2010/037040 2009-06-11 2010-06-02 Bacillus strain for increased protein production WO2010144283A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
CN201080026055.XA CN102803290B (en) 2009-06-11 2010-06-02 For increasing the Bacillus strain that protein produces
EP10721097.3A EP2440573B1 (en) 2009-06-11 2010-06-02 Bacillus strain for increased protein production
DK10721097.3T DK2440573T3 (en) 2009-06-11 2010-06-02 Bacillus strain for increased protein production
US13/622,968 US8476042B2 (en) 2009-06-11 2012-09-19 Bacillus strain for increased protein production

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US18632109P 2009-06-11 2009-06-11
US61/186,321 2009-06-11

Related Child Applications (3)

Application Number Title Priority Date Filing Date
US13377529 A-371-Of-International 2010-06-02
US13/622,968 Division US8476042B2 (en) 2009-06-11 2012-09-19 Bacillus strain for increased protein production
US13/622,968 Continuation US8476042B2 (en) 2009-06-11 2012-09-19 Bacillus strain for increased protein production

Publications (1)

Publication Number Publication Date
WO2010144283A1 true WO2010144283A1 (en) 2010-12-16

Family

ID=42562872

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2010/037040 WO2010144283A1 (en) 2009-06-11 2010-06-02 Bacillus strain for increased protein production

Country Status (6)

Country Link
US (2) US8293499B2 (en)
EP (1) EP2440573B1 (en)
CN (1) CN102803290B (en)
AR (1) AR076941A1 (en)
DK (1) DK2440573T3 (en)
WO (1) WO2010144283A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015102814A1 (en) 2013-12-31 2015-07-09 Danisco Us Inc. Enhanced protein expression
WO2016100128A1 (en) 2014-12-19 2016-06-23 Danisco Us Inc Enhanced protein expression
WO2016134213A3 (en) * 2015-02-19 2016-11-03 Danisco Us Inc Enhanced protein expression
WO2017075195A1 (en) 2015-10-30 2017-05-04 Danisco Us Inc Enhanced protein expression and methods thereof
WO2017112733A1 (en) 2015-12-23 2017-06-29 Danisco Us Inc. Enhanced protein production and methods thereof
CN111004815A (en) * 2019-12-18 2020-04-14 江苏省农业科学院 Method for improving yield of surface active element of bacillus subtilis by mutating degU gene
US10683528B2 (en) 2014-12-16 2020-06-16 Danisco Us Inc Enhanced protein expression
EP4095152A2 (en) 2016-03-04 2022-11-30 Danisco US Inc. Engineered ribosomal promoters for protein production in microorganisms

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DK2788491T3 (en) * 2011-12-09 2019-04-01 Danisco Us Inc RIBOSOMAL PROMOTERS FROM B. SUBTILIS FOR PROTEIN PRODUCTION IN MICRO-ORGANISMS
KR101619316B1 (en) 2014-05-27 2016-05-11 한국생명공학연구원 Variant Promoter of Biacillus and Method for Preparing Protein Using the Same
DE102014223904A1 (en) * 2014-11-24 2016-05-25 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Capacitor and method of making the same
CN108456669B (en) * 2017-02-20 2023-08-08 上海凯赛生物技术股份有限公司 Ribosome binding site, recombinant expression plasmid, transformant and application thereof
JP7280269B2 (en) * 2018-01-03 2023-05-23 ダニスコ・ユーエス・インク Mutant and genetically modified Bacillus cells for increased protein production and methods thereof
CN109355303B (en) * 2018-11-14 2022-05-06 天津大学 Application of inhibition and/or knockout gene in improving expression quantity of monoclonal antibody
CN113366108A (en) * 2018-11-28 2021-09-07 丹尼斯科美国公司 Novel promoter sequences and methods for their use to enhance production of proteins from Bacillus cells
CN114127256A (en) * 2019-02-20 2022-03-01 巴斯夫欧洲公司 Industrial fermentation process of bacillus by using defined medium and trace element for feeding
EP3927837A1 (en) * 2019-02-20 2021-12-29 Basf Se Industrial fermentation process for bacillus using defined medium and magnesium feed
CN112646746B (en) * 2021-01-04 2022-02-25 济南大学 Bacillus firmus for efficiently degrading donkey hair
CN112941088B (en) * 2021-02-04 2023-06-16 中国农业科学院哈尔滨兽医研究所(中国动物卫生与流行病学中心哈尔滨分中心) Genes related to brucella virulence and application thereof in brucella virulence evaluation and preparation of attenuated brucella
CN115260314B (en) * 2022-05-10 2023-10-24 吉林大学 Recombinant thermophilic neutral protease and application thereof in degradation of protein

Citations (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4302544A (en) 1979-10-15 1981-11-24 University Of Rochester Asporogenous mutant of B. subtilis for use as host component of HV1 system
US4450235A (en) 1982-04-21 1984-05-22 Cpc International Inc. Asporogenic mutant of bacillus subtilis useful as a host in a host-vector system
EP0134048A1 (en) 1983-07-06 1985-03-13 Gist-Brocades N.V. Molecular cloning and expression in industrial microorganism species
US4760025A (en) 1984-05-29 1988-07-26 Genencor, Inc. Modified enzymes and methods for making same
WO1989006279A1 (en) 1988-01-07 1989-07-13 Novo-Nordisk A/S Mutated subtilisin genes
US4873192A (en) 1987-02-17 1989-10-10 The United States Of America As Represented By The Department Of Health And Human Services Process for site specific mutagenesis without phenotypic selection
US4914031A (en) 1987-04-10 1990-04-03 Amgen, Inc. Subtilisin analogs
US4980288A (en) 1986-02-12 1990-12-25 Genex Corporation Subtilisin with increased thermal stability
US5208158A (en) 1990-04-19 1993-05-04 Novo Nordisk A/S Oxidation stable detergent enzymes
WO1993010249A1 (en) 1991-11-14 1993-05-27 Novo Nordisk A/S A bacillus promoter derived from a variant of a bacillus licheniformis x-amylase promoter
US5217878A (en) 1987-02-27 1993-06-08 Gist-Brocades, Nv Molecular cloning and expression of genes encoding proteolytic enzymes
EP0414279B1 (en) 1986-04-03 1993-11-18 Massey Ferguson S.A. Tractor hitch control system
US5264366A (en) 1984-05-29 1993-11-23 Genencor, Inc. Protease deficient bacillus
US5310675A (en) 1983-06-24 1994-05-10 Genencor, Inc. Procaryotic carbonyl hydrolases
US5336611A (en) 1988-02-11 1994-08-09 Gist-Brocades N.V. PB92 serine protease muteins and their use in detergents
US5387521A (en) 1991-04-26 1995-02-07 Genencor International, Inc. Gene expression in bacilli
US5399283A (en) 1986-01-15 1995-03-21 Amgen Inc. Thermally stable and pH stable subtilisin analogs and method for production thereof
US5482849A (en) 1990-12-21 1996-01-09 Novo Nordisk A/S Subtilisin mutants
US5665587A (en) 1989-06-26 1997-09-09 Novo Nordisk A/S Modified subtilisins and detergent compositions containing same
US5700676A (en) 1984-05-29 1997-12-23 Genencor International Inc. Modified subtilisins having amino acid alterations
WO1998007846A1 (en) 1996-08-23 1998-02-26 Peter Ruhdal Jensen Artificial promoter libraries for selected organisms and promoters derived from such libraries
US5858757A (en) 1991-05-01 1999-01-12 Novo Nordisk A/S Stabilized enzymes and detergent compositions
US5880080A (en) 1994-03-31 1999-03-09 Solvay Enzymes Gmbh & Co. Kg Use of alkaline proteases in industrial textile laundering processes
WO1999020726A1 (en) 1997-10-23 1999-04-29 The Procter & Gamble Company Bleaching compositions comprising multiply-substituted protease variants
WO1999034011A2 (en) 1997-12-24 1999-07-08 Genencor International, Inc. Method of assaying for a preferred enzyme and/or detergent
WO1999043835A2 (en) 1998-02-26 1999-09-02 Novo Nordisk Biotech, Inc. Methods for producing a polypeptide in a bacillus cell
US6218165B1 (en) 1998-04-15 2001-04-17 Genencor International, Inc. Mutant proteins having lower allergenic response in humans and methods for constructing, identifying and producing such proteins
US6376450B1 (en) 1998-10-23 2002-04-23 Chanchal Kumar Ghosh Cleaning compositions containing multiply-substituted protease variants
US20020182734A1 (en) 2000-08-11 2002-12-05 Diaz-Torres Maria R. Bacillus transformation, transformants and mutant libraries
US20030148461A1 (en) 2000-01-07 2003-08-07 Fernando Valle Mutant aprE promoter
WO2003070963A2 (en) * 2002-02-15 2003-08-28 Genencor International, Inc. Improved protein expression in bacillus subtilis
US20050202535A1 (en) 2003-11-06 2005-09-15 Katherine Collier Bacterial expression of protease inhibitors and variants thereof
WO2006033668A2 (en) * 2004-04-09 2006-03-30 Genencor International, Inc. Pcka modifications and enhanced protein expression in bacillus
JP2006325586A (en) * 2005-04-28 2006-12-07 Kao Corp Recombinant microorganism
WO2008126929A1 (en) * 2007-04-10 2008-10-23 Kao Corporation Recombinant microorganism
JP2009072154A (en) * 2007-09-21 2009-04-09 Kao Corp New bacillus subtilis mutant strain and method for preparing long-lived bacillus subtilis mutant strain
WO2009094084A1 (en) * 2007-12-21 2009-07-30 Danisco Us Inc., Genencor Division Enhanced protein production in bacillus

Patent Citations (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4302544A (en) 1979-10-15 1981-11-24 University Of Rochester Asporogenous mutant of B. subtilis for use as host component of HV1 system
US4450235A (en) 1982-04-21 1984-05-22 Cpc International Inc. Asporogenic mutant of bacillus subtilis useful as a host in a host-vector system
US5310675A (en) 1983-06-24 1994-05-10 Genencor, Inc. Procaryotic carbonyl hydrolases
EP0134048A1 (en) 1983-07-06 1985-03-13 Gist-Brocades N.V. Molecular cloning and expression in industrial microorganism species
US4760025A (en) 1984-05-29 1988-07-26 Genencor, Inc. Modified enzymes and methods for making same
US5700676A (en) 1984-05-29 1997-12-23 Genencor International Inc. Modified subtilisins having amino acid alterations
US5441882A (en) 1984-05-29 1995-08-15 Genencor, Inc. Method for preparing modified subtilisins
USRE34606E (en) 1984-05-29 1994-05-10 Genencor, Inc. Modified enzymes and methods for making same
US5264366A (en) 1984-05-29 1993-11-23 Genencor, Inc. Protease deficient bacillus
US5399283A (en) 1986-01-15 1995-03-21 Amgen Inc. Thermally stable and pH stable subtilisin analogs and method for production thereof
US4980288A (en) 1986-02-12 1990-12-25 Genex Corporation Subtilisin with increased thermal stability
EP0414279B1 (en) 1986-04-03 1993-11-18 Massey Ferguson S.A. Tractor hitch control system
US4873192A (en) 1987-02-17 1989-10-10 The United States Of America As Represented By The Department Of Health And Human Services Process for site specific mutagenesis without phenotypic selection
US5217878A (en) 1987-02-27 1993-06-08 Gist-Brocades, Nv Molecular cloning and expression of genes encoding proteolytic enzymes
US4914031A (en) 1987-04-10 1990-04-03 Amgen, Inc. Subtilisin analogs
US5741694A (en) 1988-01-07 1998-04-21 Novo Nordisk A/S Useful mutations of bacterial alkaline protease
WO1989006279A1 (en) 1988-01-07 1989-07-13 Novo-Nordisk A/S Mutated subtilisin genes
US5336611A (en) 1988-02-11 1994-08-09 Gist-Brocades N.V. PB92 serine protease muteins and their use in detergents
US5665587A (en) 1989-06-26 1997-09-09 Novo Nordisk A/S Modified subtilisins and detergent compositions containing same
US6197567B1 (en) 1989-06-26 2001-03-06 Novo Nordisk A/S Modified subtilisins and detergent compositions containing same
US5208158A (en) 1990-04-19 1993-05-04 Novo Nordisk A/S Oxidation stable detergent enzymes
US5631217A (en) 1990-12-21 1997-05-20 Novo Nordisk A/S Detergent compositions comprising a modified subtilisin
US5482849A (en) 1990-12-21 1996-01-09 Novo Nordisk A/S Subtilisin mutants
US5387521A (en) 1991-04-26 1995-02-07 Genencor International, Inc. Gene expression in bacilli
US5858757A (en) 1991-05-01 1999-01-12 Novo Nordisk A/S Stabilized enzymes and detergent compositions
WO1993010249A1 (en) 1991-11-14 1993-05-27 Novo Nordisk A/S A bacillus promoter derived from a variant of a bacillus licheniformis x-amylase promoter
US5880080A (en) 1994-03-31 1999-03-09 Solvay Enzymes Gmbh & Co. Kg Use of alkaline proteases in industrial textile laundering processes
WO1998007846A1 (en) 1996-08-23 1998-02-26 Peter Ruhdal Jensen Artificial promoter libraries for selected organisms and promoters derived from such libraries
WO1999020770A2 (en) 1997-10-23 1999-04-29 Genencor International, Inc. Multiply-substituted protease variants
WO1999020726A1 (en) 1997-10-23 1999-04-29 The Procter & Gamble Company Bleaching compositions comprising multiply-substituted protease variants
WO1999020769A2 (en) 1997-10-23 1999-04-29 Genencor International, Inc. Multiply-substituted protease variants
WO1999034011A2 (en) 1997-12-24 1999-07-08 Genencor International, Inc. Method of assaying for a preferred enzyme and/or detergent
WO1999043835A2 (en) 1998-02-26 1999-09-02 Novo Nordisk Biotech, Inc. Methods for producing a polypeptide in a bacillus cell
US6218165B1 (en) 1998-04-15 2001-04-17 Genencor International, Inc. Mutant proteins having lower allergenic response in humans and methods for constructing, identifying and producing such proteins
US6376450B1 (en) 1998-10-23 2002-04-23 Chanchal Kumar Ghosh Cleaning compositions containing multiply-substituted protease variants
US20030148461A1 (en) 2000-01-07 2003-08-07 Fernando Valle Mutant aprE promoter
US20020182734A1 (en) 2000-08-11 2002-12-05 Diaz-Torres Maria R. Bacillus transformation, transformants and mutant libraries
WO2003070963A2 (en) * 2002-02-15 2003-08-28 Genencor International, Inc. Improved protein expression in bacillus subtilis
US20050202535A1 (en) 2003-11-06 2005-09-15 Katherine Collier Bacterial expression of protease inhibitors and variants thereof
WO2006033668A2 (en) * 2004-04-09 2006-03-30 Genencor International, Inc. Pcka modifications and enhanced protein expression in bacillus
JP2006325586A (en) * 2005-04-28 2006-12-07 Kao Corp Recombinant microorganism
WO2008126929A1 (en) * 2007-04-10 2008-10-23 Kao Corporation Recombinant microorganism
JP2009072154A (en) * 2007-09-21 2009-04-09 Kao Corp New bacillus subtilis mutant strain and method for preparing long-lived bacillus subtilis mutant strain
WO2009094084A1 (en) * 2007-12-21 2009-07-30 Danisco Us Inc., Genencor Division Enhanced protein production in bacillus

Non-Patent Citations (79)

* Cited by examiner, † Cited by third party
Title
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 - 410
ALTSCHUL ET AL., METH. ENZYMOL., vol. 266, 1996, pages 460 - 480
APPL. MICROBIOL. BIOTECHNOL., vol. 62, 2003, pages 369 - 73
ARIGONI ET AL., MOL. MICROBIOL., vol. 31, 1999, pages 1407 - 1415
AUNSTRUP ET AL., PROC IV IFS: FERMENT. TECHNOL. TODAY, 1972, pages 299 - 305
BAND, L.; D. J. HENNER, DNA, vol. 3, 1984, pages 17 - 21
BRYAN, BIOCHIM. BIOPHYS. ACTA, vol. 1543, 2000, pages 203 - 222
CALDWELL ET AL., J. BACTERIOL., vol. 183, 2001, pages 7329 - 7340
DARMON, JOURNAL OF BACTERIOLOGY, vol. 184, 2002, pages 5661 - 5671
DEL MAR ET AL., ANAL. BIOCHEM., vol. 99, 1979, pages 316 - 320
DEVEREUX ET AL., NUCL. ACID RES., vol. 12, 1984, pages 387 - 395
DUBNAU, MICROBIOLOGICAL REVIEWS, vol. 55, 1991, pages 395 - 424
EHRLICH, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES USA, vol. 75, 1978, pages 1433
ESTELL ET AL., J BIOL CHEM., vol. 260, 1985, pages 6518 - 6521
FAHNESTOCK; FISCHER, J. BACTERIOL., vol. 165, 1986, pages 796 - 804
FENG; DOOLITTLE, J. MOL. EVOL., vol. 35, 1987, pages 351 - 360
FERRARI ET AL., J BACTERIOL., vol. 170, 1988, pages 289 - 295
GABALLA AHMED ET AL: "The Bacillus subtilis iron-sparing response is mediated by a Fur-regulated small RNA and three small, basic proteins", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, vol. 105, no. 33, August 2008 (2008-08-01), pages 11927 - 11932, XP002597750, ISSN: 0027-8424 *
GROSSMAN, ANNUAL REVIEWS GENETICS, vol. 29, 1995, pages 477 - 508
GUEROT-FLEURY, GENE, vol. 167, 1995, pages 335 - 337
GUPTA, APPL. MICROBIOL. BIOTECHNOL., vol. 59, 2002, pages 15 - 32
HAHN ET AL., MOL MICROBIOL., vol. 18, 1995, pages 755 - 67
HAMBRAEUS ET AL., MICROBIOLOGY, vol. 148, 2002, pages 1795 - 1803
HE ET AL., RES. MICROBIOL., vol. 142, 1991, pages 797 - 803
HELMAN ET AL., NUCLEIC ACID RESEARCH, vol. 24, 1995, pages 2351 - 2360
HENNER ET AL., MICROBIOL. REV., vol. 44, 1980, pages 57 - 82
HIGGINS; SHARP, CABIOS, vol. 5, 1989, pages 151 - 153
HOCH ET AL., GENETICS, vol. 73, 1973, pages 215 - 228
KARLIN ET AL., PROC. NATL. ACAD. SCI. USA, vol. 90, 1993, pages 5873 - 5787
KRAMER ET AL., NUCLEIC ACIDS RES., vol. 12, 1984, pages 9441
KROLL DJ ET AL., DNA CELL BIOL, vol. 12, 1993, pages 441 - 53
KUNKEL ET AL., METHODS ENZYMOL., vol. 154, 1987, pages 367 382
KUNKEL, PROC. NATL. ACAD. SCI. USA, vol. 82, 1985, pages 488 492
KUNST ET AL., NATURE, vol. 390, 1997, pages 249 - 256
KUNST, NATURE, vol. 390, 1997, pages 249 - 256
KUNST, RESEARCH IN MICROBIOLOGY, vol. 145, 1994, pages 393 - 402
LEE ET AL., APPL. ENVIRON. MICROBIOL., vol. 66, 2000, pages 476 - 480
MARAHIEL, MOLECULAR MICROBIOLOGY, vol. 7, 1993, pages 631 - 636
MAURER, CURR. OP, BIOTECHNOL., 2004, vol. 15, 2004, pages 330 - 334
MCQUADE ET AL., J. BACTERIOLOGY, vol. 183, no. 16, August 2001 (2001-08-01), pages 4905 - 9
MORING ET AL., BIOTECH., vol. 2, 1984, pages 646
MOSER, I., FEBS LETT., vol. 430, no. 1-2, 1998, pages 28 - 36
MOSZER, FEBS LETT., vol. 430, 1998, pages 28 - 36
MSADEK ET AL., J BACTERIOL, vol. 172, 1990, pages 824 - 834
MSADEK ET AL., J. BACTERIOL., vol. 172, 1990, pages 824 - 834
NEEDLEMAN; WUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 443
OLMOS ET AL., MOL. GEN. GENET., vol. 253, 1997, pages 562 - 567
PALMEROS ET AL., GENE, vol. 247, 2000, pages 255 - 264
PALVA ET AL., GENE, vol. 19, 1982, pages 81 - 87
PEARSON; LIPMAN, PROC. NATL. ACAD. SCI. USA, vol. 85, 1988, pages 2444
PEREGO ET AL., MOL. MICROBIOL., vol. 5, 1991, pages 173 - 185
PEREGO, CELL, vol. 79, 1994, pages 1047 - 1055
PEREGO, M., TRENDS IN MICROBIOLOGY, vol. 6, 1998, pages 366 - 370
PEREGO, MOLECULAR MICROBIOLOGY, vol. 19, 1996, pages 1151 - 1170
PEREGO, PEPTIDES, vol. 22, 2001, pages 1541 - 7
PEREGO, PROCEDINGS OF THE NATIONAL ACADAMY OF SCIENCE USA, vol. 94, 1997, pages 8612 - 8617
PEREGO, TRENDS IN MICROBIOLOGY, vol. 6, 1998, pages 366 - 370
PHILIPP ET AL., MOL. CELL. BIOCHEM., vol. 51, 1983, pages 5 - 32
PORATH, PROTEIN EXPR PURIF, vol. 3, 1992, pages 263 - 281
POTTATHIL, FRONT BIOSCI., vol. 8, 2003, pages D32 - 45
PRIEST, BACTERIOL. REV., vol. 41, 1977, pages 711 - 753
SAUTER ET AL., NUCLEIC ACID RES, vol. 31, 2003, pages 4091 - 4098
SAUTER ET AL., NUCLEIC ACIDS RES., vol. 31, 2003, pages 4091 - 4098
SIEZEN, PROTEIN SCI., vol. 6, 1997, pages 501 - 523
SILVAGGI ET AL., J BACTERIOL., vol. 187, no. 19, 2005, pages 6641 - 6650
SIMONEN; PALVA, MICROBIOLOGICAL REVIEWS, vol. 57, 1993, pages 109 - 137
SLOMA ET AL., J. BACTERIOL., vol. 173, 1991, pages 6889 - 8895
SMITH; WATERMAN, ADV. APPL. MATH., vol. 2, 1981, pages 482
SOLOMON, GENES AND DEVELOPMENT, vol. 10, 1996, pages 2014 - 2024
STAHL ET AL., J. BACTERIOL., vol. 159, 1984, pages 811 - 818
STEIN, MOLECULAR MICROBIOLOGY, vol. 44, 2002, pages 403 - 416
TJALSMA, MICROBIOLOGICAL AND MOLECULAR BIOLOGY REVIEWS, vol. 64, 2000, pages 515 - 547
TRIEU-CUOT ET AL., GENE, vol. 23, 1983, pages 331 - 341
VOSKUIL ET AL., MOL MICROBIOL, vol. 17, 1995, pages 271 279
WANG ET AL., GENE, vol. 69, 1988, pages 39 - 47
WELLS, T. A. ET AL., NUCLEIC ACIDS RES., vol. 11, 1983, pages 7911 - 7925
WESTERS HELGA ET AL: "Genome engineering reveals large dispensable regions in Bacillus subtilis", MOLECULAR BIOLOGY AND EVOLUTION, THE UNIVERSITY OF CHICAGO PRESS, US LNKD- DOI:10.1093/MOLBEV/MSG219, vol. 20, no. 12, 1 December 2003 (2003-12-01), pages 2076 - 2090, XP002495524, ISSN: 0737-4038 *
WU ET AL., J. BACTERIOL., vol. 173, 1991, pages 4952 - 4958
YE ET AL.: "Proc. Internatl. Symp. Rec. Adv. Bioindustry", MICROBIOLOGY, 1996, pages 160 - 169

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015102814A1 (en) 2013-12-31 2015-07-09 Danisco Us Inc. Enhanced protein expression
US10287591B2 (en) 2013-12-31 2019-05-14 Danisco Us Inc Enhanced protein expression
US10683528B2 (en) 2014-12-16 2020-06-16 Danisco Us Inc Enhanced protein expression
JP2017538433A (en) * 2014-12-19 2017-12-28 ダニスコ・ユーエス・インク Increased protein expression
CN107278230A (en) * 2014-12-19 2017-10-20 丹尼斯科美国公司 Enhanced protein expression
WO2016100128A1 (en) 2014-12-19 2016-06-23 Danisco Us Inc Enhanced protein expression
CN107278230B (en) * 2014-12-19 2021-10-29 丹尼斯科美国公司 Enhanced protein expression
US20180030456A1 (en) * 2015-02-19 2018-02-01 Danisco Us Inc. Enhanced protein expression
JP2018505686A (en) * 2015-02-19 2018-03-01 ダニスコ・ユーエス・インク Increased protein expression
WO2016134213A3 (en) * 2015-02-19 2016-11-03 Danisco Us Inc Enhanced protein expression
WO2017075195A1 (en) 2015-10-30 2017-05-04 Danisco Us Inc Enhanced protein expression and methods thereof
WO2017112733A1 (en) 2015-12-23 2017-06-29 Danisco Us Inc. Enhanced protein production and methods thereof
EP4095152A2 (en) 2016-03-04 2022-11-30 Danisco US Inc. Engineered ribosomal promoters for protein production in microorganisms
CN111004815A (en) * 2019-12-18 2020-04-14 江苏省农业科学院 Method for improving yield of surface active element of bacillus subtilis by mutating degU gene
CN111004815B (en) * 2019-12-18 2023-07-21 江苏省农业科学院 Method for improving yield of bacillus subtilis surfactant by mutating degU gene

Also Published As

Publication number Publication date
CN102803290A (en) 2012-11-28
AR076941A1 (en) 2011-07-20
EP2440573A1 (en) 2012-04-18
US8293499B2 (en) 2012-10-23
US20130029402A1 (en) 2013-01-31
US8476042B2 (en) 2013-07-02
EP2440573B1 (en) 2014-10-15
DK2440573T3 (en) 2015-01-19
CN102803290B (en) 2015-11-25
US20120142076A1 (en) 2012-06-07

Similar Documents

Publication Publication Date Title
EP2440573B1 (en) Bacillus strain for increased protein production
US8512982B2 (en) Enhanced protein production in Bacillus
KR102375732B1 (en) Compositions and methods for increasing protein production in Bacillus licheniformis
JP4680512B2 (en) Increased protein expression in Bacillus
EP2421973B1 (en) Proteases with modified pro regions
US7247450B2 (en) Secretion, transcription and sporulation genes in Bacillus clausii
EP2900689B1 (en) Bacterial mutants with improved transformation efficiency
EP0941349B1 (en) Methods for producing polypeptides in surfactin mutants of bacillus cells
US5958728A (en) Methods for producing polypeptides in mutants of bacillus cells
KR20170093247A (en) Enhanced protein expression
KR20230042088A (en) Alanine racemase single deletion and trans-complementation
JP2007532114A (en) PckA modification and enhanced protein expression in Bacillus
JP2017538426A (en) Enhanced protein expression
EP2257629B1 (en) Method for amplifying locus in bacterial cell
CN113785056A (en) Means and methods for improving protease expression

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201080026055.X

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10721097

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2010721097

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 9280/DELNP/2011

Country of ref document: IN

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 13377529

Country of ref document: US