WO2012068053A1 - Structure cristalline de l'enzyme murg de pseudomonas aeruginosa - Google Patents

Structure cristalline de l'enzyme murg de pseudomonas aeruginosa Download PDF

Info

Publication number
WO2012068053A1
WO2012068053A1 PCT/US2011/060705 US2011060705W WO2012068053A1 WO 2012068053 A1 WO2012068053 A1 WO 2012068053A1 US 2011060705 W US2011060705 W US 2011060705W WO 2012068053 A1 WO2012068053 A1 WO 2012068053A1
Authority
WO
WIPO (PCT)
Prior art keywords
murg
protein
murg protein
udp
binding pocket
Prior art date
Application number
PCT/US2011/060705
Other languages
English (en)
Inventor
Kieron Brown
Neesha Dedi
Sarah Vial
Graham Cheetham
Original Assignee
Vertex Pharmaceuticals Incorporated
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vertex Pharmaceuticals Incorporated filed Critical Vertex Pharmaceuticals Incorporated
Publication of WO2012068053A1 publication Critical patent/WO2012068053A1/fr

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1048Glycosyltransferases (2.4)

Definitions

  • the present invention is in the field of structural biology, especially X- ray structural determination.
  • Pseudomonas aeruginosa (P. a .) is a ubiquitous environmental gram- negative bacterium that is one of the top three causes of opportunistic human infections.
  • P. a. is highly resistant to known antibiotics.
  • One strategy to develop novel and potent antibacterial agents is to target enzymes involved in synthesis of bacterial membranes.
  • the peptidoglycan layer surrounds and protects bacterial cells is essential for survival.
  • enzymes involved in peptidoglycan biosynthesis are attractive targets for the design of new antibiotics.
  • Many are integral membrane proteins or membrane-associated proteins and are thus difficult to express, purify, crystallize and also develop biochemical assays for.
  • MurG is a glycosyltransferase that is anchored to the cytoplasmic surface of the bacterial cell membrane. Functioning in close collaboration with MraY, it catalyses the rate-limiting step of bacterial peptidoglycan biosynthesis. Specifically, MurG catalyzes the transfer of N-acetyl glucosamine (GlcNAc) from UDP- GlcNAc to Lipid I.
  • the GlcNAc-MurNAc product of the MurG reaction is the minimal subunit of the peptidoglycan polymer.
  • the present invention provides for the first time, crystallizable compositions, crystals, and the crystal structures of P.a. MurG protein and a P.a. MurG protein-inhibitor complex.
  • the crystal structure of P.a. MurG potein has allowed the applicants to determine the key structural features of P.a. MurG protein, particularly the shape of its substrate and P.a. MurG protein-binding pockets.
  • the present invention provides molecules or molecular complexes comprising all or parts of the binding pockets, or homologues of the binding pockets that have similar three-dimensional shapes.
  • the present invention further provides crystals of P.a.
  • a ligand, substrate or inhibitor such as UDP-N- acetylglucosamine
  • the present invention provides crystallizable compositions from which crystals of P.a. MurG protein, and complexes of P.a. MurG protein with a chemical entity, such as its inhibitor, ligand, or substrat, may be obtained.
  • the invention provides a data storage medium that comprises the structure coordinates of molecules and molecular complexes that comprise all or part of a binding pocket of P.a. MurG protein.
  • a data storage medium encoded with these data when read and utilized by a computer programmed with appropriate software displays, on a computer screen or similar viewing device, a three-dimensional graphical representation of a molecule or molecular complex comprising such a binding pocket or similarly shaped homologous binding pocket.
  • the invention provides computational methods of using structure coordinates of the P.a. MurG protein and P.a. MurG protein complex to screen for and design compounds, including inhibitory compounds and antibodies that interact with P. a. MurG protein or homologues thereof.
  • the invention provides methods for designing, evaluating and identifying compounds, which bind to the aforementioned binding pocket.
  • such compounds are potential inhibitors of P.a. MurG protein or its homologues.
  • the invention provides a method for determining at least a portion of the three-dimensional structure of molecules or molecular complexes which contain at least some structurally similar features to P.a. MurG. In certain embodiments, this is achieved by using at least some of the structural coordinates obtained from the P.a. MurG protein and P. a. MurG protein complexed with UDP-N- acetylglucosamine.
  • FIG. 1 lists the atomic structure coordinates of X-ray structure of P.a.
  • MurG protein as derived by X-ray diffraction from its crystal.
  • the P.a. MurG protein in the crystal comprises amino acid residues of SEQ ID NO : 1. Certain amino acid residues of SEQ ID NO: 1 are not shown in FIG.1 due to disorder in their atom coordinates.
  • FIG. 2 lists the atomic structure coordinates of X-ray structure of P.a.
  • MurG protein and UDP-N-acetylglucosamine complex as derived by X-ray diffraction from its crystal.
  • the P.a. MurG protein in the crystal comprises amino acid residues of SEQ ID NO: 1. Certain amino acid residues of SEQ ID NO: 1 are not shown in FIG.2 due to disorder in their atom coordinates.
  • FIGs. 1 and 2 The following abbreviations are used in FIGs. 1 and 2:
  • Atom type refers to the identity of the element whose coordinates are measured.
  • the first letter in the column defines the actual type of atom (i.e., C for carbon, N for nitrogen, O for oxygen, etc.).
  • Resid refers to the amino acid residue or molecule identity in the molecular model.
  • B is a thermal factor that measures movement of the atom around its atomic center.
  • FIG. 3 shows a diagram of a system used to carry out the instructions encoded by the storage medium of Figures 4 and 5.
  • FIG. 4 shows a cross section of a magnetic storage medium.
  • FIG. 5 shows a cross section of an optically-readable data storage medium.
  • FIG. 6 shows UDP-N-acetylglucosamine bound in the binding pocket of
  • FIG. 7 shows enzymatic characterization of P. a. MurG. Change in fluorescence with time (slope) observed at various concentrations of enzyme.
  • FIG. 8 shows residues involved in binding UDP-GlcNAc and forming the Lipid I binding pocket.
  • FIG. 9 shows graphs for thermal stabilization of the P. a. MurG:UDP-
  • GlcNAc complex measured by Differential Scanning Fluorimetry Thermal Melt.
  • sociating with refers to a condition of proximity between a chemical entity or compound, or portions thereof, and a binding pocket or binding site on a protein.
  • the association may be non-covalent— wherein the juxtaposition is energetically favored by hydrogen bonding or van der Waals or electrostatic interactions - - or it may be covalent.
  • binding pocket refers to a region of a molecule or molecular complex, that, as a result of its shape and charge, favourably associates with another chemical entity or compound.
  • the term “pocket” includes, but is not limited to, cleft, channel or site.
  • P. a MurG protein may have binding pockets which include, but are not limited to, substrate-binding and antibody-binding pockets.
  • chemical entity refers to chemical compounds, complexes of at least two chemical compounds, and fragments of such compounds or complexes.
  • the chemical entity may be, for example, a ligand, a substrate, a nucleotide triphosphate, a nucleotide diphosphate, phosphate, a nucleotide, an agonist, antagonist, inhibitor, antibody, drug, peptide, protein or compound.
  • Constant substitutions refers to residues that are physically or functionally similar to the corresponding reference residues. That is, a conservative substitution and its reference residue have similar size, shape, electric charge, chemical properties including the ability to form covalent or hydrogen bonds, or the like. Preferred conservative substitutions are those fulfilling the criteria defined for an accepted point mutation in Dayhoff et al, Atlas of Protein Sequence and Structure, 5, pp. 345-352 (1978 & Supp.), which is incorporated herein by reference.
  • substitutions including but not limited to the following groups: (a) valine, glycine; (b) glycine, alanine; (c) valine, isoleucine, leucine; (d) aspartic acid, glutamic acid; (e) asparagine, glutamine; (f) serine, threonine; (g) lysine, arginine, methionine; and (h) phenylalanine, tyrosine.
  • groups including but not limited to the following groups: (a) valine, glycine; (b) glycine, alanine; (c) valine, isoleucine, leucine; (d) aspartic acid, glutamic acid; (e) asparagine, glutamine; (f) serine, threonine; (g) lysine, arginine, methionine; and (h) phenylalanine, tyrosine.
  • corresponding amino acid refers to a particular amino acid or analogue thereof in a P. a MurG protein homologue that corresponds to an amino acid in the P. a MurG protein structure.
  • the corresponding amino acid may be an identical, mutated, chemically modified, conserved, conservatively substituted, functionally equivalent or homologous amino acid when compared to the P. a MurG amino acid to which it corresponds.
  • corresponding amino acids may be identified by superimposing the backbone atoms of the amino acids in P. a MurG protien and P. a MurG protein homologue using well known software applications, such as QUANTA [Molecular Simulations, Inc., San Diego, CA ⁇ 1998,2000].
  • the corresponding amino acids may also be identified using sequence alignment programs such as the "bestfit" program available from the Genetics Computer Group which uses the local homology algorithm described by Smith and Waterman in Adv. Appl. Math., 2, 482 (1981), which is incorporated herein by reference.
  • domain refers to a portion of the P. a MurG protein or homologue that can be separated according to its biological function, for example, catalysis.
  • the domain is usually conserved in sequence or structure when compared to other related proteins.
  • the domain can comprise a binding pocket, or a sequence or structural motif.
  • sub-domain refers to a portion of the domain as defined above in the P. a MurG protein or homologue.
  • the "MurG substrate -binding pocket" of a molecule or molecular complex is defined by the structure coordinates of a certain set of amino acid residues present in the P. a MurG protein structure, as described below.
  • the substrate for the P. a MurG protein is a nucleotide such as UDP, TDP or GDP. This MurG substrate-binding pocket is in the catalytic active site of the MurG protein.
  • the term "part of a MurG substrate-binding pocket" or “part of a MurG- like substrate-binding pocket” refers to less than all of the amino acid residues that define the MurG or MurG-like substrate-binding pocket.
  • the structure coordinates of residues that constitute part of MurG or MurG-like substrate-binding pocket may be specific for defining the chemical environment of the binding pocket, or useful in designing fragments of an inhibitor that may interact with those residues.
  • the portion of residues may be key residues that play a role in substrate binding, or may be residues that are spatially related and define a three-dimensional compartment of the binding pocket.
  • the residues may be contiguous or non-contiguous in primary sequence.
  • part of the MurG or MurG-like substrate-binding pocket is at least five amino acid residues, preferably, R143, F242, 1243, M246 and E267.
  • the amino acids are selected from the group consisting of G 14, HI 5, F 17, N124, R143, L186, S189, F242, 1243, M246, A262, L263, T264 and E267.
  • the "catalytic domain of P. a. MurG protein” includes, for example, the catalytic active site which comprises the catalytic residues, the lipid I binding site and the UDP-GlcNAc binding site.
  • the catalytic domain in the MurG protein comprises residues from 1M to 357G (see SEQ ID NO: 1).
  • the term "part of a P. a. MurG catalytic domain” or “part of a MurG-like catalytic domain” refers to a portion of the MurG or MurG-like catalytic domain.
  • the structure coordinates of residues that constitute part of a MurG or MurG-like catalytic domain may be specific for defining the chemical environment of the domain, or useful in designing fragments of an inhibitor that may interact with those residues.
  • the portion of residues may be key residues that play a role in substrate or ligand binding, or may be residues that are spatially related and define a three- dimensional compartment of the domain.
  • the residues may be contiguous or noncontiguous in primary sequence.
  • part of a MurG catalytic domain can be the active site, the glycine-rich loop, the activation loop, or the catalytic loop.
  • the term "homologue of P. a. MurG” refers to a molecule or molecular complex that is homologous to MurG by three-dimensional structure or sequence.
  • homologues include a MurG protein from other species than P. a., a protein comprising a P. a. MurG-like substrate-binding pocket, a P. a. MurG catalytic domain, or a binding site of P. a. MurG protein.
  • the term "sufficiently homologous to P. a. MurG” refers to a protein that has a sequence homology of at least 35% compared to P. a. MurG protein. In one embodiment, the sequence homology is at least 40%, at least 60%, at least 80%, at least 90% or at least 95%.
  • part of a P. a. MurG protein or "part of a P. a. MurG homologue” refers to a portion of the amino acid residues of a P.a. MurG protein or homologue.
  • part of a P.a. MurG protein or homologue defines the binding pockets, domains, sub-domains, and motifs of the protein or homologue.
  • the structure coordinates of residues that constitute part of a P.a. MurG protein or homologue may be specific for defining the chemical environment of the protein, or useful in designing fragments of an inhibitor that may interact with those residues.
  • the portion of residues may also be residues that are spatially related and define a three-dimensional compartment of a binding pocket, motif or domain.
  • the residues may be contiguous or non-contiguous in primary sequence.
  • the portion of residues may be key residues that play a role in ligand or substrate binding, peptide binding, antibody binding, catalysis, structural stabilization or degradation.
  • P.a. MurG protein complex or "P.a. MurG homologue complex” refers to a molecular complex formed by associating the P.a. MurG protein or P.a. MurG homologue with a chemical entity, for example, a ligand, a substrate, nucleotide triphosphate, an agonist or antagonist, inhibitor, drug or compound.
  • the chemical entity is a nucleotide (e.g., UDP, TDP or GDP) and an inhibitor (e.g., a small organic molecule) for the ATP-binding pocket.
  • the term "motif refers to a portion of the P.a. MurG protein or homologue that defines a structural compartment or carries out a function in the protein, for example, catalysis, structural stabilization, or phosphorylation.
  • the motif may be conserved in sequence, structure and function when compared to other related proteins.
  • the motif can be contiguous in primary sequence or three-dimensional space.
  • the motif can comprise a-helices and/or ⁇ -sheets. Examples of a motif include but are not limited to a binding pocket, active site, phosphorylation lip or activation loop, the glycine-rich phosphate anchor loop, the catalytic loop [See, Xie et ah, Structure, 6, pp. 983-991 (1998); R. Giet and C. Prigent, J. Cell Sci., 112, pp. 3591-3601 (1999)], and the degradation box.
  • root mean square deviation means the square root of the arithmetic mean of the squares of the deviations from the mean. It is a way to express the deviation or variation from a trend or object.
  • the "root mean square deviation” defines the variation in the backbone of a protein from the backbone of P.a. MurG protein, a binding pocket, a motif, a domain, or portion thereof, as defined by the structure coordinates of P. a. MurG protein described herein.
  • the term "soaked” refers to a process in which the crystal is transferred to a solution containing the compound of interest. In certain embodiments, the compound is diffused into the crystal.
  • structure coordinates refers to Cartesian coordinates derived from mathematical equations related to the patterns obtained on diffraction of a monochromatic beam of X-rays by the atoms (scattering centers) of a protein or protein complex in crystal form.
  • the diffraction data are used to calculate an electron density map of the repeating unit of the crystal.
  • the electron density maps are then used to establish the positions of the individual atoms of the molecule or molecular complex. It would be readily apparent to those skilled in the art that all or part of the structure coordinates of FIGs. 1 and 2 may have a RMSD deviation of ⁇ 0.1-0.5 A because of standard error.
  • crystallization solution refers to a solution that promotes crystallization.
  • the solution comprises at least one agent, and may include a buffer, one or more salts, a precipitating agent, one or more detergents, sugars or organic compounds, lanthanide ions, a poly-ionic compound and/or a stabilizer.
  • homologue of .a.MurG or "P.a. MurG homologue” refers to a molecule that is homologous to P.a. MurG by three-dimensional structure or sequence and retains the biological activity of P.a. MurG.
  • homologues include, but are not limited to, P.a. MurG having one or more amino acid residues that are chemically modified, mutated, conservatively substituted, added, deleted or a combination thereof.
  • homologues include P.a. MurG proteins from other species than P.a. and proteins comprising a P.a. MurG-like binding pocket (e.g., sunstrate-binding pocket) or a P.a. MurG-like catalytic domain.
  • the term "homology model” refers to a structural model derived from known three-dimensional structure(s). Generation of the homology model, termed “homology modeling”, can include sequence alignment, residue replacement, residue conformation adjustment through energy minimization, or a combination thereof.
  • the term "three-dimensional structural information” refers to information obtained from the structure coordinates. Structural information generated can include the three-dimensional structure or graphical representation of the structure. Structural information can also be generated when subtracting distances between atoms in the structure coordinates, calculating chemical energies for a P.a. MurG protein or homologues thereof, calculating or minimizing energies for an association of a P.a. MurG protein or homologues thereof to a chemical entity.
  • the three-dimensional structure may be displayed as a graphical representation or used to perform computer modeling or fitting operations. In addition, the structure coordinates themselves may be used to perform computer modeling and fitting operations.
  • the present invention is directed to P.a. MurG crystals.
  • P.a. MurG crystals include crystals of P.a. MurG protein and crystals of P.a. MurG protein complexed with a chemical entity (e.g., ligand, substrate, or inhibitor of MurG protein).
  • the P.a. MurG protein is a single chain polypeptide and comprises a MurG polypeptide comprising amino acids according to SEQ ID NO: l :
  • FIG. 1054 the terms 'crystalline P.a. MurG' and 'P.a.
  • crystal both refer to crystallised P.a. MurG protein and are intended to be interchangeable.
  • the term "crystal” means that the crystal has an X-ray diffraction quality to provide X-ray diffraction data for determination of atomic coordinates to a resolution of 4 angstroms or less.
  • a typical crystal of the present invention provides X- ray diffraction data for determination of atomic coordinates to a resolution of 3 angstroms or less, specifically 2.4 angstroms or less, and more specifically 1.8 angstroms or less.
  • the crystal of the present invention provide X-ray diffraction data for determination of atomic coordinates to a resolution of between 4 angstroms and 1 angstrom, specifically between 3 angstroms and 1 angstrom, more specifically between 3 angstroms and 1.5 angstroms.
  • the substrate of P. a. MurG protein include a nucleotide, such as a UDP, TDP, or GDP.
  • the substrate of MurG protein is UDP-N- acetylglucosamine (UDP-GlcNAc).
  • the P. a. MurG crystals of the invention are in a tetragonal space group.
  • the amino acids of P. a. MurG protein of the P. a. MurG crystals have atomic coordinates according to FIG. 1 or FIG. 2.
  • the amino acids of P. a. MurG protein of the MurG crystals have atomic coordinates according to FIG. 1. In yet another specific embodiment, the amino acids of P. a. MurG protein of the MurG crystals have atomic coordinates according to FIG. 2.
  • the present invention is directed to crystallizable compositions of P. a. MurG protein.
  • the crystallizable compositions comprise a MurG protein concentrate and a reservoir solution.
  • the MurG protein concentrate includes a polypeptide comprising amino acid residues according to SEQ. ID. NO. l .
  • the MurG protein concentrate further includes a chemical entity (e.g., ligand, substrate, or inhibitor of MurG protein) that forms a complex with the polypeptide according to SEQ. ID. NO. l .
  • the polypeptide is in a concentration of 5 mg/ml to 20mg/ml. More typically, the polypeptide is in a concentration of 5 mg/ml to 15 mg/ml or 8 mg/ml to 14 mg/ml. More typically, the polypeptide is in a concentration of 10 mg/ml.
  • the reservoir includes a buffer having a pKa of 6.0 and 8.0.
  • Any suitable buffer known in the art can be employed in the present invention. Typical examples include a HEPES (4-(2-Hydroxyethyl)piperazine-l-ethanesulfonic acid, N-(2- Hydroxyethyl)piperazine-N'-(2-ethanesulfonic acid) buffer and a Tris
  • the buffer has a pKa ranged from 6.5 to 8.0. In another specific embodiment, the buffer has a pKa ranged from 6.5 to 7.5. In yet another specific embodiment, the buffer is a HEPES buffer, specifically HEPES buffer having a pH ranged from 6.8 to 7.5, more specifically HEPES buffer having a pH of 7.0.
  • the reservoir further includes a polyethylene glycol (PEG) as a precipitant.
  • PEG polyethylene glycol
  • a molecular weight of the polyethylene glycol can range from 200 to 20,000 Daltons. Typically, the molecular weight of the polyethylene glycol has a range of 1,000 to 20,000 Daltons. In a specific embodiment, the polyethylene glycol has a molecular weight of 2,500 to 4,000 Daltons. In another specific embodiment, the polyethylene glycol has a molecular weight of 3,000 to 4,000 Daltons. In another specific
  • the polyethylene glycol has a molecular weight of 3,350 Daltons.
  • a concentration of polyethylene glycol can range from 5% to 50% by volume. In one specific embodiment, the concentration of polyethylene glycol can vary from 8%-35% by volume. In another specific embodiment, the concentration of polyethylene glycol can vary from 10%-30% by volume. In yet another specific embodiment, the concentration of polyethylene glycol can vary from 8%-22% by volume. In yet another specific embodiment, the polyethylene glycol has a concentration of 20% by volume.
  • the reservoir can further include one or more other additives .
  • the reservoir further includes an additional precipitant.
  • a typical example of such additional precipitant is ammonium sulphate.
  • the reservoir further includes one or more reducing agents.
  • suitable reducing agents include DTT (tAreo-l,4-Dimercapto-2,3-butanediol) and TCEP (Tris(2-carboxyethyl)phosphine hydrochloride).
  • DTT tAreo-l,4-Dimercapto-2,3-butanediol
  • TCEP Tris(2-carboxyethyl)phosphine hydrochloride
  • the reducing agent is in a concentration of 1 mM to 20 mM. More typically, the reducing agent is in a concentration of 5 mM to 15 mM. Even more typically, the reducing agent is in a concentration of 10 mM.
  • the MurG protein concentrate of the crystallizable compositions further comprise a chemical entity, such as a ligand (e.g., a nucleotide) or substrate of P. a. MurG protein, an analogue thereof, or an inhibitor of P. a. MurG protein (e.g., a small organic compound), which forms a complex with the MurG protein.
  • the crystallizable compositions further comprise UDP-GlcNAc.
  • UDP-GlcNAc is in a molar ratio of 1 :600 to 1 :2,000 P. a. MurG protein to UDP-GlcNAc.
  • UDP-GlcNAc is in a molar ratio of 1 :600 to 1 : 1 ,400 P. a. MerG protein to UDP-GlcNAc. More typically, UDP-GlcNAc is in a molar ratio of 1 : 1 ,000 P. a. MerG protein to UDP-GlcNAc.
  • Such crystallizable compositions can produce crystals of P. a. MurG complex with UDP-GlcNAc.
  • the crystallizable compositions of the invention comprise a polypeptide comprising amino acid residues according to SEQ. ID. NO. l of P. a.
  • MurG protein in a concentration of 5 mg/ml to 15 mg/ml, a HEPES buffer having a pKa of 6.8 to 7.2 (e.g., 7.0), and polyethylene glycol having a molecular weight of 3,350 Daltons in a concentration of 15% to 25% by volume.
  • the crystallizable compositions of the invention comprise a polypeptide comprising amino acid residues according to SEQ. ID. NO. l of P. a.
  • MurG protein in a concentration of 5 mg/ml to 15 mg/ml; a HEPES buffer having a pKa of 6.8 to 7.2 (e.g., 7.0); polyethylene glycol having a molecular weight of 3,350 Daltons in a concentration of 15% to 25% by volume; and UDP-GlcNAc in a molar ratio of 1 :500 to 1 :700 MurG protein to UDP-GlcNAc. More specifically, the molar ratio of MurG protein to UDP- GlcNAc is 1 :600.
  • the crystallizable compositions of the invention comprise a polypeptide comprising amino acid residues according to SEQ. ID. NO.1 of P. a.
  • MurG protein in a concentration of 10 mg/ml in a buffer of 25mM HEPES having a pH of 7.0, NaCl (e.g., in 500 mM), DTT (e.g., 2.5mM), Decyl Maltopyranoside (e.g., 0.11%) by volume), and glycerol (e.g., 10%> by volume).
  • the reservoir of the crystallizable composition include 0.1M HEPES having a pH of 7.0, 20 %(v/v) PEG having a molecular weight of 3,350 Daltons, and 10 mM DTT.
  • the reservoir of the crystallizable composition further includes ammonium sulfate.
  • a typical concentration of ammonium sulfate is in a range of 80 mM to 120 mM, such as 100 mM.
  • the polypeptides of the P. a. MurG crystals and the polypeptides included in the MurG protein concentrate for the crystallizable compositions can further comprise non- . ⁇ . MurG amino acids, such as amino acids from a tag.
  • a P.a. MurG protein is overexpressed with a C-terminal tag, such as his- tag (SEQ ID NO:2: LEHHHHHH), to facilitate its purification using an affinity column.
  • the polypeptides of the MurG crystals and crystallizable compositions of the invention further comprise amino acids of his-tag at the C-terminal.
  • the amino acids of his-tag consist of LEHHHHHH (SEQ ID NO:2). The sequence listing of the amino acids of such MurG crystals and crystallizable
  • compositions of the invention that further comprises amino acids of the his-tag is as follows (SEQ ID NO:3):
  • FIG. present invention further provides methods of forming P.a.
  • MurG crystals e.g., crystals of P.a. MurG protein and crystals of P.a. MurG complexes with a chemical entity.
  • the methods comprise the steps of: a) combining a MurG protein concentrate with a reservoir solution; and b) inducing crystal formation to produce crystals by subjecting the resulting mixture to devices or conditions, which promote crystallization.
  • the methods comprise the steps of : a) preparing a crystallizable composition as described above; and inducing crystal formation to produce crystals by subjecting the resulting mixture to devices or conditions which promote crystallization.
  • the features, including the specific features, of the MurG protein concentrate and reservoir solution are each and independently as described above.
  • the method comprises the steps of: a) combining a polypeptide comprising amino acid residues according to SEQ ID NO: l of P.a. MurG protein with UDP-GlcNAc in a molar ratio of 1 :600 to 1 :2,000 MurG protein to UDP-GlacNac to result in a mixture of MurG protein and UDP-GlcNAc, b) combining the mixture of MurG protein and UDP-GlcNAc with a reservoir solution that includes a buffer having a pKa of 6.5 to 8.0 and a polyethylene glycol in a concentration from 5% to 50% by volume, and c) inducing crystal formation to produce a crystal of MurG protein.
  • the features, including the specific features, of the polypeptide, UDP-GlcNAc, and reservoir solution are each and independently as described above.
  • the UDP- GlcNAc is in a molar ratio of 1 :600 to 1 : 1 ,400 MurG protein to UDP-GlcNAc.
  • the UDP-GlcNAc is in a molar ratio of 1 : 1000 MurG protein to UDP-GlcNAc.
  • the polypeptide further comprise additional amino acids of a tag consisting of LEHHHH (SEQ ID NO:2) at the C-terminal.
  • the methods comprise: a) concentrating a MurG protein solution that includes a MurG polypeptide as described above in a buffer solution; b) mixing the MurG protein solution with UDP-GlcNAc in a 1 :600 - 1 :2,000 molar ratio, such as 1 :600 to 1 : 1 ,400 molar ratio or 1 : 1000 molar ratio, to generate a mixture of the MurG polypeptide and UDP-GlcNAc; c) mixing equal volumes of the mixture of the MurG polypeptide and UDP-GlcNAc with a reservoir solution as described above; and d) inducing crystal formation using hanging drop vapour diffusion.
  • the reservoir solution comprises 0.1M HEPES having pH7.0, 20% (v/v) PEG (polyethylene glycol) having a molecular weight of 3,350 Daltons, and lOmM DTT.
  • the MurG polypeptide is concentrated to lOmg/ml in a buffer that includes 25mM HEPES pH7.0, 500mM NaCl, 2.5mM DTT, 0.11% Decyl Maltopyranoside and 10% glycerol.
  • the MurG crystals are produced using the crystal formation method described in Example 1.
  • the concentrated solutions of MurG protein can be induced to crystallise by several methods including, but not limited to, vapour diffusion, liquid diffusion, batch crystallisation, dialysis or a combination thereof. In a specific embodiment, vapour diffusion is employed.
  • MurG protein become supersaturated and form MurG crystals at a constant temperature by diffusion of solvent(s), in which the MurG protein is not generally soluble, into the MurG protein solutions.
  • vapour diffusion is performed under a controlled temperature in a range of 12°C ⁇ 1 °C to 37°C ⁇ 1 °C, specifically from 16°C ⁇ 1 °C to 24°C ⁇ 1 °C, more specifically at a constant temperature of 20°C ⁇ 1 °C.
  • Devices for promoting crystallization can include but are not limited to the hanging-drop, sitting-drop, dialysis or microtube batch devices.
  • the hanging-drop or sitting-drop methods produce crystals by vapor diffusion.
  • the hanging-drop, sitting-drop, and some adaptations of the microbatch methods [D'Arcy et al, J. Cryst. Growth, 168, pp. 175-180 (1996) and Chayen, J. Appl. Cryst., 30, pp. 198-202 (1997)] produce crystals by vapor diffusion.
  • the hanging drop and sitting drop containing the crystallizable composition is equilibrated in a reservoir containing a higher or lower concentration of the precipitant. As the drop approaches equilibrium with the reservoir, the saturation of protein in the solution leads to the formation of crystals.
  • Microseeding or seeding may be used to obtain larger, or better quality
  • Microseeding involves the use of crystalline particles to provide nucleation under controlled crystallization conditions. Microseeding is used to increase the size and quality of crystals. In this instance, micro -crystals are crushed to yield a stock seed solution. The stock seed solution is diluted in series. Using a needle, glass rod or strand of hair, a small sample from each diluted solution is added to a set of equilibrated drops containing a protein concentration equal to or less than a concentration needed to create crystals without the presence of seeds. The aim is to end up with a single seed crystal that will act to nucleate crystal growth in the drop.
  • the methods further include the steps of producing and purifying P. a. MurG protein.
  • the methods further include the steps of producing and purifying P. a. MurG protein.
  • the methods further include the steps of producing and purifying P. a. MurG protein.
  • characterization step includes a) size-exclusion chromatography and dynamic light scattering, b) mass spectrometry analysis, c) differential scanning fluorescence study of UDP-GlcNAc binding to P. a. MurG protein, and/or d) enzymatic characterization of P. a. MurG protein.
  • the P. a. MurG protein can be produced by any well-known method, including synthetic methods, such as solid phase, liquid phase and combination solid phase/liquid phase syntheses; recombinant DNA methods, including cDNA cloning, optionally combined with site directed mutagenesis; and/or purification of the natural products.
  • the protein is overexpressed from E. coli system.
  • MurG in E. coli cells see the experimental section below. This fining facilitates obtaining sufficient quantities of isolated and/or purified P. a. MurG protein.
  • the expression of P.a. MurG protein in E. coli host cells can be performed, for example by expressing the P.a. MurG gene cloned into pET21b expression vector and transformed into an E. coli host cell.
  • the P.a. MurG protein can be overexpressed with a C-terminal his-tag (LEHHHHHH) which allows the protein to be purified using a His-tag affinity column.
  • the protein is then crystallised as described above and demonstrated in the experimental section.
  • the atomic coordinates can then be determined using X-ray diffraction and methods to those skilled in the art.
  • the present invention also provides crystals of mutants and homo logs of
  • P.a. MurG protein, and fragments thereof and crystals of molecular complexes of mutants and homo logs of P.a. MurG protein, and fragments thereof with a chemical entity such as a nucleotide or substrate, or analogue thereof.
  • the present invention also provides crystallizable composition for, and methods of, making such crystals.
  • Such variations include, but are not limited to, adjusting pH, protein concentration and/or crystallization temperature, changing the identity or concentration of salt and/or precipitant used, using a different method of crystallization, or introducing additives such as detergents ⁇ e.g., TWEEN 20 (monolaurate), LDAO, Brij 30 (4 lauryl ether)), sugars ⁇ e.g., glucose, maltose), organic compounds ⁇ e.g., dioxane, dimethylformamide), lanthanide ions or polyionic compounds that aid in crystallization.
  • detergents ⁇ e.g., TWEEN 20 (monolaurate), LDAO, Brij 30 (4 lauryl ether)
  • sugars ⁇ e.g., glucose, maltose
  • organic compounds ⁇ e.g., dioxane, dimethylformamide
  • lanthanide ions lanthanide ions or polyionic compounds that aid in crystallization.
  • the x-ray structure of the P.a. MurG consist of two ⁇ / ⁇ domains separated by a cleft 20 angstroms deep and 16 angstroms across at its widest point. Individual N and C domains are similar in the presence and absence of substrate, although the C-terminal domain of the Apo structure is significantly disordered without substrate bound (see Fig6). There is a slight change in the relative orientation of the two domains so that in the presence of UDP-GlcNAc, MurG adopts a more closed conformation. The conformational change results mostly from a rigid body domain movement. Binding sites for Lipid I and UDP-GlcNAc are contained within the cleft between the N- and C-terminal domains of MurG.
  • UDP-GlcNAc:MurG complex shows that the UDP-GlcNAc moiety mostly contacts the C-terminal domain.
  • UDP-GlcNAc makes several contacts to these helices as well as to the loops connecting them to the adjacent ⁇ strands of the N- and C-terminal domains.
  • the uracil base is bound in a pocket flanked on one edge by a helix and the strand connecting the - and C-terminal domains.
  • a set of structure coordinates for a molecule or a molecular-complex or a portion thereof is a relative set of points that define a shape in three dimensions.
  • an entirely different set of coordinates could define a similar or identical shape.
  • slight variations in the individual coordinates will have little effect on overall shape. In terms of binding pockets, these variations would not be expected to significantly alter the nature of ligands that could associate with those pockets.
  • the variations in coordinates discussed above may be generated because of mathematical manipulations of the P. a. MurG protein structure coordinates.
  • the structure coordinates set forth in FIGs. 1 and 2 could be manipulated by crystallographic permutations of the structure coordinates, fractionalization of the structure coordinates, integer additions or subtractions to sets of the structure coordinates, inversion of the structure coordinates or any combination of the above.
  • modifications in the crystal structure due to mutations, additions, substitutions, and/or deletions of amino acids, or other changes in any of the components that make up the crystal could also account for variations in structure coordinates. If such variations are within a certain root mean square deviation as compared to the original coordinates, the resulting three-dimensional shape is considered encompassed by this invention.
  • a ligand or substrate that bound to the binding pocket of P. a. MurG protein would also be expected to bind to another binding pocket whose structure coordinates defined a shape that fell within the acceptable root mean square deviation.
  • the procedure used in ProFit to compare structures includes the following steps: 1) load the structures to be compared; 2) specify selected residues of interest; 3) define the atom equivalences in the selected residues; 4) perform a fitting operation on the selected residues; and 5) analyze the results.
  • Each structure in the comparison is identified by a name.
  • One structure is identified as the target (i.e., the fixed structure); all remaining structures are working structures (i.e., moving structures). Since atom equivalency within the above programs is defined by user input, for the purpose of this invention we will define equivalent atoms as protein backbone atoms (N, Ca, C and O) for P.a. MurG protein amino acids and corresponding amino acids in the structures being compared.
  • the corresponding amino acids may be identified by sequence alignment programs such as the "bestfit" program available from the Genetics Computer Group which uses the local homology algorithm described by Smith and Waterman in Advances in Applied Mathematics 2, 482 (1981), which is incorporated herein by reference.
  • a suitable amino acid sequence alignment will require that the proteins being aligned share minimum percentage of identical amino acids.
  • a first protein being aligned with a second protein should share in excess of about 35% identical amino acids with the second protein [Hanks et ah, Science, 241, 42 (1988); Hanks and Quinn, Meth. Enzymol, 200, 38 (1991)].
  • the identification of equivalent residues can also be assisted by secondary structure alignment, for example, aligning the a-helices, ⁇ -sheets in the structure.
  • the program Swiss-Pdb Viewer has its own best fit algorithm that is based on secondary sequence alignment. [ 0089 ] When a rigid fitting method is used, the working structure is translated and rotated to obtain an optimum fit with the target structure. The fitting operation uses an algorithm that computes the optimum translation and rotation to be applied to the moving structure, such that the root mean square difference of the fit over the specified pairs of equivalent atom is an absolute minimum. This number, given in angstroms, is reported by the above programs.
  • the Swiss-Pdb Viewer program sets an RMSD cutoff for eliminating pairs of equivalent atoms that have high RMSD values.
  • an RMSD cutoff value can be used to exclude pairs of equivalent atoms with extreme individual RMSD values.
  • the RMSD cutoff value can be specified by the user.
  • any molecule, molecular complex, binding pocket, motif, domain thereof or portion thereof that is within a root mean square deviation for backbone atoms (N, Ca, C, O) when superimposed on the relevant backbone atoms described by structure coordinates listed in FIGs. 1 and 2 are
  • a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein said data comprises all or part of P.a. MurG binding pocket defined by structure coordinates of P.a. MurG amino acids G14, H15, F17, N124, R143, LI 86, SI 89, F242, 1243, M246, A262, L263, T264 and E267, according to FIG. 1 or FIG. 2; or a molecule or molecular complex comprising all or part of a P.a.
  • MurG-like binding pocket defined by structure coordinates of corresponding amino acids that are identical to said MurG amino acids, wherein the root mean square deviation of the backbone atoms between said corresponding amino acids and said MurG amino acids is not more than 3.0 A ⁇ 0.1 A, 2.5 A ⁇ 0.1 A, 2.0 A ⁇ 0.1 A, 1.5 A ⁇ 0.1 A, or 1.0 A ⁇ 0.1 A; or a molecule or molecular complex comprising all or part of a P.a.
  • MurG-like binding pocket defined by structure coordinates of a set of corresponding amino acids, wherein the root mean square deviation of the backbone atoms between said set of corresponding amino acids and said MurG amino acids is not more than 1.1, 0.9, 0.7 or 0.5 A, and wherein at least one of said corresponding amino acids is not identical to the MurG amino acid to which it corresponds.
  • a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein said data comprises all or part of any molecule or molecular complex discussed in the above paragraphs.
  • a computer comprising:
  • a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein said data comprises all or part of a P.a. MurG substrate-binding pocket defined by structure coordinates of P. a. MurG amino acids R143, F242, 1243, M246 and E267, according to FIG. 1 or FIG. 2; or a molecule or molecular complex comprising all or part of a P. a. MurG-like substrate-binding pocket defined by structure coordinates of corresponding amino acids that are identical to said P.a. MurG amino acids, wherein the root mean square deviation of the backbone atoms between said corresponding amino acids and said P.a.
  • MurG amino acids is not more than 3.0 A ⁇ 0.1 A, 2.5 A ⁇ 0.1 A, 2.0 A ⁇ 0.1 A, 1.5 A ⁇ 0.1 A, or 1.0 A ⁇ 0.1 A; or a molecule or molecular complex comprising all or part of a P.a. MurG-like substrate-binding pocket defined by structure coordinates of a set of corresponding amino acids, wherein the root mean square deviation of the backbone atoms between said set of corresponding amino acids and said P.a. MurG amino acids is not more than 0.5 A ⁇ 0.1 A, and wherein at least one of said corresponding amino acids is not identical to the P.a. MurG amino acid to which it corresponds.
  • a computer comprises a working memory for storing instructions for processing the machine-readable data, a central-processing unit coupled to the working memory and to said machine -readable data storage medium for processing said machine-readable data into the three- dimensional structure.
  • the computer further comprises a display for displaying the three-dimensional structure as a graphical representation.
  • the computer further comprises commercially available software program to display the graphical representation. Examples of software programs include but are not limited to QUANTA® [Molecular Simulations, Inc., San Diego, CA; now part of Accelrys, San Diego, CA], O [Jones et al., Acta Cryst. A, 47, pp. 110-119 (1991)] and RIBBONS [M. Carson, J. Appl. Cryst., 24, pp. 958-961 (1991)], which are incorporated herein by reference.
  • a computer comprises executable code for:
  • the computer further comprises executable code for: (d) controlling a unit for assaying the small molecules determined in step (c) in a protein binding assay.
  • Using structural coordinates may include displaying the coordinates graphically or manipulating the structure coordinates with computational programs.
  • This invention also provides a computer comprising:
  • a machine -readable data storage medium comprising a data storage material encoded with machine-readable data, wherein the data defines any one of the above binding pockets or protein of the molecule or molecular complex;
  • a central processing unit coupled to the working memory and to the machine-readable data storage medium for processing said machine readable data as well as an instruction or set of instructions for generating three-dimensional structural information of said binding pocket or protein;
  • Three-dimensional data generation may be provided by an instruction or set of instructions such as a computer program or commands for generating a three- dimensional structure or graphical representation from structure coordinates, or by subtracting distances between atoms, calculating chemical energies for a P.a. MurG protein or homologues thereof, or for complexes of such P. a.
  • MurG protein or homologues thereof with a chemical entity or calculating or minimizing energies for an association of a P. a. MurG protein or homologues thereof to a chemical entity.
  • the graphical representation can be generated or displayed by commercially available software programs. Examples of software programs include but are not limited to QUANTA® [Molecular Simulations, Inc., San Diego, CA; now part of Accelrys, San Diego, CA], O [Jones et al., Acta Crystallogr. A47, pp. 110-119 (1991)] and RIBBONS [Carson, J. Appl. Crystallogr., 24, pp. 9589-961 (1991)], Coot [Emsley, P. and Cowtan, K. Acta Crystallogr.
  • Information about said binding pocket or information produced by using said binding pocket can be outputted through display terminals, touchscreens, printers, modems, facsimile machines, CD-ROMs or disk drives.
  • the information can be in graphical or alphanumeric form.
  • System 10 includes a computer 11 comprising a central processing unit ("CPU") 20, a working memory 22 which may be, e.g., RAM (random-access memory) or “core” memory, mass storage memory 24 (such as one or more disk drives or CD-ROM drives), one or more cathode -ray tube (“CRT") display terminals 26, one or more keyboards 28, one or more input lines 30, and one or more output lines 40, all of which are interconnected by a conventional bi-directional system bus 50.
  • CPU central processing unit
  • working memory 22 which may be, e.g., RAM (random-access memory) or “core” memory
  • mass storage memory 24 such as one or more disk drives or CD-ROM drives
  • CRT cathode -ray tube
  • Machine -readable data of this invention may be inputted via the use of a modem or modems 32 connected by a telephone line or dedicated data line 34.
  • the input hardware 36 may comprise CD-ROM drives or disk drives 24.
  • keyboard 28 may also be used as an input device.
  • Output hardware 46 coupled to computer 11 by output lines 40, may similarly be implemented by conventional devices.
  • output hardware 46 may include CRT display terminal 26 for displaying a graphical representation of a binding pocket of this invention using a program such as QUANTA® [Molecular Simulations, Inc., San Diego, CA; now part of Accelrys, San Diego, CA] as described herein.
  • Output hardware might also include a printer 42, so that hard copy output may be produced, or a disk drive 24, to store system output for later use.
  • Output hardware may also include a display terminal, a CD or DVD recorder, ZIPTM or JAZTM drive, or other machine-readable data storage device.
  • CPU 20 coordinates the use of the various input and output devices 36, 46, coordinates data accesses from mass storage 24 and accesses to and from working memory 22, and determines the sequence of data processing steps.
  • a number of programs may be used to process the machine -readable data of this invention. Such programs are discussed in reference to the computational methods of drug discovery as described herein. Specific references to components of the hardware system 10 are included as appropriate throughout the following description of the data storage medium.
  • FIG. 4 shows a cross section of a magnetic data storage medium 100 which can be encoded with a machine -readable data that can be carried out by a system such as system 10 of FIG. 3.
  • Medium 100 can be a conventional floppy diskette or hard disk, having a suitable substrate 101, which may be conventional, and a suitable coating 102, which may be conventional, on one or both sides, containing magnetic domains (not visible) whose polarity or orientation can be altered magnetically.
  • Medium 100 may also have an opening (not shown) for receiving the spindle of a disk drive or other data storage device 24.
  • the magnetic domains of coating 102 of medium 100 are polarized or oriented so as to encode in a manner that may be conventional, machine readable data such as that described herein, for execution by a system such as system 10 of FIG. 3.
  • FIG. 5 shows a cross section of an optically-readable data storage medium 110 which also can be encoded with such a machine-readable data, or set of instructions, which can be carried out by a system such as system 10 of FIG. 3.
  • Medium 110 can be a conventional compact disk read only memory (CD-ROM) or a rewritable medium such as a magneto-optical disk that is optically readable and magneto-optically writable.
  • Medium 100 preferably has a suitable substrate 111, which may be
  • a suitable coating 112 which may be conventional, usually of one side of substrate 111.
  • coating 112 is reflective and is impressed with a plurality of pits 113 to encode the machine -readable data.
  • the arrangement of pits is read by reflecting laser light off the surface of coating 112.
  • a protective coating 114 which preferably is substantially transparent, is provided on top of coating 112.
  • coating 112 has no pits 113, but has a plurality of magnetic domains whose polarity or orientation can be changed magnetically when heated above a certain temperature, as by a laser (not shown).
  • the orientation of the domains can be read by measuring the polarization of laser light reflected from coating 112.
  • the arrangement of the domains encodes the data as described above.
  • the data defines the above-mentioned binding pockets by comprising the structure coordinates of said amino acid residues according to FIG. 1 or FIG. 2.
  • the structure coordinates generated for P. a. MurG protein or a homologue thereof one of its binding pockets, motifs, domains, or portion thereof, it is at times necessary to convert them into a three-dimensional shape or to generate three- dimensional structural information from them. This is achieved through the use of commercially or publicly available software that is capable of generating a three- dimensional structure of molecules or portions thereof from a set of structure coordinates.
  • the three-dimensional structure may be displayed as a graphical representation.
  • this invention provides a machine-readable data storage medium comprising a data storage material encoded with machine readable data.
  • a machine programmed with instructions for using said data is capable of generating a three-dimensional structure of any of the molecule or molecular complexes, or binding pockets thereof, that are described herein.
  • this invention also provides a computer for producing a three-dimensional structure of:
  • MurG binding pocket defined by structure coordinates of P. a. MurG amino acids G14, H15, F17, N124, R143, L186, S189, F242, 1243, M246, A262, L263, T264 and E267, according to FIG. 1 or FIG. 2;
  • MurG-like binding pocket defined by structure coordinates of corresponding amino acids that are identical to said P.
  • MurG amino acids wherein the root mean square deviation of the backbone atoms between said corresponding amino acids and said P. a.
  • MurG amino acids is not more than 3.0 A ⁇ 0.1 A, 2.5 A ⁇ 0.1 A, 2.0 A ⁇ 0.1 A, 1.5 A ⁇ 0.1 A, or 1.0 A ⁇ 0.1 A; and/or
  • MurG-like binding pocket defined by structure coordinates of a set of corresponding amino acids, wherein the root mean square deviation of the backbone atoms between said set of corresponding amino acids and said P. a.
  • MurG amino acids is not more than 0.6 A, 0.5 A or 0.4 A, and wherein at least one of said corresponding amino acids is not identical to the P. a. MurG amino acid to which it corresponds,
  • a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein said data comprises all or part of a P. a. MurG binding pocket defined by structure coordinates of P. a. MurG amino acids R143, F242, 1243, M246 and E267, according to FIG. 1 or FIG. 2; all or part of a P.a. MurG -like binding pocket defined by structure coordinates of corresponding amino acids that are identical to said P. a. MurG amino acids, wherein the root mean square deviation of the backbone atoms between said corresponding amino acids and said P. a.
  • MurG amino acids is not more than 3.0 A ⁇ 0.1 A, 2.5 A ⁇ 0.1 A, 2.0 A ⁇ 0.1 A, 1.5 A ⁇ 0.1 A, or 1.0 A ⁇ 0.1 A; or all or part of a P. a.
  • MurG -like binding pocket defined by structure coordinates of a set of corresponding amino acids, wherein the root mean square deviation of the backbone atoms between said set of corresponding amino acids and said P.a.
  • MurG amino acids is not more than 0.6 A, 0.5 A or 0.4 A, and wherein at least one of said corresponding amino acids is not identical to the P.a. MurG amino acid to which it corresponds; and
  • the computer is also for producing the three-dimensional structure of the aforementioned molecules and molecular complexes and comprises the corresponding machine-readable data storage mediums.
  • the three-dimensional structure is displayed as a graphical representation.
  • the structure coordinates of said molecules or molecular complexes are produced by homology modeling of at least a portion of the structure coordinates of FIG. 1 or FIG. 2.
  • Homology modeling can be used to generate structural models of P.a. MurG homologues or other homologous proteins based on the known structure of P.a. MurG. This can be achieved by performing one or more of the following steps: performing sequence alignment between the amino acid sequence of an unknown molecule against the amino acid sequence of P.a. MurG; identifying conserved and variable regions by sequence or structure; generating structure co-ordinates for structurally conserved residues of the unknown structure from those of P.a. MurG;
  • MurG and homologues thereof can be aligned relative to each other, it is possible to construct models of the structures of P. a. MurG homologues, particularly in the regions of the active site, using the P. a. MurG structure.
  • Software programs that are useful in homology modeling include XALIGN [Wishart, D. S. et al, Comput. Appl. BioscL, 10, pp. 687-88 (1994)] and CLUSTAL W Alignment Tool [Higgins D. G. et al., Methods Enzymol, 266, pp. 383-402 (1996)]. See also, U.S. Patent No. 5,884,230. These references are incorporated herein by reference.
  • Homology modeling can be performed using, for example, the computer programs SWISS-MODEL available through Glaxo Wellcome Experimental Research in Geneva, Switzerland; WHATIF available on EMBL servers; Schnare et al, J. Mol. Biol, 256: 701-719 (1996); Blundell et al, Nature 326: 347-352 (1987); Fetrow and Bryant, Bio/Technology 11 :479-484 (1993); Greer, Methods in Enzymology 202: 239-252 (1991); and Johnson et al, Crit. Rev. Biochem. Mol Biol. 29: 1-68 (1994).
  • An example of homology modeling can be found, for example, in Szklarz G.D., Life Sci. 61 : 2507-2520 (1997). These references are incorporated herein by reference.
  • data capable of generating the three dimensional structure of, for example, the above molecules or molecular complexes, or binding pockets thereof can be stored in a machine-readable storage medium, which is capable of displaying three-dimensional structural information or a graphical three-dimensional representation of the structure.
  • the P. a. MurG structure coordinates or the three-dimensional graphical representation generated from these coordinates may be used in conjunction with a computer for a variety of purposes, including drug discovery.
  • the computer is programmed with software to translate those coordinates into the three- dimensional structure of P. a. MurG.
  • the structure encoded by the data may be computationally evaluated for its ability to associate with chemical entities.
  • Chemical entities that associate with P. a. MurG may inhibit or activate P. a. MurG or its homologues, and are potential drug candidates.
  • the structure encoded by the data may be displayed in a graphical three-dimensional representation on a computer screen. This allows visual inspection of the structure, as well as visual inspection of the structure's association with chemical entities.
  • the invention provides a method for designing, selecting and/or optimizing a chemical entity that binds to the molecule or molecular complex comprising the steps of:
  • Three-dimensional structural information in step (a) may be generated by instructions such as a computer program or commands that can generate a three- dimensional structure or graphical representation; subtract distances between atoms; calculate chemical energies for a P. a. MurG protein or homologues thereof, or for a complex of P. a. MurG protein or homologues thereof with a chemical entity; or calculate or minimize energies of an association of P. a. MurG protein or homologues thereof to a chemical entity.
  • a computer program or commands that can generate a three- dimensional structure or graphical representation; subtract distances between atoms; calculate chemical energies for a P. a. MurG protein or homologues thereof, or for a complex of P. a. MurG protein or homologues thereof with a chemical entity; or calculate or minimize energies of an association of P. a. MurG protein or homologues thereof to a chemical entity.
  • the graphical representation can be generated or displayed by commercially available software programs.
  • Examples of software programs include but are not limited to QUANTA® [Molecular Simulations, Inc., San Diego, CA; now part of Accelrys, San Diego, CA], O [Jones et al., Acta Crystallogr. A47, pp. 110-119 (1991)] and RIBBONS [Carson, J. Appl. Crystallogr., 24, pp. 9589-961 (1991)], which are incorporated herein by reference.
  • Certain software programs may imbue this representation with physico-chemical attributes which are known from the chemical composition of the molecule, such as residue charge, hydrophobicity, torsional and rotational degrees of freedom for the residue or segment, etc. Examples of software programs for calculating chemical energies are described below.
  • Another embodiment of the invention provides a method for evaluating the potential of a chemical entity to associate with the molecule or molecular complex as described previously.
  • This method comprises the steps of: a) employing computational means to perform a fitting operation between the chemical entity and the molecule or molecular complex described before; b) analyzing the results of said fitting operation to quantify the association between the chemical entity and the molecule or molecular complex; and, optionally, c) outputting said quantified association to a suitable output hardware, such as a CRT display terminal, a printer, a CD or DVD recorder, ZIPTM or JAZTM drive, a disk drive, or other machine-readable data storage device, as described previously.
  • the method may further comprise generating a three-dimensional structure, graphical representation thereof, or both, of the molecule or molecular complex prior to step a).
  • the method is for evaluating the ability of a chemical entity to associate with the binding pocket of a molecule or molecular complex.
  • the method comprises the steps of:
  • the invention provides a method of using a computer for evaluating the ability of a chemical entity to associate with the molecule or molecular complex, wherein said computer comprises a machine-readable data storage medium comprising a data storage material encoded with said structure coordinates defining said binding pocket and means for generating a three-dimensional graphical representation of the binding pocket, and wherein said method comprises the steps of:
  • the above method may further comprise the steps of:
  • the structure coordinates of the P. a. MurG binding pockets may be utilized in a method for identifying an agonist or antagonist of a molecule comprising a binding pocket of P. a. MurG.
  • the method comprises steps of: a) using a three-dimensional structure of the molecule or molecular complex to design, select or optimize a chemical entity;
  • step a) is performed using a graphical representation of the binding pocket or portion thereof of the molecule or molecular complex.
  • the three-dimensional structure is displayed as a graphical representation.
  • the method comprises the steps of:
  • Obtaining said chemical entity includes synthesizing the chemical entity, obtaining a commercially available product, or isolating the chemical entity.
  • the present invention permits the use of molecular design techniques to identify, select and design chemical entities, including inhibitory compounds, capable of binding to P.a. MurG or P.a. MurG-like binding pockets, motifs and domains.
  • Applicants ' elucidation of binding pockets on P.a. MurG can provide the necessary information for designing new chemical entities and compounds that may interact with P.a. MurG or P.a. MurG-like substrate in whole or in part.
  • the design of chemical entities that bind to or inhibit P.a. MurG binding pockets according to this invention generally involves consideration of two factors. First, the entity must be capable of physically and structurally associating with parts or all of the P.a. MurG binding pockets. Non-covalent molecular interactions important in this association include hydrogen bonding, van der Waals interactions, hydrophobic interactions and electrostatic interactions.
  • the entity must be able to assume a conformation that allows it to associate with the P.a. MurG binding pockets directly. Although certain portions of the entity will not directly participate in these associations, those portions of the entity may still influence the overall conformation of the molecule. This, in turn, may have a significant impact on potency.
  • Such conformational requirements include the overall three-dimensional structure and orientation of the chemical entity in relation to all or a portion of the binding pocket, or the spacing between functional groups of an entity comprising several chemical entities that directly interact with the P.a. MurG or P.a. MurG -like binding pockets.
  • the potential inhibitory or binding effect of a chemical entity on P.a. MurG binding pockets may be analyzed prior to its actual synthesis and testing by the use of computer modeling techniques. If the theoretical structure of the given entity suggests insufficient interaction and association between it and the P.a. MurG binding pockets, testing of the entity is obviated. However, if computer modeling indicates a strong interaction, the compound may then be synthesized and tested for its ability to bind to a P. a. MurG binding pocket. This may be achieved by testing the ability of the molecule to inhibit P. a. MurG protein using the assays described in Example 4. In this manner, synthesis of inoperative compounds may be avoided.
  • a potential inhibitor of a P. a. MurG binding pocket may be
  • One skilled in the art may use one of several methods to screen chemical entities or fragments for their ability to associate with a P. a. MurG binding pocket. This process may begin by visual inspection of, for example, a P. a. MurG binding pocket on the computer screen based on the P. a. MurG structure coordinates in FIG. 1 or FIG.2, or other coordinates which define a similar shape generated from the machine-readable storage medium. Selected fragments or chemical entities may then be positioned in a variety of orientations, or docked, within that binding pocket as defined supra.
  • Docking may be accomplished using software such as QUANTA® [Molecular Simulations, Inc., San Diego, CA; now part of Accelrys, San Diego, CA] and SYBYL® [Tripos Associates, St. Louis, MO], followed by energy minimization and molecular dynamics with standard molecular mechanics force fields, such as CHARMm® [Accelrys, San Diego, CA] and AMBER.
  • QUANTA® Molecular Simulations, Inc., San Diego, CA; now part of Accelrys, San Diego, CA
  • SYBYL® Tripos Associates, St. Louis, MO
  • energy minimization and molecular dynamics with standard molecular mechanics force fields such as CHARMm® [Accelrys, San Diego, CA] and AMBER.
  • Specialized computer programs may also assist in the process of selecting fragments or chemical entities. These include:
  • GRID [P. J. Goodford, "A Computational Procedure for Determining Energetically Favorable Binding Sites on Biologically Important Macromolecules", J. Med. Chem., 28, pp. 849-857 (1985)]. GRID is available from Oxford University, Oxford, UK.
  • MCSS A. Miranker et al, "Functionality Maps of Binding Sites: A Multiple Copy Simultaneous Search Method.” Proteins: Structure, Function and Genetics, 11, pp. 29-34 (1991)]. MCSS is available from Molecular Simulations, San Diego, CA. 3. AUTODOCK [D. S. Goodsell et al, "Automated Docking of Substrates to Proteins by Simulated Annealing", Proteins: Structure, Function, and Genetics, 8, pp. 195-202 (1990)]. AUTODOCK is available from Scripps Research Institute, La Jolla, CA.
  • DOCK [I. D. Kuntz et al, "A Geometric Approach to Macromolecule-Ligand Interactions", J. Mol. Biol, 161, pp. 269-288 (1982)]. DOCK is available from University of California, San Francisco, CA.
  • Useful programs to aid one of skill in the art in building an inhibitor of a P. a. MurG binding pocket in a step-wise fashion, including one fragment or chemical entity at a time include:
  • CAVEAT [P. A. Bartlett et al, "CAVEAT: A Program to Facilitate the Structure-Derived Design of Biologically Active Molecules", in Molecular Recognition in Chemical and Biological Problems", Special Pub., Royal Chem. Soc, 78, pp. 182-196 (1989); G. Lauri and P. A. Bartlett, "CAVEAT: a Program to Facilitate the Design of Organic Molecules", J. Comput. Aided Mol. Des. , 8, pp. 51-66 (1994)].
  • CAVEAT is available from the University of California, Berkeley, CA. 2.
  • 3D Database systems such as ISIS (MDL Information Systems, San Leandro, CA). This area is reviewed in Y. C. Martin, "3D Database Searching in Drug Design", J. Med. Chem., 35, pp. 2145-2154 (1992).
  • HOOK A Program for Finding Novel Molecular Architectures that Satisfy the Chemical and Steric Requirements of a Macromolecule Binding Site", Proteins: Struct., Fund, Genet., 19, pp. 199-221 (1994)]. HOOK is available from Molecular Simulations, San Diego, CA.
  • inhibitory or other P. a. MurG binding compounds may be designed as a whole or "de novo" using either an empty binding pocket or optionally including some portion(s) of a known inhibitor(s).
  • de novo ligand design methods including:
  • LUDI H.-J. Bohm, "The Computer Program LUDI: A New Method for the De Novo Design of Enzyme Inhibitors", J. Comp. Aid. Molec. Design, 6, pp. 61-78 (1992)]. LUDI is available from Molecular Simulations Incorporated, San Diego, CA; now Accelrys, San Diego,CA.
  • LEGEND [Y. Nishibata et al, Tetrahedron, 47, p. 8985 (1991)]. LEGEND is available from Molecular Simulations Incorporated, San Diego, CA; now Acclerys, San Diego, CA.
  • NEWLEAD V. Tschinke and N.C. Cohen, "The NEWLEAD Program: A New Method for the Design of Candidate Structures from Pharmacophoric Hypotheses", J. Med. Chem., 36, 3863-3870 (1993)).
  • the efficiency with which that chemical entity may bind to a P. a. MurG binding pocket may be tested and optimized by computational evaluation.
  • an effective P. a. MurG binding pocket inhibitor must preferably demonstrate a relatively small difference in energy between its bound and free states (i.e., a small deformation energy of binding).
  • the most efficient P.a. MurG binding pocket inhibitors should preferably be designed with a deformation energy of binding of not greater than about 10 kcal/mole, more preferably, not greater than 7 kcal/mole.
  • P.a. MurG binding pocket inhibitors may interact with the binding pocket in more than one conformation that is similar in overall binding energy. In those cases, the deformation energy of binding is taken to be the difference between the energy of the free entity and the average energy of the conformations observed when the inhibitor binds to the protein.
  • An entity designed or selected as binding to a P.a. MurG binding pocket may be further computationally optimized so that in its bound state it would preferably lack repulsive electrostatic interaction with the target enzyme and with the surrounding water molecules.
  • Such non-complementary electrostatic interactions include repulsive charge-charge, dipole-dipole and charge-dipole interactions.
  • Graphics® workstation such as an INDIG02 with "IMPACTTM” graphics.
  • Other hardware systems and software packages will be known to those skilled in the art.
  • Another approach enabled by this invention is the computational screening of small molecule databases for chemical entities or compounds that can bind in whole, or in part, to a .a.MurG binding pocket. In this screening, the quality of fit of such entities to the binding pocket may be judged either by shape complementarity or by estimated interaction energy [E. C. Meng et al., J. Comp. Chem., 13, pp. 505-524 (1992)].
  • Iterative drug design is a method for optimizing associations between a protein and a compound by determining and evaluating the three- dimensional structures of successive sets of protein/compound complexes.
  • the invention provides compounds which associate with a P. a. MurG binding pocket produced or identified by the method set forth above.
  • Iterative drug design is a method for optimizing associations between a protein and a compound by determining and evaluating the three- dimensional structures of successive sets of protein/compound complexes.
  • this invention provides for a method of designing a compound which binds to a catalytic domain of a P. a. MurG protein comprising a P. a. MurG binding pocket, wherein said catalytic domain is characterized by:
  • said method comprising the steps of:
  • the invention provides for a method of identifying a potential inhibitor of a P. a. MurG protein comprising a P. a. MurG binding pocket, wherein said method comprising the steps of:
  • this invention provides for a method of identifying a candidate inhibitor of a molecule or molecular complex comprising a binding pocket or domain selected from the group consisting of:
  • this invention provides for a method for identifying a candidate inhibitor that interacts with a binding site of a P. a. MurG protein or a homologue thereof, comprising the steps of:
  • this invention provides the methods of identifying above, wherein the binding site of said P. a. MurG protein or said homologue thereof determined in step (d) comprises the structure coordinates of P. a. MurG protein according to FIG. 1 or FIG.2 of a set of amino acid residues that are identical to P.a. MurG amino acid residues G14, H15, F17, N124, R143, L186, S189, F242, 1243, M246, A262, L263, T264 and E267 wherein the root mean square deviation from the backbone atoms of said amino acids is not more than 1.0 A.
  • One embodiment of this invention provides for a method of identifying compounds that bind P. a. MurG protein comprising:
  • the 3-D molecular model of P. a. MurG protein is represented by the structure coordinates of P. a. MurG protein according to FIG. 1 or FIG. 2.
  • the 3-D molecular model of P. a. MurG protein comprises the structure coordinates of P. a. MurG protein according to FIG. l or FIG.2.
  • the 3-D molecular model comprises amino acid residues 1-357 of SEQ ID NO: l .
  • Another embodiment of this invention provides for a method of identifying compounds that bind a P. a. MurG protein comprising:
  • step (e) assaying for P. a. MurG protein activity in the presence of the binding candidate compounds identified in step (d);
  • the 3-D molecular model of P. a. MurG protein is represented by the structure coordinates of P. a. MurG protein according to FIG. 1 or FIG.2.
  • the 3-D molecular model of P.a. MurG protein comprises the structure coordinates of P.a. MurG protein according to FIG. l or FIG.2.
  • the 3-D molecular model comprises amino acid residues 1-357 of SEQ ID NO: l .
  • the structure coordinates of P.a. MurG protein according to FIG. 1 or FIG.2 can also be used to aid in obtaining structural information about another crystallized molecule or molecular complex. This may be achieved by any of a number of well-known techniques, including molecular replacement.
  • the machine -readable data storage medium comprises a data storage material encoded with a first set of machine readable data which comprises the Fourier transform of at least a portion of the structure coordinates of P.a. MurG protein according to FIG. 1 or FIG. 2, or homology model thereof, and which, when using a machine programmed with instructions for using said data, can be combined with a second set of machine readable data comprising the X-ray diffraction pattern of a molecule or molecular complex to determine at least a portion of the structure coordinates corresponding to the second set of machine readable data.
  • the invention provides a computer for determining at least a portion of the structure coordinates corresponding to X-ray diffraction data obtained from a molecule or molecular complex, wherein said computer comprises:
  • a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein said data comprises at least a portion of the structural coordinates of P.a. MurG according to FIG. 1 or FIG.2, or homology model thereof;
  • a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein said data comprises X-ray diffraction data obtained from said molecule or molecular complex; and c) instructions for performing a Fourier transform of the machine readable data of (a) and for processing said machine readable data of (b) into structure coordinates.
  • the Fourier transform of at least a portion of the structure coordinates of P.a. MurG protein according to FIG. 1 or FIG. 2, or homology model thereof may be used to determine at least a portion of the structure coordinates of P.a. MurG homologues.
  • this invention provides a method of utilizing molecular replacement to obtain structural information about a molecule or molecular complex whose structure is unknown comprising the steps of:
  • the method is performed using a computer.
  • the molecule is selected from the group consisting of P.a. MurG and P.a. MurG homologues.
  • the molecule is a P.a. MurG molecular complex or homologue thereof.
  • Molecular replacement provides an accurate estimation of the phases for an unknown structure. Phases are a factor in equations used to solve crystal structures that can not be determined directly. Obtaining accurate values for the phases, by methods other than molecular replacement, is a time-consuming process that involves iterative cycles of approximations and refinements and greatly hinders the solution of crystal structures. However, when the crystal structure of a protein containing at least a homologous portion has been solved, the phases from the known structure provide a satisfactory estimate of the phases for the unknown structure.
  • this method involves generating a preliminary model of a molecule or molecular complex whose structure coordinates are unknown, by orienting and positioning the relevant portion of the P.a. MurG protein according to FIG. 1 or FIG. 2, or homology model thereof within the unit cell of the crystal of the unknown molecule or molecular complex so as best to account for the observed X-ray diffraction pattern of the crystal of the molecule or molecular complex whose structure is unknown. Phases can then be calculated from this model and combined with the observed X-ray diffraction pattern amplitudes to generate an electron density map of the structure whose coordinates are unknown.
  • the method of molecular replacement is utilized to obtain structural information about a P.a. MurG homologue.
  • the structure coordinates of P.a. MurG as provided by this invention are particularly useful in solving the structure of P.a. MurG complexes that are bound by ligands, substrates and inhibitors.
  • P.a. MurG mutants are useful in solving the structure of MurG proteins that have amino acid substitutions, additions and/or deletions (referred to collectively as "P.a. MurG mutants", as compared to naturally occurring P.a. MurG).
  • P.a. MurG mutants may optionally be crystallized in co-complex with a chemical entity, such as UDP-N- acetylglucosamine.
  • the crystal structures of a series of such complexes may then be solved by molecular replacement and compared with that of wild-type P. a. MurG protein. Potential sites for modification within the various binding pockets of the enzyme may thus be identified. This information provides an additional tool for determining the most efficient binding interactions, for example, increased hydrophobic interactions, between P. a. MurG protein and a chemical entity.
  • the structure coordinates are also particularly useful in solving the structure of crystals of P. a. MurG protein or P. a. MurG homologues, which is co- complexed with a variety of chemical entities.
  • This approach enables the determination of the optimal sites for interaction between chemical entities, including candidate P. a. MurG inhibitors.
  • high resolution X-ray diffraction data collected from crystals exposed to different types of solvent allows the determination of where each type of solvent molecule resides. Small molecules that bind tightly to those sites can then be designed and synthesized and tested for their P. a. MurG inhibition activity.
  • Example 1 Expression and Purification of P. a.
  • MurG [ 00175 ] Pseudomonas Aeruginosa MurG (M 1 -G357) was cloned into the pET21b vector (Novagen), The c-terminal his-tagged protein was overexpressed using BL21(DE3)pLysS cells (Novagen) in E.coli 2xYT media supplemented with lOOug/ml carbenicillin and 0.34ug/mL chloramphenicol. When an OD 6 oo nm of 0.8 was reached, IPTG was added to a final concentration of ImM. The cell culture was then harvested 3.5 hours post induction and cell pellet stored at -80°C.
  • the MurG cell pellets were thawed at 4°C and resuspended in Buffer 1 : 50mM Tris, pH8.0, 500mM NaCl, lOmM Imidazole, 5mM B-Me, 3% Triton X-100, 10% Glycerol and Protease Inhibitor tablets. (Good protein solubilisation was achieved only in the presence of high salt and triton concentration).
  • Cells were initially disrupted by dounce homogenization and sonicated (3x 30seconds). Resuspended cells were further mechanically disrupted using a Micro fluidizer (Micro fluidics, Newton, MA). The cell debris was removed by centrifugation (24,000rpm, 15min at 4°C) and the supernatant decanted. Protein was purified by Ni/NTA agarose metal affinity chromatography (Novagen).
  • Protein supernatant was incubated for 2 hours at 4°C with pre- equilibrated Nickel-NTA metal affinity resin in Buffer 1.
  • the NiNTA resin was collected by centrifugation (4000g, 4 min) and the non-specifically bound protein was removed by extensively washing with Buffer 2: 50mM Tris, pH 8.0, 500mM NaCl, lOmM Imidazole, 5mM B-Me and 10% Glycerol.
  • MurG protein was eluted using a step imidazole gradient of 5%, 10%, 20% and 30% of 1M Imidazole . Clean MurG eluted at 20% and 30%.
  • MurG was further purified by size-exclusion on a Superdex 200(26/60) column
  • Purified MurG was buffer exchanged using PD10 columns (GE Healthcare) into 25mM Hepes (pH 7.0), 500 mM NaCl, 10% (v/v) glycerol, 2 mM DTT and 2.24mM decyl-b-d maltopyranoside (Anatrace) at 4°C and concentrated to 10 mg/ml for crystallisation.
  • PD10 columns GE Healthcare
  • 500 mM NaCl 10% (v/v) glycerol
  • 2 mM DTT 2.24mM decyl-b-d maltopyranoside
  • Lipid I solvent was evaporated using a stream of nitrogen gas. Lipid I was reconstituted in 50 mM HEPES (pH 7.9), 15 mM KCl, 5 mM MgCl 2 , 0.1% Triton X-100 and 10 % methanol by vortexing for 3 minutes, to give a 1.7 mg/mL (1 mM) solution.
  • Assay MurG activity was tested using a continuous enzyme-coupled UV absorbance assay as described previously (Biochem. (2002) 41 6829-6833; Chem. Bio. Chem. (2003) 4 603-609).
  • Table 1 Summary of data collection and structure refinement
  • the 2.2 A resolution X-ray structure of UDP-GlcNAc bound to P. a MurG has many features in common with the E.coli MurG:UDP-GlcNAc crystal structure previously published (Ha S, Walker D, Shi Y, Walker S (2000), Protein Sci. 9: 1045-52; PDB code: INLM).
  • the overall structure of MurG consists of and N-terminal and a C-terminal domain linked by a hinge region (residues 162-166 and 339-341) and separated by a deep cleft.
  • N- and C-domains have minimal sequence homology they share high structural homology, and both have an ⁇ - ⁇ Rossmann-like fold, which is characteristic of domains that bind nucleotides. Incorporated within these domains, consistent with binding negatively charged phosphates from co-factors, are three
  • Glycine-rich motifs which have the consensus sequence GXGXXG. These G- loops are located at a turn between the carboxyl end of one ⁇ -strand and the amino terminus of the adjacent a-helix.
  • the C-terminal domain of E.coli MurG swings away from the N-terminal domain as a rigid-body by an angle of approximately 15° about the inter-domain hinge. This means the cleft is significantly wider in E.coli and consequently the UDP-GlcNAc substrate, which remains in contact with the C-terminal domain, is displaced by approximately 6A.
  • G- loop2 and G-loop3 are highly mobile and also unstructured in the apo- crystal structure. This degree of disorder and flexibility was not observed for a similar pair of E.coli MurG crystal structures (Ha S, Walker D, Shi Y, Walker S (2000), Protein Sci. 9: 1045-52; PDB's INLM and IFOK). Upon binding of the donor sugar UDP-GlcNAc to P.a. MurG the three G-loops become visible in experimental electron density.
  • MurG associates with the cytoplasmic surface of bacterial membranes and it has been proposed that the membrane association site is comprised of a
  • Residues T 16, HI 9 and Y106 identified in the E.coli enzyme by enzymology as being important for binding Lipid I are present in our crystal structures (residues T12, HI 5 and Y102 in P.a. MurG) and point towards the proposed Lipid I binding site. These residues are invariant in MurG enzymes from all known bacteria. Positioning of residue HI 9 so close to the bound UDP-GlcNAc donor substrate and proposed Lipid I binding site suggests it has a key role in anchoring the lipid acceptor tail.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Peptides Or Proteins (AREA)

Abstract

Les cristaux MurG de P. a. comprennent un polypeptide constitué d'acides aminés selon SEQ ID No: 1. Les procédés de formation de ces cristaux MurG de P. a. font appel à des compositions cristallisables, lesdites compositions cristallisables comprenant un polypeptide selon l'invention, un tampon ayant une valeur pKa de 6,5 à 8,0, et un polyéthylène glycol à une concentration de 5 à 50 % en volume. Des supports de stockage de données renferment les coordonnées structurales des molécules et des complexes moléculaires qui constituent tout ou partie d'une poche de liaison de la protéine MurG de P. a. Les procédés de conception, d'évaluation et d'identification des composés, tels que des inhibiteurs potentiels de la protéine MurG de P. a. ou de ses homologues, qui se lient à la poche de liaison précitée, font appel à au moins une partie des coordonnées atomiques des cristaux MurG de P.a.
PCT/US2011/060705 2010-11-16 2011-11-15 Structure cristalline de l'enzyme murg de pseudomonas aeruginosa WO2012068053A1 (fr)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US41416410P 2010-11-16 2010-11-16
US61/414,164 2010-11-16
US201161488935P 2011-05-23 2011-05-23
US61/488,935 2011-05-23

Publications (1)

Publication Number Publication Date
WO2012068053A1 true WO2012068053A1 (fr) 2012-05-24

Family

ID=45099187

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2011/060705 WO2012068053A1 (fr) 2010-11-16 2011-11-15 Structure cristalline de l'enzyme murg de pseudomonas aeruginosa

Country Status (1)

Country Link
WO (1) WO2012068053A1 (fr)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4886646A (en) 1988-03-23 1989-12-12 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration Hanging drop crystal growth apparatus and method
US5096676A (en) 1989-01-27 1992-03-17 Mcpherson Alexander Crystal growing apparatus
US5130105A (en) 1990-10-23 1992-07-14 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration Protein crystal growth tray assembly
US5221410A (en) 1991-10-09 1993-06-22 Schering Corporation Crystal forming device
US5400741A (en) 1993-05-21 1995-03-28 Medical Foundation Of Buffalo, Inc. Device for growing crystals
US5884230A (en) 1993-04-28 1999-03-16 Immunex Corporation Method and system for protein modeling
WO2001090301A2 (fr) * 2000-05-17 2001-11-29 Princeton University Procedes de preparation de modeles, procedes d'utilisation des modeles de la proteine murg, composes liant, inhibant ou stimulant des proteines murg et des compositions therapeutiques associees
WO2003097789A2 (fr) * 2002-05-21 2003-11-27 Affinium Pharmaceuticals, Inc. Nouveaux polypeptides purifies issus de pseudomonas aeruginosa

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4886646A (en) 1988-03-23 1989-12-12 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration Hanging drop crystal growth apparatus and method
US5096676A (en) 1989-01-27 1992-03-17 Mcpherson Alexander Crystal growing apparatus
US5130105A (en) 1990-10-23 1992-07-14 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration Protein crystal growth tray assembly
US5221410A (en) 1991-10-09 1993-06-22 Schering Corporation Crystal forming device
US5884230A (en) 1993-04-28 1999-03-16 Immunex Corporation Method and system for protein modeling
US5400741A (en) 1993-05-21 1995-03-28 Medical Foundation Of Buffalo, Inc. Device for growing crystals
WO2001090301A2 (fr) * 2000-05-17 2001-11-29 Princeton University Procedes de preparation de modeles, procedes d'utilisation des modeles de la proteine murg, composes liant, inhibant ou stimulant des proteines murg et des compositions therapeutiques associees
WO2003097789A2 (fr) * 2002-05-21 2003-11-27 Affinium Pharmaceuticals, Inc. Nouveaux polypeptides purifies issus de pseudomonas aeruginosa

Non-Patent Citations (56)

* Cited by examiner, † Cited by third party
Title
"Int. Sci. Rev. Ser.", 1972, GORDON & BREACH, article "The Molecular Replacement Method"
"Meth. Enzymol.", vol. 114, 115, 1985, ACADEMIC PRESS
A. C.R. MARTIN: "SciTech Software", UNIVERSITY COLLEGE LONDON
A. MIRANKER ET AL.: "Functionality Maps of Binding Sites: A Multiple Copy Simultaneous Search Method.", PROTEINS: STRUCTURE, FUNCTION AND GENETICS, vol. 11, 1991, pages 29 - 34, XP002920473, DOI: doi:10.1002/prot.340110104
BIOCHEM., vol. 41, 2002, pages 6829 - 6833
BLUNDELL ET AL., NATURE, vol. 326, 1987, pages 347 - 352
BRUNGER ET AL., ACTA CRYSTALLOGR. D. BIOL. CRYSTALLOGR., vol. 54, 1998, pages 905 - 921
CARSON, J. APPL. CRYSTALLOGR., vol. 24, 1991, pages 9589 - 961
CHAYEN, J. APPL. CRYST., vol. 30, 1997, pages 198 - 202
CHEM. BIO. CHEM., vol. 4, 2003, pages 603 - 609
D. S. GOODSELL ET AL.: "Automated Docking of Substrates to Proteins by Simulated Annealing", PROTEINS: STRUCTURE, FUNCTION, AND GENETICS, vol. 8, 1990, pages 195 - 202, XP008010751, DOI: doi:10.1002/prot.340080302
D'ARCY ET AL., J. CRYST. GROWTH, vol. 168, 1996, pages 175 - 180
DATABASE UniProt [online] 1 March 2001 (2001-03-01), "RecName: Full=UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase; EC=2.4.1.227; AltName: Full=Undecaprenyl-PP-MurNAc-pentapeptide-UDPGlcNAc GlcNAc transferase;", XP002671839, retrieved from EBI accession no. UNIPROT:Q9HW01 Database accession no. Q9HW01 *
DAYHOFF ET AL., ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, vol. 5, 1978, pages 345 - 352
DELANO, W.L.: "The PyMOL Molecular Graphics System", 2002, DELANO SCIENTIFICTM
E. C. MENG ET AL., J. COMP. CHEM., vol. 13, 1992, pages 505 - 524
E. LATTMAN: "Use of the Rotation and Translation Functions", METH. ENZYMOL., vol. 115, 1985, pages 55 - 77, XP008145055, DOI: doi:10.1016/0076-6879(85)15007-X
EMSLEY, P.; COWTAN, K., ACTA CRYSTALLOGR., vol. D60, December 2004 (2004-12-01), pages 2126 - 2132
FETROW; BRYANT, BIO/TECHNOLOGY, vol. 11, 1993, pages 479 - 484
G. LAURI; P. A. BARTLETT: "CAVEAT: a Program to Facilitate the Design of Organic Molecules", J. COMPUT. AIDED MOL. DES., vol. 8, 1994, pages 51 - 66, XP009082997, DOI: doi:10.1007/BF00124349
GREER, METHODS IN ENZYMOLOGY, vol. 202, 1991, pages 239 - 252
GUEX ET AL., ELECTROPHORESIS, vol. 18, 1997, pages 2714 - 2723
H.-J. BOHM: "The Computer Program LUDI: A New Method for the De Novo Design of Enzyme Inhibitors", J. COMP. AID. MOLEC. DESIGN, vol. 6, 1992, pages 61 - 78, XP000560808, DOI: doi:10.1007/BF00124387
HA S; WALKER D; SHI Y; WALKER S, PROTEIN SCI., vol. 9, 2000, pages 1045 - 52
HANKS ET AL., SCIENCE, vol. 241, 1988, pages 42
HANKS; QUINN, METH. ENZYMOL., vol. 200, 1991, pages 38
HIGGINS D. G. ET AL., METHODS ENZYMOL, vol. 266, 1996, pages 383 - 402
I. D. KUNTZ ET AL.: "A Geometric Approach to Macromolecule-Ligand Interactions", J. MOL. BIOL., vol. 161, 1982, pages 269 - 288, XP024019141, DOI: doi:10.1016/0022-2836(82)90153-X
JOHNSON ET AL., CRIT. REV. BIOCHEM. MOL BIOL., vol. 29, 1994, pages 1 - 68
JONES ET AL., ACTA CRYST. A, vol. 47, 1991, pages 110 - 119
JONES ET AL., ACTA CRYST. SECT. A, vol. 47, 1991, pages 110 - 119
JONES ET AL., ACTA CRYSTALLOGR., vol. A47, 1991, pages 110 - 119
L. M. BALBES ET AL.: "Reviews in Computational Chemistry", vol. 5, 1994, VCH, article "A Perspective of Modem Methods in Computer-Aided Drug Design", pages: 337 - 380
M. A. NAVIA; M. A. MURCKO: "The Use of Structural Information in Drug Design", CURRENT OPINIONS IN STRUCTURAL BIOLOGY, vol. 2, 1992, pages 202 - 210
M. B. EISEN ET AL.: "HOOK: A Program for Finding Novel Molecular Architectures that Satisfy the Chemical and Steric Requirements of a Macromolecule Binding Site", PROTEINS: STRUCT., FUNCT., GENET., vol. 19, 1994, pages 199 - 221
M. CARSON, J. APPL. CRYST., vol. 24, 1991, pages 958 - 961
MIHA KOTNIK ET AL: "Development of novel inhibitors targeting intracellular steps of peptidoglycan biosynthesis.", CURRENT PHARMACEUTICAL DESIGN, vol. 13, no. 22, 1 January 2007 (2007-01-01), pages 2283 - 2309, XP055022212, ISSN: 1381-6128 *
MOHAMMADI T; KARCZMAREK A; CROUVOISIER M; BOUHSS A; MENGIN-LECREULX D; DEN BLAAUWEN T, MOL MICROBIOL., vol. 65, 2007, pages 1106 - 21
N. C. COHEN ET AL.: "Molecular Modeling Software and Methods for Medicinal Chemistry", J. MED. CHEM., vol. 33, 1990, pages 883 - 894, XP002950820, DOI: doi:10.1021/jm00165a001
P. A. BARTLETT ET AL.: "Molecular Recognition in Chemical and Biological Problems", vol. 78, 1989, SPECIAL PUB., ROYAL CHEM. SOC., article "CAVEAT: A Program to Facilitate the Structure-Derived Design of Biologically Active Molecules", pages: 182 - 196
P. J. GOODFORD: "A Computational Procedure for Determining Energetically Favorable Binding Sites on Biologically Important Macromolecules", J. MED. CHEM., vol. 28, 1985, pages 849 - 857, XP002083458, DOI: doi:10.1021/jm00145a002
PAV ET AL., PROTEINS: STRUCTURE, FUNCTION, AND GENETICS, vol. 20, 1994, pages 98 - 102
R. GIET; C. PRIGENT, J. CELL SCI., vol. 112, 1999, pages 3591 - 3601
SCHNARE ET AL., J. MOL. BIOL, vol. 256, 1996, pages 701 - 719
SHA HA ET AL: "The 1.9 Å crystal structure of Escherichia coli MurG, a membrane-associated glycosyltransferase involved in peptidoglycan biosynthesis", PROTEIN SCIENCE, vol. 9, no. 6, 1 June 2000 (2000-06-01), pages 1045 - 1052, XP055022206, ISSN: 0961-8368, DOI: 10.1110/ps.9.6.1045 *
SMITH; WATERMAN, ADV. APPL. MATH., vol. 2, 1981, pages 482
SMITH; WATERMAN, ADVANCES IN APPLIED MATHEMATICS, vol. 2, 1981, pages 482
SZKLARZ G.D., LIFE SCI., vol. 61, 1997, pages 2507 - 2520
THOMAS A. HALGREN; ROBERT B. MURPHY; RICHARD A. FRIESNER; HEGE S. BEARD; LEAH L. FRYE; W. THOMAS POLLARD; JAY L. BANKS: "Glide: A New Approach for Rapid, Accurate Docking and Scoring. 2. Enrichment Factors in Database Screening", J. MED. CHEM., vol. 47, no. 7, 2004, pages 1750 - 1759
V. GILLET ET AL.: "SPROUT: A Program for Structure Generation", J. COMPUT. AIDED MOL. DESIGN, vol. 7, 1993, pages 127 - 153, XP000618588, DOI: doi:10.1007/BF00126441
V. TSCHINKE; N.C. COHEN: "The NEWLEAD Program: A New Method for the Design of Candidate Structures from Pharmacophoric Hypotheses", J. MED. CHEM., vol. 36, 1993, pages 3863 - 3870
W. C. GUIDA: "Software For Structure-Based Drug Design", CURR. OPIN. STRUCT. BIOLOGY, vol. 4, 1994, pages 777 - 781, XP025675913, DOI: doi:10.1016/S0959-440X(94)90179-1
WISHART, D. S. ET AL., COMPUT. APPL. BIOSCI., vol. 10, 1994, pages 687 - 88
XIE ET AL., STRUCTURE, vol. 6, 1998, pages 983 - 991
Y. C. MARTIN: "3D Database Searching in Drug Design", J. MED. CHEM., vol. 35, 1992, pages 2145 - 2154
Y. NISHIBATA ET AL., TETRAHEDRON, vol. 47, 1991, pages 8985

Similar Documents

Publication Publication Date Title
US8712749B2 (en) Crystal structure of human JAK3 kinase domain complex and binding pockets thereof
US8124392B2 (en) Crystal structure of Aurora-2 protein and binding pockets thereof
US7666646B2 (en) Crystal structure of human Pim-1 kinase protein complexes and binding pockets thereof, and uses thereof in drug design
US20090062286A1 (en) Crystal Structure of SMYD3 Protein
US20080312298A1 (en) Methods for Identification of Modulators of Carm1 Methyl Transferase Activity
US20030096303A1 (en) Crystallized P38 complexes
US20070020253A1 (en) Spleen tyrosine kinase catalytic domain:crystal structure and binding pockets thereof
Harjes et al. The crystal structure of human PAPS synthetase 1 reveals asymmetry in substrate binding
US7655446B2 (en) Crystal structure of Rho-kinase I kinase domain complexes and binding pockets thereof
US8002891B2 (en) Crystallization of C-Jun N-Terminal Kinase 3 (JNK3)
US20110171714A1 (en) Crystal structure of tak1-tab1
US20060030016A1 (en) Crystal structure of interleukin-2 tyrosine kinase (ITK) and binding pockets thereof
US7700340B2 (en) Crystal structure of polo-like kinase 3 (PLK3) and binding pockets thereof
WO2012068053A1 (fr) Structure cristalline de l'enzyme murg de pseudomonas aeruginosa
US7494795B2 (en) Crystal structure of FMS-like tyrosine kinase
US7563610B1 (en) Crystalline composition of farsenyl pyrophosphate synthase (IspA)
US20050107298A1 (en) Crystals and structures of c-Abl tyrosine kinase domain
US7444273B1 (en) Crystallization of aurora/LPL1P-related kinase
WO2003060102A2 (fr) Structures cristallines de complexes d'inhibition de la jnk et poches de liaison de ceux-ci

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11791691

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11791691

Country of ref document: EP

Kind code of ref document: A1