WO1999017616A1 - Structure-based rational design of compounds to inhibit papillomavirus infection - Google Patents

Structure-based rational design of compounds to inhibit papillomavirus infection Download PDF

Info

Publication number
WO1999017616A1
WO1999017616A1 PCT/US1998/020991 US9820991W WO9917616A1 WO 1999017616 A1 WO1999017616 A1 WO 1999017616A1 US 9820991 W US9820991 W US 9820991W WO 9917616 A1 WO9917616 A1 WO 9917616A1
Authority
WO
WIPO (PCT)
Prior art keywords
e6bp
candidate compound
compound
hpv
dimensional
Prior art date
Application number
PCT/US1998/020991
Other languages
French (fr)
Inventor
Elliot J. Androphy
James D. Baleja
Jason J. Chen
Original Assignee
New England Medical Center Hospitals, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by New England Medical Center Hospitals, Inc. filed Critical New England Medical Center Hospitals, Inc.
Priority to EP98953264A priority Critical patent/EP0964617A1/en
Priority to JP52222399A priority patent/JP2001513823A/en
Priority to CA002273186A priority patent/CA2273186A1/en
Priority to AU10676/99A priority patent/AU1067699A/en
Publication of WO1999017616A1 publication Critical patent/WO1999017616A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/005Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/12Antivirals
    • A61P31/20Antivirals for DNA viruses
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/001Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof by chemical synthesis
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/569Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
    • G01N33/56983Viruses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2730/00Reverse transcribing DNA viruses
    • C12N2730/00011Details
    • C12N2730/10011Hepadnaviridae
    • C12N2730/10111Orthohepadnavirus, e.g. hepatitis B virus
    • C12N2730/10122New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes

Definitions

  • the invention relates to methods of structure-based rational design of compounds useful for inhibiting infection by papillomavirus.
  • Papillomaviruses have been linked to widespread, serious human diseases, especially carcinomas of the genital and oral mucosa. It is estimated that there are currently somewhere in the neighborhood of tens of millions of women who suffer from human papilloma virus (HPV) infection of the genital tract. Many of these women eventually develop cancer of the cervix. For example, it has been estimated that about twenty percent (20%) of all cancer related deaths in women worldwide are from cancers which are associated with HPV. It has also been estimated that 90% of all cervical cancer is linked to HPV.
  • HPV human papilloma virus
  • Papillomaviruses induce benign, dysplastic and malignant hyperproliferations of skin or mucosal epithelium (see, for example, Mansur and Androphy, (1993) Biochim Biophys Ada 1155:323-345; Pfister (1984) Rev. Physiol. Biochem. Pharmacol. 99:111- 181 ; and Broker et al, (1986) Cancer Cells 4:17-36, for reviews of the molecular, cellular, and clinical aspects of the papillomaviruses). Almost 70 human papillomavirus types have been identified, and different papillomavirus types are known to cause distinct diseases, Pfister, (1987) Adv.
  • HPVs Human papillomaviruses
  • HPV types 1 and 2 cause common warts
  • types 6 and 11 cause warts of the external genitalia, anus and cervix.
  • HPV, types 16, 18, 31 and 33 have been isolated from the majority of cervical cancers with HPV-16 present in about 50 percent of all cervical cancers. These HPVs are referred to as "high risk”. While HPV 6 and 11 are the most common isolates for cervical warts, these infections rarely progress to invasive cancer, and therefore these HPVs are referred to as "low risk”.
  • E6 and E7 have been shown to encode transforming and immortalizing activities.
  • E6-induced tumorigenesis is thought to occur via multiple pathways (see Turek L., (1994) Adv. Virus Res. 44:305; Tommasino M. and Crawford L. (1995) BioEssays 17:509; Lee J.M. and Bernstein A. (1995) Cancer Metas. Rev. 14:149; Scheffner M. et al., (1993) Cell 75:495; and Huibregste J. et al., (1993) Mol. Cell Biol. 13:775).
  • E6AP In the p53-dependent pathway, the E6 protein associates with a human cellular factor, E6AP and in the p53- independent pathway, the E6 protein associates with a human cellular factor, ERC55.
  • the invention features, a method for evaluating a candidate compound for the ability to interact with, e.g., bind, an HPV E6 transforming protein.
  • the method includes: supplying a three-dimensional structure for the E6 binding peptide (E6bp); supplying a three-dimensional structure for the candidate compound; and, optionally, comparing the three-dimensional structure of the candidate compound to the three- dimensional structure of the E6bp, thereby evaluating the candidate compound for the ability to interact with, e.g., bind the HPV E6 transforming protein.
  • similarity in the structure of the candidate compound to the structure of the E6bp is indicative of the ability of the candidate compound to interact with the HPV E6 transforming protein.
  • the invention features, a method of providing or identifying a compound, preferably a compound which has the ability to interact with, e.g., bind, an HPV E6 transforming protein.
  • the method includes: supplying a three-dimensional structure for E6bp; supplying a three-dimensional structure for a candidate compound; optionally comparing the three-dimensional structure of the candidate compound to the three-dimensional structure of the E6bp; and optionally altering the structure or altering the spatial position of the structure of the candidate compound, thereby providing or identifying a compound, which preferably has the ability to interact with the HPV E6 transforming protein.
  • the altered structure of the candidate compound more closely resembles the three-dimensional structure of E6bp, than does the original structure of the candidate compound.
  • the method includes comparing the altered structure of the candidate compound or the identified compound to the three-dimensional structure of the E6bp.
  • the comparison can be performed by defining an atom equivalency in the candidate compound or the identified compound and the E6bp three- dimensional structures and comparing these atom equivalencies.
  • a second or further subsequent alteration is made in the structure or the spatial position of the structure of the candidate compound.
  • the method includes defining an atom equivalency in the candidate compound and the E6bp three-dimensional structures; and performing a fitting operation between the candidate compound and the E6bp three-dimensional structures.
  • the method includes defining an atom equivalency in the candidate compound and the E6bp three-dimensional structures; performing a fitting operation between the candidate compound and the E6bp three-dimensional structures; and analyzing the results of the fitting operation to compare the level of similarity between the candidate compound and the E6bp three-dimensional structures.
  • the atom equivalencies can correspond to protein backbone atoms, e.g., N, C ⁇ , C and O atoms.
  • the fitting operation can be a rigid fitting operation, e.g., the E6bp three-dimensional structure can be kept rigid and the three- dimensional structure of the candidate compound can be translated and rotated to obtain an optimum fit with the rigid target E6bp structure.
  • the comparison between the candidate compound and the E6bp three-dimensional structures can be performed computationally, e.g., by calculating the root mean square deviation of a set of structural coordinates in the candidate compound from a set of structural coordinates in the E6bp, or visually, e.g., by visual inspection of the candidate compound and the E6bp three-dimensional structures, displayed in a graphical format.
  • the candidate compound can have an -helical structure and the alteration can result in a change in the class of the ⁇ -helix comprising the structure of the candidate compound.
  • the ⁇ -helix comprising the structure of the candidate compound can be selected from the group consisting of A, G, and Y ⁇ -helices.
  • the method includes creating a record of one or more of the three-dimensional structures of the candidate compound, the altered candidate compound, the identified compound, and E6bp.
  • the record can be encoded in the form of a machine-readable data storage medium.
  • the three-dimensional structures can be displayed on a machine capable of displaying a graphical three-dimensional representation.
  • the method includes providing the identified compound, e.g., chemically synthesizing the identified compound based on the structure identified using the methods described herein.
  • the method includes assessing the biological activity of the identified compound.
  • the biological activity of the identified compound can be assessed in vitro, e.g., in a GST-E6 binding assay or a two-hybrid assay, or in vivo, e.g., by applying the compound to a cell line (Hela, Caski, Siha) which expresses HPVE6 and examining the growth characteristics of the cells; or by its tumor suppression ability in an animal model for HPV infection.
  • the identified compound can be combined with a carrier suitable for introduction into an animal model, e.g., naturally derived or synthetic polymers, solvents, dispersion media, coatings, antibacterial and antifungal agents and the like.
  • the candidate compound can be altered so as to have a three-dimensional structure that is substantially similar to the three-dimensional structure of E6bp provided in figure 1, such that the candidate compound can bind to the HPV E6 transforming protein or portion thereof.
  • the candidate compound can be a peptide, a peptidomimetic, e.g., an isostere, an "inverso” or a "retro-inverso” peptide and the like, or a non peptide organic or inorganic compound.
  • the identified compound associates with the HPV E6 transforming protein or a portion thereof, such that the ERC55 protein is inhibited from binding to E6. The association may be non-covalent or it may be covalent.
  • the association can be energetically favored by hydrogen bonding or van der Waals or electrostatic interactions.
  • the three-dimensional structures can be supplied as a set of coordinates, defining the three-dimensional structures of the E6bp molecule, the candidate compound, the altered candidate compound and the identified compound or as a graphical three-dimensional representation of the E6bp molecule, the candidate compound, the altered candidate compound and the identified compound.
  • the invention features, a machine-readable data storage medium, including a data storage material encoded with a set of NMR derived coordinates which define the three-dimensional structure of the E6bp molecule.
  • the storage medium can be used in methods of the invention.
  • the invention features, a machine-readable data storage medium, including a data storage material encoded with machine readable data which, when used with a machine programmed with instructions for using the data, is capable of displaying a graphical three-dimensional representation of the E6bp molecule.
  • the storage medium can be used in methods of the invention.
  • the invention features, a method of treating a subject at risk for infection by a HPV.
  • a subject at risk for an HPV induced cancer e.g., cervical cancer
  • the method includes: administering to a subject a therapeutically effective amount of a compound other than an E6bp fragment, wherein the compound has a structure sufficiently duplicative of that of Figure 1, so as to bind to the HPV E6 transforming protein and prevent its interaction with the ERC55 protein, thereby treating a subject at risk for infection by a HPV.
  • HPV E6 protein present in the cell, can be bound to the compound and can, therefore, be unable to bind to ERC55 and induce cellular transformation.
  • the invention features, a compound, other than an E6bp or ERC55 fragment, having a three-dimensional structure substantially similar to the three- dimensional structure of E6bp provided in Figure 1, such that the candidate compound can bind to the HPV E6 transforming protein.
  • the candidate compound has a structure sufficiently duplicative of the three-dimensional structure E6bp provided in Figure 1, such that the candidate compound can bind to the HPV E6 transforming protein, with an affinity which is at least half that of E6bp.
  • the dissociation constant (Kd) for the E6-compound complex is less than 100, 50 or 10 times the Kd of the E6-E6bp complex, and more preferably less than the Kd of the E6-E6bp complex.
  • the candidate compound is more stable (e.g., more resistant to proteolytic degradation) than E6bp or ERC55.
  • the invention features, a composition comprising a compound, other than an E6bp or ERC55 fragment, having a three-dimensional structure sufficiently duplicative of the three-dimensional structure of E6bp provided in Figure 1, such that the compound can bind to the HPV E6 transforming protein, and a carrier macromolecule suitable for the administration of the composition to a subject.
  • the invention features, a method of providing or identifying a compound, preferably a compound which can interact with, e.g., bind E6bp.
  • the method includes: supplying a three-dimensional structure for the E6bp molecule; supplying a three-dimensional structure for the candidate compound; optionally comparing the three-dimensional structure of the candidate compound to the three- dimensional structure of the E6bp; optionally altering the structure or altering the spatial position of the structure of the candidate compound; optionally comparing the altered structure of the candidate compound to the structure of the E6bp, thereby providing or identifying a compound, preferably a compound which can interact with, e.g., bind E6bp.
  • comparing includes performing a fitting operation.
  • the invention features, a method of evaluating the ability of a candidate compound to interact with, e.g., bind an E6bp molecule.
  • the method includes: supplying a three-dimensional structure for the E6bp molecule; supplying a three-dimensional structure for the candidate compound; and performing a fitting operation between the three-dimensional structures of the candidate compound and the E6bp molecule, thereby evaluating the ability of the candidate compound to interact with the E6bp molecule.
  • the method includes analyzing the results of the fitting operation to quantify the association between the candidate compound and the E6bp molecule.
  • the method includes defining an atom equivalency in the candidate compound and the E6bp molecule three-dimensional structures.
  • the atom equivalencies can correspond to protein backbone atoms, e.g., N, C ⁇ , C and O atoms.
  • the fitting operation can be a rigid fitting operation, e.g., the E6bp three-dimensional structure can be kept rigid and the three- dimensional structure of the candidate compound can be translated and rotated to obtain an optimum fit with the rigid target E6bp structure.
  • the fitting operation can be performed computationally, e.g., by calculating the root mean square deviation of a set of structural coordinates in the candidate compound from a set of structural coordinates in the E6bp, or visually, e.g., by visual inspection of the candidate compound and the E6bp three- dimensional structures, displayed in a graphical format.
  • the method includes altering the structure or altering the spatial position of the structure of the candidate compound.
  • the method includes creating a record of one or more of the three-dimensional structures of the candidate compound, the altered candidate compound, the identified compound and the E6bp molecule.
  • the record can be encoded in the form of a machine-readable data storage medium.
  • the three-dimensional structures can be displayed on a machine capable of displaying a graphical three- dimensional representation.
  • the method includes providing the identified compound, e.g., chemically synthesizing the identified compound based on the structure identified using the methods described herein.
  • the method includes assessing the biological activity of the identified compound.
  • the biological activity of the identified compound can be assessed in vitro, e.g., in a GST-E6bp or a GST-ERC55 binding assay or a two-hybrid assay, or in vivo, e.g., by its tumor suppression ability in an animal model for HPV infection.
  • the identified compound can be combined with a carrier suitable for introduction into an animal model, e.g., naturally derived or synthetic polymers, solvents, dispersion media, coatings, antibacterial and antifungal agents and the like.
  • the candidate compound can have an ⁇ -helical structure and the alteration can result in a change in the class of the ⁇ -helix comprising the structure of the candidate compound.
  • the ⁇ -helix comprising the structure of the candidate compound can be selected from the group consisting of A, G, and Y ⁇ -helices.
  • the altered compound can associate with the E6bp molecule with a higher affinity.
  • evaluating includes determining the ability of a compound to interact with, e.g., bind the E6bp molecule. Evaluation can be performed computationally, e.g., by calculating the root mean square deviation of a set of structural coordinates in the candidate compound from a set of structural coordinates in the E6bp, or visually, e.g., by visual inspection of the candidate compound and the E6bp three- dimensional structures, displayed in a graphical format.
  • the candidate compound includes a compound which can be altered so as to have a three-dimensional structure that is suitable for associating with E6bp and, therefore, with ERC55.
  • the candidate compound can be a peptide, a peptidomimetic, e.g., an isostere, an "inverso” or a “retro-inverso” peptide and the like, or a non peptide organic compound.
  • the candidate compound associates with the E6bp, such that the ERC55 protein is inhibited from binding to E6.
  • the association may be non-covalent or it may be covalent.
  • the association can be energetically favored by hydrogen bonding or van der Waals or electrostatic interactions.
  • the three-dimensional structures can be supplied as a set of coordinates, defining the three-dimensional structures of the E6bp molecule, the candidate compound, the altered candidate compound and the identified compound or as a graphical three-dimensional representation of the E6bp molecule, the candidate compound, the altered candidate compound and the identified compound.
  • the invention features, a method of treating a subject at risk for infection by a HPV.
  • a subject at risk for an HPV induced cancer e.g., cervical cancer
  • the method includes: administering to a subject a therapeutically effective amount of a compound, wherein the compound associates with E6bp with an affinity so as to prevent the interaction between the ERC55 and the HPV E6 protein, thereby treating a subject at risk for infection by a HPV.
  • Preferred compounds are provided by the methods described herein.
  • the invention features, a method of modeling the region of the HPV E6 protein which binds ERC55.
  • the method includes: supplying a three- dimensional structure for an E6bp molecule and supplying a structure, which is complementary to the structure of the E6bp molecule, thereby modeling the region of the HPV E6 protein which binds ERC55.
  • the molecular modeling techniques, described herein can be used to construct a structure, which is complementary to the E6bp three-dimensional structure.
  • complementary is meant a structure, which is complementary to one or more of: (a) the shape, (b) the electrostatic properties or (c) the hydrophobicity of the E6bp three- dimensional structure.
  • E6bp may be complementary in shape to a critical portion of E6.
  • something complementary to E6bp mimics the structure of E6.
  • the complementary structure need not be translated into a real molecule, but can be used in the computational or computer based methods described herein, to identify a compound which has the ability to interact with the HPV E6 transforming protein.
  • the method includes creating a record of the three- dimensional structure of the E6bp molecule and its complementary structure.
  • the record of the three-dimensional structure of the E6bp molecule and its complementary structure can be encoded in the form of a machine-readable data storage medium.
  • the three-dimensional structures can be displayed on a machine capable of displaying a graphical three-dimensional representation of a structure.
  • the three-dimensional structure can be supplied as a set of coordinates, defining the three-dimensional structure of the E6bp molecule and its complementary structure or as a graphical three-dimensional representation of the E6bp molecule and its complementary structure.
  • comparing refers to examining a quality, e.g., three- dimensional structure, hydrophobicity, steric bulk, electrostatic properties, bond angles, size or molecular composition of a compound, in order to identify resemblances or differences between two structures.
  • altering the structure refers to altering the intrinsic properties, e.g., three-dimensional structure, hydrophobicity, steric bulk, electrostatic properties, bond angles, size or molecular composition of a compound.
  • the alteration can include an amino acid substitution or the introduction of a non-peptide molecule or bond in the structure of the candidate compound.
  • the non- peptide molecule or bond can comprise a peptidomimetic entity, e.g., a peptidomimetic molecule or bond.
  • altering the spatial position refers to changing the orientation of or translating the structure of the candidate compound, relative to a pre- defined reference, e.g., relative to the structure of the E6bp molecule.
  • the structure of the candidate compound can be rotated, e.g., 30, 60, 90, 120 or 180° relative to the structure of the E6bp molecule.
  • atom equivalencies refers to a set of conserved residues between two structures, defined such that they allow direct comparison of the structures being compared.
  • the atom equivalencies can correspond to protein backbone atoms, e.g., N, C ⁇ , C and O atoms.
  • the term "fitting operation” refers to the process by which, a working structure (i.e. a compound) is translated and rotated to obtain an optimum fit with the target E6bp structure.
  • the fitting operation can use a least squares fitting algorithm that computes the optimum translation and rotation to be applied to the moving compound structure, such that the root mean square difference of the fit over the specified pairs of equivalent atoms is an absolute minimum. This number, given in angstroms, can be reported by a computer software.
  • root mean square deviation refers to the square root of the arithmetic mean of the squares of the deviations from the mean. It is a way to express the deviation or variation from a trend or object.
  • the "root mean square deviation” defines the variation in a set of atom equivalencies of a compound from a set of atom equivalencies of the E6bp molecule, as defined by the structure coordinates of the E6bp molecule described herein.
  • the term “least squares” refers to a method based on the principle that the best estimate of a value is that in which the sum of the squares of the deviations of observed values is a minimum.
  • Figure 1 is a depiction of the three-dimensional structure of E6bp.
  • the protein backbone is shown as a blue ribbon.
  • Hydrophobic amino acid side chains are shown in yellow, and polar side-chains in blue.
  • Calcium ions are shown as spheres.
  • Figure 2 is an amide-amide region of the 2D NOESY illustrating the structure of the E6bp molecule. Intense amide to amide proton contacts, indicative of ⁇ helices, are indicated by residue number. Residues 13 to 16, shown above the diagonal, form an N- terminal helix, whereas residues 22 to 27, shown below the diagonal, for a C-terminal helix.
  • Figure 3 is an illustration of ligand design.
  • the candidate compound phenylarginine was built onto two exposed amino acids residues (glutamate 16 and leucine 19) on the C-terminal ⁇ helix of the E6bp protein.
  • the hydrocarbons of the ligand are indicated by G, whereas those of the protein are indicated by Y.
  • Oxygen atoms are indicated by R, nitrogens by B, and polar hydrogens by W.
  • the compound was designated with the LUDI feature feature of the molecular modeling program INSIGHTII, and the fit to the protein was optimized using the DOCK module.
  • This figure illustrates the method for design of novel inhibitors to papillomavirus, described herein.
  • FIG. 4 is a depiction of various sequences illustrating that the E6 binding domains is a short ⁇ -helical peptide and that the E6 binding region of E6BP is found in other E6-binding proteins. Further illustrated in this Figure are the results from an analysis of the structure of E6bp, based on site-directed mutagenesis.
  • E6-induced tumorigenesis occurs via two pathways.
  • the E6 protein associates with a human cellular factor, E6AP.
  • the E6-E6AP complex directs p53 for rapid degradation via the ubiqui tin-mediated proteolytic pathway (Lee J.M. and Bernstein A. (1995) Cancer Metas. Rev. 14: 149; Scheff er M. et al., (1993) Cell 75:495; and Huibregste J. et al., (1993) Mol. Cell Biol. 13:775).
  • Loss of p53 protein correlates with the loss in its tumor suppressor functions.
  • An 18 amino acid residue peptide fragment, E6ap is the minimal region of E6AP that binds E6 (Huibregste J. et al., (1993) Mol. Cell Biol. 13:4918).
  • E6-induced tumorigenesis is also p53-independent (Storey A. et al., (1995) Oncogene 11:653), and a different target protein, ERC55, has been demonstrated to bind E6 (Chen J.J. et al, (1995) Science 269:529). ERC55 and p53 compete for binding to E6, consistent with alternate roles in tumorigenesis. The mechanism of ERC55 function appears to involve alteration of keratinocyte differentiation (Sherman L. and Schlegel R. (1996) J. Virol. 70:3269; and Reiss M. et al, (1989) Cancer Commun. 1 :75; Chen J.J.
  • E6bp A 25 amino acid segment of ERC55, called E6bp, has been found to be necessary and sufficient for binding to E6.
  • the sequence of E6ap is homologous to E6bp, as shown in the following Table.
  • the inventors have solved the three-dimensional structure of an E6bp molecule, using one-and two-dimensional NMR Spectroscopy. Importantly, this has provided, for the first time, information about the three-dimensional structure of the E6bp molecule. This information is of significant utility in fields such as drug discovery.
  • An understanding of the structure of the E6bp molecule, a 25 amino acid region in the ERC55 protein sufficient for binding to the HPV E6 transforming protein allows the design of drugs which interact with the HPV E6 transforming protein. As a result, this information is useful for designing inhibitors of the E6-ERC55 interaction and therefore, drugs for fighting papillomavirus infection.
  • Candidate compounds can be agents which can be altered so as to have a three- dimensional structure that is substantially similar to the three-dimensional structure of E6bp, provided in Figure 1, such that the agent can bind to the HPV E6 transforming protein or portion thereof.
  • the altered candidate compound can bind to E6 with an affinity which is at least 10, 50, 100, 150, 200 or 500% as strong as the affinity with which E6bp binds to E6.
  • Candidate compounds can also be agents which can be altered so as to have a three-dimensional structure that is suitable for associating with the E6bp molecule.
  • the candidate compound can be a peptide or a peptidomimetic.
  • peptidomimetics include peptidic compounds in which the peptide backbone is substituted with one or more benzodiazepine molecules (see e.g., James, G.L. et al, 1993, Science 260:1937-1942, the contents of which are incorporated herein by reference), peptides in which all L-amino acids are substituted with the corresponding D-amino acids, and "retro-inverso" peptides (see U.S. Patent No. 4,522,752 by Sisto, the contents of which are incorporated herein by reference).
  • mimetic and in particular, peptidomimetic, includes isosteres.
  • isostere includes a chemical structure that can be substituted for a second chemical structure because the steric conformation of the first structure fits a binding site specific for the second structure.
  • the term specifically includes peptide back-bone modifications ⁇ e.g., amide bond mimetics). Such modifications include modifications of the amide nitrogen, the ⁇ -carbon, amide carbonyl, complete replacement of the amide bond, extensions, deletions or backbone cross links.
  • ⁇ [CH 2 S], ⁇ [CH 2 NH], ⁇ [CSNH 2 ], ⁇ [NHCO], ⁇ [COCH 2 ], and ⁇ [(E) or (Z) CH CH].
  • indicates the absence of an amide bond.
  • the structure that replaces the amide group is specified within the brackets.
  • a retro-inverso peptide has a reversed backbone while retaining substantially the original spatial conformation of the side chains, resulting in a retro-inverso isomer with a topology that closely resembles the parent peptide.
  • the candidate compound can also be a non peptide organic compound prepared as described in WO 9504277, the contents of which are incorporated herein by reference, as well as a steroid, a carbohydrate, a lipid and the like.
  • the candidate compound can be selected from a database of three-dimensional structures of known compounds.
  • the three-dimensional structures in the database can be either experimentally determined, e.g., crystal structures from the Cambridge structural database (see Allen et al., J. Chem. Inf. Comput. Sci. 31 : 187-204, 1991, the contents of which are incorporated herein by reference) or computationally generated, e.g., using rule-based programs such as CONCORD (see Pearlman, R. S., Chem. Des. Auto. News, 2:1-7, 1987, the contents of which are incorporated herein by reference).
  • CONCORD see Pearlman, R. S., Chem. Des. Auto. News, 2:1-7, 1987, the contents of which are incorporated herein by reference.
  • the candidate compounds can also be designed de novo; e.g., by piecing together or assembling molecular fragments to create compounds which: (a) have a three- dimensional structure that is substantially similar to the three-dimensional structure of E6bp provided in figure 1 , such that the created compound can bind to the HPV E6 transforming protein or portion thereof or (b) have a three-dimensional structure that is suitable for associating with the E6bp molecule.
  • the GROW algorithm Moon, J.B., et al., Proteins: Struct. Fund. Genet 11 :314-328,1991, the contents of which are incorporated herein by reference
  • the LUDI program B ⁇ hm, H.-J.
  • Machine Readable Storage Medium In order to use the NMR derived structure coordinates for the E6bp peptide, it is preferable to convert them into a three-dimensional representation. This can be achieved through the use of commercially available software which is capable of generating three- dimensional graphical representations of molecules or portions thereof from as set of structure coordinates.
  • the invention allows the use of molecular design techniques to design and evaluate candidate compounds, including inhibitory compounds, e.g., candidate compounds having a three-dimensional structure that is: (a) substantially similar to the three-dimensional structure of E6bp provided in figure 1, such that the candidate compounds can bind to the HPV E6 transforming protein or portion thereof or (b) suitable for associating with the E6bp molecule.
  • inhibitory compounds e.g., candidate compounds having a three-dimensional structure that is: (a) substantially similar to the three-dimensional structure of E6bp provided in figure 1, such that the candidate compounds can bind to the HPV E6 transforming protein or portion thereof or (b) suitable for associating with the E6bp molecule.
  • a potential compound which can bind to the HPV E6 transforming protein or portion thereof can be evaluated by means of a series of steps in which compounds are screened and selected for their similarity to the three-dimensional structure of the E6bp molecule.
  • One skilled in the art can use one of several methods to screen compounds for their similarity to the three-dimensional structure of the E6bp molecule. This process may begin by visual inspection of, for example, the three-dimensional structure of the candidate compound in comparison to the three-dimensional structure of the E6bp molecule on a computer screen, wherein the three-dimensional structure of the E6bp molecule is generated from the machine-readable storage medium.
  • the Molecular Similarity application permits comparisons between different structures, different conformations of the same structure, and different parts of the same structure.
  • the procedure used in Molecular Similarity to compare structure is divided into four steps: 1) loading the structures to be compared; 2) defining the atom equivalences in these structures; 3) performing a fitting operation; and 4) analyzing the results.
  • the E6bp structure can be identified as the target (i.e., the fixed structure); the candidate compound structures can be working structures (i.e., moving structures). Since atom equivalency within QUANTA is defined by user input, equivalent atoms can be defined as protein backbone atoms (N, C ⁇ , C and O) for all conserved residues between the two structures being compared. The process can be aided by color-coding the different parts of the molecules. Rigid fitting operations can be used. When a rigid fitting method is used, the working compound structure is translated and rotated to obtain an optimum fit with the target E6bp structure.
  • the fitting operation uses a least squares fitting algorithm that computes the optimum translation and rotation to be applied to the moving compound structure, such that the root mean square difference of the fit over the specified pairs of equivalent atom is an absolute minimum. This number, given in angstroms, can be reported by QUANTA.
  • Preferred candidate structures are those having a set of structure coordinates with a root mean square deviation of conserved residue backbone atoms (e.g., N, C ⁇ , C, O) of less than 1.5 A when superimposed, using backbone atoms, on the relevant structure coordinates listed in Figure 2 are considered identical. More preferably, the root mean square deviation is less than 1.0 A, and even more preferably, the root mean square deviation is less than 0.5 A.
  • Candidate compounds can also be evaluated for their ability to associate with the E6bp molecule.
  • One skilled in the art may use one of several methods to screen compounds for their ability to associate with the E6bp molecule. This process may begin by visual inspection of, for example, the E6bp molecule on a computer screen based on the NMR derived data shown in Figure 2, and generated from the machine- readable storage medium. Selected compounds may then be positioned in a variety of orientations, or docked, within the E6bp molecule. Docking may be accomplished using software such as Quanta and Sybyl, followed by energy minimization and molecular dynamics with standard molecular mechanics force fields, such as CHARMM and AMBER. Specialized computer programs may also assist in the process of selecting fragments or chemical entities.
  • DOCK is available from University of California, San Francisco, CA.
  • DOCK (I.D. Kuntz et al., J. Mol. Biol. 161, pp. 269-288 (1982), the contents of which are incorporated herein by reference).
  • DOCK is available from University of California, San Francisco, CA. Synthesis of Identified Compounds
  • peptide compounds can be prepared according to standard recombinant DNA techniques using a nucleic acid molecule encoding the peptide.
  • a nucleotide sequence encoding the peptide can be determined using the genetic code and an oligonucleotide molecule having this nucleotide sequence can be synthesized by standard DNA synthesis methods (e.g., using an automated DNA synthesizer).
  • a DNA molecule encoding a peptide compound can be derived from the corresponding natural gene or cDNA (e.g., using the polymerase chain reaction and/or restriction enzyme digestion) according to standard molecular biology techniques.
  • the isolated nucleic acid encoding the peptide is incorporated into a recombinant expression vector.
  • vector refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
  • plasmid refers to a circular double stranded DNA loop into which additional DNA segments may be ligated.
  • viral vector Another type of vector is a viral vector, wherein additional DNA segments may be ligated into the viral genome.
  • vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non- episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are referred to herein as "recombinant expression vectors" or simply "expression vectors". In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids.
  • plasmid and “vector” may be used interchangeably as the plasmid is the most commonly used form of vector.
  • vectors such other forms of expression vectors, such as viral vectors, which serve equivalent functions may also be used to express a peptide compound.
  • the nucleotide sequence encoding the peptide compound can be operatively linked to one or more regulatory sequences, selected on the basis of the host cells to be used for expression.
  • the term "operably linked” is intended to mean that the sequences encoding the peptide compound are linked to the regulatory sequence(s) in a manner that allows for expression of the peptide compound.
  • the term "regulatory sequence” is intended to include promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are described, for example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185. Academic Press, San Diego, CA (1990), the content of which are incorporated herein by reference.
  • Regulatory sequences include those that direct constitutive expression of a nucleotide sequence in many types of host cells, those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences) and those that direct expression in a regulatable manner (e.g., only in the presence of an inducing agent). It will be appreciated by those skilled in the art that the design of the expression vector may depend on such factors as the choice of the host cell to be transformed, the level of expression of peptide compound desired, and the like.
  • the peptide compound expression vectors can be introduced into host cells to thereby produce peptide compounds encoded by nucleic acids.
  • the recombinant expression vectors can be designed for expression of peptide compounds in prokaryotic or eukaryotic cells.
  • peptide compounds can be expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: Methods in Enzymology 185. Academic Press, San Diego, CA (1990).
  • the recombinant expression vector may be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. Examples of vectors for expression in yeast S.
  • Baculovirus vectors available for expression of peptide compounds in cultured insect cells include the pAc series (Smith et al, (1983) Mol. Cell. Biol.
  • mammalian expression vectors include pCDM8 (Seed, B., (1987) Nature 329:840) and pMT2PC (Kaufman et al. (1987), EMBO J. 6:187-195).
  • the expression vector's control functions are often provided by viral regulatory elements.
  • commonly used promoters are derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40.
  • the recombinant expression vector may contain additional nucleotide sequences.
  • the recombinant expression vector may encode a selectable marker gene to identify host cells that have incorporated the vector.
  • selectable marker genes are well known in the art.
  • the recombinant expression vector preferably encodes a signal sequence operatively linked to sequences encoding the amino-terminus of the peptide compound such that upon expression, the peptide compound is synthesized with the signal sequence fused to its amino terminus.
  • This signal sequence directs the peptide compound into the secretory pathway of the cell and is then cleaved, allowing for release of the mature peptide compound (i.e., the peptide compound without the signal sequence) from the host cell.
  • a signal sequence to facilitate secretion of proteins or peptides from mammalian host cells is well known in the art.
  • Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques.
  • transformation and “transfection” are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, electroporation, microinjection and viral-mediated transfection.
  • Suitable methods for transforming or transfecting host cells can be found in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory press (1989)), and other laboratory manuals.
  • a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host cells along with the gene encoding the peptide compound.
  • selectable markers include those that confer resistance to drugs, such as G418, hygromycin and methotrexate.
  • Nucleic acid encoding a selectable marker may be introduced into a host cell on the same vector as that encoding the peptide compound or may be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other cells die). Assessment of the Biological Activity of Identified Compounds
  • an assay can be used to assess the ability of a compound to inhibit binding between ERC55 (i.e., E6bp) and the HPV E6 transforming protein.
  • ERC55 i.e., E6bp
  • HPV E6 transforming protein a variety of assays are available and readily apparent to the skilled artisan.
  • the identified compound can be contacted with an isolated and purified ERC55 protein. The mixture of the candidate compound and ERC55 can then be added to a composition containing the E6 protein but which does not contain ERC55.
  • Detection and quantification of labelled E6/ERC55 complexes provides a means for determining the candidate compound's efficacy at inhibiting complex formation between the papillomavirus E6 protein and the ERC55 protein.
  • a control assay can also be performed to provide a baseline for comparison.
  • isolated and purified ERC55 is added to a composition containing the E6 protein, and the formation of an E6/ERC55 complex is quantitated in the absence of the test compound.
  • GST/E6 glutathione-S-transferase/E6
  • fusion proteins can be adsorbed onto glutathione sepharose beads which can then be combined with an 35 S-labeled ERC55 protein and incubated under conditions conducive to complex formation, [e.g., at 4°C in a buffer of 25 mM Tris-HCl (pH 7.2), 50 mM NaCl and 0.2%) NP-40].
  • the beads can be washed to remove any unbound ERC55, and the sepharose bead-bound radiolabel can be determined directly (e.g.
  • the supernatant containing the complexes can be separated by SDS-PAGE gel before detection.
  • ERC55 or E6bp can be used to generate a two-hybrid assay, as described in U.S. Patent No: 5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J Biol Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920- 924; and Iwabuchi et al. (1993) Oncogene 8:1693-1696), for subsequently assessing the ability of compounds to disrupt binding of ERC55 or E6bp to E6.
  • the interaction trap assay relies on reconstituting in vivo a fuctional transcriptional activator protein from two separate fusion proteins, one of which comprises the DNA-binding domain of a transcriptional activator fused to an E6 protein.
  • the second fusion protein comprises a transcriptional activation domain (e.g. able to initiate RNA polymerase transcription) fused to ERC55 or E6bp.
  • a transcriptional activation domain e.g. able to initiate RNA polymerase transcription
  • Saccharomyces cerevisiae YPB2 cells can be transformed simultaneously with a plasmid encoding a GAL4db-E6 fusion and with a plasmid encoding the GAL4ad domain fused to ERC55 or E6bp.
  • the strain is transformed such that the GAL4-responsive promoter drives expression of a phenotypic marker.
  • the ability to grow in the absence of histidine depends on the expression of the HIS3 gene.
  • the HIS3 gene is placed under the control of a GAL4-responsive promoter, relief of this auxotrophic phenotype indicates that a functional GAL4 activator has been reconstituted through the interaction of E6 and ERC55 or E6bp.
  • the phenotypic marker (e.g. instead of the HIS3 gene) can be one which provides a negative selection when expressed such that compounds which disrupt E6/ERC55 or E6/E6bp interactions confer positive growth selection to the cells.
  • the E6bp peptide was synthesized using standard protein synthesis methods and purified by HPLC.
  • the recombinant E6 protein is expressed in E. coli, solubilized from inclusion bodies, and refolded using a rapid dilution method.
  • the E6 protein has been expressed and solubilized from HP V- 16 to a concentration of 4mg/mL (0.2 mM).
  • (protonated) peptide is added to 0.2 mM deuterated (2H)E6 following the procedure of Shibata et al., (1995) Arch. Biochem. Biophys. 319:204, the contents of which are inco ⁇ orated herein by reference. This allows observation of only the H NMR resonances of the E6-bound peptides.
  • Example 2 NMR Spectroscopy
  • Peptide samples were prepared at about 2 mM concentration.
  • One- and two- dimensional (ID & 2D) NMR data were collected on a 500 MHz Bruker AMX500 spectrometer.
  • E6bp is titrated with calcium and NMR spectra recorded to determine the effect on the peptide conformation.
  • Low concentration peptide samples (0.2 mM) are titrated with E6 to determine the residues that are affected most by E6. Samples are prepared in both H 2 O and D 2 O solution.
  • TRNOE transferred NOE
  • the spectra exhibit mostly the lineshape and intensity of the unbound peptide, but the NOESY cross peaks represent the bound conformation of the peptide.
  • Spectra are optimized in terms of salt concentration (up to 0.2 M) and temperatures (5°C to 35°C) to identify the conditions that yield superior quality NMR data with narrow line shapes, following the methods described in Baleja J.D., (1996) Techniques in Protein Chemistry VI 131, the contents of which are inco ⁇ orated herein by reference.
  • TOCSY Total correlation spectroscopy
  • NOESY nuclear Overhauser effect spectroscopy
  • NMR resonance strategy is performed as described in Wuthrich et al.
  • TOCSY spectra are analyzed to provide information regarding spin-spin coupled NMR-active nuclei.
  • Different types of amino acids side chains produce distinct TOCSY cross peak patterns.
  • the cross-peak patterns for each amino acid are linked together using the NOESY experiment since NOE sequential connectivities can be identified between residues neighboring in sequence.
  • NOESY spectra show cross peaks between protons that are within 5 A of each other and therefore also show contacts between residues that are sequentially distant but spatially close. Inte ⁇ roton distances are determined from NOESY cross peak intensities.
  • ⁇ and ⁇ -1 torsion angles are obtained by measuring the coupling constants from ID slices of resolution-enhanced 2D data, as described in Wuthrich K., (1986) NMR of Proteins and Nucleic Acids, Wiley, New York; and Szyperski T. et al., (1992) J. Magn. Reson. 99:552, the contents of which are inco ⁇ orated herein by reference.
  • ⁇ NMR resonance assignments were made (see Table II) and a low resolution structure of the calcium-bound E6bp was calculated using approximately 200 inte ⁇ roton distances and 15 ⁇ torsion angles derived from NOESY data and distance geometry and simulated annealing protocols of the INSIGHTII molecular modeling program (see Figure 1).
  • the E6bp peptide dimerizes. From the dispersion present in the NMR spectra, it is estimated that 400 additional inte ⁇ roton distances are obtainable. Structure calculation is repeated with the more extensive data set to determine the high resolution structure of E6bp. The high resolution structure of E6bp both free and bound to E6 protein is also solved. Knowing the conformation for this E6bp in the absence of E6 protein is important for understanding the conformational changes brought about by binding to E6 and the binding surface of the peptide involved in the interaction. TABLE II
  • the inte ⁇ roton distances and torsion angle information are introduced into the distance geometry program, DGII of Insight (Biosym, Inc.).
  • the initial structure calculated using distance geometry and simulated annealing protocols is refined by determining the presence or lack of NOE cross peaks for spatially proximal protons in the calculated structure. Structures are compared to one another to highlight the conformational changes.
  • the E6bp peptide is expected to have major changes in surface properties on binding calcium, such as the formation of a deep hydrophobic cavity essential for target protein recognition (see Ikura M., (1996) Trends Biochem. Sci. 21:14, the contents of which are inco ⁇ orated herein by reference).
  • E6ap and E6bp are compared to one another to test the hypothesis that they are similar.
  • the residues involved in interaction with E6 are identified by chemical shift changes that occur upon titration, as described in Baleja J.D. et al., (1992) Nature 356:450; and Baleja J.D. et al., (1992) Biochemistry 33:3071, the contents of which are inco ⁇ orated herein by reference. Furthermore, differences in conformation observed on binding E6 are determined.
  • E6-interacting protein ERC55
  • EF hands Six EF hands are predicted to occur in its C-terminal domain. Only one of these EF hands, a 25 amino acid segment, E6bp, binds E6 selectively (Chen J.J. et al., (1995) Science 269:529), and with about the same affinity as the full-length ERC55 protein. Structural determination of other EF-hand domains using 'H NMR spectroscopy (Ikura M. (1996) Trends Biochem. Sci. 21:14) suggests the feasibility for NMR study of this peptide.
  • E6bp peptide A small amount of E6bp peptide was synthesized, purified and a 0.7 mM sample was prepared. As predicted, the peptide bound calcium and showed excellent NMR spectral dispersion as shown in Figure 2. The calcium free form of E6bp was studied. Structure Analysis of E6AP
  • E6ap 18 amino acid residue peptide fragment
  • the sequence of E6ap is homologous to E6bp (see Table I), and their solution properties are likely to be similar. Sequence prediction (Wishart et al., (1994) Comp. Biol. Sci. 10:121) has indicated ⁇ helical structure for the homologous region of E6ap-the same region which has been determined to be ⁇ helical in the primary solution structure of E6bp ( Figure 1). E6ap does not have the calcium ligands required for an EF hand, and therefore is unlikely to bind calcium.
  • Example 4 Analysis of The E6bp Structure By Site-Directed Mutagenesis To further map the domain that is important for E6 binding, additional deletion mutants from E6bp were constructed as GST fusions and tested for binding with E6. Deletion of the first alpha helix (20mer) did not affect binding, nor did deletion of two additional amino acid residues from the C-terminal end of E6bp to the distal region of the second alpha helix affect binding.
  • a 13 amino acid peptide containing the second alpha helix as a GST fusion retained the ability to bind to E6, although at reduced efficiency in comparison to intact E6bp. Notably, the major portion of the loop region from the EP-hand motif is deleted in the 13 amino acid peptide.
  • the ability of the second alpha helix to bind E6 demonstrates that the interaction of E6BP with E6 is independent of calcium binding, as the first alpha helix and loop region from the EP- hand motif are both required for calcium binding.
  • Alanine replacement mutations in E6bp were also constructed and used to define the amino acid(s) important for E6 interaction in the alpha helix. Some mutations were also made in the surrounding regions. As expected, mutants VI 9 A, S20A, E22A, R29A, and W30A, which have mutations in the area beyond the alpha helix, bound E6 at wild- type level (see Figure 4). Mutant F18A showed some reduced binding. Notably, phenyalanine is a hydrophobic residue, which may enhance the interaction between E6 and E6bp. This may explain why a leucine to alanine change at amino acid 21 also reduced binding.
  • mutants at the boundary of alpha helix showed modest reduction (approximately 60% of wild-type binding) in their E6 binding ability
  • all other mutants made up of the alpha helical structure showed substantial decrease in E6 binding.
  • a change of leucine to alanine at amino acid 25 totally abolished binding.
  • the ability of mutants F18A and E23A to bind E6 confirmed the notion that the interaction of E6BP with E6 is independent of calcium binding, as both mutations abolished calcium binding.
  • a leucine to proline change at amino acid 25 of E6bp was created and used in a binding experiment. This change (L25P) which is expected to disrupt the alpha helical structure, totally abolished E6bp-binding with E6, indicating that the alpha helix from the conserved motif is indeed important for E6 binding.

Abstract

The invention provides methods of evaluating a compound for the ability to interact with the HPV E6 transforming protein. The invention also provides a machine-readable data storage medium, comprising a data storage material encoded with a set of NMR derived coordinates which define the three-dimensional structure of the E6bp molecule as well as a machine-readable data storage medium, comprising a data storage material encoded with machine readable data which, when used with a machine programmed with instructions for using the data, is capable of displaying a graphical three-dimensional representation of the E6bp molecule. The invention further provides a method of treating a subject at risk for infection by a HPV. A compound having a three-dimensional structure substantially similar to the three-dimensional structure of E6bp and a composition comprising the candidate compound and a carrier macromolecule are also provided. Finally a method of evaluating the ability of a compound to associate with an E6bp molecule is provided.

Description

STRUCTURE-BASED RATIONAL DESIGN OF COMPOUNDS TO INHIBIT
PAPILLOMAVIRUS INFECTION
FIELD OF THE INVENTION
The invention relates to methods of structure-based rational design of compounds useful for inhibiting infection by papillomavirus.
BACKGROUND OF THE INVENTION Papillomaviruses (PV) have been linked to widespread, serious human diseases, especially carcinomas of the genital and oral mucosa. It is estimated that there are currently somewhere in the neighborhood of tens of millions of women who suffer from human papilloma virus (HPV) infection of the genital tract. Many of these women eventually develop cancer of the cervix. For example, it has been estimated that about twenty percent (20%) of all cancer related deaths in women worldwide are from cancers which are associated with HPV. It has also been estimated that 90% of all cervical cancer is linked to HPV.
Papillomaviruses induce benign, dysplastic and malignant hyperproliferations of skin or mucosal epithelium (see, for example, Mansur and Androphy, (1993) Biochim Biophys Ada 1155:323-345; Pfister (1984) Rev. Physiol. Biochem. Pharmacol. 99:111- 181 ; and Broker et al, (1986) Cancer Cells 4:17-36, for reviews of the molecular, cellular, and clinical aspects of the papillomaviruses). Almost 70 human papillomavirus types have been identified, and different papillomavirus types are known to cause distinct diseases, Pfister, (1987) Adv. Cancer Res., 48:113-147, Syrjanen, (1984) Obstet. Gynecol. Survey 39:252-265. Human papillomaviruses (HPVs) are a heterogeneous group of DNA tumor viruses associated with hyperplastic (warts, condylomata), pre- malignant and malignant lesions (carcinomas) of squamous epithelium. For example, HPV types 1 and 2 cause common warts, and types 6 and 11 cause warts of the external genitalia, anus and cervix. HPV, types 16, 18, 31 and 33 have been isolated from the majority of cervical cancers with HPV-16 present in about 50 percent of all cervical cancers. These HPVs are referred to as "high risk". While HPV 6 and 11 are the most common isolates for cervical warts, these infections rarely progress to invasive cancer, and therefore these HPVs are referred to as "low risk".
Studies of viral gene expression in carcinomas suggest the importance of two HPV encoded proteins, E6 and E7, in malignant development and these proteins have been shown to encode transforming and immortalizing activities. E6-induced tumorigenesis is thought to occur via multiple pathways (see Turek L., (1994) Adv. Virus Res. 44:305; Tommasino M. and Crawford L. (1995) BioEssays 17:509; Lee J.M. and Bernstein A. (1995) Cancer Metas. Rev. 14:149; Scheffner M. et al., (1993) Cell 75:495; and Huibregste J. et al., (1993) Mol. Cell Biol. 13:775). In the p53-dependent pathway, the E6 protein associates with a human cellular factor, E6AP and in the p53- independent pathway, the E6 protein associates with a human cellular factor, ERC55.
SUMMARY OF THE INVENTION In general, the invention features, a method for evaluating a candidate compound for the ability to interact with, e.g., bind, an HPV E6 transforming protein. The method includes: supplying a three-dimensional structure for the E6 binding peptide (E6bp); supplying a three-dimensional structure for the candidate compound; and, optionally, comparing the three-dimensional structure of the candidate compound to the three- dimensional structure of the E6bp, thereby evaluating the candidate compound for the ability to interact with, e.g., bind the HPV E6 transforming protein.
In preferred embodiments, similarity in the structure of the candidate compound to the structure of the E6bp is indicative of the ability of the candidate compound to interact with the HPV E6 transforming protein.
In another aspect, the invention features, a method of providing or identifying a compound, preferably a compound which has the ability to interact with, e.g., bind, an HPV E6 transforming protein. The method includes: supplying a three-dimensional structure for E6bp; supplying a three-dimensional structure for a candidate compound; optionally comparing the three-dimensional structure of the candidate compound to the three-dimensional structure of the E6bp; and optionally altering the structure or altering the spatial position of the structure of the candidate compound, thereby providing or identifying a compound, which preferably has the ability to interact with the HPV E6 transforming protein.
In preferred embodiments, the altered structure of the candidate compound more closely resembles the three-dimensional structure of E6bp, than does the original structure of the candidate compound.
In preferred embodiments, the method includes comparing the altered structure of the candidate compound or the identified compound to the three-dimensional structure of the E6bp. Preferably, the comparison can be performed by defining an atom equivalency in the candidate compound or the identified compound and the E6bp three- dimensional structures and comparing these atom equivalencies. In preferred embodiments, a second or further subsequent alteration is made in the structure or the spatial position of the structure of the candidate compound.
In preferred embodiments, the method includes defining an atom equivalency in the candidate compound and the E6bp three-dimensional structures; and performing a fitting operation between the candidate compound and the E6bp three-dimensional structures.
In preferred embodiments, the method includes defining an atom equivalency in the candidate compound and the E6bp three-dimensional structures; performing a fitting operation between the candidate compound and the E6bp three-dimensional structures; and analyzing the results of the fitting operation to compare the level of similarity between the candidate compound and the E6bp three-dimensional structures. For example, the atom equivalencies can correspond to protein backbone atoms, e.g., N, Cα, C and O atoms. In preferred embodiments, the fitting operation can be a rigid fitting operation, e.g., the E6bp three-dimensional structure can be kept rigid and the three- dimensional structure of the candidate compound can be translated and rotated to obtain an optimum fit with the rigid target E6bp structure.
In preferred embodiments, the comparison between the candidate compound and the E6bp three-dimensional structures can be performed computationally, e.g., by calculating the root mean square deviation of a set of structural coordinates in the candidate compound from a set of structural coordinates in the E6bp, or visually, e.g., by visual inspection of the candidate compound and the E6bp three-dimensional structures, displayed in a graphical format.
In preferred embodiments, the candidate compound can have an -helical structure and the alteration can result in a change in the class of the α-helix comprising the structure of the candidate compound. For example, the α-helix comprising the structure of the candidate compound can be selected from the group consisting of A, G, and Y α-helices.
In preferred embodiments, the method includes creating a record of one or more of the three-dimensional structures of the candidate compound, the altered candidate compound, the identified compound, and E6bp. The record can be encoded in the form of a machine-readable data storage medium. The three-dimensional structures can be displayed on a machine capable of displaying a graphical three-dimensional representation.
In preferred embodiments, the method includes providing the identified compound, e.g., chemically synthesizing the identified compound based on the structure identified using the methods described herein. In preferred embodiments, the method includes assessing the biological activity of the identified compound. The biological activity of the identified compound can be assessed in vitro, e.g., in a GST-E6 binding assay or a two-hybrid assay, or in vivo, e.g., by applying the compound to a cell line (Hela, Caski, Siha) which expresses HPVE6 and examining the growth characteristics of the cells; or by its tumor suppression ability in an animal model for HPV infection. In preferred embodiments, the identified compound can be combined with a carrier suitable for introduction into an animal model, e.g., naturally derived or synthetic polymers, solvents, dispersion media, coatings, antibacterial and antifungal agents and the like.
In preferred embodiments, the candidate compound can be altered so as to have a three-dimensional structure that is substantially similar to the three-dimensional structure of E6bp provided in figure 1, such that the candidate compound can bind to the HPV E6 transforming protein or portion thereof. For example, the candidate compound can be a peptide, a peptidomimetic, e.g., an isostere, an "inverso" or a "retro-inverso" peptide and the like, or a non peptide organic or inorganic compound. In preferred embodiments, the identified compound associates with the HPV E6 transforming protein or a portion thereof, such that the ERC55 protein is inhibited from binding to E6. The association may be non-covalent or it may be covalent. The association can be energetically favored by hydrogen bonding or van der Waals or electrostatic interactions. In preferred embodiments, the three-dimensional structures can be supplied as a set of coordinates, defining the three-dimensional structures of the E6bp molecule, the candidate compound, the altered candidate compound and the identified compound or as a graphical three-dimensional representation of the E6bp molecule, the candidate compound, the altered candidate compound and the identified compound. In another aspect, the invention features, a machine-readable data storage medium, including a data storage material encoded with a set of NMR derived coordinates which define the three-dimensional structure of the E6bp molecule. The storage medium can be used in methods of the invention.
In yet another aspect, the invention features, a machine-readable data storage medium, including a data storage material encoded with machine readable data which, when used with a machine programmed with instructions for using the data, is capable of displaying a graphical three-dimensional representation of the E6bp molecule. The storage medium can be used in methods of the invention.
In another aspect, the invention features, a method of treating a subject at risk for infection by a HPV. For example, a subject at risk for an HPV induced cancer, e.g., cervical cancer, can be treated. The method includes: administering to a subject a therapeutically effective amount of a compound other than an E6bp fragment, wherein the compound has a structure sufficiently duplicative of that of Figure 1, so as to bind to the HPV E6 transforming protein and prevent its interaction with the ERC55 protein, thereby treating a subject at risk for infection by a HPV. In preferred embodiments, 50, 60, 70 and more preferably 80, 90 or 100% of the
HPV E6 protein, present in the cell, can be bound to the compound and can, therefore, be unable to bind to ERC55 and induce cellular transformation.
In another aspect, the invention features, a compound, other than an E6bp or ERC55 fragment, having a three-dimensional structure substantially similar to the three- dimensional structure of E6bp provided in Figure 1, such that the candidate compound can bind to the HPV E6 transforming protein.
In preferred embodiments, the candidate compound has a structure sufficiently duplicative of the three-dimensional structure E6bp provided in Figure 1, such that the candidate compound can bind to the HPV E6 transforming protein, with an affinity which is at least half that of E6bp. The dissociation constant (Kd) for the E6-compound complex, is less than 100, 50 or 10 times the Kd of the E6-E6bp complex, and more preferably less than the Kd of the E6-E6bp complex.
In preferred embodiments, the candidate compound is more stable (e.g., more resistant to proteolytic degradation) than E6bp or ERC55. In another aspect, the invention features, a composition comprising a compound, other than an E6bp or ERC55 fragment, having a three-dimensional structure sufficiently duplicative of the three-dimensional structure of E6bp provided in Figure 1, such that the compound can bind to the HPV E6 transforming protein, and a carrier macromolecule suitable for the administration of the composition to a subject. In another aspect, the invention features, a method of providing or identifying a compound, preferably a compound which can interact with, e.g., bind E6bp. The method includes: supplying a three-dimensional structure for the E6bp molecule; supplying a three-dimensional structure for the candidate compound; optionally comparing the three-dimensional structure of the candidate compound to the three- dimensional structure of the E6bp; optionally altering the structure or altering the spatial position of the structure of the candidate compound; optionally comparing the altered structure of the candidate compound to the structure of the E6bp, thereby providing or identifying a compound, preferably a compound which can interact with, e.g., bind E6bp. In preferred embodiments, comparing includes performing a fitting operation. In another aspect, the invention features, a method of evaluating the ability of a candidate compound to interact with, e.g., bind an E6bp molecule. The method includes: supplying a three-dimensional structure for the E6bp molecule; supplying a three-dimensional structure for the candidate compound; and performing a fitting operation between the three-dimensional structures of the candidate compound and the E6bp molecule, thereby evaluating the ability of the candidate compound to interact with the E6bp molecule.
In preferred embodiments, the method includes analyzing the results of the fitting operation to quantify the association between the candidate compound and the E6bp molecule.
In preferred embodiments, the method includes defining an atom equivalency in the candidate compound and the E6bp molecule three-dimensional structures. For example, the atom equivalencies can correspond to protein backbone atoms, e.g., N, Cα, C and O atoms. In preferred embodiments, the fitting operation can be a rigid fitting operation, e.g., the E6bp three-dimensional structure can be kept rigid and the three- dimensional structure of the candidate compound can be translated and rotated to obtain an optimum fit with the rigid target E6bp structure.
In preferred embodiments, the fitting operation can be performed computationally, e.g., by calculating the root mean square deviation of a set of structural coordinates in the candidate compound from a set of structural coordinates in the E6bp, or visually, e.g., by visual inspection of the candidate compound and the E6bp three- dimensional structures, displayed in a graphical format.
In preferred embodiments, the method includes altering the structure or altering the spatial position of the structure of the candidate compound. In preferred embodiments, the method includes creating a record of one or more of the three-dimensional structures of the candidate compound, the altered candidate compound, the identified compound and the E6bp molecule. The record can be encoded in the form of a machine-readable data storage medium. The three-dimensional structures can be displayed on a machine capable of displaying a graphical three- dimensional representation.
In preferred embodiments, the method includes providing the identified compound, e.g., chemically synthesizing the identified compound based on the structure identified using the methods described herein. In preferred embodiments, the method includes assessing the biological activity of the identified compound. The biological activity of the identified compound can be assessed in vitro, e.g., in a GST-E6bp or a GST-ERC55 binding assay or a two-hybrid assay, or in vivo, e.g., by its tumor suppression ability in an animal model for HPV infection. In preferred embodiments, the identified compound can be combined with a carrier suitable for introduction into an animal model, e.g., naturally derived or synthetic polymers, solvents, dispersion media, coatings, antibacterial and antifungal agents and the like. In preferred embodiments, the candidate compound can have an α-helical structure and the alteration can result in a change in the class of the α-helix comprising the structure of the candidate compound. For example, the α-helix comprising the structure of the candidate compound can be selected from the group consisting of A, G, and Y α-helices. In preferred embodiments, the altered compound can associate with the E6bp molecule with a higher affinity.
In preferred embodiments, evaluating includes determining the ability of a compound to interact with, e.g., bind the E6bp molecule. Evaluation can be performed computationally, e.g., by calculating the root mean square deviation of a set of structural coordinates in the candidate compound from a set of structural coordinates in the E6bp, or visually, e.g., by visual inspection of the candidate compound and the E6bp three- dimensional structures, displayed in a graphical format.
In preferred embodiments, the candidate compound includes a compound which can be altered so as to have a three-dimensional structure that is suitable for associating with E6bp and, therefore, with ERC55. For example, the candidate compound can be a peptide, a peptidomimetic, e.g., an isostere, an "inverso" or a "retro-inverso" peptide and the like, or a non peptide organic compound.
In preferred embodiments, the candidate compound associates with the E6bp, such that the ERC55 protein is inhibited from binding to E6. The association may be non-covalent or it may be covalent. The association can be energetically favored by hydrogen bonding or van der Waals or electrostatic interactions.
In preferred embodiments, the three-dimensional structures can be supplied as a set of coordinates, defining the three-dimensional structures of the E6bp molecule, the candidate compound, the altered candidate compound and the identified compound or as a graphical three-dimensional representation of the E6bp molecule, the candidate compound, the altered candidate compound and the identified compound.
In another aspect, the invention features, a method of treating a subject at risk for infection by a HPV. For example, a subject at risk for an HPV induced cancer, e.g., cervical cancer, can be treated. The method includes: administering to a subject a therapeutically effective amount of a compound, wherein the compound associates with E6bp with an affinity so as to prevent the interaction between the ERC55 and the HPV E6 protein, thereby treating a subject at risk for infection by a HPV. Preferred compounds are provided by the methods described herein.
In another aspect, the invention features, a method of modeling the region of the HPV E6 protein which binds ERC55. The method includes: supplying a three- dimensional structure for an E6bp molecule and supplying a structure, which is complementary to the structure of the E6bp molecule, thereby modeling the region of the HPV E6 protein which binds ERC55.
The molecular modeling techniques, described herein can be used to construct a structure, which is complementary to the E6bp three-dimensional structure. By "complementary", is meant a structure, which is complementary to one or more of: (a) the shape, (b) the electrostatic properties or (c) the hydrophobicity of the E6bp three- dimensional structure. While not wishing to be bound by theory, E6bp may be complementary in shape to a critical portion of E6. Thus, something complementary to E6bp mimics the structure of E6. The complementary structure need not be translated into a real molecule, but can be used in the computational or computer based methods described herein, to identify a compound which has the ability to interact with the HPV E6 transforming protein.
In preferred embodiments, the method includes creating a record of the three- dimensional structure of the E6bp molecule and its complementary structure. The record of the three-dimensional structure of the E6bp molecule and its complementary structure can be encoded in the form of a machine-readable data storage medium. The three-dimensional structures can be displayed on a machine capable of displaying a graphical three-dimensional representation of a structure.
In preferred embodiments, the three-dimensional structure can be supplied as a set of coordinates, defining the three-dimensional structure of the E6bp molecule and its complementary structure or as a graphical three-dimensional representation of the E6bp molecule and its complementary structure.
As used herein, the term "comparing" refers to examining a quality, e.g., three- dimensional structure, hydrophobicity, steric bulk, electrostatic properties, bond angles, size or molecular composition of a compound, in order to identify resemblances or differences between two structures.
As used herein, the term "altering the structure" refers to altering the intrinsic properties, e.g., three-dimensional structure, hydrophobicity, steric bulk, electrostatic properties, bond angles, size or molecular composition of a compound. In a peptide molecule, the alteration can include an amino acid substitution or the introduction of a non-peptide molecule or bond in the structure of the candidate compound. The non- peptide molecule or bond can comprise a peptidomimetic entity, e.g., a peptidomimetic molecule or bond.
As used herein, the term "altering the spatial position" refers to changing the orientation of or translating the structure of the candidate compound, relative to a pre- defined reference, e.g., relative to the structure of the E6bp molecule. For example, the structure of the candidate compound can be rotated, e.g., 30, 60, 90, 120 or 180° relative to the structure of the E6bp molecule.
As used herein, the term "atom equivalencies" refers to a set of conserved residues between two structures, defined such that they allow direct comparison of the structures being compared. For example, the atom equivalencies can correspond to protein backbone atoms, e.g., N, Cα, C and O atoms.
As used herein, the term "fitting operation" refers to the process by which, a working structure (i.e. a compound) is translated and rotated to obtain an optimum fit with the target E6bp structure. The fitting operation can use a least squares fitting algorithm that computes the optimum translation and rotation to be applied to the moving compound structure, such that the root mean square difference of the fit over the specified pairs of equivalent atoms is an absolute minimum. This number, given in angstroms, can be reported by a computer software.
As used herein, the term "root mean square deviation" refers to the square root of the arithmetic mean of the squares of the deviations from the mean. It is a way to express the deviation or variation from a trend or object. For purposes of this invention, the "root mean square deviation" defines the variation in a set of atom equivalencies of a compound from a set of atom equivalencies of the E6bp molecule, as defined by the structure coordinates of the E6bp molecule described herein. As used herein, the term "least squares" refers to a method based on the principle that the best estimate of a value is that in which the sum of the squares of the deviations of observed values is a minimum.
The methods of the invention allow rapid and efficient design and evaluation of compounds useful for inhibiting infection by papillomavirus. Other features and advantages of the invention will be apparent from the following detailed description, and from the claims.
DETAILED DESCRIPTION BRIEF DESCRIPTION OF THE DRAWINGS The drawings are first briefly described. Figure 1 is a depiction of the three-dimensional structure of E6bp. The protein backbone is shown as a blue ribbon. Hydrophobic amino acid side chains are shown in yellow, and polar side-chains in blue. Calcium ions are shown as spheres.
Figure 2 is an amide-amide region of the 2D NOESY illustrating the structure of the E6bp molecule. Intense amide to amide proton contacts, indicative of α helices, are indicated by residue number. Residues 13 to 16, shown above the diagonal, form an N- terminal helix, whereas residues 22 to 27, shown below the diagonal, for a C-terminal helix.
Figure 3 is an illustration of ligand design. In this example, the candidate compound phenylarginine was built onto two exposed amino acids residues (glutamate 16 and leucine 19) on the C-terminal α helix of the E6bp protein. The hydrocarbons of the ligand are indicated by G, whereas those of the protein are indicated by Y. Oxygen atoms are indicated by R, nitrogens by B, and polar hydrogens by W. The compound was designated with the LUDI feature feature of the molecular modeling program INSIGHTII, and the fit to the protein was optimized using the DOCK module. This figure illustrates the method for design of novel inhibitors to papillomavirus, described herein.
Figure 4 is a depiction of various sequences illustrating that the E6 binding domains is a short α-helical peptide and that the E6 binding region of E6BP is found in other E6-binding proteins. Further illustrated in this Figure are the results from an analysis of the structure of E6bp, based on site-directed mutagenesis.
E6 Signal Transduction Pathway
E6-induced tumorigenesis occurs via two pathways. In the p53-dependent pathway, the E6 protein associates with a human cellular factor, E6AP. The E6-E6AP complex directs p53 for rapid degradation via the ubiqui tin-mediated proteolytic pathway (Lee J.M. and Bernstein A. (1995) Cancer Metas. Rev. 14: 149; Scheff er M. et al., (1993) Cell 75:495; and Huibregste J. et al., (1993) Mol. Cell Biol. 13:775). Loss of p53 protein correlates with the loss in its tumor suppressor functions. An 18 amino acid residue peptide fragment, E6ap, is the minimal region of E6AP that binds E6 (Huibregste J. et al., (1993) Mol. Cell Biol. 13:4918).
E6-induced tumorigenesis is also p53-independent (Storey A. et al., (1995) Oncogene 11:653), and a different target protein, ERC55, has been demonstrated to bind E6 (Chen J.J. et al, (1995) Science 269:529). ERC55 and p53 compete for binding to E6, consistent with alternate roles in tumorigenesis. The mechanism of ERC55 function appears to involve alteration of keratinocyte differentiation (Sherman L. and Schlegel R. (1996) J. Virol. 70:3269; and Reiss M. et al, (1989) Cancer Commun. 1 :75; Chen J.J. et al, (1995) Science 269:529; and Howley P.M. (1996) Field Virol. Chapter 65, 2nd edition). A 25 amino acid segment of ERC55, called E6bp, has been found to be necessary and sufficient for binding to E6. The sequence of E6ap is homologous to E6bp, as shown in the following Table.
TABLE I
E6ap IPESSELTLQELLGEERR (SEQ I.D. NO: 1)
** I * I * I j * 1
E6bp ALEEHDKNGDGFVSLEEFLGDYRWD (SEQ I.D. NO:2)
The * symbol indicates homologous amino acids, whereas the | symbol denotes identical amino acids. Structural Analysis of E6bp
The inventors have solved the three-dimensional structure of an E6bp molecule, using one-and two-dimensional NMR Spectroscopy. Importantly, this has provided, for the first time, information about the three-dimensional structure of the E6bp molecule. This information is of significant utility in fields such as drug discovery. An understanding of the structure of the E6bp molecule, a 25 amino acid region in the ERC55 protein sufficient for binding to the HPV E6 transforming protein, allows the design of drugs which interact with the HPV E6 transforming protein. As a result, this information is useful for designing inhibitors of the E6-ERC55 interaction and therefore, drugs for fighting papillomavirus infection. Candidate Compounds
Candidate compounds can be agents which can be altered so as to have a three- dimensional structure that is substantially similar to the three-dimensional structure of E6bp, provided in Figure 1, such that the agent can bind to the HPV E6 transforming protein or portion thereof. Preferably, the altered candidate compound can bind to E6 with an affinity which is at least 10, 50, 100, 150, 200 or 500% as strong as the affinity with which E6bp binds to E6. Candidate compounds can also be agents which can be altered so as to have a three-dimensional structure that is suitable for associating with the E6bp molecule.
For example, the candidate compound can be a peptide or a peptidomimetic. Examples of peptidomimetics include peptidic compounds in which the peptide backbone is substituted with one or more benzodiazepine molecules (see e.g., James, G.L. et al, 1993, Science 260:1937-1942, the contents of which are incorporated herein by reference), peptides in which all L-amino acids are substituted with the corresponding D-amino acids, and "retro-inverso" peptides (see U.S. Patent No. 4,522,752 by Sisto, the contents of which are incorporated herein by reference). The term mimetic, and in particular, peptidomimetic, includes isosteres. The term "isostere" as used herein, includes a chemical structure that can be substituted for a second chemical structure because the steric conformation of the first structure fits a binding site specific for the second structure. The term specifically includes peptide back-bone modifications {e.g., amide bond mimetics). Such modifications include modifications of the amide nitrogen, the α-carbon, amide carbonyl, complete replacement of the amide bond, extensions, deletions or backbone cross links. Several peptide backbone modifications are known, including ψ[CH2S], ψ[CH2NH], ψ [CSNH2], ψ[NHCO], ψ[COCH2], and ψ[(E) or (Z) CH=CH]. In the nomenclature used above, ψ indicates the absence of an amide bond. The structure that replaces the amide group is specified within the brackets.
Other possible modifications include an N-alkyl (or aryl) substitution (ψ [CONR]), backbone cross linking to construct lactams and other cyclic structures, substitution of all D-amino acids for all L-amino acids within the candidate compound ("inverso" compounds) or retro-inverso amino acid incorporation (ψ[NHCO]). By "inverso" is meant replacing L-amino acids of a sequence with D-amino acids, and by "retro-inverso" or "enantio-retro" is meant reversing the sequence of the amino acids ("retro") and replacing the L-amino acids with D-amino acids. For example, if the parent peptide is Thr-Ala-Tyr, the retro modified form is Tyr-Ala-Thr, the inverso form is thr-ala-tyr, and the retro-inverso form is tyr-ala-thr (lower case letters refer to D- amino acids). Compared to the parent peptide, a retro-inverso peptide has a reversed backbone while retaining substantially the original spatial conformation of the side chains, resulting in a retro-inverso isomer with a topology that closely resembles the parent peptide. (See Goodman et al. "Perspectives in Peptide Chemistry" pp. 283-294, 1981, and U.S. Patent No. 4,522,752, the contents of which are incorporated herein by reference).
The candidate compound can also be a non peptide organic compound prepared as described in WO 9504277, the contents of which are incorporated herein by reference, as well as a steroid, a carbohydrate, a lipid and the like.
The candidate compound can be selected from a database of three-dimensional structures of known compounds. The three-dimensional structures in the database can be either experimentally determined, e.g., crystal structures from the Cambridge structural database (see Allen et al., J. Chem. Inf. Comput. Sci. 31 : 187-204, 1991, the contents of which are incorporated herein by reference) or computationally generated, e.g., using rule-based programs such as CONCORD (see Pearlman, R. S., Chem. Des. Auto. News, 2:1-7, 1987, the contents of which are incorporated herein by reference). The candidate compounds can also be designed de novo; e.g., by piecing together or assembling molecular fragments to create compounds which: (a) have a three- dimensional structure that is substantially similar to the three-dimensional structure of E6bp provided in figure 1 , such that the created compound can bind to the HPV E6 transforming protein or portion thereof or (b) have a three-dimensional structure that is suitable for associating with the E6bp molecule. For example, the GROW algorithm (Moon, J.B., et al., Proteins: Struct. Fund. Genet 11 :314-328,1991, the contents of which are incorporated herein by reference), or the LUDI program (Bδhm, H.-J. J Comput Aided Mol Design 6:61-78,1992, the contents of which are incorporated herein by reference) can be used. Other useful programs to aid one of skill in the art in assembling molecular components to create compounds include: (1) CAVEAT, described in Bartlett et al., In "Molecular Recognition in Chemical and Biological Problems". Special Pub., Royal Chem. Soc, 78, 182-196,1989, the contents of which are incorporated herein by reference; (2) 3D Database systems such as MACCS-3D (MDL Information Systems, San Leandro, CA). The use of these systems is described in Martin et al., J. Med.
Chem., 35, 2145-2154,1992, the contents of which are incorporated herein by reference; and (3) HOOK (available from Molecular Simulations, Burlington, MA).
Other molecular modeling techniques may also be employed in accordance with this invention. See e.g., Cohen et al., J. Med. Chem., 33, 883-894,1990, Naiva et al, Current Opinions in Structural Biology, 2, 202-210,1992, the contents of which are incorporated herein by reference.
Once a compound has been designed by the above methods, its similarity to the three-dimensional structure of the E6bp peptide may be evaluated. Machine Readable Storage Medium In order to use the NMR derived structure coordinates for the E6bp peptide, it is preferable to convert them into a three-dimensional representation. This can be achieved through the use of commercially available software which is capable of generating three- dimensional graphical representations of molecules or portions thereof from as set of structure coordinates. Evaluation and Design of Candidate Compounds The invention allows the use of molecular design techniques to design and evaluate candidate compounds, including inhibitory compounds, e.g., candidate compounds having a three-dimensional structure that is: (a) substantially similar to the three-dimensional structure of E6bp provided in figure 1, such that the candidate compounds can bind to the HPV E6 transforming protein or portion thereof or (b) suitable for associating with the E6bp molecule.
A potential compound which can bind to the HPV E6 transforming protein or portion thereof can be evaluated by means of a series of steps in which compounds are screened and selected for their similarity to the three-dimensional structure of the E6bp molecule.
One skilled in the art can use one of several methods to screen compounds for their similarity to the three-dimensional structure of the E6bp molecule. This process may begin by visual inspection of, for example, the three-dimensional structure of the candidate compound in comparison to the three-dimensional structure of the E6bp molecule on a computer screen, wherein the three-dimensional structure of the E6bp molecule is generated from the machine-readable storage medium.
Various computational analyses can also be used to determine whether a compound is sufficiently similar to the three-dimensional structure of the E6bp molecule. Such analyses can be carried out in current software applications, such as the Molecular Similarity application of QUANTA (Molecular Simulations Inc., Waltham, MA) version 3.3, and as described in the accompanying User's Guide, Volume 3 pg. 134-135, the contents of which are incorporated herein by reference.
The Molecular Similarity application permits comparisons between different structures, different conformations of the same structure, and different parts of the same structure. The procedure used in Molecular Similarity to compare structure is divided into four steps: 1) loading the structures to be compared; 2) defining the atom equivalences in these structures; 3) performing a fitting operation; and 4) analyzing the results.
Each structure can be identified by a name. The E6bp structure can be identified as the target (i.e., the fixed structure); the candidate compound structures can be working structures (i.e., moving structures). Since atom equivalency within QUANTA is defined by user input, equivalent atoms can be defined as protein backbone atoms (N, Cα, C and O) for all conserved residues between the two structures being compared. The process can be aided by color-coding the different parts of the molecules. Rigid fitting operations can be used. When a rigid fitting method is used, the working compound structure is translated and rotated to obtain an optimum fit with the target E6bp structure. The fitting operation uses a least squares fitting algorithm that computes the optimum translation and rotation to be applied to the moving compound structure, such that the root mean square difference of the fit over the specified pairs of equivalent atom is an absolute minimum. This number, given in angstroms, can be reported by QUANTA.
Preferred candidate structures are those having a set of structure coordinates with a root mean square deviation of conserved residue backbone atoms (e.g., N, Cα, C, O) of less than 1.5 A when superimposed, using backbone atoms, on the relevant structure coordinates listed in Figure 2 are considered identical. More preferably, the root mean square deviation is less than 1.0 A, and even more preferably, the root mean square deviation is less than 0.5 A.
Candidate compounds can also be evaluated for their ability to associate with the E6bp molecule. One skilled in the art may use one of several methods to screen compounds for their ability to associate with the E6bp molecule. This process may begin by visual inspection of, for example, the E6bp molecule on a computer screen based on the NMR derived data shown in Figure 2, and generated from the machine- readable storage medium. Selected compounds may then be positioned in a variety of orientations, or docked, within the E6bp molecule. Docking may be accomplished using software such as Quanta and Sybyl, followed by energy minimization and molecular dynamics with standard molecular mechanics force fields, such as CHARMM and AMBER. Specialized computer programs may also assist in the process of selecting fragments or chemical entities. These include: 1. AUTODOCK (D.S. Goodsell et al., "Automated Docking of Substrates to Proteins by Simulated Annealing" Proteins: Structure. Function, and Genetics. 8, pp. 195-202 (1990)). DOCK is available from University of California, San Francisco, CA. 2. DOCK, (I.D. Kuntz et al., J. Mol. Biol. 161, pp. 269-288 (1982), the contents of which are incorporated herein by reference). DOCK is available from University of California, San Francisco, CA. Synthesis of Identified Compounds
Once a compound has been evaluated by the above methods, it can be prepared by standard techniques known in the art. Peptides can be synthesized using standard techniques such as those described in Bodansky, M. Principles of Peptide Synthesis. Springer Verlag, Berlin (1993) and Grant, G.A (ed.). Synthetic Peptides: A User's Guide. W.H. Freeman and Company, New York (1992). Automated peptide synthesizers are commercially available (e.g., Advanced ChemTech Model 396; Milligen/ Biosearch 9600). Approaches to designing peptide analogs are also known in the art. For example, see Farmer, P.S. in Drug Design (E.J. Aliens, ed.) Academic Press, New York, 1980, vol. 10, pp. 119-143; Ball. J.B. and Alewood, P.F. (1990) J. Mol. Recognition 3:55; Morgan, B.A. and Gainor, J.A. (1989) Ann. Rep. Med. Chem. 24:243; and Freidinger, R.M. (1989) Trends Pharmacol. Sci. 10:270, the contents of all of which are incorporated herein by reference. Alternatively, peptide compounds can be prepared according to standard recombinant DNA techniques using a nucleic acid molecule encoding the peptide. A nucleotide sequence encoding the peptide can be determined using the genetic code and an oligonucleotide molecule having this nucleotide sequence can be synthesized by standard DNA synthesis methods (e.g., using an automated DNA synthesizer). Alternatively, a DNA molecule encoding a peptide compound can be derived from the corresponding natural gene or cDNA (e.g., using the polymerase chain reaction and/or restriction enzyme digestion) according to standard molecular biology techniques.
To facilitate expression of a peptide compound in a host cell by standard recombinant DNA techniques, the isolated nucleic acid encoding the peptide is incorporated into a recombinant expression vector. As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers to a circular double stranded DNA loop into which additional DNA segments may be ligated. Another type of vector is a viral vector, wherein additional DNA segments may be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non- episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are referred to herein as "recombinant expression vectors" or simply "expression vectors". In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. In the present specification, "plasmid" and "vector" may be used interchangeably as the plasmid is the most commonly used form of vector. However, such other forms of expression vectors, such as viral vectors, which serve equivalent functions may also be used to express a peptide compound.
The nucleotide sequence encoding the peptide compound can be operatively linked to one or more regulatory sequences, selected on the basis of the host cells to be used for expression. The term "operably linked" is intended to mean that the sequences encoding the peptide compound are linked to the regulatory sequence(s) in a manner that allows for expression of the peptide compound. The term "regulatory sequence" is intended to include promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are described, for example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185. Academic Press, San Diego, CA (1990), the content of which are incorporated herein by reference. Regulatory sequences include those that direct constitutive expression of a nucleotide sequence in many types of host cells, those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences) and those that direct expression in a regulatable manner (e.g., only in the presence of an inducing agent). It will be appreciated by those skilled in the art that the design of the expression vector may depend on such factors as the choice of the host cell to be transformed, the level of expression of peptide compound desired, and the like. The peptide compound expression vectors can be introduced into host cells to thereby produce peptide compounds encoded by nucleic acids. The recombinant expression vectors can be designed for expression of peptide compounds in prokaryotic or eukaryotic cells. For example, peptide compounds can be expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: Methods in Enzymology 185. Academic Press, San Diego, CA (1990). Alternatively, the recombinant expression vector may be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. Examples of vectors for expression in yeast S. cerevisiae include pYepSecl (Baldari et al., (1987) EMBO J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 (Schultz et al, (1987) Gene 54:113-123), and pYES2 (Invitrogen Corporation, San Diego, CA). Baculovirus vectors available for expression of peptide compounds in cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al, (1983) Mol. Cell. Biol. 3:2156-2165) and the pVL series (Lucklow, V.A., and Summers, M.D., (1989) Virology 170:31-39). Examples of mammalian expression vectors include pCDM8 (Seed, B., (1987) Nature 329:840) and pMT2PC (Kaufman et al. (1987), EMBO J. 6:187-195). When used in mammalian cells, the expression vector's control functions are often provided by viral regulatory elements. For example, commonly used promoters are derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40.
In addition to the regulatory control sequences discussed above, the recombinant expression vector may contain additional nucleotide sequences. For example, the recombinant expression vector may encode a selectable marker gene to identify host cells that have incorporated the vector. Such selectable marker genes are well known in the art. Moreover, to facilitate secretion of the peptide compound from a host cell, in particular mammalian host cells, the recombinant expression vector preferably encodes a signal sequence operatively linked to sequences encoding the amino-terminus of the peptide compound such that upon expression, the peptide compound is synthesized with the signal sequence fused to its amino terminus. This signal sequence directs the peptide compound into the secretory pathway of the cell and is then cleaved, allowing for release of the mature peptide compound (i.e., the peptide compound without the signal sequence) from the host cell. Use of a signal sequence to facilitate secretion of proteins or peptides from mammalian host cells is well known in the art.
Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, electroporation, microinjection and viral-mediated transfection. Suitable methods for transforming or transfecting host cells can be found in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory press (1989)), and other laboratory manuals.
For stable transfection of mammalian cells, it is known that, depending upon the expression vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify and select these integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host cells along with the gene encoding the peptide compound. Preferred selectable markers include those that confer resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a selectable marker may be introduced into a host cell on the same vector as that encoding the peptide compound or may be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other cells die). Assessment of the Biological Activity of Identified Compounds
Once a compound has been identified and synthesized, its biological activity can be assessed. For example, an assay can be used to assess the ability of a compound to inhibit binding between ERC55 (i.e., E6bp) and the HPV E6 transforming protein. A variety of assays are available and readily apparent to the skilled artisan. For example, in one such biological evaluation assay the identified compound can be contacted with an isolated and purified ERC55 protein. The mixture of the candidate compound and ERC55 can then be added to a composition containing the E6 protein but which does not contain ERC55. Detection and quantification of labelled E6/ERC55 complexes provides a means for determining the candidate compound's efficacy at inhibiting complex formation between the papillomavirus E6 protein and the ERC55 protein. A control assay can also be performed to provide a baseline for comparison. In the control assay, isolated and purified ERC55 is added to a composition containing the E6 protein, and the formation of an E6/ERC55 complex is quantitated in the absence of the test compound.
Complex formation between ERC55 and E6 may be detected by a variety of other methods as well. For example, glutathione-S-transferase/E6 (GST/E6) fusion proteins can be adsorbed onto glutathione sepharose beads which can then be combined with an 35S-labeled ERC55 protein and incubated under conditions conducive to complex formation, [e.g., at 4°C in a buffer of 25 mM Tris-HCl (pH 7.2), 50 mM NaCl and 0.2%) NP-40]. Following incubation, the beads can be washed to remove any unbound ERC55, and the sepharose bead-bound radiolabel can be determined directly (e.g. beads placed in scintilant), or in the superatantant after the E6 ERC55 complexes are dissociated (e.g. by treatment with DTT). The supernatant containing the complexes can be separated by SDS-PAGE gel before detection.
Additionally, ERC55 or E6bp can be used to generate a two-hybrid assay, as described in U.S. Patent No: 5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J Biol Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920- 924; and Iwabuchi et al. (1993) Oncogene 8:1693-1696), for subsequently assessing the ability of compounds to disrupt binding of ERC55 or E6bp to E6. The interaction trap assay relies on reconstituting in vivo a fuctional transcriptional activator protein from two separate fusion proteins, one of which comprises the DNA-binding domain of a transcriptional activator fused to an E6 protein. The second fusion protein comprises a transcriptional activation domain (e.g. able to initiate RNA polymerase transcription) fused to ERC55 or E6bp. When the E6 and ERC55 or E6bp proteins interact, the two domains of the transcriptional activator protein are brought into sufficient proximity as to cause transcription of a reporter gene. For example, Saccharomyces cerevisiae YPB2 cells can be transformed simultaneously with a plasmid encoding a GAL4db-E6 fusion and with a plasmid encoding the GAL4ad domain fused to ERC55 or E6bp. Moreover, the strain is transformed such that the GAL4-responsive promoter drives expression of a phenotypic marker. For example, the ability to grow in the absence of histidine depends on the expression of the HIS3 gene. When the HIS3 gene is placed under the control of a GAL4-responsive promoter, relief of this auxotrophic phenotype indicates that a functional GAL4 activator has been reconstituted through the interaction of E6 and ERC55 or E6bp. Thus, a compound able to inhibit an ERC55 or E6bp interaction with E6 will result in yeast cells unable to grow in the absence of histidine. Alternatively, the phenotypic marker (e.g. instead of the HIS3 gene) can be one which provides a negative selection when expressed such that compounds which disrupt E6/ERC55 or E6/E6bp interactions confer positive growth selection to the cells.
EXAMPLES
Example 1 : Sample Preparation
The E6bp peptide was synthesized using standard protein synthesis methods and purified by HPLC. The recombinant E6 protein is expressed in E. coli, solubilized from inclusion bodies, and refolded using a rapid dilution method. The E6 protein has been expressed and solubilized from HP V- 16 to a concentration of 4mg/mL (0.2 mM). For the peptide-E6 complex, (protonated) peptide is added to 0.2 mM deuterated (2H)E6 following the procedure of Shibata et al., (1995) Arch. Biochem. Biophys. 319:204, the contents of which are incoφorated herein by reference. This allows observation of only the H NMR resonances of the E6-bound peptides. Example 2: NMR Spectroscopy
Peptide samples were prepared at about 2 mM concentration. One- and two- dimensional (ID & 2D) NMR data were collected on a 500 MHz Bruker AMX500 spectrometer. E6bp is titrated with calcium and NMR spectra recorded to determine the effect on the peptide conformation. Low concentration peptide samples (0.2 mM) are titrated with E6 to determine the residues that are affected most by E6. Samples are prepared in both H2O and D2O solution.
Since the equilibrium binding constant of E6bp-E6 complex is in the μM range, transferred NOE (TRNOE) experiments, as described in Clore M. and Gronenborn A.M., (1982) J. Magn. Reson. 48:402; ibid. (1983) 53:423; and Sykes B.D., (1994) Cur. Opin. Biotech. 4:392, the contents of which are incoφorated herein by reference, are suitable to determine the conformation of E6-bound E6bp. TRNOE experiments are performed with an approximately 10-fold excess of ligand relative to the protein (2 mM peptide and 0.2 mM E6 protein). In TRNOE experiments, the spectra exhibit mostly the lineshape and intensity of the unbound peptide, but the NOESY cross peaks represent the bound conformation of the peptide. Spectra are optimized in terms of salt concentration (up to 0.2 M) and temperatures (5°C to 35°C) to identify the conditions that yield superior quality NMR data with narrow line shapes, following the methods described in Baleja J.D., (1996) Techniques in Protein Chemistry VI 131, the contents of which are incoφorated herein by reference.
Total correlation spectroscopy (TOCSY), and nuclear Overhauser effect spectroscopy (NOESY) experiments are used for resonance assignment and to obtain conformation data for structure calculations, as described in Bax A. and Davis D.G., (1985) J. Magn. Reson. 65:355; and Wϋthrich K, (1986) NMR of Proteins and Nucleic Acids, Wiley, New York, the contents of which are incoφorated herein by reference. NMR data are processed using the Bruker NMR processing program or FELIX (Biosym, Inc.)
Example 3: NMR Resonance Assignment Strategy
NMR resonance strategy is performed as described in Wuthrich et al. In this method, TOCSY spectra are analyzed to provide information regarding spin-spin coupled NMR-active nuclei. Different types of amino acids side chains produce distinct TOCSY cross peak patterns. The cross-peak patterns for each amino acid are linked together using the NOESY experiment since NOE sequential connectivities can be identified between residues neighboring in sequence. NOESY spectra show cross peaks between protons that are within 5 A of each other and therefore also show contacts between residues that are sequentially distant but spatially close. Inteφroton distances are determined from NOESY cross peak intensities. Φ and χ-1 torsion angles are obtained by measuring the coupling constants from ID slices of resolution-enhanced 2D data, as described in Wuthrich K., (1986) NMR of Proteins and Nucleic Acids, Wiley, New York; and Szyperski T. et al., (1992) J. Magn. Reson. 99:552, the contents of which are incoφorated herein by reference. Η NMR resonance assignments were made (see Table II) and a low resolution structure of the calcium-bound E6bp was calculated using approximately 200 inteφroton distances and 15 Φ torsion angles derived from NOESY data and distance geometry and simulated annealing protocols of the INSIGHTII molecular modeling program (see Figure 1). Consistent with peptide analogs of other EF-hand proteins, the E6bp peptide dimerizes. From the dispersion present in the NMR spectra, it is estimated that 400 additional inteφroton distances are obtainable. Structure calculation is repeated with the more extensive data set to determine the high resolution structure of E6bp. The high resolution structure of E6bp both free and bound to E6 protein is also solved. Knowing the conformation for this E6bp in the absence of E6 protein is important for understanding the conformational changes brought about by binding to E6 and the binding surface of the peptide involved in the interaction. TABLE II
NMR Resonance Assignments for E6 binding peptide
Figure imgf000024_0001
* Tentative assignment. "-" not observable. 1 mM peptide was in a buffer containing 100 mM NaCl, 50 mM calcium chloride, pH 6.0, 10% TFE, 35 deg. C. Structure Calculation and Analysis
The inteφroton distances and torsion angle information (see Table III) are introduced into the distance geometry program, DGII of Insight (Biosym, Inc.). The initial structure calculated using distance geometry and simulated annealing protocols is refined by determining the presence or lack of NOE cross peaks for spatially proximal protons in the calculated structure. Structures are compared to one another to highlight the conformational changes. For example, the E6bp peptide is expected to have major changes in surface properties on binding calcium, such as the formation of a deep hydrophobic cavity essential for target protein recognition (see Ikura M., (1996) Trends Biochem. Sci. 21:14, the contents of which are incoφorated herein by reference). The structures of E6ap and E6bp are compared to one another to test the hypothesis that they are similar. The residues involved in interaction with E6 are identified by chemical shift changes that occur upon titration, as described in Baleja J.D. et al., (1992) Nature 356:450; and Baleja J.D. et al., (1992) Biochemistry 33:3071, the contents of which are incoφorated herein by reference. Furthermore, differences in conformation observed on binding E6 are determined.
The high resolution structures and the identification of key residues for binding E6 are important for the design of small molecule inhibitors that block the oncogenic properties of E6 ( see Fesik S.W., (1993) J. Biomol. NMR 3:261; and Kuntz I.D. et al, (1994) Ace. Chem. Res. 27:117, the contents of which are incoφorated herein by reference). The first focus is on readily synthesized peptide derivatives for in vitro testing. Modifications of a lead compound to optimize interactions are proposed using the BUILDER and LUDI features of the INSIGHTII molecular modeling program (and chemical intuition). Binding to the protein surface uses the DOCK module. An illustration of the approach is given in Figure 3.
TABLE III
STRUCTURE DATA FOR E6 BINDING PEPTIDE
BIOSYM restraint 1 remote prochiral center
Figure imgf000025_0001
Figure imgf000025_0002
Figure imgf000026_0001
Figure imgf000027_0001
Figure imgf000028_0001
3J dihedral torsion angle information (theta and chι-1)
Figure imgf000028_0002
Figure imgf000029_0001
Structure Analysis of ERC55
Sequence analysis of the E6-interacting protein, ERC55, revealed homology to proteins containing the calcium-binding helix-loop-helix motif called an EF hands. Six EF hands are predicted to occur in its C-terminal domain. Only one of these EF hands, a 25 amino acid segment, E6bp, binds E6 selectively (Chen J.J. et al., (1995) Science 269:529), and with about the same affinity as the full-length ERC55 protein. Structural determination of other EF-hand domains using 'H NMR spectroscopy (Ikura M. (1996) Trends Biochem. Sci. 21:14) suggests the feasibility for NMR study of this peptide. A small amount of E6bp peptide was synthesized, purified and a 0.7 mM sample was prepared. As predicted, the peptide bound calcium and showed excellent NMR spectral dispersion as shown in Figure 2. The calcium free form of E6bp was studied. Structure Analysis of E6AP
An 18 amino acid residue peptide fragment, E6ap, is the minimal region of E6AP that binds E6. The sequence of E6ap is homologous to E6bp (see Table I), and their solution properties are likely to be similar. Sequence prediction (Wishart et al., (1994) Comp. Biol. Sci. 10:121) has indicated α helical structure for the homologous region of E6ap-the same region which has been determined to be α helical in the primary solution structure of E6bp (Figure 1). E6ap does not have the calcium ligands required for an EF hand, and therefore is unlikely to bind calcium. The small sizes of the E6bp and E6ap peptides make them attractive for detailed structural studies by solution NMR methods in a timely manner. Example 4: Analysis of The E6bp Structure By Site-Directed Mutagenesis To further map the domain that is important for E6 binding, additional deletion mutants from E6bp were constructed as GST fusions and tested for binding with E6. Deletion of the first alpha helix (20mer) did not affect binding, nor did deletion of two additional amino acid residues from the C-terminal end of E6bp to the distal region of the second alpha helix affect binding. A 13 amino acid peptide containing the second alpha helix as a GST fusion retained the ability to bind to E6, although at reduced efficiency in comparison to intact E6bp. Notably, the major portion of the loop region from the EP-hand motif is deleted in the 13 amino acid peptide. The ability of the second alpha helix to bind E6 demonstrates that the interaction of E6BP with E6 is independent of calcium binding, as the first alpha helix and loop region from the EP- hand motif are both required for calcium binding.
Alanine replacement mutations in E6bp were also constructed and used to define the amino acid(s) important for E6 interaction in the alpha helix. Some mutations were also made in the surrounding regions. As expected, mutants VI 9 A, S20A, E22A, R29A, and W30A, which have mutations in the area beyond the alpha helix, bound E6 at wild- type level (see Figure 4). Mutant F18A showed some reduced binding. Notably, phenyalanine is a hydrophobic residue, which may enhance the interaction between E6 and E6bp. This may explain why a leucine to alanine change at amino acid 21 also reduced binding. While mutants at the boundary of alpha helix (E23 and D28A) showed modest reduction (approximately 60% of wild-type binding) in their E6 binding ability, all other mutants made up of the alpha helical structure showed substantial decrease in E6 binding. Notably, a change of leucine to alanine at amino acid 25 totally abolished binding. The ability of mutants F18A and E23A to bind E6 confirmed the notion that the interaction of E6BP with E6 is independent of calcium binding, as both mutations abolished calcium binding. Finally, a leucine to proline change at amino acid 25 of E6bp was created and used in a binding experiment. This change (L25P) which is expected to disrupt the alpha helical structure, totally abolished E6bp-binding with E6, indicating that the alpha helix from the conserved motif is indeed important for E6 binding.
EQUIVALENTS Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be encompassed by the following claims.

Claims

CLAIMS:
1. A method for evaluating a candidate compound for the ability to interact with the HPV E6 transforming protein, comprising: supplying a three-dimensional structure for E6bp; supplying a three-dimensional structure for said candidate compound; and comparing the three-dimensional structure of said candidate compound to the three-dimensional structure of said E6bp, wherein similarity in the structure of said candidate compound to the structure of said E6bp is indicative of the ability of said candidate compound to interact with said HPV E6 transforming protein, thereby evaluating the ability of said candidate compound to interact with said
HPV E6 transforming protein.
2. A method of providing or identifying a candidate compound which has the ability to interact with the HPV E6 transforming protein, comprising: supplying a three-dimensional structure for E6bp; supplying a three-dimensional structure for said candidate compound; comparing the three-dimensional structure of said candidate compound to the three-dimensional structure of said E6bp; and altering the structure or altering the spatial position of the structure, of said candidate compound, thereby providing or identifying said candidate compound which has the ability to interact with said HPV E6 transforming protein.
3. The method of claim 2, further comprising comparing the altered structure or spatial position of the structure of said candidate compound to the three-dimensional structure of said E6bp.
4. The method of claim 3, further comprising making a second alteration in the structure or the spatial position of the structure, of said candidate compound.
5. The method of claim 2, wherein the method comprises defining an atom equivalency in said candidate compound and said E6bp three-dimensional structures.
6. The method of claim 2, wherein the method further comprises: defining an atom equivalency in said candidate compound and said E6bp three- dimensional structures; and performing a fitting operation between said candidate compound and said E6bp three-dimensional structures.
7. The method of claim 2, wherein said comparing step is performed visually.
8. The method of claim 2, wherein said alteration results in a change in the orientation of the structure of said candidate compound.
9. The method of claim 8, wherein said alteration comprises rotation of said structure.
10. The method of claim 9, wherein the orientation of the structure of said candidate compound is altered relative to the structure of the E6bp.
11. The method of claim 2, wherein the alteration results in a change in a parameter selected from the group consisting of: hydrophobicity, steric bulk, electrostatic properties, size, bond angle, type of bond and amino acid sequence of the candidate compound.
12. The method of claim 2, further comprising creating a record of one or both the three-dimensional structures of said candidate compound and said E6bp.
13. The method of claim 2, wherein the three-dimensional structure of said candidate compound and/or said E6bp is displayed on a machine capable of displaying a graphical three-dimensional representation.
14. The method of claim 2, wherein said candidate compound is selected from a database of three-dimensional structures.
15. The method of claim 14, wherein said three-dimensional structures are determined experimentally.
16. The method of claim 14, wherein said three-dimensional structures are computationally generated.
17. The method of claim 2, wherein said candidate compound is designed de novo.
18. A machine-readable data storage medium, comprising a data storage material encoded with a set of NMR derived coordinates which define the three-dimensional structure of the E6bp molecule.
19. A method of treating a subject at risk for infection by a HPV, comprising administering to a subject a therapeutically effective amount of a compound other than an E6bp fragment, wherein the candidate compound has a structure sufficiently duplicative of the structure of E6bp shown in Figure 1, so as to bind to the HPV E6 transforming protein and prevent its mteraction with the ERC55 protein, thereby treating a subject at risk for infection by a HPV.
20. A compound having a three-dimensional structure substantially similar to the three- dimensional structure of E6bp, such that the candidate compound can bind to the HPV E6 transforming protein, wherein said compound is other than E6bp or ERC55.
21. The compound of claim 20, wherein the structure of the candidate compound is sufficiently similar to the three-dimensional structure the E6bp provided in Figure 1 , such that the candidate compound can bind to the HPV E6 transforming protein, with an affinity which is at least half that of E6bp.
22. A method of evaluating the ability of a candidate compound to associate with an E6bp molecule, comprising: supplying a three-dimensional structure for said E6bp molecule; supplying a three-dimensional structure for said candidate compound; and performing a fitting operation between said three-dimensional structures of said candidate compound and said E6bp molecule, thereby evaluating the ability of said candidate compound to associate with said E6bp molecule.
23. A method of modeling the region of the HPV E6 protein which binds ERC55, comprising: supplying a three-dimensional structure for an E6bp molecule; and supplying a complementary structure to said E6bp molecule, thereby modeling the region of the HPV E6 protein which binds ERC55.
PCT/US1998/020991 1997-10-07 1998-10-06 Structure-based rational design of compounds to inhibit papillomavirus infection WO1999017616A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
EP98953264A EP0964617A1 (en) 1997-10-07 1998-10-06 Structure-based rational design of compounds to inhibit papillomavirus infection
JP52222399A JP2001513823A (en) 1997-10-07 1998-10-06 Rational design of papillomavirus infection-based compounds based on their structure
CA002273186A CA2273186A1 (en) 1997-10-07 1998-10-06 Structure-based rational design of compounds to inhibit papillomavirus infection
AU10676/99A AU1067699A (en) 1997-10-07 1998-10-06 Structure-based rational design of compounds to inhibit papillomavirus infe ction

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US6129597P 1997-10-07 1997-10-07
US60/061,295 1997-10-07

Publications (1)

Publication Number Publication Date
WO1999017616A1 true WO1999017616A1 (en) 1999-04-15

Family

ID=22034861

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1998/020991 WO1999017616A1 (en) 1997-10-07 1998-10-06 Structure-based rational design of compounds to inhibit papillomavirus infection

Country Status (5)

Country Link
EP (1) EP0964617A1 (en)
JP (1) JP2001513823A (en)
AU (1) AU1067699A (en)
CA (1) CA2273186A1 (en)
WO (1) WO1999017616A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002003919A2 (en) * 2000-07-10 2002-01-17 The Board Of Trustees Of The Leland Stanford Junior University Structural targets in hepatitis c virus ires element
US6677160B1 (en) 1999-09-29 2004-01-13 Pharmacia & Upjohn Company Methods for creating a compound library and identifying lead chemical templates and ligands for target molecules
US6764858B2 (en) 1999-09-29 2004-07-20 Pharmacia & Upjohn Company Methods for creating a compound library
US7144982B2 (en) 2000-01-20 2006-12-05 University Of Minnesota Polypeptides with therapeutic activity and methods of use
WO2013037700A1 (en) 2011-09-12 2013-03-21 Valdospan Gmbh Human papilloma virus (hpv) e7 is a prognostic marker for prostate cancer

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5532348A (en) * 1993-07-30 1996-07-02 United States Of America E6 associated protein and methods of use thereof
US5583000A (en) * 1991-09-03 1996-12-10 The Regents Of The University Of California Protease-binding compounds and methods of use

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5583000A (en) * 1991-09-03 1996-12-10 The Regents Of The University Of California Protease-binding compounds and methods of use
US5532348A (en) * 1993-07-30 1996-07-02 United States Of America E6 associated protein and methods of use thereof

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
CHARRETIER E, GUERON M: "APPLICATION DE LA RESONANCE MAGNETIQUE NUCLEAIRE A LA DETERMINATIONDE LA STRUCTURE DES PROTEINES EN SOLUTION", BIOCHEMISTRY AND CELL BIOLOGY. BIOCHIMIE ET BIOLOGIE CELLULAIRE., NRC RESEARCH PRESS, CA, vol. 69, 1 January 1991 (1991-01-01), CA, pages 323 - 335, XP002917013, ISSN: 0829-8211 *
CHEN J J, ET AL.: "INTERACTION OF PAPILLOMAVIRUS E6 ONCOPROTEINS WITH A PUTATIVE CALCIUM-BINDING PROTEIN", SCIENCE, AMERICAN ASSOCIATION FOR THE ADVANCEMENT OF SCIENCE, US, vol. 269, 28 July 1995 (1995-07-28), US, pages 529 - 531, XP002917012, ISSN: 0036-8075, DOI: 10.1126/science.7624774 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6677160B1 (en) 1999-09-29 2004-01-13 Pharmacia & Upjohn Company Methods for creating a compound library and identifying lead chemical templates and ligands for target molecules
US6764858B2 (en) 1999-09-29 2004-07-20 Pharmacia & Upjohn Company Methods for creating a compound library
US7377894B2 (en) 1999-09-29 2008-05-27 Pharmacia & Upjohn Company Methods for creating a compound library and identifying lead chemical templates and ligands for target molecules
US7144982B2 (en) 2000-01-20 2006-12-05 University Of Minnesota Polypeptides with therapeutic activity and methods of use
WO2002003919A2 (en) * 2000-07-10 2002-01-17 The Board Of Trustees Of The Leland Stanford Junior University Structural targets in hepatitis c virus ires element
WO2002003919A3 (en) * 2000-07-10 2002-04-18 Univ Leland Stanford Junior Structural targets in hepatitis c virus ires element
WO2013037700A1 (en) 2011-09-12 2013-03-21 Valdospan Gmbh Human papilloma virus (hpv) e7 is a prognostic marker for prostate cancer

Also Published As

Publication number Publication date
AU1067699A (en) 1999-04-27
EP0964617A1 (en) 1999-12-22
CA2273186A1 (en) 1999-04-15
JP2001513823A (en) 2001-09-04

Similar Documents

Publication Publication Date Title
US8921323B2 (en) Methods and compositions for modulating BCL-2 family polypeptides
Elrod-Erickson et al. High-resolution structures of variant Zif268–DNA complexes: implications for understanding zinc finger–DNA recognition
Colgan et al. Binding of the human immunodeficiency virus type 1 Gag polyprotein to cyclophilin A is mediated by the central region of capsid and requires Gag dimerization
EP2338056B1 (en) Chemical modulators of pro-apoptotic bax and bcl-2 polypeptides
DK1987178T3 (en) Process for the construction and screening of peptide structure libraries
CN110709412A (en) Protein and peptide tags with increased rate of spontaneous isopeptide bond formation and uses thereof
Reimer et al. Local structural changes caused by peptidyl–prolyl cis/trans isomerization in the native state of proteins
US8063019B2 (en) Scaffold polypeptides for heterologous peptide display
Isaac et al. Biochemical basis for distinct roles of the heterochromatin proteins Swi6 and Chp2
Shi et al. Structural basis of copper-efflux-regulator-dependent transcription activation
WO1999017616A1 (en) Structure-based rational design of compounds to inhibit papillomavirus infection
CA2449290A1 (en) Mutants of igf binding proteins and methods of production of antagonists thereof
US20020156585A1 (en) Crystallization and structure determination of Staphylococcus aureus UDP-N-acetylenolpyruvylglucosamine reductase (S. aureus MurB)
JP2003212895A (en) Crystal structure of antibiotic bound to 30s ribosome, and its application
EP2511290A1 (en) Compounds binding to the bacterial beta ring
US20090087445A1 (en) Method for the Redox Potential-Dependent Detection of Target Molecules by Interacting Polypeptides
JP2003510250A (en) Crystallization and structure determination of Staphylococcus aureus elongation factor P
WO2002002622A2 (en) Crystal structure of survivin
US20160041181A1 (en) Nmr assay to screen protein-protein interaction inhibitors
JP4430821B2 (en) Region of papillomavirus E1 helicase involved in E1 oligomerization
US11266616B2 (en) Compositions and methods for inhibiting kinase activity
Sandate et al. Structural insights into viral hijacking of p53 by E6 and E6AP
Pantelejevs et al. A recombinant approach for stapled peptide discovery yields inhibitors of the RAD51 recombinase
EP1452590A1 (en) Stereostructure of dna recombination repair enzyme and utilization thereof
Lu Macromolecular interaction studies on FF1-3 of human CA150 and STAT1-importin α5 using biophysical and biochemical methods

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AU CA JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 10676/99

Country of ref document: AU

ENP Entry into the national phase

Ref document number: 2273186

Country of ref document: CA

ENP Entry into the national phase

Ref country code: JP

Ref document number: 1999 522223

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 1998953264

Country of ref document: EP

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 1998953264

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 1998953264

Country of ref document: EP