WO1998054333A2 - Mammalian caax processing enzymes - Google Patents

Mammalian caax processing enzymes Download PDF

Info

Publication number
WO1998054333A2
WO1998054333A2 PCT/US1998/011415 US9811415W WO9854333A2 WO 1998054333 A2 WO1998054333 A2 WO 1998054333A2 US 9811415 W US9811415 W US 9811415W WO 9854333 A2 WO9854333 A2 WO 9854333A2
Authority
WO
WIPO (PCT)
Prior art keywords
seq
dna
protein
nucleic acid
caax
Prior art date
Application number
PCT/US1998/011415
Other languages
French (fr)
Other versions
WO1998054333A3 (en
Inventor
Matthew N. Ashby
Dago G. Dimster-Denk
John W. Philips
Original Assignee
Acacia Biosciences Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Acacia Biosciences Inc. filed Critical Acacia Biosciences Inc.
Priority to AU77216/98A priority Critical patent/AU7721698A/en
Publication of WO1998054333A2 publication Critical patent/WO1998054333A2/en
Publication of WO1998054333A3 publication Critical patent/WO1998054333A3/en
Priority to US09/454,889 priority patent/US6518035B1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/48Hydrolases (3) acting on peptide bonds (3.4)
    • C12N9/50Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
    • C12N9/64Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue
    • C12N9/6421Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue from mammals
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides

Definitions

  • the eukaryotic cell is made up of a wide variety of specialized structures and membrane-enclosed compartments which create special intracellular environments. Many cellular components, such as proteins, must be targeted to particular intracellular locations for proper function.
  • the lipid bilayer membrane that surrounds the cell is one such specialized cellular environment. Cellular components such as proteins are delivered to and must associate in a particular way with the correct membrane surface of the lipid bilayer. A number of mechanisms which the cell has evolved for targeting proteins to and inserting proteins into the lipid bilayer membrane of the cell have now been described.
  • Proteins targeted to the lipid bilayer membrane may be situated within the lipid bilayer membrane (i.e., a portion of the protein is inserted into and across the lipid bilayer) and are typically referred to as "integral membrane proteins". Integral membrane proteins possess intrinsic hydrophobic regions (comprising amino acid regions of relatively high intrinsic or apparent hydrophobicity) which are often inserted into the lipid bilayer as the protein is being synthesized. Alternatively, proteins targeted to the lipid bilayer membrane may be situated at and associate with the membrane surface without being inserted across the membrane. Such proteins are typically referred to as “peripheral membrane proteins”.
  • Peripheral membrane proteins are usually less tightly associated with the lipid bilayer than integral membrane proteins. Peripheral membrane proteins may associate with the lipid bilayer by protein-protein interactions, by hydrophobic interactions between the bilayer and one or more intrinsically hydrophobic regions of the membrane protein, or by the co- or post- translational addition of a lipid group to the membrane protein.
  • the most recently discovered lipid modification involves the covalent attachment of cholesterol biosynthetic intermediates to proteins, and is referred to generically as protein prenylation. See, e.g., W.R. Schafer and J. Rine, Annu. Rev. Genet.. 26, p. 209 (1992); S. Clarke, Annu. Rev. Biochem., 61, p. 355 (1992); each of which is incorporated herein by reference.
  • peripheral membrane proteins processed by the latter mechanism has a terpenoid lipid added to the C-terminal end for such purposes.
  • Proteins belonging to this class (often referred to as prenylated proteins) that possess a C-terminal tetrapeptide CAAX motif (where C is cysteine, A is usually aliphatic, and X can be a variety of different residues) undergo a series of post-translational processing reactions.
  • These reactions include the attachment of either a 15-carbon farnesyl or a 20-carbon geranylgeranyl lipid moiety to the cysteine residue of the CAAX motif (prenylation) , proteolytic removal of the C-terminal three amino acids (AAX) and ethylesterification of the -carboxyl group of the terminal prenylated cysteine residue.
  • Prenylation of CAAX protein substrates is catalyzes by one of two different type I protein prenyl- transferases. If the X position is occupied by leucine or phenylalanine, the cysteine residue of the protein is geranylgeranylated.
  • Prenylated proteins include, for example, many proteins involved in signal transduction such as the Ras oncogenes, fungal pheromones and the y subunits of trimeric guanosine triphosphate (GTP) -binding proteins (G proteins) . They also include a wide range of small GTP- binding proteins that participate in vesicular trafficking. See, e.g, P. J. Casey, Curr. Opin. Cell.
  • the yeast Saccharomyces cerevisiae possesses numerous CAAX-containing proteins including, but not limited to, Raslp, Ras2p, Mfalp, Mfa2p, and Stel ⁇ p.
  • Yeast have two genes encoding prenylation-dependent CAAX endoproteinases, AFC1 and RCEl (Boyartchuk et al., supra) .
  • the Afcl protein (Afclp; encoded by AFC1 ) is a zinc protease that participates in the processing of the yeast mating pheromone a.-factor.
  • the Reel protein contributes to the processing of ⁇ -factor and the yeast Ras protein (Ras2p) .
  • Rcelp is only a minor contributor in vivo to _a-factor processing but appears to be required for Ras2p processing.
  • Yeast have a single gene, STE14, which encodes the membrane-associated, CAAX C-terminal methyltransferase (Stel4p) (Ashby et al . , Yeast, 9, p. 907 (1993) (incorporated herein by reference) ) .
  • Tel4p CAAX C-terminal methyltransferase
  • Ras proteins are attached to the inner surface of the lipid bilayer membrane that surrounds each cell. Ras proteins function as a primary intracellular signaling mechanism for cell proliferation. They respond to external stimuli such as growth factors and hormones which bind to the cell membrane and trigger an intracellular cascade of chemical reactions known generally as "signal transduction". These chemical signals lead to the cell nucleus, where they activate gene transcription and the process of cell division. When the extracellular stimulus stops, the internal signaling pathway is deactivated and the cell returns to its resting state.
  • Ras gene When dominant mutations occur in the Ras gene (turning it into a Ras oncogene) , mutant proteins are produced that lock the signaling pathway in an active state, resulting in uncontrolled cell division. Dominant mutations in the Ras gene are presently the second most common type of mutations in human cancers. They are found in more than 50% of all human cancers including pancreatic cancer (95%), colon cancer (50%), lung cancers (45%) and leukemias (30%) . The functional activity of Ras protein depends on its attachment to the cell membrane, which in turn is mediated by protein prenylation. Farnesylation of the Ras protein in yeast and humans is required for its oncogenic transforming activity.
  • each of these processing enzymes represents a potential target that could be attacked pharmacologically to block the oncogenic action of mutant Ras protein.
  • Protein farnesyl transferases have become an active target for drug development. Inhibitors of protein farnesyl transferases are being developed, for example, as potential therapeutics for Ras-based tumors. See, e.g., F. Tamanoi, Trends Biochem. Sci.. 18, p. 349 (1993); N. E. Kohl et al., J. Cell. Biochem. SUPPI ⁇ . 22, p. 145 (1995). The methyltransferase enzyme will also be an attractive target for drug development.
  • the mammalian counterparts to the yeast CAAX endoproteinase and methyltransferase genes have not been isolated.
  • the mammalian homologs to each of the three yeast CAAX processing enzymes each represent a potential target to block the oncogenic action of mutant Ras protein in tumors, or more generally, to modulate the activity of prenylated peripheral membrane proteins. It would thus be desirable to identify and to isolate the mammalian homologs to the yeast CAAX endoproteinase and methyltransferase genes.
  • the present invention provides mammalian DNA sequences that display a high degree of sequence identity to their counterpart genes in Saccharomyces cerevisiae, RCEl and AFC1 , encoding prenylation-dependent CAAX endoproteinases.
  • cDNA complementary DNA sequences of the human and mouse RCEl functional homologs are provided.
  • Human cDNA sequences encoding proteins having a high degree of amino acid sequence identity to the yeast Afclp protein are also provided.
  • This invention is also directed to recombinant DNA molecules comprising the mammalian DNA sequences, DNA molecules and antisense RNA molecules which hybridize under stringent hybridization conditions to those DNA sequences, hosts transformed with the recombinant DNA molecules and protein expression products produced by culturing the transformed hosts. Antibodies directed against the protein expression products are also provided. Also provided are assays to identify inhibitors of one or more mammalian CAAX processing enzymes and kits for making the above products and performing the above assays. Finally, this invention provides pharmaceutical compositions comprising an inhibitor of a mammalian CAAX processing enzyme, and methods for treating a CAAX-protein mediated disease or disorder in a patient by administering such a pharmaceutical composition.
  • Figure 1A Amino acid sequence alignment of yeast RCE with a human EST.
  • Figure IB Amino acid sequence alignment of yeast AFC1 (NAG2) with a human EST.
  • Figure IC Amino acid sequence alignment of yeast STE14 with a mouse EST.
  • Figure 2A Human hRCE cDNA (SEQ ID NO:l) and predicted protein sequences (SEQ ID NO:2) .
  • FIG. 2B Mouse mRCE cDNA (SEQ ID NO: 3) and predicted protein sequences from the first in-frame ATG codon (SEQ ID NO:4) .
  • FIG. 2C Mouse mRCE cDNA sequence with additional 5' genomic sequence relative to Figure 2B (SEQ ID NO: 5). The first in-frame ATG codon is underlined. The sequence obtained exclusively from the mouse genomic clones is represented from nucleotides 1-152 (SEQ ID NO: 5). See (SEQ ID NO: 6) for the predicted protein sequence.
  • Figure 3. Amino acid sequence alignment of the yeast Rcelp protein (RCEl. PRO), the human hRCE protein (HRCE1.PR0) and the mouse mRCE (MRCEORF.PRO) protein. The alignment utilized the Clustal method with PAM250 residue weight table.
  • FIG. 4 Northern blots of hRCE mRNA expression levels in various human tissues.
  • the source tissue for the poly (A) RNA is indicated along the top, and the size standards (kbp) along the left side of each blot.
  • the hRCE message appears to be approximately 1.4 kbp and was expressed in all tissues examined.
  • FIG. 5 Primer extension analysis of hRCE mRNAs .
  • Human placental and pancreatic poly (A) RNA was hybridized to a hRCE specific primer and extended using reverse transcriptase.
  • the extension products were resolved by polyacrylamide gel electrophoresis, along with the products of a sequencing reaction to serve as size standards.
  • the transcriptional start sites appear to be in the region from -10 to -26 (relative to the valine codon GTG at position 116-118; Figure 2A) .
  • FIG. 6 Yeast halo assays of wild-type strain (W303) (MAT-a; leu2-3,112; ade2-101; his3-ll; trpl-1; ura3-l), a strain lacking the AFCl gene ( ⁇ afcl) , a strain lacking the RCEl gene ( ⁇ rcel), and a strain lacking both AFCl and RCEl ( ⁇ ) are shown in the top four panels.
  • Halo assays of a strain lacking both AFCl and RCEl genes expressing either no human protein ( ⁇ pJPl), hRCE expressed from the yeast MEV1 promoter ( ⁇ pACA7001 ) , or hRCE expressed from the yeast ERG12 promoter ( ⁇ pACA7002) are shown in the lower three panels. Halos are present when hRCE is expressed in yeast indicating that the human protein is functional in yeast (see Example 3) .
  • Figure 7 Human hAFC cDNA and protein sequences.
  • Figure 8 Amino acid sequence alignment between the full length yeast Afclp protein (YAFCl.PRO) and the human hAFC protein (HAFC1.PRO). The alignment utilized the Clustal method with PAM250 residue weight table.
  • Figure 8 Amino acid alignment between the full length yeast Afclp protein (YAFCl.PRO) and the human hAFC protein (HAFC1.PRO). The alignment utilized the Clustal method with PAM250 residue weight table.
  • Figure 9 Amino acid alignment between a region of the human hRCElp protein (SEQ ID NO: 2, amino acid residues 166-233) comprising the histidine-rich sequence, HxHH (SEQ ID NO: 2, amino acid residues 172-175) and nucleic acid sequences in GenBank.
  • the alignment utilized the Clustal method with PAM250 residue weight table.
  • Figure 10 Yeast halo assays performed with the following strains: wild-type strain (W303) (MAT-a; leu2-3,112; ade2- 101; his3-ll; trpl-1; ura3-l); a strain lacking the RCEl gene ( ⁇ rcel); a strain lacking both AFCl and RCEl ( ⁇ afcl,
  • ⁇ rcel a strain transformed with a URA3 + marked, high-copy (2 ⁇ ) plasmid carrying wildtype hRCE expressed from the yeast ERG12 promoter (pAB13 wild-type) ; a strain transformed with a pAB13-derived plasmid in which the histidine at position 172 of hRCE was changed to an alanine (pAB15 H172A) ; a strain transformed with a pAB13- derived plasmid in which the histidine at position 174 of hRCE was changed to an alanine (pABl ⁇ H174A) ; a strain transformed with a pAB13-derived plasmid in which the histidine at position 175 of hRCE was changed to an alanine (pAB17 H175A) .
  • Halos are present when hRCE is expressed in yeast indicating that the human protein is functional in yeast (see Example 3) .
  • Oligonucleotides or "primers” were all purchased from Operon Technologies (Alameda, CA) and consisted of the following sequences: hRCE(lR)-RI (SEQ ID NO: 7) CGG AAT TCT CAA TAA TAT GGT GAA AAT GGG C; hRCE(2R)-RI (SEQ ID NO: 8) CGG AAT TCG GAT GAA GAG GAA AGC AGT G; hRCE(3F)-RI (SEQ ID NO: 9) GCG AAT TCT TCG GTG CCT ACA CTG CTT TC; hRCE(4F)-RI (SEQ ID NO: 10) GCG AAT TCC CAT TTT CAC CAT ATT ATT GAG C; hRCE(5R) (SEQ ID NO: 11) CTA ACA TGG GCA GCA TAC AGG; hRCE(6R) (SEQ ID NO: 12) CTG GTG AAG CGT CGC TTG A
  • hSTE14 IF) -RI (SEQ ID NO:33) GCG AAT TCG AGA ACA TCT TCT GGC CAG, hSTE14 (2R) -RI (SEQ ID NO: 34) GCG AAT TCT GGC CAG AAG ATG TTC TCG; hSTE14(3F) (SEQ ID NO:35) CTA CCA GAT AGC CAT CAG AGC; hSTE14 (4R) (SEQ ID NO:36) GCT CTG ATG GCT ATC TGG TAG.
  • nucleic acid, polynucleotide e.g., an RNA, DNA or a mixed polymer
  • polypeptide is one which is substantially separated from other cellular components that naturally accompany the native sequence or protein in its natural host cell, e.g., ribosomes, polymerases, many other mammalian genome sequences and proteins.
  • the term embraces a nucleic acid sequence or polypeptide that has been removed from its naturally occurring environment, and includes recombinant or cloned DNA isolates and chemically synthesized analogs or analogs that are biologically synthesized by heterologous systems.
  • nucleic acids or “polynucleotides” is also extended to refer to nucleic acids that encode a RCEl or AFCl polypeptide, polypeptide fragment, homologs and variants of those polypeptides, and protein fusions and deletions of any of the above. These nucleic acids comprise a sequence which is either derived from, or substantially similar to a natural CAAX protease-encoding gene or one having substantial homology with a natural CAAX protease-encoding gene or a portion thereof.
  • the polynucleotide compositions of this invention may include both sense and antisense strands of RNA, cDNA, genomic DNA, and synthetic forms and mixed polymers of the above.
  • nucleotide bases may be modified chemically or biochemically or may contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those of skill in the art. Such modifications include, for example, labels, methylation, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications such as uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.), charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), pendent moieties (e.g., polypeptides), intercalators (e.g., acridine, psoralen, etc.), chelators, alkylators, and modified linkages (e.g., alpha anomeric nucleic acids, etc.) Also included are synthetic molecules that mimic polynucleotides in their ability to bind to a designated sequence via hydrogen bonding and
  • nucleic acid or fragment thereof indicates that, when optimally aligned with appropriate nucleotide insertions or deletions with another nucleic acid (or its complementary strand) , there is nucleotide sequence identity in at least about 60% of the nucleotide bases, usually at least about 70%, more usually at least about 80%, preferably at least about 90%, and more preferably at least about 95-98% of the nucleotide bases.
  • nucleic acid or fragment thereof hybridizes to another nucleic acid, to a strand of another nucleic acid, or to the complementary strand thereof, under selective hybridization conditions.
  • selective hybridization will occur when there is at least about 55% homology — preferably at least about 65%, more preferably at least about 75%, and most preferably at least about 90% — over a stretch of at least about 14 nucleotides. See, e.g., Kanehisa, 1984.
  • the length of homology comparison, as described, may be over a stretch of at least about nine nucleotides, usually at least about 20 nucleotides, more usually at least about 24 nucleotides, typically at least about 28 nucleotides, more typically at least about 32 nucleotides, and preferably at least about 36 or more nucleotides.
  • Nucleic acid hybridization will be affected by such conditions as salt concentration, temperature, solvents, the base composition of the hybridizing species, length of the complementary regions, and the number of nucleotide base mismatches between the hybridizing nucleic acids, as will be readily appreciated by those skilled in the art .
  • Stringent hybridization and “stringent hybridization wash conditions” in the context of nucleic acid hybridization experiments differ depending on different physical parameters such as temperature, the base composition and concentration of the reactants and the salt concentration.
  • Stringent temperature conditions will generally include temperatures in excess of 30°C, typically in excess of 37°C, and preferably in excess of 45°C.
  • Stringent salt conditions will ordinarily be less than 1000 mM, typically less than 500 mM, and preferably less than 200 mM. However, the combination of parameters is much more important than the measure of any single parameter. See, e.g., Wetmur and Davidson, 1968. Probe sequences may also hybridize specifically to duplex DNA under certain conditions to form triplex or other higher order DNA complexes.
  • stringent hybridization and wash conditions are selected to be about 5°C lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength and pH.
  • T m is the temperature at which 50% of the target sequence hybridizes to a perfectly matched probe.
  • An example of stringent hybridization conditions for hybridization of complementary nucleic acid sequences having more than 100 complementary residues on a filter in a Southern or northern blot is 50% formamide with 1 mg of heparin per 50mL at 42°C for at least ten hours.
  • An example of stringent wash conditions is a 0.2x SSC wash at 65°C for 15 minutes (see Sambrook et al .
  • nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical to those which do hybridize under stringent conditions if the polypeptides which they encode are substantially identical. This occurs, for example, when a nucleic acid is created synthetically or recombinantly using a high codon degeneracy as permitted by the redundancy of the genetic code.
  • nucleic acid or polypeptide sequences refers to the residues in the two sequences which are the same when aligned for maximum correspondence.
  • sequence identity When percentage of sequence identity is used in reference to proteins or peptides, it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions -- where amino acid residues are substituted for other amino acid residues having side chains with similar chemical properties (e.g., charge or hydrophobicity) which do not substantially change the functional properties of the molecule. Where sequences differ in conservative substitutions the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Means for making this adjustment are well known to those of skill in the art (see, e.g., Pearson et al., Methods in Molecular Biolo ⁇ y, 24, pp. 307-31 (1994)).
  • substantially homology when referring to polypeptides, indicate that the polypeptide or protein in question exhibits at least about 30% identity with an entire naturally-occurring protein or a portion thereof, usually at least about 70% identity, and preferably at least about 95% identity.
  • substantially similar function refers to the function of a modified nucleic acid or a modified polypeptide (or protein) with reference to its wild-type nucleic acid or polypeptide counterpart.
  • CAAX protease polypeptide will be substantially homologous to the wild-type CAAX protease polypeptide and will have substantially the same function.
  • the modified polypeptide may have an altered amino acid sequence and/or may contain modified amino acids.
  • a modified CAAX protease polypeptide may have other useful properties, such as an altered (e.g., longer) half-life.
  • the similarity of function (activity) of the modified polypeptide may be substantially the same as the activity of the wild-type CAAX protease polypeptide.
  • the similarity of function (activity) of the modified polypeptide may differ from the activity of the wild-type polypeptide in, for example, substrate specificity and/or enzymatic activity.
  • the modified polypeptide is synthesized using conventional techniques, or is encoded by a modified nucleic acid and produced using conventional techniques.
  • the modified nucleic acid is prepared by conventional techniques. A nucleic acid with a function substantially similar to the wild-type
  • RCEl or AFCl gene function produces the modified protein as described above.
  • sequence analysis software See, e.g., the Sequence Analysis Software Package of the Genetics
  • polypeptide sequences compared for homology will generally be at least about 16 amino acids, usually at least about 20 residues, more usually at least about 24 residues, typically at least about 28 residues, and preferably more than about 35 residues.
  • Constantly modified variations of a particular nucleic acid sequence refers to nucleic acids that encode identical or essentially identical amino acid sequences or DNA sequences where no amino acid sequence is encoded, because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given polypeptide sequence.
  • nucleic acid sequence When a nucleic acid sequence is changed at one or more positions with no corresponding change in the amino acid sequence which it encodes, that mutation is called a "silent mutation".
  • a silent mutation When a nucleic acid sequence is changed at one or more positions with no corresponding change in the amino acid sequence which it encodes, that mutation is called a "silent mutation".
  • every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent mutation or variation.
  • individual substitutions, deletions, additions and the like which alter, add or delete a single amino acid or a small percentage of amino acids (less than 5%, more typically less than 1%) in an encoded sequence are "conservatively modified variations" where the alterations result in the substitution of one amino acid with a chemically similar amino acid.
  • Conservative substitution tables providing functionally similar amino acids are well known in the art. The following six groups each contain amino acids that are conservative substitutions for one another:
  • RCEl (or AFCl) protein or "RCEl (or AFCl) polypeptide” refer to a protein or polypeptide encoded by the RCEl (or AFCl ) locus, and variants or fragments thereof.
  • polypeptide refers to a polymer of amino acids and its equivalent and does not refer to a specific length of the product; thus, peptides, oligopeptides and proteins are included within the definition of a polypeptide. This term also does not exclude modifications of the polypeptide, for example, glycosylations, acetylations, phosphorylations, and the like.
  • polypeptides containing one or more analogs of an amino acid including, e.g., unnatural amino acids
  • polypeptides with substituted linkages as well as other modifications known in the art, both naturally and non-naturally occurring.
  • polypeptides will be at least about 50% homologous to the native CAAX protease sequences, preferably in excess of about 90%, and more preferably at least about 95% homologous.
  • proteins encoded by DNA which hybridize under high or low stringency conditions, to CAAX protease-encoding nucleic acids and closely related polypeptides or proteins retrieved by antisera raised against such protein (s).
  • a polypeptide "fragment”, “portion” or “segment” refers to a stretch of amino acid residues of at least about five to seven contiguous amino acids, often at least about seven to nine contiguous amino acids, typically at least about nine to 13 contiguous amino acids and, most preferably, at least about 20 to 30 or more contiguous amino acids .
  • protein modifications refers to RCEl (or AFCl) polypeptides or fragments thereof that are substantially homologous in primary structural sequence but which include, e.g., in, vivo or in vitro chemical and biochemical modifications or which incorporate unusual amino acids.
  • Such modifications include, for example, acetylation, carboxylation, phosphorylation, glycosylation, ubiquitination, labeling, e.g., with radionuclides, and various enzymatic modifications, as will be readily appreciated by those well skilled in the art.
  • labeling e.g., with radionuclides, and various enzymatic modifications, as will be readily appreciated by those well skilled in the art.
  • a variety of methods for labeling polypeptides and of substituents or labels useful for such purposes are well known in the art, and include radioactive isotopes
  • ligands which bind to labeled antiligands e.g., antibodies
  • labeled antiligands e.g., antibodies
  • fluorophores e.g., fluorophores
  • chemiluminescent agents e.g., enzymes
  • antiligands which can serve as specific binding pair members for a labeled ligand.
  • the choice of label depends on the sensitivity required, ease of conjugation with the primer, stability requirements, and available instrumentation. Methods for labeling polypeptides are well known in the art. See Sambrook et al., 1989 or Ausubel et al., 1992.
  • fusion protein refers to fusion polypeptides comprising RCEl or AFCl polypeptides and fragments.
  • Homologous polypeptide fusions may be between two or more polypeptide sequences or between sequences derived from RCElp or AFClp and a related protein.
  • Heterologous fusions may be constructed which would exhibit a combination of properties or activities of the derivative polypeptides. For example, active site or other domains may be "swapped" between CAAX protease polypeptides and other polypeptides or fragments.
  • Such homologous or heterologous fusion polypeptides may display altered biological characteristics (e.g., altered enzymatic activity, substrate specificity and/or altered cellular localization) .
  • isolated is substantially pure when at least about 60 to 75% of a sample exhibits a single polypeptide sequence.
  • a substantially pure protein will typically comprise about 60 to 90% W/W of a protein sample, more usually about 95%, and preferably will be over 99% pure.
  • Protein purity or homogeneity may be indicated by a number of means well known in the art, such as polyacrylamide gel electrophoresis of a protein sample, followed by visualizing a single polypeptide band upon staining the gel with a stain well known in the art. For certain purposes, higher resolution may be provided by using HPLC or other means well known in the art for purification.
  • a RCEl or AFCl protein is substantially free of naturally associated components when it is separated from the native contaminants that accompany it in its natural state.
  • a polypeptide that is chemically synthesized or synthesized in a cellular system different from the cell from which it naturally originates will be substantially free from its naturally associated components.
  • a protein may also be rendered substantially free of naturally associated components by isolation, using protein purification techniques well known in the art.
  • the present invention provides for biologically active fragments of the polypeptides.
  • Significant biological activities include CAAX substrate binding, protease activity, immunological activity or other (presently unidentified) biological activities characteristic of the CAAX protease polypeptides of this invention.
  • Immunological activities include both immunogenic function in a target immune system, as well as sharing of immunological epitopes for binding, serving as either a competitor or substitute antigen for an epitope of the RCEl or AFCl protein.
  • Expression of recombinant DNA molecules according to this invention may involve post-translational modification of a resultant polypeptide by the host cell.
  • expression might include, among other things, glycosylation, lipidation or phosphorylation of a polypeptide, or cleavage of a signal sequence to produce a "mature" protein.
  • polypeptide expression products of this invention encompass full-length polypeptides and modifications or derivatives thereof, such as glycosylated versions of such polypeptides, mature proteins, polypeptides retaining a signal peptide, truncated polypeptides having comparable biological activity, and the like.
  • epitope refers to an antigenic determinant of a polypeptide, i.e., a region of a polypeptide that provokes an immunological response in a host. This region needs not comprise consecutive amino acids.
  • epitope is also known in the art as "antigenic determinant”.
  • An epitope may comprise as few as three amino acids in a spatial conformation which is unique to the immune system of the host. Generally, an epitope consists of at least five such amino acids, and more usually consists of at least 8-10 such amino acids. Methods for determining the spatial conformation of such amino acids are known in the art.
  • antibody refers to a polypeptide encoded by an immunoglobulin gene, genes, or fragments thereof.
  • the immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and mu constant regions, as well as a myriad of immunoglobulin variable regions.
  • Light chains are classified as either kappa or lambda.
  • Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes IgG, IgM, IgA, IgD and IgE, respectively.
  • Antibodies exist for example, as intact immunoglobulins or as a number of well-characterized fragments produced by digestion with various peptidases.
  • trypsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)' 2 , a dimer of Fab which itself is a light chain joined to a V H -C H 1 by a disulfide bond.
  • the F(ab)'2 may be reduced under mild conditions to break the disulfide linkage in the hinge region thereby converting the F(ab)'2 dimer to a Fab' monomer.
  • the Fab' monomer is essentially an Fab with part of the hinge region. See Fundamental Immunology, Third Edition, W.E. Paul, ed. Raven Press, N.Y. (1993) (incorporated herein by reference) for a detailed description of antibodies and antibody fragments.
  • antibody includes antibody fragments produced by the modification of whole antibodies or those synthesized de novo.
  • the antibodies of the present invention are optionally derived from libraries of recombinant antibodies in phage or similar vectors (see, e.g., Huse et al., Science, 246, pp. 1275-81 (1989); Ward et al.,
  • the human EST NCBI#482520 (Genbank Accession # N76181) showed 45 percent sequence identity with the product of the AFCl gene (Afclp) over approximately 80 amino acids ( Figure IB) .
  • the murine EST NCBI#636296 (Genbank Accession # AA022288) sequence showed 36 percent identity with the product of the STE14 gene (Stel4p) over 67 amino acids.
  • ⁇ gtll cDNA library made from a 9-week old human fetus.
  • the library (ATCC #77433) was purchased from the American Type Culture Collection (see Example 1) . Briefly, phage were plated onto an E. coli host strain, lifted onto nylon membranes and hybridized under stringent conditions to radio-labeled DNA sequences which were isolated from the human EST (ID#358628) (see above) . Two phage plaques, 13R and 27R, showed a strong hybridization signal to the human EST
  • Phage stocks of the ⁇ gtll clones 13R and 27R were prepared and titered, and DNA was prepared from these two clones (Example 1) .
  • the ⁇ gtll clone 27R contains a human cDNA insert of approximately 1.4 kB.
  • An E. coli strain (LE392) infected with a purified phage stock of ⁇ gtll clone 27R was deposited as strain designation "27R- ⁇ gtll" on May 7, 1997 with the American Type Culture Collection (ATCC) (Rockville, MD) according to the provisions of the Budapest Treaty, and was assigned the ATCC accession number designated: ATCC #209020.
  • ATCC American Type Culture Collection
  • the continuous cDNA and genomic sequences are shown in Figures 2B and 2C.
  • the mouse RCE ⁇ mRCE) cDNA consensus also revealed the presence of a continuous ORF that extended from the 5' terminus to nucleotide 1004 (SEQ ID NO:3; SEQ ID NO:5) (see Figure 2B) .
  • the first ATG codon encountered within the mouse ORF is underlined in Figure 2C.
  • the predicted protein sequence of mRCE (SEQ ID NO: 4; SEQ ID NO: 6) is shown in Figure 2B.
  • the first ATG codon encountered within the human hRCE ORF (SEQ ID NO:l) is well into the coding sequence and is located beyond a region of sequence identity with the yeast Rcelp sequence (see Figure 3) suggesting that translation begins at an upstream site.
  • the 5' terminal "G" residue of the hRCE cDNA sequence (SEQ ID NO:l) (see Figure 2A) , which was fused to linker sequences during the construction of the cDNA clone, may be preceded by an "AT" dinucleotide, forming an in-frame ATG codon for methionine.
  • the corresponding G residue in the mouse mRCE sequence forms an ATG codon which was revealed upon obtaining additional genomic sequence information.
  • ATG codon in mRCE is a functional translational start site, it is likely that the 5' G residue in the human hRCE sequence shown in Figure 2A is also preceded by an AT dinucleotide and the hRCE ORF begins with a methionine residue.
  • translational initiation may begin at a non-ATG codon.
  • GTG codons There are two in-frame GTG codons at positions 116-118 and 122-124 (SEQ ID NO:l) which are known to function as translational start sites on eukaryotic mRNAs . Initiation at either of these codons would encode a protein of very similar size and composition as the yeast counterpart.
  • the Northern blots revealed the presence of a single abundant mRNA species approximately 1400 base-pairs in length.
  • the human hRCE ORF was cloned into one of two expression vectors (pACAl' or pACA5') under the control of the yeast MEV1 promoter and an ATG translational initiation codon, or the yeast ERG12 promoter and the first four codons of the ERG12 coding sequence, respectively.
  • the resulting plasmids, pACA7001 (also referred to as pAB14) and pACA7002 (also referred to as pAB13) represent translational fusions between the MEV1 ORF and the ERG12 ORF, respectively.
  • Both expression vectors contained the yeast PGK1 transcriptional termination sequence.
  • E. coli strains transformed with either pAB13 or pAB14 were deposited as strain designations " " and " ", respectively, on June , 1998 with the American Type Culture Collection (ATCC) (Rockville, MD) according to the provisions of the Budapest Treaty, and were assigned the ATCC accession number designated: ATCC # and ATCC # . All restrictions on the availability to the public of the above ATCC deposits will be irrevocably removed upon the granting of a patent on this application.
  • ATCC American Type Culture Collection
  • the yeast a.-mating factor is a CAAX-containing protein and must undergo C-terminal proteolysis in order to secrete biologically active pheromone.
  • pACA7001 and pACA7002 were functional in yeast by their ability to restore production of a-factor in strain JRY5317 deleted for the two endogenous CAAX proteases. Plasmids pACA7001 and pACA7002 were each transformed into the yeast strain JRY5317, an a.-mating type strain in which both CAAX protease genes have been deleted (Boyartchuk et al., supra) (Example 3) .
  • EST database revealed the presence of several ESTs with significant sequence identity.
  • Genbank Accession numbers for the ESTs are: AA359058, H02497, AA210930, AA295448, Z43273, N76181, T35312, F11310, R54272, AA01886, AA134125, Z45538, H15202, and N30505.
  • the sequences were aligned and found to contain a continuous ORF from the 5' end to nucleotide 721 (SEQ ID NO:37) (see Figure 7).
  • the predicted amino acid sequence (SEQ ID NO: 38) shares extensive sequence identity with the yeast Afclp protein, including the proposed active site sequence HExxH
  • yeast Afclp has a zinc metalloprotease consensus sequence (VxxHExxH) (SEQ ID NO: 50) located at positions 294-301.
  • Afclp in all respects appears to be a zinc-dependent metalloprotease based both upon amino acid sequence analysis as well as enzymatic sensitivity to the zinc chelator, o-phenanthroline (Boyartchuk et al., supra) . Reel has no similar sensitivity to o- phenanthroline nor to the chelator EDTA.
  • the mammalian CAAX protease is either a serine or cysteine protease based upon its sensitivity to a farnesylated peptide aldehyde (Ma, Gilbert and Rando, Biochemistry. 32, pp. 2386-2393 (1993) ) and to the thiol modifying reagents PCMB and NEM (Hrycyna and Clarke, J. Biol. Chem., 267, pp. 10457-464 (1992) ; Ma, Chaudhuri and Rando, Biochemistry, 31, pp. 11772-777 (1992); Akopyan et al., BBRC.198 , pp. 787-794 (1994) ) .
  • IDE1 is a zinc-dependent thiol protease that is a member of a family of such proteases that includes the E. coli protease III (Affholter, J.A., Fried, V.A., Roth, R.A., Science. 242, pp.
  • hydroxamic acid derivatives have found significant utility for the inhibition of zinc proteases such as the matrix etalloproteinase (Levy, D.E. et al., J. Med. Chem. , 41, pp. 199-223 (1998) ) and angiotensin-converting enzyme (Parvathy, S. et al., Biochem. J., 327, pp. 37-43 (1997)).
  • Hydroxamic acid confers it inhibitory effect on zinc metalloproteases through its metal chelating properties.
  • Possible hydroxamic acid hRcelp inhibitors would thus include a prenylated cysteine amino acid linked to a hydroxa ate group at it carboxy terminus.
  • the amino group of the inhibitor would be preferably linked to a small aliphatic group such as methyl, or more preferably to acetyl, or linked to a small number of amino acids, or left unmodified.
  • the prenyl group would be linked to the cysteine residue through the sulfur group of the cysteine and would be composed of a trans, trans, trans- geranylgeranyl lipid, or more preferably, a trans, trans- farnesyl lipid.
  • cells harboring the wild- type hRCE expression plasmid produced significant amounts of biologically active a.-factor producing a visible -halo (pAB13; also referred to as pACA7002 in Figure 6) .
  • cells harboring mutant hRCE expression plasmids in which the histidines at position 172, 174 or 175 were individually replaced by alanine produced significantly less biologically active a-factor, as evidenced by the reduced (pAB16; H174A and pAB17; H175A) or undetectable (pAB15; H172A) halos ( Figure 10) .
  • the present invention provides nucleic acids and recombinant DNA vectors which comprise mammalian RCE and AFC DNA sequences. More specifically, vectors comprising all or portions of the cDNA sequences of the human and mouse RCE and AFC functional homologs described above are provided. The vectors of this invention also include those comprising DNA sequences which hybridize under stringent conditions to the human and mouse RCE and AFC cDNA sequences, and conservatively modified variations thereof.
  • the nucleic acids of this invention include single stranded and double-stranded genomic DNA, cDNA, RNA, or hybrids thereof and may be isolated from biological sources or synthesized chemically or by recombinant DNA methodology.
  • the nucleic acids, recombinant DNA molecules and vectors of this invention may be present in transformed or transfected cells, cell lysates, or in partially purified or substantially pure forms .
  • DNA sequences may be expressed by operatively linking them to an expression control sequence in an appropriate expression vector and employing that expression vector to transform an appropriate unicellular host.
  • operative linking of a DNA sequence of this invention to an expression control sequence includes, if not already part of the DNA sequence, the provision of a translation initiation codon, ATG, in the correct reading frame upstream of the DNA sequence.
  • a wide variety of host/expression vector combinations may be employed in expressing the DNA sequences of this invention.
  • Useful expression vectors may consist of segments of chromosomal, non- chromosomal and synthetic DNA sequences.
  • DNA vector design for transfection into mammalian cells should include appropriate sequences to promote expression of the gene of interest, including: appropriate transcription initiation, termination and enhancer sequences; efficient RNA processing signals such as splicing and polyadenylation signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus sequence); sequences that enhance protein stability; and when desired, sequences that enhance protein secretion.
  • appropriate transcription initiation, termination and enhancer sequences such as splicing and polyadenylation signals
  • sequences that stabilize cytoplasmic mRNA sequences that enhance translation efficiency (i.e., Kozak consensus sequence)
  • sequences that enhance protein stability i.e., Kozak consensus sequence
  • sequences that enhance protein secretion i.e., Kozak consensus sequence
  • DNA vectors also include a marker gene and means for amplifying the copy number of the gene of interest.
  • DNA vectors may also comprise stabilizing sequences (e.g., ori- or ARS-like sequences and telomere- like sequences) , or may alternatively be designed to favor directed or non-directed integration into the host cell genome .
  • stabilizing sequences e.g., ori- or ARS-like sequences and telomere- like sequences
  • Useful expression vectors for bacterial hosts include bacterial plasmids, such as those from E. coli , including pBluescript, pGEX-2T, pUC vectors, col El, pCRl, pBR322, pMB9 and their derivatives, wider host range plasmids, such as RP4, phage DNAs, e.g., the numerous derivatives of phage lambda, e.g., NM989, ⁇ GTIO and ⁇ GTll, and other phages, e.g., M13 and Filêtous single stranded phage DNA.
  • Useful expression vectors for yeast cells include the 2 ⁇ plasmid and derivatives thereof.
  • Useful vectors for insect cells include pVL 941.
  • any of a wide variety of expression control sequences sequences that control the expression of a DNA sequence when operatively linked to it — may be used in these vectors to express the DNA sequences of this invention.
  • Such useful expression control sequences include the expression control sequences associated with structural genes of the foregoing expression vectors.
  • useful expression control sequences include, for example, the early and late promoters of SV40 or adenovirus, the lac system, the trp system, the TAC or TRC system, the T3 and T7 promoters, the major operator and promoter regions of phage lambda, the control regions of fd coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the promoters of the yeast a- mating system, the GAL1 or GAL10 promoters, and other constitutive and inducible promoter sequences known to control the expression of genes of prokaryotic or eukaryotic cells or their viruses, and various combinations thereof.
  • the early and late promoters of SV40 or adenovirus the lac system, the trp system, the TAC or TRC system, the T3 and T7 promoters, the major operator and promoter regions of phage lambda,
  • expression control sequences typically include a promoter, an enhancer derived from immunoglobulin genes, SV40, cytomegalovirus, etc., and a polyadenylation sequence which may include splice donor and acceptor sites.
  • DNA sequences of this invention are cloned in the expression vector lambda ZAP II (Stratagene, La Jolla, CA) , in which expression from the lac promoter may be induced by IPTG.
  • DNA sequences of this invention are inserted in frame into an expression vector that allows high level expression of the corresponding polypeptide as a fusion protein.
  • fusion proteins include but are not limited to fusions to reporter genes, such as the green fluorescent protein, lacZ and luciferase, and fusions to epitope tags such as myc, hemagglutinin (HA) or other related short immunogenic sequences.
  • the DNA sequences of this invention may be linked by the acid labile cleavage site (Asp-Pro) to a leader sequence (such as the "MLE leader”) suitable for promoting expression in E. coli .
  • Full-length and truncated forms of the DNA sequences of this invention may be prepared as fusion constructs.
  • a wide variety of unicellular host cells are useful in expressing the DNA sequences of this invention. These hosts may include well known eukaryotic and prokaryotic hosts, such as strains of E. coli , Pseudomonas, Bacill us, Strepto yces, fungi, yeast, insect cells such as Spodoptera frugiperda (SF9) , animal cells such as CHO and mouse cells, African green monkey cells such as COS 1, COS 7, BSC 1, BSC 40, and BMT 10, and human cells, as well as plant cells in tissue culture.
  • eukaryotic and prokaryotic hosts such as strains of E. coli , Pseudomonas, Bacill us, Strepto yces, fungi, yeast, insect cells such as Spodoptera frugiperda (SF9) , animal cells such as CHO and mouse cells, African green monkey cells such as
  • Unicellular hosts should be selected by consideration of their compatibility with the chosen vector, the toxicity of the product coded for by the DNA sequences of this invention, their secretion characteristics, their ability to fold the polypeptide correctly, their fermentation or culture requirements, and the ease of purification from them of the products coded for by the DNA sequences of this invention.
  • DNA cloning and sequencing methods are well known to those of skill in the art and are described in an assortment of laboratory manuals, including Sambrook et al, supra, 1989; and Current Protocols in Molecular Biology, F.M. Ausubel et al., eds . , Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. (1994 Supplement) .
  • Product information from manufacturers of biological, chemical and immunological reagents also provide useful information.
  • the recombinant DNA molecules and more particularly, the expression vectors of this invention may be used to express recombinant RCE and AFC polypeptides in a heterologous host cell.
  • the RCE and AFC polypeptides of this invention may be full-length or less than full-length recombinant polypeptide fragments recombinantly expressed from the DNA sequences according to this invention.
  • Such polypeptides include variants, fragments and uteins having biological activity.
  • the polypeptides of this invention may be soluble, or may be engineered to be membrane- or substrate-bound using techniques (such as cross-linking) well known in the art.
  • polypeptides of this invention may be fused to other molecules, such as genetic, enzymatic or chemical or immunological markers such as epitope tags.
  • Fusion partners include, inter alia, GST, immunoglobulins, bacterial beta-galactosidase, trpE, protein A, ⁇ -lactamase, amylase, maltose binding protein, alcohol dehydrogenase, polyhistidine (for example, six histidine at the amino and/or carboxyl terminus of the polypeptide) , green fluorescent protein (GFP) , yeast ⁇ mating factor, GAL4 transcription activation or DNA binding domain, and luciferase. See, e.g., Godowski et al . , 1988.
  • Fusion proteins will typically be made by either recombinant nucleic acid methods, as described above, or may be chemically synthesized. Techniques for the synthesis of polypeptides are described, for example, in Merrifield, 1963.
  • Transformation and other methods of introducing nucleic acids into a host cell can be accomplished by a variety of methods which are well known in the art.
  • Animal (and yeast) cells may be transfected with a plasmid, a cosmid or the like, infected with a viral vector of this invention for transient or stable expression of RCE or AFC polypeptides.
  • a viral vector of this invention for transient expression in animal cells, investigators have used spheroplast fusion, DEAE dextran, and electroporation.
  • polypeptide gene products of the mammalian RCE or AFC genes of this invention may be used to produce polyclonal or monoclonal antibodies using a variety of techniques well known to those of skill in the art.
  • peptides corresponding to specific regions of the RCE or AFC polypeptides may be synthesized and used to create immunological reagents according to well known methods .
  • Antibodies directed against the mammalian RCE and AFC polypeptides of this invention are immunoglobulin molecules or portions thereof that are immunologically reactive with at least one novel mammalian RCE or AFC polypeptide of the present invention. It should be understood that the antibodies of this invention include antibodies immunologically reactive with fusion proteins and multimeric proteins comprising a novel mammalian RCE or AFC polypeptide.
  • Antibodies directed against a novel mammalian RCE or AFC polypeptide may be generated by immunization of a mammalian host. Such antibodies may be polyclonal or monoclonal. Preferably they are monoclonal. Methods to produce polyclonal and monoclonal antibodies are well known to those of skill in the art. For a review of such methods, see Antibodies, A Laboratory .Manual , supra, and
  • Determination of immunoreactivity with a novel mammalian RCE or AFC polypeptide of this invention may be made by any of several methods well known in the art, including by immunoblot assay and ELISA.
  • M or preferably 10 to 10 M or stronger are typically made by standard procedures as described, e.g., in Harlow and Lane, 1988 or Goding, 1986. Briefly, appropriate animals are selected and the desired immunization protocol followed. After the appropriate period of time, the spleens of such animals are excised and individual spleen cells fused, typically, to immortalized myeloma cells under appropriate selection conditions. Thereafter, the cells are clonally separated and the supernatants of each clone tested for their production of an appropriate antibody specific for the desired region of the antigen.
  • Suitable techniques involve in vitro exposure of lymphocytes to the antigenic polypeptides, or alternatively, to selection of libraries of antibodies in phage or similar vectors. See Huse et al., 1989.
  • the polypeptides and antibodies of the present invention may be used with or without modification. Frequently, polypeptides and antibodies will be labeled by joining, either covalently or non-covalently, a substance which provides for a detectable signal.
  • labels and conjugation techniques are known and are reported extensively in both the scientific and patent literature. Suitable labels include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent agents, chemiluminescent agents, magnetic particles and the like. Patents teaching the use of such labels include U.S.
  • 7 ⁇ n antibody of this invention may also be a hybrid molecule formed from immunoglobulin sequences from different species (e.g., mouse and human) or from portions of immunoglobulin light and heavy chain sequences from the same species. It may be a molecule that has multiple binding specificities, such as a bifunctional antibody prepared by any one of a number of techniques known to those of skill in the art including: the production of hybrid hybridomas; disulfide exchange; chemical cross- linking; addition of peptide linkers between two monoclonal antibodies; the introduction of two sets of immunoglobulin heavy and light chains into a particular cell line; and so forth.
  • the antibodies of this invention may also be human monoclonal antibodies, for example those produced by immortalized human cells, by SCID-hu mice or other non- human animals capable of producing "human” antibodies, or by the expression of cloned human immunoglobulin genes.
  • human monoclonal antibodies for example those produced by immortalized human cells, by SCID-hu mice or other non- human animals capable of producing "human” antibodies, or by the expression of cloned human immunoglobulin genes.
  • This invention is directed to methods for identifying inhibitors of mammalian CAAX processing enzymes by in vitro enzyme inhibition assays using the polypeptide expression products of the mammalian RCE and AFC sequences.
  • this invention provides a heterologous host cell or organism comprising a recombinant DNA molecule (vector) capable of expressing a mammalian gene encoding Rcelp or Afclp.
  • a host cell that overexpresses Rcelp or Afclp is also provided.
  • One preferred host organism is a yeast such as Saccharomyces transformed with a high copy plasmid encoding a RCE or AFC gene.
  • the resulting recombinant organism represents an enriched source for the corresponding mammalian enzyme. Inclusion of such an extract in an in vitro CAAX-protease assay allows the identification of inhibitors that can reduce or eliminate the mammalian CAAX protease activity.
  • One method for isolating a protein inhibitor (or an activator) of a mammalian CAAX protease is the "two- hybrid system” (see, e.g., CT. Chien et al., "The two-hybrid system: a method to identify and clone genes for proteins that interact with a protein of interest", Proc. Natl. Acad. Sci. U.S.A., 88(21), pp. 9578-82 (1991)) (incorporated herein by reference) (Example 6) .
  • This invention is also directed to methods for identifying inhibitors of mammalian CAAX processing enzymes by screening host cells transformed with the DNA sequences of this invention in in vivo biological activity assays, including but not limited to those described herein.
  • One preferred in vivo method to identify inhibitors of Rcelp or Afclp first requires identifying genes whose expression requires the presence of the appropriate protease.
  • a gene expression reporter construct may be created and introduced into cells. Such a cell line or strain will allow indirect monitoring of protease function. Incubation of these cells with select candidate protease inhibitors or a combination of candidate inhibitors such as a chemical library facilitates identification of chemicals that alter protease activity in vivo.
  • Another preferred in vivo method to identify inhibitors of Rcelp or Afclp capitalizes on the extreme sensitivity of co-culture experiments to identify differences in the relative growth rates (or fitness) of organisms.
  • two strains of cells which may possess only a subtle difference are mixed and grown for a number of generations.
  • portions of the co-culture are analyzed to determine the relative proportion of each constituent strain.
  • Very small differences in growth rates between two strains will result in the more fit strain becoming the predominant, if not the exclusive, strain in the co- culture after a number of generations.
  • Co-cultures may be grown and the cells diluted into fresh medium to continually increase the number of cell generations in culture, enabling even very subtle differences in growth rates to be measured.
  • Such co-culture experiments may be used to identify compounds that are competitive inhibitors of Rcelp or Afclp (Example 8) .
  • Competitive inhibition of an enzyme can be overcome by increasing the amount of enzyme. Therefore, a strain that harbors a significant amount of Rcelp (or Afclp) will be less sensitive to the effects of a competitive inhibitor than will be a strain with a normal amount of Rcelp (or Afclp) .
  • a yeast strain is constructed (Strain 1) that harbors a high-copy (2 ⁇ ) plasmid that encodes the RCEl gene.
  • the RCEl gene is driven by a very strong promoter, such as that of the ADHl gene.
  • This RCEl overexpressing strain also possesses, preferably (but not necessarily) on the same high-copy plasmid, a marker gene the presence of which (i.e., the gene or gene product) can be easily detected, such as the green fluorescent protein (GFP) .
  • a second strain (Strain 2) is required for the co-culture experiment and is preferably Strain 1 without the high-copy plasmid carrying the RCEl gene (and optionally comprising a marker gene) .
  • Strain 1 and Strain 2 are grown independently to mid-log phase and then mixed at a selected ratio to create an appropriate co-culture.
  • the mixture will contain a large excess of Strain 2 relative to Strain 1 (e.g., 5,000:1; see Example 8). Because Strain 1 (which harbors the multi-copy plasmid encoding RCEl and GFP) is present in such low abundance, the overall mixture will not exhibit significant fluorescence. During growth of the co-culture, if a situation were encountered where Strain 1 has a growth advantage over Strain 2, the relative abundance of Strain 1 would increase and thereby cause an easily identified increase in fluorescence of the mixture.
  • the co-cultures are aliquoted into separate growing chambers (e.g., the wells of 96-well plates).
  • a selected concentration or range of concentrations from selected putative inhibitors e.g., portions of a chemical library
  • the co-cultures containing added test chemicals are incubated until the cells have divided numerous times. At various points during the growth period, the relative abundance of Strain 1 in the mixture may be assessed by fluorescence emission from the 96-well plates.
  • the cells are diluted into fresh medium and allowed to continue to grow.
  • This regimen of growth followed by dilution into fresh medium may be repeated until the cells have experienced multiple generation doublings (e.g., 2-20).
  • the cells undergo 10 generation doublings. More preferably, the cells undergo 5 generation doublings.
  • the preferred number of generation doublings of the co-culture may be determined as described in Example 8.
  • the concentrations and ratios of Strains 1 and 2 in the beginning co-culture are selected to avoid the need to dilute the cultures, which involves more work and time.
  • Strain 1 and Strain 2 do not express (e.g., are deleted for) the endogenous RAS1 gene and possess the ras2-23 allele in place of RAS2.
  • the ras2-23 allele for example, is temperature sensitive and does not adequately function at 37°C to confer viability to a rasl : :HIS3 strain (H. Mitsuzawa et al., Genetics, 123, pp. 739-748 (1989)).
  • a semi-permissive temperature (34°C) growth of this strain depends upon the function of Rcelp (V.L. Boyartchuk et al., Science, 275, pp. 1796-1800 (1997)). Therefore, the presence of the ras2-23 allele, in the absence of RAS1, sensitizes Strain 1 and Strain 2 such that their growth rates depend upon the function of Rcelp.
  • hRCE SEQ ID NO:l
  • mRCE SEQ ID NO: 3 or SEQ ID NO: 5
  • hAFC SEQ ID NO: 37
  • the sequences comprising hRCE, mRCE or hAFC may also be used to construct fusion proteins in order to address the localization of each of these proteins in vivo.
  • Such fusion proteins may additionally contain a reporter gene, such as the green fluorescent protein, lacZ or luciferase.
  • a reporter gene such as the green fluorescent protein, lacZ or luciferase.
  • sequences may be fused to an epitope tag, such as myc, hemagglutinin (HA) or any other related short sequences.
  • HA hemagglutinin
  • RCE and AFC sequences may be used to construct vectors that express protein products having biological activity (e.g., CAAX processing) in a heterologous organism, preferably, in a unicellular organism.
  • Expression products may be purified by conventional means which are well known to those of skill in the art.
  • Mammalian RCE and AFC DNA sequences or their expression products may be purified and then immobilized to beads or solid support matrices which can then be used to isolate and to identify cellular factors which bind to those DNA or amino acid sequences.
  • hRCE, mRCE or hAFC sequences may be expressed in a heterologous organism to:
  • A) examine many biochemical features of the protein such as catalytic mechanism, Kcat, Km, Vmax, and oligomeric status;
  • B) determine the substrate specificities of the CAAX processing enzyme such as the tolerance of non-aliphatic residues at the Al and A2 position, and permissive and non-permissive substitutions at the X position, lipid specificities attached to the cysteine and amino acid sequence specificities N-terminal of the cysteine; C) create an in vitro assay for the purpose of identifying and characterizing inhibitors of the protease;
  • Yeast which express an activated allele of Ras display a heat shock sensitivity when exposed to an acute treatment at 55°C
  • the endogenous protease gene products may be inactivated (e.g., by mutation of the AFCl and RCEl genes) by standard methods, which causes a suppression of the heat-shock sensitivity.
  • Normal, mutant and altered forms of the hRCE or mRCE cDNAs may be expressed in yeast cells lacking endogenous AFCl and RCEl activities (e.g., the deletion strains described herein) and the yeast cells tested for relative sensitivity to heat-shock.
  • hRCE, mRCE or hAFC sequences may be used to create an antisense vector.
  • the antisense vector may be used to synthesize an RCE or AFC antisense molecule to specifically inactivate gene expression and thereby to inactivate the production of the RCEp or AFCp proteins.
  • Such vectors could be constructed such that it is operably linked and under the control of an inducible, tissue specific, developmental or constitutive promoter.
  • RCE or AFC activity could be selectively turned on or off at desired locations that would therapeutically effect a CAAX processing-mediated or affected diseased state or disorder in a patient.
  • hRCE, mRCE or hAFC sequences of this invention may be used to construct specific ribozymes that are directed to the endogenous RCE or hAFC mRNAs .
  • Methods for constructing various types of ribozymes are published and may be performed by those of skill in the art using the RCE or hAFC sequences described herein.
  • the mouse mRCE cDNA sequences of this invention may be used to construct a genetic knockout in a mouse.
  • the genetically altered mouse may in turn be used as a mammalian model for human diseases and disorders involving proteins processed by the RCE gene product.
  • Such genetically altered mice will thus be useful in determining the functional importance of mRCE in various mammalian disease states and disorders, and to assess the therapeutic value of inhibitors of mRCE or other therapeutic agents on the diseased state in mouse.
  • An example of one such diseased state is the tumorigenesis induced in mice by expressing activated forms of Ras
  • vall2 vall2 protein H-ras or K-ras
  • the antibodies of this invention have a variety of uses. For example, they are useful as reagents to screen for expression of mammalian RCE or AFC polypeptides, either in expression libraries constructed from mammalian DNA sources or from other samples in which the proteins may be present. Moreover, by virtue of their specific binding affinities, the antibodies of this invention are also useful to purify or remove polypeptides from a given sample, and to block or bind to specific epitopes on the polypeptides.
  • Such immunological reagents may be used for a variety of applications, some examples of which are below.
  • the assay may be performed by an immobilization-based protocol such as an immunoblot where the proteins are resolved by SDS-PAGE and transferred to a solid support or by an ELISA assay. Such an approach would reveal steady state levels of the proteins.
  • the synthesis rate and protein stability of the respective proteins could be determined by a pulse-chase experiment comprising an immunoprecipitation step using an antibody of this invention.
  • Deviations from the molecular weight predicted from the amino acid sequence may reveal polypeptide products from alternatively spliced mRNAs or post-translational processing events such as proteolysis, glycosylation, lipidation or phosphorylation.
  • Tissue or cell extracts may be fractionated into cellular components, the proteins resolved by electrophoresis and detected using the immunological reagents of this invention. Association of the protein with a specific cellular organelle may reveal the sub- cellular localization of the protease.
  • antibodies directed against Rcelp or Afclp could be labeled with a fluorescent tag such as fluorescein and incubated with permeabilized cells in culture. Fluorescent microscopy could then be employed to identify the localization of the protein in vivo.
  • Candidate Rcelp- or Afclp-interacting proteins identified genetically or by an assay such as the "two-hybrid screen" could be tested for their ability to interact in vitro.
  • Cells ectopically expressing the candidate protein and the respective protease could be cross-linked and immunoprecipitated with protein- specific antisera. Once precipitated, the complexes could be resolved by SDS-PAGE and immunoblotted with antisera directed against the second member of the putative complexes .
  • All of the above utilities apply to the corresponding use of the AFC sequences, polypeptides and inhibitors and antibodies of those polypeptides according to this invention whether or not specifically referred to above. They also apply to the corresponding use of related sequences which may be identified, for example, by hybridization to the sequences of this invention, and their corresponding expression products.
  • This invention is also directed to methods for identifying inhibitors of mammalian CAAX processing enzymes by screening host cells transformed with the DNA sequences of this invention in in vivo biological activity assays such as those described herein, or by in vitro enzyme inhibition assays using the polypeptide expression products of the mammalian RCE and AFC sequences.
  • Inhibitors of the mammalian CAAX processing enzymes will be useful as therapeutics for treating certain CAAX protein-mediated diseased states or disorders in a patient, for example, for treating cancers or tumors correlated to Ras activation.
  • a small molecule inhibitor of a mammalian Rcelp or Afclp protein is administered to a patient suffering from a CAAX protein-mediated disease or disorder.
  • the inhibitors of mammalian CAAX processing enzymes of this invention may be formulated into pharmaceutical compositions and administered in vivo at an effective dose to treat the particular clinical condition addressed. Determination of a preferred pharmaceutical formulation and a therapeutically efficient dose regiment for a given application is within the skill of the art taking into consideration, for example, the condition and weight of the patient, the extent of desired treatment and the tolerance of the patient for the treatment.
  • Administration of the inhibitors of mammalian CAAX processing enzymes of this invention may be accomplished using any of the conventionally accepted modes of administration of agents which are used to treat, for example, tumors and cancers.
  • compositions of this invention may be in a variety of forms, which may be selected according to the preferred modes of administration. These include, for example, solid, semi- solid and liquid dosage forms such as tablets, pills, powders, liquid solutions or suspensions, suppositories, and injectable and infusible solutions. The preferred form depends on the intended mode of administration and therapeutic application. Modes of administration may include oral, parenteral, subcutaneous, intravenous, intralesional or topical administration.
  • the inhibitors of this invention may, for example, be placed into sterile, isotonic formulations with or without cofactors which stimulate uptake or stability.
  • the formulation is preferably liquid, or may be lyophilized powder.
  • the inhibitors may be diluted with a formulation buffer comprising 5.0 mg/ml citric acid monohydrate, 2.7 mg/ml trisodium citrate, 41 mg/ml mannitol, 1 mg/ml glycine and 1 mg/ml polysorbate 20.
  • This solution can be lyophilized, stored under refrigeration and reconstituted prior to administration with sterile Water-For-Injection (USP) .
  • USP Water-For-Injection
  • compositions also will preferably include conventional pharmaceutically acceptable carriers well known in the art (see for example Remington's Pharmaceutical Sciences, 16th Edition, 1980, Mac Publishing Company) .
  • Such pharmaceutically acceptable carriers may include other medicinal agents, carriers, genetic carriers, adjuvants, excipients, etc., such as human serum albumin or plasma preparations.
  • the compositions are preferably in the form of a unit dose and will usually be administered one or more times a day.
  • the pharmaceutical compositions of this invention may also be administered using microspheres, liposomes, other microparticulate delivery systems or sustained release formulations placed in, near, or otherwise in communication with affected tissues or the bloodstream.
  • sustained release carriers include semipermeable polymer matrices in the form of shaped articles such as suppositories or microcapsules.
  • Implantable or microcapsular sustained release matrices include polylactides (U.S. Patent No. 3,773,319; EP 58,481), copolymers of L-glutamic acid and gamma ethyl-L-glutamate (Sidman et al., Biopolymers, 22, pp. 547-56 (1985)); poly (2-hydroxyethyl-methacrylate) or ethylene vinyl acetate (Langer et al., J. Bio ed. Mater. Res. , 15, pp.
  • Liposomes containing inhibitors of CAAX processing enzymes may be prepared by well-known methods (See, e.g. DE 3,218,121; Epstein et al., Proc. Natl. Acad. Sci. U.S.A.. 82, pp. 3688-92 (1985); Hwang et al., Proc. Natl. Acad. Sci. U.S.A.. 77, pp. 4030-34 (1980); U.S. Patent Nos. 4,485,045 and 4,544,545). Ordinarily the liposomes are of the small (about 200-800 Angstroms) unilamellar type in which the lipid content is greater than about 30 mol.% cholesterol. The proportion of cholesterol is selected to control the optimal rate of MAG derivative and inhibitor release.
  • the inhibitors of mammalian CAAX processing enzymes of this invention may also be attached to liposomes, which may optionally contain other agents to aid in targeting or administration of the compositions to the desired treatment site. Attachment of inhibitors to liposomes may be accomplished by any known cross-linking agent such as heterobifunctional cross-linking agents that have been widely used to couple toxins or chemotherapeutic agents to antibodies for targeted delivery. Conjugation to liposomes can also be accomplished using the carbohydrate-directed cross-linking reagent 4- (4- maleimidophenyl) butyric acid hydrazide (MPBH) (Duzgunes et al., J. Cell. Biochem. Abst. Suppl . 16E 77 (1992)).
  • MPBH carbohydrate-directed cross-linking reagent 4- (4- maleimidophenyl) butyric acid hydrazide
  • yeast Rcelp protein (ID#358628) with similarity to the yeast Rcelp protein was purchased from Research Genetics and served as a probe.
  • the EST was recovered from the vector by digesting with
  • the hybridization buffer consisted of 5X Denhardt's solution, 5X SSC, 0.1% SDS, and 100 ⁇ g/ml denatured salmon sperm DNA.
  • the filters were washed at high stringency at 65°C in 0.1% SDS and 0.1% SSC and exposed to autoradiography .
  • Two plaques, 13R and 27R, which showed a strong hybridization signal to the human EST (#358628) were purified to homogeneity by two successive rounds of plating and hybridization according to standard procedures . Phage stocks were prepared by overlaying plates of E.
  • LE392 cells fresh overnight culture concentrated in 40% volume in SM
  • the infection was incubated for 30 minutes at 42°C, plated on large LB plates with top agar, and incubated at 42°C overnight.
  • the plates were then overlayed with 15 mis ⁇ diluent and allowed to shake gently for 2 hours at room temperature.
  • the ⁇ diluent was then drawn off and the cells pelleted out.
  • DNA was then prepared following standard procedures (see, e.g., Sambrook et al., Molecular Cloning — A Laboratory Manual [2 nd ed.] Vol. 1-3 (1989)), and resuspended in TE.
  • PCR Amplification The 5' ends of the ⁇ clones were PCR amplified using two different internal primers (608RI, 5'- GGTTGCGTACAGACACTCCGTC-3' (SEQ ID NO: 39); RCE 1R-RI, 5'- TTATTATACCACTTTTACCCG-3' (SEQ ID NO: 40)) and an external ⁇ primer (OMS178, 5' -TGGCGCAGGTAGCAGAG-3' (SEQ ID NO: 41) ) using Vent DNA polymerase) . The PCR reactions were then treated with Taq DNA polymerase for 10 minutes at 72°C to produce A overhangs.
  • the reactions were extracted with phenol/chloroform, precipitated with NaOAc/EtOH and resuspended in TE.
  • the Invitrogen® Original TA Cloning® kit was used to clone the PCR products into pCR® 2.1 and transformed into INV ⁇ F' .
  • Five clones containing inserts were identified: 13R/608-178-1, -3, 13R/RCE-178-3, 27R/608-178-1, -3. These five clones were sequenced using the ABI 377 sequencer.
  • the 3' ends of the ⁇ clones were subcloned by PCR amplifying the ends with an internal primer (8F, 5'- CTGCCTCACAGACATGCGTTG-3' (SEQ ID NO: 42)) and an external ⁇ primer (OMS179, 5' -GCGAAATACGGGCAGACATGG-3' (SEQ ID NO: 43) ) using Taq DNA polymerase.
  • an internal primer 8F, 5'- CTGCCTCACAGACATGCGTTG-3' (SEQ ID NO: 42)
  • an external ⁇ primer OLS179, 5' -GCGAAATACGGGCAGACATGG-3' (SEQ ID NO: 43)
  • Cloning® kit was used to clone the PCR products into pCR® 2.1 and transformed into INV ⁇ F'. Three clones containing inserts were identified: 13R/8F-179-1, 27R/8F-179-1, -2. These three clones were sequenced using the ABI 377 sequencer.
  • Adaptor-ligated cDNA constructed from total human fetus mRNA (#7438-1), human placenta (#7411-1) and murine 15-day embryo (#7459-1) were purchased from Clonetech (Palo Alto, CA) . Reactions were carried out under the recommended conditions of the manufacturer which included the following PCR conditions: IX KlenTaq PCR buffer (Clonetech), 200 ⁇ M dNTPs, 200nM oligonucleotide primers,
  • PCR reactions included the primer API (Clonetech) which is complementary to the cDNA adapter sequence and a gene specific primer.
  • PCR reactions with human cDNA included one of the gene specific primers hRCE(lR) (SEQ .
  • Extension of the primer was initiated by the addition of 40 ⁇ l of reverse transcriptase buffer (20 mM Tris [pH 8.7], 10 mM MgCl 2 , 100 ⁇ g/ml actinomycin D, 5 mM dithiothreitol, 0.33 mM deoxynucleoside triphosphates) and 200 units SuperScriptTMII (GibcoBRL, catalog # 18064-014) .
  • the extension reaction was placed at 37°C for 30 minutes and then terminated with the addition of 300 ⁇ l ethanol.
  • the nucleic acids were pelleted and resuspended in 5 ⁇ l formamide loading buffer/0.1 M NaOH (2:1). The reaction products were then separated on an 8% polyacrylamide/urea sequencing gel, and visualized by autoradiography.
  • the predominant extension product was approximately 41 nucleotides long.
  • the reactions were terminated with the addition of 120 ⁇ l ethanol and the nucleic acids pelleted.
  • the products were resuspended in 5 ⁇ l formamide loading buffer/0.1 M NaOH (2:1).
  • double stranded sequencing reactions were performed using 15R (SEQ ID NO: 21) as the primer and a subclone of the ⁇ clone 27R (27R/608-178-1, see above) .
  • the sequencing reactions were performed utilizing the Sequenase Version 2.0 DNA Sequencing Kit (United States Biochemical, catalog # US70770) as per the kit instructions.
  • the primer extension products and the sequencing reactions were resolved on an 8% polyacrylamide/urea sequencing gel followed by autoradiography.
  • Example 3 Expression of the human -RCE cDNA in a yeast mutant
  • the ORF beginning at position 116 of the hRCE gene (SEQ ID NO:l, nucleotides 116 to 997; Figure 2A) was expressed in a yeast strain deleted for both CAAX protease genes, yAFCl and yRCEl (Boyartchuk et al., supra) (Example 3) .
  • yeast strain deleted for both CAAX protease genes yAFCl and yRCEl (Boyartchuk et al., supra) (Example 3) .
  • To create the expression construct we amplified by PCR the human hRCE ORF from the first valine (defined by nucleotides 116-118) to the stop codon (defined by nucleotides 995-997) employing the human hRCE cDNA ⁇ clone 27R as a template.
  • the primers included in the reaction were hRCE-ValX (SEQ ID NO:22) and hRCE3P-RI (SEQ ID NO: 23) .
  • the primer hRCE-ValX incorporated an X£>al restriction site immediately upstream (5') of the GTG valine codon (defined by SEQ ID NO:l, nucleotides 116- 118) .
  • the primer hRCE3P-RI incorporated an £coRI site immediately downstream (3') of the TGA stop codon (defined by SEQ ID NO:l, nucleotides 995-997).
  • pACAl' contains 1000 base-pairs of the yeast MEV1 promoter including the translational initiation codon ATG.
  • the MEV1 promoter was amplified by PCR from yeast genomic DNA using the following primers:
  • MEV1-1 (containing HindiII site): 5 ' -GGA AAA GCT TTT GAA GCA CCT GTG GAG T-3' (SEQ ID NO:44); and MEV1-2 (containing Xbal site): 5'-GTA AC GTT CTA- GAC ATT TTG AGT ACG TC-3' (SEQ ID NO: 45) .
  • Vector pACA5' contains 700 base-pairs of the yeast ERG12 promoter and the first four codons of the ERG12 coding sequence.
  • the ERG12 promoter was amplified by PCR from yeast genomic DNA using the following primers:
  • ERG12-1 (containing Sphl site): 5'-GGC CGG GCA TGC AAC CTA TTC GAG ATG AAT GCG-3' (SEQ ID NO:46);
  • ERG12-2 (containing X ⁇ al site): 5 ' -CCG GCC TCT AGA CGG TAA TGA CAT ATT GAC AGT-3' (SEQ ID NO: 47) .
  • Both expression vectors include an £coRI-NarI fragment containing the yeast PGKl transcriptional termination sequence.
  • a fragment of D ⁇ A comprising the yeast PGKl termination sequence was amplified by PCR from yeast genomic D ⁇ A using the following primers:
  • PGK 1 (containing an EcoRI site) : 5 ' -GAT TGA ATT CAA TTG AAA TCG ATA G-3' (SEQ ID ⁇ O:48);
  • PGK 2 (containing a Narl site) : 5' -CCG AGG CGC CGA ATT TTC GAG TTA T-3' (SEQ ID NO: 49) .
  • the yeast strain JRY5317 (Boyartchuk et al., supra) is an ⁇ .-mating type strain in which both CAAX protease genes have been deleted.
  • JRY5317 was grown overnight in rich medium (2% Bacto Peptone, 1% Yeast Extract, 2% Glucose) .
  • the transformation was performed by the lithium acetate method (Gietz and Schiestl, Yeast, 7, pp. 253-263 (1991) ) .
  • Transformants were selected for uracil prototrophy on minimal medium.
  • Plasmids pACA7001 and pACA7002 were each transformed into the yeast strain JRY5317, an a-mating type strain in which both CAAX protease genes have been deleted (Boyartchuk et al., supra) (Example 3).
  • Halo assays were performed by replica plating patches of the yeast transformants onto a lawn of ⁇ sst2 tester lawn as described (Boyartchuk et al., supra) . Cells harboring an sst2 mutation are rendered supersensitive to mating pheromone and undergo a Gl arrest when exposed to pheromone. This creates a halo around the patch of mated yeast cells.
  • Example 4 Expression of the human hAFC cDNA in a yeast mutant.
  • the ORF described herein which encodes a homolog of the yeast AFCl gene (SEQ ID NO: 37; Figure 8) is expressed in a yeast strain deleted for both CAAX protease genes, yAFCl and yRCEl (Boyartchuk et al., supra) as described in Example 3.
  • the human AFC ORF (SEQ ID NO: 37) is amplified from the first GCG valine codon to the stop codon at ca. nucleotide 721 (see Figure 7) and inserted into yeast expression vectors as described in Example 3.
  • Example 5 A screening method for identifying inhibitors of mammalian CAAX processing enzymes .
  • a surrogate organism that is easy to manipulate and maintain such as Saccharomyces cerevisiae, may be transformed with an expression vector encoding hRCE or hAFC sequences. Such a recombinant organism may serve as a template for in vivo screening for inhibitors of the protease.
  • Such an assay could be direct or indirect.
  • a direct assay could be created based upon the cleavage of a CAAX-containing proteins such as a-factor in a halo assay or mating assay as described in (Boyartchuk et al., supra) .
  • Another assay could be based upon the requirement of CAAX proteolysis for Ras function.
  • Such an assay would utilize a yeast strain possessing hRCE and an activated allele of Ras such as the Glyl9 to Vall9 mutation.
  • Cells that harbor an activated allele of Ras are susceptible to acute heat shock at 55°C
  • one could screen for inhibitors of Ras by screening or selecting compounds that cause a heat-shock resistant phenotype.
  • an assay for hRCE could be based upon an indirect effect of modulating the CAAX protease activity.
  • cells could be transformed with either a normal or non-functional version of hRCE. The expression of other selected genes in an organism could then be assessed. Once a gene was found whose expression was dependent upon the function of hRCE, that gene could then serve as the basis of an assay.
  • Example 6 A screening method for identifying proteins that interact with mammalian CAAX processing enzymes
  • a protein which physically interacts with a mammalian CAAX protease according to this invention may be isolated using the "two-hybrid system” (see, e.g., CT. Chien et al., "The two-hybrid system: a method to identify and clone genes for proteins that interact with a protein of interest", Proc. Natl. Acad. Sci. U.S.A., 88(21), pp. 9578-82 (1991)).
  • This method takes advantage of the multi-domain structure of the yeast transcriptional activator protein GAL4, which has a DNA binding domain separable from a transcriptional activation domain.
  • Expression plasmids are constructed to encode one of two hybrid proteins.
  • One hybrid consists of the GAL4 DNA-binding domain fused to a mammalian CAAX protease polypeptide of this invention.
  • the second hybrid consists of the GAL4 activation domain fused to protein sequences encoded by a library of genomic DNA or cDNA fragments from a mammalian or other cellular source.
  • the plasmids are co-transformed into an appropriate yeast strain, or are individually transformed into yeast strains of opposite mating types having genetic markers that allow selection of diploids.) Interaction between the CAAX protease and a protein encoded by one of the library plasmids leads to transcriptional activation of a reporter gene of choice which contains an appropriately positioned GAL4 binding site.
  • the advantage of this method is that the gene which encodes the protein that interacts with the CAAX protease may be identified directly from the library.
  • Example 7 Mutagenesis of the Histidine-Rich Region of hRCEl Each histidine comprising the histidine-rich sequence "HxHH" of the human Reel protein (SEQ ID NO: 2, amino acids 172-175) were changed individually to an alanine residue by site-directed mutagenesis.
  • Plasmids pAB15, pAB16, and pAB17 were constructed as follows: An 888 bp Xbal-EcoRI fragment was cut from pAB13 (see pACA7002, Example 3) and cloned into pBluescript SK. Histidines 172, 174 and 175 were each mutated to an alanine using the QuikChangeTM Site-Directed Mutagenesis Kit following manufacturer's directions (Stratagene) .
  • hRCEl-Ml 5' -CTC TTT TTT GGA GTT GCC ££T TTT CAC CAT ATT ATT GAG CAG CTG-3'
  • hRCEl-M2 5' -CAG CTG CTC AAT AAT ATG GTG AAA AGC GGC AAC TCC AAA AAA GAG-3'
  • hRCEl-M4 5' -CAG CTG CTC AAT AAT ATG ⁇ GC AAA ATG GGC AAC TCC AAA AAA GAG-3'
  • hRCEl-M6 5' -GAA ACG CAG CTG CTC AAT AAT AG_£ GTG AAA ATG GGC AAC TCC-3'
  • Example 8 Whole Cell Co-Culture Assay for Identifying Rcelp Inhibitors
  • a strain of yeast is constructed in which the endogenous RAS1 gene has been deleted and the endogenous RAS2 gene has been replaced with the temperature-sensitive ras2-23 allele, which sensitizes the strains such that their growth rates depend upon the function of Rcelp.
  • Cells from this strain are transformed with a high-copy (2 ⁇ ) plasmid (e.g., a LEU2, URA3 or HIS3-marked plasmid whose presence can be selected in the appropriate leu, ura or his auxotrophic strain background) which either does (Strain 1) or does not (Strain 2) additionally encode: a) the RCEl gene driven by a strong (e.g., the ADHl ) promoter; and b) a marker gene (e.g., green fluorescent protein (GFP)) under the control of a strong promoter.
  • Strain 1 and Strain 2 transformants are selected on the appropriate medium (e.g., selecting for leu, ura or his prototrophy) .
  • Strain 1 (but not Strain 2) transformants are also selected based on their fluorescence (from expression of the plasmid-linked GFP marker) .
  • a typical 96-well plate will handle about 0.25ml of liquid.
  • One ODggo °f yeast in rich medium (YPD) is about 1 x 10 7 cells/ml.
  • YPD ODggo °f yeast in rich medium
  • a typical yeast overnight growth in a 96-well plate will produce an OD 6Q O ⁇ 4. This translates to a 40-fold increase in growth, or 5.3 generations.
  • a dilution series (e.g., ten-fold) of each selected potential inhibitor resuspended in an appropriate medium is prepared and an aliquot of each test sample added individually to the wells of the 96-well plates which contain the co-cultured strains.
  • an equal volume of each medium in which the test inhibitors are suspended (now without the test inhibitor) is added to a well containing the strain co-culture.
  • the co-cultures plus test chemicals and controls are incubated until the cells have divided numerous times.
  • the cells are diluted into fresh medium and allowed to undergo ca. 5 generation doublings (as determined by spectrographic absorbance readings) .
  • the relative abundance of Strain 1 in the mixture is assessed by fluorescence emission from the 96-well plates.
  • Example 9 Heat Shock Assays for Measuring hRCE Function and for Identifying Rcelp
  • Plasmid pABl was constructed as follows: The backbone plasmid for pABl was Yeplacl95 (2 ⁇ , URA3; Gietz and Sugino, 1988) and contains an insert consisting of two parts: green fluorescent protein (GFP) and phosphoglycerate kinase ⁇ PGKl ) terminator. The coding region of GFP from Aequoria victoria containing the S65T mutation (Ormo et al., 1996) was amplified by PCR using oligonucleotides:
  • 5P-GFP-ORF (5' -CAT GTC TAG AGG AGA AGA ACT TTT C-3') (SEQ ID NO: 51) ;
  • 3P-GFP-ORF (5' -CGC GAA TTC CTA TTT GTA TAG TTC A-3'); and has Xbal and EcoRI sites engineered on the 5' and 3' ends, respectively.
  • the PGKl terminator was amplified by PCR using the following oligonucleotides:
  • PGK 1 (containing an EcoRI site) : 5 * -GAT TGA ATT CAA TTG AAA TCG ATA G-3' (SEQ ID NO:48);
  • PGK 2 (containing a NarI site) : 5' -CCG AGG CGC CGA ATT TTC GAG TTA T-3' (SEQ ID NO: 49) and consists of 263 bp immediately downstream of the stop codon of PGKl .
  • EcoRI and Narl sites were engineered on the 5' and 3' ends of the PGKl terminator.
  • Plasmid pAB20 was constructed as follows: The val19 dominant allele of RAS2, RAS2 (D. Broek, et al . ,
  • This plasmid carries the coding region for
  • Yeast cells wild-type for RCEl (W303) ⁇ MAT&; leu2-3, 112; ade2-101 ; his3-l l ; trpl -1 ; ura3-l ) were used to create an isogenic, double mutant strain lacking both the Rcelp and Afclp CAAX proteases ⁇ rcel , ⁇ afcl ) .
  • the wild-type (W303) or ⁇ rcel , ⁇ afcl strains were separately transformed with plasmid pAB20 expressing the dominant val19 mutant RAS2 gene.
  • Transformed ⁇ rcel , ⁇ afcl cells then were co-transformed with one of the following URA3 marked, high-copy (2 ⁇ ) plasmids: pABl (vector control); pAB13 ⁇ hRCEl ) , pABl5 (H172A) , pABl ⁇ (H174A) ; and pAB17
  • H175A (as described in Figure 10) .
  • Yeast cell transformants were grown at 30°C for approximately 48 hours in synthetic complete media lacking uracil and leucine and supplemented with 2% glucose, to an approximate OD600 of 4-6.
  • the cultures were diluted 1:10 in water in 0.5 ml PCR tubes and placed in a PTC-100 thermocycler (MJ Research, Inc.).
  • the heat-shock regimen was performed as follows: 25°C 3 min; 55°C 1 min, 25°C 3 min.
  • the cultures were further diluted in water and plated for viability. The number of surviving colonies were compared with those from parallel samples that had not undergone heat shock. The results of this experiment are shown in Figure 11.
  • Heat shock assays to identify CAAX protease inhibitors may be modified by one of skill in the art to identify inhibitors of Rcelp in the presence of an activated RAS2 allele. A dilution series of each selected potential inhibitor may be tested (e.g., as described in Example 8). Rcelp inhibitors may be identified by their ability to increase in a concentration-dependent manner the viability of heat- shocked cells which express activated Ras2p, which express Rcelp from a chromosomal and/or from a single or multicopy plasmid. In addition, the cells can be deleted for the yRCEl gene to identify inhibitors of the human hRcelp protein.

Abstract

The present invention provides mammalian DNA sequences that display a high degree of sequence identity to their Saccharomyces cerevisae counterparts, RCE1 and AFC1. Specifically, complementary DNA (cDNA) sequences of the human and mouse RCE functional homologs are provide. Human cDNA sequences encoding proteins having a high degree of amino acid sequence identity to the yeast Afc1p protein are also provided. This invention is also directed to recombinant DNA molecules comprising the mammalian DNA sequences, DNA molecules and antisense RNA molecules which hybridize under stringent hybridization conditions to those DNA sequences, hosts transformed with their recombinant DNA molecules and protein expression products produced by culturing the transformed hosts. Antibodies directed against the protein expression products are also provided. Also provided are assays to identify inhibitors of one or more mammalian CAAX processing enzymes and kits for making the above products and performing the above assays. Finally, this invention provides pharmaceutical compositions comprising an inhibitor of a mammalian CAAX processing enzyme, and methods for treating a CAAX-protein mediated disease or disorder in a patient by administering such a pharmaceutical composition..

Description

MAMMALIAN CAAX PROCESSING ENZYMES
BACKGROUND OF THE INVENTION The eukaryotic cell is made up of a wide variety of specialized structures and membrane-enclosed compartments which create special intracellular environments. Many cellular components, such as proteins, must be targeted to particular intracellular locations for proper function. The lipid bilayer membrane that surrounds the cell is one such specialized cellular environment. Cellular components such as proteins are delivered to and must associate in a particular way with the correct membrane surface of the lipid bilayer. A number of mechanisms which the cell has evolved for targeting proteins to and inserting proteins into the lipid bilayer membrane of the cell have now been described.
Proteins targeted to the lipid bilayer membrane may be situated within the lipid bilayer membrane (i.e., a portion of the protein is inserted into and across the lipid bilayer) and are typically referred to as "integral membrane proteins". Integral membrane proteins possess intrinsic hydrophobic regions (comprising amino acid regions of relatively high intrinsic or apparent hydrophobicity) which are often inserted into the lipid bilayer as the protein is being synthesized. Alternatively, proteins targeted to the lipid bilayer membrane may be situated at and associate with the membrane surface without being inserted across the membrane. Such proteins are typically referred to as "peripheral membrane proteins".
Peripheral membrane proteins are usually less tightly associated with the lipid bilayer than integral membrane proteins. Peripheral membrane proteins may associate with the lipid bilayer by protein-protein interactions, by hydrophobic interactions between the bilayer and one or more intrinsically hydrophobic regions of the membrane protein, or by the co- or post- translational addition of a lipid group to the membrane protein. The most recently discovered lipid modification involves the covalent attachment of cholesterol biosynthetic intermediates to proteins, and is referred to generically as protein prenylation. See, e.g., W.R. Schafer and J. Rine, Annu. Rev. Genet.. 26, p. 209 (1992); S. Clarke, Annu. Rev. Biochem., 61, p. 355 (1992); each of which is incorporated herein by reference.
One class of peripheral membrane proteins processed by the latter mechanism has a terpenoid lipid added to the C-terminal end for such purposes. Proteins belonging to this class (often referred to as prenylated proteins) that possess a C-terminal tetrapeptide CAAX motif (where C is cysteine, A is usually aliphatic, and X can be a variety of different residues) undergo a series of post-translational processing reactions. These reactions include the attachment of either a 15-carbon farnesyl or a 20-carbon geranylgeranyl lipid moiety to the cysteine residue of the CAAX motif (prenylation) , proteolytic removal of the C-terminal three amino acids (AAX) and ethylesterification of the -carboxyl group of the terminal prenylated cysteine residue. Prenylation of CAAX protein substrates is catalyzes by one of two different type I protein prenyl- transferases. If the X position is occupied by leucine or phenylalanine, the cysteine residue of the protein is geranylgeranylated. If the X position is occupied by any other amino acid, the cysteine residue of the protein is farnesylated. See V. L. Boyartchuk et al., Science, 275, pp. 1796-1800 (1997) and M.N. Ashby, Current Opinion in Lipidol. , 9, pp. 99-102 (1998) (incorporated herein by reference) .
Prenylated proteins include, for example, many proteins involved in signal transduction such as the Ras oncogenes, fungal pheromones and the y subunits of trimeric guanosine triphosphate (GTP) -binding proteins (G proteins) . They also include a wide range of small GTP- binding proteins that participate in vesicular trafficking. See, e.g, P. J. Casey, Curr. Opin. Cell.
Biol.. 6, p. 219 (1994); J.E. Roth an and F. T. ieland, Science, 272, p. 227 (1996) . Prenylation is critical for the activity of many of these proteins.
The yeast Saccharomyces cerevisiae possesses numerous CAAX-containing proteins including, but not limited to, Raslp, Ras2p, Mfalp, Mfa2p, and Stelδp. Yeast have two genes encoding prenylation-dependent CAAX endoproteinases, AFC1 and RCEl (Boyartchuk et al., supra) . The Afcl protein (Afclp; encoded by AFC1 ) is a zinc protease that participates in the processing of the yeast mating pheromone a.-factor. The Reel protein (Rcelp; encoded by RCEl ) contributes to the processing of ≤-factor and the yeast Ras protein (Ras2p) . Rcelp is only a minor contributor in vivo to _a-factor processing but appears to be required for Ras2p processing. Yeast have a single gene, STE14, which encodes the membrane-associated, CAAX C-terminal methyltransferase (Stel4p) (Ashby et al . , Yeast, 9, p. 907 (1993) (incorporated herein by reference) ) . Farnesylation of the Ras protein of yeast and humans is required for proper membrane localization. The Ras protein in normal cells contributes to the orderly cell divisions needed during growth and development. Ras proteins are attached to the inner surface of the lipid bilayer membrane that surrounds each cell. Ras proteins function as a primary intracellular signaling mechanism for cell proliferation. They respond to external stimuli such as growth factors and hormones which bind to the cell membrane and trigger an intracellular cascade of chemical reactions known generally as "signal transduction". These chemical signals lead to the cell nucleus, where they activate gene transcription and the process of cell division. When the extracellular stimulus stops, the internal signaling pathway is deactivated and the cell returns to its resting state.
When dominant mutations occur in the Ras gene (turning it into a Ras oncogene) , mutant proteins are produced that lock the signaling pathway in an active state, resulting in uncontrolled cell division. Dominant mutations in the Ras gene are presently the second most common type of mutations in human cancers. They are found in more than 50% of all human cancers including pancreatic cancer (95%), colon cancer (50%), lung cancers (45%) and leukemias (30%) . The functional activity of Ras protein depends on its attachment to the cell membrane, which in turn is mediated by protein prenylation. Farnesylation of the Ras protein in yeast and humans is required for its oncogenic transforming activity. Because the Ras protein is modified sequentially by the three different processing enzymes described above (which farnesylate, cleave and methylate the C-terminal CAAX motif) , each of these processing enzymes represents a potential target that could be attacked pharmacologically to block the oncogenic action of mutant Ras protein.
Protein farnesyl transferases have become an active target for drug development. Inhibitors of protein farnesyl transferases are being developed, for example, as potential therapeutics for Ras-based tumors. See, e.g., F. Tamanoi, Trends Biochem. Sci.. 18, p. 349 (1993); N. E. Kohl et al., J. Cell. Biochem. SUPPI ■ . 22, p. 145 (1995). The methyltransferase enzyme will also be an attractive target for drug development.
The mammalian counterparts to the yeast CAAX endoproteinase and methyltransferase genes have not been isolated. The mammalian homologs to each of the three yeast CAAX processing enzymes each represent a potential target to block the oncogenic action of mutant Ras protein in tumors, or more generally, to modulate the activity of prenylated peripheral membrane proteins. It would thus be desirable to identify and to isolate the mammalian homologs to the yeast CAAX endoproteinase and methyltransferase genes.
SUMMARY OF THE INVENTION The present invention provides mammalian DNA sequences that display a high degree of sequence identity to their counterpart genes in Saccharomyces cerevisiae, RCEl and AFC1 , encoding prenylation-dependent CAAX endoproteinases. Specifically, complementary DNA (cDNA) sequences of the human and mouse RCEl functional homologs are provided. Human cDNA sequences encoding proteins having a high degree of amino acid sequence identity to the yeast Afclp protein are also provided. This invention is also directed to recombinant DNA molecules comprising the mammalian DNA sequences, DNA molecules and antisense RNA molecules which hybridize under stringent hybridization conditions to those DNA sequences, hosts transformed with the recombinant DNA molecules and protein expression products produced by culturing the transformed hosts. Antibodies directed against the protein expression products are also provided. Also provided are assays to identify inhibitors of one or more mammalian CAAX processing enzymes and kits for making the above products and performing the above assays. Finally, this invention provides pharmaceutical compositions comprising an inhibitor of a mammalian CAAX processing enzyme, and methods for treating a CAAX-protein mediated disease or disorder in a patient by administering such a pharmaceutical composition.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1A. Amino acid sequence alignment of yeast RCE with a human EST.
Figure IB. Amino acid sequence alignment of yeast AFC1 (NAG2) with a human EST.
Figure IC. Amino acid sequence alignment of yeast STE14 with a mouse EST.
Figure 2A. Human hRCE cDNA (SEQ ID NO:l) and predicted protein sequences (SEQ ID NO:2) .
Figure 2B. Mouse mRCE cDNA (SEQ ID NO: 3) and predicted protein sequences from the first in-frame ATG codon (SEQ ID NO:4) .
Figure 2C. Mouse mRCE cDNA sequence with additional 5' genomic sequence relative to Figure 2B (SEQ ID NO: 5). The first in-frame ATG codon is underlined. The sequence obtained exclusively from the mouse genomic clones is represented from nucleotides 1-152 (SEQ ID NO: 5). See (SEQ ID NO: 6) for the predicted protein sequence. Figure 3. Amino acid sequence alignment of the yeast Rcelp protein (RCEl. PRO), the human hRCE protein (HRCE1.PR0) and the mouse mRCE (MRCEORF.PRO) protein. The alignment utilized the Clustal method with PAM250 residue weight table.
Figure 4. Northern blots of hRCE mRNA expression levels in various human tissues. The source tissue for the poly (A) RNA is indicated along the top, and the size standards (kbp) along the left side of each blot. The hRCE message appears to be approximately 1.4 kbp and was expressed in all tissues examined.
Figure 5. Primer extension analysis of hRCE mRNAs . Human placental and pancreatic poly (A) RNA was hybridized to a hRCE specific primer and extended using reverse transcriptase. The extension products were resolved by polyacrylamide gel electrophoresis, along with the products of a sequencing reaction to serve as size standards. The transcriptional start sites appear to be in the region from -10 to -26 (relative to the valine codon GTG at position 116-118; Figure 2A) .
Figure 6. Yeast halo assays of wild-type strain (W303) (MAT-a; leu2-3,112; ade2-101; his3-ll; trpl-1; ura3-l), a strain lacking the AFCl gene (Δafcl) , a strain lacking the RCEl gene (Δrcel), and a strain lacking both AFCl and RCEl (ΔΔ) are shown in the top four panels. Halo assays of a strain lacking both AFCl and RCEl genes expressing either no human protein (ΔΔpJPl), hRCE expressed from the yeast MEV1 promoter (ΔΔpACA7001 ) , or hRCE expressed from the yeast ERG12 promoter (ΔΔpACA7002) are shown in the lower three panels. Halos are present when hRCE is expressed in yeast indicating that the human protein is functional in yeast (see Example 3) .
Figure 7. Human hAFC cDNA and protein sequences.
Figure 8. Amino acid sequence alignment between the full length yeast Afclp protein (YAFCl.PRO) and the human hAFC protein (HAFC1.PRO). The alignment utilized the Clustal method with PAM250 residue weight table.
Figure 8. Amino acid alignment between the full length yeast Afclp protein (YAFCl.PRO) and the human hAFC protein (HAFC1.PRO). The alignment utilized the Clustal method with PAM250 residue weight table.
Figure 9. Amino acid alignment between a region of the human hRCElp protein (SEQ ID NO: 2, amino acid residues 166-233) comprising the histidine-rich sequence, HxHH (SEQ ID NO: 2, amino acid residues 172-175) and nucleic acid sequences in GenBank. The alignment utilized the Clustal method with PAM250 residue weight table.
Figure 10. Yeast halo assays performed with the following strains: wild-type strain (W303) (MAT-a; leu2-3,112; ade2- 101; his3-ll; trpl-1; ura3-l); a strain lacking the RCEl gene (Δrcel); a strain lacking both AFCl and RCEl (Δafcl,
Δrcel) ; a strain transformed with a URA3+ marked, high-copy (2μ) plasmid carrying wildtype hRCE expressed from the yeast ERG12 promoter (pAB13 wild-type) ; a strain transformed with a pAB13-derived plasmid in which the histidine at position 172 of hRCE was changed to an alanine (pAB15 H172A) ; a strain transformed with a pAB13- derived plasmid in which the histidine at position 174 of hRCE was changed to an alanine (pABlβ H174A) ; a strain transformed with a pAB13-derived plasmid in which the histidine at position 175 of hRCE was changed to an alanine (pAB17 H175A) . Halos are present when hRCE is expressed in yeast indicating that the human protein is functional in yeast (see Example 3) .
Figure 11. Heat shock assays in wild-type yeast cells
(W303) (MAT&; l eu2-3, 112; ade2-101 ; his3-l l ; trpl -1 ; ura3-
1 ) transformed with plasmid pAB20 expressing the activated
19 RAS2-val gene (dominant RAS2 allele) ("WT") ; and in a
W303-derived strain lacking both the Rcelp and Afclp CAAX proteases {Δrcel , Δafcl ) co-transformed with one of the following URA3 marked, high-copy (2μ) plasmids: pABl
(vector control), pAB13 (hRCEl ) , pABl5 (H172A) , pABlβ
(H174A) ; and pAB17 (H175A) (as described in Figure 10) .
Percent (%) viability after heat shock treatment is shown for each strain (see Example 9) .
PRIMER SEQUENCES
Oligonucleotides or "primers" were all purchased from Operon Technologies (Alameda, CA) and consisted of the following sequences: hRCE(lR)-RI (SEQ ID NO: 7) CGG AAT TCT CAA TAA TAT GGT GAA AAT GGG C; hRCE(2R)-RI (SEQ ID NO: 8) CGG AAT TCG GAT GAA GAG GAA AGC AGT G; hRCE(3F)-RI (SEQ ID NO: 9) GCG AAT TCT TCG GTG CCT ACA CTG CTT TC; hRCE(4F)-RI (SEQ ID NO: 10) GCG AAT TCC CAT TTT CAC CAT ATT ATT GAG C; hRCE(5R) (SEQ ID NO: 11) CTA ACA TGG GCA GCA TAC AGG; hRCE(6R) (SEQ ID NO: 12) CTG GTG AAG CGT CGC TTG A; hRCE(7F) (SEQ ID NO: 13) ATC CAG CCA GGC ACA TCC; hRCE(8F) (SEQ ID NO: 14) TTG CCT CAC AGA CAT GCG TTG; hRCE(9F) (SEQ ID NO: 15) AAC GCA TGT CTG TGA GGC AAC; hRCE(10R)-RI (SEQ ID NO: 16) CGG AAT TCG CTA ACA TGG GCA GCA TAC AGG CC; hRCE(HR) (SEQ ID NO: 17) GGT CAG GAG CAC AGG GGA GC; hRCE(12F) (SEQ ID NO: 18) GAA GGG CTG GCT GGG GTC; hRCE(13R) (SEQ ID NO: 19) GGT CCC TGG GAC GCA ACT AC- hRCE(14F) (SEQ ID NO: 20) CGG GGC GGC GCT GGG CGG GGA TGG GC; hRCE(15R) (SEQ ID NO:21) AGG CTG CCC ACG TAG GAG CAG GCG AGG CTG A, hRCE-ValX (SEQ ID NO: 22) GTG CTC TAG AGT GTC AGT GTT CTC CTG CCT CAG CCT C; hRCE3P-RI (SEQ ID NO: 23) CCG GGA ATT CAG GAG CAC AGG GGA GCT TGT GAG TCC C; hRCE3P-Xba (SEQ ID NO: 24) CCG GTC TAG AGG AGC ACA GGG GAG CTT GTG AGT CCC; 608(1F)-RI (SEQ ID NO: 25) CGG AAT TCC CTG CTG TGT TCA CCT GCC CA; 608(2F)-RI (SEQ ID NO: 26) CGG AAT TCC TCC TAC ACC GCT GTC TTC GGT G; 608(1R)-RI (SEQ ID NO:27) CGG AAT TCC CAG CGC ATG TCT GTG AGG CAG; 608(2R)-RI
(SEQ ID NO: 28) CGG AAT TCT GGG CAG GTG AAC ACA GCA GG; hAFC(lR)-RI (SEQ ID NO: 29) GCG AAT TCC TGG AAG ATG ATC AAT
AGT CCA AT; hAFC(2R) (SEQ ID NO: 30) TCT GGG CTT TTC TGT ACT
TAA TT; hAFC(3F) (SEQ ID NO: 31) GGA ACA GTG AAG AAA TAA AAG CTA AAG; hAFC(4F) (SEQ ID NO: 32) TGA GGA GGT ACT CGC TGT
ACT AGG; hSTE14 (IF) -RI (SEQ ID NO:33) GCG AAT TCG AGA ACA TCT TCT GGC CAG, hSTE14 (2R) -RI (SEQ ID NO: 34) GCG AAT TCT GGC CAG AAG ATG TTC TCG; hSTE14(3F) (SEQ ID NO:35) CTA CCA GAT AGC CAT CAG AGC; hSTE14 (4R) (SEQ ID NO:36) GCT CTG ATG GCT ATC TGG TAG.
Definitions
Unless otherwise defined, all technical and scientific terms used herein have the meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The practice of the present invention employs, unless otherwise indicated, conventional techniques of chemistry, molecular biology, microbiology, recombinant DNA, genetics and immunology. See, e.g., Maniatis et al., 1982; Sambrook et al., 1989; Ausubel et al., 1992; Glover, 1985; Anand, 1992; Guthrie and Fink, 1991 (which are incorporated herein by reference) .
An "isolated" or "substantially pure" nucleic acid, polynucleotide (e.g., an RNA, DNA or a mixed polymer) or polypeptide is one which is substantially separated from other cellular components that naturally accompany the native sequence or protein in its natural host cell, e.g., ribosomes, polymerases, many other mammalian genome sequences and proteins. The term embraces a nucleic acid sequence or polypeptide that has been removed from its naturally occurring environment, and includes recombinant or cloned DNA isolates and chemically synthesized analogs or analogs that are biologically synthesized by heterologous systems. The term "nucleic acids" or "polynucleotides" is also extended to refer to nucleic acids that encode a RCEl or AFCl polypeptide, polypeptide fragment, homologs and variants of those polypeptides, and protein fusions and deletions of any of the above. These nucleic acids comprise a sequence which is either derived from, or substantially similar to a natural CAAX protease-encoding gene or one having substantial homology with a natural CAAX protease-encoding gene or a portion thereof. The polynucleotide compositions of this invention may include both sense and antisense strands of RNA, cDNA, genomic DNA, and synthetic forms and mixed polymers of the above. They may be modified chemically or biochemically or may contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those of skill in the art. Such modifications include, for example, labels, methylation, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications such as uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.), charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), pendent moieties (e.g., polypeptides), intercalators (e.g., acridine, psoralen, etc.), chelators, alkylators, and modified linkages (e.g., alpha anomeric nucleic acids, etc.) Also included are synthetic molecules that mimic polynucleotides in their ability to bind to a designated sequence via hydrogen bonding and other chemical interactions. Such molecules are known in the art and include, for example, those in which peptide linkages substitute for phosphate linkages in the backbone of the molecule.
The term "substantial homology or similarity" when referring to a nucleic acid or fragment thereof indicates that, when optimally aligned with appropriate nucleotide insertions or deletions with another nucleic acid (or its complementary strand) , there is nucleotide sequence identity in at least about 60% of the nucleotide bases, usually at least about 70%, more usually at least about 80%, preferably at least about 90%, and more preferably at least about 95-98% of the nucleotide bases. Alternatively, substantial homology or similarity exists when a nucleic acid or fragment thereof hybridizes to another nucleic acid, to a strand of another nucleic acid, or to the complementary strand thereof, under selective hybridization conditions. Typically, selective hybridization will occur when there is at least about 55% homology — preferably at least about 65%, more preferably at least about 75%, and most preferably at least about 90% — over a stretch of at least about 14 nucleotides. See, e.g., Kanehisa, 1984. The length of homology comparison, as described, may be over a stretch of at least about nine nucleotides, usually at least about 20 nucleotides, more usually at least about 24 nucleotides, typically at least about 28 nucleotides, more typically at least about 32 nucleotides, and preferably at least about 36 or more nucleotides.
Nucleic acid hybridization will be affected by such conditions as salt concentration, temperature, solvents, the base composition of the hybridizing species, length of the complementary regions, and the number of nucleotide base mismatches between the hybridizing nucleic acids, as will be readily appreciated by those skilled in the art .
"Stringent hybridization" and "stringent hybridization wash conditions" in the context of nucleic acid hybridization experiments differ depending on different physical parameters such as temperature, the base composition and concentration of the reactants and the salt concentration. Stringent temperature conditions will generally include temperatures in excess of 30°C, typically in excess of 37°C, and preferably in excess of 45°C. Stringent salt conditions will ordinarily be less than 1000 mM, typically less than 500 mM, and preferably less than 200 mM. However, the combination of parameters is much more important than the measure of any single parameter. See, e.g., Wetmur and Davidson, 1968. Probe sequences may also hybridize specifically to duplex DNA under certain conditions to form triplex or other higher order DNA complexes. The preparation of such probes and suitable hybridization conditions are well known in the art. Generally, stringent hybridization and wash conditions are selected to be about 5°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature at which 50% of the target sequence hybridizes to a perfectly matched probe. An example of stringent hybridization conditions for hybridization of complementary nucleic acid sequences having more than 100 complementary residues on a filter in a Southern or northern blot is 50% formamide with 1 mg of heparin per 50mL at 42°C for at least ten hours. An example of stringent wash conditions is a 0.2x SSC wash at 65°C for 15 minutes (see Sambrook et al . , supra, for SSC buffer). Often the high stringency wash is preceded by a low stringency wash to remove excess probe. An exemplary medium stringency wash for duplex DNA of more than 100 base pairs is lxSSC at 45°C for 15 minutes. An exemplary low stringency wash for such a duplex is 4xSSC at 40°C for 15 minutes. In general, signal-to-noise ratio of 2x or higher than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization.
As defined herein, nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical to those which do hybridize under stringent conditions if the polypeptides which they encode are substantially identical. This occurs, for example, when a nucleic acid is created synthetically or recombinantly using a high codon degeneracy as permitted by the redundancy of the genetic code.
The term "identical" in the context of nucleic acid or polypeptide sequences refers to the residues in the two sequences which are the same when aligned for maximum correspondence. When percentage of sequence identity is used in reference to proteins or peptides, it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions -- where amino acid residues are substituted for other amino acid residues having side chains with similar chemical properties (e.g., charge or hydrophobicity) which do not substantially change the functional properties of the molecule. Where sequences differ in conservative substitutions the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Means for making this adjustment are well known to those of skill in the art (see, e.g., Pearson et al., Methods in Molecular Bioloσy, 24, pp. 307-31 (1994)).
The terms "substantial homology" or "substantial identity", when referring to polypeptides, indicate that the polypeptide or protein in question exhibits at least about 30% identity with an entire naturally-occurring protein or a portion thereof, usually at least about 70% identity, and preferably at least about 95% identity.
The term "substantially similar function" refers to the function of a modified nucleic acid or a modified polypeptide (or protein) with reference to its wild-type nucleic acid or polypeptide counterpart. The modified
CAAX protease polypeptide will be substantially homologous to the wild-type CAAX protease polypeptide and will have substantially the same function. The modified polypeptide may have an altered amino acid sequence and/or may contain modified amino acids.
In addition to the similarity of function, a modified CAAX protease polypeptide may have other useful properties, such as an altered (e.g., longer) half-life. The similarity of function (activity) of the modified polypeptide may be substantially the same as the activity of the wild-type CAAX protease polypeptide. Alternatively, the similarity of function (activity) of the modified polypeptide may differ from the activity of the wild-type polypeptide in, for example, substrate specificity and/or enzymatic activity. The modified polypeptide is synthesized using conventional techniques, or is encoded by a modified nucleic acid and produced using conventional techniques. The modified nucleic acid is prepared by conventional techniques. A nucleic acid with a function substantially similar to the wild-type
RCEl or AFCl gene function produces the modified protein as described above.
Homology, for polypeptides, is typically measured using sequence analysis software. See, e.g., the Sequence Analysis Software Package of the Genetics
Computer Group, University of Wisconsin Biotechnology Center, 910 University Avenue, Madison, Wisconsin 53705. Protein analysis software matches similar sequences using measure of homology assigned to various substitutions, deletions and other modifications. The length of polypeptide sequences compared for homology will generally be at least about 16 amino acids, usually at least about 20 residues, more usually at least about 24 residues, typically at least about 28 residues, and preferably more than about 35 residues. "Conservatively modified variations" of a particular nucleic acid sequence refers to nucleic acids that encode identical or essentially identical amino acid sequences or DNA sequences where no amino acid sequence is encoded, because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given polypeptide sequence. When a nucleic acid sequence is changed at one or more positions with no corresponding change in the amino acid sequence which it encodes, that mutation is called a "silent mutation". Thus, one species of a conservatively modified variation according to this invention is a silent mutation. Accordingly, every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent mutation or variation. Furthermore, one of skill in the art will recognize that individual substitutions, deletions, additions and the like, which alter, add or delete a single amino acid or a small percentage of amino acids (less than 5%, more typically less than 1%) in an encoded sequence are "conservatively modified variations" where the alterations result in the substitution of one amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. The following six groups each contain amino acids that are conservative substitutions for one another:
1) Alanine (A), Serine (S) , Threonine (T) ;
2) Aspartic Acid (D) , Gluta ic Acid (E) ;
3) Asparagine (N) , Glutamine (Q) ; 4) Arginine (R) , Lysine (K) ; 5) Isoleucine (I), Leucine (L) , Methionine (M) , Valine (V) , and
6) Phenylalanine (F), Tyrosine (Y) , Tryptophan (W)
The terms "RCEl (or AFCl) protein" or "RCEl (or AFCl) polypeptide" refer to a protein or polypeptide encoded by the RCEl (or AFCl ) locus, and variants or fragments thereof. The term "polypeptide" refers to a polymer of amino acids and its equivalent and does not refer to a specific length of the product; thus, peptides, oligopeptides and proteins are included within the definition of a polypeptide. This term also does not exclude modifications of the polypeptide, for example, glycosylations, acetylations, phosphorylations, and the like. Included within the definition are, for example, polypeptides containing one or more analogs of an amino acid (including, e.g., unnatural amino acids), polypeptides with substituted linkages as well as other modifications known in the art, both naturally and non-naturally occurring. Ordinarily, such polypeptides will be at least about 50% homologous to the native CAAX protease sequences, preferably in excess of about 90%, and more preferably at least about 95% homologous. Also included are proteins encoded by DNA which hybridize under high or low stringency conditions, to CAAX protease-encoding nucleic acids and closely related polypeptides or proteins retrieved by antisera raised against such protein (s).
A polypeptide "fragment", "portion" or "segment" refers to a stretch of amino acid residues of at least about five to seven contiguous amino acids, often at least about seven to nine contiguous amino acids, typically at least about nine to 13 contiguous amino acids and, most preferably, at least about 20 to 30 or more contiguous amino acids . The term "protein modifications" refers to RCEl (or AFCl) polypeptides or fragments thereof that are substantially homologous in primary structural sequence but which include, e.g., in, vivo or in vitro chemical and biochemical modifications or which incorporate unusual amino acids. Such modifications include, for example, acetylation, carboxylation, phosphorylation, glycosylation, ubiquitination, labeling, e.g., with radionuclides, and various enzymatic modifications, as will be readily appreciated by those well skilled in the art. A variety of methods for labeling polypeptides and of substituents or labels useful for such purposes are well known in the art, and include radioactive isotopes
32 such as P, ligands which bind to labeled antiligands (e.g., antibodies), fluorophores, chemiluminescent agents, enzymes, and antiligands which can serve as specific binding pair members for a labeled ligand. The choice of label depends on the sensitivity required, ease of conjugation with the primer, stability requirements, and available instrumentation. Methods for labeling polypeptides are well known in the art. See Sambrook et al., 1989 or Ausubel et al., 1992.
The term "fusion protein" refers to fusion polypeptides comprising RCEl or AFCl polypeptides and fragments. Homologous polypeptide fusions may be between two or more polypeptide sequences or between sequences derived from RCElp or AFClp and a related protein. Heterologous fusions may be constructed which would exhibit a combination of properties or activities of the derivative polypeptides. For example, active site or other domains may be "swapped" between CAAX protease polypeptides and other polypeptides or fragments. Such homologous or heterologous fusion polypeptides may display altered biological characteristics (e.g., altered enzymatic activity, substrate specificity and/or altered cellular localization) . The terms "isolated", "substantially pure", "purified", "purified and isolated" and "substantially homogeneous" are used interchangeably to describe a protein or polypeptide that has been separated from components that accompany it in its natural state. A monomeric protein is substantially pure when at least about 60 to 75% of a sample exhibits a single polypeptide sequence. A substantially pure protein will typically comprise about 60 to 90% W/W of a protein sample, more usually about 95%, and preferably will be over 99% pure.
Protein purity or homogeneity may be indicated by a number of means well known in the art, such as polyacrylamide gel electrophoresis of a protein sample, followed by visualizing a single polypeptide band upon staining the gel with a stain well known in the art. For certain purposes, higher resolution may be provided by using HPLC or other means well known in the art for purification.
A RCEl or AFCl protein is substantially free of naturally associated components when it is separated from the native contaminants that accompany it in its natural state. Thus, a polypeptide that is chemically synthesized or synthesized in a cellular system different from the cell from which it naturally originates will be substantially free from its naturally associated components. A protein may also be rendered substantially free of naturally associated components by isolation, using protein purification techniques well known in the art.
Besides substantially full-length polypeptides, the present invention provides for biologically active fragments of the polypeptides. Significant biological activities include CAAX substrate binding, protease activity, immunological activity or other (presently unidentified) biological activities characteristic of the CAAX protease polypeptides of this invention.
Immunological activities include both immunogenic function in a target immune system, as well as sharing of immunological epitopes for binding, serving as either a competitor or substitute antigen for an epitope of the RCEl or AFCl protein. Expression of recombinant DNA molecules according to this invention may involve post-translational modification of a resultant polypeptide by the host cell. For example, in mammalian cells expression might include, among other things, glycosylation, lipidation or phosphorylation of a polypeptide, or cleavage of a signal sequence to produce a "mature" protein. Accordingly, the polypeptide expression products of this invention encompass full-length polypeptides and modifications or derivatives thereof, such as glycosylated versions of such polypeptides, mature proteins, polypeptides retaining a signal peptide, truncated polypeptides having comparable biological activity, and the like.
As used herein, "epitope" refers to an antigenic determinant of a polypeptide, i.e., a region of a polypeptide that provokes an immunological response in a host. This region needs not comprise consecutive amino acids. The term epitope is also known in the art as "antigenic determinant". An epitope may comprise as few as three amino acids in a spatial conformation which is unique to the immune system of the host. Generally, an epitope consists of at least five such amino acids, and more usually consists of at least 8-10 such amino acids. Methods for determining the spatial conformation of such amino acids are known in the art. The term "antibody" refers to a polypeptide encoded by an immunoglobulin gene, genes, or fragments thereof. The immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and mu constant regions, as well as a myriad of immunoglobulin variable regions. Light chains are classified as either kappa or lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes IgG, IgM, IgA, IgD and IgE, respectively. Antibodies exist for example, as intact immunoglobulins or as a number of well-characterized fragments produced by digestion with various peptidases. For example, trypsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab which itself is a light chain joined to a VH-CH1 by a disulfide bond. The F(ab)'2 may be reduced under mild conditions to break the disulfide linkage in the hinge region thereby converting the F(ab)'2 dimer to a Fab' monomer. The Fab' monomer is essentially an Fab with part of the hinge region. See Fundamental Immunology, Third Edition, W.E. Paul, ed. Raven Press, N.Y. (1993) (incorporated herein by reference) for a detailed description of antibodies and antibody fragments. One of skill in the art recognizes that such Fab' fragments may be synthesized de novo either chemically or using recombinant DNA technology. Thus, as used herein, the term antibody includes antibody fragments produced by the modification of whole antibodies or those synthesized de novo. The antibodies of the present invention are optionally derived from libraries of recombinant antibodies in phage or similar vectors (see, e.g., Huse et al., Science, 246, pp. 1275-81 (1989); Ward et al.,
Nature, 341, pp. 544-46 (1989); Vaughan et al . , Nature Biotech. , 14, pp. 309-14 (1996) which are incorporated herein by reference) .
DETAILED DESCRIPTION OF THE INVENTION
Identifying Human ESTs For Use as Probes
We performed extensive database searches for DNA sequences capable of encoding protein sequence with significant sequence identity with the protein products of the yeast RCEl , AFCl , and STE14 genes using the search engine XREF (Bassett et al., Trends in Genetics, 11, pp. 372-373 (1995) . Three expressed sequence tags (ESTs) were found with extensive amino acid sequence identity to the corresponding yeast genes (Figures 1A-C) . The human EST NCBI#604642 (Genbank Accession # W96411) showed considerable sequence identity with the yeast RCEl gene product, Rcelp, over approximately 90 amino acids (Figure 1A) . In addition, the EST possessed 3 conserved histidine residues in a tetrapeptide sequence we believe is critical for catalysis (HxHH at positions 194-197 of Rcelp; see Figure 1A, fragment 188-202) .
The human EST NCBI#482520 (Genbank Accession # N76181) showed 45 percent sequence identity with the product of the AFCl gene (Afclp) over approximately 80 amino acids (Figure IB) . The murine EST NCBI#636296 (Genbank Accession # AA022288) sequence showed 36 percent identity with the product of the STE14 gene (Stel4p) over 67 amino acids.
Isolating human hRCE cDNA Homologs
To isolate the human cDNA counterpart to the yeast RCEl gene, we screened a λgtll cDNA library made from a 9-week old human fetus. The library (ATCC #77433) was purchased from the American Type Culture Collection (see Example 1) . Briefly, phage were plated onto an E. coli host strain, lifted onto nylon membranes and hybridized under stringent conditions to radio-labeled DNA sequences which were isolated from the human EST (ID#358628) (see above) . Two phage plaques, 13R and 27R, showed a strong hybridization signal to the human EST
(#358628) probe and were purified to homogeneity by two successive rounds of plating and hybridization (Example 1) . Sequencing the human hRCE cDNA
Phage stocks of the λgtll clones 13R and 27R were prepared and titered, and DNA was prepared from these two clones (Example 1) . The λgtll clone 27R contains a human cDNA insert of approximately 1.4 kB. An E. coli strain (LE392) infected with a purified phage stock of λgtll clone 27R was deposited as strain designation "27R-λgtll" on May 7, 1997 with the American Type Culture Collection (ATCC) (Rockville, MD) according to the provisions of the Budapest Treaty, and was assigned the ATCC accession number designated: ATCC #209020. All restrictions on the availability to the public of the above ATCC deposit will be irrevocably removed upon the granting of a patent on this application. The 5' and 3' ends of the λ clones were amplified by PCR as described in Example 1. Additional sequence information for the human hRCE cDNA was obtained by 5' and 3' RACE methods (Siebert et al.. Nucleic Acids Research, 23, pp. 1087-1088 (1995)) (see Example 1). The consensus sequence derived from the two human λgtll cDNAs, 13R and 27R, together with the human RACE PCR products revealed the presence of an open reading frame extending from the 5' end of the sequence to nucleotide 996 (SEQ ID NO:l) (Figure 2A and see below). 5' RACE reactions were also performed on mouse cDNA and a number of products were isolated and cloned into pCR2.1 (obtained from Invitrogen, Inc.). Two genomic mRCE clones, 20 (a 10.5 kbp BamHI fragment in pBluescript SK) and 75 (an 8.5 kbp EcoRI fragment in pbluescript SK) were sequenced using the primer RCE-15R. These mouse clone sequences were assembled into one continuous DNA sequence ("a contig") with the aid of a number of murine ESTs (Genbank ID Nos: AA072190, AA154658, AA168614, W14344, AA154864, AA218396) . The continuous cDNA and genomic sequences are shown in Figures 2B and 2C. The mouse RCE {mRCE) cDNA consensus also revealed the presence of a continuous ORF that extended from the 5' terminus to nucleotide 1004 (SEQ ID NO:3; SEQ ID NO:5) (see Figure 2B) . The first ATG codon encountered within the mouse ORF is underlined in Figure 2C. The predicted protein sequence of mRCE (SEQ ID NO: 4; SEQ ID NO: 6) is shown in Figure 2B.
The first ATG codon encountered within the human hRCE ORF (SEQ ID NO:l) is well into the coding sequence and is located beyond a region of sequence identity with the yeast Rcelp sequence (see Figure 3) suggesting that translation begins at an upstream site. The 5' terminal "G" residue of the hRCE cDNA sequence (SEQ ID NO:l) (see Figure 2A) , which was fused to linker sequences during the construction of the cDNA clone, may be preceded by an "AT" dinucleotide, forming an in-frame ATG codon for methionine. As described above, the corresponding G residue in the mouse mRCE sequence forms an ATG codon which was revealed upon obtaining additional genomic sequence information. If that ATG codon in mRCE is a functional translational start site, it is likely that the 5' G residue in the human hRCE sequence shown in Figure 2A is also preceded by an AT dinucleotide and the hRCE ORF begins with a methionine residue.
Alternatively, translational initiation may begin at a non-ATG codon. There are two in-frame GTG codons at positions 116-118 and 122-124 (SEQ ID NO:l) which are known to function as translational start sites on eukaryotic mRNAs . Initiation at either of these codons would encode a protein of very similar size and composition as the yeast counterpart.
Human hRCE RNA Analysis
To resolve whether the DNA sequence we determined from the λgtll cDNA clones and numerous 5' RACE products reflected the bona fide 5' terminus of the hRCE mRNA, we performed Northern blot analyses and primer extension experiments with human mRNA (Example 2) .
The Northern blots revealed the presence of a single abundant mRNA species approximately 1400 base-pairs in length. The expression pattern indicated that the hRCE gene is expressed in all tissues examined. The highest expression was found in placenta and pancreatic mRNA samples. The lowest expression was found in the brain. The size of the hRCE mRNA was consistent with transcription beginning at or within the sequence shown in Figure 2A.
To map more precisely the actual 5' ends of the hRCE mRNA, we performed primer extension experiments (Example 2) . The reaction products appeared to be the same when the reaction were performed under different conditions, and were the same from the two tissue types (Figure 5) . The transcriptional start sites appear to range from -10 to -26 relative to the valine codon GTG at position 116-118 (SEQ ID NO:l) (Figure 2A) .
Expression of the human hRCE cDNA
The most simple interpretation of the above analyses is that the start-site of translation of both the hRCE and mRCE genes is at the GTG codon located at position 116. To test whether the ORF beginning at position 116 encodes a functional counterpart to the yeast gene, we expressed this portion of the human ORF (SEQ ID NO:l nucleotide 116 to 997, Figure 2A) in a yeast strain deleted for both CAAX protease genes, yAFCl and yRCEl (Boyartchuk et al., supra) (Example 3). The human hRCE ORF was cloned into one of two expression vectors (pACAl' or pACA5') under the control of the yeast MEV1 promoter and an ATG translational initiation codon, or the yeast ERG12 promoter and the first four codons of the ERG12 coding sequence, respectively. The resulting plasmids, pACA7001 (also referred to as pAB14) and pACA7002 (also referred to as pAB13) , represent translational fusions between the MEV1 ORF and the ERG12 ORF, respectively. Both expression vectors contained the yeast PGK1 transcriptional termination sequence.
E. coli strains transformed with either pAB13 or pAB14 were deposited as strain designations " " and " ", respectively, on June , 1998 with the American Type Culture Collection (ATCC) (Rockville, MD) according to the provisions of the Budapest Treaty, and were assigned the ATCC accession number designated: ATCC # and ATCC # . All restrictions on the availability to the public of the above ATCC deposits will be irrevocably removed upon the granting of a patent on this application.
The yeast a.-mating factor is a CAAX-containing protein and must undergo C-terminal proteolysis in order to secrete biologically active pheromone. We tested whether the human hRCE expression constructs, pACA7001 and pACA7002, were functional in yeast by their ability to restore production of a-factor in strain JRY5317 deleted for the two endogenous CAAX proteases. Plasmids pACA7001 and pACA7002 were each transformed into the yeast strain JRY5317, an a.-mating type strain in which both CAAX protease genes have been deleted (Boyartchuk et al., supra) (Example 3) . We performed a halo assay by replica plating patches of the yeast transformants onto a lawn of sst2 tester lawn. Cells harboring an sst2 mutation are rendered supersensitive to mating pheromone and undergo a Gl arrest when exposed to pheromone.
Cells harboring either one of the hRCE expression plasmids, but not the vector control (vector with no hRCE insert) , produced significant amounts of biologically active ^.-factor (Figure 6) . This experiment demonstrates that the human hRCE cDNA (Figure 2A) is the functional human counterpart to the yeast RCEl gene and has a conserved substrate specificity extending to yeast &- actor. This experiment also demonstrates that the ORF that begins at the GTG codon (SEQ ID N0:1, nucleotides 116-118) and ends at the TGA stop codon (SEQ ID N0:1, nucleotides 995-997) encodes a functional human CAAX protease.
Mammalian homologs of the yeast AFCl gene Comparison of the yeast AFCl gene with the NCBI
EST database revealed the presence of several ESTs with significant sequence identity. The Genbank Accession numbers for the ESTs are: AA359058, H02497, AA210930, AA295448, Z43273, N76181, T35312, F11310, R54272, AA01886, AA134125, Z45538, H15202, and N30505. The sequences were aligned and found to contain a continuous ORF from the 5' end to nucleotide 721 (SEQ ID NO:37) (see Figure 7). The predicted amino acid sequence (SEQ ID NO: 38) shares extensive sequence identity with the yeast Afclp protein, including the proposed active site sequence HExxH
(Boyartchuck et al., supra) which is characteristic of zinc metalloproteases (Figure 8) . Thus, this sequence likely encodes the functional counterpart in humans to the yeast CAAX protease, Afclp (Example 4) .
Protein Sequences Related to Human hRcelp
To identify other known protein sequences related to those comprising human hRCElp, we performed a BLASTP search of GenBank with a fragment of the human Reel protein sequence including the histidine-rich sequence "HxHH" (SEQ ID NO:2, amino acids 172-175). The search revealed sequence similarity with both yeast CAAX proteases, yRcelp and yAfclp (Figure 9) . The amino acid sequence of yeast Afclp has a zinc metalloprotease consensus sequence (VxxHExxH) (SEQ ID NO: 50) located at positions 294-301. In the alignment shown in Figure 9, the last histidine of the zinc metalloprotease consensus sequence of yAfclp, (H301; histidine at position 301) aligns with the histidine at position 172 (H172) of hRcelp. The sequence similarity of the histidine-rich region of hRcelp to the region of yAfclp comprising the presumptive zinc-binding site which functions in catalysis suggests that these regions serve similar biochemical functions. These similar histidine-rich regions may be involved in substrate binding and/or catalysis. Given the presence of histidine-rich sequences that are functionally important in both proteins and the diverse utilization of histidine residues by protease enzymes in binding zinc, a role in metal binding and hence, catalysis, is reasonable. Afclp in all respects appears to be a zinc- dependent metalloprotease based both upon amino acid sequence analysis as well as enzymatic sensitivity to the zinc chelator, o-phenanthroline (Boyartchuk et al., supra) . Reel has no similar sensitivity to o- phenanthroline nor to the chelator EDTA. Reports in the literature support the notion that the mammalian CAAX protease is either a serine or cysteine protease based upon its sensitivity to a farnesylated peptide aldehyde (Ma, Gilbert and Rando, Biochemistry. 32, pp. 2386-2393 (1993) ) and to the thiol modifying reagents PCMB and NEM (Hrycyna and Clarke, J. Biol. Chem., 267, pp. 10457-464 (1992) ; Ma, Chaudhuri and Rando, Biochemistry, 31, pp. 11772-777 (1992); Akopyan et al., BBRC.198 , pp. 787-794 (1994) ) .
Given the modest sensitivity of the mammalian CAAX endoprotease to thiol-modifying reagents and the similarity of hRcelp to Afclp (Figure 9) , it is tempting to speculate that Rcelp is related to the insulin degrading enzyme, IDE1. IDE1 is a zinc-dependent thiol protease that is a member of a family of such proteases that includes the E. coli protease III (Affholter, J.A., Fried, V.A., Roth, R.A., Science. 242, pp. 1415-1418 (1988)), and the yeast protease Axllp (Adames, N., Blundell, K., Ashby, M.N., Boone, C, Science, 270, pp. 464-467 (1995)). However, the sequence of the yeast or human Reel protein do not show any significant sequence similarity to any members of this family. Perhaps the greatest clue as to the functional properties of Rcelp comes from the observations that Rcelp
2+ is inhibited by Zn (Ki~100mM) (Ashby, M.N., King, D.S.,
Rine, J., Proc. Natl. Acad. Sci. U.S.A., 89, pp. 4613-4617
(1992)), that the conserved histidine residues of hRcepl (H172, H174, H175) are functionally important (see below) , and that the same histidine-rich region shows similarity to Afclp around its putative zinc-binding site described herein. These latter observations strongly suggest a role of metal-binding in the function of hRcelp. Zinc is the most likely metal ion to associate with hRcelp, although another divalent cation may be involved, such as cobalt, calcium or magnesium.
The notion that hRcelp requires zinc-binding, or another divalent cation, will provide valuable information in the design of potent inhibitors. For instance, hydroxamic acid derivatives have found significant utility for the inhibition of zinc proteases such as the matrix etalloproteinase (Levy, D.E. et al., J. Med. Chem. , 41, pp. 199-223 (1998) ) and angiotensin-converting enzyme (Parvathy, S. et al., Biochem. J., 327, pp. 37-43 (1997)). Hydroxamic acid confers it inhibitory effect on zinc metalloproteases through its metal chelating properties. Possible hydroxamic acid hRcelp inhibitors would thus include a prenylated cysteine amino acid linked to a hydroxa ate group at it carboxy terminus. The amino group of the inhibitor would be preferably linked to a small aliphatic group such as methyl, or more preferably to acetyl, or linked to a small number of amino acids, or left unmodified. The prenyl group would be linked to the cysteine residue through the sulfur group of the cysteine and would be composed of a trans, trans, trans- geranylgeranyl lipid, or more preferably, a trans, trans- farnesyl lipid.
Mutagenesis of the Histidine-Rich Region of hRCEl
To determine whether the histidine-rich sequence "HxHH" of the human Reel protein (SEQ ID NO: 2, amino acids 172-175) are required for biological activity, we changed each of these histidine residues individually to an alanine residue (H172A, H174A and H175A) by site-directed mutagenesis (Example 7) . Each mutant hRCEl gene was inserted into a high-copy expression vector and tested in parallel with a wild-type hRCEl expression construct for function in yeast by their ability to restore production of -factor as determined by halo assays in strain JRY5317 (deleted for the two endogenous CAAX proteases) (see Figure 6 and accompanying text, above, and Example 3) .
As shown in Figure 10, cells harboring the wild- type hRCE expression plasmid produced significant amounts of biologically active a.-factor producing a visible -halo (pAB13; also referred to as pACA7002 in Figure 6) . In contrast, cells harboring mutant hRCE expression plasmids in which the histidines at position 172, 174 or 175 were individually replaced by alanine produced significantly less biologically active a-factor, as evidenced by the reduced (pAB16; H174A and pAB17; H175A) or undetectable (pAB15; H172A) halos (Figure 10) . This experiment demonstrates that histidine residues H172, H174 and H175, comprising the histidine-rich region of human hRCEl (conserved between RCEl and AFCl ) contribute to biological activity of hRCElp protein in vivo.
The ability for the above wild-type and mutant hRCEl plasmids to express biologically active CAAX protease in vivo was also measured by heat shock assays in yeast cells transformed with a plasmid expressing the
19 mutant RAS2-val gene (dominant RAS2 allele) (see Figure
11 and Example 9) .
As described above, a deletion in the RCEl gene suppresses the temperature-sensitive phenotype of cells
19 expressing activated Ras2p-val (i.e., Rcelp activity is required for the functional expression of Ras2p) . Heat-
19 shock assays in cells expressing activated Ras2p-val may thus be used to assess Rcelp biological activity in vivo
(see Example 9) . As shown in Figure 11, cells that
19 express activated Ras2p-val in the presence of a functional RCEl gene (WT; wild-type yRCEl and pAB13; hRCEl ) are temperature-sensitive after heat shock and have reduced assay or with a wild-type Percent (%) viability after heat shock treatment is shown for each strain .
Vectors, Cloning and Nucleic Acids
The present invention provides nucleic acids and recombinant DNA vectors which comprise mammalian RCE and AFC DNA sequences. More specifically, vectors comprising all or portions of the cDNA sequences of the human and mouse RCE and AFC functional homologs described above are provided. The vectors of this invention also include those comprising DNA sequences which hybridize under stringent conditions to the human and mouse RCE and AFC cDNA sequences, and conservatively modified variations thereof.
The nucleic acids of this invention include single stranded and double-stranded genomic DNA, cDNA, RNA, or hybrids thereof and may be isolated from biological sources or synthesized chemically or by recombinant DNA methodology. The nucleic acids, recombinant DNA molecules and vectors of this invention may be present in transformed or transfected cells, cell lysates, or in partially purified or substantially pure forms .
As is well known in the art, DNA sequences may be expressed by operatively linking them to an expression control sequence in an appropriate expression vector and employing that expression vector to transform an appropriate unicellular host. Such operative linking of a DNA sequence of this invention to an expression control sequence, of course, includes, if not already part of the DNA sequence, the provision of a translation initiation codon, ATG, in the correct reading frame upstream of the DNA sequence.
A wide variety of host/expression vector combinations may be employed in expressing the DNA sequences of this invention. Useful expression vectors, for example, may consist of segments of chromosomal, non- chromosomal and synthetic DNA sequences.
DNA vector design for transfection into mammalian cells should include appropriate sequences to promote expression of the gene of interest, including: appropriate transcription initiation, termination and enhancer sequences; efficient RNA processing signals such as splicing and polyadenylation signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus sequence); sequences that enhance protein stability; and when desired, sequences that enhance protein secretion. A great number of expression control sequences — constitutive, inducible and/or tissue-specific — are known in the art and may be utilized.
Preferred DNA vectors also include a marker gene and means for amplifying the copy number of the gene of interest. DNA vectors may also comprise stabilizing sequences (e.g., ori- or ARS-like sequences and telomere- like sequences) , or may alternatively be designed to favor directed or non-directed integration into the host cell genome . Substantial progress in the development of mammalian cell expression systems has been made in the last decade and many aspects of the system are well characterized. A detailed review of the production of foreign proteins in mammalian cells, including useful cells, protein expression-promoting sequences, marker genes, and gene amplification methods, is disclosed in M. M. Bendig, Genetic Engineering, 7, pp. 91-127 (1988) .
Particular details of the transfection, expression and purification of recombinant proteins are well documented and are understood by those of skill in the art. Further details on the various technical aspects of each of the steps used in recombinant production of foreign genes in mammalian cell expression systems can be found in a number of texts and laboratory manuals in the art. See, e.g., F. M. Ausubel et al., ed., Current
Protocols in Molecular Biology, John Wiley & Sons, New York (1989) .
Useful expression vectors for bacterial hosts include bacterial plasmids, such as those from E. coli , including pBluescript, pGEX-2T, pUC vectors, col El, pCRl, pBR322, pMB9 and their derivatives, wider host range plasmids, such as RP4, phage DNAs, e.g., the numerous derivatives of phage lambda, e.g., NM989, λGTIO and λGTll, and other phages, e.g., M13 and Filamenteous single stranded phage DNA. Useful expression vectors for yeast cells include the 2μ plasmid and derivatives thereof. Useful vectors for insect cells include pVL 941.
In addition, any of a wide variety of expression control sequences — sequences that control the expression of a DNA sequence when operatively linked to it — may be used in these vectors to express the DNA sequences of this invention. Such useful expression control sequences include the expression control sequences associated with structural genes of the foregoing expression vectors. Examples of useful expression control sequences include, for example, the early and late promoters of SV40 or adenovirus, the lac system, the trp system, the TAC or TRC system, the T3 and T7 promoters, the major operator and promoter regions of phage lambda, the control regions of fd coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the promoters of the yeast a- mating system, the GAL1 or GAL10 promoters, and other constitutive and inducible promoter sequences known to control the expression of genes of prokaryotic or eukaryotic cells or their viruses, and various combinations thereof. See, e.g., The Molecular Biology of the Yeast Saccharo yces (eds. Strathern, Jones and Broach) Cold Spring Harbor Lab., Cold Spring Harbor, N.Y. for details on yeast molecular biology in general and on yeast expression systems (pp. 181-209) (incorporated herein by reference) ) .
For eukaryotic cells, expression control sequences typically include a promoter, an enhancer derived from immunoglobulin genes, SV40, cytomegalovirus, etc., and a polyadenylation sequence which may include splice donor and acceptor sites.
In a preferred embodiment, DNA sequences of this invention are cloned in the expression vector lambda ZAP II (Stratagene, La Jolla, CA) , in which expression from the lac promoter may be induced by IPTG.
In another preferred embodiment, DNA sequences of this invention are inserted in frame into an expression vector that allows high level expression of the corresponding polypeptide as a fusion protein. Examples of fusion proteins envisioned by this invention include but are not limited to fusions to reporter genes, such as the green fluorescent protein, lacZ and luciferase, and fusions to epitope tags such as myc, hemagglutinin (HA) or other related short immunogenic sequences. The DNA sequences of this invention may be linked by the acid labile cleavage site (Asp-Pro) to a leader sequence (such as the "MLE leader") suitable for promoting expression in E. coli . Full-length and truncated forms of the DNA sequences of this invention may be prepared as fusion constructs. A wide variety of unicellular host cells are useful in expressing the DNA sequences of this invention. These hosts may include well known eukaryotic and prokaryotic hosts, such as strains of E. coli , Pseudomonas, Bacill us, Strepto yces, fungi, yeast, insect cells such as Spodoptera frugiperda (SF9) , animal cells such as CHO and mouse cells, African green monkey cells such as COS 1, COS 7, BSC 1, BSC 40, and BMT 10, and human cells, as well as plant cells in tissue culture.
It should of course be understood that not all vectors and expression control sequences will function equally well to express the DNA sequences of this invention. Neither will all hosts function equally well with the same expression system. However, one of skill in the art may make a selection among these vectors, expression control sequences and hosts without undue experimentation and without departing from the scope of this invention. For example, in selecting a vector, the host must be considered because the vector must be replicated in it. The vector's copy number, the ability to control that copy number, the ability to control integration, if any, and the expression of any other proteins encoded by the vector, such as antibiotic or other selection markers, should also be considered.
In selecting an expression control sequence, a variety of factors should also be considered. These include, for example, the relative strength of the sequence, its controllability, and its compatibility with the DNA sequence of this invention, particularly with regard to potential secondary structures. Unicellular hosts should be selected by consideration of their compatibility with the chosen vector, the toxicity of the product coded for by the DNA sequences of this invention, their secretion characteristics, their ability to fold the polypeptide correctly, their fermentation or culture requirements, and the ease of purification from them of the products coded for by the DNA sequences of this invention.
Within these parameters, one of skill in the art may select various vector/expression control sequence/host combinations that will express the DNA sequences of this invention on fermentation or in other large scale cultures.
Given the strategies described herein, one of skill in the art can construct a variety of vectors and nucleic acid molecules comprising functionally equivalent nucleic acids. DNA cloning and sequencing methods are well known to those of skill in the art and are described in an assortment of laboratory manuals, including Sambrook et al, supra, 1989; and Current Protocols in Molecular Biology, F.M. Ausubel et al., eds . , Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. (1994 Supplement) . Product information from manufacturers of biological, chemical and immunological reagents also provide useful information.
Recombinant Mammalian RCE and AFC Polypeptides and Antibodies Directed Against Them.
The recombinant DNA molecules and more particularly, the expression vectors of this invention may be used to express recombinant RCE and AFC polypeptides in a heterologous host cell. The RCE and AFC polypeptides of this invention may be full-length or less than full-length recombinant polypeptide fragments recombinantly expressed from the DNA sequences according to this invention. Such polypeptides include variants, fragments and uteins having biological activity. The polypeptides of this invention may be soluble, or may be engineered to be membrane- or substrate-bound using techniques (such as cross-linking) well known in the art. The polypeptides of this invention may be fused to other molecules, such as genetic, enzymatic or chemical or immunological markers such as epitope tags. Fusion partners include, inter alia, GST, immunoglobulins, bacterial beta-galactosidase, trpE, protein A, β-lactamase, amylase, maltose binding protein, alcohol dehydrogenase, polyhistidine (for example, six histidine at the amino and/or carboxyl terminus of the polypeptide) , green fluorescent protein (GFP) , yeast α mating factor, GAL4 transcription activation or DNA binding domain, and luciferase. See, e.g., Godowski et al . , 1988.
Fusion proteins will typically be made by either recombinant nucleic acid methods, as described above, or may be chemically synthesized. Techniques for the synthesis of polypeptides are described, for example, in Merrifield, 1963.
Transformation and other methods of introducing nucleic acids into a host cell (e.g., transfection and viral infection) can be accomplished by a variety of methods which are well known in the art. Animal (and yeast) cells may be transfected with a plasmid, a cosmid or the like, infected with a viral vector of this invention for transient or stable expression of RCE or AFC polypeptides. For example, for transient expression in animal cells, investigators have used spheroplast fusion, DEAE dextran, and electroporation. For stable expression they have used calcium phosphate, spheroplast fusion, and electroporation.
The polypeptide gene products of the mammalian RCE or AFC genes of this invention may be used to produce polyclonal or monoclonal antibodies using a variety of techniques well known to those of skill in the art.
Alternatively, peptides corresponding to specific regions of the RCE or AFC polypeptides may be synthesized and used to create immunological reagents according to well known methods . Antibodies directed against the mammalian RCE and AFC polypeptides of this invention are immunoglobulin molecules or portions thereof that are immunologically reactive with at least one novel mammalian RCE or AFC polypeptide of the present invention. It should be understood that the antibodies of this invention include antibodies immunologically reactive with fusion proteins and multimeric proteins comprising a novel mammalian RCE or AFC polypeptide.
Antibodies directed against a novel mammalian RCE or AFC polypeptide may be generated by immunization of a mammalian host. Such antibodies may be polyclonal or monoclonal. Preferably they are monoclonal. Methods to produce polyclonal and monoclonal antibodies are well known to those of skill in the art. For a review of such methods, see Antibodies, A Laboratory .Manual , supra, and
D.E. Yelton, et al., Ann. Rev, of Biochem., 50, pp. 657-80 (1981) . Determination of immunoreactivity with a novel mammalian RCE or AFC polypeptide of this invention may be made by any of several methods well known in the art, including by immunoblot assay and ELISA.
—8 Monoclonal antibodies with affinities of 10
—1 — — TO —1
M or preferably 10 to 10 M or stronger are typically made by standard procedures as described, e.g., in Harlow and Lane, 1988 or Goding, 1986. Briefly, appropriate animals are selected and the desired immunization protocol followed. After the appropriate period of time, the spleens of such animals are excised and individual spleen cells fused, typically, to immortalized myeloma cells under appropriate selection conditions. Thereafter, the cells are clonally separated and the supernatants of each clone tested for their production of an appropriate antibody specific for the desired region of the antigen.
Other suitable techniques involve in vitro exposure of lymphocytes to the antigenic polypeptides, or alternatively, to selection of libraries of antibodies in phage or similar vectors. See Huse et al., 1989. The polypeptides and antibodies of the present invention may be used with or without modification. Frequently, polypeptides and antibodies will be labeled by joining, either covalently or non-covalently, a substance which provides for a detectable signal. A wide variety of labels and conjugation techniques are known and are reported extensively in both the scientific and patent literature. Suitable labels include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent agents, chemiluminescent agents, magnetic particles and the like. Patents teaching the use of such labels include U.S. Patents 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149 and 4,366,241. Also, recombinant immunoglobulins may be produced (see U.S. Patent 4,816,567) .
7Λn antibody of this invention may also be a hybrid molecule formed from immunoglobulin sequences from different species (e.g., mouse and human) or from portions of immunoglobulin light and heavy chain sequences from the same species. It may be a molecule that has multiple binding specificities, such as a bifunctional antibody prepared by any one of a number of techniques known to those of skill in the art including: the production of hybrid hybridomas; disulfide exchange; chemical cross- linking; addition of peptide linkers between two monoclonal antibodies; the introduction of two sets of immunoglobulin heavy and light chains into a particular cell line; and so forth. The antibodies of this invention may also be human monoclonal antibodies, for example those produced by immortalized human cells, by SCID-hu mice or other non- human animals capable of producing "human" antibodies, or by the expression of cloned human immunoglobulin genes. In sum, one of skill in the art, provided with the teachings of this invention, has available a variety of methods which may be used to alter the biological properties of the antibodies of this invention including methods which would increase or decrease the stability or half-life, immunogenicity, toxicity, affinity or yield of a given antibody molecule, or to alter it in any other way that may render it more suitable for a particular application.
Assays to Identify Inhibitors of Mammalian CAAX-Processing Enzymes
This invention is directed to methods for identifying inhibitors of mammalian CAAX processing enzymes by in vitro enzyme inhibition assays using the polypeptide expression products of the mammalian RCE and AFC sequences. In particular, this invention provides a heterologous host cell or organism comprising a recombinant DNA molecule (vector) capable of expressing a mammalian gene encoding Rcelp or Afclp. A host cell that overexpresses Rcelp or Afclp is also provided. One preferred host organism is a yeast such as Saccharomyces transformed with a high copy plasmid encoding a RCE or AFC gene. The resulting recombinant organism represents an enriched source for the corresponding mammalian enzyme. Inclusion of such an extract in an in vitro CAAX-protease assay allows the identification of inhibitors that can reduce or eliminate the mammalian CAAX protease activity.
One method for isolating a protein inhibitor (or an activator) of a mammalian CAAX protease is the "two- hybrid system" (see, e.g., CT. Chien et al., "The two-hybrid system: a method to identify and clone genes for proteins that interact with a protein of interest", Proc. Natl. Acad. Sci. U.S.A., 88(21), pp. 9578-82 (1991)) (incorporated herein by reference) (Example 6) . This invention is also directed to methods for identifying inhibitors of mammalian CAAX processing enzymes by screening host cells transformed with the DNA sequences of this invention in in vivo biological activity assays, including but not limited to those described herein.
One preferred in vivo method to identify inhibitors of Rcelp or Afclp first requires identifying genes whose expression requires the presence of the appropriate protease. A gene expression reporter construct may be created and introduced into cells. Such a cell line or strain will allow indirect monitoring of protease function. Incubation of these cells with select candidate protease inhibitors or a combination of candidate inhibitors such as a chemical library facilitates identification of chemicals that alter protease activity in vivo.
Another preferred in vivo method to identify inhibitors of Rcelp or Afclp capitalizes on the extreme sensitivity of co-culture experiments to identify differences in the relative growth rates (or fitness) of organisms. Using this approach, two strains of cells which may possess only a subtle difference are mixed and grown for a number of generations. At various points of the growth period, portions of the co-culture are analyzed to determine the relative proportion of each constituent strain. Very small differences in growth rates between two strains will result in the more fit strain becoming the predominant, if not the exclusive, strain in the co- culture after a number of generations. Co-cultures may be grown and the cells diluted into fresh medium to continually increase the number of cell generations in culture, enabling even very subtle differences in growth rates to be measured.
Such co-culture experiments may be used to identify compounds that are competitive inhibitors of Rcelp or Afclp (Example 8) . Competitive inhibition of an enzyme can be overcome by increasing the amount of enzyme. Therefore, a strain that harbors a significant amount of Rcelp (or Afclp) will be less sensitive to the effects of a competitive inhibitor than will be a strain with a normal amount of Rcelp (or Afclp) .
In one embodiment, a yeast strain is constructed (Strain 1) that harbors a high-copy (2μ) plasmid that encodes the RCEl gene. Preferably, the RCEl gene is driven by a very strong promoter, such as that of the ADHl gene. This RCEl overexpressing strain also possesses, preferably (but not necessarily) on the same high-copy plasmid, a marker gene the presence of which (i.e., the gene or gene product) can be easily detected, such as the green fluorescent protein (GFP) . A second strain (Strain 2) is required for the co-culture experiment and is preferably Strain 1 without the high-copy plasmid carrying the RCEl gene (and optionally comprising a marker gene) .
Strain 1 and Strain 2 are grown independently to mid-log phase and then mixed at a selected ratio to create an appropriate co-culture. Preferably, the mixture will contain a large excess of Strain 2 relative to Strain 1 (e.g., 5,000:1; see Example 8). Because Strain 1 (which harbors the multi-copy plasmid encoding RCEl and GFP) is present in such low abundance, the overall mixture will not exhibit significant fluorescence. During growth of the co-culture, if a situation were encountered where Strain 1 has a growth advantage over Strain 2, the relative abundance of Strain 1 would increase and thereby cause an easily identified increase in fluorescence of the mixture.
The co-cultures are aliquoted into separate growing chambers (e.g., the wells of 96-well plates). To identify inhibitors of Rcelp, a selected concentration or range of concentrations from selected putative inhibitors (e.g., portions of a chemical library) are added to individual wells of 96-well plates which contain aliquots of the co-culture. Both of these steps lend themselves well to automation and large numbers of chemicals may be tested in parallel. The co-cultures containing added test chemicals are incubated until the cells have divided numerous times. At various points during the growth period, the relative abundance of Strain 1 in the mixture may be assessed by fluorescence emission from the 96-well plates. The cells are diluted into fresh medium and allowed to continue to grow. This regimen of growth followed by dilution into fresh medium may be repeated until the cells have experienced multiple generation doublings (e.g., 2-20). Preferably, the cells undergo 10 generation doublings. More preferably, the cells undergo 5 generation doublings. The preferred number of generation doublings of the co-culture may be determined as described in Example 8. Preferably, the concentrations and ratios of Strains 1 and 2 in the beginning co-culture are selected to avoid the need to dilute the cultures, which involves more work and time.
In a more preferred embodiment of the above assay, Strain 1 and Strain 2 do not express (e.g., are deleted for) the endogenous RAS1 gene and possess the ras2-23 allele in place of RAS2. The ras2-23 allele, for example, is temperature sensitive and does not adequately function at 37°C to confer viability to a rasl : :HIS3 strain (H. Mitsuzawa et al., Genetics, 123, pp. 739-748 (1989)). At a semi-permissive temperature (34°C) , growth of this strain depends upon the function of Rcelp (V.L. Boyartchuk et al., Science, 275, pp. 1796-1800 (1997)). Therefore, the presence of the ras2-23 allele, in the absence of RAS1, sensitizes Strain 1 and Strain 2 such that their growth rates depend upon the function of Rcelp.
Utility of Mammalian RCE and AFC Nucleic Acid Sequences, Polypeptides, and Antibodies And Inhibitors Thereof
The sequences of hRCE (SEQ ID NO:l), mRCE (SEQ ID NO: 3 or SEQ ID NO: 5) and hAFC (SEQ ID NO: 37) — in their entirety or fragments thereof — may be used in hybridization experiments to: A) determine RCE expression levels in various tissues, different developmental or disease states or other conditions; B) isolate and clone related genes, mRNA or cDNAs which hybridize to those sequences; and C) isolate related or identical mRNA sequences for the purpose of in vitro translation. The sequences comprising hRCE, mRCE or hAFC may also be used to construct fusion proteins in order to address the localization of each of these proteins in vivo. Such fusion proteins may additionally contain a reporter gene, such as the green fluorescent protein, lacZ or luciferase. Alternatively, such sequences may be fused to an epitope tag, such as myc, hemagglutinin (HA) or any other related short sequences. The above DNA sequence manipulations may be accomplished using methods which are well known to those of skill in the art.
RCE and AFC sequences may be used to construct vectors that express protein products having biological activity (e.g., CAAX processing) in a heterologous organism, preferably, in a unicellular organism. Expression products may be purified by conventional means which are well known to those of skill in the art. Mammalian RCE and AFC DNA sequences or their expression products may be purified and then immobilized to beads or solid support matrices which can then be used to isolate and to identify cellular factors which bind to those DNA or amino acid sequences.
The hRCE, mRCE or hAFC sequences (in their entirety or fragments thereof) may be expressed in a heterologous organism to:
A) examine many biochemical features of the protein such as catalytic mechanism, Kcat, Km, Vmax, and oligomeric status;
B) determine the substrate specificities of the CAAX processing enzyme such as the tolerance of non-aliphatic residues at the Al and A2 position, and permissive and non-permissive substitutions at the X position, lipid specificities attached to the cysteine and amino acid sequence specificities N-terminal of the cysteine; C) create an in vitro assay for the purpose of identifying and characterizing inhibitors of the protease;
D) create antibodies (either polyclonal or monoclonal) , and hybrid, chimeric and humanized antibodies, for the purpose of localization of protein or to selectively inactivate protease activity;
E) assess the functional importance of Ras CAAX processing in yeast. Yeast which express an activated allele of Ras display a heat shock sensitivity when exposed to an acute treatment at 55°C The endogenous protease gene products may be inactivated (e.g., by mutation of the AFCl and RCEl genes) by standard methods, which causes a suppression of the heat-shock sensitivity. Normal, mutant and altered forms of the hRCE or mRCE cDNAs may be expressed in yeast cells lacking endogenous AFCl and RCEl activities (e.g., the deletion strains described herein) and the yeast cells tested for relative sensitivity to heat-shock.
In addition, hRCE, mRCE or hAFC sequences may be used to create an antisense vector. The antisense vector may be used to synthesize an RCE or AFC antisense molecule to specifically inactivate gene expression and thereby to inactivate the production of the RCEp or AFCp proteins. Such vectors could be constructed such that it is operably linked and under the control of an inducible, tissue specific, developmental or constitutive promoter. Thus, RCE or AFC activity could be selectively turned on or off at desired locations that would therapeutically effect a CAAX processing-mediated or affected diseased state or disorder in a patient. Similarly, the hRCE, mRCE or hAFC sequences of this invention may be used to construct specific ribozymes that are directed to the endogenous RCE or hAFC mRNAs . Methods for constructing various types of ribozymes are published and may be performed by those of skill in the art using the RCE or hAFC sequences described herein.
The mouse mRCE cDNA sequences of this invention may be used to construct a genetic knockout in a mouse. The genetically altered mouse may in turn be used as a mammalian model for human diseases and disorders involving proteins processed by the RCE gene product. Such genetically altered mice will thus be useful in determining the functional importance of mRCE in various mammalian disease states and disorders, and to assess the therapeutic value of inhibitors of mRCE or other therapeutic agents on the diseased state in mouse. An example of one such diseased state is the tumorigenesis induced in mice by expressing activated forms of Ras
. . .„ vall2 vall2 protein (H-ras or K-ras ) .
The antibodies of this invention have a variety of uses. For example, they are useful as reagents to screen for expression of mammalian RCE or AFC polypeptides, either in expression libraries constructed from mammalian DNA sources or from other samples in which the proteins may be present. Moreover, by virtue of their specific binding affinities, the antibodies of this invention are also useful to purify or remove polypeptides from a given sample, and to block or bind to specific epitopes on the polypeptides.
Such immunological reagents may be used for a variety of applications, some examples of which are below.
1. Determine protein levels of Rcelp and Afclp under different conditions in a particular tissue. The assay may be performed by an immobilization-based protocol such as an immunoblot where the proteins are resolved by SDS-PAGE and transferred to a solid support or by an ELISA assay. Such an approach would reveal steady state levels of the proteins. The synthesis rate and protein stability of the respective proteins could be determined by a pulse-chase experiment comprising an immunoprecipitation step using an antibody of this invention.
2. Determine the apparent molecular weights of Rcelp and Afclp. An immunoblot of protein extracts that have been resolved by SDS-PAGE may be used to identify the size of the proteins when they are expressed in a particular cell type or tissue source or when they are expressed recombinantly in, for example, a unicellular host.
Deviations from the molecular weight predicted from the amino acid sequence may reveal polypeptide products from alternatively spliced mRNAs or post-translational processing events such as proteolysis, glycosylation, lipidation or phosphorylation.
3. Subcellular localization of RCE and AFC polypeptides. Tissue or cell extracts may be fractionated into cellular components, the proteins resolved by electrophoresis and detected using the immunological reagents of this invention. Association of the protein with a specific cellular organelle may reveal the sub- cellular localization of the protease. Alternatively, antibodies directed against Rcelp or Afclp could be labeled with a fluorescent tag such as fluorescein and incubated with permeabilized cells in culture. Fluorescent microscopy could then be employed to identify the localization of the protein in vivo.
4. Determining the functional requirement of Ras C-terminal proteolysis. Micro-injection of activated forms of Ras into NIH-3T3 cells or Xenopus oocytes results in a rapid and quantifiable phenotype for assaying Ras function (Schafer et al.. Science, 245, pp. 379-385
(1989) ) . Co-injecting an Rcelp inactivating antibody with an unprocessed form of activated Ras would allow the determination of the functional requirement of Ras proteolytic processing for Ras biological function.
5. Identifying the topology of Rcelp or Afclp within a lipid bilayer. Monoclonal antibodies directed against discrete regions of either polypeptide could be incubated with permeabilized cells or membrane fractions to map regions of the respective proteins that are cytosol-facing or cytosol-excluded. Cytosol-facing domains would be accessible for interaction with epitope-specific antibodies and could be identified by fluorescent microscopy or immunoblotting techniques.
6. Confirming protein-protein interactions. Candidate Rcelp- or Afclp-interacting proteins identified genetically or by an assay such as the "two-hybrid screen" could be tested for their ability to interact in vitro. Cells ectopically expressing the candidate protein and the respective protease could be cross-linked and immunoprecipitated with protein- specific antisera. Once precipitated, the complexes could be resolved by SDS-PAGE and immunoblotted with antisera directed against the second member of the putative complexes .
All of the above utilities apply to the corresponding use of the AFC sequences, polypeptides and inhibitors and antibodies of those polypeptides according to this invention whether or not specifically referred to above. They also apply to the corresponding use of related sequences which may be identified, for example, by hybridization to the sequences of this invention, and their corresponding expression products. This invention is also directed to methods for identifying inhibitors of mammalian CAAX processing enzymes by screening host cells transformed with the DNA sequences of this invention in in vivo biological activity assays such as those described herein, or by in vitro enzyme inhibition assays using the polypeptide expression products of the mammalian RCE and AFC sequences. Inhibitors of the mammalian CAAX processing enzymes will be useful as therapeutics for treating certain CAAX protein-mediated diseased states or disorders in a patient, for example, for treating cancers or tumors correlated to Ras activation. In the therapeutic method of this invention, a small molecule inhibitor of a mammalian Rcelp or Afclp protein is administered to a patient suffering from a CAAX protein-mediated disease or disorder.
The inhibitors of mammalian CAAX processing enzymes of this invention may be formulated into pharmaceutical compositions and administered in vivo at an effective dose to treat the particular clinical condition addressed. Determination of a preferred pharmaceutical formulation and a therapeutically efficient dose regiment for a given application is within the skill of the art taking into consideration, for example, the condition and weight of the patient, the extent of desired treatment and the tolerance of the patient for the treatment.
Administration of the inhibitors of mammalian CAAX processing enzymes of this invention, including isolated and purified forms, their salts or pharmaceutically acceptable derivatives thereof, may be accomplished using any of the conventionally accepted modes of administration of agents which are used to treat, for example, tumors and cancers.
The pharmaceutical compositions of this invention may be in a variety of forms, which may be selected according to the preferred modes of administration. These include, for example, solid, semi- solid and liquid dosage forms such as tablets, pills, powders, liquid solutions or suspensions, suppositories, and injectable and infusible solutions. The preferred form depends on the intended mode of administration and therapeutic application. Modes of administration may include oral, parenteral, subcutaneous, intravenous, intralesional or topical administration.
The inhibitors of this invention may, for example, be placed into sterile, isotonic formulations with or without cofactors which stimulate uptake or stability. The formulation is preferably liquid, or may be lyophilized powder. For example, the inhibitors may be diluted with a formulation buffer comprising 5.0 mg/ml citric acid monohydrate, 2.7 mg/ml trisodium citrate, 41 mg/ml mannitol, 1 mg/ml glycine and 1 mg/ml polysorbate 20. This solution can be lyophilized, stored under refrigeration and reconstituted prior to administration with sterile Water-For-Injection (USP) . The compositions also will preferably include conventional pharmaceutically acceptable carriers well known in the art (see for example Remington's Pharmaceutical Sciences, 16th Edition, 1980, Mac Publishing Company) . Such pharmaceutically acceptable carriers may include other medicinal agents, carriers, genetic carriers, adjuvants, excipients, etc., such as human serum albumin or plasma preparations. The compositions are preferably in the form of a unit dose and will usually be administered one or more times a day. The pharmaceutical compositions of this invention may also be administered using microspheres, liposomes, other microparticulate delivery systems or sustained release formulations placed in, near, or otherwise in communication with affected tissues or the bloodstream. Suitable examples of sustained release carriers include semipermeable polymer matrices in the form of shaped articles such as suppositories or microcapsules. Implantable or microcapsular sustained release matrices include polylactides (U.S. Patent No. 3,773,319; EP 58,481), copolymers of L-glutamic acid and gamma ethyl-L-glutamate (Sidman et al., Biopolymers, 22, pp. 547-56 (1985)); poly (2-hydroxyethyl-methacrylate) or ethylene vinyl acetate (Langer et al., J. Bio ed. Mater. Res. , 15, pp. 167-277 (1981); Langer, Chem. Tech., 12, pp. 98-105 (1982) ) . Liposomes containing inhibitors of CAAX processing enzymes may be prepared by well-known methods (See, e.g. DE 3,218,121; Epstein et al., Proc. Natl. Acad. Sci. U.S.A.. 82, pp. 3688-92 (1985); Hwang et al., Proc. Natl. Acad. Sci. U.S.A.. 77, pp. 4030-34 (1980); U.S. Patent Nos. 4,485,045 and 4,544,545). Ordinarily the liposomes are of the small (about 200-800 Angstroms) unilamellar type in which the lipid content is greater than about 30 mol.% cholesterol. The proportion of cholesterol is selected to control the optimal rate of MAG derivative and inhibitor release.
The inhibitors of mammalian CAAX processing enzymes of this invention may also be attached to liposomes, which may optionally contain other agents to aid in targeting or administration of the compositions to the desired treatment site. Attachment of inhibitors to liposomes may be accomplished by any known cross-linking agent such as heterobifunctional cross-linking agents that have been widely used to couple toxins or chemotherapeutic agents to antibodies for targeted delivery. Conjugation to liposomes can also be accomplished using the carbohydrate-directed cross-linking reagent 4- (4- maleimidophenyl) butyric acid hydrazide (MPBH) (Duzgunes et al., J. Cell. Biochem. Abst. Suppl . 16E 77 (1992)).
The following are examples which illustrate the invention. These examples should not be construed as limiting: these examples are included for purposes of illustration only.
Example 1 : Cloning and Sequencing of the Human hRCE cDNA
Isolation of 13R/27R λgtll clones
To isolate the human cDNA counterpart to the yeast RCEl gene, we screened a λgtll cDNA library made from a 9-week old human fetus. The library (ATCC #77433) was purchased from the American Type Culture Collection.
Approximately 6 x 10 phage were plated onto the host E. coli strain LE392 and lifted onto Biodyne nylon membranes (ICN) by standard methodology (see, e.g., Sambrook et al..
Molecular Cloning — A Laboratory Manual [2nd ed.] Vol. 1-3
(1989) ) .
A plasmid vector containing the human EST
(ID#358628) with similarity to the yeast Rcelp protein was purchased from Research Genetics and served as a probe.
The EST was recovered from the vector by digesting with
.EcoRI restriction enzyme and agarose gel purification. The EST, approximately 750 base pairs in length, was
32 labeled with P by random priming and hybridized to the phage lifts at high stringency according to the nylon membrane manufacturers specifications at 65°C for 18 hours. The hybridization buffer consisted of 5X Denhardt's solution, 5X SSC, 0.1% SDS, and 100 μg/ml denatured salmon sperm DNA. The filters were washed at high stringency at 65°C in 0.1% SDS and 0.1% SSC and exposed to autoradiography . Two plaques, 13R and 27R, which showed a strong hybridization signal to the human EST (#358628) were purified to homogeneity by two successive rounds of plating and hybridization according to standard procedures . Phage stocks were prepared by overlaying plates of E. coli strain LE392 that had been infected with λgtll clones 13R and 27R with 20 mis SM. After gentle shaking at room temperature, approximately 12 mis was drawn off each plate, the cells were pelleted, and the stocks were stored at 4°C with a few drops of chloroform. The stocks
3 were titered with LE392 and determined to be 2.6x10 pfu/μl (13R) and l.δxlO3 pfu/μl (27R) .
To prepare DNA from these two clones, LE392 cells (fresh overnight culture concentrated in 40% volume in SM) were infected with 100,000 pfu of 13R and 27R. The infection was incubated for 30 minutes at 42°C, plated on large LB plates with top agar, and incubated at 42°C overnight. The plates were then overlayed with 15 mis λ diluent and allowed to shake gently for 2 hours at room temperature. The λ diluent was then drawn off and the cells pelleted out. DNA was then prepared following standard procedures (see, e.g., Sambrook et al., Molecular Cloning — A Laboratory Manual [2nd ed.] Vol. 1-3 (1989)), and resuspended in TE.
PCR Amplification The 5' ends of the λ clones were PCR amplified using two different internal primers (608RI, 5'- GGTTGCGTACAGACACTCCGTC-3' (SEQ ID NO: 39); RCE 1R-RI, 5'- TTATTATACCACTTTTACCCG-3' (SEQ ID NO: 40)) and an external λ primer (OMS178, 5' -TGGCGCAGGTAGCAGAG-3' (SEQ ID NO: 41) ) using Vent DNA polymerase) . The PCR reactions were then treated with Taq DNA polymerase for 10 minutes at 72°C to produce A overhangs. The reactions were extracted with phenol/chloroform, precipitated with NaOAc/EtOH and resuspended in TE. The Invitrogen® Original TA Cloning® kit was used to clone the PCR products into pCR® 2.1 and transformed into INVαF' . Five clones containing inserts were identified: 13R/608-178-1, -3, 13R/RCE-178-3, 27R/608-178-1, -3. These five clones were sequenced using the ABI 377 sequencer.
The 3' ends of the λ clones were subcloned by PCR amplifying the ends with an internal primer (8F, 5'- CTGCCTCACAGACATGCGTTG-3' (SEQ ID NO: 42)) and an external λ primer (OMS179, 5' -GCGAAATACGGGCAGACATGG-3' (SEQ ID NO: 43) ) using Taq DNA polymerase. The Invitrogen® Original TA
Cloning® kit was used to clone the PCR products into pCR® 2.1 and transformed into INVαF'. Three clones containing inserts were identified: 13R/8F-179-1, 27R/8F-179-1, -2. These three clones were sequenced using the ABI 377 sequencer.
Additional sequence information for the human hRCE cDNA was obtained by 5' and 3' RACE methods (Siebert et al.. Nucleic Acids Research, 23, pp. 1087-1088 (1995)). Adaptor-ligated cDNA constructed from total human fetus mRNA (#7438-1), human placenta (#7411-1) and murine 15-day embryo (#7459-1) were purchased from Clonetech (Palo Alto, CA) . Reactions were carried out under the recommended conditions of the manufacturer which included the following PCR conditions: IX KlenTaq PCR buffer (Clonetech), 200μM dNTPs, 200nM oligonucleotide primers,
10 pg/μl cDNA, and 5 units of KlenTaq polymerase in a 50μl reaction volume in the presence or absence of 5% DMSO. All PCR reactions included the primer API (Clonetech) which is complementary to the cDNA adapter sequence and a gene specific primer. PCR reactions with human cDNA included one of the gene specific primers hRCE(lR) (SEQ. ID NO: 7), hRCE(2R) (SEQ ID NO: 8), hRCE(lOR) (SEQ ID NO: 16) or hRCE(15R) (SEQ ID NO: 21) for 5' RACE reactions and hRCE(3F) (SEQ ID NO: 9) or hRCE(4F) (SEQ ID NO: 10) for 3' RACE reactions. 5' RACE reactions of murine cDNA included API (Clonetech) and one of the following gene specific primers: 608 (1R) (SEQ ID NO:27), 608 (2R) (SEQ ID NO:28) or hRCE(lOR) (SEQ ID NO: 16). All PCR reactions were performed in an ABI 9600 thermocycler . 5' and 3' RACE products were resolved by agarose gel electrophoresis, excised and recovered. The DNA products were subcloned into the T/A cloning vector, pCR2.1 (Invitrogen, Carlsbad, CA) and sequenced.
Example 2: Human hRCE RNA Analysis
Northern Blot Analysis Two multiple tissue mRNA blots (#7760-1, #7759-
1) were purchased from Clonetech and hybridized to the human EST (ID#358628) uniformly labeled with 32P as described in Example 1. Hybridization and washing were done at high stringency at 65°C using the conditions recommended by Clonetech.
Primer Extension Analysis
5 μg human poly (A) RNA from pancreas (Clonetech,
32 catalog # 6539-1) was hybridized to 0.9 pmol of P end- labeled primer (15R, 5' -AGGCTGCCCACGTAGGAGCAGGCGAGGCTGA-3' (SEQ ID NO:21) 2.6x10 cpm/pmol) in hybridization buffer (2 mM Tris [pH 7.6], 0.2 mM EDTA, 250 mM KC1) in a volume of 10 μl. The reaction mixture was placed at 73°C for 2 minutes and then at 53°C for 1 hour. Extension of the primer was initiated by the addition of 40 μl of reverse transcriptase buffer (20 mM Tris [pH 8.7], 10 mM MgCl2, 100 μg/ml actinomycin D, 5 mM dithiothreitol, 0.33 mM deoxynucleoside triphosphates) and 200 units SuperScript™II (GibcoBRL, catalog # 18064-014) . The extension reaction was placed at 37°C for 30 minutes and then terminated with the addition of 300 μl ethanol. The nucleic acids were pelleted and resuspended in 5 μl formamide loading buffer/0.1 M NaOH (2:1). The reaction products were then separated on an 8% polyacrylamide/urea sequencing gel, and visualized by autoradiography. The predominant extension product was approximately 41 nucleotides long.
To control for the possibility of secondary structures at the 5' end of the hRCEl mRNA affecting the primer extension analysis, the reactions were performed utilizing the GeneAmp® Thermostable rTtΛ Reverse Transcriptase RNA PCR kit (Perkin Elmer, catalog # N808- 0069) . Human Poly (A) RNA (5 μg) from pancreas (Clontech, catalog # 6539-1) and placenta (Clontech, catalog # 6518-
32
1) were separately hybridized to 0.9 pmol of P end- g labeled primer (15R, 2.6x10 cpm/pmol (SEQ ID NO:21)) in rTth reverse transcriptase buffer (10 mM Tris [pH 8.3], 90 mM KC1) in a volume of 14.4 μl . The hybridization was carried out at 70°C for 15 minutes. 5.6 μl extension mix (final concentrations: 1 mM MnCl2, 1 mM dATP, 1 mM dCTP, 1 mM dGTP, 1 mM dTTP, 5 units r Tth DNA polymerase) was added to the hybridization reactions and incubated at 70°C for 20 minutes. The reactions were terminated with the addition of 120 μl ethanol and the nucleic acids pelleted. The products were resuspended in 5 μl formamide loading buffer/0.1 M NaOH (2:1). In order to size the primer extension products double stranded sequencing reactions were performed using 15R (SEQ ID NO: 21) as the primer and a subclone of the λ clone 27R (27R/608-178-1, see above) . The sequencing reactions were performed utilizing the Sequenase Version 2.0 DNA Sequencing Kit (United States Biochemical, catalog # US70770) as per the kit instructions. The primer extension products and the sequencing reactions were resolved on an 8% polyacrylamide/urea sequencing gel followed by autoradiography.
Example 3: Expression of the human -RCE cDNA in a yeast mutant
The ORF beginning at position 116 of the hRCE gene (SEQ ID NO:l, nucleotides 116 to 997; Figure 2A) was expressed in a yeast strain deleted for both CAAX protease genes, yAFCl and yRCEl (Boyartchuk et al., supra) (Example 3) . To create the expression construct, we amplified by PCR the human hRCE ORF from the first valine (defined by nucleotides 116-118) to the stop codon (defined by nucleotides 995-997) employing the human hRCE cDNA λ clone 27R as a template. The primers included in the reaction were hRCE-ValX (SEQ ID NO:22) and hRCE3P-RI (SEQ ID NO: 23) . The primer hRCE-ValX incorporated an X£>al restriction site immediately upstream (5') of the GTG valine codon (defined by SEQ ID NO:l, nucleotides 116- 118) . The primer hRCE3P-RI incorporated an £coRI site immediately downstream (3') of the TGA stop codon (defined by SEQ ID NO:l, nucleotides 995-997).
Following amplification, the PCR product was digested with £coRI and X-bal and subcloned into the respective sites of the expression vector, pACAl' or pACA5' . pACAl' contains 1000 base-pairs of the yeast MEV1 promoter including the translational initiation codon ATG. The MEV1 promoter was amplified by PCR from yeast genomic DNA using the following primers:
MEV1-1 (containing HindiII site): 5 ' -GGA AAA GCT TTT GAA GCA CCT GTG GAG T-3' (SEQ ID NO:44); and MEV1-2 (containing Xbal site): 5'-GTA AC GTT CTA- GAC ATT TTG AGT ACG TC-3' (SEQ ID NO: 45) .
Vector pACA5' contains 700 base-pairs of the yeast ERG12 promoter and the first four codons of the ERG12 coding sequence. The ERG12 promoter was amplified by PCR from yeast genomic DNA using the following primers:
ERG12-1 (containing Sphl site): 5'-GGC CGG GCA TGC AAC CTA TTC GAG ATG AAT GCG-3' (SEQ ID NO:46); and
ERG12-2 (containing Xώal site): 5 ' -CCG GCC TCT AGA CGG TAA TGA CAT ATT GAC AGT-3' (SEQ ID NO: 47) .
Both expression vectors include an £coRI-NarI fragment containing the yeast PGKl transcriptional termination sequence. A fragment of DΝA comprising the yeast PGKl termination sequence was amplified by PCR from yeast genomic DΝA using the following primers:
PGK 1 (containing an EcoRI site) : 5 ' -GAT TGA ATT CAA TTG AAA TCG ATA G-3' (SEQ ID ΝO:48); and
PGK 2 (containing a Narl site) : 5' -CCG AGG CGC CGA ATT TTC GAG TTA T-3' (SEQ ID NO: 49) . The resulting plasmids, pACA7001 (alternatively referred to herein as pAB14) and pACA7002 (alternatively referred to herein as pAB13), represent translational fusions between the MEV1 ORF and the ERG12 ORF, respectively. The yeast strain JRY5317 (Boyartchuk et al., supra) is an ^.-mating type strain in which both CAAX protease genes have been deleted. For transformation of the hRCE expression plasmids, JRY5317 was grown overnight in rich medium (2% Bacto Peptone, 1% Yeast Extract, 2% Glucose) . The transformation was performed by the lithium acetate method (Gietz and Schiestl, Yeast, 7, pp. 253-263 (1991) ) . Transformants were selected for uracil prototrophy on minimal medium.
Plasmids pACA7001 and pACA7002 were each transformed into the yeast strain JRY5317, an a-mating type strain in which both CAAX protease genes have been deleted (Boyartchuk et al., supra) (Example 3). Halo assays were performed by replica plating patches of the yeast transformants onto a lawn of α sst2 tester lawn as described (Boyartchuk et al., supra) . Cells harboring an sst2 mutation are rendered supersensitive to mating pheromone and undergo a Gl arrest when exposed to pheromone. This creates a halo around the patch of mated yeast cells.
Example 4: Expression of the human hAFC cDNA in a yeast mutant.
The ORF described herein which encodes a homolog of the yeast AFCl gene (SEQ ID NO: 37; Figure 8) is expressed in a yeast strain deleted for both CAAX protease genes, yAFCl and yRCEl (Boyartchuk et al., supra) as described in Example 3. To create the expression construct, the human AFC ORF (SEQ ID NO: 37) is amplified from the first GCG valine codon to the stop codon at ca. nucleotide 721 (see Figure 7) and inserted into yeast expression vectors as described in Example 3.
Example 5: A screening method for identifying inhibitors of mammalian CAAX processing enzymes .
A surrogate organism that is easy to manipulate and maintain, such as Saccharomyces cerevisiae, may be transformed with an expression vector encoding hRCE or hAFC sequences. Such a recombinant organism may serve as a template for in vivo screening for inhibitors of the protease. Such an assay could be direct or indirect. A direct assay could be created based upon the cleavage of a CAAX-containing proteins such as a-factor in a halo assay or mating assay as described in (Boyartchuk et al., supra) . Another assay could be based upon the requirement of CAAX proteolysis for Ras function. Such an assay would utilize a yeast strain possessing hRCE and an activated allele of Ras such as the Glyl9 to Vall9 mutation. Cells that harbor an activated allele of Ras are susceptible to acute heat shock at 55°C Thus, one could screen for inhibitors of Ras by screening or selecting compounds that cause a heat-shock resistant phenotype.
Alternatively, an assay for hRCE could be based upon an indirect effect of modulating the CAAX protease activity. For example, cells could be transformed with either a normal or non-functional version of hRCE. The expression of other selected genes in an organism could then be assessed. Once a gene was found whose expression was dependent upon the function of hRCE, that gene could then serve as the basis of an assay.
Example 6: A screening method for identifying proteins that interact with mammalian CAAX processing enzymes A protein which physically interacts with a mammalian CAAX protease according to this invention may be isolated using the "two-hybrid system" (see, e.g., CT. Chien et al., "The two-hybrid system: a method to identify and clone genes for proteins that interact with a protein of interest", Proc. Natl. Acad. Sci. U.S.A., 88(21), pp. 9578-82 (1991)). This method takes advantage of the multi-domain structure of the yeast transcriptional activator protein GAL4, which has a DNA binding domain separable from a transcriptional activation domain. Expression plasmids are constructed to encode one of two hybrid proteins. One hybrid consists of the GAL4 DNA-binding domain fused to a mammalian CAAX protease polypeptide of this invention. The second hybrid consists of the GAL4 activation domain fused to protein sequences encoded by a library of genomic DNA or cDNA fragments from a mammalian or other cellular source. (The plasmids are co-transformed into an appropriate yeast strain, or are individually transformed into yeast strains of opposite mating types having genetic markers that allow selection of diploids.) Interaction between the CAAX protease and a protein encoded by one of the library plasmids leads to transcriptional activation of a reporter gene of choice which contains an appropriately positioned GAL4 binding site. The advantage of this method is that the gene which encodes the protein that interacts with the CAAX protease may be identified directly from the library.
Example 7 : Mutagenesis of the Histidine-Rich Region of hRCEl Each histidine comprising the histidine-rich sequence "HxHH" of the human Reel protein (SEQ ID NO: 2, amino acids 172-175) were changed individually to an alanine residue by site-directed mutagenesis.
Plasmids pAB15, pAB16, and pAB17 were constructed as follows: An 888 bp Xbal-EcoRI fragment was cut from pAB13 (see pACA7002, Example 3) and cloned into pBluescript SK. Histidines 172, 174 and 175 were each mutated to an alanine using the QuikChange™ Site-Directed Mutagenesis Kit following manufacturer's directions (Stratagene) . The following complementary pairs of oligonucleotides were used (underlined bases are those that were changed relative to the wild-type sequence (5'- CTC TTT TTT GGA GTT GCC CAT TTT CAC CAT ATT ATT GAG CAG CTG -3') (SEQ ID NO:l, nucleotides 611-655):
For H172A hRCEl-Ml: 5' -CTC TTT TTT GGA GTT GCC ££T TTT CAC CAT ATT ATT GAG CAG CTG-3' hRCEl-M2: 5' -CAG CTG CTC AAT AAT ATG GTG AAA AGC GGC AAC TCC AAA AAA GAG-3'
For H174A hRCEl-M3: 5' -CTC TTT TTT GGA GTT GCC CAT TTT GC_Q CAT ATT ATT GAG CAG CTG-3'
hRCEl-M4: 5' -CAG CTG CTC AAT AAT ATG ≤GC AAA ATG GGC AAC TCC AAA AAA GAG-3'
For H175A hRCEl-M5: 5' -GGA GTT GCC CAT TTT CAC GCT ATT ATT GAG CAG CTG CGT TTC-3'
hRCEl-M6: 5' -GAA ACG CAG CTG CTC AAT AAT AG_£ GTG AAA ATG GGC AAC TCC-3'
Mutations were confirmed by sequencing on an ABI Prism 377 DNA Sequencer using the ABI Prism Dye Terminator Sequencing Kit with AmpliTaq DNA Polymerase, FS . The 888 bp Xbal-EcoRI fragments from those clones with the desired mutations were then cloned back into pAB13 (replacing the corresponding wild-type hRCEl Xbal-EcoRI fragment) generating pABl5 (hRCEl-H172A) , pAB16 (hRCEl-H174A) and pAB17 (hRCEl-H175A) .
Example 8: Whole Cell Co-Culture Assay for Identifying Rcelp Inhibitors
A strain of yeast is constructed in which the endogenous RAS1 gene has been deleted and the endogenous RAS2 gene has been replaced with the temperature-sensitive ras2-23 allele, which sensitizes the strains such that their growth rates depend upon the function of Rcelp. Cells from this strain are transformed with a high-copy (2μ) plasmid (e.g., a LEU2, URA3 or HIS3-marked plasmid whose presence can be selected in the appropriate leu, ura or his auxotrophic strain background) which either does (Strain 1) or does not (Strain 2) additionally encode: a) the RCEl gene driven by a strong (e.g., the ADHl ) promoter; and b) a marker gene (e.g., green fluorescent protein (GFP)) under the control of a strong promoter. Strain 1 and Strain 2 transformants are selected on the appropriate medium (e.g., selecting for leu, ura or his prototrophy) . Strain 1 (but not Strain 2) transformants are also selected based on their fluorescence (from expression of the plasmid-linked GFP marker) .
Strain 1 and Strain 2 are grown independently to a point where they are in mid-log phase (O.D.ggo = 1-0) and are then mixed at an initial ratio of 1 cell of Strain 1 for every 5,000 cells of Strain 2. The mixture does not exhibit significant fluorescence because Strain 1, which harbors the multi-copy plasmid encoding RCEl and the fluorescent marker GFP, is present in such low abundance. Following the initial mixture of the two strains, the co- culture is aliquoted into wells of 96-well plates.
A typical 96-well plate will handle about 0.25ml of liquid. One ODggo °f yeast in rich medium (YPD) is about 1 x 107 cells/ml. Taking into consideration that we want a 5,000-fold excess of Strain 2 in the mixture and we want to have at least 50 cells of Strain 1 in the mixture to take into account inevitable variations, then a co- culture experiment where the starting OD6Q0 = 0.1 would include 50 cells of Strain 1 and 250,000 cells of Strain 2. This gives a Strain 1-to-Strain 2 ratio of 1:5,000. A typical yeast overnight growth in a 96-well plate will produce an OD6QO ~4. This translates to a 40-fold increase in growth, or 5.3 generations.
To identify competitive inhibitors of Rcelp, a dilution series (e.g., ten-fold) of each selected potential inhibitor resuspended in an appropriate medium is prepared and an aliquot of each test sample added individually to the wells of the 96-well plates which contain the co-cultured strains. As a control, an equal volume of each medium in which the test inhibitors are suspended (now without the test inhibitor) is added to a well containing the strain co-culture. The co-cultures plus test chemicals and controls are incubated until the cells have divided numerous times. The cells are diluted into fresh medium and allowed to undergo ca. 5 generation doublings (as determined by spectrographic absorbance readings) . At various points during this growth period, the relative abundance of Strain 1 in the mixture is assessed by fluorescence emission from the 96-well plates.
Example 9 : Heat Shock Assays for Measuring hRCE Function and for Identifying Rcelp
Inhibitors
Plasmids
Plasmid pABl was constructed as follows: The backbone plasmid for pABl was Yeplacl95 (2μ, URA3; Gietz and Sugino, 1988) and contains an insert consisting of two parts: green fluorescent protein (GFP) and phosphoglycerate kinase { PGKl ) terminator. The coding region of GFP from Aequoria victoria containing the S65T mutation (Ormo et al., 1996) was amplified by PCR using oligonucleotides:
5P-GFP-ORF (5' -CAT GTC TAG AGG AGA AGA ACT TTT C-3') (SEQ ID NO: 51) ; and
3P-GFP-ORF (5' -CGC GAA TTC CTA TTT GTA TAG TTC A-3'); and has Xbal and EcoRI sites engineered on the 5' and 3' ends, respectively.
The PGKl terminator was amplified by PCR using the following oligonucleotides:
PGK 1 (containing an EcoRI site) : 5 * -GAT TGA ATT CAA TTG AAA TCG ATA G-3' (SEQ ID NO:48); and
PGK 2 (containing a NarI site) : 5' -CCG AGG CGC CGA ATT TTC GAG TTA T-3' (SEQ ID NO: 49) and consists of 263 bp immediately downstream of the stop codon of PGKl . EcoRI and Narl sites were engineered on the 5' and 3' ends of the PGKl terminator.
Plasmid pAB20 was constructed as follows: The val19 dominant allele of RAS2, RAS2 (D. Broek, et al . ,
Cell. 41, pp. 763-769 (1985)), was cloned as a Xhol- Hindlll fragment (1906 bp) into the CEN-LEU2 plasmid pRS315 (R.S. Sikorski and P. Hieter, Genetics, 122, pp.
19-27 (1989) ) . This plasmid carries the coding region for
Ras2p as well as 639 bp upstream and 298 bp downstream of the RAS2 open reading frame. Plasmids pAB13 hRCEl , pAB15 H172A, pABlβ H174A; and pAB17 H175A were made as described in Examples 3 and
7.
Heat shock assays in veast cells
Yeast cells wild-type for RCEl (W303) {MAT&; leu2-3, 112; ade2-101 ; his3-l l ; trpl -1 ; ura3-l ) were used to create an isogenic, double mutant strain lacking both the Rcelp and Afclp CAAX proteases {Δrcel , Δafcl ) . The wild-type (W303) or Δrcel , Δafcl strains were separately transformed with plasmid pAB20 expressing the dominant val19 mutant RAS2 gene. Transformed Δrcel , Δafcl cells then were co-transformed with one of the following URA3 marked, high-copy (2μ) plasmids: pABl (vector control); pAB13 {hRCEl ) , pABl5 (H172A) , pABlβ (H174A) ; and pAB17
(H175A) (as described in Figure 10) . Yeast cell transformants were grown at 30°C for approximately 48 hours in synthetic complete media lacking uracil and leucine and supplemented with 2% glucose, to an approximate OD600 of 4-6. The cultures were diluted 1:10 in water in 0.5 ml PCR tubes and placed in a PTC-100 thermocycler (MJ Research, Inc.). The heat-shock regimen was performed as follows: 25°C 3 min; 55°C 1 min, 25°C 3 min. The cultures were further diluted in water and plated for viability. The number of surviving colonies were compared with those from parallel samples that had not undergone heat shock. The results of this experiment are shown in Figure 11.
Heat shock assays to identify CAAX protease inhibitors The above heat shock assay may be modified by one of skill in the art to identify inhibitors of Rcelp in the presence of an activated RAS2 allele. A dilution series of each selected potential inhibitor may be tested (e.g., as described in Example 8). Rcelp inhibitors may be identified by their ability to increase in a concentration-dependent manner the viability of heat- shocked cells which express activated Ras2p, which express Rcelp from a chromosomal and/or from a single or multicopy plasmid. In addition, the cells can be deleted for the yRCEl gene to identify inhibitors of the human hRcelp protein.

Claims

What is claimed is:
1. An isolated, recombinant or synthetic DNA molecule which comprises a nucleic acid sequence selected from SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 37.
2. The DNA according to claim 1 wherein the DNA comprises a nucleic acid sequence of SEQ ID NO: 1.
3. An isolated, recombinant or synthetic DNA or polynucleotide comprising a nucleic acid sequence that is substantially homologous to any of the DNA molecules of claims 1 or 2.
4. An isolated, recombinant or synthetic DNA or polynucleotide comprising a nucleic acid sequence that hybridizes under stringent conditions to any of the nucleic acid sequences of claims 1 or 2.
5. The DNA or polynucleotide according to any one of claims 3 or 4, wherein said polynucleotide is selected from the group consisting of RNA, cDNA, genomic DNA, synthetic nucleic acids, chemically or biochemically modified nucleic acid comprising non-natural or derivatized nucleic acid bases, or mixed polymers of any of the above.
6. An isolated, recombinant or synthetic DNA or polynucleotide comprising the antisense RNA or antisense DNA sequence of the DNA or polynucleotide according to any one of claims 1-5.
7. A DNA molecule according to any one of claims 1-6, further comprising an expression control sequence operatively linked to the DNA or polynucleotide sequence.
8. A polynucleotide comprising at least eight consecutive nucleotides of the DNAs of claims 1 or 2 or the polynucleotides of claims 3-6.
9. A vector comprising a DNA according to claim 1 or 2 or a polynucleotide according to claims 3-6.
10. A host cell transformed with a molecule according to any one of claims 1 to 9.
11. The host cell according to claim 10, wherein the molecule is a DNA molecule which is integrated into the genome of said host cell.
12. A method for producing a polypeptide comprising the step of culturing a host cell transformed with a DNA molecule according to claim 7.
13. A polypeptide encoded by a DNA or polynucleotide comprising a nucleic acid sequence selected from SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 37.
14. A polypeptide encoded by a DNA or polynucleotide comprising a nucleic acid sequence selected from the group consisting of:
(a) a nucleic acid sequence that is substantially homologous to any of the nucleic acid sequences of claims 1-5; and
(b) a nucleic acid sequence that hybridizes under stringent conditons to any one of the nucleic acid sequences of claims 1-5.
15. A polypeptide comprising amino acid sequences selected from SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 38.
16. A polypeptide characterized by being at least 80% identical in amino acid sequence to any one of the amino acid sequences selected from SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 38.
17. An antibody that binds specifically to an epitope of any one of the polypeptides of claims 13- 15.
18. A pharmaceutical composition comprising any one of the polypeptides of claims 13-15.
19. A pharmaceutical composition comprising an inhibitor of a mammalian CAAX processing enzyme, said enzyme comprising amino acid sequences selected from SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 38.
20. A method for identifying a competitive inhibitor of any one of the polypeptides of claims 13-15, wherein said method comprises the steps of: a) co-culturing two distinguishable strains of cells which differ by the presence or absence of a functional RCEl or AFCl gene; and b) selecting compounds which increase the relative proportion of cells expressing the functional RCEl or AFCl gene compared to those which do not.
21. A method for treating a CAAX-protein mediated disease or disorder in a patient by administering a pharmaceutical composition comprising any one of the polypeptides of claims 13-15.
22. A method for treating a CAAX-protein mediated disease or disorder in a patient by administering a pharmaceutical composition according to claim 19..
PCT/US1998/011415 1997-06-02 1998-06-02 Mammalian caax processing enzymes WO1998054333A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
AU77216/98A AU7721698A (en) 1997-06-02 1998-06-02 Mammalian caax processing enzymes
US09/454,889 US6518035B1 (en) 1998-06-02 1999-12-03 Targeted methods of drug screening using co-culture methods

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US4736997P 1997-06-02 1997-06-02
US60/047,369 1997-06-02
US5238997P 1997-07-14 1997-07-14
US60/052,389 1997-07-14

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US21312098A Continuation 1998-06-02 1998-12-15

Publications (2)

Publication Number Publication Date
WO1998054333A2 true WO1998054333A2 (en) 1998-12-03
WO1998054333A3 WO1998054333A3 (en) 1999-04-08

Family

ID=26724942

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1998/011415 WO1998054333A2 (en) 1997-06-02 1998-06-02 Mammalian caax processing enzymes

Country Status (1)

Country Link
WO (1) WO1998054333A2 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0887416A2 (en) * 1997-06-24 1998-12-30 Smithkline Beecham Corporation Human AFC1
WO1999035275A1 (en) * 1998-01-08 1999-07-15 Universidad De Oviedo Human genetic sequences which are homologous to the yeast genes involved in the proteolytic processing of prenylated proteins
WO1999061628A2 (en) * 1998-05-22 1999-12-02 Onyx Pharmaceuticals, Inc. Novel nucleic acids and polypeptides related to a farnesyl-directed endopeptidase
US6060277A (en) * 1997-06-24 2000-05-09 Smithkline Beecham Corporation Human AFC1
US6261793B1 (en) * 1999-03-04 2001-07-17 Schering Corporation RAS converting endoprotease (RCE) and methods
US6417332B1 (en) * 1998-02-12 2002-07-09 Smithkline Beecham Corporation Human RCE1
US6518035B1 (en) 1998-06-02 2003-02-11 Rosetta Inpharmatics, Inc. Targeted methods of drug screening using co-culture methods
US6667398B2 (en) 2000-06-22 2003-12-23 Pfizer Inc Process for the preparation of pyrazolopyrimidinones
US7067660B2 (en) * 2000-06-22 2006-06-27 Pfizer Inc. Process for the preparation of pyrazolopyrimidinones

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0679716A1 (en) * 1993-11-12 1995-11-02 Kenichi Matsubara Gene signature
WO1998005786A2 (en) * 1996-08-07 1998-02-12 The Regents Of The University Of California Afc1 and rce1:isoprenylated caax processing enzymes
EP0887416A2 (en) * 1997-06-24 1998-12-30 Smithkline Beecham Corporation Human AFC1
EP0887415A2 (en) * 1997-06-24 1998-12-30 Smithkline Beecham Corporation Human RCE1

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0679716A1 (en) * 1993-11-12 1995-11-02 Kenichi Matsubara Gene signature
WO1998005786A2 (en) * 1996-08-07 1998-02-12 The Regents Of The University Of California Afc1 and rce1:isoprenylated caax processing enzymes
EP0887416A2 (en) * 1997-06-24 1998-12-30 Smithkline Beecham Corporation Human AFC1
EP0887415A2 (en) * 1997-06-24 1998-12-30 Smithkline Beecham Corporation Human RCE1

Non-Patent Citations (12)

* Cited by examiner, † Cited by third party
Title
AKOPYAN T.N. ET AL.: "Proteolytic processing of farnesylated peptides: assay and partial purification from pig brain membranes of an endopeptidase which has the characteristics of E.C. 3.4.24.15" BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, vol. 198, no. 2, 28 January 1994, pages 787-794, XP002049582 cited in the application *
BOYARTCHUK V.L. ET AL.: "Modulation of Ras and a-factor function by carboxyl-terminal proteolysis" SCIENCE, vol. 275, 21 March 1997, pages 1796-1800, XP002049583 cited in the application *
CHEN Y. ET AL.: "Solubilization, partial purification, and affinity labeling of the membrane-bound isoprenylated protein endoprotease" BIOCHEMISTRY, vol. 35, 12 March 1996, pages 3227-3237, XP000199958 *
Database EMBL Emest12, Entry HSAA19855 Accession number AA210930 2 February 1997 96% identity with Seq.ID:37 nt.216-590 XP002091645 *
Database EMBL Emest13, Entry HSW4115 Accession number W96411 18 July 1996 98% identity with Seq.ID:1 nt.488-841 88% identity with Seq.ID:3 nt.481-830 reverse orientation XP002083134 cited in the application *
Database EMBL Emest15, Entry MM3448 Accession number W14344 30 April 1996 87% identity with Seq.ID:1 nt.617-990 100% identity with Seq.ID:3 nt.610-983 XP002083135 cited in the application *
Database EMBL Emest16, Entry MMAA21859 Accession number AA021859 29 November 1996 89% identity with Seq.ID:1 nt.7-664 98% identity with Seq.ID:3 nt.3-672 XP002083136 *
Database EMBL Emest16, Entry MMAA68614 Accession number AA168614 22 December 1996 90% identity with Seq.ID:1 nt.390-839 100% identity with Seq.ID:3 nt.383-832 XP002083133 cited in the application *
Database EMBL Emest18, Entry HSZZ64182 Accession number AA359058 18 April 1997 100% identity with Seq.ID:37 nt.1-306 XP002091647 *
Database EMBL Emest8, Entry HS1228356 Accession number AA428315 25 May 1997 100% identity with Seq.ID:1 nt.130-529 90% identity with Seq.ID:3 nt.123-522 XP002082655 *
Database EMBL Emest8, Entry HS181310 Accession number N76181 5 April 1996 100% identity with Seq.ID:37 nt.430-721 XP002091646 *
NISHII W. ET AL.: "Partial purification and characterization of a CAAX-motif-specific protease from bovine brain using a novel fluorometric assay" JOURNAL OF BIOCHEMISTRY, vol. 122, 1997, pages 402-408, XP000199959 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0887416A2 (en) * 1997-06-24 1998-12-30 Smithkline Beecham Corporation Human AFC1
EP0887416A3 (en) * 1997-06-24 1999-03-31 Smithkline Beecham Corporation Human AFC1
US6060277A (en) * 1997-06-24 2000-05-09 Smithkline Beecham Corporation Human AFC1
WO1999035275A1 (en) * 1998-01-08 1999-07-15 Universidad De Oviedo Human genetic sequences which are homologous to the yeast genes involved in the proteolytic processing of prenylated proteins
US6417332B1 (en) * 1998-02-12 2002-07-09 Smithkline Beecham Corporation Human RCE1
WO1999061628A2 (en) * 1998-05-22 1999-12-02 Onyx Pharmaceuticals, Inc. Novel nucleic acids and polypeptides related to a farnesyl-directed endopeptidase
WO1999061628A3 (en) * 1998-05-22 2000-02-03 Onyx Pharma Inc Novel nucleic acids and polypeptides related to a farnesyl-directed endopeptidase
US6518035B1 (en) 1998-06-02 2003-02-11 Rosetta Inpharmatics, Inc. Targeted methods of drug screening using co-culture methods
US6261793B1 (en) * 1999-03-04 2001-07-17 Schering Corporation RAS converting endoprotease (RCE) and methods
US6667398B2 (en) 2000-06-22 2003-12-23 Pfizer Inc Process for the preparation of pyrazolopyrimidinones
US7005520B2 (en) 2000-06-22 2006-02-28 Pfizer Inc. Process for the preparation of pyrazolopyrimidinones
US7067660B2 (en) * 2000-06-22 2006-06-27 Pfizer Inc. Process for the preparation of pyrazolopyrimidinones

Also Published As

Publication number Publication date
WO1998054333A3 (en) 1999-04-08

Similar Documents

Publication Publication Date Title
US5686412A (en) Protein kinases
US6627405B1 (en) 53BP2 complexes
US6071700A (en) Heterologous polypeptide production in the absence of nonsense-mediated MRNA decay functions
JPH10500568A (en) TNF / NGF superfamily receptor and soluble oligomer TNF / NGF superfamily receptor modulator
CA2323070A1 (en) Chitinase chitin-binding fragments
WO1998054333A2 (en) Mammalian caax processing enzymes
EP0793725B1 (en) Nuclear protein serine/threonine kinases
CA2295439A1 (en) Mammalian neuro-growth factor like protein
EP0690874B1 (en) Materials and methods relating to proteins that interact with casein kinase i
CA2310987A1 (en) Tnf ligand family gene
WO1993021330A1 (en) Human glutamine:fructose-6-phosphate amino-transferase
US6159732A (en) Nucleic acid encoding mammalian Ubr1
US6307035B1 (en) BRCA1 associated polynucleotide (BAP-1) and uses therefor
US6312688B1 (en) Tyrosine-phosphatase-related protein
WO2001016314A1 (en) Iren protein, its preparation and use
WO1998005968A9 (en) Brca1 associated protein (bap-1) and uses therefor
US6746837B2 (en) Multiple drug resistance gene atrD of aspergillus nidulans
JPH10117788A (en) Human myt-1 kinase clone
CA2313464A1 (en) Mammalian alpha helical protein-1
CA2315943A1 (en) Multiple drug resistance gene atrc of aspergillus nidulans
AU713686B2 (en) EB1 gene product binds to APC
US6303770B1 (en) Nucleic acids encoding mammalian alpha helical protein-1
AU5775300A (en) Tankyrase2 materials and methods
US5840536A (en) Growth factor receptor-binding insulin receptor
CA2304209A1 (en) Ras-binding protein (pre1)

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM GW HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US US UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

WWE Wipo information: entry into national phase

Ref document number: 09213120

Country of ref document: US

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM GW HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US US UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase in:

Ref country code: JP

Ref document number: 1999501041

Format of ref document f/p: F

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: CA