AU779746B2

AU779746B2 - Prenylcysteine carboxyl methyltransferase, DNA encoding same, and a method of screening for inhibitors thereof

Info

Publication number: AU779746B2
Application number: AU42439/02A
Authority: AU
Inventors: Mark Philips
Original assignee: New York University NYU
Current assignee: New York University NYU
Priority date: 1997-06-11
Filing date: 2002-05-22
Publication date: 2005-02-10
Anticipated expiration: 2018-06-11
Also published as: AU4243902A

Description

1

AUSTRALIA

Patents Act 1990 NEW YORK UNIVERSITY COMPLETE SPECIFICATION DIVISIONAL PATENT Invention Title: Prenylcysteine carboxyl methyltransferase, DNA encoding same, and a method of screening for inhibitors thereof The following statement is a full description of this invention including the best method of performing it known to us:- PRENYLCYSTEINE CARBOXYL METHYLTRANSFERASE,

DNA

ENCODING SAME, AND A METHOD OF SCREENING

FOR

INHIBITORS

THEREOF

BACKGROUND OF THE INVENTION Field of the Invention The present invention relates to the field of posttranslational modification of proteins, and more particularly, to an enzyme catalyzing carboxyl methylation of signaling molecules and DNA encoding same. The present invention also relates to a method of screening for inhibitors of carboxyl methylation, which inhibitors may serve as therapeutic agents in the treatment of inflammation and cancer.

Description of the Related Art Proteins that are posttranslationally modified by prenylation (farnesylation or geranylgeranylation) of a Cterminal cysteine are usually further modified by proteolysis and a-carboxyl methylation. There are two major classes of proteins that are prenylated on cysteine residues. The first major class of these prenylated proteins are those which end with the sequence CysXaaXaaXaa (SEQ ID NO:6), hereinafter referred to as CXXX, indicating that the primary translation product has a cysteine four residues from the C-terminus.

Proteins of this class are often referred to as CAAX proteins, where A=aliphatic amino acid and X= any amino acid, because most, but not all, such proteins have aliphatic residues at the indicated positions (Clarke, 1992).

Among the large number of proteins ending with the CXXX consensus signal for prenylation are yeast mating pheromones, nuclear lamins, members of ras and Rho families of the Ras superfamily of GTPases, and the y subunits of heterotrimeric guanine nucleotide-binding regulatory

(G)

proteins.

In most cases, the final amino acid of the CXXX sequence determines whether the primary translation product is farnesylated or geranylgeranylated on the C-terminal cysteine 2 residue by two related (type I) but distinct prenyltransferases. If the final amino acid of the C-terminus is leucine or phenylalanine, the primary translation product is geranylgeranylated (Finegold et al., 1991; Kinsella et al., 1992). Other amino acids, such as serine, methionine, or glutamine, at the final amino acid position, causes the primary translation product to be farnesylated. Thus, cytosolic type I prenyltransferases recognize the CXXX consensus sequence and catalyze the attachment of a farnesyl or 20-carbon geranylgeranyl polyisoprene chain via a thioether linkage to the cysteine residue (Glomset et al., 1990; Maltese, 1990; Seabra et al., 1991).

Once prenylated, these proteins become substrates for a protease that removes the XXX sequence (final three amino acids at the C-terminus), leaving the prenylcysteine as the new C-terminus. It is this C-terminal prenylcysteine moiety that then becomes a substrate for prenylcysteine carboxyl methyltransferase, which methylesterifies the a carboxyl group (Clarke et al., 1988; Stephenson et al., 1990).

Unlike prenylation and proteolysis, however, carboxyl methylation is reversible under physiologic conditions (Venkatasubramanian et al., 1980; Chelsky et al., 1985).

Figure 1 schematically illustrates the series of posttranslational modifications of proteins having the C-terminal CXXX consensus sequence using a Rho-protein as an example.

The other major class of prenylated proteins is the rab family of ras-related GTPases which end with the sequence CysXaaCys or CysCys. One or both of the C-terminal cysteine residues are geranylgeranylated by a distinct type II geranylgeranyl transferase. The C-terminal geranylgeranylated cysteine residL of rab proteins are also substrates for prenylcysteine .:oxyl methyltransferase.

Yeasi ing pheromones absolutely require carboxyl methylation for their interaction with receptors. The gene product of the wild-type Stel4 gene characterized and sequenced in Saccharomyces cerevisiae was identified to be the farnesylcysteine C-terminal carboxyl methyltransferase which mediates the C-terminal methylation of the yeast a-factor pheromone and the ras proteins of S. cerevisiae (Hrycyna et al., 1990 and 1991; Ashby et al., 1993). In human neutrophils, the laboratory of the present inventor reported that prenylcysteine carboxyl methylation of Rho GTPases is stimulated by inflammatory agonists and that agents that block prenylcysteine carboxyl methyltransferase inhibit neutrophil signal transduction, (Philips et al., 1993). Thus, inhibitors of prenylcysteine carboxyl methyl transferase are expected to serve a therapeutic role as antiinflammatory agents.

The family of Ras proteins which are prenylated play a central role in the regulation of cell grdoth and integration of the regulatory signals which govern the cell cycle and proliferation. It is now known that there are four alleles of ras, H-ras, K-ras (both exon 4a and 4b splice variants), and N-ras, in mammalian cells. Mutants of these ras genes were among the first oncogenes to be identified for their ability to transform cells to a cancerous phenotype (Barbacid, 1987), and mutations of the ras genes (H-ras,

K-

ras, and N-ras) have been demonstrated to be associated with unregulated cell proliferation and are found in an estimated of all human cancers (Rodenhuis, 1992).

The function of normal and oncogenic Ras proteins is absolutely dependent on the series of posttranslational modifications (see Fig. Indeed, such posttranslational modifications are necessary for both membrane targeting and localization. Because Ras function is dependent on the localization to the plasma membrane and association with the plasma membrane, and because constitutively active mutant Ras proteins and several other prenylated Ras-related GTPases have the capacity to transform cells, it has been recognized that each of the enzymatic posttranslational modification steps are new drug targets (Gibbs, 1991). There has been much activity in developing inhibitors of prenyltransferases, which catalyze the first step of the posttranslational modifications, to block the membrane targeting and localization of the Ras protein (Koblan et al., 1996). Tests have shown that prenyltransferase inhibitors block the maturation of the Ras protein and reverse the cancerous transformation induced by mutant ras genes in cell culture and that the formation of new tumors by abnormal Ras proteins was prevented in animals.

Furthermore, the farnesyl transferase inhibitors appear to be quite specific and do not affect normal cells (Koblan et al., 1996; Gibbs et al., 1996; Oliff et al., 1996).

It is expected that inhibitors that target and block prenylcysteine carboxyl methyltransferase, the third step of the posttranslational modification, will also serve as a therapeutic in cancer treatment as well as in other hyperproliferative disorders, such as psoriasis, precancer, etc. Prenylcysteine carboxyl methyltransferases, however, have defied biochemical purification because they cannot be extracted from biological membranes in an active state. Thus, no tool useful in identifying such inhibitors has existed in the prior art.

Citation of any document herein is not intended as an admission that such document is pertinent prior art, or considered material to the patentability of any claim of the present application. Any statement as to content or a date of any document is based on the information available to the applicant at the time of filing and does not constitute an admission as to the correctness of such a statement.

SUMMARY OF THE INVENTION The present invention relates to mammalian prenylcysteine carboxyl methyltransferase (pcCMT), and particularly to a human pcCMT that is the first mammalian pcCMT to be isolated away from human cells and other human proteins in active form and to specific antibodies thereto.

The present invention also relates to a recombinant

DNA

molecule which includes a nucleotide sequence encoding human prenylcysteine carboxyl methyltransferase, as well as an expression vector thereof, and a host cell transformed with the expression vector. The present invention further relates to methods of screening for inhibitors of mammalian prenylcysteine carboxyl methyltransferase activity, which inhibitors are expected to serve as therapeutics in the treatment of inflammation and hyperproliferative disorders such as cancer.

Accordingly in one aspect the invention provides an isolated polypeptide, comprising: a) the amino acid sequence of SEQ ID b) an amino acid sequence having at least 70% identity to a) and having prenylcysteine carboxyl methyltransferase (pcCMT) activity either when isolated or when membrane bound; c) a fragment of a) or b) which has pcCMT.activity either when isolated or when membrane bound; or d) a protein encoded by a nucleotide sequence which hybridizes to the complement of nucleotides 44 to 895 of SEQ ID NO:4 in 50% formamide and 5XSSC at 40*C, followed by nonstringent washing, said protein having pcCMT activity either when isolated or when membrane bound.

In another aspect the invention provides an isolated polypeptide comprising a fragment of the amino acid sequence of SEQ ID NO:5, which fragment is capable of inhibiting the carboxyl methylation activity of the prenylcysteine carboxyl methyltransferase (pcCMT) of SEQ ID In another aspect the invention provides a membrane on which prenylcysteine carboxyl methyltransferase (pcCMT) is present, said pcCMT being non-native to said membrane and having the amino acid sequence of a protein encoded by a nucleotide sequence which hybridizes to the complement of nucleotides 44 to 895 of SEQ ID NO:4 in 50% formamide and 5XSSC at 40*C, followed by nonstringent washing wherein said pcCMT on said membrane has pcCMT activity.

In another aspect the invention provides a membrane on which prenylcysteine carboxyl methyltransferase (pcCMT) is present, said pcCMT being non-native to said membrane and having an amino acid sequence having at least 70% identity to SEQ ID NO:5 and having pcCMT activity, or a fragment thereof having pcCMT activity.

In another aspect the invention provides a recombinant DNA molecule comprising a nucleotide sequence encoding a prenylcysteine carboxyl methyltransferase (pcCMT) comprising an amino acid sequence selected from the group consisting of a sequence having at least 70% identity to the full_length of SEQ ID NO:5 and a fragment of a sequence having at least identity to the full length of SEQ ID NO:5, wherein said pcCMT or fragment has prenylcysteine carboxyl methyltransferase activity either when isolated or membrane bound.

In another aspect the invention provides a recombinant DNA molecule comprising a nucleotide sequence encoding a protein having prenylcysteine carboxyl methyltransferase (pcCMT) activity either when isolated or when membrane bound, wherein said nucleotide sequence hybridizes to the complement of nucleotides 44 to 895 of SEQ ID NO:4 in 50% formamide and at 400C, followed by nonstringent washing.

In another aspect the invention provides a method of screening for and identifying inhibitors of prenylcysteine carboxyl methyltransferase (pcCMT), comprising the steps of: providing a mating competent yeast MATa strain stably transformed from a sterile strain lacking a functional pcCMT with a recombinant DNA molecule of claim 22 encoding a pcCMT to ectopically express the pcCMT for mating competence; incubating the yeast MATa strain in the presence or absence of a potential inhibitor of pcCMT; assaying the yeast strain incubated in the presence of the potential inhibitor for mating competence relative to the yeast strain incubated in the absence of the potential inhibitor; and identifying as an inhibitor any potential inhibitor for which the assaying step determines that the mating competence of the yeast strain in the presence of the potential inhibitor is reduced.

In another aspect the invention provides a molecule for delivery of a therapeutic or detectable moiety to a protein consisting of the amino acid sequence of SEQ ID said molecule consisting essentially of a therapeutic or detectable moiety and an antibody, or an antigen-binding portion thereof, specific for an epitope of the protein consisting of the amino acid sequence of SEQ ID In another aspect the invention provides a molecule consisting essentially of a chimeric T cell receptor bearing an antigen-binding portion of an antibody specific for an epitope of the protein consisting of the amino acid sequence of SEQ ID In another aspect the invention provides an isolated polypeptide, comprising: a) the amino acid sequence of SEQ ID b) an amino acid sequence having at least identity to a) and having prenylcysteine carboxyl methyltransferase (pcCMT) activity either when isolated or when membrane bound; g'e c) a fragment of a) or b) which has pcCMT activity 25 either when isolated or when membrane bound.

:In another aspect the invention provides an isolated antibody or antigen-binding fragment thereof specific for the protein consisting of the amino acid sequence of SEQ ID In another aspect the invention provides A method of screening for and isolating inhibitors of prenylcysteine carboxyl methyltransferase, comprising the steps of: providing a membrane preparation in which a prenylcysteine carboxyl methyltransferase molecule is present, wherein said prenylcysteine carboxyl methyltransferase molecule is not native to the source of the membrane preparation and/or is overexpressed in said source of the membrane preparation; incubating the membrane preparation with a substrate of the isolated prenylcysteine carboxyl methyltransferase and with or without a potential inhibitor of prenylcysteine carboxyl methyltransferase; determining the level of prenylcysteine carboxyl methyltransferase activity in the presence of the potential inhibitor relative to the level of prenylcysteine carboxyl methyltransferase in the absence of the potential inhibitor; and isolating as an inhibitor any said potential inhibitor for which said determining step determines that the level of prenylcysteine carboxyl methyltransferase activity in the presence of said potential inhibitor is substantially 20 less than that in the absence of said potential inhibitor.

Throughout this specification the word "comprise", or variations such as "comprises" or "comprising", will be understood to imply the inclusion of a stated element, integer or step, or group of elements, integers or steps, S 25 but not the exclusion of any other element, integer or step, or group of elements, integers or steps.

Any discussion of documents, acts, materials, devices, articles or the like which has been included in the present specification is solely for the purpose of providing a context for the present invention. It is not to be taken as an admission that any or all of these matters form part of the prior art base or were common general knowledge in the field relevant to the present invention as it existed in Australia before the priority date of each claim of this application.

BRIEF DESCRIPTION OF THE DRAWINGS Figure 1 schematically illustrates the posttranslational processing of a Rho-family Ras-related protein.

Figure 2 shows an amino acid sequence alignment between the deduced amino acid sequence of human pcCMT (SEQ ID NO:5) and the related genes and sequences of mouse expressed sequence tag (EST) mh77d06.rl (SEQ ID NO:7), Xenopus laevis mam4 (SEQ ID NO:8; Imai et al., 1997), two C. elegans genes identified by search of the NCBI GenBank database, S. cerevisiae Stel4 (SEQ ID NO:11; Ashby et al., 1993) and S. pombe mam4 (SEQ ID NO:12; Imai et al., 1997).

Amino acid identities between human pcCMT and the other gene products are indicated by dark shading. Hydrophobic 20 stretches of human pcCMT that may represent membranespanning domains are indicated with solid bars and a region of 36% identity between human pcCMT (amino acid residues 1-66 of SEQ ID NO:5) and the amino acids 750-821 of human band 3 anion transporter is indicated with a 25 dashed bar. An internal peptide used to raise an antipcCMT antiserum is indicated by a dotted line. A potential N-glycosylation site is indicated with an arrowhead.

Figure 3 shows a Kyte-Doolittle hydrophobicity plot of human pcCMT.

Figures 4A-G show plot analysis of pcCMT structure where Fig. 4A shows hydrophilicity, Fig. 4B shows surface probability, Fig. 4C shows flexibility, Fig. 4D shows antigenic index, Fig. 4E shows amphiphilic helix, Fig. 4F shows amphiphilic sheet, and Fig. 4G shows secondary structure.

Figures 5A and 5A show the carboxyl methylation of ras-related GTPases by COS-1 cell membranes expressing ectopic human pcCMT. In Fig. 5A, neutrophil cytosol (CS) containing a mixture of Rho family proteins complexed with Rho-GDI (lanes 2, 3, 6, 7, 10, 11) or RhoA-GDI complexes partially purified *e 6 from CS (lanes 4, 8, 12) were incubated with the methyl donor S-adenosyl-L-[methyl-'H]methionine and light membranes derived from nitrogen cavitation of human neutrophils (lanes 1-4) or COS-1 cells transiently transfected with either human pcCMT (lanes 5-8) or vector alone (lanes 9-12). In some reactions (lanes 3, 7, 11) a competitive pcCMT inhibitor, AFC, was included. Reaction products were analyzed by SDS-PAGE and fluorography. The position of carboxyl methylated p21s is indicated as is the position of protein phosphatase 2a (PP2a) that is methylated on a C-terminal leucine by an enzyme unrelated to pcCMT (Lee et al., 1993). In Fg. 5B, the analysis as done in Fig. 5A shows GTP-dependence (±10 pM of carboxyl methylation of partially purified human neutrophil Rac2 and RhoA, but not cdc42hs, by COS-1 cell membranes expressing human pcCMT.

Figures 6A and 6B show Northern analysis of human tissues for expression of pcCMT mRNA and Figure 6C shows immunoblot analysis of expressed pcCMT protein. In Fig. 6A polyadenylated mRNA was prepared from HL60 cells that had been grown in DMEM 10% fetal calf serum without (lane 1) or with (lane 2) 5% DMSO for five days to induce granulocytic differentiation. Membranes were probed with ["PI-labeled full-length pcCMT cDNA (upper panel) and later stripped and reprobed for p-actin mRNA (lower panel). The results shown are representative of two independent experiments. In Fig.

6B, a membrane containing polyadenylated mRNA from the indicated human tissues was modified as in Fig. 6A for pcCMT mRNA (upper panel) and subsequently pactin mRNA (lower panel) according to the manufacturers' instructions (Clontech, Palo Alto, CA). In Fig. 6C, lysates of HL60 cells metabolically labeled with methionine/c :eir were immL...precipitated (lanes 2 and 3) with preimmune c- imm- a ierum from rabbit immunized with a pcCMT peptide. M..ibranes from human neutrophils (PMN) and from COS-1 cells transfected with pcCMT (CMT) or vector alone (VEC) were immunoblotted with the same sera (lanes 4 and 7 Figures 7A-7I show the subcellular localization of human pcCMT. In Figs. 7A-7E CHO cells (Figs. 7A-7C) and COS- 1 cels (Figs. 7D and 7E) were transiently transfected with

GFP

alone (Fig. 7A) or GFP-tagged human pcCMT (Figs. 7B-7E) and examined unfixed. In Figs. 7C and 7E, the image is intentionally overexposed to reveal the distinct fluorescence of the ER and absence of plasma membrane fluorescence.

Arrowheads indicate nuclear envelope fluorescence and asterisks indicate a perinuclear area of intense Golgi fluorescence. In Fig. 7F, CHO cells transfected with GFPtaagged pcCMT were fixed, permeablized, and stained for the ER marker ribophorin I (Texas-Red) and viewed by confocal microscopy (0.4 JM optical section) where yellow pseudocolor indicates overlap. In Fig. 7G, CHO cells transfected with Myc-tagged human pcCMT were fixed, permeablized, and stained with anti-Myc antibody 9E10 (Texas-Red). In Figs. 7H and 71, untransfected COS-1 cells were stained for endogenous pcCMT with preimmune serum (Fig. 7H) or immune anti-peptide antiserum (Fig. 71). Bars indicate 10 JM.

Figures 8A-8C show the effect of a pcCMT inhibitor (AFC) on the subcellular localization of Nras. In Fig. 8A, COS-l cells were transiently transfected with GFP-Nras and COS-1 cells were 2pfuorescence microscopy to observed live at 24 hours by epifluorescence microscopy to reveal both surface membrane (arrowheads) and intense perinuclear (Golgi) fluorscenCe. The position of the nuclease is marked Fig. 8B shows the cytosolic localization of GFP-NrasCd86S, a mutant defective in prenylation. Fig. 8C shows the predominantly cytosolic localization of GFP-Nras in cells exposed to 200 pM AFC during the 24-hour transfection.

Bars indicate 10 AM.

8 DETAILED DESCRIPTION OF THE INVENTION A gene expressed in human myeloid cells which encodes a prenylcysteine carboxyl methyltransferase (pcCMT) has been identified and characterized, and is the first such gene to be isolated from a mammal. As shown in Figure 2, the deduced 284 amino acid sequence (SEQ ID NO:5) of human pcCMT was found to have 31% identity (over amino acid residues 277) to a previously described pcCMT of S. cerevisae, designated Stel4 (Ashby et al., 1993). However, the first amino acid residues showed no similarity to the yeast gene product but was found to be homologous to theillth and 12th membrane-spanning domains of a mammalian anion transporter, band 3. Hydropathy analysis, as shown in Figure 3, revealed six membrane spanning domains, consistent with membrane localization and detergent lability of pcCMT activity.

Despite persistent attempts, prenylcysteine carboxyl methyltransferases have previously defied biochemical purification because pcCMT cannot be extracted from biological membranes in an active state. Before the human pcCMT according to the present invention was cloned and characterized as described in the Example herein, the only pcCMT characterized at the molecular level (by complementation of a mating-deficient mutant) was the S. cerevisiae Stel4 gene product. 'Attempts to clone pcCMT from vertebrates by homology to Stel4 have failed. However, according to the present invention, a cDNA encoding human pcCMT was finally cloned and sequenced, and the protein was characterized from its deduced amino acid sequence. It was only after the present human pcCMT was cloned, sequenced and characterized by the laboratory of the present inventor that the cloning of a Schizosaccharomyces pombe mam4 gene and a Xenopus laevis mam4 gene by complementation and transcomplementation of S. pombe mam4 mutants, respectively, was independently reported (Imai et al., 1997). Even these sequences, however, do not render obvious the gene and protein sequences of the present invention.

In the context of the present invention, the term "prenylcysteine carboxyl methyltransferase", also referred to herein as "pcCMT", is intended to encompass any mammalian pcCMT and any naturally occurring variants thereof having at least 70% identity to the amino acid sequence of SEQ ID The term is also intended to include muteins in which one or more amino acid residues of SEQ ID NO:5 has been substituted with a different amino acid, deleted, or added, provided that the amino acid sequence of the mutein has at least identity to SEQ ID NO:5, such as at least 85% identity, at least 90% identity or at least 95% identity. To be a naturally occurring variant or mutein of the human pcCMT according to the present invention, it must have or substantially retain the same carboxyl methyltransferase activity of the human pcCMT of SEQ ID For the purpose of this specification and claims, unless otherwise stated, the percentage sequence identity between two sequences is to be determined by aligning to maximize the local similarity score between the two sequences, and expressing the number of identical aligned pairs as a percentage of the total length of the overlap region, including nulls (gaps), or the original of the shorter sequence, whichever of or is larger.

The two sequences are to be aligned by a rigorous (linear programming) based local alignment algorithm in which the overall similarity score for a given alignment is obtained by summing the pairwise alignment scores, for each aligned pair of amino acids, and a gap-penalty for each gap introduced into either sequence in an attempt to improve the overall similarity score for the alignment.. The pairwise alignment scores are derived from a 20 x 20 scoring matrix for amino acids. The gap penalties are a linear combination of a gap initiation penalty imposed for the first null of a given gap, and a gap extension penalty for each additional null added to that gap. Only internal gaps will be penalized. For amino acid sequence alignment, the scoring matrix will be the PAM250 matrix, in the form wherein the scores range from +17 to -8; the gap initiation penalty will be -12; and the gap extension penalty will be -4.

In addition, the term "isolated mammalian prenylcysteine carboxyl methyltransferase" is intended to encompass any mammalian pcCMT that is isolated away from its native cells, native cell fractions, and other native proteins. For instance, an "isolated mammalian prenylcysteine carboxyl methyltransferase" includes, but is not limited to, mammalian pcCMT expressed in non-native host cells, mammalian pcCMT obtained as part of a membrane preparation of non-native cells, mammalian pcCMT extracted from membrapes of non-native cells, and mammalian pcCMT associated with phospholipid vesicles/liposomes. By the term "membrane", it is intended to not only encompass the various membranes of cells but any artificial membrane-like structures, such as for example phospholipid vesicles/liposomes.

Proteins may be modified by any art-recognized means, including: expression, in a suitable host cell, of a modified gene which encodes the modified protein; biosynthesis of a precursor protein in a host cell which post-translationally modifies the precursor protein crosslinking of cysteines; glycosylation; hydroxylation of prolines; phosphorylation of tyrosines; lipidation of Cterminals; cleavage); biosynthesis of the protein in a host cell engineered to provide a tRNA charged with a modified amino acid; nonbiological synthesis of the modified protein by stepwise addition of amino acids and/cr peptide fragmer- condensation; chemical or enzymatic modification of a precursor protein, with or without use of a linker; fragmentation of a precursor protein and, optionally, linkage of one or more fragments to I 11 additional amino acids or peptides; and combinations thereof.

For the sake of convenience, modifications are divided into those achievable by above, which will be termed "mutations", and those which are not, which will be termed ,derivatizations." While normally derivatives are prepared by first synthesizing the lead molecule and then derivatizing it, derivatives may also be prepared directly. However, implicit in the term "derivatives" is that it is technically feasible to obtain the derivative by modification of the lead molecule, even if that is not the preferred method of production.

Modifications of proteins may be classified as follows: Favorable these modifications enhance the utility of a protein for a particular use.

Neutral these modifications neither enhance nor diminish the utility of a protein for a particular use.

Unfavorable these modifications diminish, but do not necessarily eliminate, the utility of a protein for a particular use.

Inactivating these modifications eliminate the utility of a protein for a particular use.

Tolerable these modifications are favorable, neutral, or unfavorable, but not inactivating.

It is possible, since a protein may have more than one use, that a modification is favorable for one use, unfavorable for a second, neutral for a third, and inactivating for a fourth.

In general, we will discuss the effect of modifications on the suitability of the protein for uses which are specific, to a greater or lesser degree, on the specific structure of the protein, rather than on those which are dependent only on the protein's.amino acid composition, molecular weight, or gross physical properties.

A protein may be modified for a variety of reasons, including one or more of the following purposes: to render the molecule more detectable, as in the case of radiolabeled, enzyme labeled, and fluorescent labeled derivatives; to render the molecule more stable with respect to a particular physical, chemical, or biological agent, such as heat, light, oxidizing agents, reducing agents, and enzymes; S to render the molecule more soluble in a solvent of interest, to facilitate its administration, or less soluble, to facilitate its precipitation_ r to allow its use in capturing another molecule; 0 to limit the nature of the reactions which the molecule can participate in, placing protective groups on amino acid side chains, or conversely, to expand the possible reactions, placing a reactive group on the molecule to facilitate a subsequent coupling reaction; 0 to render the molecule less (or more) immunogenic; 0 to increase (or decrease) the time which the molecule resides in a particular organ or tissue if administered to a subject, or to hasten (or slow) its arrival in a particular organ or tissue; to enhance (or diminish) one or more of its biological or immunological activities, e.g., to increase or decrease its affinity for a receptor, or to alter its specificity; or 0 to confer a new activity which complements a prior acti- attaching a toxin to -n antitumor antibody); or 0 to inhibit an undesirable side effect of the molecule.

Most residues of a protein can tolerate some degree of mutation. Mutations may take the form of single or multiple substitutions, insertions, or deletions. Preferably, insertions or deletions are directed to the termini of the molecule, or to surface loops or interdomain boundaries.

preferably, internal insertions and deletions are of no more than five residues, absent evidence (such as an example in a homologous protein) that a larger internal insertion or deletion could be tolerated.

There is no preferred maximum with respect to an insertion at a terminus, which is more aptly termed an "addition" or "fusion". It is routine to fuse one protein to another to facilitate expression, or to provide a fusion protein which has the combined biological activities of its components. A fusion protein may be useful as a precursor, which can be cleaved to liberate an active protein, even if the fusion protein itself lacks activity.

With regard to deletion at a terminus, more aptly termed "truncation", the purpose of the modification is important. It is routine to extensively truncate a protein when one is interested only in its immunological properties.

One may abstract from a protein an epitope as small as five amino acids, and use it by itself to elicit a T cell response, or conjugated to copies of itself or to an immunogenic carrier to elicit a B cell response. When it is a biological activity which must be preserved, the limits on truncation may be more stringent.

Preferably, after considering substitutions, and any internal deletions and insertions, the mutant is at least more preferably at least 90%, and most preferably at least identical in sequence to the original protein.

A protein is more likely to tolerate a mutation which is a substitution rather than an insertion or deletion; an insertion or deletion at the termini, than internally, or, if internally, at a loop or an interdomain linker; affects a surface residue rather than an interior residue; affects a part of the molecule distal to the binding site; is a substitution of one amino acid for another of similar size, charge, and/or hydrophobicity; and is at a site which is subject to substantial variation among a family of homologous proteins to which the protein of interest belongs.

These considerations can be used to design functional mutants.

Preferably, for the framework residues, and more preferably for the whole chain, the predicted-or experimentally determined 3D structure of the modified protein has a main chain (C-carbon) conformation whose root-meansquare deviation from the predicted or experimentally determined 3D structure of the original protein is preferably less than 5A, more preferably less than 3A, still more preferably less than 2A, most preferably less than 1A.

The most accurate method for the prediction of a protein structure is model building from a protein or proteins of known structure that have been identified as homologous from sequence analysis. Surprisingly, proteins with very little detectable sequence identity can still fold into very similar structures.

The coordinates of protein structures can be obtained from Protein Data Bank or the Cambridge Crystal Structure Data Centre. Sequence databases include the Protein Identification Resource (National Biomedical Research Foundation), GENBANK (Los Alamos National Laboratory), EMBL (European Molecular Biology Laboratory) and SBASE (International Center for Genetic Engineering and Biotechnology). Denied alignment databar.s, in which 3D structure and amino acid sequenc- have been correlated, include NRL-3D Naval Research Lab), HSSP (EMBL), 3D-ALI (EMBL), FSSP (EMBL), and the Overington database (J.P.

Overington, Pfizer Central Research). For complete addresses see Table 2 in Johnson et al. (1994).

The basic approach is to identify related sequences and structures; identify structurally equivalent residues; model structurally conserved regions (SCRs); and model structurally variable regions (SVRs). The model of the SCRs acts, to a greater or lesser degree, as a constraint in the modeling of the SVRs. Because the core residues are usually more structurally conserved than surface residues, they are usually modeled first. For similar reasons, helices and strands are usually modeled before loops. Typically, the main chain (Ca atom) conformation is determined first, and then the side chain conformations. Modeling steps may be iterated to arrive at successively improved approximations of the true structure. Typically, the predicted structures are more accurate for protein cores than for protein loops.

It is not necessary that more than one 3-D structure be available for model building. However, if the 3-D structures of two or more homologous proteins are known, the accuracy of the model can be improved. Preferably, the 3-D structures are "weighted" to reflect the relatedness of the homologous protein to the protein of interest. One popular scheme is to weight by the square of the percentage sequence identity.

Moreover, information regarding homologous substructures of nonhomologous proteins may be used in addition to, or even in lieu of, a 3-D structure of a homologous protein. See Jones et al. (1986), Unger et al.

(1989), Claesseus et al. (1989), and Levitt et al. (1992) for the building of models by combining "spare parts" from different proteins.

It is not necessary for a molecular biologist to be an expert in protein modeling, as several programs exist which automate the modeling process. These include COMPOSER (Tripos Associates) If a 3-D structure is available for the binding partner, as well as for a binding protein of interest, molecular modeling software may be used to predict potential binding sites, or to predict the effect of a proposed mutation on a binding site, by attempting to "dock" the binding partner to the site. See, Guruprasad et al. (1996) and Constantino et al. (1996).

In general, within families of proteins of similar sequence and function, surface residues are more likely to vary than are interior residues. This is most likely because the surface residues are unlikely to be involved in interactions with other residues which are necessary to maintain the overall conformation of the protein.

Some surface residues are directly involved in the binding surface by which a protein exercises a particular binding activity. Mutation of such residues is likely to affect binding; however, it is not necessarily undesirable to make such mutations. For example, mutation of the binding site of a serine protease can alter what is bound, as opposed to simply rendering the protein inactive altogether.

The most reliable method of identifying the surface residues of a protein is to determine the protein's 3-D structure by X-ray diffraction. Even an incomplete 3D structure can be useful in designing mutants. Residues with high mobility, as evidenced by a higher than average crystallographic thermal factor, are those least susceptible to destabilizing mutations. See Alber et al. (1987).

Although many amino acid substitutions can be made at surface positions with no adverse effects, substitutions at internal positions tend to be severely destabilizing. Within families of homologous proteins, the most conserved residues, apart from functional amino acids, are those which are buried.

The main contribution to the free energy of protein folding, and hence to protein stability, comes from burying hydrophobi- side chains in the interior, thereby shielding them from "vent. Packing densities are typically high. In general, the ability of a protein to tolerate mutations which alter the volume of core residues is dependent more on the net change in the total core residue volume, then on the magnitude of the individual residue volume changes. In other words, an increase in the volume of one core position can compensate for a decrease in the volume of another core position.

Preferably, the net change in the total core residue volume is not more than 10%, more preferably, not more than See Lim et al (1989) and Lim et al. (1992).

In the absence of evidence to the contrary, all residues identified as interior residues may be assumed to be part of a single core. However, if it is likely that the protein folds to form several distinct cores, the above-stated volume conservation rule should be applied separately to each core.

Amino acids differ in terms of their propensity to be buried residues. The following table shows, for each residue, the percentage which were in buried positions, based on a study of the 3D structures of unrelated proteins: Amino Acid Buried Gly 36 Ala 38 Val 54 Ile Leu Pro 18 Ser 22 Thr 23 Cys 48 Met Tyr Trp 27 His 17 12 Gin 7 Asp Glu 18 Lys 3 Arg 1 The makeup of the buried core of a protein is dependent, not only on the propensity of each amino acid, if present, to be buried, but also on the overall frequency of occurrence of that amino acid in the protein. The most commonly buried residues are, in descending order, Val, Gly, Leu, Ala, Ile and Ser.

Lim et al (1992) reported that replacing a single hydrophobic amino acid (Leu, Val) in the protein core with a hydrophilic amino acid (Asn, Gin) prevented the complete folding of the protein and destroyed biological activity.

Buried Cys, form), Asp, Gly, His, Pro, and Trp are more than 70% likely to be unchanged in a homologous protein. Therefore, if these residues occur in a buried position in the protein of interest it is preferable to leave them unchanged. Their conservation is probably explainable as follows: Cys (disulfide bonding), Asp (charged but rigid side chain), Gly (chain flexibility), His (charged and uncharged states both available at physiological pH), Pro (unique chain geometry), and Trp (largest side chain).

The other residues, with the exception of Met, are 40-60% likely to left unchanged when buried (Met is unchanged only 26% of the time, but it is 25% likely to be replaced by Leu).

The following buried residue substitution probabilities exceed Ala-Val, Glu-Gln, Phe-Leu, Ile-Leu, IleVal, Lys-Arg, Leu-Ile, Leu-Val, Met-Leu, Met-Val, Asn- Asp, AsnSer, Arg-Lys, Arg-Gln, Ser-Ala, Thr- Ser, Val-Ile, Val- Leu, Tyr->Phe, Cys(-SH)-*Ala.

These further substitutions have probabilities in the 5-10% range: Ala-Ser, Asp-Asn, Glu-Arg, Glu->Val, Phe-Ile, Phe-.Val, Phe-+Tyr, His-Val, Leu-Phe, Met-Ala, Met- Ile, Gln-+Glu, Gln-His, Gln-+Met, Ser-Gly, Ser->Thr, ThrVal, Val-Ala, Trp-Phe, Tyr-Leu, Cys(-SH)-)Ser.

See Overington et al. (1992), Table The most consistent exchange groups appear to be (Arg, Lys), (Leu, Ile, Met, Val, Phe), and (Ser, Thr, Ala).

However, Ala and Val appear to be fairly promiscuous.

In general, therefore, it is preferable to avoid mutating buried residues at all. However, if they are mutated, one should limit the overall change in the volume of the core, and most preferably should limit the mutation to the replacement of one residue with another whose typical substitution probabilitY exceeds zero, more preferably is at least and most preferably at least 10%. Mutation of buried Cys(-S-S), Asp, Gly, His, Pro and Trp should be avoided, absent justification by other evidence. The safest core mutations are exchanges of one hydrophobic amino acid for another, and of Arg for Lys (or vice versa).

Nonetheless, judicious mutation at internal residues may be used to improve protein stability. Such mutations could introduce additional stabilizing interactions (hydrogen bonds, ion pairs) compatible with the native structure, or could reduce the mobility of nearby interacting groups by replacing smaller amino acids with larger ones, or linear side chains with branched or aromatic ones. See Alber (1987).

The most useful information for determining which residues are safely mutatable in muteins is knowledge of the sequence of pcCMT proteins. The sequences of these homologous pcCMT proteins may then be aligned, and residues which are not conserved are more likely to be safely mutatable. The degree of confidence which one has as to the tolerance of a residue to mutation is a function of the degree of variation of amino acid type at that site among the protein family, as well as the extent to which all of the proteins in the family, despite their differences, retain the desired activity.

While studies of homologous proteins are useful in identifying sites which, by virtue of their variability, are likely to be tolerant of mutation, it is less certain that sites which are strongly conserved are necessarily invariant.

Random mutagenesis studies have indicated that proteins are able to accommodate, both structurally and functionally, a far greater variety of mutations than occur naturally.

Cognate proteins are homologous proteins, expressed in a different species of organism, which perform the same biological function as that performed by the protein of interest, although they may differ in activity, specificity, timing of expression, etc. Such cognate pcCMT proteins from other mammals can be obtained upon possession of the human pcCMT cDNA or genomic DNA as the first identified mammalian pcCMT gene (the "starting DNA") encoding the pcCMT protein of interest (the "starting protein") which greatly facilitates the isolation of cognate mammalian proteins.

In one embodiment, the human pcCMT DNA, or a fragment thereof, is used as a hybridization probe to screen a cDNA or genomic DNA library for clones containing inserts which encode either the entire cognate protein, or a recognizable fragment thereof. The minimum length of the hybridization probe is dictated by the need for specificity.

The library is preferably derived from the genomic DNA or mRNA of mammalian cells which are likely to be relatively high producers of pcCMT.

If the mammal in question is known to have substantially different codon preferences from that of human cDNA or genomic DNA, a synthetic hybridization probe may be used which encodes the same amino acid sequence but whose codon utilization is more similar to that of the DNA of the target mammal. Alternatively, the synthetic probe may employ inosine as a substitute for those bases which are most likely to be divergent, or the probe may be a mixed probe which mixes the codons for the source DNA with the preferred codons (encoding the same amino acid) for the target mammal.

By routine methods, the Tm of a perfect duplex of starting DNA is determined. One may then select a hybridization temperature which is sufficiently lower than the perfect duplex Tm to allow hybridization of the starting DNA (or other probe) to a target DNA which is divergent from the starting DNA. A 1% sequence divergence typically lowers the Tm of a duplex by 1-2 0 C, and the DNAs encoding homologous proteins of different species typically have sequence identities of -round 50-80%. F-rferably, the library is screened under conditions wher- :he te:.-erature is at least more preferably at least 50 0 below the perfect duplex Tm. Since salt reduces the Tm, one ordinarily would carry out the search for DNAs encoding homologous proteins under low salt hybridization conditions, <1M NaC1.

For the use of probes to identify homologous genes in other species, see, Schwinn et al. (1990) (hamster 67-bp cDNA probe vs. human leukocyte genomic library; human 0.32kb DNA probe vs. bovine brain cDNA library, both with hybridization at 420C in 6xSSC); Jenkins et al. (1990) (Chicken 770-bp cDNA probe vs. human genomic libraries; hybridization at 40 0 C in 50% formamide and 5xSSC); Murata et al. (1992) (1.2-kb mouse cDNA probe v. human eosinophl cDNA library; hybridization at 65°C in 6xSSC); Guyer et al. (1990) (2.95-kb human genomic DNA probe vs. porcine genomic

DNA

library; hybridization at 42 0 C in Hybridization conditions for hybridizing to a homologous sequence which encodes a protein which has, for example, at least 70% identity to the amino acid sequence encoded by the probe sequence SEQ ID NO:5) is well within the skill of those in the art, particularly in view of numerous reference texts on hybridization, such as Ausubel et al, Current Protocols in Molecular Biology, Chapter 6, John Wiley Sons, Inc. (1987-1998).

If human pcCMT and the cognate mammalian protein are immunologically cross-reactive, antibodies raised against the human pcCMT may be used in a second approach to identifying genes encoding cognate mammalian proteins. In that approach the library is caused to express the polypeptides encoded by the cloned inserts, and the expression products are screened using the antibodies raised against human pcCMT. If the proteins are cross-reactive, the cognate mammalian protein will be detected and the corresponding gene can be recovered.

In order to derive guidance from the sequences of cognate pcCMT proteins, a homology or alignment score for each possible alignment is calculated, and the highest such score for each pair of cognate pcCMT proteins is determined. Global alignment algorithms consider both complete sequences in generating similarity scores for a given alignment, and, in general, allow "gapping". They are most appropriate when the sequences are known or expected to be similar over their entire length. Local alignment algorithms, however, search 22 for similar fragments of two sequences, and, in general, do not allow gaps. They are useful in locating common subdomains between long sequences that otherwise share little similarity.

Binding sites may also be identified by mutagenesis strategies designed to locally perturb the protein. One such strategy is alanine scanning mutagenesis. In this technique, all non-alanine residues of the protein (or of a region of the protein suspected to contain the binding site are replaced, one-by-one, with alanine, yielding a collection of single substitution mutants. Alanine is used because it is the most common amino acid residue in proteins, it has a small side chain, and therefore is not likely to sterically hinder other residues, and its side chain (-CH 3 does not form Hbonds, but is not especially hydrophobic. Cunningham and Wells (1989) conducted an Ala scanning mutagenesis study of residues 2-19, 54-74, and 167-191 in hGH. A total of 62 Ala mutations were produced. Of these, fourteen mutants could not be produced in quantities sufficient for affinity testing.

Presumably, these mutations globally destabilized the protein, rendering it vulnerable to proteolysis. Eleven mutants seemingly enhanced binding, although it is unclear which improvements were significant. Of the remaining 37 mutants, only four impaired binding by 10-fold or more, and only nine by 5-fold or more. See generally Genentech, W090/04788.

For other uses of Ala-scan mutagenesis, see Yu et al. (1995) (complete scan of a single disulfide derivative of the 58-residue protein BPTI); Allen et al. (1987) (Ala-scan of residues 52-61 of hen egg white lysozyme); Ruf et al. (1994) (Ala-scan of residues other than Gly, Pro and Cys; multiple Ala mutants examined first, then single Ala mutants); Williams et al. )95) (Ala-scan i- insulin receptor of chargamino a is, aromatic residues, and residues adj. :nt to or other than prolines, cysteines, or potential Nlinked glycosylation sites); Kelly and O'Connell, Biochemistry, 32:6828-35 (Ala-scan of antibody CDR). Alascanning mutagenesis may be applied to all residues of a protein, or to residues selected on some rational basis, such as amino acid type charged and aromatic residues), degree of variability in a homologous protein family, or relevance to function as shown by homologue-scanning mutagenesis.

Preferably, further mutations (especially nonconservative mutations) are made at sites where an alanine substitution does not worsen the activity of interest by more than 20-fold, more preferably, by more than 10-fold, even more preferably, by more than 5-fold, still more preferably, by more than 2-fold. Most preferably, mutations are made at sites at which an alanine substitutions improves activity.

Preferably, if multiple mutations are made, the expected (additive) effect of the mutations is one which does not worsen the activity more than 10-fold, more preferably, by more than 5 fold, still more preferably, by more than two fold. Most preferably, the expected effect is to improve activity. The expected effect of a conservative substitution is the effect of that mutation as a single substitution if known, or otherwise neutral. The expected effect of a nonconservative substitution is the effect of that mutation as a single substitution if known, or otherwise the effect of a single substitution of a different residue of the same exchange group as the actual replacement residue, if known, or otherwise the effect of a single Ala substitution.

Another approach is homologue-scanning mutagenesis.

This involves identifying a homologue which can be distinguished in an activity assay from the protein of interest, and screening mutants in which a segment of the protein of interest is replaced by corresponding segments of the homologue (or vice versa). If the replacement alters the activity of the modified protein, the segment in question presumably contributes to the observed difference in activity between the protein of interest and the homologous protein, and comparison of the interchanged segments helps to explain the character of the binding site involved in that activity.

For example, segments of prolactin, which does not bind the GH receptor, have been used to replace segments of growth hormone, which does. If a substitution disrupts GH binding, it implies that the replaced segment was part of the GH receptor binding site, and one may then focus on how the replaced and replacing segments differ. See W090/04788.

If a residue is determined to be a part of the binding site, one may prepare all possible single substitution mutants of that site.

It is possible to incorporate two or more tolerable mutations into a protein.

Generally speaking, as a first approximation, it is reasonable to assume that the effect of two mutations will be additive in nature. See Wells (1990); Sandberg and Terwilliger (1993); Gregoret and Sauer (1993); Schreiber and Fersht (1995); Lowman and Wells (1993); Lawman et al. (1991); Lin et al. (1994); Venkatachalam et al. (1994); Akasako et al.

(1995); Behravar et al. (1991); Lin et al. (1994); Zuckerman et al. (1992) Nonadditive effects are more likely to occur between residues that are in Van der Waals contact with each other.

See Sandberg and Terwilliger (1993). According to Schreiber and Fersht (1995), nonadditive effects are more likely to occur between residues less than 7A apart (10A in the case of charged residues). The effect of a second mutation on a first one may be synergistic, additive, partially additive, neutral, antagonistic, or suppressive. Long range but low magnitude departures from additivity may occur reasonably often, see LiCata and Ackers (1995), but do not significantly impair the value of multiple mutation in protein engineering.

Gregoret et al. (1993) assumed that, under selective conditions, the frequency of occurrence of a mutation in an active mutant was an indication of whether the mutant conferred resistance, and found that an additive model (multiplying the mutational frequencies of a pair of single Ala substitution mutants) was about 90% effective in predicting the activity class of a binomial (multiple Ala substitution) mutant.

The most common reason for combining mutations is to benefit from their additive or synergistic effect in combination. For example, if a mutation has both favorable and unfavorable activities, it may be possible to combine it with a second mutation that neutralizes the unfavorable activity of the first mutation.

One use of multiple mutation is to achieve, by combining mutations which individually have a small but favorable effect on activity, a mutant with a more substantial improvement in activity. It is not necessary that the mutations be strictly additive; it is sufficient that they be at least partially additive for the combination to be advantageous. See Blacklow et al. (1991) (improved catalytic effectiveness of triosephosphate isomerase); Akasako et al.

(1995) (multiple thermostabilizing mutations in ribonuclease HI); Lowman et al. (1991) (HGH-receptor binding properties of human placental lactogen improved about 500-fold by five simultaneous, mutations, with "reasonably additive" effects); Lowman and Wells (1993) (HGH-receptor binding properties of HGH improved about 400-fold by combination of substitutions. Sandberg and Terwilliger (1993), reported that there was only a weak correlation between changes in DNA binding protein stability and changes in DNA binding affinity, and hence that it was possible to combine mutations so as to selectively change one property without changing the other.

Watanabe et al. (1994) suggests that increasing the number of proline residues, especially at second sites of beta turns and N-caps of alpha helces, increases the thermostability of the protein in an additive manner.

Gloss et al. (1992), converted all cysteines of a protein to alanine. They point out that this cysteine-free mutant provides a platform onto which uniquely placed cysteine residues may be engineered, thereby allowing the introduction of unnatural amino acids through exploitation of the unique reactivity of the thiol group.

The interactivity of two residues is generally determined by preparing both single substitution mutants as well as a double substitution mutant, and determining whether the effects are additive or not. Therefore, if single Ala substitutions have been shown to favorably or unfavorably affect activity, one may prepare a double Ala mutant and compare its activity to that of the single substitution mutants. While it is certainly possible that two mutations which, by themselves, do not affect activity, may do so when combined, this is unlikely, especially if the sites are not close together.

While in theory one could prepare all possible double Ala mutants, this would mean preparing N(N-1) mutants, where N was the number of non-Ala residues in the protein. In general, one would limit the double substitution studies to sites known to favorably affect the activity. Possibly, one would also consider sites which were strongly unfavorable (to look for antagonistic interactions).

Another approach is binomial Ala-scanning mutagenesis. Here, one constructs a library in which, at each position of interest of a given protein molecule, the residue is randomly either the native residue, or Ala. See Gregoret and Sauer (1993). If it is feasible to screen a library of 1010 mutants, then the combined effects of up to 30 different Ala substitutions (227~1010) can be studied in one experiment.

It should be noted that the Ala:non-Ala ratio at each position may be, but need not be equal. The choice made for this ratio will determine the degree of substitution will predominate, according to a binomial distribution.

If the protein is too large for all sites of interest to be sampled by binomial Ala-scanning mutagenesis in a single experiment, one may divide the protein into segments a i subject each s-gment in turn to such mutagenesis, and then, as a cross-check, similarly mutate one residue from each segment.

Even when mutations are not additive in effect, this is not necessarily undesirable. Green and Shortle, (1993) reported that mutations which individually reduced stability, when not additive in their effects, were almost exclusively subadditive, the reduction in stability was less than that expected by summing the individual destabilizations.

This is credited to an overlap of the "spheres of perturbation" surrounding the two mutations. Ballinger et al.

(1995) reported that a combination subtilisin BPN' mutant had a larger than additive shift in specificity toward dibasic substrates, which is a desirable change.

Certain multiple mutations are worthy of special comment.

Primary shifts: In a primary shift the residue at position n becomes the replacement amino acid at position n+s, or vice versa. For example, instead of Cys at 30, one might have Cys at 31. The result is a mere displacement, rather than a loss, of the amino acid in question. In a primary shift, s (the shift distance) is most often equal to one, but may be two, three or more. The greater the value of s, the more the shift resembles an ordinary double mutation.

Primary transpositions: In a primary transposition, the residues at positions n and n+s in the primary amino acid sequence are swapped. Such swaps are less likely to perturb the protein than the individual replacements, examined singly, might suggests. A primary transposition is, in effect, a combination of two complementary shifts.

Secondary Transposition: Here, two amino acids which interact as a result of the folding of the protein are swapped. A classic example would be members of a salt bridge.

If there is an Asp in one segment forming a salt bridge with a Lys in another segment, the Asp and Lys can be swapped, and a salt bridge can still form.

Coordinated Replacement: Here, replacement of residue x is coordinated with replacement of residue y. Thus, replacement of one Cys may be coordinated with replacement of a second Cys with which it otherwise forms a disulfide bond, and if one amino acid of a pair forming a salt bridge is replaced by an uncharged the other may likewise be replaced.

Techniques of detecting coordinated amino acid changes in families of homologous proteins are discussed in Altschuh et al. (1988) Primary shifts, primary transpositions, secondary transpositions and coordinated replacements are more likely to be tolerated than other multiple mutations involving the same individual amino acid changes.

The amino acid sequence alignment for homology between the deduced amino acid sequence of human pcCMT and related genes and sequences is shown in Figure 2. Amino acid identity between human pcCMT and other gene pXoducts in Figure

I

2 is indicated with dark shading. Those of skill in the art reviewing the amino acid sequence alignment shown in Figure 2 will understand the positions where amino acid substitutions, deletions, additions can be made with the highest expectation of retaining the enzymatic activity of the native enzyme. For instance, it can be seen at amino acid positions where the areas of dark shading are mixed with areas of no shading in Figure 2, or in which there is no dark shading, that conservative amino acid substitutions may be predicted to still substantially retain the carboxyl methyltransferase activity of the human pcCMT, especially where there are alternative amino acids shown in the non-shaded area at a conserved residue position having like properties in protein structure. Amino acid residues at positions in which there is no dark shading show no conservation, identity, homology), and in which the charge, hydrophobicity, steric properties, etc., of the amino acid residue are predicted not to substantially affect the enzymatic activity and protein structure of "cCMT, can be substituted with virtually any of the cct,.ion amlno acics. Secondary structure prediction (Fig.

4G), such as routine2v obtained by a Chou-Fasman analysis or other similar analysis, and the other structural predictions shown in Figs. 4A-4F provide those of skill in the art with further information on what amino acids may be appropriate substitutes at a particular residue position.

Preferably, the muteins of SEQ ID NO:5 have at least identity to SEQ ID NO:5, and more preferably at least identity, and most preferably at least 95% identity while substantially retaining the carboxyl methyltransferase activity of the human pcCMT of SEQ ID Examples of production of amino acid substitutions in proteins which can be used for obtaining muteins of human pcCMT for use in the present invention include any known method steps, such as presented in U.S. patent RE 33,653, 4,959,314, 4,588,585 and 4,737,462, to Mark et al.; 5,116,943 to Koths et al., 4,965,195 to Namen et al.; 4,879,111 to Chong et al.; and 5,017,691 to Lee et al.; and lysine substituted proteins presented in U.S. patent No. 4,904,584 (Shaw et al.).

At the genetic level, these muteins are generally prepared by site-directed mutagenesis of nucleotides in the DNA encoding the human pcCMT, thereby producing DNA encoding the mutein, and thereafter synthesizing the DNA and expressing the polypeptide in recombinant cell culture. The muteins typically exhibit the same or increased qualitative biological activity as the naturally occurring protein, Ausubel et al.

(1987-1997); Sambrook et al. (1989).

Preparation of a human pcCMT mutein in accordance herewith, or an alternative nucleotide sequence encoding the same polypeptide but differing from the natural sequence due to changes permitted by the known degeneracy of the genetic code, can be achieved by site-specific mutagenesis of DNA.

Site-specific mutagenesis allows the production of muteins through the use of specific oligonucleotide sequences that encode the DNA sequence of the desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size and sequence complexity to form a stable duplex on both sides of the deletion junction being traversed. Typically, a primer of about 20 to 25 nucleotides in length is preferred, with about 5 to 10 complementing nucleotides on each side of the sequence being altered. In general, the technique of site-specific mutagenesis is well known in the art, as exemplified by publications such as Adelman et al. (1983), the disclosure of which is incorporated herein by reference.

As will be appreciated, the site-specific mutagenesis technique typically employs a phage.vector that exists in both a single-stranded and double-stranded form.

Typical vectors useful in site-directed mutagenesis include vectors such as the M13 phage, for example, as disclosed by Messing et al. (1981), the disclosure of which is incorporated herein by reference. These phage are readily available commercially and their use is generally well known to those skilled in the art. Alternatively, plasmid vectors that contain a single-stranded phage origin of replication (Veira et al., 1987) may be employed to obtain single-stranded DNA.

In general, site-directed mutagenesis in accordance herewith is performed by first obtaining a single-stranded vector that includes within its sequence a DNA sequence that encodes the relevant polypeptide. An oligonucleotide primer bearing the desired mutated sequence is prepared synthetically by automated DNA/oligonucleotide synthesis. This primer is then annealed with the single-stranded DNA-sequence-containing vector, and subjected to DNA-polymerizing enzymes such as E.

coli polymerase I Klenow fragment, to complete the synthesis of the mutation-bearing strand. Thus, a mutated sequence and the second strand bears the desired mutation. This heteroduplex vector is then used to transform appropriate cells, such as E. coli JM101 cells, and clones are selected that include recombinant vectors bearing the mutated sequence arrangement.

After such a clone is selected, the mutated human pcCMT may be removed and placed in an appropriate vector, generally a transfer or expression vector of the type that may be employed for transfection of an appropriate host.

A recombinant DNA molecule encoding the mammalian pcCMT according to the present invention can be obtained by cDNA cloning using the characterized human pcCMT gene as the starting DNA to isolate cognate mammalian pcCMT as disclosed above in the discussion on cognate mammalian proteins. This recombinant DNA is then incorporated into a replicable expression vector such as a plasmid capable of autonomous replication or chromosomal integration in recipient host cells. Standard reference works setting forth the general principles of recombinant DNA technology include Ausubel et al. (1987-1997), Watson et al. (1987); Darnell et al. (1986); Lewin, (1985); Old et al. (1981); and Sambrook et al. (1989).

These references are hereby incorporated by reference.

In order to be capable of expressing the mammalian pcCMT, an expression vector should have specific nucleotide sequences containing transcriptional and translational regulatory information linked to the DNA coding for the mammalian pcCMT in such a way as to permit gene expression and production of the protein. First, in order for the gene to be transcribed, it must be preceded by a promoter recognizable by RNA polymerase, to which the polymerase binds and thus initiates the transcription process.

A DNA is said to be "capable of expressing" a polypeptide, such as pcCMT, if it contains nucleotide sequences which contain transcriptional and translational regulatory information and such sequences are "operably linked" to nucleotide sequences which encode the polypeptide.

An operable linkage is a linkage in which the regulatory

DNA

sequences and the DNA sequence sought to be expressed are connected in such a way as to permit gene expression. The regulatory regions needed for gene expression in general include a promoter region as well as the DNA sequences which, when transcribed into RNA, will signal the initiation of protein synthesis. Such regions will normally include those 5'-non-coding sequences involved with initiation of transcription and translation. There are a variety of promoters in common use for high level protein expression in mammalian and insect cell systems.

The DNA molecule containing the nucleotide sequence coding for mammalian pcCMT of the invention, and the operably linked transcriptional and translational regulatory signals can be inserted into a vector which either transiently expresses the mammalian pcCMT in host cells or is capable of integrating the desired gene sequences into the host cell chromosome. In order to be able to select the cells which have stably integrated the introduced DNA into their chromosomes, one or more markers which allow for selection of host cells which contain the expression vector is used. The marker may provide for prototropy to an auxotrophic host, biocide resistance, resistance to antibiotics, or heavy metals, such as copper, or the like. The selectable marker gene can either be directly linked to the DNA gene sequences to be expressed, or introduced into the same cell by cotransfection. Additional elements may-alto be needed for optimal synthesis of mRNA. These elements may include splice signals, as well as transcription promoters, enhancers, and termination signals. cDNA expression vectors incorporating such elements include those described by Okayama (1983).

Expression vectors which are capable of transiently expressing a high level of the desired protein when transfected into eukaryotic host cells, are well-known in the art and are generally either publicly available or commercially available from molecular biology suppliers (e.g, plasmid pcDM8, is available from Invitrogen, San Diego, CA).

Once the vector or DNA sequence containing the construct(s) has been prepared for expression, the expression vector may be introduced into an appropriate host cell by any variety of suitable means, such as transformation, transfection, lipofection, conjugation, protoplast fusion, electroporation, calcium phosphate precipitation, direct microinjection, etc.

Eukaryotic host cells can be mammalian cells, e.g., human cells, monkey (COS cells), mouse and Chinese hamster ovary Lis, because they provide ;:c3t-translational modifi tionr. to protein molecules inclu g correct folding, correct disulfide bond formation as well as glycosylation at correct sites. However, insect cells, baculovirus, can overproduce polypeptides and can also carry out posttranslational peptide modifications including glycosylation.

Ectopic protein expression in COS, CHO and insect cells are well known in the art and protocols, such as those provided in Ausubel et al. (1987-1997), sections 16.9-16.14, on expression of proteins in insect cells using baculovirus vectors and in mammalian cells, can be suitably used. The transfected host cells expressing mammalian pcCMT are harvested to obtain membrane preparations for determining the carboxyl methyltranferase activity as described in Philips et al.

(1995b).

The method of screening for inhibitors of pcCMT as therapeutic antiproliferative and antiiflammatory agents according to the present invention involves providing a membrane preparation, incubating the membrane preparation with substrate, such as described by Philips et al. (1995b) in the presence or absence (control) of a potential pcCMT inhibitor, and determining whether the level of carboxyl methyltransferase activity is blocked/inhibited by the potential inhibitor relative to the control. While the use of membrane preparations of cells associated with a high level of ectopic protein expression offers the advantage of high expression and activity levels in these membrane preparations relative to the low level of carboxyl methyltransferase that are present endogenously in these membrane preparations, membrane preparations with endogenous levels may also be used.

Generally, aside from a low endogenous level of pcCMT activity in the membrane preparation, the prenylcysteine carboxyl methyltransferase, which is present in the membrane preparation used to screen for pcCMT inhibitors according to the method of the present invention, is not native to the source of the membrane preparation.

Potential prenylcysteine carboxyl methyltransferase (pcCMT) inhibitors in at least two ways.

In a first method, cellular membranes or artificial liposomes containing pcCMT are used as a source of enzyme, small molecules such as N-acetyl-S-trans-farnesyl -L-cysteine (AFC), peptides that terminate in a prenylcysteine residue, or prenylated proteins rhoA) serve as substrates, and

S-

adenosyl-L-methionine (SAM) radiolabeled on the methyl group is added as the methyl donor. pcCMT enzymatic activity can then be directly determined by measuring the transfer of the radiolabeled methyl group to the substrate. Candidate pcCMT inhibitors can be included in the reaction and extent of inhibition measured. A high throughput system based on this assay can be designed microtiter plate assay system) to screen a large number of compounds for potential inhibitors.

Although endogenous pcCMT membranes isolated from cells) could, in principle, be used as the source of enzyme in this assay, the use of recombinant human pcCMT offers several advantages. Most important is the ability to overexpress the recombinant enzyme ectopically in mammalian-'Tells such as CHO or COS (as demonstrated in Dai et al.) or in insect cells such as sf9 such that the membrane harvested from the transfected cells offers a markedly enriched, homogeneous, reproducible, and replenishable source of enzyme. Moreover, unlike the endogenous activity that may represent more than one enzyme, the overexpressed recombinant enzyme is fully characterized at the molecular level and would therefore lend itself to mutational analysis useful in rational drug design.

The second method would take advantage of the biology of a model genetic organism, S. cerevisiae. In contrast to the inhibitor screen described above that relies directly on biochemical activity, the S. cerevisiae screen would have a biological readout: the inhibition of mating competence. The ability of MATa strains of S. cerevisiae to mate with MATa strains is absolutely dependent on the production of biochemically active a factor from MATa strains.

MATa strains of S. cerevisiae. that lack a functional pcCMT, i.e. stel4 mutants, are unable to mate (sterile) because the a factor t they produce cannc be c:.rboxyl methylated on prenylcy. ine, a iodification requi---d for biological activity. The laboratory of the present inventors has shown (Dai et al.) that the human pcCMT gene can functionally substitute for the S. cerevisiae STE14 gene (complementation).

Thus, as a method to screen for inhibitors of human pcCMT, a strain of MATa S. cerevisiae that have the human pcCMT gene substituted for the endogenous STE14 locus can be constructed by standard techniques Rothstein, Methods in Enzymology 194:281-301, 1991). This strain would then be competent for mating with MATa cells. Standard mating assays

F.

Sprague, Jr., Methods in Enzymology 194:77-93, 1991) could then be employed in the presence or absence of potential inhibitors of pcCMT. A potential pcCMT inhibitor would then be scored as a compound that blocked the mating of the MATa strain engineered to contain the human pcCMT gene with a MATa tester strain. The standard assays for mating competence include prototrophic growth of diploids on media nonpermissive for either haploid strain or assays of a factor activity such as the halo assay whereby a factor produced by a patch of MATa cells inhibits the growth of nearby MATa tester cells spread as a lawn. The advantage of this screen is that it is exceedingly cheap and simple and therefore perfectly suited to high throughput screening. The anti-pcCMT activity of any potential inhibitor identified by the S. cerevisiae mating screen could then be verified with the biochemical assay described above. An embodiment of a method of screening for inhibitors of pcCMT using membrane preparations according to the present invention is described in Example 2, where a biotinylated and prenylated peptide substrate for pcCMT, a labeled co-factor, and a membrane preparation containing pcCMT are incubated together in an avidin-coated or streptavidincoated multiwell dish. The biotinylated substrate is bound to the avidin-coating and the action of carboxyl methyltransferase activity on the substrate in the presence of labeled co-factor generates a radiolabeled product which can be separated from the labeled co-factor by washing. The amount of labeled product is then detected, the level of radiolabeled product in avidin- or streptavidin-coated well is determined by measuring cpm, relative to a control. This format allows for rapid high throughput screening for pcCMT inhibitors. Previously published methods of screening for enzyme inhibitors involve the use of test tubes where the product is extracted into heptane and then measured following a tedious demethylation procedure (Pillinger et al., 1994; Shi et al., 1992).

As previously disclosed, it is intended that the membrane preparations containing a pcCMT encompass artificial membrane preparations, such as phospholipid vesicles/liposomes, which reconstitutes membrane extracted pcCMT. A preferred method of reconstituting extracted pcCMT into liposomes is described in Philips et al. (1995b) where pcCMT extracted from membranes with detergent was reconstituted in the presence of diacylphosphatidylcholine and an anionic phospholipid. Residual detergent in the sample was removed with Extractigel (Pierce), a deter gt removing gel.

Small prenylcysteine analogs, such as N-acetyl-Strans,trans-farnesyl-L-cysteine (AFC) and N-acetyl-S-alltrans-geranylgeranyl-L-cysteine (AGGC), are known to be substrates for pcCMT as well as inhibitors of the carboxyl methylation activity of pcCMT (Volker et al., 1995). Other potential inhibitory compounds can be more readily and reliably screened according to the method of the present invention in membrane preparations where a high level of ectopically expressed pcCMT is localized.

Once pcCMT inhibitors are identified and isolated according to the method of the present invention, techniques that are well-known and commonly used by pharmaceutical companies in drug development may be used to provide stability, enhance cell penetration, the design peptidomimetic inhibitors, etc., such as along the lines of the development of farnesyl-protein transferase -inhibitors (Gibbs et al., 1996; Koblan et al., 1996). It is also contemplated that peptides or fragments of pcCMT may serve as an inhibitor of carboxyl methylation in the localization and function of membrane proteins, such as Ras, when overexpressed in the cell. The presence of such peptides in the cytosol may serve as antagonists and prevent the carboxyl methylation. Peptides or fragments of pcCMT can be readily generated for screening by removing successive residues from either or both the Nterminus or C-terminus of pcCMT, or from peptides obtained thereof by enzymatic or chemical cleavage of the polypeptide.

While the most important utility of the pcCMT of the present invention is as a tool in the search for inhibitors which could be used therapeutically in the treatment of inflammation or hyperproliferative disorders such as cancer, as discussed above, other utilities for such proteins would be readily apparent. For example, as the protein has enzymatic activity, it can be used in chemical processes in which carboxyl methylation of appropriate substrates is desired.

Preparations of membranes containing pcCMT, in accordance with the present invention, may be used for carboxyl methylation of natural protein substrates, particularly when a high level of carboxyl methyltransferase activity from the ectopic expression of pcCMT is present in these membrane preparations.

For example, proteins, which are substrates for pcCMT, i.e., Ras protein, nuclear lamins, G proteins, etc., have a prenylcysteine residue at the C-terminus. Such substrates may be methylesterified at the a-carboxyl group by a functional pcCMT to produce the functionally active form. Thus, membrane preparations containing high levels of mammalian pcCMT activity are well-suited for supplying quantities of functional Ras proteins, among others, that can be made commercially available. For a treatment of hyperproliferative and inflammatory disorders, gene therapy at the affected site, e.g. tumor, etc., may be used once a variant of pcCMT that would act as a dominant negative allele is identified.

The protein of the present invention is also useful to raise polyclonal or monoclonal antibodies by techniques well known in the art, one of which is described in the following example. Such antibodies are useful as research tools, for example, in localization of subcellular pcCMT or in otherwise identifying, the presence of the protein in situ or in vitro.

Antibodies to mammalian pcCMT, and to human pcCMT in particular, may also be used to disrupt the action of pcCMT, thereby treating disorders associated with inflammation or hyperproliferation. Additionally, the antibodies specific to pcCMT can be used in methods to detect the presence of, or measure the quantity of pcCMT in a cell or cell extract.

It should be understood that when the term "antibodies" is used with respect to the anti-pcCMT antibody, this is intended to include intact antibodies, such as polyclonal antibodies or monoclonal antibodies (mAbs), as well as proteolytic fragments thereof such as the Fab or F(ab') 2 fragments. Furthermore, the DNA encoding the variable region of the antibody can be inserted into other antibodies to produce chimeric antibodies (see, for example, U.S. Patent 4,816,567) or into T-cell receptors to prodice T-cells with the same broad specificity (see Eshhar et al. 1990; Gross et al., 1989). Single chain antibodies can also be produced and used. Single chain antibodies can be single chain composite polypeptides having antigen binding capabilities and comprising a pair of amino acid sequences homologous or analogous to the variable regions of an immunoglobulin light and heavy chain (linked VH-VL or single chain Both VH and VL may copy natural monoclonal antibody sequences or one or both of the chains may comprise a CDR-FR construct of the type described in U.S. Patent 5,091,513 (the entire contents of which are hereby incorporated herein by reference). The separate polypeptides analogous to the variable regions of the light and heavy chains are held together by a polypeptide linker. Methods of production of such single chain antibodies, particularly where the DNA encoding the polypeptide structures of the Vg and VL chains are known, may be accomplished in accordance with the methods described, for example, in U.S. Patents 4,946,778, 5,091,513 and 5,096,815, the entire contents of each of which are hereby incorporated herein by reference.

A "molecule which includes the antigen-binding portion of an antibody," is intended to include not only intact immunoglobulin molecules of any isotype and generated by any animal cell line or microorganism, but also the antigen-binding reactive fraction thereof, including, but not limited to, the Fab fragment, the Fab' fragment, the F(ab') 2 fragment, the variable portion of the heavy and/or light chains thereof, and chimeric or single-chain antibodies incorporating such reactive fraction, as well as any other type of molecule or cell in which such antibody reactive fraction has been physically inserted, such as a chimeric

T-

cell receptor or a T-cell having such a receptor, or molecules developed to deliver therapeutic moieties by means of a portion of the molecule containing such a reactive fraction.

Such molecules may be provided by any known technique, including, but not limited to, enzymatic cleavage, peptide synthesis or recombinant techniques.

An antibody is said to be "capable of binding" a molecule if it is capable of specifically reacting with the molecule to thereby bind the molecule to the antibody. The term "epitope" is meant to refer to that portion of any molecule capable of being bound by an antibody which can also be recognized by that antibody. Epitopes or "antigenic determinants" usually consist of chemically active surface groupings of molecules such as amino acids or sugar side chains and have specific three dimensional structural characteristics as well as specific charge characteristics.

An "antigen" is a molecule or a portion of a molecule capable of being bound by an antibody which is additionally capable of inducing an animal to produce antibody capable of binding to an epitope of that antigen. An antigen may have one or more than one epitope. The specific reaction referred to above is meant to indicate that the antigen will react, in a highly selective manner, with its corresponding antibody and not with the multitude of other antibodies which may be evoked by other antigens.

In order to predict antigenic epitopes present in pcCMT, the amino acid sequence can be obtained from the cDNA of pcCMT can be inspected visually or analyzed by computer, for example, using the program of PEPTIDESTRUCTURE (Jameson et al., 1988). This program allows determination of hydropathicity values which are then used to determine which peptide sequences within the overall protein sequence are likely to be most immunogenic based on their potential secondary structure. Such peptides may be synthesized chemically, or alternatively, and preferably, by recombinant DNA methods.

Polyclonal antibodies are heterogeneous populations of antibody molecules derived from the sera of animals immunized with an antigen.

Monoclonal antibodies (mAbs) are a substantially homogeneous population of antibodies to specific antigens.

MAbs may be obtained by methods known to those skilled in the art. See, for example Kohler and Milstein/1975); U.S. Patent No. 4,376,110; Ausbel et al. (1988); and Colligan et al.

(1993), the contents of which reference are incorporated entirely herein by reference. Such antibodies may be of any immunoglobulin class including IgG, IgM, IgE, IgA, GILD and any subclass thereof. The hybridoma producing the mAbs of this invention may be cultivated in vitro (see Example 3) or in vivo. High titers of mAbs can be obtained in in vivo production where cells from the individual hybridomas are injected intraperitoneally into pristane-primed Balb/c mice to produce ascites fluid containing high concentrations of the desired mAbs. MAbs of isotype IgM or IgG may be purified from such ascites fluids, or from culture supernatants, using column chromatography methods well known to those of skill in the art.

Chimeric antibodies are molecules, the different portions of which are derived from different animal species, such as those having a variable region derived from a murine mAb and a human immunoglobulin constant region. Chimeric antibodies are primarily used -L reduce immunogenicity in application and to increase yields in production, for example, where murine mAbs have higher yields from hybridomas but higher immunogenicity in humans, such that human/murine chimeric mAbs are used. Chimeric antibodies and methods for their production are known in the art (Cabilly et al. 1984; Morrison et al. 1984; Boulianne et al. 1984; Cabilly et al.

European Patent Application 125.023 (published November 14, 1984); Neuberger et al. 1985; Taniguchi et al., European Patent Application 171496 (published February 19, 1985); Morrison et al., European Patent Application 173494 (published March 5, 1986); Neuberger et al., PCT Application WO 8601533, (published March 13, 1986); Kudo et al., European Patent Application 184187 (published June 11, 1986); Morrison et al., European Patent Application 173494 (published March 5, 1986); Sahagan et al. (1986); Robinson et al., International Patent Publication WO 9702671 (published May 7, 1987); Liu et al.

1987); Sun et al. 1987; Better et al. 1988; and Harlow and Lane, supra. These references are hereby incorporated by reference.

An anti-idiotypic (anti-Id) antibody is an antibody which recognizes unique determinants generally associated with the antigen-binding site of an antibody. An Id antibody can be prepared by immunizing an animal of the same species and genetic type mouse strain) as the source of the mAb with the mAb to which an anti-Id is being prepared. The immunized animal will recognize and respond to the idiotypic determinants of the immunizing antibody by producing an antibody to these idiotypic determinants (the anti-Id antibody). See, for example, U.S. patent No. 4,699,880, which is herein entirely incorporated by reference.

The anti-Id antibody may also be used as an 2 ,immunogen" to induce an immune response in yet another animal, producing a so-called anti-anti-Id antibody. The anti-anti-Id may bear structural similarity to the original mAb which induced the anti-Id. Thus, by using antibodies to the idiotypic determinants of a mAb, it is possible to identify other clones expressing antibodies of identical specificity..

Accordingly, mAbs generated against the pcCMT protein of the present invention may be used to induce anti-Id antibodies in suitable animals, such as Balb/c mice. Spleen cells from such immunized mice are used to produce anti-Id hybridomas secreting anti-Id mAbs. Further, the anti-Id Abs can be coupled to a carrier such as keyhole limpet hemocyanin (KLH) and used to immunize additional Balb/c mice. Sera from these mice will contain anti-anti-Id antibodies that have the binding properties of the original mAb specific for a pcCMT protein epitope.

As mentioned above, the term "antibody" is also meant to include both intact molecules as well as fragments thereof, such as, for example, Fab and F(ab') 2 which are capable of binding antigen. Fab and fragments lack the Fc fragment of intact antibody, clear more rapidly from the circulation, and may have less non-specific tissue binding than an intact antibody (Wahl et al., 19&31.1 It will be appreciated that Fab and F(ab') 2 and other fragments of the antibodies useful in the present invention may be used for the detection and quantitation of pcCMT protein according to the methods disclosed herein for intact antibody molecules. Such fragments are typically produced by proteolytic cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab')2 fragments).

The antibodies, or fragments of antibodies, of the present invention may be used to quantitatively or qualitatively detect the presence of cells which express the pcCMT protein. This can be accomplished by immunofluorescence techniques employing a fluorescently labeled antibody (see below) coupled with light microscopic, flow cytometric, or fluorimetric detection.

The biological sample, such as membrane preparation, may be treated with a solid phase support or carrier (which terms are used interchangeably herein) such as nitrocellulose, or other solid support which is capable of immobilizing cells, cell p-rticles or soluble proteins. e support may then be washed with suitable buffers followe: y treatment with the detectably labeled pcCMT-specific an- ody. The solid phase support may then be washed with the buffer a second time to remove unbound antibody. The amount of bound label on said solid support may then be detected by conventional means.

By "solid phase support or carrier" is intended any support capable of binding antigen or antibodies. Well-known supports, or carriers, include glass, polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, polyacrylamides, gabbros, and magnetite. The nature of the carrier can be either soluble to some extent or insoluble for the purposes of the present invention. The support material may have virtually any possible structural configuration so long as the coupled molecule is capable of binding to an antigen or antibody.

Thus, the support configuration may be spherical, as in a bead, or cylindrical, as in the inside surface of a test tube, or the external surface of a rod. Alternatively, the surface may be flat such as a sheet, test strip, etc. Those skilled in the art will know many other suitable carriers for binding antibody or antigen, or will be able to ascertain the same by use of routine experimentation.

The binding activity of a given lot of anti-pcCMT antibody may be determined according to well known methods.

Those skilled in the art will be able to determine operative and optimal assay conditions for each determination by employing routine experimentation.

Other such steps as washing, stirring, shaking, filtering and the like may be added to the assays as is customary or necessary for the particular situation.

One of the ways in which the pcCMT-specific antibody can be detectably labeled is by linking the same to an enzyme and use in an enzyme immunoassay (EIA). This enzyme, in turn, when later exposed to an appropriate substrate, will react with the substrate in such a manner as to produce a chemical moiety which can be detected, for example, by spectrophotometric, fluorimetric or by visual means. Enzymes which can be used to detectably label the antibody include, but are not limited to, malate dehydrogenase, staphylococcal nuclease, delta-5-steroid isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate dehydrogenase, triose phosphate isomerase, horseradish peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, betagalactosidase, ribonuclease, urease, catalase, glucose-6phosphate dehydrogenase, glucoamylase and acetylcholinesterase. The detection can be accomplished by colorimetric methods which employ a chromogenic substrate for the enzyme. Detection may also be accomplished by visual comparison of the extent of enzymatic reaction of a substrate in comparison with similarly prepared standards.

Detection may be accomplished using any of a variety of other immunoassays. For example, by radioactively labeling the antibodies or antibody fragments, it is possible to detect pcCMT protein through the use of a radioimiMnoassay

(RIA)

(Chard, 1978, incorporated by reference herein). The radioactive isotope can be detected by such means as the use of a gamma counter or a liquid scintillation counter or by autoradiography. Radioactively labeled antibodies or antibody fragments can also be used for their capacity to kill cells bound by such antibodies, or cells in the immediate vicinity which are exposed to the radiation from such antibodies.

It is also possible to label the antibody with a fluorescent compound. When the fluorescently labeled antibody is exposed to light of the proper wave length, its presence can then be detected due to fluorescence. Among the most commonly used fluorescent labelling compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin, allophycocyanin, o-phthaldehyde and fluorescamine.

The antibody can also be detectably labeled using fluorescence emitting metals such as 152 Eu, or others of the lanthanide series. These metals can be attached to the antibody using such metal chelating groups as diethylenetr. aminepentaacetic acid (PTPA) or ethylenediaminetetraacetic acid (EDTA).

The antibody also can be detectably labeled by coupling it to a chemiluminescent compound. The presence of the chemiluminescent-tagged antibody is then determined by detecting the presence of luminescence that arises during the course of a chemical reaction. Examples of particularly useful chemiluminescent labeling compounds are luminol, isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate ester.

Likewise, a bioluminescent compound may be used to label the antibody of the present invention. Bioluminescence is a type of chemiluminescence found in biological systems in which a catalytic protein increases the efficiency of the chemiluminescent reaction. The presence of a bioluminescent protein is determined by detecting the presence of luminescence. Important bioluminescent compounds for purposes of labeling are luciferin, luciferase and aequorin.

The antibody molecules of the present invention may be adapted for utilization in an immunometric assay, also known as a "two-site" or "sandwich" assay. In a typical immunometric assay, a quantity of unlabeled antibody (or fragment of antibody) is bound to a solid support and a quantity of detectably labeled soluble antibody is added to permit detection and/or quantitation of the ternary complex formed between solid-phase antibody, antigen, and labeled antibody.

Typical, and preferred, immunometric assays include "forward" assays in which the antibody bound to the solid phase is first contacted with the sample being tested to "extract" the antigen from the sample by formation of a binary solid phase antibody-antigen complex. After a suitable incubation period, the solid support is washed to remove the residue of the fluid sample, including unreacted antigen, if any, and then contacted with the solution containing an unknown quantity of labeled antibody (which functions as a "reporter molecule"). After a second incubation period to permit the labeled antibody to complex with the antigen bound to the solid support through the unlabeled antibody, the solid support is washed a second time to remove the unreacted labeled antibody.

In another type of "sandwich" assay, which may also be useful with the antigens of the present invention, the socalled "simultaneous" and "reverse" assays are used.

A

simultaneous assay involves a single incubation step as the antibody bound to the solid support and labeled antibody are both added to the sample being tested at the same time. After the incubation is completed, the solid support is washed to remove the residue of fluid sample and uncomplexed labeled antibody. The presence of labeled antibody associated with the solid support is then determined as it would be in a conventional "forward" sandwich assay.

In the "reverse" assay, stepwise addition first of a solution of labeled antibody to the fluid sample followed by the addition of unlabeled antibody bound to a solid support after a suitable incubation period is u]i3Azed. After a second incubation, the solid phase is washed in conventional fashion to free it of the residue of the sample being tested and the solution of unreacted labeled antibody. The determination of labeled antibody associated with a solid support is then determined as in the "simultaneous" and "forward" assays.

Antibodies or other molecules which include the antigen-binding portion of an antibody may also be used for isolation and purification of pcCMT. Thus, for example, antibodies specific to pcCMT can be immobilized on a solid phase support or carrier with which an impure solution containing pcCMT such as a membrane extract is brought into contact. The pcCMT will bind to the antibodies which are in turn bound to the support while all of the contaminants are washed away. Pure pcCMT can then be eluted from the support by means well-known in the art.

Other utilities will be readily apparent to those of ordinary skill in the art such as, for example, as a tool to assist research into the mechanism of action of the oncogen ras gene.

Having now generally described the invention, the same will be more readily understood through reference to the following examples which are provided by way of illustration and is not intended to be limiting of the present invention.

EXAMPLE 1 In this example, the present inventors report the molecular characterization and the subcellular localization of the first mammalian (human myeloid) pcCMT. Ectopically expressed recombinant pcCMT were shown to have enzymatic activity identical to that observed in neutrophil membranes.

It was unexpectedly discovered by the present inventors that mammalian pcCMT, which is the third of three enzymes that posttranslationally modify C-terminal CAAX motifs and thereby target CAAX proteins to the plasma membrane, was not expressed at the plasma membrane but rather restricted to the endoplasmic reticulum. Using pcCMT tagged with green fluorescent protein (GFP), the present inventors have demonstrated below that pcCMT is expressed in the endoplasmic reticulum but excluded from the plasma membrane that is the target of many CAAX proteins, including Ras. Thus, the final enzyme in the sequence of enzymes that modify CAAX motifs is surprisingly located in membranes topologically removed from the CAAX protein target membrane.

The experimental procedures used in this example are as follows: Northern Blotting: Total RNA (40 pg) from cells grown with or without 1.25% dimethylsulfoxide (Me 2 SO) for days was fractionated on a 1.2% denaturing formaldehyde agarose gel, transferred to a nylon filter, and hybridized with [3P]dATP-labeled full-length HL60 pcCMT cDNA in Express Hybridization solution (Clontech, Palo Alto,CA) at 680C.

A

human multiple-tissue Northern blot (Clontech) was hybridized with the same probe according to-the manufacturer's instructions. The same filters were stripped and rehybridized with a P-actin probe.

Antiserum: A peptide corresponding to deduced amino acids 185-201 of SEQ ID NO:5 (HL60 pcCMT) was synthesized, linked to keyhole limpet hemocyanin, and used to immunize rabbits. The resulting antiserum was used for immunoblotting of neutrophil light membranes prepared by nitrogen cavitation and discontinuous sucrose density sedimentation (Philips et 48 al., 1991), for immunoprecipitation of lysates of HL60 cells metabolically labeled with L-[SS]methionine, and for indirect immunofluorescence of COS-1 cells.

Expression of Recombinant pcCMT: The 3.6-kb fulllength HL60 pcCMT cDNA and an 852-bp cDNA fragment comprising the ORF were subcloned into the eukaryotic expression vectors pCDM8 and pcDNA3.1 (Invitrogen, Carlsbad, CA). Myc-tagged pcCMT was made by adding a sequence encoding the 10-amino acid epitope of c-Myc recognized by monoclonal antibody 9E10 to the C terminus of pcCMT using patch PCR and subcloning into pcDNA3.1. pcCMT tagged at the C terminus-with GFP was generated by subcloning the pcCMT ORF intQ the expression vector pGFP-N3 (Clontech). For pcCMT enzymatic assays, COS-1 cells were grown in 10-cm dishes and transfected with pcCMT with DEAE-dextran as described (Aruffo et al., 1987). For fluorescence microscopy, exponentially growing COS-1, CHO, and NIH3T3 cells were plated the day prior to transfection at confluence and transfected with 2 Ag of DNA using LipofectAMINE (Life Technologies, Inc.), Gaithersburg,

MD)

according to the manufacturer's instructions. Cells were observed at 24 and 48 hours after transfection.

Prenylcysteine Carboxyl Methyltransferase Assays: COS-1 cells transfected with HL60 pcCMT or vector alone were harvested at 48 hours with 5 mM EDTA and subjected, in parallel with human neutrophils, to nitrogen cavitation and discontinuous sucrose density centrifugation as described (Philips et al., 1991). Light membranes were assayed for pcCMT activity qualitatively by S-adenosyl-L-[methyl-'H] methionine labeling (SDS-polyacrylamide gel electrophoresis and flurography) of Rho GTPases partially purified from neutrophil cytosol and quantitatively by carboxyl methylation of N-acetyl-S-trans,trans-farnesyl-L-cyste-ne (AFC) (heptane partition, alkaline hydrolysis, and measurement of vapor phase ['H]methanol) as described (Philips et al., 1995).

S. cerevisiae Strains, Growth, and stel4 Complementation: Complete (YEPD), synthetic and synthetic dropout media were prepared as described previously (Michaelis et al., 1988), except that dropout medium lacked cysteine. All experiments were performed at 30 0 C. Yeast transformations were carried out by either the lithium acetate method (Ito et al., 1983) or the Elble method (Elble, 1992).

The stel4 deletion strains used are SM1188 (MATuastel4-3::TRPI trpl leu2 ura3 his4 canl) and SM1639 (MATaste14-4::URA3 trpl leu2 his4 cani) (Sapperstein et al, 1994). Plasmid expression of STE14 protein was accomplished by pSM123 7 (CEN URA3 STEI4).

Human pcCMT was expressed in yeast by subcloning the pcCMT ORF into the BamHI SalI sites of the yeast expression plasmid pGl (2A TRPI) that utilizes a GPD promoter and a PGK transcription terminator and polyadenylation signal (Schena et al., 1991). The resulting construct was designated pGl-hCMT 2 TRPI hCMT). Patch mating tests were carried out essentially as described previously (Michaelis et al., 1988).

Briefly, patches of MATa cells grown on selective media were replica plated onto a lawn of the MATa mating tester, SM1068 (lysl), that had been spread on an SD plate. Plates were incubated at 300C for 3 days. Growth of the prototrophic diploids indicated mating.

Fluorescence Microscopy: Live cells that had been transfected with pcCMT-GFP and fixed/permeabilized (2% paraformaldehyde/0.

2 Triton X-100 at 4 0 C or methanol at 0 C) cells stained with 9E10 anti-Myc antibody (Santa Cruz Biotechnology, Santa Cruz,CA), C6 anti-ribophorin I antiserum (provided by Dr. Gert Kriebich, NYU), or the anti-pcCMT antiserum described above, followed by Texas Red-conjugated secondary antisera, were imaged with a Zeiss Axioscope equipped with a Princeton Instruments cooled CCD camera and a KAF 1400 chip or a Molecular Dynamics confocal microscope equipped with an argon laser.

Cloning of Human Prencl1steine Carboxyl Methytransferase (pcCMT) At the time this work was undertaken, the only Sisoprenylcysteine methyltransferase characterized at the molecular level was the Stel4 gene of S. cerevisiae. Attempts by the laboratory of the present inventor at cloning a homologous gene from human DNA by PCR using degenerate oligonucleotide primers based on the amino acid sequence of Stel4 were unsuccessful. Therefore, a search of the GenBank in the Expressed Sequence Tag (EST) Database of the National Center for Biotechnology Information

(NCBI)

(http://www2.ncbi.nlm.nih.gov/dbST/dbest_query.html) for mammalian sequences related to Stel4 was performed using the text string "isoprenylcysteine". This search identified a 426 bp partial cDNA as an expressed sequence tag from a murine placental cDNA library (EST name: mh77d06.rl; GenBank Acc: AA022288; GenBank gi: 1486061) that was noted to have amino acid sequence homology to Stel4. The following strategy was then used to generate a cDNA probe with -which to screen a human cDNA library for a pcCMT gene.

Membranes prepared from HL60 cells (human promyelocytic leukemia cell line obtained from American Type Culture Collection, Rockville, MD) have also been reported to express pcCMT enzymatic activity, and, accordingly, total

RNA

was prepared from HL60 cells and the first strand cDNA was generated with oligo(dt) primers and the reverse transcriptase from Moloney Murine Leukemia Virus. After first strand cDNA synthesis, the original mRNA template was removed by treatment with RNase H, followed by second strand synthesis and PCR amplification of the resultant cDNA was accomplished using Taq DNA polymerase and a pair of primers that were designed based on the EST mh77d06.rl sequence (forward primer: GCCGGACTCAACGCGCTGCTGCTGCTACTCTA-3'(SEQ ID NO:1); reverse primer: 5'-CGTGTACTCCAGGCTGTGATTCAGGAGGAA-3' (SEQ ID NO:2)).

Thirty-five cycles of PCR were-performed using the following conditions: denaturation at 94 0 C for 1 minute; annealing at 50 0 C for 1 minute; extension at 720C for 2 minutes. The PCR products were separated on a 1.5% agarose gel and two bands were observed, the more abundant of which corresponded to a DNA of the predicted size (~250bp).

The more abundant PCR reaction product of approximately 250bp in size was gel purified and sequenced by the dideoxy procedure, and the result revealed a 218 bp sequence (not counting the primers; SEQ ID NO:3) that had high 51 similarity to the murine EST mh77d06.rl sequence. To subclone this amplified cDNA, the amplification product was purified from the PCR reaction by phenol/chloroform extraction and directly inserted into the pCR'"2.1 vector using a TA cloning kit (Invitrogen, San Diego CA). E. coli were transformed with the resulting recombinant plasmid and grown to log phase. The plasmid was isolated with a Mini Prep Kit (Qiagen) and the insert was released by EcoRI digestion. The resultant insert, labeled with ["P]dATP using a random primer extension labeling system (Ambion, Inc., Austin, TX), was used to screen a cDNA library to identify a human pcCMT gene. The screened library was a size-fractionated, unidirectional HL60 cDNA library constructed in the X lambdaZAP® system (Stratagene, La Jolla, CA) and provided by Dr. Philip Murphy of the Laboratory of Host Defenses, National Institute of Allergy and Immunology.

Screening was performed in the E. coli strain XL1-blue (Stratagene). An aliquot of the library was plated at a density of 2000 pfu/150-mm petri dish. A total of 20,000 plaques on ten petri dishes were screened. Two duplicate lifts were made from each petri dish, using nitrocellulose membranes.The lifts were probed with the [2]dATP-labeled 218 bp PCR fragment (SEQ ID NO:3) as described above.

Hybridization was performed in 5X SSPE, X Denhardt solution, and 0.2% SDS at 56 0 C overnight. The filters were washd 0.1X 0.2X SSC/0.1% SDS at room temperature for 30 minutes, and 0.1X SSC/0.1% SDS at 55 0 C for 40 minutes with two changes and then allowed to dry. Autoradiograms of the filters were analyzed, and plaques were scored as positive only if they appeared on the autoradiograms of duplicate filters. One of the initial 20,000 plaques screened was determined to be positive by this criteria, and a single recombinant phage containing the hybridizing cDNA insert was selected in a secondary screening.

Analysis of the recombinant phage revealed a 3.6 kb insert, and sequencing of the insert (SEQ ID NO:4) revealed an open reading frame (nucleotides 44 to 895 of SEQ ID NO:4) that is predicted to direct the synthesis of a 284 amino acid protein having the amino acid sequence of SEQ ID NO:5 (deduced molecular weight of 30 kD) that exhibits 26% identity to the Stel4 gene product. Hydropathy analysis revealed six hydrophobic sequences that may represent membrane-spanning domains (Fig. This is consistent with the five or six putative membrane-spanning domains of STE14 protein (Saperstein et al., 1994), with recent data demonstrating that STE14 protein is an integral membrane protein (Romano et al., submitted for publication) and with our observation that active pcCMT cannot be extracted from membranes with detergents but can be partially reconstituted in phospholipid vesicles (Pillinger et al., 1994). The-fist 65 amino acids of human pcCMT, containing the first two hydrophobic sequences, are 36% identical to amino acids 750-821 of the human band 3 anion transporter, which represent the llth and 12th of 14 well-characterized membrane-spanning domains, further supporting the hypothesis that human pcCMT is a multiple membrane-spanning protein. Comparison of human pcCMT with all related sequences in the GenBank (Fig. 2) revealed the highest degree of divergence in the N-terminal third of the molecule, suggesting that the catalytic domain is Cterminal. Sequences homologous to the S-adenosylmethionine binding regions described in aspartyl and glutamyl protein carboxyl methyltransferase (Volker et al., 1989) were not apparent, arguing against an evolutionary link to these carboxyl methyltransferases.

Further analysis of the cDNA revealed that the initiation codon is flanked by a Kozak consensus sequence, and that no upstream termination codon was found. A polyA sequence 3' to the opening reading frame was found to be present.

Exoression of Human Prenvlcysteine Carboxyl Methyltransferase To determine whether the HL60 STE14 protein homolog has prenylcysteine carboxyl methyltransferase activity, the cDNA was subcloned into a mammalian expression vector PCDM8 (Invitrogen, San Diego, CA) and transiently overexpressed ectopically in COS-1 cells. Membranes prepared from these 53 cells were used as a source of recombinant enzyme.

Farnesylated Ras GTPases require detergent for extraction from membranes. Therefore, geranylgeranylated neutrophil cytosolic Rho accessory protein, guanine nucleotide dissociation inhibitor (GDI), were utilized as endogenous substrates in an in vitro assay (Figs. 5A and SB). Untransfected COS-1 cell membranes had little pcCMT activity toward Rho proteins compared with that of membranes derived from human neutrophils (Fig. 5A). Transfection with the HL60 cDNA conferred pcCMT activity toward Rho GTPases on COS-1 membranes, and this activity was blocked by the competitive pcCMT inhibitor,

AFC,

with an EDs 5 (10 M) identical to that for endogenous neutrophil pcCMT. Carboxyl methylation.of Rac2 and RhoA by pcCMT-transfected COS-1 cell membranes was enhanced by guanosine 5'-3-O-(thio)triphosphate (GTPyS) (Fig. 5B), similar to the activity observed in neutrophil membranes (Philips et al., 1993). Carboxyl methyltransferase activity was quantitated in membranes of COS-1 cells transfected with pcCMT using prenylcysteine analogs as defined substrates (Philips et al., 1995). Untransfected COS-1 cell membranes had 23 10% (n 5) of the pcCMT specific activity of neutrophil membranes toward AFC. Transfection of COS-1 cells with HL60 pcCMT resulted in an 18 -53-fold increase in specific AFC carboxyl. methyltransferase activity (Table 1; 3.0 0.8 to 74.5 6.8 pmol/mg-min, n 5, p <0.0005).

TABLE 1 Prenylcysteile carboxyl methyltrafisferase activity in membranes of neutrophils and COS cells pcCMT transfectioi 4at1Experiment 2 Experiment 3 Source of Membrane Tncrease Activity Increase Activity Increase 1 14:^1 A I CCVIpmol/mg/min (f old) po/fgml (odi pmO.Lfn/LLU.LJ ,pmol/mg/min l) 1 .03 6" 2 6.5 3.5 11.4 7 GaS Vecto 5.3 1 1.9 1 1.7 1 OS Vct or N 6 12 17.7 9 35.3 21 OS +PCCMTOF NDN 62. 84.6 45 90.7 53 PC M OR tN DI rrni (48h) transfections were Ii *kpCDM8; tpcDMB pcCMT full-length CD1NA; tpCDMB pcCMT ORE OnY performed with DEAE-dextran. Membranes were prepared by nitrogen cavitation followed by discontinuous sucrose density centrifugation. Carboxyl methyltransferase activity was determined using AFC as substrate.

Each experiment represents an independent set of transfections.

Carboxyl methylation of N-acetyl-S-all-transgeranylgeranyl-L-cysteine (AGGC) was increased in parallel with methylation of AFC (18-versus 20-fold increase, n 2), consistent with previous studies demonstrating that a single activity carboxyl methylated both farnesylated and geranylgeranylated substrates (Volker et al., 1991b). The Michaelis constants of the recombinant enzyme, Km 7 pM for AFC and 0.6 pM for AGGC, were similar to those for the endogenous neutrophil enzyme (Pillinger et al., 1994). Thus, the cDNA described above encodes authentic human myeloid pcCMT.

CMT

To determine whether human pcCMT could substitute in vivo for the S. cerevisiae pcCMT, STE14 protein, a complementation analysis was done using mating as a biological readout. The HL60 pcCMT cDNA expressed from a plasmid in a Astel4 yeast strain partially restored the mating phenotype, indicating that a factor could be carboxyl methylated by human pcCMT. Thus, HL60 pcCMT is a functional human homolog of S.

cerevisiae STE14 protein.

Expression of the human pcCMT gene was examined by Northern analysis. HL60 cells expressed two pcCMT mRNAs, a 3.6-kb transcript consistent with the isolated cDNA and a transcript suggesting a related gene or an alternatively spliced message (Fig. 6A). The level of expression of both transcripts was diminished by granulocytic differentiation of cells induced by MeSO, consistent with our observation that HL60 membranes have 5-fold greater specific pcCMT activity than membranes of mature, peripheral blood neutrophils. Both transcripts were ubiquitously expressed in human tissues (Fig. 6B).

To characterize endogenous pcCMT, a polyclonal antiserum was raised against an internal HL60 pcCMT peptide (amino acids.18 4 2 01 of SEQ ID NO:5). Immunoblots of neutrophil and pcCMT-transfected COS-1 cell membranes using this antiserum revealed a 33-kDa protein (Fig. 6C) corresponding to the predicted size of the protein encoded by the HL60 pcCMT cDNA, confirming that this cDNA, which lacked a termination codon 5' of the ORF is, in fact, full-length. The same antiserum immunoprecipitated a 33-kDa protein from cells (Fig. 6C). Although the deduced amino acid sequence of pcCMT reveals a potential N-glycosylation site (Fig. these data argue against glycosylation.

To tag pcCMT for both the study of the biology of pcCMT and the isolation of the enzyme, sequences encoding Cterminal epitope tags (Myc, FLAG, and GFP) were added to the 3'-end of the pcCMT gene as a C-terminal extension. Myc is a widely-used 10-amino acid epitope recognized by the very efficient, commercially available 9E10 monoclonal antibody, and FLAG is an 8-amino acid epitope effi-e-intly recognized by a proprietary monoclonal antibody commercially available from Eastman Kodak. Green fluorescent protein (GFP) is a 27 kDa protein derived from a jellyfish that is intrinsically fluorescent and, therefore, can tag molecules with a fluorescent epitope that can be visualized in living cells.

Table 2 presents the results obtained on the effect of these C-terminal epitope tags on relative enzymatic activity of human pcCMT ectopically expressed in COS cell membranes.

Neither the Myc nor FLAG epitopes as C-terminal extensions of pcCMT affect its carboxyl methyltransferase activity when ectopically expressed in COS cell membranes. This shows that epitope tagged pcCMT can be easily quantitated in cells by immunochemical techniques and the isolation of the enzyme is facilitated because Myc- or FLAG-tagged pcCMT could be easily affinity purified and then enzymatically reconstituted in liposomes to provide a biochemically defined target for screening potential pcCMT inhibitors. The data on GFP-tagged CMT are inconclusive and will be repeated.

TABLE 2 Effect of C-Terminal Epitope Tags (Myc, FLAG, and GFP) on Relative Enzymatic Activity of Human pcCMT Ectopically Expressed in Cos Cell Membranes Experiment Construct #1 #2 #3 Vector Alone 1 1 1 pcCMT 14.1 5.4 19.6 pcCMT-Myc 11.1 pcCMT-FLAG 5.45 18.5 pcCMT-GFP 2.1 Ras is constitutively carboxyl methylated (Clarke et al., 1988) and expressed at the plasma membrane (Willingham et al., 1980), a localization required for its biological activity (Willingham et al., 1980). Like all CAAX proteins, Ras lacks a signal peptide, is synthesized in the cytosol, and modified immediately posttranslationally by a cytosolic prenyltransferase (Casey et al., 1996). Because prenylcysteine carboxyl methylation is catalyzed by an intrinsic membrane protein and represents the last of the three posttranslational modifications of the CAAX cysteine that enhance the affinity of Ras for membranes, the simplest model of plasma membrane targeting predicts pcCMT expression in the target membrane. However, activities associated with the further processing of prenylated proteins, including pcCMT (Stephenson et al., 1992), S-isoprenyl-CAAX high affinity binding (Thissen et al., 1993), S-isoprenyl-CAAX protease (Hancock et al., 1991; Ma et al., 1992), and palmitoyltransferase (Kasinathan et al., 1990) activities, have all been reported in microsomal fractions. Furthermore, one of the two S-isoprenyl-CAAX proteases recently identified in yeast has a putative ER retention sequence (Boyartchuk et al., 1997; Fujimura-Kamada et al., 1997), and a double deletion of these genes led to mislocalization of yeast Ras2p to internal membranes and cytosol (Boyartchuk et al., 1997).

Nevertheless, none of these studies excluded expression of prenylcysteine-modifying activities from plasma membranes.

Indeed, the present inventors have reported pcCMT activity in neutrophil subcellular fractions enriched for surface membrane (Pillinger et al., 1994).

To determine the subcellular localization of pcCMT, a recombinant pcCMT tagged at the C terminus with GFP was constructed. In CHO (Figs. 7A-7C), COS-1 (Figs. 7D and 7E), and HIH3T3 (not shown) cells, CMT-GFP was visualized in the ER, Golgi, and nuclear membranes but not in the plasma membrane. This assay afforded a sensitivit. and resolution that allowed visualization of individual ER canaliculi extending to the periphery of the cell (Figs. 7C and 7D) and was, therefore, definitive in excluding expression in the plasma membrane. ER expression was confirmed by colocalization with the ER marker ribophorin I (Fig. 7F), a component of the glycosyltransferase complex restricted to ER.

Because the 27-kDa GFP added to the C terminus of pcCMT might alter its native subcellular localization, the CMT-GFP localization was confirmed with a pcCMT tagged at the C terminus with a 10-amino acid Myc epitope (Fig. 7G). Finally, an anti-peptide antiserum reactive with pcCMT was used to confirm these data by localizing endogenous pcCMT to ER and nuclear membranes in COS-1 cells (Figs. 7H and 71). The absence of Golgi staining for endogenous pcCMT suggests that the Golgi localization of ectopically expressed pcCMT may result from gene overexpression.

These data demonstrate that mammalian pcCMT is an intrinsic membrane protein localized to a compartment topologically removed from the plasma membrane, and show that R s-related, GTPases and that are targeted to the cytoplasmic face of the plasma membrane where they participate in signaling events, complete their posttranslational processing in the ER.

In order to determine the role of carboxyl methylation on membrane targeting of Ras, Ras molecules tagged at their N-termini with the intrinsically fluorescent protein, green fluorescent protein (GFP) was constructed. As predicted, GFP-Nras was localized in the plasma membrane (Fig.

8A). Somewhat surprising was the localization of GFP-Nras in the membrane of the Golgi as well. In contrast, when a Ras mutant with an altered CAAX motif that cannot be farnesylated (NrasC186S) was tagged with GFP, all of the GFP-Nras remained in the cytosol (Fig. 8B), confirming that farnesylation is required for plasma membrane localization.

Farnesyltransferase inhibitors would, therefore, be expected to result in the retention of GFP-Nras in the cytosol. When a carboxyl methyltransferase inhibitor, AFC, was applied to this system, the proper membrane localization of GFP-Nras was blocked and most of the molecules remained cytosolic (Fig.

8C), similar to the GFP-Nras blocked at the level of farnesylation. Since farnesylation and carboxyl methylation are distinct steps in a sequential biochemical process, inhibitors of these enzymes would be expected to have synergistic effects.

It is surprising that proteins, such as Ras, synthesized on free ribosomes, prenylated in the cytosol, and destined form the cytoplasmic leaflet of the plasma membrane, are diverted to the ER for processing. Moreover, the ER restriction of pcCMT suggests that an uncharacterized transport pathway must mediate the translocation of fully processed GTPases from internal membranes to the cell surface.

Such a pathway could utilize the cytoplasmic surface of secretory vesicles, cytosolic accessory molecules analogous to GDI, or a novel transport system.

EXAMPLE 2 Carboxvl Methyltransferase Assay Using Strentavidin-bound Biotinvlated Substrate In a 96 multiwell format, the assay has a capacity for testing five compounds in quadruplicates, with controls, as inhibitors of pcCMT activity. For a 100/l final assay volume, a mixture of a prenylated (farnesylated or geranylgeranylated) peptide substrate (approx. 2/mols of a GlyGlyTyrGlnLysArgAlaCys peptide), which is also biotinylated at its N-terminus, is mixed with.a solution of the test compound. A 45A1 volume of this mixture is transferred to each well in a 96 well REACTI-BIND plate (PJerce) and to which a 40/l/well mixture of S-adenosyl-L- H] -methionine (lCi/10 l of specific activity 56.1 Ci/mmol) with 3.6 ml of a membrane preparation containing pcCMT activity. The complete reaction mixture is then incubated at 37 0 C for one hour. After washing with a wash buffer, 100il of WALLAC OY scintillation fluid (Wallac) is added to all wells and allowed to stand for 45-60 minutes before transferring the scintillation fluid from all the wells into a WALLAC OY plate to measure counts per minute (cpm).

Having now fully described this invention, it will be appreciated by those skilled in the art that the same can be performed within a wide range of equivalent parameters, concentrations, and conditions without departing from the spirit and scope of the invention and without undue experimentation.

While this invention has been described in connection with specific embodiments thereof, it will be understood that it is capable of further modifications. This ap ication is intended to cover any variations, uses, or ad. -ations of the i-.ventions following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains and as may be applied to the essential features hereinbefore set forth as follows in the scope of the appended claims.

All references cited herein, including journal articles or abstracts, published or unpublished U.S. or foreign patent applications, issued U.S. or foreign patents, or any other references, are entirely incorporated by reference herein, including all data, tables, figures, and text presented in the cited references. Additionally, the entire contents of the references cited within the references cited herein are also entirely incorporated by reference.

Reference to known method steps, conventional method steps, known methods or conventional methods is not in any way an admission that any aspect, description or embodiment of the present invention is disclosed, taught or suggested in the relevant art.

The foregoing description of the specific embodiments will so fully reveal the general nature of the invention that others can, by applying knowledge within the skill of the art (including the contents of the references cited herein), readily modify and/or adapt for various applications such specific embodiments, without undue experimentation, without departing from the general concept of the present invention. Therefore, such adaptations and modifications are intended to be within the meaning and range of equivalents of the disclosed embodiments, based on the teaching and guidance presented herein. It is to be understood that the phraseology or terminology herein is for the purpose of description and not of limitation, such that the terminology or phraseology of the present specification is to be interpreted by the skilled artisan in light of the teachings and guidance presented herein, in combination with the knowledge of one of ordinary skill in the art.

REFERENCES:

Akasako et al., Biochemistry 34:8115-8122 (1995) Alber et al., Biochemistry 26:3751-3758 (1987) Allen et al., Nature 327:713 (1987) Altschuh et al., Protein Eng. 2:193-199 (1988) Aruffo et al., Proc. Natl. Acad. Sci. USA 84:8573-8577 (1987) Ashby et al., Yeast 9:907-913 (1993) Ausubel et al. (Eds), Current Protocols in Molecular Biology, Greene Publications and Wiley Interscience (New York, NY, 1987-1997) Ballinger et al., Biochemistry 34:13312-13319 (1995) Barbacid, Ann. Rev. Biochem. 56:779-827 (4-97) Behravar et al., Eur. J. Biochem. 198:589-592 (1991) Better et al., Science 240:1041-1043 (1988) Blacklow et al., Biochemistry 30:8470-8476 (1991) Boulianne et al., Nature 312:643-646 (1984) Boyartchuk et al., Science 275:1796-1800 (1997) Cabilly et al., Proc. Natl. Acad. Sci. USA 81:3273-3277 (1984) Casey et al., J. Biol. Chem. 271:5289-5292 (1996) Chard (In: Work et al., Laboratory Techniaues in Biochemistry in Molecular Biology, North Holland Publishing Company (New York 1978) Chelsky, D. et al., Biochemistry 24:6651 (1985) Claesseus et al., Protein Eng. 4:335 (1989) Clarke et al., Proc. Natl. Acad. Sci. USA 85:4643-4647 (1988) Clarke, Ann. Rev. Biochem. 61:355-386 (1992) Colligan et al. (Eds), Current Protocols in Immunology, Green Publishing Assoc., and Wiley Interscience (New York, 1993) Cc...tantino et al., J. Med. Chem. 39:3998-4006 (1996) Darnell et al., Molecular Cell Biology, Scientific American Books, Inc., (New York, 1986) Elble, Biotechniques 13:18-20 (1992) 63 Eshhar et al., Br. J. Cancer Suopl., 10:27-29 (1990) Finegold et al., Proc. Natl. Acad. Sci. USA 88:4448 (1991) Fujimura-Kamada et al., J. Cell Biol. 136:271-285 (1997) Gibbs et al., Cell 65:1-4 (1991) Gibbs et al., Breast Cancer Res. and Treatment 38:75-83 (1996) Glomset et al., Trends Biochem. Sci. 15:139-142 (1990) Gloss et al., Biochemistry 31:32-39 (1992) Green et al., Biochemistry 32:10131-10139 (1993) Gregoret et al., Proc. Nat. Acad. Sci. USA 90:4246-4250 (1993) Gross et al., Proc. Natl. Acad. Sci. USA 86:10024-10028 (1989) Guruprasad et al., Protein En.q 9:849-856 (1996) Guyer et al., J. Biol. Chem. 265:17307-17317 (1990) Hancock et al., Cell 63:133-139 (1990) Hancock et al., EMBO J. 10:641-646 (1991) Harlow et al, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory (1988) Hrycyna et al., Mol. Cell. Biol. 10:5071-5076 (1990) Hrycyna et al., EMBO J. 10:1699-1709 (1991) Huzoor-Akbar et al., Proc. Natl. Acad. Sci. USA 90:868-872 (1993) Imai et al., Mol. Cell. Biol. 17:1543-1551 (1997) Ingrosso et al., J. Biol. Chem. 264:20131-20139 (1989) Ito et al., J. Bacteriol. 153:163-168 (1983) Jenkins et al., J. Biol. Chem. 265:19624-19631 (1990) Jameson et al., CABIOS 4:181-186 (1988) Johnson et al., Crit. Rev. Biochem. Mol. Biol. 29(1):1-68 (1994) Jones et al., Curr. Comm. Molec. Biol. 1986 Kasinathan et al., J. Biol. Chem. 265:5139-5144 (1990) Kelly et al., Biochemistry 32:6828-6835 Kinsella et al., J. Biol. Chem. 267:8934 (1992) Koblan et al., Biochem. Soc. Transactions 24:688-692 (1996) Kohler et al. Nature 256:495-497 (1975) Lawman et al., J. Biol. Chem. 266:10982-10988 (1991) Lee et al., J. Biol. Chem. 268:19192-19195 (1993) 64 Lewin, Genes II, John Wiley Sons (New York, 1985) Levitt et al., J. Mol. Biol. 226:507 (1992) LiCata et al., Biochemistry 34:3133-3139 (1995) Lim et al., Biochemistry 31:4324-4333 (1992) Lim et al., Nature 339:31-36 (1989) Lin et al., Proc. Nat. Acad. Sci. USA 91:10265-10269 (1994) Liu et al., Proc. Natl. Acad. Sci. USA 84:3439-3443, 1987 Lowman et al., J. Biol. Chem., 266:10982-8 (1991) Lowman et al., J. Mol. Biol. 234:564-578 (1993) Ma et al., Proc. Natl. Acad. Sci. USA 89:6275-6279 (1992) Maltese, FASEB J. 4:3319-3328 (1990) Marcus et al., Mol. Cell. Biol. 11:3603-361r (1991) Messing et al., Third Cleveland Symposium on Macromolecules and Recombinant DNA, Editor A. Walton, Elsevier, Amsterdam (1981) Michaelis et al, Mol. Cell. Biol. 8:1309-1318 (1988) Morrison et al., Proc. Natl. Acad. Sci. USA 81:6851-6855, 1984 Murata et al., J. Exo. Med. 175:341-351 (1992) Neuberger et al., Nature 314:268-270, 1985 Okayama, Mol. Cel. Biol. 3:280 (1983) Old et al., Princioles of Gene Manioulation: An Introduction to Genetic Encineering, 2d edition, University of California Press (Berkeley, CA, 1981) Oliff et al., Sci. American 275:144-149 (1996) Overington et al., Protein Science 1:216-226 (1992) Paris et al., Biochemistry 35:8473-8477 (1996) Philips et al., J. Biol. Chem. 266:1289-1298 (1991) Philips et al., Science 259:977-980 (1993) Philips et al., Proc. Natl. Acad. Sci. USA 92:2283-2287 (1995a) Philips et al., Meth. Enzvmol. 256:49-63 (1995b) Pillinger et al., J. Biol. Chem. 269:1486-1492 (1994) Rodenhuis, Semin. Cancer Biol. 3:241-247 (1992) Ruf et al., Biochemistry 33:1565-1572 (1994) Sahagan et al., J. Immunol. 137:1066-1074 (1986) Sambrook et al., Molecular Clonina: A Laboratory Manual, Cold Spring Harbor Laboratory (Cold Spring Harbor, NY, 1989) Sandberg et al., Proc. Nat. Acad. Sci. USA 90:8367-83 (1993) Saperstein et al., Mol. Cell. Biol. 14:1438-1449 Schena et al., Methods Enzymol. 194:389-398 (1991) Schreiber et al., J. Mol. Biol. 248:478-86 (1995) Schwinn et al., J. Biol. Che. 265:8183-81 (1990) Seabra, et al., Cell 65:429-434 (1991) Shi et al., J. Biol. Chem. 267:9547-9551 (1992) Silvius et al., Biochemistry 33:3014-3022 (1994) Stephenson et al., J. Biol. Chem. 265:16248-16254 (1990) Stephenson et al., J. Biol. Chem. 267:13314-13319 (1992) Stock et al., Annu. Rev. Biophys. Biophys. Chem. 20:109-136 (1991) Sun et al., Proc. Natl. Acad. Sci. USA 84:214-218 (1987) Thissen et al., J. Biol. Chem. 268:13780-13783 (1993) Unger et al., Proteins 5:335 (1989) Veira et al., Meth. Enzymol. 153:3 (1987) Venkatachalam et al., J. Biol. Chem. 269:23444-23450 (1994) Venkatasubramanian et al., Mol. Immunol. 17:201 (1980) Volker et al., J. Biol. Chem. 266:21515-21522 (1991a) Volker et al., FEBS Lett. 295:189-194 (1991b) Volker et al., Meth. Enzvmol. 250:216-225 (1995) Wahl et al., J. Nucl. Med. 24:316-325 (1983) Watanabe et al., Eur. J. Biochem 226:277-283 (1994) Watson et al., Molecular Biology of the Gene, Vols I and II, The Benjamin/Cummings Publishing Company, Inc., (Menlo Park, CA, 1987) Wells, Biochemistry 29:8509-17 (1990) Williams et al., J. Biol. Chem. 270:3012-3016 (1995) Willingham et al., Cell 19:1005-1014 (1980) Willumsen et al., EMBO J. 3:2581-2585 (1984) Yu et al., J. Mol. Biol. 249:388-397 (1995) Zuckerman et al., Proc. Nat. Acad. Sci. USA 89:4505-4509 (1992) Page(s) are claims pages They appear after the sequence listing(s) SEQUENCE LISTING GENERAL INFORMATION: APPLICANT: PHILIPS, Mark R.

(ii) TITLE OF INVENTION: PRENYLCYSTEINE CARBOXYL METHYLTRANSFERASE, DNA ENCODING SAME,AND A METHOD OF SCREENING INHIBITORS THEREOF (iii) NUMBER OF SEQUENCES: 12 (iv) CORRESPONDENCE ADDRESS: ADDRESSEE: BROWDY AND NEIMARK, P.L.L.C.

STREET: 419 Seventh Street Ste. 300 CITY: Washington STATE: D.C.

COUNTRY: U.S.A.

ZIP: 20004 COMPUTER READABLE FORM: MEDIUM TYPE: Floppy disk COMPUTER: IBM PC compatible OPERATING SYSTEM: PC-DOS/MS-DOS SOFTWARE: PatentIn Release Version #1.30 (vi) CURRENT APPLICATION DATA: APPLICATION NUMBER: FILING DATE:

CLASSIFICATION:

(viii) ATTORNEY/AGENT INFORMATION: NAME: YUN, Allen C.

REGISTRATION NUMBER: 37,971 REFERENCE/DOCKET NUMBER: PHILIPS=1 (ix) TELECOMMUNICATION INFORMATION: TELEPHONE: (202) 628-5197 TELEFAX: (202) 737-3528 INFORMATION FOR SEQ ID NO:1: SEQUENCE CHARACTERISTICS: LENGTH: 32 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION SEQ ID NO:1: GCCCGACTCA ACGCGCTGCT GCTGCT. 'C TA INFORMATION FOR SEQ ID NO:2: SEQUENCE CHARACTERISTICS: LENGTH: 30 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA 67 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: CGTGTACTCC AGGCTGTGAT TCAGGAGGAA INFORMATION FOR SEQ ID NO:3: SEQUENCE CHARACTERISTICS: LENGTH: 218 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: CGGCCGCCTC GCTACCAGAT AGCCATCCGA GCTTGTTTCC TGGGGTTTGT GTTCGGCTGC GGCACGCTGC TAAGTTTTAG CCAGTCTTCT TGGAGTCACT TTGGCTGGTA CATGTGCTCC CTGTCATTGT TCCACTATTC TGAATACTTG GTGACAGCAG TCAATAATCC CAAAAGTCTG TCCTTGGATT CCTTCCTCCT GAATCACAGC CTGGAGTA INFORMATION FOR SEQ ID NO:4: SEQUENCE CHARACTERISTICS: LENGTH: 3597 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: NAME/KEY: CDS LOCATION: 44..895 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: GGCACGAGCG GCGCCGCCGC CCGCTAGTCC GCCGCCCGGC GCC ATG GCG GGC TGC Met Ala Gly Cys GCG GCG CGG GCT CCG CCG GGC TCT GAG GCG CGT CTC AGC Ala Ala Arg Ala Pro Pro Gly Ser Glu Ala Arg Leu Ser 10 TTC CTG CTG GGC GCC TCG GTG CTC GCG CTG CCG CTG CTC Phe Leu Leu Gly Ala Ser Val Leu Ala Leu Pro Leu Leu 30 GGC CTG CAG GGC CGC ACC GGG CTG GCG CTC TAC GTG GCC Gly Leu Gln Gly Arg Thr Gly Leu Ala Leu Tyr Val Ala 45 GCG CTG CTG CTG CTG CTC TAT CGG CCG CCT CGC TAC CAG Ala Leu Leu Leu Leu Leu Tyr Arg Pro Pro Arg Tyr Gln 60 CGA GCT TGT TTC CTG GGG TTT GTG TTC GGC TGC GGC ACG Arg Ala Cys Phe Leu Gly Phe Val Phe Gly Cys Gly Thr 75 CTC GCC Leu Ala 103

ACG

Thr CGC GCC Arg Ala GGG CTC AAC Gly Leu Asn ATA GCC ATC Ile Ala Ile CTG CTA AGT Leu Leu Ser 199 247 295

TTT

Phe

TCA

Ser

AAA

Lys

ACA

Thr

TTT

Phe

CTG

Leu 165

GCT

Ala

CAT

His

TAC

Tyr

AAC

Asn

CGC

Arg 245

GAG

Glu

ATA

AGC CAG Ser Gin TTG TTC Leu Phe AGT CTG Ser Leu GTA GCT Val Ala 135 TGG CCA Trp .Pro 150 ATG GTG Met Val GGC TCC Gly Ser ACT CTG Thr Leu GTC GGG Val Gly 215 CCC ATC Pro Ile 230 GAT CGA Asp Arg GAG TAC Giu Tyr AAG GGG

TCT

Ser

CAC

His

TCC

Ser 120

GCT

Al a

GAA

Glu

GTC

Val

AAT

Asn

GTG

Val 200

TGG

Trp

TGC

Cys

ACA

Thr

CTG

Leu

GTC

TCT

Ser

TAT

Tyr 105

TTG

Leu

CTT

Leu

CTG

Leu

TTC

Phe

TTC

Phe 185

ACC

Thr

TTT

Phe

GGC

Gly

GAA

Giu

GAG

Glu 265

AAG

TGG

Tru 90

TCT

Ser

GAT

Asp

TCT

Ser

AAG

Lys

GGA

Gly 170

AAC

Asn

AGT

Ser

TAC

Tyr

GTC

Val

GAA

Glu 250

TAT

Tyr

GTG

AGT

Ser

GAA

Glu

TCC

Ser

TCT

Ser

CAG

Gin 155

GAA

Glu

CAC

His

GGA

Gly

TGG

Trp

AGC

Ser 235

GAA

Glu

AAG

Lys

GAC

CAC

His

TAC

Tyr

TTT

Phe

TGG

Trp 140

ATT

Ile

TGT

Cys

GTG

Val

GTG

Val

AGT

Ser 220

TAT

Tyr

GAA

Glu

AAG

Lys

CTG

TTT GGC TGG TAC ATG TGC TCC CTG Phe Gly Trp Tyr Met Cys Ser Leu 95 100 TTG GTG ACA GCA GTC AAT AAT CCC Leu Val Thr Ala Val Asn Asn Pro 110 115 CTC CTG AAT CAC AGC CTG GAG TAT Leu Leu Asn His Ser Leu Giu Tyr 125 130 TTA GAG TTC ACA CTT GAA AAT ATC Leu Giu Phe Thr Leu Glu Asn Ile 145 ACC TGG CTC AGT GTC ACA GGG CTG Thr Trp Leu Ser Val Thr Gly Leu 160 CTG AGG AAG GCG GCCOG TTT ACA Leu Arg Lys Ala Ala Met Phe Thr 175 180 GTA CAG AAT GAA AAA TCA GAT ACA Val Gin Asn Glu Lys Ser Asp Thr 190 195 TAC GCT TGG TTT CGG CAT CCT TCT Tyr Ala Trp Phe Arg His Pro Ser 205 210 ATT GGA ACT CAG GTG ATG CTG TGT Ile Giy Thr Gin Val Met Leu Cys 225 GCC CTG ACA GTG TGG CGA TTC TTC Ala Leu Thr Val Trp Arg Phe Phe 240 ATC TCA CTA ATT CAC TTT TTT GGA Ile Ser Leu Ile His Phe Phe Gly 255 260 AGG GTG CCC ACG GGC CTG CCT TTC Arg Val Pro Thr Gly Leu Pro Phe 270 275 TGACGGGCAG TGGCCCCGGT GACCTTGGGG 343 391 439 487 535 583 631 679 727 775 823 871 925 Ile Lys Gly Val Lys Val Asp Leu 280

CCTCCGACCC

TTCTTAATCG

AGAAGGCCTG

GAACATGGGT

AGTTACCTGG

GGATAACCAG

AGGGGTAGCA

CCGAAAAACA

TGTGCAGCCT

TTTT.MTGTCA

AGGACCCAAG

GAAAGACGAG

GGAGGACCGA

GACTCGCTGG

GGTACCCCTT

CAGACCGCTT

GGGACAAAAC

TTAGTCACTC

GCCCCACTGG

TAAGGGGCAA

GGCCACACGC

GAGAGATGGA

CTTATCAGCG

TAACCTCTTT

TGTTTCCGGT

TTCTGGAATG

AGCAGTCTGT

ATCACAGCAA

CACTGCCCCC

CTCTGTCCTC

GTAAAAATGC

ATTTCTGTCC

TGGCCGCTGC

TCACTCAAGA

CCTTATGCCG

TATTCCACAG

GAGGCCAGAG

AGCAACACTC

ATTTACAACC

CCCACTGCAT

CACATGGATT

CCAAGCGGTC

AATCAAGGCG

CGCCCTCCAG

TGTAAGTAAA

CACAGCAGAA

TTTCATTTAA

GAACATCTAT

985 1045 1105 1165 1225 1285 1345 1405

ACAATTTTAA

ACATACCTAT

TTGGTGTTCT

TGGCTGAGTG

TTTCCCCCGG

TCTTTTGATC

AAGTGCTGCT

ATTTCAATGC

CCTCATGTCA

GCAGATTGAC

AGTATGTTTG

TAAGAGGTGA

CCTGCCATGC

GGGGCCAGAG

CCGTGCCTAG

TGTGAGGACT

AATGAAATCT

CAGATACCTA

CTAGATTCAT

CATTGATTGG

GATTGCAGCG

AGTGATTGAC

CAGTGGCAGG

AAATACTTCC

TTTTTTATAA

AGTTCAACTT

GCTTTGTGTG

GGGTCTCAGC

CAAGAAGTGC

CTCCGTTTAT

CAGTCTCAGA

TTTCAAACTG

TCTCAAAAAT

TTTTGCAAGT

GACGCTGGCT

AGTGTTGCGG

CACCACTCTG

ACCTCCCCAT

GAGAAGAACA

GTACTGTATG

AAATGTATGT

GGCTTAATGA

TTTGAAAACC

TGAGCCACCA

TTCTTTTTAG

GCCCGTGGAG

rCATAGGATG

STAGCAGTAC

ITATCCTGCA

kCCGTGGCCC rCCTGTTGTC

TTACTGAATG

TTGGGATTTG

ALCTCTTGTTT

TTTTCCAAAG

AATCTTAAAT

TTGGGAGAGT

TCATTCTGGA

TGGCACTGAA

CTGCCCCACC

CTGTGGAALTG

ATGTCAAATG

AA7AGCTACAC

TCAGAAAAGC

ATTTTTTCAT

AGACAGACGG

CGCCCAGCCA

TTCCATGATC

AATCAGGTTA

TTCTCGATTT

TTCTGAAGTG

GCCTTTCGTG

TGAGCCTCTT

GCTCCTTGAG

GATTGATACG

TGTAAGATCT

ACTGTCCCTT

GCTTTTGTAG

CCTACTCTCT

GCCATCTGCA

CTTTGGCCCT

ACATTCAA.AG

GCAAGTAAGC

CAGGCCAAGG

AGTGATAGGC

CCTGCCCTGT

ATTCTCATTT

TCTGTGTTCT

GGATTTGAGC

ACTGCAAATA

AAGTTGGCTT

CTAGACAGGA

GAAGTGTGAA

ACGCGGCCTG

GGGTCAATAC

TTTTTAATAC

AAAAATGGGC

ATTTTATCAA

TGTTATATAT

CCGGGCACGC

AGATGAACTC

TTGAGATTAT

ATGAGGTAAA

AAAAAACTGT

GAAAATAGGT

AGGGCTiTGCA

AGCAGGACAG

CCTAAAAAGT

ACCGGGGAAA

CGCCAACAGC

CACTGCACTT

TGGTTTGGCA

GGGCCACTGC

GCAGAGGCAG

TCATCTATTT

GGGTATTCCT

CTAGATAACT

GGACCATGGC

GGCTCAA-AGG

GCGTGGCCTT

TACCATAGCT

GTTCTCAAAA

ATATACAACT

GTCCCAACTA

TGAGTCACAC

ACTTGGCCTC

TGTGAAGGCG

TGAGGAGCCA

CACCTACCTC

TCAGATGTGG

AGACATTTGG

CTCAGAAATA

ACCAAAGAGG

CTGTAATCCC

CTTAAGGACA

TTTTAGCTTT

GGCTTTCTGG

CAAGAGATTT

ACTGTCAGAA

GGCCGCTGCT

AAAACAAGGC

AGGCTTTATT

ATGTTCCTAA

TAGCTGCTTA

TCTTCCTGCC

AGGTTAGAAG

TGCAGAGGCC

CCTATCCCAT

AATCATAGCT

AGCTCAATGC

CTACACGATT

CCTGGCTGGC

AGCATCAGTT

AAACATTGAG

TATATTCTCA

TTACATTGTC

ACAAATCCAA

AGATTCAGTC

ATCGAAGCTT

AGCGTGAGAT

GCGTCGGCGC

CTAAACCTTT

ACAGGGGTGT

GAGCGACATC

TTAATTGTGC

TGACTTATTT

CTTACGGGTT

AAAGTGCTGG

GGATTTGGTA

ATAAATTTAG

GTATTTGCTG

ATTTTTCCAT

CAAAGCTACC

GAGTGGCAGT

AGCAGCGCAC

CATCCCTTCT

ACCAGGAAGC

GGAGTACCCC

TTAGGTAGTT

GCCTCGGCCT

GTGGCACTTG

TGCAAGGAGA

1465 1525 1585 1645 1705 1765 1825 1885 1945 2005 2065 2125 2185 2245 2305 2365 2425 2485 2545 2605 2665 2725 2785 2845 2905 2965 3025 3085 3145 3205 3265 3325 3385 3445 CCAAGGCCAC ATCACCAATT

TGCAGGTTTT

TGGAAACAGC

TTACAGAAGA

CTGCCACCCC

GTTCATTTAC

TGCGTTAGCC

CACGATACGC

GGGCTTGCCC

CTCTGTCATG

TCATGGGTTT

AAAGTGGAGA

ATAGAGTGAA

GGTCGCGGGG

TTCACGAGCT

CAACCTGGGG

GATCAGGCTT

ACAGCACACC

ACCCTAGTTT

CTGGGAAGTG

GGAAGACCCA

GGAACTGAAC GGAGTAATTA TTCTACTCTT CTTTTTACAT AAATGTTTAT TTAAATATTC 3505 TAAATTGGAT TTTCATTCAC AGATACTGAT TATTCTTTCC AGTTCTTAA.A TAAAACTGCA 3565 CTTGATTTCA CTCAAAAAAA AAAAAA AA 3597 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 284 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID Met Ala Gly Cys Ala Ala Arg Ala Pro Pro Gly Ser Giu Ala Arg Leu 1 5 10 Ser Leu Ala Thr Phe Leu Leu Gly Ala Ser Val Leu Ala Leu Pro Leu 25 Leu Thr Arg Ala Gly Leu Gin Gly Arg Thr Gly Leu Ala Leu Tyr Val 40 Ala Gly Leu Asn Ala Leu Leu Leu Leu Leu Tyr Arg Pro Pro Arg Tyr 55 Gin Ile Ala Ile Arg Ala Cys Phe Leu Gly Phe Val Phe Gly Cys Gly 70 75 Thr Leu Leu Ser Phe Ser Gin Ser Ser Trp Ser His Phe Gly Trp Tyr 90 Met Cys Ser Leu Ser Leu Phe His Tyr Ser Giu Tyr Leu Val Thr Ala 100 105 110 Val Asn Asn Pro Lys Ser Leu Ser Leu Asp Ser Phe Leu Leu Asn His 115 120 125 Ser Leu Giu Tyr Thr Val Ala Ala Leu Ser Ser Trp Leu Glu Phe Thr 130 135 140 Leu Giu Asn Ile Phe Trp Pro Glu Leu Lys Gin Ile Thr Trp Leu Ser 145 150 155 160 Val Thr Gly Leu Leu Met Val Val Phe Gly Giu Cys Leu Arg Lys Ala 165 170. 175 Ala Met Phe Thr Ala Gly Ser Asn Phe Asn-His Val Val Gin Asn Giu 180 185 190 Lys Ser Asp Thr His Thr Leu V~l Tl-.r Ser Gly Val Tyr Ala Trp Phe 195 2 ul 205 Arg His Pro Ser Tyr Vai Gly Trp Phe Tyr Trp Ser Ile Gly Thr Gin 210 215 220 Val Met Leu Cys Asn Pro Ile Cys Gly Val Ser Tyr Ala Leu Thr Vai 225 230 235 240 Trp Arg Phe Phe Arg Asp Arg Thr Glu Giu Giu Giu Ile Ser Leu Ile 245 250 255 71 His Phe Phe Gly Glu Glu Tyr Leu Glu Tyr Lys Lys Arg Val Pro Thr 260 265 270 Gly Leu Pro Phe Ile Lys Gly Val Lys Val Asp Leu 275 280 INFORMATION FOR SEQ ID NO:6: SEQUENCE CHARACTERISTICS: LENGTH: 4 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: Cys Xaa Xaa Xaa 1 INFORMATION FOR SEQ ID NO:7: SEQUENCE CHARACTERISTICS: LENGTH: 173 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: Pro Gly Tyr Ser Arg Arg Phe Gly Pro Leu Val Arg Arg Arg Leu Phe 1 5 10 Arg Arg Pro Ala His Gly Gly Cys Ala Ala Arg Phe Arg Gly Leu Arg 25 Gly Ala Pro Gin Pro Arg Tyr Ile Pro Pro Gly Arg Leu Gly Ala Arg 40 Ser Ala Ala Ala His Ala Arg Arg Pro Ala Gly Arg Thr Ala Leu Ala 55 Leu Tyr Val Ala Gly Leu Asn Ala Leu Leu Leu Leu Leu Tyr Arg Pro 70 75 Pro Arg Tyr Gin Ile Ala Ile Arg Ala Cys Phe Leu Gly Phe Val Phe 90 Gly Cys Gly Val Leu Leu Ser Phe Ser Gin Ser Ser Trp Asn His Phe 100 105 110 Gly Trp Tyr Val Cys Ser Leu Ser Leu Phe His Tyr Glu Ser Tyr Leu 115 120 125 Val Thr Ala Val Asn Asn Pro Lys Ser Leu Ser Leu Asp Ser Phe Leu 130 135 140 Leu Asn His Ser Leu Glu Tyr Thr Val Ala Ala Leu Ser Ser Trp Ile 145 150 155 160 72 Glu Phe Thr Leu Glu Asn Ile Phe Trp Pro Glu Leu Lys 165 170 INFORMATION FOR SEQ ID NO:8: SEQUENCE CHARACTERISTICS: LENGTH: 288 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: Met Ala Gly Ala Arg Leu Leu Gin Giu Phe Phe Val1 Ala Phe Phe Leu Leu Val 145 Thr Leu Val Tyr Ile 225 Thr Thr Thr Phe Gin Gly Gly Val Leu 130 Giu Trp Arg Gin Ser 210 Gly Val1 Leu Giu Phe Leu Cys Trp Thr 115 Asn Phe Leu Lys Asn 195 Trp Thr Al a Gly Gin Ile Tyr Giy Tyr 100 Ala His Thr Ser Cys 180 Giu Phe Gin Ser Ala Thr Ala Gin Leu Val Met Ser Ile Val 165 Ala Lys Arg Val Trp 245 Ser Leu Ala Val Leu Cys Asn Leu Giu 150 Ile Met Ser His Leu 230 Arg Val Leu Leu 55 Ala Leu Ser Asn Glu 135 Thr Gly Leu Asp Pro 215 Leu Phe Ile Ala 40 Asn Ile Ser Leu Pro 120 Tyr Thr Leu Thr Ser 200 Ser Cys Phe Ser 25 Ala Gly Arg Ile Ser 105 Arg Thr Ile Ile Al a 185 His Tyr Asn Ser Gly Arg Leu Pro Ala Pro Leu Leu Ala Ser 75 Thr Gin 90 Phe Phe Ser Leu Leu Ala Tyr Pro 155 Met Vai 170 Giy Ser Thr Leu Gly Val Pro Leu 235 Giu Arg 250 Val Ser Leu T~eu Giy Arg Leu Leu Phe Leu Ser Pro His Tyr Ser Ile 125 Ala Leu 140 Asp Leu Leu Phe Asn Phe Val Thr 205 'p Phe 220 Cys Leu Ile Glu Ile Thr Ile Leu Gly Trp Ser 110 Asp Ser Lys Gly Asn 190 Ser Tyr Val Glu Val Ser Al a Tyr Phe Lys Giu Ser Ser Gin Giu 175 His Gly Gly Giu 255 Ser Ser Leu Lys Ala Pro Tyr Phe Trp Ile 160 Val Ile Val Ser Asp 240 Glu Phe Ser Leu Ile 260 His Phe Phe Gly Glu 265 Asn Tyr Leu Giu Tyr Lys Lys 270 Lys Val Pro Thr Gly Leu Pro Phe Ile Lys Gly Val Lys Met Glu Pro 275 280 285 INFORMATION FOR SEQ ID NO:9: SEQUENCE CHARACTERISTICS: LENGTH: 293 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: Met Pro Pro Ile Pro Pro Pro Thr Phe Val Gly Arg Ile Ala Phe His Leu Lys Ser Phe Phe Gly Ala Arg Leu Leu Glu Gly Met Phe Thr Leu 130 Leu Ala 145 Phe Pro Cys Ile Leu Ala Leu Val 210 Gly Trp 225 Ile Cys Ser Al a Ile Lys Gly Glu His 115 Arg Ala Glu Ile Phe 195 Lys Phe Cys Asp Val Leu Arg Cys Val 100 Phe Pro Ser Ile Gly 180 Thr Asp Leu Val Asp 260 Asp Val Ala Val Ala Leu Thr Asp Ile Lys 165 Giu His Gly Trp Val 245 Asp Ala Ser Ph.

Val Glu Glu Ser Ser 150 Met Ile Arg Ile Ala 230 Tyr Phe Thr Leu 55 Thr Ala His Phe Phe 135 Trp Arg Phe Leu Tyr 215 Val Ala Arg Val1 40 Leu Asn Val Leu Val 120 Leu Ile Gly Arg Ala 200 Ala Ser Tyr Thr 25 Ser Thr Lys Ser Ser 105 Phe Leu Glu Ile Lys 185 Met Tyr Thr Val Ala Ala Ile Ser Leu 90 His Thr Asn Phe Leu 170 Val Thr Leu Gin Thr 250 Ile Ser Leu Ile 75 Ala Tyr Ala His Leu 155 Trp Gly Lys Arg Ile 235 Trp Asp Thr Ile Leu Tyr Phe Leu Ser 140 Ile Ile Met Arg His 220 Ile His Ala Ser Ala Met Ser Leu Thr 125 Val Glu Gly Val Ser 205 Pro Leu Phe Phe Ser Tyr Pro Val Phe 110 Asn Gly Ala Thr His 190 Asp Gly Cys Phe Met Phe Leu Ala Ser Leu Arg Tyr Tyr Leu 175 Ala His Tyr Asn Ala Al a Val Phe Al a His Ser Arg Trp Phe 160 Gly Gly Arg Phe Pro 240 Ser 255 Arg Ile Tyr Giu Glu Lys Asp Leu Ile Ser Phe Phe 265 Gly Asp Ser 270 74 Tyr Val Glu Tyr Gin Gin Asn Val Trp Cys Gly Val Pro Phe Val Arg 275 280 285 Gly Tyr Gin Arg Pro 290 INFORM4ATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 295 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID Met Ala Pro Asn Ser Thr Pro Pro Pro Thr Phe Phe Gly Arg Ile Val Phe Ile Phe Ala Ala Al a Leu Arg Tyr 145 Asn Phe Ala His Tyr 225 His Al a Leu so Val1 Al a His Ser Arg 130 Trp Phe Gly Gly Thr 210 Phe Leu Ser Val Gly Phe Glu Val1 115 Thr Leu Tyr Cys Leu 195 Leu Gly Thr Phe Gly Glu Leu Gly 100 Phe Leu Al a Pro Ile 180 Ala Ile Trp Ser Asp Thr Val Leu Leu Ser Cys 70 Gly Cys Glu Leu His Phe Gly Pro Ala Ser 150 Glu Ile 165 Ile Gly Phe Thr Asn -Thr Phe Ile 230 Asp Val Ile Ala 40 Ala Ser 55 Glu Phe Ala Val Trp Giu Ser Glu 120 Asp Ser 135 Ile Gly Lys Met Glu Ile Hi 'u Jo Gly Ile 215 Trp Ala Phe 25 Ala Val Ile Al a Tyr 105 Phe Phe Trp Tyr Phe 185 Met Tyr Val Arg Val1 Ile Asn Val 90 Phe Val1 Leu Ile Ser 170 Arg Ala Ala Ser Thr Ala Ile Phe Ala Phe Al a Val Asn 75 Asn Ser Phe Leu Glu 155 Val1 Lys Arg Tyr Thr 235 Ser Leu Gin Leu Arg Thr Lys 140 Phe Leu Val Thr Met 220 Gin Val1 Leu Ile Val Tyr Ala 125 His Leu Trp Gly Arg Ile Thr Val Leu Tyr Phe 110 Leu Ser Ile Ile Met 190 Arg His Val Gly Ser Ala Tyr Met Pro Thr Val Leu Phe Thr Asn Phe Gly Glu Ala 160 Gly Thr 175 Val His Ser Gly Pro Gly Leu Cys 240 Asn Pro Ile Ser Phe Val Ile Tyr Thr Phe Val Thr Trp Arg Phe Phe 255 24S 250 Ala Asn Arg Ile Glu Ile Glu.Glu Lys Asp Leu Ile Ser Phe Phe Gly 260 265 270 Asp Asp Tyr Ala Glu Tyr Gin Arg Lys Thr Trp Ser Gly Val Pro Phe 275 280 285 Ala Arg Gly Tyr Gin Lys Pro 290 295 INFORMATION FOR SEQ ID NO:11: SEQUENCE CHARACTERISTICS: LENGTH: 239 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: Met His Gin Asp Phe Gin Glu Asp Glu His Glu Tyr Pro Asp Ile Arg 1 5 10 Arg Asn Pro Leu His Glu Val Thr Met Thr Ser Tyr Ile Leu Gly Ile 25 Leu Leu Gly Ile Phe Val Gly Leu Phe Pro Gin Ile Arg Phe Lys Asn 40 Phe Asn Leu Phe Ile Ile Ala Leu Ser Leu Phe His Phe Leu Glu Tyr 55 Tyr Ile Thr Ala Lys Tyr Asn Pro Leu Lys Val His Ser Glu Ser Phe 70 75 Leu Leu Asn Asn Gly Lys Ser Tyr Met Ala Ala His Ser Phe Ala Ile 90 Leu Glu Cys Leu Val Glu Ser Phe Leu Phe Pro Asp Leu Lys Ile Phe 100 105 110 Ser Tyr Ser Leu Ala Thr Lys Leu Cys Thr Val Leu Gly Cys Leu Leu 115 120 125 Val Ile Leu Gly Gin Tyr Thr Arg Thr Ile Ala Met His Thr Ala Gly 130 135 140 His Ser Phe Ser His Ile Val Lys Thr Lys Lys Glu Ser Asp His Val 145 150 155 160 Leu Val Lys Thr Gly Val Tyr Ser Trp Ser Arg His Pro Ser Tyr Leu 165 170 175 Gly Phe Phe Trp Trp Ala Ile Gly Thr Gin Leu Leu Leu Leu Asn Pro 180 185 190 Leu Ser Leu Val Ile Phe Ile Phe Val Leu Trp Lys Phe Phe Ser Asp 195 200 205 Arg Ile Arg Val Glu Glu Lys Tyr Leu Ile Glu Phe Phe Ser Ala Glu 210 215 220 Tyr Ile Glu Tyr Lys Asn Lys Val Gly Val Gly Ile Pro Phe Ile 225 230 235 INFORMATION FOR SEQ ID NO:12: SEQUENCE CHARACTERISTICS: LENGTH: 236 amino acids TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: Met Gly Asn Leu His Thr Ser Ile Ala Val Ala Ser Ile Cys Leu Thr 1 5 10 Ser Ala Phe Leu Gly Cys Val Phe Gly Leu Gly Phe Phe Val Trp Ile 25 Ile Tyr Gly Tyr Ser Ile Gly Gly Phe Phe Ala Phe-Leu Ser Leu Phe 40 His Phe Leu Glu Phe Tyr Ile Thr Ala Arg Phe Gln Gly Ser Gin Leu 55 Ser Trp Asp Ser Phe Ile Leu Asn Asn Gly Lys Ala Tyr Trp Leu Ala 70 75 Met Leu Val Gly Leu Leu Glu Cys Leu Leu Ser Gly Gly Lys Ser Phe 90 Ala Lys Val Ile Asn Cys Leu Arg Phe Pro Ser Phe Leu Ile Asn Phe 100 105 110 Ile Phe Ser Val Tyr Gin Thr Ser Ala Leu Gly Phe Leu Cys Leu Gly 115 120 125 Gin Tyr Leu Arg Ser Ser Ala Met Val Gin Ala Gly Gin Ser Phe Ser 130 135 140 His Ile Val Ala Ser Lys Arg Asn Lys Asp His Leu Leu Val Thr Asp 145 150 155 160 Gly Ile Tyr Ala Tyr Val Arg His Pro Ser Tyr Val Gly Phe Phe Ile 165 170 175 Trp Ala Leu Gly Thr Gin Met Leu Leu Gly Asn Phe Val Ser Thr Leu 180 185 190 Leu Phe Ser Leu Val Leu Trp Lys Phe Phe Ser Gin Arg Ile Thr Thr 195 200 205 Glu Glu Ala Tyr Leu Val Ser Phe Phe Gly Asp Ser Tyr Glu Gin Tyr 210 215 220 Arg Lys Lys Pro Ser Gly Ile Pro Leu Ile Pro 225 230 235

Claims

1. An isolated polypeptide, comprising: a) the amino acid sequence of SEQ ID b) an amino acid sequence having at least identity to a) and having prenylcysteine carboxyl methyltransferase (pcCMT) activity either when isolated or when membrane bound; c) a fragment of a) or b) which has pcCMT activity either when isolated or when membrane bound.

2. An isolated polypeptide in accordance with claim 1, comprising an amino acid sequence having at least identity to SEQ ID NO:5 and having pcCMT activity either when isolated or when membrane bound, or a fragment thereof having pcCMT activity either when isolated or when membrane bound.

3. An isolated polypeptide in accordance with claim 1, wherein said molecule is associated with a membrane.

4. An isolated polypeptide in accordance with claim S* 1, comprising a sequence which has at least 85% identity S. 20 to SEQ ID NO:5 and which has pcCMT activity either when Sisolated or when membrane bound, or a fragment thereof which has pcCMT activity either when isolated or when membrane bound.

5. An isolated polypeptide in accordance with claim 25 1, comprising a sequence which has at least 90% identity to SEQ ID NO:5 and which has pcCMT activity either when isolated or when membrane bound, or a fragment thereof which has pcCMT activity either when isolated or when membrane bound.

6. An isolated polypeptide in accordance with claim 1, comprising a sequence which has at least 95% identity to SEQ ID NO:5 and which has pcCMT activity either when m:\specifications\500000\500000\50061 3clmmaw.doc 78 isolated or when membrane bound, or a fragment thereof which has pcCMT activity either when isolated or when membrane bound.

7. An isolated polypeptide in accordance with claim 1, comprising the amino acid sequence of SEQ ID NO:5, or a fragment thereof which has pcCMT activity either when isolated or when membrane bound.

8. An isolated polypeptide in accordance with claim 1, comprising a sequence which has at least 85% identity to SEQ ID NO:5 and which has pcCMT activity either when isolated or when membrane bound.

9. An isolated polypeptide in accordance with claim 1, comprising a sequence which has at least 90% identity to SEQ ID NO:5 and which has pcCMT activity either when isolated or when membrane bound.

An isolated polypeptide in accordance with claim 1, comprising a sequence which has at least 95% identity to SEQ ID NO:5 and which has pcCMT activity either when isolated or when membrane bound. 20

11. An isolated polypeptide in accordance with claim 1, comprising the amino acid of SEQ ID

12. An isolated polypeptide comprising a fragment of the amino acid sequence of SEQ ID NO:5, which fragment is *e capable of inhibiting the carboxyl methylation activity of *o. 25 the prenylcysteine carboxyl methyltransferase (pcCMT) of SEQ ID

13. A membrane on which prenylcysteine carboxyl methyltransferase (pcCMT) is present, said pcCMT being non-native to said membrane and having an amino acid sequence having at least 70% identity to SEQ ID NO:5 and having pcCMT activity, or a fragment thereof having pcCMT activity. m:\specifications\500000\500000\50061 3clmmaw.doc

14. A membrane in accordance with claim 13, wherein said membrane is a membrane of a cell in which said non- native pcCMT is recombinant pcCMT which has been expressed so as to become membrane-bound.

15. A membrane in accordance with claim 14, wherein said non-native pcCMT is expressed onto said membrane at a much higher level than is any native endogenous pcCMT.

16. A membrane in accordance with claim 13, wherein said pcCMT is the polypeptide of SEQ ID NO:5 and said membrane is non-human.

17. A recombinant DNA molecule comprising a nucleotide sequence encoding a prenylcysteine carboxyl methyltransferase (pcCMT) comprising an amino acid sequence selected from the group consisting of a sequence having at least 70% identity to the full_length of SEQ ID and wherein said pcCMT has prenylcysteine carboxyl methyltransferase activity either when isolated or membrane bound.

18. The recombinant DNA molecule according to claim 20 17, wherein the nucleotide sequence encodes the amino acid sequence of SEQ ID S*5.

19. The recombinant DNA molecule according to claim 18, wherein the nucleotide sequence comprises nucleotides 44 to 895 of SEQ ID NO:4. 25

20. An expression vector comprising the recombinant DNA molecule of claim 17.

*21. A host transformed with the expression vector *f .o according to claim 4.

22. A method of screening for and identifying inhibitors of prenylcysteine carboxyl methyltransferase (pcCMT), comprising the steps of: m:\specificatons\500000\500000\50061 3clmmaw.doc providing a mating competent yeast MATa strain stably transformed from a sterile strain lacking a functional pcCMT with a recombinant DNA molecule of claim 22 encoding a pcCMT to ectopically express the pcCMT for mating competence; incubating the yeast MATa strain in the presence or absence of a potential inhibitor of pcCMT; assaying the yeast strain incubated in the presence of the potential inhibitor for mating competence relative to the yeast strain incubated in the absence of the potential inhibitor; and identifying as an inhibitor any potential inhibitor for which the assaying step determines that the mating competence of the yeast strain in the presence of the potential inhibitor is reduced.

23. The method in accordance with claim 22, wherein said recombinant DNA molecule encodes a molecule having pcCMT activity when expressed in said yeast, comprising an amino acid sequence selected from the group consisting of a sequence having at least 70% identity to SEQ ID NO:5 and a fragment thereof having pcCMT activity.

24. A method in accordance with claim 22, further including the step of isolating the inhibitor identified in said identifying step.

25 25. An isolated antibody or antigen-binding fragment thereof specific for the protein consisting of the amino acid sequence of SEQ ID

26. The antibody or antigen-binding fragment thereof in accordance with claim 25, wherein said antibody is a 30 polyclonal antibody. m:\specifications\500000\500000\50061 3clmmaw.doc 81

27. The antibody or antigen-binding fragment thereof in accordance with claim 25, wherein said antibody is a monoclonal antibody.

28. The antibody or antigen binding fragment thereof in accordance with claim 25, wherein said antigen binding fragment is the Fab fragment, the Fab' fragment, the F(ab') 2 fragment or the variable portion of the heavy and/or light chain of said antibody.

29. The antibody or antigen binding fragment thereof in accordance with claim 25, wherein the antibody is a chimeric antibody or a single chain antibody.

A molecule for delivery of a therapeutic or detectable moiety to a protein consisting of the amino acid sequence of SEQ ID NO:5, said molecule consisting essentially of a therapeutic or detectable moiety and an antibody, or an antigen-binding portion thereof, specific for an epitope of the protein consisting of the amino acid sequence of SEQ ID 2

31. A molecule consisting essentially of a chimeric T 20 cell receptor bearing an antigen-binding portion of an antibody specific for an epitope of the protein consisting of the amino acid sequence of SEQ ID

32. A method of screening for and isolating inhibitors of prenylcysteine carboxyl methyltransferase, comprising 25 the steps of: providing a membrane preparation in which a prenylcysteine carboxyl methyltransferase molecule is present, wherein said prenylcysteine carboxyl o methyltransferase molecule is not native to the source of 30 the membrane preparation and/or is overexpressed in said source of the membrane preparation; m:\specifications\500000\500000\50061 3clmmaw.doc 82 incubating the membrane preparation with a substrate of the isolated prenylcysteine carboxyl methyltransferase and with or without a potential inhibitor of prenylcysteine carboxyl methyltransferase; determining the level of prenylcysteine carboxyl methyltransferase activity in the presence of the potential inhibitor relative to the level of prenylcysteine carboxyl methyltransferase in the absence of the potential inhibitor; and isolating as an inhibitor any said potential inhibitor for which said determining step determines that the level of prenylcysteine carboxyl methyltransferase activity in the presence of said potential inhibitor is substantially less than that in the absence of said potential inhibitor.

33. The method in accordance with claim 32, wherein the prenylcysteine carboxyl methyltransferase is not native to the source of the membrane preparation.

34. The method in accordance with claim 32, wherein the prenylcysteine carboxyl methyltransferase is an 20 isolated prenylcysteine carboxyl methyltransferase molecule comprising an amino acid sequence selected from the group consisting of a sequence having at least 70% identity to SEQ ID NO:5 and a fragment thereof, wherein said molecule has prenylcysteine carboxyl methyltransferase activity 25 either when isolated or membrane bound.

35. An isolated polypeptide according to any one of claim 1 to 12 substantially as hereinbefore described with reference to the examples.

36. A membrane according to any one of claims 13 to 30 46 substantially as hereinbefore described with reference to the examples. m:\speciications\500000\500000\50061 3clmmaw.doc 83

37. A recombinant DNA molecule according to any one of claims 17 to 19 substantially as hereinbefore described with reference to the examples.

38. A vector according to claim 20 substantially as hereinbefore described with reference to the examples.

39. A host according to claim 21 substantially as hereinbefore described with reference to the examples.

A method according to any one of claims 22 to 24 and 32 to 34 substantially as hereinbefore described with reference to the examples.

41. An antibody or fragment according to any one of claims 25 to 29 substantially as hereinbefore described with reference to the examples.

42. A molecule according to claims 30 or 31 substantially as hereinbefore described with reference to the examples. Dated this thirtieth day of November 2004 e 20 New York University Patent Attorneys for the Applicant: F B RICE CO **ooo *oo• m:\specifications\500000\500000\50061 3clmmaw.doc