WO1998036064A1 - Phosphonate binding proteins - Google Patents

Phosphonate binding proteins Download PDF

Info

Publication number
WO1998036064A1
WO1998036064A1 PCT/US1998/002709 US9802709W WO9836064A1 WO 1998036064 A1 WO1998036064 A1 WO 1998036064A1 US 9802709 W US9802709 W US 9802709W WO 9836064 A1 WO9836064 A1 WO 9836064A1
Authority
WO
WIPO (PCT)
Prior art keywords
bisphosphonate
protein
val
seq
gly
Prior art date
Application number
PCT/US1998/002709
Other languages
French (fr)
Other versions
WO1998036064A9 (en
Inventor
Jonathan Shaun Cook
Frank Hallock Ebetino
Kenneth John Ibbotson
Xiaohui Ji
Michael John Rogers
Donald Jeremy Watts
Roger Graham Goodwin Russell Russell
Xiao Juan Xiong
Original Assignee
The Procter & Gamble Company
University Of Sheffield Med. School
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by The Procter & Gamble Company, University Of Sheffield Med. School filed Critical The Procter & Gamble Company
Priority to AU66540/98A priority Critical patent/AU6654098A/en
Publication of WO1998036064A1 publication Critical patent/WO1998036064A1/en
Publication of WO1998036064A9 publication Critical patent/WO1998036064A9/en

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/5005Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
    • G01N33/5091Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing the pathological state of an organism
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/37Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/5005Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
    • G01N33/5008Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2333/00Assays involving biological materials from specific organisms or of a specific nature
    • G01N2333/90Enzymes; Proenzymes
    • G01N2333/914Hydrolases (3)
    • G01N2333/916Hydrolases (3) acting on ester bonds (3.1), e.g. phosphatases (3.1.3), phospholipases C or phospholipases D (3.1.4)

Definitions

  • the present invention relates to proteins capable of binding bisphosphonate or bisphosphonate analogues, to methods for producing and identifying these proteins, to their corresponding genes and to various uses of the proteins, for example, in therapy and in the screening, isolation, synthesis, design and evaluation of bisphosphonate-based drugs.
  • Bone destruction can result from various cancers and from rheumatoid arthritis. Metabolic bone disorders commonly involve excessive bone resorption and include Paget's disease, hypercalcaemia (both tumour-induced and non-tumour induced), bone metastases and osteoporosis.
  • Paget's disease (a focal increase in bone turnover) is fairly common, and in some countries affects up to 5% of the population over 50 years of age.
  • the disease may be caused by a slow virus infection, and leads to bone pain, deformities and fractures.
  • Bone metastases can induce bone destruction either through local invasion or via the secretion of bone-resorbing agents into the blood stream.
  • Hypercalcaemia can result either from an increase in the flow of calcium from bone or the intestine to the blood, or trom an increase in tubular reabsorption of calcium in the kidney. It can induce a Ai variety of physiological and/or pharmacological disturbances and can b lite threatening.
  • Osteoporosis is characterized by a reduction in the quantity of bone. This loss of bone tissue often causes mechanical failure, and bone fractures frequently occur in the hip and spine of women suffering from postmenopausal osteoporosis.
  • Kyphosis (abnormally increased curvature of the thoracic spine) is another common feature.
  • Secondary osteoporosis is the result of an identifiable disease process or agent, while primary osteoporosis (which constitutes about 90% of all cases) includes postmenopausal osteoporosis, age-associated osteoporosis (affecting a majority of individuals over the age of 70) and idiopathic osteoporosis affecting middle-aged and younger men and women.
  • Bone remodeling occurs throughout life, renewing the skeleton and maintaining the strength of bone.
  • This remodeling involves the erosion and filling of discrete sites on the surface of bones, by an organized group of cells called “basic multicellular units” or "BMUs".
  • BMUs primarily consist of osteoclasts, osteoblasts, and their cellular precursors.
  • bone is resorbed at the site of an "activated" BMU by an osteoclast, forming a resorption cavity. This cavity is then filled with bone by osteoblasts.
  • compositions and methods are described in the medical literature for the treatment of the above-described diseases and conditions, and most attempt to either slow the loss of bone or produce a net gain in bone mass.
  • oestrogen has been used as a means both to prevent and to treat osteoporosis in postmenopausal women.
  • oestrogen has been associated with certain side effects, such as uterine bleeding.
  • the hormone calcitonin has also been used to treat Paget's disease (and to a lesser extent tumour bone disease), and can be effective in decreasing bone turnover.
  • Paget's disease and to a lesser extent tumour bone disease
  • the incidence of relapse is high, and side effects limit the therapeutic usefulness of calcitonin.
  • Inorganic pyrophosphate has high affinity for bone mineral and is able to inhibit the precipitation and dissolution of calcium phosphate crystals in vitro and to inhibit bone mineralization in vivo. These activities are thought to arise from direct physicochemical effects (such as adsorption to hydroxy apatite, inhibition of dissolution of hydroxyapatite and crystal growth inhibition). Pyrophosphate has therefore found application as an antitartar agent for use in toothpastes.
  • inorganic pyrophosphate PPi
  • PPi inorganic pyrophosphate
  • oral administration due to the presence of the relatively labile phosphorus-oxygen bond P-O-P.
  • Bisphosphonates which are characterized by phosphorus-carbon (P-C-P) bonds, are stable analogues of naturally occurring inorganic pyrophosphates which to a great extent overcome the limitations associated with inorganic pyrophosphates. Bisphosphonates are resistant to chemical and enzymatic hydrolysis but retain the therapeutic activity of PPi.
  • bisphosphonates exhibit properties which extend beyond those attributable to purely physicochemical phenomena.
  • bisphosphonates have been found to be inhibitors of osteoclast-mediated bone resorption in organ cultures of bone and in animal models.
  • Bisphosphonates therefore have broader clinical utility than PPi, and have found widespread application in several main clinical areas, e.g., (a) as bone imaging agents for diagnostic purposes, usually in the form of " echnetium derivatives, (b) as anti- resorptive agents to combat bone loss associated with Paget's disease, hypercalcaemia associated with malignancy and bone metastases and osteoporosis, (c) as calcification inhibitors in patients with ectopic calcification and ossification, and (d) as antitartar agents for use in toothpastes.
  • bone imaging agents usually in the form of " echnetium derivatives
  • anti- resorptive agents to combat bone loss associated with Paget's disease, hypercalcaemia associated with malignancy and bone metastases and osteoporosis
  • c as calcification inhibitors in patients with ectopic calcification and ossification
  • antitartar agents for use in toothpastes.
  • Bisphosphonates are now among the most important therapeutic agents for the treatment of pathological disorders of bone metabolism, including osteoporosis. Moreover, since some bisphosphonates appear to have anti-inflammatory as well as anti-resorptive effects in vivo, they may also have utility in the treatment of inflammation and rheumatoid arth_-i ⁇ s. Bisphosphonates described in the literature generally have the structure:
  • bisphosphonates appear to contain a "bone binding moiety” (the P-C-P group) and a “bioactive moiety", R2.
  • the "bone binding moiety” appears to endow the bisphosphonate compound with the ability to adsorb to bone and/or to hydroxyapatite (a model for bone), while the R2 bioactive moiety appears to determine the potency of the bisphosphonate.
  • R ⁇ is a moiety that assists in binding to bone or in bioactivity (as R2), R3 is H or alkyl.
  • R2 is a moiety that assists in binding to bone or in bioactivity
  • R3 is H or alkyl.
  • R ⁇ is a moiety that assists in binding to bone or in bioactivity (as R2)
  • R3 is H or alkyl.
  • Such compounds are described, for example, in U.S. Patent 5,391,743; Published European Patent application 186405 (published July 2, 1986); Published European Patent aplication 298553 (published January 1, 1989); published PCT applications WO 93/05044 (published December 9, 1993), WO 93/04469 (published December 9, 1993), WO 93/04979 (published December 9, 1993), WO 93/04978 (published December 9, 1993), and WO 93/04977 (published December 9, 1993), hereby inco ⁇ orated by reference.
  • bisphosphonate "mimetics” which may have some, or none of the analogue's characteristics, yet bind to the bisphosphonate binding protein, and this may be useful in treating the same maladies as the bisphosphonates.
  • bisphosphonates may affect the differentiation and recruitment of osteoclast precursors or alter the capacity of mature osteoclasts to resorb bone by altering the permeability of the osteoclast membrane to small ions. Another hypothesis is that they act by affecting lysosomal enzyme production or cell metabolism or through toxic effects on osteoclasts (Carano et al., 1990,. J. Gin. Invest. 85:456-461). A further suggestion is that other cells in the bone microenvironment that regulate the activity of osteoclasts are involved in the antiresorptive mechanism.
  • bisphosphonate drugs exert their effect (at least in part) via interaction with specific target proteins (hereinafter referred to as bisphosphonate binding proteins).
  • the present invention also relates to bisphosphonate binding proteins that are useful in therapy and in the identification, isolation and/or design of novel drugs.
  • the invention also relates to isolated DNA encoding the bisphosphonate binding protein (or derivatives, fragments, etc.) thereof.
  • the invention also relates to nucleic acid probes which are selectively hybridizable with the DNA of the invention.
  • the invention also relates to a method for producing a bisphosphonate binding protein.
  • the invention also relates to a method for designing and synthesizing a therapeutically active bisphosphonate or a mimetic thereof using a three-dimensional model of the bisphosphonate binding protein, or the bisphosphonate binding site of such protein
  • the invention also relates to the bisphosphonate binding protein, antibody thereto, mimetic or antagonist thereof for use in therapy, diagnosis or testing, both in vivo and in vitro.
  • the invention also relates to the protein of the invention for use in a method of screening bisphosphonates for therapeutic activity.
  • screening contemplates test kits or screening kits, comprising the protein.
  • the invention also relates to the protein of the invention which is labeled for use in the test kits of the invention.
  • Also contemplated by the invention is a host cell comprising the vector of the invention.
  • Figure 1 Affinity chromatography of a cell extract of Dictyostelium discoideum on an AHBuBP-affinity column.
  • Figure 2 Profile of eluted proteins from the AHBuBP-affinity column.
  • FIG. 3 SEP. ID NO. 1 and SEP. ID NO. 2.: Nucleotide and deduced amino acid sequence of the cDNA for wild-type DPI . Since this cDNA is identical to the sequence of discoidin II, it is clear that DPI is in fact discoidin II.
  • the nucleotide sequence shown is that of the 968-bp cDNA of DPI from wild-type ⁇ discoideum Ax-2. The adenine base of the ATG initiation sequence is assigned as 1 in the numbering. Nucleotides are numbered in the right margin and amino acids on the left. An open reading frame of 771-bp encoding 257 amino acids is shown with a single-letter code for the translated amino acids.
  • a termination codon (TAA) at the end of the coding sequence is marked with an asterisk.
  • a putative signal peptide at the beginning of the amino acid sequence is indicated in italics.
  • the nucleotide sequence of the PCR fragment XJ-450 used to screen the cDNA library is underlined. Multiple polyadenylation signal sequences (AATAAA) are shown in italics.
  • Figure 4 Comparison of the deduced amino acid sequences for DPI (SEQ. ID NO. 3) and discoidin IA (DISC 1A) (SEQ. ID NO. 4). Asterisks indicate positions of identity while dots indicate positions of conservation. Amino acid sequences obtained for peptides of DPI are underlined and the corresponding numbers are shown in parentheses underneath the sequences. Regions used for generating primers XJ-1 (SEQ. ID NO » and XJ-2 (SEQ. ID NO. 6) are in italics. The alignment is performed by use ot the multiple alignment program of CLUSTALV (Higgins et al., 1992, d,mp i rr l Biosci. 8: 189-191).
  • FIG. 5 Northern blot anal ses ot D I mRNA expression in axenic strains. 0.5 ug samples of mRNA were fractionated on formaldehyde gels. After transfer to Hybond-N membrane (Amersham, Bucks. I . ). the samples were hybridized to 3 p . labeled DNA probes.
  • mRNAs on the blot were subjected to successive hybridizations with 32P-labeled PCR fragments of DPI (XJ-450) from sequence ID #1, discoidin I A sequence ID #6 and Dd-tcpl (control). Lane 1 shows axenically- grown Ax-2, lane 2 shows bacterial ly-grown Ax-2.
  • the columns on the right hand side represent the relative abundance of the hybridized mRNA transcripts in the corresponding lanes of the blots.
  • FIG. 6 Southern blot analysis of DPI. Genomic DNA from the wild-type Ax-2 strain is subjected to restriction digestion and the products were separated on a 1% agarose gel (a). After transfer to Hybond-N membrane, the samples were hybridized to the 32P-labeled XJ-450 probe (b) or the discoidin I A probe (c). Note that in each lane there is only a single band of hybridization to the DPI (discoidin II) cDNA XJ-450 whereas there were multiple bands for hybridization to the discoidin IA cDNA probe.
  • Figure 7 Sequence comparison matrices for DPI (horizontal) with (a) human coagulation factor V, Wood et al., Nature 312:330-337 (1984); Jenny, et al., Proc. Natl. Acad. Sci., USA 84:4846-4850 (1987), (b) human coagulation factor VIII, Wood et al., Nature 312:330-337 (1984), (c) ORF7 linked to the Rhodopseudomonas blastica atp operon (Yat7-Rhobl) Tybnlewicz et al., J. Mol. Biol. 179:185-214 (1984) and (d) milk fat globule (MFG) protein Larocca et al.
  • the matrices were plotted using MDM ⁇ g mutation data matrix Pam.250 as the scoring system (Schwartz and Dayhoff, 1978, Atlas of Protein Sequence and Structure. 5: 353-358, National Biomedical Research Foundation, Washington DC), with a window size of 8 and a minimum score of 50%.
  • Figure 8 Hydrophilic plot and secondary structure prediction for DPI.
  • the profile is constructed using the Kyte-Doolittle algorithm (Kyte and Doolittle, 1982, J. Mol. Biol. 157, 105-132) with a sliding window of seven amino acids. Values above the zero axis correspond to hydrophilic segments.
  • Secondary structure predictions based on algorithms of both Chou-Fasman (Adv. Enzymol. Relat. Areas Mol. Bio., Al, 45-148, 1978) (CH, shadowed boxes) and Robson-Garnier (Gamier et al., 1978, J. Mol. Biol. 120, 97-120) (RG. filled boxes) are shown in the lower half of the Figure.
  • Figure 9 Prediction of a probable cleavage site for a signal peptide of DPI by the method of van Heijne (1986) using the programme MacProt (Markiewicz, 1991, BioTechniques 10: 156-151 +160+162-1 >; Luttke, 1990, Comp. Method. Prog. Biomed. 31 :105-1 12).
  • a window size of 15 residues of weight matrix for eukaryotic proteins is used.
  • a protein having a signal peptide has one segment scoring greater than + 3.5 while cytosolic proteins have a score less than + 3.5.
  • Figure 10 Surface probability of the RGD-containing region of DPI .
  • a surface probability profile of residues 76-88 is constructed using the Janin et al. (J. Mol. Biol. 125: 357-386, 1978) and Emini et al. (J. Virol. 55: 836-839, 1985) algorithms.
  • a probability value above 0.5 is assigned to a water-accessible, "exposed” sequence and values below 0.5 to "buried” segments.
  • Figure 1 N-terminal amino acid sequence alignment of hDPl (SEQ. ID NO. 7) and the rat round spermatid 29,000 Mr protein (RSP-29) (SEQ. ID NO. 8) Onoda and Djaliew 1993 A. Asterisks indicate positions of identity, colons indicate positions of conservation and dots positions where the two sequences contain different amino acids.
  • Figure 12 Schematic representation of the amplification of hDPl cDNA by the polymerase chain reaction.
  • Degenerate primers DJ1, DJ5, DJ9 (sense) and DJ10 (antisense) were designed from a knowledge of the peptide sequences of hDPl.
  • the nucleotide sequence of the primer UAP (antisense) had been inco ⁇ orated into the first strand cDNA during reverse transcription using primer oligo(dT)17-AP.
  • Figure 13 Nucleotide sequence and deduced amino acid sequence of hDPl cDNA.
  • the nucleotide sequence of hDPl (SEQ. ID NO. 9 and SEQ. ID NO 10) cDNA isolated from a human testis cDNA library consists of 1161 base pairs. The largest open reading frame consists of 936 bp and contains six ATG codons. The deduced amino acid sequence is shown in a single letter code.
  • the mature hDPl protein appears to be encoded by the nucleotide sequence started from the second ATG codon and contains 260 amino acids (shown in bold type). The adenine base of this second ATG codon and the methionine encoded by this codon are assigned as 1 respectively in the numbering.
  • Nucleotides are numbered in the right margin and amino acids in the left. A putative N-terminal sequence of 48 amino acids starting from the first ATG codon is shown in italics. A stop codon (TGA) at the terminus of the translation sequence is marked with an asterisk. The amino acid sequences of peptides 1 and 2 are underlined.
  • FIG 14 Northern bio * nalyses of hDPl in human tissues.
  • Human multiple tissue blots (I and II) containing 2 ug poly(A )RNA from various tissues were purchased from Clontech (Palo Alto, California).
  • Northern blots III and IV were prepared by running 1 ug of poly(A )RNA in a formaldehyde gel and blotting onto Hybond-N membrane. Hybridization is carried out using a [ P]-labeled PCR fragment of hDPl cDNA.
  • a DNA fragment of 3-act ⁇ n is also labeled with [ P] and hybridized to the same blots.
  • the height of the bar on top of each lane represents the relative abundance of hDPl mRNA in that tissue.
  • Figure 15 Southern analysis of hDPl on a zoo-blot.
  • the Southern blot containing 8 ug of genomic DNA per lane from nine eukaryotic species is purchased from Clontech.
  • the DNA had been digested with ECQRI, run on a 0.7% agarose gel and transferred to a nylon membrane.
  • Hybridization is carried out using a [ P]- labeled PCR fragment of hDPl (SEQ. ID NO. 13) cDNA.
  • Lanes 1-9 contain, in order, genomic DNA from yeast, chicken, rabbit, cow, dog, mouse, rat, monkey and man.
  • Figure 16 Northern blot analyses of hDPl homologues in Dictyostelium discoideum.
  • a [ P]-labeled PCR fragment of hDPl cDNA is hybridized to a northern blot containing 5 ug poly(A )RNA isolated from Dictyostelium amoebae: Lane 1 , strain Ax-2 grown with bacteria; Lane 2 strain Ax-2 grown axenically.
  • Figure 17 Sequence alignment of hDPl.
  • (a) and (b)- show matrix plots of hDPl (horizontal) against ORF3 linked to the Rhodopseudomonas blastica atp operon (ATPase-ORF3, vertical) and ORF1 linked to the genes for arginyl tRNA synthetase and ribonuclease H of Buchnera aphidicola (ATS-ORF1, vertical).
  • the matrices were plotted using MDM ⁇ g mutation data matrix Pam250 as the scoring system (Schwartz and Dayhoff, 1978, Atlas of Protein Sequence and Structure, Vol.
  • FIG. 1 Sequence comparison of hDPl (SEQ. ID NO. 10) with aspartate aminotransferase (AAT) (SEQ. ID NO. 1 1 ) from chicken mitochondria.
  • hDPl is shown in the upper line and DPI in the lower one.
  • " indicates identity between aligned residues; ":” indicates similarity.
  • the comparison is performed using the program "bestfit" of the GCG package ( Devereux et al., 1984, Nucelic Acids Res. 12: 387-395). Identity between the two proteins is 14.3% and the similarity is 44.1%.
  • Figure 19 Hydrophilic plot and secondary structure prediction for hDPl .
  • the hydrophilic profile is constructed using the Kyte-Doolittle algorithm (Kyte and Doolittle, 1982, J. Mol. Biol. 157, 105-132) with a sliding window of seven amino acids. Values above the zero axis correspond to hydrophilic segments. Secondary structure prediction is based on algorithms of both Chou-Fasman (Adv. Enzymol. Relat. Areas Mol. Biol. 47, 45-148, 1978) (CH, shadowed boxes) and Robson- Garnier (Gamier et al., 1978, J. Mol. Biol. 120, 97-120) (RG, filled boxes), shown in the lower half of the Figure.
  • Figure 20 Amino acid sequence comparison of hDPl (SEQ. ID NO. 10) with DPI (SEQ. ID NO. 3) (discoidin II). hDPl is shown in the upper line and DPI in the lower. "
  • Figure 21 Nucleotide and predicted amino acid sequence of DdCyP2 cDNA (SEQ. ID NO. 12). The nucleotide sequence shown is a 685 bp cDNA of DdCyP2 (SEQ. ID NO. 13) isolated from a Dictyostelium discoideum strain Ax2 cDNA library. Nucleotides are numbered in the left margin and amino acids on the right. An open reading frame of 540 bp encoding 180 amino acids is shown using a single letter code for the translated amino acids. A start codon (ATG) and stop codon (TAA) are underlined. An RGD motif is in bold.
  • Figure 22 Alignment of the amino acid sequence of Dd CyP2 and the sequences of selected members of the cyclophilin A family of proteins.
  • the sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group).
  • the sequences of the CyPs were derived from the following sources: (1) Dd CyP2; (2) Dd CyPl (Barisic et al., 1991, Developmental Genetics 12:50-53); (3)
  • Figure 23 (SEQ. ID NO. 13)Alignment of the amino acid sequences of Dd CyP2 and four human cyclophilins. The sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group). The sequences of the CyPs were derived from the following sources: (1) human CyPA (p05092, Haendler et al., 1987, EMBO J. 6: 947-950), (2) human CyPD (p30405, Bergsma et al., 1991, J. Biol. Chem. 266: 23204-23214); (3) Dd CyP2; (4) human CyPB (p23284, Price et al., 1991, Proc. Natl. Acad. Sci. 88: 1903-1907); (5) human CyPC (Schneider et al., 19 '4, Biochemistry 33 : 8218-8224).
  • human CyPA p05092, Haendler e
  • FIG. 24 (SEQ. ID NO. 13)_Alignment of the amino acid sequences of Dd CyP2 and CyPBs.
  • the sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group).
  • the sequences of the CyPBs • were derived from the following sources: ( 1 ) human (p23284, Price et al., 1991, Proc. Natl. Acad. Sci. 88: 1903-1907): (2) mouse (p24369, Hasel et al., 1991, Mol. and Cellular Bio. 11: 3484-3491); (3) chick (p24367, Caroni et al., 1991, J. Biol. Chem.
  • FIG. 25 (SEQ. ID NO. 13) Alignment of the amino acid sequences of two Dd CyPs and some plant CyPs. The sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group). The sequences for the CyPs were derived from the to I l win sources: (1) Arabidopsis thaliana (114844, Lippuner et al., 1994, J. Biol ⁇ hem 269: 7863-7868) (2) tomato (m55019, Gasser et al., 1990, Proc. Natl.
  • isolated is used herein to indicate that the binding protein is substantially isolated with respect to the complex cellular milieu in which it naturally occurs.
  • the absolute level of purity is not critical, and those skilled in the art can readily determine appropriate levels of purity depending upon the intended use for the protein.
  • the isolated bisphosphonate binding protein will form part of a composition, buffer system or pharmaceutical excipient, which may for example contain other components (including other proteins, such as albumin).
  • the bisphosphonate binding protein may be purified to essential homogeneity, for example as determined by PAGE or column chromatography (for example HPLC).
  • bisphosphonate binding protein is used herein to define a protein which can bind bisphosphonate and/or act (either directly, or indirectly) as a target for bisphosphonate(s) in vivo.
  • the bisphosphonate binding proteins of the invention may bind specifically to bisphosphonates, or may exhibit broader binding affinity and bind to other molecules in addition to bisphosphonate (with either the same or with a different affinity).
  • the bisphosphonate binding proteins of the invention may be targets of the physiological and/or pharmacological action of bisphosphonates and may mediate the physiological and or pharmacological effects of bisphosphonates in vivo (such as growth inhibition in Dictyostelium discoideum and antireso ⁇ tive action in humans), either alone or as part of a complex comprising other proteins and/or molecules.
  • the bisphosphonate binding proteins of the invention may be bisphosphonate receptors, and may be involved in signal transduction during a cellular response to bisphosphonate.
  • the bisphosphonate binding protein may also be an enzyme for which bisphosphonates are substrate analogues.
  • bisphosphonate is used herein in a broad sense to cover not onl> bisphosphonates sensu stricto. as defined by the literature above, but also bisphosphonate analogues.
  • Bisphosphonate analogues are those ligands which can compete with osteoactive bisphosphonate for binding to the cellular targets of the osteoactive bisphosphonates, or which can compete in vitro (for example, on an affinity column) with osteoactive bisphosphonate (in either the free state or in the form of a derivative linked to an affinity column) for binding to the binding proteins of the invention.
  • These include HOOC-C-PO H 2 , HO3S-C-PO3H2 RHO3P-C- PO3H2 and the like.
  • Such compounds are disclosed for example in F.
  • R2 is a "bioactive moiety" as described above;
  • Ri is a moiety that assists binding to bone, or assists bioactivity; and each A is individually an acidic moiety, an ester or the like; or any moiety that binds to bone.
  • NE10788 Bisphosphonate analogues include, but are not limited to pyridoxal phosphate (PLP), O-phosphorylethanolamine, O-phosphorylcholine, phosphatidyl ethanolamine and phospholipid bisphosphonate analogues.
  • PRP pyridoxal phosphate
  • O-phosphorylethanolamine O-phosphorylcholine
  • phosphatidyl ethanolamine phospholipid bisphosphonate analogues.
  • Oleoactive bisphosphonates are those bisphosphonates which act as antireso ⁇ tive drugs in vivo.
  • Bosine mimetic is a functional term that describes any molecule which performs similarly to a bisphosphonate in vivo, or in vitro. The structure of such molecules are secondary to their function.
  • the bisphosphonate binding protein is a cyclophilin, a discoidin or a round spermatid protein (or homologue thereof).
  • discoidin is a term of the art, and defines a family of proteins sharing sequence similarity or homology with the discoidin proteins of Dictyostelium (particularly discoidin I) which are cytoplasmic components.
  • discoidin covers discoidin homologues (e.g. mammalian homologues) which share conserved domains characteristic of the discoidin family.
  • cyclophilin is also a term of the art and defines a family of proteins falling into at least two groups: the cyclophilin A group and the cyclophilin B group. There are many cyclophilins described, all showing sequence similarities and many characterized by their ability to bind to the immunomodulatory drug cyclosporin A (CsA). Most cyclophilins possess enzyme activity, being peptidyl prolyl cis-trans isomerases.
  • round spermatid protein (RSP) is used herein to define a family of proteins sharing sequence similarity and/or exhibiting homology with the rat RSP-29 protein (described by Oneda and Djakiew, 1993, Molecular and Cellular Endocrinology 93, 53-61) which is a protein of rat round spermatids.
  • RSP-29 has homologues in many different species, including higher eukaryotes (monkey to chicken), lower eukaryotes .Dictyostelium discoideum) and even prokaryotes (Rhodopseudomonas blastica and Buchnera aphidicola).
  • the round spermatid proteins of the invention may therefore control cell differentiation in the testes as well as in the bone, and may constitute a hitherto unrecognized family of factors involved in cell differentiation (including the differentiation of bone cells such as osteoclast progenitor cells).
  • family is used herein to indicate a group of proteins or genes which share substantial sequence similarities, either at the level of the primary sequence of the proteins themselves, or at the level of the DNA encoding them.
  • sequence similarities may extend over the entire protein/gene, or may be limited to particular regions or domains. Similarities may be based on nucleotide/amino acid sequence identity as well as similarity (for example, those skilled in the art recognize certain amino acids as similar, and identify substitutions of similar amino acids as conservative changes).
  • Some members of a protein family may be related in the sense that they share a common evolutionary ancestry, and such related proteins are herein referred to as "homologues".
  • the members of a protein family may not necessarily share the same biochemical properties or biological functions, though their similarities are often reflected in common functional features (such as effector binding sites and substrates).
  • the criteria by which such families are recognized are well-known in the art, and include computer analysis of large collections of sequences at the level of DNA and protein as well as biochemical techniques such as hybridization analysis.
  • the bisphosphonate binding domain of the binding proteins of the invention may be characterized by the amino acid sequence motifs RGD and or SEQ. ID. NO. 34. While other binding domains may occur, preferably, the bisphosphonate binding protein of the invention comprises the amino acid sequence RGD and/or SEQ. ID. NO. 34. Other binding proteins of the invention may have different binding domains characterized by different amino acid sequence motifs.
  • the binding protein of the invention may bind pyridoxal phosphate, O-phosphorylethanolamine, O- phosphorylcholine, phosphatidyl ethanolamine or phospholipid bisphosphonate analogues, and or comprise the C-terminal region of human DPI (hDPl) and/or the amino acid sequence RGD and/or the amino acid sequence SEQ. ID. NO. 34 and/or a pyridoxal phosphate-binding domain/pseudodomain and/or a phosphatidyl ethanolamine-binding domain/pseudodomain, and/or be a Dictyostelium discoideum bisphosphonate binding protein (or homologue thereof).
  • Dictyostelium discoideum bisphosphonate binding proteins are particularly preferred because the present inventors have found that this organism expresses bisphosphonate binding proteins while being relatively easy to grow in large quantities.
  • Some Dictyostelium discoideum bisphosphonate binding proteins have homologues in other organisms (for example mammalian, e.g. human cells), and such homologues are also covered by the invention.
  • fragments of the protein of the invention which comprise the bisphosphonate binding domain, for example, comprising the amino acid sequence RGD and/or SEQ. ID. NO. 34 the C-terminal region of human DPI (hDPl), a pyridoxal phosphate-binding domain/pseudodomain and/or a phosphatidyl ethanolamine-binding domain/pseudodomain, or any fragment substantially retains bisphosphonate binding activity.
  • proteodomain is used herein to indicate that the domain is structurally and functionally related to its correlate domain, but binds a different (although usually structurally related) ligand.
  • such fragments consist essentially of any bisphosphonate binding domain in the protein, preferably the bisphosphonate binding domain described above.
  • the fragments may be fused to other peptide sequences preferably having, for example, enzyme activity or to antibodies (such as monoclonal antibodies or antibody fragments).
  • the bisphosphonate binding domain may be identified by any of a variety of methods known to those skilled in the art such as sequence analysis, protection analysis, affinity labeling (e.g. photoaiTinity labeling) and/or the generation of a collection of fragments via e.g. protease treatment. Such techniques are known in the art
  • One particularly preferred bisphosphonate binding protein is DPI (SEQ. ID NO. 3), or homologues, fragments, muteins. equivalents or derivatives (e.g. fusion derivatives or synthetic peptides) there t v.h ⁇ ch substantially retain bisphosphonate binding activity.
  • Another particularly preferred bisphosphonate binding protein is hDPl (SEQ. ID NO. 9), or homologues, fragments, muteins, equivalents or derivatives (e.g. fusion peptide derivatives or synthetic peptides) thereof which substantially retain bisphosphonate binding activity.
  • Yet another particularly preferred bisphosphonate binding protein is a second Dictyostelium discoideum cyclophilin herein designated DdCyP2 (SEQ. ID NO. 13), or homologues (for example, human cyclophilin B), fragments, muteins, equivalents or derivatives (e.g. fusion derivatives or synthetic peptides) thereof which substantially retain bisphosphonate binding activity.
  • DdCyP2 Dictyostelium discoideum cyclophilin
  • homologues for example, human cyclophilin B
  • fragments for example, human cyclophilin B
  • muteins for example, human cyclophilin B
  • derivatives e.g. fusion derivatives or synthetic peptides
  • homologue defines proteins which are related in the evolutionary sense to the proteins of the invention, and may for example define the equivalent protein from a different organism.
  • mutant is used herein to define proteins that are mutant forms of the proteins of the invention, i.e. proteins in which amino acids have been added, deleted or substituted.
  • equivalent as used herein and applied to the proteins of the invention defines proteins which exhibit substantially the same functions as those of the proteins of the invention while differing in structure (i.e. amino-acid sequence). Such equivalents may be generated for example by identifying sequences of functional importance, selecting an amino acid sequence on that basis and then synthesizing a peptide based on the selected amino acid sequence. Such synthesis can be achieved by any of many different methods known in the art, including solid phase peptide synthesis and the assembly (and subsequent cloning) of oligonucleotides.
  • derivatives as applied herein to the binding proteins of the invention is used to define proteins which are modified versions of the binding proteins of the invention.
  • derivatives may include fusion proteins, in which the proteins of the invention have been fused to one or more different proteins or peptides (for example an antibody or a protein domain conferring a biochemical activity, such as enzymic or conjugative activity, to act as a label, or to facilitate purification).
  • the homologues, fragments, muteins, equivalents or derivatives of the proteins of the invention may cross-react with antibodies to the proteins of the invention, and in particular may cross-react with antibodies directed against any or all of the proteins DPI, hDPl or DdCyP2.
  • the invention contemplates a method for producing a bisphosphonate binding protein.
  • a method for producing a bisphosphonate binding protein comprises the steps of: (a) linking bisphosphonate to a chromatography column to produce an affinin column; (b) loading the binding protein (for example in the form of a cell extract ) onto the affinity column such that it becomes bound thereto; and (c) selectively eluting the binding protein from the affinity column.
  • Methods of preparation of affinity chromatography materials are known in the art.
  • the step of selectively eluting the binding protein need not result in the elution of the binding protein alone in completely pure form, but merely results in a net purification (in most circumstances, a substantial net purification) of the binding protein.
  • Selective elution as applied in the method of the invention is a process by which the binding protein is more or less specifically eluted from the column, accompanied by varying amounts of contaminating proteins and/or other (macro)molecules.
  • the selective elution results in the binding protein eluting as the (or a) major protein component.
  • the elution conditions are such that the binding protein is washed off the column substantially free of all other proteins.
  • the affinity chromatography material preferably comprises a bisphosphonate linked to the column.
  • the bisphosphonate preferably comprises one or more bisphosphonate species covalently linked to the column, for example via a primary amine or sulphydryl group.
  • any linkage may be used, so long as the linked bisphosphonate can function as a ligand in the process of affinity chromatography.
  • the cell extract may be a cytosolic, nuclear or membrane fraction, and can be prepared by any of a large number of techniques known to those skilled in the art. Such techniques include sonication, enzymatic lysis, centrifugation, precipitation, osmotic rupture or mechanical rupture.
  • the cell extract is preferably a cytosolic extract, but may optionally include other cellular fractions.
  • the cell from which the extract is prepared is one which is sensitive to bisphosphonate.
  • examples of such cells include mammalian or amoebal (e.g. Entamoeba spp. or Dictyostelium spp.) cells. Particularly preferred are cells of Dictyostelium discoideum.
  • Cell lines for example mammalian, e.g. human
  • expressing or overexpressing the bisphosphonate binding protein of the present invention are also useful as a source of binding protein for purification.
  • the binding protein may be selectively eluted from the column by loading an excess of unlinked bisphosphonate onto the column.
  • the unlinked bisphosphonate may be different from the bisphosphonate linked to the column, and may for example be an osteoactive bisphosphonate.
  • the unlinked bisphosphonate for elution is preferably selected from AHBuBP, 3PHEBP, a monophosphonate analogue of 3PHEBP or PLP.
  • the bisphosphonate linked to the chromatography column in step (a) is preferably an osteoactive bisphosphonate. Particularly preferred are AHBuBP (Alendronate) and AHPrBP (Pamidronate).
  • the invention contemplates a method for producing a bisphosphonate binding protein, comprising the steps of: (a) providing a bisphosphonate-resistant Dictyostelium mutant (e.g. a Dictyostelium discoideum mutant) bearing a mutated bisphosphonate binding protein gene; (b) cloning the wild-type gene corresponding to that mutated in step (a) to produce a cloned gene encoding a bisphosphonate-binding protein, and either: (i) expressing the cloned gene encoding a bisphosphonate binding protein to produce the bisphosphonate binding protein, or (ii) preparing a probe which is selectively hybridizable to the cloned gene and identifying a further gene on the basis of its selective hybridization with the probe and expressing said further gene.
  • a bisphosphonate-resistant Dictyostelium mutant e.g. a Dictyostelium discoideum
  • this gene may be a heterologous gene, i.e. a corresponding gene from a different biological source, a mutant thereof, etc..
  • the mutant for use in the above-described method is conveniently provided by the step of continuously culturing wild- type Dictyostelium discoideum Ax-2 amoebae in the presence of bisphosphonate (e.g. AHBuBP), the bisphosphonate being present at a concentration sufficient to substantially prevent growth but not sufficient to cause immediate lysis (for example, at a concentration of from 50-100 uM for AHBuBP).
  • bisphosphonate e.g. AHBuBP
  • the invention as described above does not rely only on the generation of spontaneous mutants, and any of a wide variety of known mutagenesis techniques could also be used (for example those involving the treatment of the amoebae with various mutagens).
  • the invention also embraces bisphosphonate binding protein obtainable by the methods of the invention, as well as bisphosphonate binding proteins which have been obtained by the various methods of the invention.
  • the invention relates to isolated DNA encoding the bisphosphonate binding protein (or derivatives, fragments, and the like) thereof.
  • the term "isolated” indicates that the DNA is substantially isolated with respect to the complex cellular milieu in which it naturally occurs, or simply present in a different nucleic acid sequence context from that in which it occurs in nature (for example, when cloned or in the form of a restriction fragment).
  • the DNA of the invention may be isolated in the sense used herein, yet present in any of a wide variety of vectors and in any of a wide variety of host cells (or other milieu, such as buffers, viruses or cellular extracts).
  • the DNA of the present invention embraces DNA having any sequence so long as it encodes the bisphosphonate binding protein, fragment, mutein, etc. of the invention.
  • any particular amino acid sequence may be encoded by many different DNA sequences. Determining whether a sequence encodes the above material is well within the scope of the skilled artisan as is the designing of the DNA or RNA sequence given the amino acid sequence
  • the isolated DNA of the invention has the sequence shown in Fig. 3 (SEQ. ID NO. 1), Fig. 13 (SEQ. ID NO. 9) or Fig. 21 (SEQ. ID NO. 12).
  • the invention also contemplates a recombinant expression vector comprising the DNA of the invention.
  • the nature of the vector is not critical to the invention, and any vector may be used, including plasmid, virus, bacteriophage, transposon, minichromosome, liposome or mechanical carrier.
  • recombinant expression vector refers to a DNA construct used to express DNA which encodes a desired protein and which includes a transcriptional subunit comprising an assembly of 1) genetic elements having a regulatory role in gene expression, for example, promoters and/or enhancers, 2) a structural or coding sequence which is transcribed into mRNA and translated into protein, and 3) appropriate transcription and translation, initiation and termination sequences.
  • recombinant expression vectors of the present invention can be constructed.
  • Possible vectors for use in the present invention include, but are not limited to: for mammalian cells, pJT4 (discussed further below), pcDNA-1 (Invitrogen, San Diego, CA) and pSV-SPORT 1 (Gibco-BRL, Gaithersburg, MD); for insect cells, pBlueBac III or pBlueBacHis baculovirus vectors (Invitrogen, San Diego, CA); and for bacterial cells, pET-3 (Novagen, Madison, WI).
  • the DNA sequence coding for the bisphosphonate binding protein of the invention can be present in the vector operably linked to regulatory elements.
  • the vector may preferably comprise an expression element or elements operably linked to the DNA of the invention to provide for expression thereof at suitable levels.
  • an expression element or elements may for example be selected from promoters, enhancers, ribosome binding sites, operators and activating sequences.
  • Such expression elements may be regulatable, for example inducible e.g., via the addition of an inducer.
  • "operably linked” refers to a condition in which portions of a linear DNA sequence are capable of influencing the activity of other portions of the same linear DNA sequence.
  • DNA for a signal peptide is operably linked to DNA for a polypeptide if it is expressed as a precursor which participates in the secretion of the polypeptide; a promoter is operably linked to a coding sequence if it controls the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to permit translation.
  • operably linked means contiguous and, in the case of secretory leaders, contiguous in reading frame.
  • the vector of the invention can also be a viral vector, being for example based on simian virus 40, adenoviruses (e.g. human adenoviruses), retroviruses, and papillomavirus.
  • adenoviruses e.g. human adenoviruses
  • retroviruses e.g. human adenoviruses
  • papillomavirus e.g. human adenoviruses
  • the vector may further comprise a positive selectable marker and/or a negative selectable marker.
  • a positive selectable marker facilitates the selection and/or identification of cells containing the vector.
  • a host cell comprising the vector of the invention.
  • Any suitable host cell may be used, including prokaryotic host cells (such as Escherichia coli and Bacillus subtilis).
  • eukaryotic host cells including mammalian cells, amoebal cells and yeast cells).
  • Host cells may be stably transfected or transiently transfected within a recombinant expression plasmid or infected by a recombinant virus vector.
  • Other host cells include permanent cell lines derived from insects such as Sf-9 and Sf-21. and permanent mammalian cell lines such as Chinese hamster ovary (CHO) and SV40-transformed African green monkey kidney cells (COS).
  • the invention also embraces a method for producing a bisphosphonate binding protein comprising the steps of: (a) culturing the host cell of the invention such that the bisphosphonate binding protein is expressed, and (b) isolating the bisphosphonate binding protein expressed in step (a).
  • Such recombinantly-produced bisphosphonate binding protein (recombinant bisphosphonate binding protein) can be produced relatively inexpensively in large quantities, and can be relatively easily pun tied, using the method of the invention.
  • the invention also contemplates a method for producing a bisphosphonate binding protein comprising the steps of: (a) probing a gene library with a nucleic acid probe which is sclec ely hybridizable with the DNA of the invention (for example having a sequence vvhich is comprised in a gene corresponding to any of the sequences shown in Fig. 3 (SEQ. ID NO. 1), Fig. 13 (SEQ. ID NO. 9) or Fig. 21 (SEQ. ID ⁇ ) 1 2 >. to produce a signal which identifies a gene that selectively hybridizes to the probe, and (b) expressing the gene identified in step (a) (for example by cloning into a host cell) to produce bisphosphonate binding protein.
  • a nucleic acid probe which is sclec ely hybridizable with the DNA of the invention (for example having a sequence vvhich is comprised in a gene corresponding to any of the sequences shown in Fig. 3
  • the term "selectively hybridizable" indicates that the sequence of the probe is such that binding to a unique (or small class) of target sequences can be obtained under more or less stringent hybridization conditions.
  • This method of the invention is not dependent on any particular hybridization conditions, which can readily be determined by the skilled worker (e.g. by routine methods or on the basis of thermodynamic considerations).
  • the invention also embraces a bisphosphonate binding protein obtainable by the above-described methods and other methods.
  • the invention also relates to a nucleic acid probe which is selectively hybridizable with the DNA of the invention.
  • selectively hybridizable indicates that the sequence of the probe is such that binding to a unique (or small class) of target sequences can be obtained under more or less stringent hybridization conditions.
  • the nucleic acid probe is selectively hybridizable with (for example having a sequence which is comprised in) a gene corresponding to any of the sequences shown in Fig. 3 (SEQ. ID NO. 1), Fig. 13 (SEQ: ID NO. 9) or Fig. 21 (SEQ. ID NO. 12).
  • the protein of the invention may be labeled, for example with a fluorescent label, an antibody, a radioisotope or an enzyme.
  • a fluorescent label for example with an antibody, a radioisotope or an enzyme.
  • Such labeled proteins may be particularly suitable for use in the test kits of the invention.
  • the invention also covers various uses of the bisphosphonate binding proteins of the invention.
  • the invention contemplates a method for screening for therapeutically active bisphosphonates, new and potentially active bisphosphonates and their analogues, as well as entirely new classes, not broadly defined as bisphosphonates, comprising the steps of: (a) contacting a bisphosphonate with the bisphosphonate binding protein of the invention, and (b) determining whether binding between the bisphosphonate and binding protein occurs, wherein binding is indicative of a therapeutically active bisphosphonate.
  • the method described above is useful for screening large numbers of bisphosphonates and bisphosphonate like compounds for therapeutic activity. It may also be used to classify or differentiate potential osteoactive or anti-arthritic bisphosphonates according to their mode of action (e.g., by their cellular targets). In fact, it may be used in ranking efficacy of the compounds to be screened. Preferably, the method is employed in high throughput screening.
  • Compounds identified by the method of the invention can be further modified or used directly as therapeutic compounds to activate or inhibit the natural functions of the binding protein encoded by the isolated DNA molecule, for example in the treatment of osteoporosis.
  • the invention also provides a method for evaluating the therapeutic activity of a bisphosphonate comprising the steps of: (a) contacting the bisphosphonate with a bisphosphonate binding protein of the invention, and (b) measuring the binding affinity of the bisphosphonate binding protein for the bisphosphonate.
  • the method described above is useful for ranking the therapeutic activity of potential bisphosphonate drugs, bisphosphonate analogs, and "bisphosphonate mimetics" (acting in a similar manner to a bisphosphonate) drugs.
  • the invention provides a method for synthesizing a therapeutically active bisphosphonate comprising the steps of: (a) generating a three-dimensional model of the bisphosphonate binding site of the bisphosphonate binding protein of the invention, and (b) modeling the therapeutically active bisphosphonate with reference to the three-dimensional model generated in step (a).
  • the three-dimensional model is generated by computer analysis of the amino-acid sequence of all or a portion of the bisphosphonate binding protein (for example the bisphosphonate binding domain).
  • the three-dimensional model could be generated by X ray crystallography or by NMR of the binding protein (or fragments/derivatives thereof).
  • the invention also embraces a therapeutically active bisphosphonate which has been screened, evaluated or synthesized by the methods described above.
  • test kits comprising the bisphosphonate binding protein of the invention.
  • kits are useful, for example, in the screening and evaluating methods of the invention.
  • the bisphosphonate binding proteins comprised in the kits of the invention are preferably bound to a solid support and may conveniently include a labeled (e.g. radioactively-labeled, fluorescently labeled, enzymatically labeled) bisphosphonate (for example for use in competitive binding assays or in displacement assays).
  • a labeled e.g. radioactively-labeled, fluorescently labeled, enzymatically labeled
  • antibodies which bind to the binding protein of the invention are also contemplated by the invention.
  • Such antibodies can be prepared by employing standard techniques well known to those skilled in the art, using any of the bisphosphonate binding proteins of the invention as antigens for antibody production. These antibodies can be employed in assays, diagnostic applications, therapeutic applications, and the like. Preferably, and particularly for therapeutic applications, the antibodies are monoclonal antibodies.
  • the antibodies of the invention may advantageously bind specifically to the bisphosphonate binding proteins of the invention.
  • Antibodies specific for the bisphosphonate binding site may act as bisphosphonate mimetics. Specific binding may be exploited in imaging techniques, for example to assess the extent to which bisphosphonate targets are available for bisphosphonate action, or to determine the degree of occupancy of bisphosphonate targets in patients undergoing bisphosphonate therapy. They may also be used to identify and isolate new bisphosphonate binding proteins.
  • the invention also contemplates antibody derivatives, including antibody fragments (e.g. Fab fragments), chimaeric antibodies (including humanized antibodies) and antibody derivatives (such as fusion derivatives comprising an antibody-derived variable region and a non-immunoglobulin peptide having for example enzyme or conjugative activity).
  • antibody fragments e.g. Fab fragments
  • chimaeric antibodies including humanized antibodies
  • antibody derivatives such as fusion derivatives comprising an antibody-derived variable region and a non-immunoglobulin peptide having for example enzyme or conjugative activity.
  • the invention also contemplates mimetics (for example, the antibodies of the invention described above) or antagonists of the bisphosphonate binding protein of the invention.
  • the bisphosphonate binding proteins of the invention find utility in a wide range of therapeutic applications, and in another aspect the invention contemplates the bisphosphonate binding protein, antibody thereto, mimetic or antagonist thereof for use in therapy.
  • the bisphosphonate binding proteins, antibodies thereto, mimetics or antagonists thereof can be administered in a clinical setting by intraperitoneal, intramuscular, intravenous, or subcutaneous injection, implant or transdermal modes of administration, and the like.
  • the therapy involves: (i) the regulation of bone metabolism, for example in the treatment of Paget's disease, hypercalcaemia (both tumour-induced and non-tumour induced), bone metastases and osteoporosis, or (ii) the regulation of sperm maturation, for example for contraception or fertility treatment, or ( iii) the regulation of bone metabolism via interaction with cyclosporin A, and related compounds that bind to immunophilin proteins.
  • Knowledge of the hDP 1 gene sequence disclosed herein and its expression may be useful for understanding the spermatogenic process.
  • hDPl appears to be one of the factors from round spermatids that modulate the Sertoli cell function, it provides a basis for the development of novel contraceptives, or agents that affect reproduction in other ways, such as infertility, by affecting sperm maturation.
  • cyclosporins e.g. cyclosporin A
  • related compounds as modulators of bone metabolism, in the treatment of bone metabolism disorders and as adjuncts to bisphosphonate treatments.
  • Example 1 Isolation of DPI . a bisphosphonate-binding discoidin from
  • Two bisphosphonate-affinity columns were prepared.
  • AHBuBP and AHPrBP each have a primary amino group available for the coupling to the matrix.
  • Spectra/Gel MAS Beads (Spectrum, Los Angeles, CA, USA) were used as the matrix to which an aldehyde group is attached through a five atom hydrophilic spacer arm.
  • the bisphosphonates were coupled to the aldehyde via their amino groups in the presence of sodium cyanoborohydride (NaCNBH_ ⁇ ) as described in detail below.
  • a 1 M stock solution of sodium cyanoborohydride (NaCNBH4) is prepared and left at room temperature for 1 to 3 hours until the bubbling subsided before use.
  • the gel is washed on a Buchner funnel with three volumes of coupling buffer (0.1 M phosphate buffer, pH 7.8).
  • AHBuBP or AHPrBP is dissolved to a high concentration (400 mM for AHBuBP and 150 mM for AHPrBP) in the coupling buffer and added to the same volume of the gel.
  • Sodium cyanoborohydride from the stock solution is added to a final concentration of 0.1 M and the suspension is agitated at room temperature overnight.
  • the coupled gel is agitated with a small amount of sodium borohydride (NaBH4) for 2 hours at room temperature in order to reduce any unreacted aldehyde groups.
  • NaBH4 sodium borohydride
  • the gel is then washed with 10 volumes of 0.5 M NaCl, followed by column equilibration buffer (30 mM MOPS, 0.1 mM EDTA and 0.5 mM DTT, pH 7.4). 15 ml of the gel were packed into a glass column and stored in the equilibration buffer containing 0.02% (w/v) sodium azide at 4°C.
  • the column is regenerated by washing with a minimum of 10 volumes of 0.5 M NaCl, 0.1 Tris-HCl, pH 8.5, followed by a minimum of 10 volumes of 0.5 M NaCl, 0.1 M Na acetate. pH 4.5.
  • the column is cleared of any irreversibly bound protein by washing with 5 M urea.
  • the column is stored at 4°C in the equilibration buffer containing 0.02% (w/v) sodium azide. Growth and harvest of Dictyostelium discoideum cells
  • Dictyostelium discoideum amoebae strain Ax-2 were grown as shaken cultures in HL5-glucose medium at 22°C (Watts and Ashworth, Biochem. J. 119: 171-174, 1970). Cells from two 700 ml cultures were harvested at a density of 5-10 x 10 cells/ml and disrupted by sonication in 20 ml 30 mM MOPS, 0.1 mM EDTA, 0.5 mM DTT, pH7.4 at 4°C. Cell debris is removed by ultracentrifugation at 184,000g at 4°C for 2 hours. Purification of bisphQsphQna.e-binding protein DPI
  • the supernatant, containing cytosolic proteins, is then loaded onto a pre- equilibrated affinity column at a rate of 6- 12 ml/hour.
  • the column is washed first with equilibration buffer at a rate of 20 ml/hour until protein, monitored by absorbance at 280 nm, no longer appeared in the eluate.
  • the ionic strength of the eluant is then increased by including 0.1 VI KC 1 in the equilibration buffer in order to wash away further non-specifically bound proteins.
  • specific elution of proteins bound to immobilized bisphosphonate is achieved by eluting with buffer containing 5 mM bisphosphonate.
  • the concentrated samples were examined by SDS- PAGE analysis on 12% polyacrylamide gels with standard molecular weight markers (200kD, 97kD, 68kD, 43kD, 28kD and 18kD) (Gibco, Paisley, Scotland) and stained with Coomassie brilliant blue R-250 (Laemmli, Nature 227, 680-685, 1970).
  • FIG. 1 A typical elution profile, following loading of cell-free extract onto the immobilized bisphosphonate affinity columns is shown in Fig. 1.
  • a cell extract prepared by sonication of Dictyostelium cells in equilibration buffer at 4°C is loaded onto an AHBuBP-affinity column at a rate of 10 ml/hour.
  • Fractions (each represented by a chromatography monitor peak) of non-specifically-bound proteins were eluted with equilibration buffer (peak 1), equilibration buffer after overnight incubation (peak 2) and equilibration buffer containing 0.1 M KC1 (peak 3), respectively (as described above).
  • DPI is eluted when 5 mM AHBuBP dissolved in equilibration buffer is used directly (peak 4), after overnight incubation (peak 5) and after further incubation for a few hours (peak 6).
  • Each peak from the column is lyophilised, desalted by dialysis and concentrated in a Microsep microconcentrator. An aliquot from each fraction is electrophoresed on a 12% polyacrylamide gel in denaturing conditions (Laemmli, Nature 227, 680-685, 1970) and the gel is stained with Coomassie Blue. Elution with equilibration buffer, equilibration buffer after overnight incubation and with buffer containing 0.1 M KC1 eluted proteins which had no affinity for the immobilized ligand (Fig. 2, Lanes 2, 3 and 4 respectively).
  • Lane 1 contains molecular weight standards.
  • DPI binds to both the AHBuBP and the AHPrBP affinity columns very strongly, since repeated overnight incubation of the columns with 5 mM bisphosphonate followed by elution the following day continued to elute DPI over a period of 10 days.
  • DPI is the only protein eluted from the affinity-column by bisphosphonates since it is the only protein that could be detected in a 3, 000-fold concentrate of the bisphosphonate eluate by SDS-PAGE analysis and staining with Coomassie blue. The high purity is later confirmed by matching all of the fragments obtained by peptide sequencing of protein in the bisphosphonate eluate with the predicted amino acid sequence of the DPI cDNA. Not all bisphosphonates elute DPI from the columns.
  • pyrophosphate, inorganic phosphate nor galactose is effective at eluting DPI.
  • PLP pyridoxal 5'-phosphate
  • PLP pyridoxal 5'-phosphate
  • the purified DPI protein is digested with endoprotease Lys-C or endoprotease Asp-N in the presence of urea as denaturing agent to produce a collection of peptides for sequencing. These peptides were separated by HPLC and sequenced by Edman degradation (Edman, Ada Chem. Scand. 4: 283-293, 1950).
  • Endoproteinase Lys-C cleaves at the carboxyl side of a lysine peptide bond and Asp-N cleaves at the amino side of an Asparagine peptide bond.
  • Lower case letters indicate a tentative sequence assignment.
  • "X” indicates that no definite assignment could be made at that position.
  • Bold- typed regions are those used for general” > oligonucleondes XJ-1 (peptide 5) and XJ-2 (peptide 6) in the chart.
  • Example 2 Cloning and sequencing of DPI
  • RNA is isolated from approximately 10 Dictyostelium amoebae using a Quick-Micro mRNA purification kit (Pharmacia, Piscataway NJ, USA).
  • a Quick-Micro mRNA purification kit Pharmacia, Piscataway NJ, USA.
  • 2.5 ug mRNA were precipitated with potassium acetate and ethanol, collected by centrifugation at 14,000 g for 5 minutes at 4°C, and resuspended in 50 ul DEPC-treated water.
  • 12.5 ng oligo(dT) ⁇ 7-adapter primer 12.5 ng oligo(dT) ⁇ 7-adapter primer
  • GACTCGAGTCGACATCGA(dT)i7 (SEQ. ID NO. 23), were added and the solution heated to 70°C for 10 minutes followed by quick chilling on ice for 3 minutes.
  • 5 x reaction buffer 250 mM Tris-HCl, pH8.3, 0.375 mM KC1, 15 mM MgC ), 80 units RNasin (Promega, Madison WI, USA), 1.25 mM dNTP and 0.2 mM DTT were added and the mixture is prewarmed at 37°C for
  • oligonucleotide primers XJ-1 and XJ-2 were used to design the degenerate oligonucleotide primers XJ-1 and XJ-2. Alignment of the peptides from DPI with the sequence of discoidin I suggested that peptide 5 would be closer than peptide 6 to the N-terminal end of DPI . Hence XJ-1 is expected to be 5' of XJ-2 in the mRNA sequence.
  • the cDNA coding for DPI is amplified by 3'-RACE (Frohman et al. Proc. Natl. Acad. Sci. USA 85, 8998-9002, 1988) using degenerate oligonucleotide primers:
  • PCR products were diluted 1 : 1 ,000 with distilled water and used as the template for the second-round PCR reaction with the XJ-2 and XJ-ADA primers. Reaction conditions were identical to those used in the first round. PCR products were size fractionated on a 1% agarose gel, visualized and isolated (Sambrook J., Fritsch E. F., and Maniatis T.
  • the primer that is used for reverse transcription of mRNA consisted of an oligo(dT)i7 region to anneal to the Poly(A) tail of the mRNA and an adapter region of 18 nucleotides (Froh an et al., Proc. Natl. Acad. Sci. USA 85, 8998-9002, 1988). Inco ⁇ oration of the adapter sequence into the first-strand cDNA allowed use of the primer XJ-ADA for the amplification of the 3' region of the DPI cDNA using PCR.
  • PCR is performed using XJ-1 and XJ-ADA as 5' and 3' primers respectively to amplify the first-strand cDNA.
  • Two DNA fragments, with approximate sizes of 600bp and 500bp were detected on subsequent agarose gel electrophoresis. Sequencing of these two fragments showed that neither of them encoded any of the peptide sequences obtained from DPI.
  • primers XJ-2 and XJ-ADA were used to amplify the first-strand cDNA, one band of about 400bp is observed on an agarose gel. Sequencing of this band indicated that this cDNA is a fragment from the 3' end of discoidin IA cDNA.
  • primer XJ-2 is derived from a peptide sequence for which there is a homologous sequence in discoidin I.
  • primers XJ-1 and XJ-ADA were further amplified using primers XJ-2 and XJ-ADA, a DNA fragment of 450bp is obtained. Cloning and sequencing of this fragment showed that its corresponding amino acid sequence contained the sequences of peptides 3, 6, 7 and 8 of DPI (Fig. 3 (SEQ. ID NO. 2) and Fig. 4).
  • This DNA fragment designated XJ-450, is used as a probe for the subsequent cDNA library screening and northern and Southern blot analyses. Preparation and screening of cDNA libraries
  • RNA isolated from the wild-type Ax-2 strain of Dictyostelium is used as the template for cDNA synthesis for construction of libraries in lambda ZAP bacteriophage (Stratagene, San Diego, CA, USA).
  • the libraries were screened with the PCR amplified cDNA fragment (XJ-450) of DPI isolated from the pCRII vector. Hybridisations were performed at 42°C for 24 to 48 h in 5 x SSPE - 1 x Denhardt's solution - 100 ug salmon testis DNA ml- 50% formamide. Membranes were washed in 0.25 x SSPE, 0.5% SDS at 55°C. Rescreening of positive plaques yielded full-length cDNAs for DPI from the wild- type Ax-2 library.
  • RNA isolated from Dictyostelium amoebae is size fractionated on a denaturing formaldehyde agarose gel, transferred onto Hybond-N membrane (Amersham, Bucks, UK), hybridized to the appropriate cDNA fragment and washed as described above.
  • Hybond-N membrane Amersham, Bucks, UK
  • the northern blots were probed with a fragment of Dd-tcpl, a Dictyostelium homologue of the tcpl family of molecular chaperones.
  • Genomic DNA is isolated from the wild-type Ax-2 strain by using a modification of the methods described by Nellen et al. (Molecular Biology in Dictyostelium: Tools and Applications. In: Spudich JA ed: Methods in Cell Biology Vol. 28: 67-100. Academic Press, 1987) and Amau (Laboratory Products Technology I September 1993:20.).
  • Insoluble material is removed by centrifugation at lOOOOg for 2 min. at room temperature and the DNA in the supernatant is purified (Magic Miniprep DNA Purification System, Promega, Madison, WI, USA). Purified DNA is quantified by absorbance at OD ⁇ O and 10 ⁇ g samples were digested overnight with the required restriction enzymes. The DNA is then size fractionated on a 1% agarose gel. transferred to Hybond-N membrane and hybridized and washed as described aho . e ⁇ Southern blot of genomic DNA prepared by Clontech (Palo Alto.
  • C.V I S ⁇ ⁇ is hybridized with the XJ-450 DPI DNA probe using low stringency for b ⁇ d ⁇ /jt ⁇ on (37°C) and washing (3 x SSPE, 0.5% SDS).
  • Northern analysis of Poly(A+) RNA isolated from amoebae that had been grown either axenically to a density of about 5 x 10 cells/ml or to a lower density of 1 x 10 cells/ml, or with a bacterial substrate is shown in Fig. 5a.
  • the probe hybridized to a mRNA transcript of about 1.2 kb from axenic Ax-2 amoebae harvested at a density of about 5 x 10 cells/ml.
  • DPI mRNA is also expressed in axenic wild-type amoebae harvested at a lower density of 1 x 10 cells/ml (Fig. 5b).
  • a PCR fragment of discoidin IA is labeled with 3? P-dCTP and hybridized to the same northern blots.
  • discoidin I is also expressed poorly in bacterially- grown wild-type Ax-2 amoebae. (Fig. 5a).
  • discoidin I is encoded by a family of genes (Rowekamp et al., Cell 20: 495-505, 1980; Poole et al., J. Mol. Biol. 153:273- 289, 1981) but suggests that there is only a single gene for DPI (discoidin II). Structural features of DPI
  • the predicted amino acid sequence suggests that DPI is a neutral protein with an estimated pi of 6.77. Its calculated molecular mass is 28,573Da which is slightly larger than that of discoidin IA (28,258Da) or discoidin IC (28,391 Da) (Poole et al., J. Mol. Biol. 153: 273-289. 1981 ).
  • the amino acid sequence predicted from the full-length cDNA includes all nine peptide sequences obtained by enzymatic digestion of DPI.
  • the amino acid and cDNA sequences of DPI were later found to be identical to those of discoidin II. The homology between discoidin II and I is significant, with 49% identify in their predicted amino acid sequences (Fig.
  • DPI discoidin II
  • V human coagulation factors
  • VIII Wiod et al., Nature 312:330-337, 1984.
  • milk fat globule protein Larocca et al, Cancer Res. 51 : 4994-4998, 1991
  • open reading frame 7 linked to the Rhodopseudomonas blastica atp operon
  • DPI is mainly composed of b-pleated sheets (Fig. 8).
  • An Arg-Gly-Asp (RGD) motif occurs at residues 81-83 and is also present in discoidin I.
  • the amino acids flanking the RGD motif are also highly conserved between discoidin I. and DPI . Owing to its hydrophilicity, the RGD motif probably lies on the protein surface (Fig. 10). The motif may have the same role in DPI and in discoidin I.
  • Fig. 3 SEQ. ID NO. 1
  • Fig. 3 SEQ. ID NO. 1
  • Suitable vectors include pCD (Okayama et al.. Mol. Cell. Biol. 2: 161-170 (1982) and pJL3/4 (Gough elaL EMBO J.. 4: 645-653 (1985).
  • the resultant expression vectors are then transformed into appropriate host cells, which thereby express the DPI protein.
  • Fig. 3 SEQ. ID NO. 2
  • Fig. 3 SEQ. ID NO. 2
  • it could be manipulated to introduce bacterial expression signals, and the resultant recombinant DNA could then be introduced into bacterial host cells (for example Escherichia coli) for expression therein.
  • Plasmid DNA from the pMT2 subclone is then transfected into monkey COS-1 cells by the DEAE-dextran procedure (Sompayrac and Danna (1981) Proc. Natl. Acad. Sci. U.S.A., 78: 7575-7578; Luthman and Magnusson (1983), Nucl. Acids Res. 1 1 : 1295-1308). Serum-free 24 hour conditioned medium is then collected from the cells starting 40 to 70 hours post-transfection.
  • Example 4 Isolation of hDPl, a human bisphosphonate binding protein
  • AHBuBP tri-sodium salts of AHBuBP (Alendronate) is from GENTILI Materials S.P.A., Pisa, Italy. Human multiple tissue blots and a zoo-blot were purchased from Clontech. Human poly(A ) RNA from testis, HeLa cells and HL60 cells is purchased from Clontech. A cDNA library of human testis is custom-made in lambda ZAP bacteriophage by Stratagene (La Jolla, CA, USA) using poly(A ) RNA purchased from Clontech (Palo Alto, CA, USA). The human leukocyte concentrate is obtained from Sheffield Blood Transfusion Services (Longley Lane, Sheffield, UK). All other chemicals were purchased from Sigma Chemical Co., Poole, UK. Construction of a bisphosphonate affinity column
  • a bisphosphonate-affinity column is constructed as described previously in Example 1. Preparation of human leukocytes
  • the supernatant containing the cytosolic proteins is applied to the pre- equilibrated AHBuBP-affinity column at a rate of 6-12 ml/hour.
  • the column is first eluted with equilibration buffer at a rate of 20 ml/hour until protein no longer appeared in the eluate.
  • the ionic strength of the eluant is then increased by including 0.1 M KC1 to elute further non-specifically bound proteins until protein no longer appeared in the eluate.
  • buffer containing 5 mM AHBuBP specifically eluted proteins bound to the immobilized AHBuBP on the column.
  • N-terminal amino acid sequencing of hDPl (SEQ. ID NO. 13) is performed on a concentrated solution of the AHBuBP eluate and also on the band of hDP 1 (SEQ. ID NO. 13) transferred onto PVDF membrane by protein blotting following SDS-PAGE. Sequence is determined by Edman degradation (Edman, Ada Chem. Scand. 4: 283-293, 1950) both directly from sample solution and from the (PVDF) membrane containing blotted protein. Internal peptides were generated by endoprotease Lys-C digestion, purified by HPLC, and sequenced.
  • YXIGEPTVPsTLAEeFtYNpF (SEQ. ID NO. 26) [peptide 11] - where X indicates that no definite assignment is made for that residue, and a lower case letter indicates a tentative assignment for that residue.
  • N-terminal amino acid sequence (peptide 10) (SEQ. ID NO. 25) is highly homologous to the N-terminal sequence of a rat spermatid (SEQ. ID NO. 8) 29,000 MW protein ( Figure 11). No protein appeared to contain sequences having significant homology with peptide 1 1 (SEQ. ID NO. 26).
  • poly(A ) RNA of human testis is used to prepare total cDNA that contained the hDPl cDNA.
  • oligonucleotides containing 17-22 nucleotides were synthesized according to the peptide sequences of hDPl for the subsequent PCR to amplify the corresponding hDPl cDNA (Table 2).
  • Synthesis of the first strand cDNA from human testis poly(A+) mRNA is carried out using a 3' RACE kit (Gibco, Paisley, UK) following the manufacturer's instructions.
  • AP-oligo(dT) is used as the primer since this introduced the sequence of the universal amplification primer (UAP) into the cDNA sequence to facilitate the subsequent 3' RACE procedure (Frohman et al., Proc. Natl. Acad. Sci. USA 85, 8998-9002, 1988).
  • hDPl SEQ. ID NO. 15
  • cDNA The strategy for the amplification of hDPl (SEQ. ID NO. 15) cDNA is shown in Fig. 12.
  • the PCR is performed in a volume of 50 ⁇ l in the presence of AmpliTaq reaction buffer. 1.5 mM MgCl2, 0.2 mM dNTP, diluted cDNA template, 2.5 units of AmpliTaq DNA polymerase (Perkin Elmer) and 0.2 ⁇ M of each primer DJ-1 (SEQ. ID NO. 27) and UAP (SEQ. ID NO. 32) (Table 2).
  • the mixture is subjected to 40 cycles of amplification in a Perkin Elmer 9600 PCR machine.
  • Second-round PCR is carried out by using the diluted reactions of the first-round as template and different combinations of internal primers, i.e. DJ-1 (SEQ. ID NO. 27) plus DJ-10 (SEQ. ID NO. 30), DJ-5 (SEQ. ID NO. 28) plus DJ- 10, DJ-9 (SEQ. ID NO. 29) plus UAP (SEQ. ID NO. 32) (Table 2).
  • hA is used a.s the template for PCR amplification with primers DJ-5 and DJ-10, a fragment ⁇ hC ) of 570 bp is obtained. All these sequences were found within the full length sequence (SEQ. ID NO. 9). All these three fragments were from specific amplifications since leaving out either primer of a pair, or the template, eliminated the corresponding product.
  • hC is used as the probe for screening the cDNA library and for the subsequent northern and Southern blotting. cDNA library screening and sequence of the full-length of hDPl cDNA
  • 25 ng of a hDPl PCR fragment is labeled with 32 P-dCTP using random primers (Feinberg and Vogelstein, Anal. Biochem. 132: 6-13, 1983) (Prime-It II kit, Stratagene) and is used as a probe to screen a human testis cDNA library in lambda ZAP bacteriophage (Short et al., Nucleic Acid Res. 16: 7853-7860, 1988). After three rounds of screening, bacteriophage plaques specifically hybridized to the hDPl probe were selected and the inserts were subcloned into pBluescript SK(-) plasmids by in vivo excision (Short et al., Nucleic Acid Res.
  • Recombinant plasmids were subjected to restriction digestion and amplification using T3 and T7 primers which anneal to sequences on each side of the inserts.
  • Recombinant plasmids which contained a 1.2 kb hDPl cDNA insert, were identified by Southern blotting and sequenced using a Taq DyeDeoxy Terminator Cycle Sequencing kit (ABI) by the method of Sanger et al. (Proc. Natl. Acad. Sci. USA 74: 5463-5467, 1977).
  • hC When hC is used to screen a testis cDNA library constructed in lambda ZAP bacteriophage, three clones were isolated. All appeared to have the same nucleotide sequence. The largest open reading frame consisted of 936 bp and contained six ATG codons. The N-terminal sequence of hDPl corresponded with the deduced amino acid sequence from the cDNA only if the second ATG codon of the cDNA encoded the N-terminal methionine of hDPl . The cDNA nucleotide sequence then indicated that hDPl consisted of 260 amino acids and had a calculated molecular mass of 28,872 and an estimated isoelectric point of 7.0 (Fig. 13). The predicted molecular weight is in good agreement with that of the protein observed on a polyacrylamide gel.
  • hDPl-rP The full-length cDNA contains a 5' untranslated region of 157 nucleotides and a 3'- untranslated region of 265 nucleotides. However, neither a poly(A) tail nor a classical AATAAA polyadenylation signal is apparent in the cDNA towards the ' end, probably owing to restriction digestion beyond the polyadenylation site in construction of the cDNA library.
  • P-dCTP labeled hDPl cDNA fragment is hybridized to human multiple tissue northern blots (Clontech) and to a northern blot containing mRNA isolated from cultured human cells THP1 (a monocytic cell line), MG63 (an osteoblast-like cell line) and primary bone cells.
  • the hDPl probe is also hybridized to a northern blot which contained mRNA from Dictyostelium discoideum and to a zoo- blot which contained genomic DNA from different species (Clontech, Palo Alto, CA. USA).
  • Hybridization is performed at 42°C in the presence of 50% formamide (Sambrook et al. , Molecular Cloning: A laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989). Washing at high stringency is carried out at 42°C using 3 x 100 ml 0.25 x SSPE plus 0.5 % SDS for 3 x 15 min.
  • 2kb DNA fragment of human b-actin (Clontech, Palo Alto, CA, USA) is also labeled with 32 P-dCTP and is used as a positive control for hybridization.
  • the hybridization signal is quantified with a BioRad Phosphoimager.
  • DNA is also prepared from a lambda gtl 1 cDNA library of normal human osteoclast-like multinucleated cells using the Lambda DNA Purification Kit (Stratagene, Cambridge, UK). After being digested by restriction enzymes, the bacteriophage DNA is separated on a 1% agarose gel, transferred to a Hybond-N membrane and hybridized to the radioiabeled hDPl probe.
  • hDPl expression of hDPl in several human tissues is studied by northern analysis with the PCR fragment hC of hDPl cDNA as a probe.
  • the hDPl cDNA fragment hybridized to mRNA from all the human tissues and cell lines studied (Fig. 14).
  • N-terminal amino acid sequence of hDPl shared striking homology with a rat 29 kDa round spermatid protein in which the N-terminal 32 amino acids had been determined by peptide sequencing (Fig. 11) (SEQ. ID NO. 8) (Onoda and Djakiew, Mol. Cell. Endocrinol. 93: 53-61 , 1993).
  • Fig. 11 peptide sequencing
  • SEQ. ID NO. 8 Onoda and Djakiew, Mol. Cell. Endocrinol. 93: 53-61 , 1993.
  • a search through data banks with the complete deduced amino acid sequence of hDPl showed that there is no exact match to any known sequence.
  • hDPl has been suggested by searching the database of protein motifs. These include a site of Ca 2+ -calmodulin dependent kinase II at residues 228-232 (REKTV) near the C-terminus and a part of a zinc- finger at residues 53-61 (LTTHH). Sequence comparison of hDPl and DPI
  • hDPl The protein sequence of hDPl is also compared with that of DPI . No striking homology is observed between the two proteins (with 13% identity and 38% similarity). One region of identity is seen between residues 44-47 (His-Gly-Val- Lys) (SEQ. ID NO. 10) of hDPl and residues 28-31 of DPI (SEQ ID NO. 3). This sequence is hydrophilic and is located at the surface of hDPl protein (Fig. 20).
  • Example 6 Recombinant Expression of hDPl
  • the cDNA inserts of the pBluescript SK(-) subclones described in Example 5 were subcloned into the mammalian cell expression vector pMT2 using appropriate restriction enzymes followed by ligation.
  • the structure of the resultant expression vector is confirmed by restriction mapping on agarose gels.
  • Plasmid DNA from the pMT2 subclone is then transfected into monkey COS-1 cells by the DEAE-dextran procedure (Sompayrac and Danna (1981) PNAS, 78: 7575-7578; Luthman and Magnusson (1983), Nucl. Acids Res. 11 : 1295-1308). Serum-free 24 hr conditioned medium is then collected from the cells starting 40 to 70 hours post-transfection.
  • Example 7 Isolation of DdCyP2. a bisphosphonate binding cyclophilin from
  • Dictyostelium AHPrBP, AHBuBP and CI2MBP were from GENTILI S.P.A., Pisa, Italy. All other bisphosphonates and the monophosphonate (NE 10788) were from Procter and Gamble Pharmaceuticals, Cincinnati, OHIO, USA. PLP is from Sigma (Poole,
  • Dictyostelium discoideum strain Ax-2 is grown in HL-5 glucose medium (2 x 700 ml) on a shaker at 22°C to a density of 2-8 x 10 cells/ml. Cells were harvested and washed once with distilled water and once with equilibration buffer at 4°C. The cell pellet is resuspended in 3-4 volumes of equilibration buffer and disrupted by sonication for 4 x 20 seconds in a MSE Soniprep with an interval of 1 minute on ice between each sonication. The cell lysate is centrifuged at 45,000 ⁇ m for 2.0 hours at 4°C in a Beckman 50.2 Ti rotor. Isolation of the bisphosphonate-binding proteins from Dictyostelium
  • the supernatant, containing the cytosolic proteins is loaded onto the AHBuBP-affinity column, which had been pre-equilibrated with the equilibration buffer, at about 6-12 ml/hour.
  • the column is washed with the equilibration buffer at a rate of 20 ml/hour to wash off unbound proteins and then washed with 0.1 M KC1 in the equilibration buffer to wash off further non- specifically bound proteins.
  • bisphosphonates or PLP at a concentration of 5 mM in equilibration buffer were used to elute any • bisphosphonate-binding proteins.
  • the eluates (about 25 ml) containing the proteins were lyophilised overnight and redissolved in 2-3 ml of 10 mM ammonium formate. They were dialysed against four changes of 2500 ml ammonium formate over 48 hours. The dialysed proteins were further concentrated by centrifugation in Microsep microconcentrators with a molecular weight cut-off of 10 kDa (Filtron, Northborough, MA, USA) at 5,000 g for 1-3 hours. All purification and concentration processes were carried out at 4°C.
  • Proteins in each concentrated fraction were separated by electrophoresis in denaturing conditions on a 12% polyacr lamide gel. Molecular weight markers (Gibco, Paisley, Scotland) were also run on the gel. The gels were stained either with silver or Coomassie blue .
  • PLP eluted two major proteins * uh molecular weights of 18 kDa and 28 kDa respectively.
  • the 18 kDa protein i which is present in approximately the same amount as the 28 kDa protein) is later named DdCyP2 (SEQ. ID NO. 13).
  • the 28 kDa protein is probably DPI .
  • Peptide sequencing and peptide sequence anal is Following electrophoresis, the gel is equilibrated for 5 minutes in the transfer buffer [10 mM CAPS, 10% (v/v) methanol, pH 1 1.0] prior to blotting to remove electrophoresis buffer salts and detergents.
  • the membrane is cut to the dimensions of the gel, wetted with 100% methanol, and then equilibrated with the transfer buffer for 15-30 minutes. Two filter papers (Whatman 3 MM) and fibre pads were completely saturated in the transfer buffer. The gel, membrane and pads were assembled into the holder cassette. The transfer is performed at constant voltage (50 V) for 1 hour at room temperature with the Bio-Ice cooling unit in the buffer chamber.
  • Proteins blotted onto ProBlott membrane were detected by Coomassie brilliant blue staining following the instructions supplied with the membrane.
  • the blotted membrane is rinsed with deionised water and saturated with 100% methanol.
  • the membrane is then stained with 0.1% (w/v) Coomassie blue R-250 in 40% (v/v) methanol, 1% (v/v) acetic acid. Protein bands appeared within one minute.
  • the membrane is destained with 50% (v/v) methanol, and rinsed extensively with deionised water and air dried. The bands of interest were excised for protein sequencing.
  • the N-terminal sequence of DdCyP2 is determined three times directly after blotting onto ProBlott membrane using repeated cycles of Edman degradation.
  • the consensus sequence obtained is: NH2 - GKDPKITNKV FFDE (SEQ ID NO. 33).
  • An AHPrBP-affinity column is also used for isolation of bisphosphonate- binding proteins by the same purification procedure as that used with the AHBuBP- affinity column.
  • AHBuBP and PLP both eluted only the 28 kDa protein off this column
  • No DdCyP2 could be detected in either the PLP fraction or the AHBuBP fraction on a SDS-PAGE gel.
  • DdCyP2 is never eluted from the AHPrBP column, even with 5 M urea. It is probable that DdCyP2 does not bind to the AHPrBP-affinity column. Homology analysis
  • DdCyP2 had strong homology to cyclophilins, especially cyclophilin Bs.
  • the molecular weight of cyclophilins is usually 18-22 kDa, which is very similar to that ofDdCyP2.
  • Example 8 Cloning and sequencing of DdCyP2
  • DdCyP2 (SEQ. ID NO. 12) is cloned and sequenced from a Dictyostelium discoideum strain Ax-2 cDNA library, using similar methods to those described in Example 2 (above). The nucleotide and predicted amino acid sequence are shown in Fig. 21 (SEQ. ID NO.12 and SEQ. ID NO.13).
  • the amino acid sequence of DdCyP2 is compared with that of the other bisphosphonate-binding protein DPI of Dictyostelium. There is little similarity in their amino acid sequences (17% amino acid identity with four gaps), except that they both contain the RGD motif.
  • CyPBs Arg-Gly-Asp
  • CyPA animal cyclophilin Bs
  • CyPBs are a type of CyP occurring in the endoplasmic reticulum (ER).
  • ER endoplasmic reticulum
  • the CyPB family has a N-terminal signal sequence, which directs it to the ER, and a conserved C-terminal undecapeptide extension which is not found in other CyPs.
  • DdCyP2 A comparison of the amino acid sequences of DdCyP2 and the CyPBs from various species showed about 60% identity with one gap introduced into the CyPBs.
  • DdCyP2 On the basis of the "RGD" motif, DdCyP2 is like a CyPB, but it does not have the extra N- and C-terminal sequences typical of CyPBs. DdCyP2 is therefore like a hybrid between CyPA and CyPB; no similar CyPs have been reported in other cells.
  • DdCyP2 The amino acid sequence of DdCyP2 is superimposed on that of hCypA by using the program FRODO (Jones, Method Enzymol. 115: 157-171, 1985). DdCyP2 is probably very similar to human cyclophilin A (hCypA) in its three-dimensional structure. The RGD tripeptide and the seven amino acid insertion in DdCyP2 both appear on the surface of the structure.
  • DdCyP2 Large quantities of DdCyP2 can be obtained using procedures similar to those described above in Example 3. Expression in mammalian cells
  • the cDNA for DdCyP2 described in Example 8 is subcloned into the mammalian cell expression vector pMT2 using appropriate restriction enzymes followed by ligation.
  • the structure of the resultant expression vector is confirmed by restriction mapping on agarose gels.
  • Plasmid DNA from the pMT2 subclone is then transfected into monkey COS-1 cells by the DEAE-dextra" procedure (Sompayrac and Danna (1981) PNAS. 78: pages 7575-7578; Luthman and Magnusson (1983), Nucl. Acids Res. 11: pages 1295-1308). Serum-free 24 hr conditioned medium is then collected from the cells starting 40 to 70 hours post-transfection.
  • hDPl 20-200 ug of a highly pure preparation of hDPl is emulsified with Freund's complete adjuvant (FCA) using two 3 ml Luer-Lock syringes with 18-gauge needles.
  • FCA Freund's complete adjuvant
  • the emulsified immunogen mixture is then injected intradermally into a shaved rabbit using a 22-gauge needle.
  • a total volume of 0.5-1.0 ml is injected into 10-12 sites along the upper sides of the rabbit.
  • Boosters are given after 21 days and 40 days.
  • the rabbit After 40 days, the rabbit is bled and the blood allowed to clot for 2-4 hours at room temperature. The serum is then decanted and centrifuged at 1 OOOg for 10 min. to remove blood cells.
  • Immunoglobulin is then partially purified from the serum by DEAE- Sephacel ion-exchange chromatography.
  • the serum is first dialyzed against phosphate buffered saline, pH 7.4, and then applied to the column.
  • Anti-hDPl IgG antibody is then collected as it flows off the column.
  • the lymph node is placed into Dulbecco's minimal essential media which has been supplemented with 2 mM glutamine, 100 IU/ml penicillin and 100 ug/ml streptomycin.
  • the tissue is minced and the cells dispersed.
  • the cell suspension is allowed to settle for 10 min. on ice and the supernatant then centrifuged (1600 ⁇ m for 6-7 min.). The supernatant is discarded. An equal volume of medium is then added and the cells washed twice. Viable cells are counted using dye exclusion.
  • the cells are then fused with the p3U l myeloma cell line (Yelton et al. (1978) Curr. Top. Microbiol. Immunol. 81. 1 ). using standard techniques. Cells (2ml/well) are then pipetted into cell culture trays and placed in an incubator. Multiple subclonings are then carried out and the hybridomas screened by ELISA.
  • the assay is based on a modification of the Antigen Capture Assay (originally developed by Berson and colleagues - see Berson, S.A., Yalow, R.S., Bauman, A., Rothschild, M.A. and Newerly, K. (1956) Journal of Clinical Investigation 35, pages 170 to 190, and Yalow, R.S. and Berson, S.A. (1959) Nature 184, pages 1648-1649).
  • Anti-bisphosphonate binding protein monoclonal antibody is bound to the bottom of each well of a microtitre plate using standard procedures (approximately 2 ug/ml of antibody in PBS). Any remaining binding sites in the wells are then blocked with 3% BSA in PBS and a partially pure preparation of bisphosphonate binding protein is added to each well to specifically bind the bisphosphonate binding protein to the antibodies in the wells. Each well is then washed to remove excess unbound bisphosphonate binding protein and any other free contaminating proteins.
  • Radioiabeled (eg. ⁇ C or ⁇ H) bisphosphonate is then introduced into the wells.
  • the labeled bisphosphonate binds to the bisphosphonate binding protein which is attached to the antibody-coated wells.
  • the plates are incubated for a time sufficient to allow full binding and the wells are then washed to remove unbound bisphosphonate.
  • the amount of radiolabel in each well is then determined by washing each well with trichloroacetic acid followed by liquid scintillation counting of the resulting fluid.
  • the binding affinity of a range of different bisphosphonates can therefore be determined by comparing the binding characteristics of radioiabeled derivatives in the assay.
  • Anti-bisphosphonate binding protein monoclonal antibody is bound to the bottom of each well of a microtitre plate using standard procedures (approximately 2 ug/ml of antibody in PBS), and the microtitre plate treated as described in Example 11.
  • Example 13 Unleveled bisphosphonate binding assay The bisphosphonate binding protein is bound to a Pharmacia BIAcore 2000 chip using standard procedures. Small volumes of bisphosphonate solutions at various concentrations are then allowed to flow over the chip, and changes in the optical properties of the chip attendant on bisphosphonate binding are detected by the Pharmacia BIAcore apparatus. The results permit calculation of the binding constant for the binding of the bisphosphonate to the bisphosphonate binding protein.
  • Dictyostelium Ax-2 refered to herein, is deposited with the ATCC, (American Type Culture Collection, Rockville, Maryland, USA) prior to submission of this application and bears deposit number 24397, see the ATCC/NIH Repository Catalogue.
  • Figure 1 Affinity chromatography of a cell extract of Dictyostelium discoideum on an AHBuBP-affinity column.
  • the cell extract prepared by sonication of Dictyostelium cells in equilibration buffer at 4°C is loaded onto an AHBuBP-affinity column at a rate of 10 ml/hour. Peaks of non-specifically-bound proteins were eluted with equilibration buffer (1 ). equilibration buffer after overnight incubation (2) and equilibration buffer containing 0.1 M KCl (3). DPI is eluted when 5mM AHBuBp dissolved in equilibration buffer is used directly (4), after overnight incubation (5) and after a further incubation for a few hours (6)
  • Each peak from the column is lyophilised, desalted by dialysis and concentrated in a microsep microconcentrator. An aliquot of each fraction is electrophoresed on a 12% polyacrylamide gel in denaturing conditions (Laemmli, 1970) and the gel is stained with Commassie Blue. Lane 1 , molecular weight standards; Lane 2, elution with equilibration buffer; Lane 3, elution with equilibration buffer after overnight incubation; Lane 4, elution with equilibration buffer containing 0.1 M KCl; lane 5, elution with 5 mM AHBuBP in equilibration buffer.
  • the Nucleotide sequence shown is of the 986-bp cDNA of DPI (Discoidin II) from the wild-type Dictyostelium discoidium Ax-2.
  • the adenine base of the ATG initiation sequence is assigned as 1 in the numbering.
  • Nucleotides are numbered in the right margin and amino acids on the left.
  • An open reading frame of 771-bp encoding 257 amino acids is shown with a single letter code for the translated amino acids.
  • a termination codon (TAA) at the end of the coding sequence is marked with an asterisk
  • a putative signal peptides at the beginning of the amino acid sequence (1-20) is indicated in italics.
  • the nucleotides sequence of the PCR fragment XJ-450 used to screen the cDNA library is underlined. Multiple polyadenylation signal sequences (AAT AAA) are shown in italics.
  • Asterisks indicate positions of identity while dots (.) indicate positions of conservation.
  • Amino acid sequences obtained for peptides of DPI are underlined and the corresponding numbers are shown in (parentheses) underneath the sequences. Regions used for generating primers XJ-1 and XJ-2 are in italics. The alignment is performed by use of the multiple alignment program of CLUSTALV (Higgins et al, 1992).
  • FIG. 1 Southern Blot Analysis of DPI (discoidin II). Genomic DNA from the Ax-2 strain is subjected to digestion and the products were separated on a 1% agarose gel (a). After transfer to Hybond-N membrane, the samples were hybridized to the 32 P-labelled XJ-450 probe (b) or the discoidin IA probe (c). Note that in each lane there is only a single band of hybridization to the DPI (discoidin II) cDNA XJ- 450 whereas there were multiple bands for hybridization to the discoidin IA cDNA probe.
  • Lane identiifcation in (a) is as follows A Ax-2de ⁇ ved DNA, UD. undigested materials, and mw: Molecular weight marker Rest ⁇ ction enzymes used in the numbered lanes are: 1. EcoRI, 2. Bam 1. 3 V ⁇ /I. 4 Accl, and 5 Sau3Al. 0 (a) DPI ⁇ b> DP1
  • FIG. 7 Sequence Comparison Matrices for DPI (discoidin II) (horizontal) with (a) human coagulation factor V, (b) human coagulation factor VIII, (c) ORF7 (Yat7-Rhobl) linked to the Rhodopseudomonas blastica atp operon and (d) milk fat globule (MFG) protein.
  • the Matrices are plotted using MDM ⁇ g mutation dat matrix Pam250 as the scoring system (Schwartz and Dayhoff 1978), with a window size of 8, and a minimum score of 50%.
  • the profile is constructed using the Kyte-Doolittle algorithm with a sliding window of seven amino acids. Values above the zero axis correspond to hydrophilic segments. Secondary structure predictions based on algorithms of both Chou- Fasman (1978) (CH, shadowed boxes) and Robson-Gamier (Gamier et al. 1978) (RG, filled boxes) are shown in the lower half of the figure.
  • Figure 9 Prediction of a probable cleavage site for a putative signal peptide of DPI (discoidin II) by the method of von Heijne (1986) using the program MacProt (Markiewicz, 1991; Luttke, 1990). A window size of 15 residues of weight matrix for eukaryotic protein is used. Generally a protein having a signal petide has one segment scoring greater than +3.5 while cytosolic proteins have scores less than +3.5.
  • FIG. 1 Surface probability plot of the RGD-containing region of DPI (discoidin II).
  • a surface probability profile of residues 76-88 is constructed using the Janin et al. (1978) and Emini et al. (1985) algorithms. A probability value above 0.5 is assigned to a water-accessible, "exposed” sequence and values below 0.5 to buried segments.
  • Sequence ID# 8 Amino terminal amino acid sequence of r ⁇ t round spermatid 29,000 Mr protein (RSP-29)
  • a position where both sequences contained the same amino acid is indicated by an asterisk.
  • a colon indicates a position where the two sequences contain amino acids having similar properties whilst a dot indicates a position where the two sequences contain different amino acids.
  • Figure 12 Schematic representation for amplification of hDPl cDNA by the polymerase cahin reaction.
  • primers DJ1, DJ5, DJ9 (sense), and DJ10 were designed from a knowledge of peptide sequences of hDP 1.
  • the nucleotide sequence of the primer UAP (antisense) had been inco ⁇ orated into the first strand cDNA during the reverse transcription using primer oligo(dT) 17-AP.
  • the nucleotide sequence of hDPl cDNA isolated from a human testis cDNA library consists of 1160 base pairs.
  • the adenine base of the ATG initiation codon is assigned a 1 in the numbering.
  • Nucleotides are numbered in the right margin and amino acids on the left.
  • An open reading frame of 780-bp encoding 260 amino acids is shown with a single letter code for the translated amino acids.
  • a stop codon (TGA) at the terminus of the translation sequence is marked with an asterisk, the amino acid sequences of peptides 1 and 2 are underlined.
  • Figure 14 Northern blot analyses of hDPl in human tissues.
  • Figure 15 Southern analysis of hDPl on a zoo-blot.
  • the southern blot containing 8 ⁇ g og genomic DNA per lane from nine eucaryotic species is obtained from Clontech.
  • the DNA had been digested with EcoRI, run on a 0.7% agarose gel and transferred to a nylon membrane.
  • Hybridization is carried out using a [32p]- labelled PCR fragment of hDPl cDNA.
  • FIG. 16 Northern Blot analyses of hDPl homologues in Dictyostelium discoideum.
  • a [32p]-labelled PCR fragment of hDPl cDNA is hybridised to a northern blot containing 5 ⁇ g mRNA from Dictyostelium amoebae grown on acteria (lane 1), or grown axenically (lane 2).
  • FIG. 18 Sequence comparison of hDPl with aspartate aminotransferase (AAT) from chicken mitochondria. hDPl is shown in the upper line and DPI in the lower one. "
  • hydrophilic profile is constructed using the Kyte-Doolittle algorithm (Kyte and Doolittle, 1982) with a sliding window of seven amino acids. Values above the zero axis correspond to hydrophilic segments Secondary structure prediction based on the algorithms of Chou-Fasman (1978) (CH, shadowed boxes) and Robinson- Gamier (Gamier et al 1978) (RG, filled boxes) are shown in the lower half of the figure.
  • Figure 20 Amino acid sequence comparison of hDPl with DPI (discoidin II). hDPl is shown in the upper line and DPI in the lower one. "
  • Figure 21 Nucleotide sequence of the Wt.3 cDNA insert and the deduced amino acid sequence of Dd CyP2.
  • the nucleotide sequence shown is for the 629 bp cDNA of Dd CyP2 in clone Wt.3. Nucleotides are numbered on the left and amino acids on the right. An open reading frame of 540 bp encoding 180 amino acids is shown using the single letter code for the translated amino acids. The initiation codon (ATG) and the termination codon (TAA) are underlined. An RGD motif is shown in bold-type. The five N-terminal amino acid residues absent from the isolated Dd CyP2 protein are in italics. 1 50
  • Figure 22 Alignment of the amino acid sequence of Dd CyP2 and the sequences of selected members of the cyclophilin A family of proteins.
  • the sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group).
  • the sequences of the CyPs were derived from the following sources: (1) Dd CyP2; (2) Dd CyPl (Barisic et al, 1991); (3) Brassica napus (p24525, Gasser et al, 1990); (4) Saccharomyces cerevisiae (pl4832, Haendler et al, 1989); (5) human (p05092, Haendler et al, 1987).
  • Residues of human CyPA that are located in close contact with a tetrapeptide Ala- Ala-Pro- Ala substrate are marked with an asterisk (Kallen & Walkinshaw, 1992); residues of human CyPA that are in close contact with bound cyclosporin A are marked with a cross (Theriault et al, 1 93; Pflugl et al, 1993). 1 50
  • Figure 23 Alignment of the amino acid sequences of Dd CyP2 and four human cyclophilins.
  • the sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group) Fhe sequences of the CyPs were derived from the following sources: (1) human ( > PA (p05092, Haendler et al, 1987), (2) human CyPD (p30405, Bergsma d al . I *>l ); (3) Dd CyP2; (4) human CyPB (p23284, Price et al, 1991); (5) human ( ⁇ PC ( Schneider et al, 1994). 72
  • Figure 24 Alignment of the amino acid sequences of Dd CyP2 and CyPBs.
  • the sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group).
  • the sequences of the CyPBs were derived from the following sources: (1) human (p23284, Price et al, 1991); (2) mouse (p24369, Hasel et al, 1991); (3) chick (p24367, Caroni et al, 1991); (4) rat (p24368, Iwai and Inagami et al, 1990).
  • Dd CyP2 (XPl). Potential hydrophobic signal sequences of CyPBs are underlined. The RGD motifs are in bold-type.
  • Figure 25 Alignment of the amino acid sequences of two Dd CyPs and some plant CyPs.
  • the sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is pc _ ⁇ rmed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group).
  • the sequences for the CyPs were derived from the following sources: (I) Arabidopsis thaliana (114844, Lippuner et al, 1994) (2) tomato (m55019, Gasser et al, 1990); (3) Brassica napus (m55018.
  • MOLECULE TYPE DNA (genomic)
  • Trp Glu Leu Ty. Ala Leu Pro Val Lys Ser Tyr Ser Asn Pro Ser Val 210 215 220
  • Gin Thr Arg lie Gly Val Gin Pro Arg Asn He Thr Thr Lys Gly Phe 275 280 285
  • MOLECULE TYPE DNA (genomic)
  • MOLECULE TYPE DNA (genomic)
  • MOLECULE TYPE DNA (genomic)

Abstract

The present invention relates to proteins capable of binding bisphosphonate or bisphosphonate analogues, to methods for producing and identifying these proteins, to their corresponding genes and to various uses of the proteins, for example, in therapy and in the screening, isolation, synthesis, design and evaluation of bisphosphonate-based drugs.

Description

PHOSPHONATE BINDING PROTEINS
TECHNICAL FIELD
The present invention relates to proteins capable of binding bisphosphonate or bisphosphonate analogues, to methods for producing and identifying these proteins, to their corresponding genes and to various uses of the proteins, for example, in therapy and in the screening, isolation, synthesis, design and evaluation of bisphosphonate-based drugs.
BACKGROUND Bone pathology
A number of diseases are recognized which arise from bone destruction or disorders of bone metabolism. These diseases are of great clinical importance and have been the subject of intense scientific research for several decades.
Bone destruction can result from various cancers and from rheumatoid arthritis. Metabolic bone disorders commonly involve excessive bone resorption and include Paget's disease, hypercalcaemia (both tumour-induced and non-tumour induced), bone metastases and osteoporosis.
Paget's disease (a focal increase in bone turnover) is fairly common, and in some countries affects up to 5% of the population over 50 years of age. The disease may be caused by a slow virus infection, and leads to bone pain, deformities and fractures.
Bone metastases can induce bone destruction either through local invasion or via the secretion of bone-resorbing agents into the blood stream.
Hypercalcaemia can result either from an increase in the flow of calcium from bone or the intestine to the blood, or trom an increase in tubular reabsorption of calcium in the kidney. It can induce a Ai variety of physiological and/or pharmacological disturbances and can b lite threatening. Osteoporosis is characterized by a reduction in the quantity of bone. This loss of bone tissue often causes mechanical failure, and bone fractures frequently occur in the hip and spine of women suffering from postmenopausal osteoporosis. Kyphosis (abnormally increased curvature of the thoracic spine) is another common feature.
Two types of osteoporosis are recognized: primary and secondary. Secondary osteoporosis is the result of an identifiable disease process or agent, while primary osteoporosis (which constitutes about 90% of all cases) includes postmenopausal osteoporosis, age-associated osteoporosis (affecting a majority of individuals over the age of 70) and idiopathic osteoporosis affecting middle-aged and younger men and women.
The mechanism of bone loss in osteoporosis is believed to involve an imbalance of the process of "bone remodeling". Bone remodeling occurs throughout life, renewing the skeleton and maintaining the strength of bone. This remodeling involves the erosion and filling of discrete sites on the surface of bones, by an organized group of cells called "basic multicellular units" or "BMUs". BMUs primarily consist of osteoclasts, osteoblasts, and their cellular precursors. In the remodeling cycle, bone is resorbed at the site of an "activated" BMU by an osteoclast, forming a resorption cavity. This cavity is then filled with bone by osteoblasts.
Normally, in adults, the remodeling cycle results in a small deficit in bone, due to incomplete filling of the bone resorption cavity. Thus, even in healthy adults, age-related bone loss occurs. However, in many people, particularly in postmenopausal osteoporotics, there is an increase in the number of BMUs that are activated. This increased activation accelerates bone remodeling, resulting in abnormally high bone loss.
Many compositions and methods are described in the medical literature for the treatment of the above-described diseases and conditions, and most attempt to either slow the loss of bone or produce a net gain in bone mass.
Administration of oestrogen has been used as a means both to prevent and to treat osteoporosis in postmenopausal women. However, the use of oestrogen has been associated with certain side effects, such as uterine bleeding.
Other treatments are based on the administration of parathyroid hormone.
The hormone calcitonin has also been used to treat Paget's disease (and to a lesser extent tumour bone disease), and can be effective in decreasing bone turnover. However, the incidence of relapse is high, and side effects limit the therapeutic usefulness of calcitonin.
Perhaps one of the most successful class of drug for the treatment of the above diseases has proved to be the bisphosphonates. Bisphosphonates
Inorganic pyrophosphate has high affinity for bone mineral and is able to inhibit the precipitation and dissolution of calcium phosphate crystals in vitro and to inhibit bone mineralization in vivo. These activities are thought to arise from direct physicochemical effects (such as adsorption to hydroxy apatite, inhibition of dissolution of hydroxyapatite and crystal growth inhibition). Pyrophosphate has therefore found application as an antitartar agent for use in toothpastes.
However, inorganic pyrophosphate (PPi) is rapidly hydrolysed following administration (particularly oral administration) due to the presence of the relatively labile phosphorus-oxygen bond P-O-P. This severely limits their pharmaceutical utility, and has prompted a search for PPi analogues which exhibit similar physicochemical activities while resisting enzymatic hydrolysis in vivo.
Bisphosphonates, which are characterized by phosphorus-carbon (P-C-P) bonds, are stable analogues of naturally occurring inorganic pyrophosphates which to a great extent overcome the limitations associated with inorganic pyrophosphates. Bisphosphonates are resistant to chemical and enzymatic hydrolysis but retain the therapeutic activity of PPi.
Unlike pyrophosphates, however, bisphosphonates exhibit properties which extend beyond those attributable to purely physicochemical phenomena. In particular, bisphosphonates have been found to be inhibitors of osteoclast-mediated bone resorption in organ cultures of bone and in animal models. Bisphosphonates therefore have broader clinical utility than PPi, and have found widespread application in several main clinical areas, e.g., (a) as bone imaging agents for diagnostic purposes, usually in the form of " echnetium derivatives, (b) as anti- resorptive agents to combat bone loss associated with Paget's disease, hypercalcaemia associated with malignancy and bone metastases and osteoporosis, (c) as calcification inhibitors in patients with ectopic calcification and ossification, and (d) as antitartar agents for use in toothpastes.
Bisphosphonates are now among the most important therapeutic agents for the treatment of pathological disorders of bone metabolism, including osteoporosis. Moreover, since some bisphosphonates appear to have anti-inflammatory as well as anti-resorptive effects in vivo, they may also have utility in the treatment of inflammation and rheumatoid arth_-iιs. Bisphosphonates described in the literature generally have the structure:
Figure imgf000006_0001
Without wishing to be bound by any theory, bisphosphonates appear to contain a "bone binding moiety" (the P-C-P group) and a "bioactive moiety", R2. The "bone binding moiety" appears to endow the bisphosphonate compound with the ability to adsorb to bone and/or to hydroxyapatite (a model for bone), while the R2 bioactive moiety appears to determine the potency of the bisphosphonate.
Minor alterations to R2 can have a marked effect on potency, and this property is therefore specific to R2. Varying the R2 side chain has led to dramatic variations in potency (in some cases, 4 orders of magnitude) (Rogers et al., 1995, Mol. Pharm. 47:398-402).
R\ is a moiety that assists in binding to bone or in bioactivity (as R2), R3 is H or alkyl. Such compounds are described, for example, in U.S. Patent 5,391,743; Published European Patent application 186405 (published July 2, 1986); Published European Patent aplication 298553 (published January 1, 1989); published PCT applications WO 93/05044 (published December 9, 1993), WO 93/04469 (published December 9, 1993), WO 93/04979 (published December 9, 1993), WO 93/04978 (published December 9, 1993), and WO 93/04977 (published December 9, 1993), hereby incoφorated by reference.
The literature also describes bisphosphonate analogs, sometimeas also refered to as monophosphate analogues, where one phosphonate is replaced by a carboxylate, sulfonate or other acid or ester, such as in published applications WO 93/04993 (published December 9, 1993), and WO 93/04976 (published December 9, 1993), hereby incorporated by reference.
In, addition there may exist bisphosphonate "mimetics," which may have some, or none of the analogue's characteristics, yet bind to the bisphosphonate binding protein, and this may be useful in treating the same maladies as the bisphosphonates.
However, despite widespread recognition of the importance of bisphosphonates, the mechanism of action of these compounds has not been elucidated. It has been suggested that bisphosphonates may affect the differentiation and recruitment of osteoclast precursors or alter the capacity of mature osteoclasts to resorb bone by altering the permeability of the osteoclast membrane to small ions. Another hypothesis is that they act by affecting lysosomal enzyme production or cell metabolism or through toxic effects on osteoclasts (Carano et al., 1990,. J. Gin. Invest. 85:456-461). A further suggestion is that other cells in the bone microenvironment that regulate the activity of osteoclasts are involved in the antiresorptive mechanism.
There exists considerable opportunity for further innovation and development in this field. In addition, many further clinical applications may exist requiring different profiles of activity.
There is therefore a need to understand the mechanism of action of the bisphosphonates, to rapidly and efficiently screen potentially improved bisphosphonate drugs and to identify and create compounds having improved therapeutic activity.
The present inventors have now recognized that bisphosphonate drugs exert their effect (at least in part) via interaction with specific target proteins (hereinafter referred to as bisphosphonate binding proteins).
SUMMARY OF THE INVENTION
According to the present invention there is provided an isolated bisphosphonate binding protein(s), or homologues, fragments, muteins, equivalents or derivatives (e.g. fusion derivatives or synthetic peptides) thereof which substantially retain bisphosphonate binding activity.
The present invention also relates to bisphosphonate binding proteins that are useful in therapy and in the identification, isolation and/or design of novel drugs.
The invention also relates to isolated DNA encoding the bisphosphonate binding protein (or derivatives, fragments, etc.) thereof. The invention also relates to nucleic acid probes which are selectively hybridizable with the DNA of the invention.
The invention also relates to a method for producing a bisphosphonate binding protein.
The invention also relates to a method for designing and synthesizing a therapeutically active bisphosphonate or a mimetic thereof using a three-dimensional model of the bisphosphonate binding protein, or the bisphosphonate binding site of such protein
The invention also relates to the bisphosphonate binding protein, antibody thereto, mimetic or antagonist thereof for use in therapy, diagnosis or testing, both in vivo and in vitro.
The invention also relates to the protein of the invention for use in a method of screening bisphosphonates for therapeutic activity. Such screening contemplates test kits or screening kits, comprising the protein. The invention also relates to the protein of the invention which is labeled for use in the test kits of the invention.
Also contemplated by the invention is a host cell comprising the vector of the invention.
Further aspects of the invention will become apparent as the description proceeds.
BRIEF DESCRIPTION OF THE DRAWINGS
The invention is described in further detail by way of DRAWINGS, listed below.
Figure 1 : Affinity chromatography of a cell extract of Dictyostelium discoideum on an AHBuBP-affinity column.
Figure 2: Profile of eluted proteins from the AHBuBP-affinity column.
Figure 3 . SEP. ID NO. 1 and SEP. ID NO. 2.: Nucleotide and deduced amino acid sequence of the cDNA for wild-type DPI . Since this cDNA is identical to the sequence of discoidin II, it is clear that DPI is in fact discoidin II. The nucleotide sequence shown is that of the 968-bp cDNA of DPI from wild-type ∑ discoideum Ax-2. The adenine base of the ATG initiation sequence is assigned as 1 in the numbering. Nucleotides are numbered in the right margin and amino acids on the left. An open reading frame of 771-bp encoding 257 amino acids is shown with a single-letter code for the translated amino acids. A termination codon (TAA) at the end of the coding sequence is marked with an asterisk. A putative signal peptide at the beginning of the amino acid sequence is indicated in italics. The nucleotide sequence of the PCR fragment XJ-450 used to screen the cDNA library is underlined. Multiple polyadenylation signal sequences (AATAAA) are shown in italics.
Figure 4: Comparison of the deduced amino acid sequences for DPI (SEQ. ID NO. 3) and discoidin IA (DISC 1A) (SEQ. ID NO. 4). Asterisks indicate positions of identity while dots indicate positions of conservation. Amino acid sequences obtained for peptides of DPI are underlined and the corresponding numbers are shown in parentheses underneath the sequences. Regions used for generating primers XJ-1 (SEQ. ID NO » and XJ-2 (SEQ. ID NO. 6) are in italics. The alignment is performed by use ot the multiple alignment program of CLUSTALV (Higgins et al., 1992, d,mp irrl Biosci. 8: 189-191).
Figure 5: Northern blot anal ses ot D I mRNA expression in axenic strains. 0.5 ug samples of mRNA were fractionated on formaldehyde gels. After transfer to Hybond-N membrane (Amersham, Bucks. I . ). the samples were hybridized to 3 p. labeled DNA probes. In (a), mRNAs on the blot were subjected to successive hybridizations with 32P-labeled PCR fragments of DPI (XJ-450) from sequence ID #1, discoidin I A sequence ID #6 and Dd-tcpl (control). Lane 1 shows axenically- grown Ax-2, lane 2 shows bacterial ly-grown Ax-2. The columns on the right hand side represent the relative abundance of the hybridized mRNA transcripts in the corresponding lanes of the blots. In (b), hybridization to 32P-labeled DPI cDNA XJ- 450 of mRNA isolated from amoebae of strain Ax-2 harvested from axenic culture at low density (lxl 05 cells/ml) (A) and at high density (4xl06 cells/ml)(B).
Figure 6: Southern blot analysis of DPI. Genomic DNA from the wild-type Ax-2 strain is subjected to restriction digestion and the products were separated on a 1% agarose gel (a). After transfer to Hybond-N membrane, the samples were hybridized to the 32P-labeled XJ-450 probe (b) or the discoidin I A probe (c). Note that in each lane there is only a single band of hybridization to the DPI (discoidin II) cDNA XJ-450 whereas there were multiple bands for hybridization to the discoidin IA cDNA probe.
Figure 7: Sequence comparison matrices for DPI (horizontal) with (a) human coagulation factor V, Wood et al., Nature 312:330-337 (1984); Jenny, et al., Proc. Natl. Acad. Sci., USA 84:4846-4850 (1987), (b) human coagulation factor VIII, Wood et al., Nature 312:330-337 (1984), (c) ORF7 linked to the Rhodopseudomonas blastica atp operon (Yat7-Rhobl) Tybnlewicz et al., J. Mol. Biol. 179:185-214 (1984) and (d) milk fat globule (MFG) protein Larocca et al. 1991. The matrices were plotted using MDMγg mutation data matrix Pam.250 as the scoring system (Schwartz and Dayhoff, 1978, Atlas of Protein Sequence and Structure. 5: 353-358, National Biomedical Research Foundation, Washington DC), with a window size of 8 and a minimum score of 50%.
Figure 8: Hydrophilic plot and secondary structure prediction for DPI. The profile is constructed using the Kyte-Doolittle algorithm (Kyte and Doolittle, 1982, J. Mol. Biol. 157, 105-132) with a sliding window of seven amino acids. Values above the zero axis correspond to hydrophilic segments. Secondary structure predictions based on algorithms of both Chou-Fasman (Adv. Enzymol. Relat. Areas Mol. Bio., Al, 45-148, 1978) (CH, shadowed boxes) and Robson-Garnier (Gamier et al., 1978, J. Mol. Biol. 120, 97-120) (RG. filled boxes) are shown in the lower half of the Figure. Figure 9: Prediction of a probable cleavage site for a signal peptide of DPI by the method of van Heijne (1986) using the programme MacProt (Markiewicz, 1991, BioTechniques 10: 156-151 +160+162-1 >; Luttke, 1990, Comp. Method. Prog. Biomed. 31 :105-1 12). A window size of 15 residues of weight matrix for eukaryotic proteins is used. Generally, a protein having a signal peptide has one segment scoring greater than + 3.5 while cytosolic proteins have a score less than + 3.5.
Figure 10: Surface probability of the RGD-containing region of DPI . A surface probability profile of residues 76-88 is constructed using the Janin et al. (J. Mol. Biol. 125: 357-386, 1978) and Emini et al. (J. Virol. 55: 836-839, 1985) algorithms. A probability value above 0.5 is assigned to a water-accessible, "exposed" sequence and values below 0.5 to "buried" segments.
Figure 1 1. N-terminal amino acid sequence alignment of hDPl (SEQ. ID NO. 7) and the rat round spermatid 29,000 Mr protein (RSP-29) (SEQ. ID NO. 8) Onoda and Djaliew 1993 A. Asterisks indicate positions of identity, colons indicate positions of conservation and dots positions where the two sequences contain different amino acids.
Figure 12: Schematic representation of the amplification of hDPl cDNA by the polymerase chain reaction. Degenerate primers DJ1, DJ5, DJ9 (sense) and DJ10 (antisense) were designed from a knowledge of the peptide sequences of hDPl. The nucleotide sequence of the primer UAP (antisense) had been incoφorated into the first strand cDNA during reverse transcription using primer oligo(dT)17-AP.
Figure 13: Nucleotide sequence and deduced amino acid sequence of hDPl cDNA. The nucleotide sequence of hDPl (SEQ. ID NO. 9 and SEQ. ID NO 10) cDNA isolated from a human testis cDNA library consists of 1161 base pairs. The largest open reading frame consists of 936 bp and contains six ATG codons. The deduced amino acid sequence is shown in a single letter code. The mature hDPl protein appears to be encoded by the nucleotide sequence started from the second ATG codon and contains 260 amino acids (shown in bold type). The adenine base of this second ATG codon and the methionine encoded by this codon are assigned as 1 respectively in the numbering. Nucleotides are numbered in the right margin and amino acids in the left. A putative N-terminal sequence of 48 amino acids starting from the first ATG codon is shown in italics. A stop codon (TGA) at the terminus of the translation sequence is marked with an asterisk. The amino acid sequences of peptides 1 and 2 are underlined.
Figure 14: Northern bio* nalyses of hDPl in human tissues. Human multiple tissue blots (I and II) containing 2 ug poly(A )RNA from various tissues were purchased from Clontech (Palo Alto, California). Northern blots III and IV were prepared by running 1 ug of poly(A )RNA in a formaldehyde gel and blotting onto Hybond-N membrane. Hybridization is carried out using a [ P]-labeled PCR fragment of hDPl cDNA. A DNA fragment of 3-actιn is also labeled with [ P] and hybridized to the same blots. The height of the bar on top of each lane represents the relative abundance of hDPl mRNA in that tissue.
Figure 15: Southern analysis of hDPl on a zoo-blot. The Southern blot containing 8 ug of genomic DNA per lane from nine eukaryotic species is purchased from Clontech. The DNA had been digested with ECQRI, run on a 0.7% agarose gel and transferred to a nylon membrane. Hybridization is carried out using a [ P]- labeled PCR fragment of hDPl (SEQ. ID NO. 13) cDNA. Lanes 1-9 contain, in order, genomic DNA from yeast, chicken, rabbit, cow, dog, mouse, rat, monkey and man.
Figure 16: Northern blot analyses of hDPl homologues in Dictyostelium discoideum. A [ P]-labeled PCR fragment of hDPl cDNA is hybridized to a northern blot containing 5 ug poly(A )RNA isolated from Dictyostelium amoebae: Lane 1 , strain Ax-2 grown with bacteria; Lane 2 strain Ax-2 grown axenically.
Figure 17: Sequence alignment of hDPl. (a) and (b)- show matrix plots of hDPl (horizontal) against ORF3 linked to the Rhodopseudomonas blastica atp operon (ATPase-ORF3, vertical) and ORF1 linked to the genes for arginyl tRNA synthetase and ribonuclease H of Buchnera aphidicola (ATS-ORF1, vertical). The matrices were plotted using MDMγg mutation data matrix Pam250 as the scoring system (Schwartz and Dayhoff, 1978, Atlas of Protein Sequence and Structure, Vol. 5, 353-358, National Biomedical Research Foundation, Washington, DC), with a window size of 8 and a minimum score of 55%. In (c), the sequence alignment of hDPl with ATPase-ORF3 and ATS-ORF1 is shown. The alignment is performed by using multiple alignment program CLUSTALV (Higgins et al., 1992, Comp. Appl. Biosci. 8: 189-191). A position where three sequences contain the same amino acid is indicated by an asterisk. A dot indicates a position where the three sequences contain amino acids having similar properties.
Figure 1 . Sequence comparison of hDPl (SEQ. ID NO. 10) with aspartate aminotransferase (AAT) (SEQ. ID NO. 1 1 ) from chicken mitochondria. hDPl is shown in the upper line and DPI in the lower one. "|" indicates identity between aligned residues; ":" indicates similarity. The comparison is performed using the program "bestfit" of the GCG package ( Devereux et al., 1984, Nucelic Acids Res. 12: 387-395). Identity between the two proteins is 14.3% and the similarity is 44.1%. Figure 19: Hydrophilic plot and secondary structure prediction for hDPl . The hydrophilic profile is constructed using the Kyte-Doolittle algorithm (Kyte and Doolittle, 1982, J. Mol. Biol. 157, 105-132) with a sliding window of seven amino acids. Values above the zero axis correspond to hydrophilic segments. Secondary structure prediction is based on algorithms of both Chou-Fasman (Adv. Enzymol. Relat. Areas Mol. Biol. 47, 45-148, 1978) (CH, shadowed boxes) and Robson- Garnier (Gamier et al., 1978, J. Mol. Biol. 120, 97-120) (RG, filled boxes), shown in the lower half of the Figure.
Figure 20. Amino acid sequence comparison of hDPl (SEQ. ID NO. 10) with DPI (SEQ. ID NO. 3) (discoidin II). hDPl is shown in the upper line and DPI in the lower. "|" indicates identity between aligned residues; ":" indicates similarity. The comparison is performed using the program "bestfit" of the GCG package . (Devereux et al., 1984, Nucelic Acids Res. 12: 387-395). The identity between the two proteins is 12.8% and the similarity is 38.5%.
Figure 21. Nucleotide and predicted amino acid sequence of DdCyP2 cDNA (SEQ. ID NO. 12). The nucleotide sequence shown is a 685 bp cDNA of DdCyP2 (SEQ. ID NO. 13) isolated from a Dictyostelium discoideum strain Ax2 cDNA library. Nucleotides are numbered in the left margin and amino acids on the right. An open reading frame of 540 bp encoding 180 amino acids is shown using a single letter code for the translated amino acids. A start codon (ATG) and stop codon (TAA) are underlined. An RGD motif is in bold.
Figure 22. Alignment of the amino acid sequence of Dd CyP2 and the sequences of selected members of the cyclophilin A family of proteins. The sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group). The sequences of the CyPs were derived from the following sources: (1) Dd CyP2; (2) Dd CyPl (Barisic et al., 1991, Developmental Genetics 12:50-53); (3)
Brassica napus (p24525, Gasser et al., 1990, Proc. Natl Acad. Sci. USA 87: 9519- 9532); (4) Saccharomyces cerevisiae (pi 4832, Haendler et al., 1989, Gene 83: 39- 46); (5) human (p05092, Haendler et al., 1987, EMBO J. 6: 947-950). Residues of human CyPA that are located in close contact with a tetrapeptide Ala-Ala-Pro-Ala substrate are marked with an asterisk (Kallen & Walkinshaw, 1992, FEBS Lett. 300: 286-290); residues of human CyPA that are in close contact with bound cyclosporin A are marked with a cross (Theriault et al., 1993, Nature 361 : 88-91; Pflugl et al., 1993, Nature 361 : 91-94).
Figure 23. (SEQ. ID NO. 13)Alignment of the amino acid sequences of Dd CyP2 and four human cyclophilins. The sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group). The sequences of the CyPs were derived from the following sources: (1) human CyPA (p05092, Haendler et al., 1987, EMBO J. 6: 947-950), (2) human CyPD (p30405, Bergsma et al., 1991, J. Biol. Chem. 266: 23204-23214); (3) Dd CyP2; (4) human CyPB (p23284, Price et al., 1991, Proc. Natl. Acad. Sci. 88: 1903-1907); (5) human CyPC (Schneider et al., 19 '4, Biochemistry 33 : 8218-8224).
Figure 24. (SEQ. ID NO. 13)_Alignment of the amino acid sequences of Dd CyP2 and CyPBs. The sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group). The sequences of the CyPBs were derived from the following sources: ( 1 ) human (p23284, Price et al., 1991, Proc. Natl. Acad. Sci. 88: 1903-1907): (2) mouse (p24369, Hasel et al., 1991, Mol. and Cellular Bio. 11: 3484-3491); (3) chick (p24367, Caroni et al., 1991, J. Biol. Chem. 266: 10739-10742); (4) rat (p24368. Iwai and Inagami et al., 1990, Kidney Int. 37: 1460-1465). (5) Dd CyP2 (XPl ). Potential hydrophobic signal sequences of CyPBs are underlined. The RGD motifs are in bold-type.
Figure 25. (SEQ. ID NO. 13) Alignment of the amino acid sequences of two Dd CyPs and some plant CyPs. The sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group). The sequences for the CyPs were derived from the to I l win sources: (1) Arabidopsis thaliana (114844, Lippuner et al., 1994, J. Biol < hem 269: 7863-7868) (2) tomato (m55019, Gasser et al., 1990, Proc. Natl. Acad / / V< 87: 9510-9523); (3) Brassica napus (m55018, Gasser et al., 1990, Proc. \att W ad Sci. USA 87: 9510-9523); (4) onion (113365, Clark et al., 1993, direct submission of the onion cyclophilin to the GenBank); (5) maize (m55021, Gasser et al . 1 ^90. Proc. Natl. Acad. Sci. USA 87: 9510-9523); (6) Dd CyP2 (XPl); (7) Arabidopsis thaliana (x63616, Bartling et al., 1991, Plant Mol. Biol. 19: 529-530); (8) Arabidopsis thaliana (114845, nuclear- encoded chloroplast stromal, Lippuner et al., 1994, J. Biol. Chem. 269: 7863-7868); (9) Dd CyPl (Barisic et al., 1991, Developmental Genetics 12:50-53). The seven amino acid insertion in Dd CyP2 is in bold-type. The potential ATP/GTP binding sites are underlined.
DETAILED DESCRIPTION
The term "isolated" is used herein to indicate that the binding protein is substantially isolated with respect to the complex cellular milieu in which it naturally occurs. The absolute level of purity is not critical, and those skilled in the art can readily determine appropriate levels of purity depending upon the intended use for the protein. In many circumstances, the isolated bisphosphonate binding protein will form part of a composition, buffer system or pharmaceutical excipient, which may for example contain other components (including other proteins, such as albumin). In other circumstances, the bisphosphonate binding protein may be purified to essential homogeneity, for example as determined by PAGE or column chromatography (for example HPLC).
The term "bisphosphonate binding protein" is used herein to define a protein which can bind bisphosphonate and/or act (either directly, or indirectly) as a target for bisphosphonate(s) in vivo. The bisphosphonate binding proteins of the invention may bind specifically to bisphosphonates, or may exhibit broader binding affinity and bind to other molecules in addition to bisphosphonate (with either the same or with a different affinity).
Thus, as defined herein, the bisphosphonate binding proteins of the invention may be targets of the physiological and/or pharmacological action of bisphosphonates and may mediate the physiological and or pharmacological effects of bisphosphonates in vivo (such as growth inhibition in Dictyostelium discoideum and antiresoφtive action in humans), either alone or as part of a complex comprising other proteins and/or molecules. Accordingly, the bisphosphonate binding proteins of the invention may be bisphosphonate receptors, and may be involved in signal transduction during a cellular response to bisphosphonate. The bisphosphonate binding protein may also be an enzyme for which bisphosphonates are substrate analogues.
The term "bisphosphonate" is used herein in a broad sense to cover not onl> bisphosphonates sensu stricto. as defined by the literature above, but also bisphosphonate analogues. Bisphosphonate analogues are those ligands which can compete with osteoactive bisphosphonate for binding to the cellular targets of the osteoactive bisphosphonates, or which can compete in vitro (for example, on an affinity column) with osteoactive bisphosphonate (in either the free state or in the form of a derivative linked to an affinity column) for binding to the binding proteins of the invention. These include HOOC-C-PO H2, HO3S-C-PO3H2 RHO3P-C- PO3H2 and the like. Such compounds are disclosed for example in F. H. Ebetino, et al. U. S Patent 4,868,164, issued July 6, 1988, F. H. Ebetino, et al. Published European Patent application 87/0274158, published July 13, 1988, F. H. Ebetino, et al. U. S Patent 5,334,586, issued December 1, 1991, F. H. Ebetino, et al. U. S Patent 4,868,164, issued August 2, 1994, C. N. Yu, et al. Published European Patent application 92/918467.9, published March 3, 1993, F.H. Ebetino, et al. Published PCT Patent application 93/04976, published December 9, 1993, all incoφorated herein by reference. Bisphosphonates, and bisphosphonate analogues referred to throughout this application are:
Figure imgf000015_0001
where
R2 is a "bioactive moiety" as described above;
Ri is a moiety that assists binding to bone, or assists bioactivity; and each A is individually an acidic moiety, an ester or the like; or any moiety that binds to bone.
Specific examples of these include:
Figure imgf000015_0002
3PHEBP HEBP AHBuBP (Risedronate) (Etidronate) (Alendronate)
Figure imgf000016_0001
C12MBP AHPrBP PLP clodronate (pamidronate) (pyridoxal phosphate)
Figure imgf000016_0002
NE10788 Bisphosphonate analogues include, but are not limited to pyridoxal phosphate (PLP), O-phosphorylethanolamine, O-phosphorylcholine, phosphatidyl ethanolamine and phospholipid bisphosphonate analogues. Pyridoxal phosphate (PLP), O- phosphorylethanolamine, O-phosphorylcholine, phosphatidyl ethanolamine are known in the art.
"Osteoactive bisphosphonates" are those bisphosphonates which act as antiresoφtive drugs in vivo.
The term "Bisphosphonate mimetic" as defined herein, is a functional term that describes any molecule which performs similarly to a bisphosphonate in vivo, or in vitro. The structure of such molecules are secondary to their function.
In preferred embodiments of the invention, the bisphosphonate binding protein is a cyclophilin, a discoidin or a round spermatid protein (or homologue thereof).
The term "discoidin" is a term of the art, and defines a family of proteins sharing sequence similarity or homology with the discoidin proteins of Dictyostelium (particularly discoidin I) which are cytoplasmic components. The term discoidin covers discoidin homologues (e.g. mammalian homologues) which share conserved domains characteristic of the discoidin family.
The term "cyclophilin" (CyP) is also a term of the art and defines a family of proteins falling into at least two groups: the cyclophilin A group and the cyclophilin B group. There are many cyclophilins described, all showing sequence similarities and many characterized by their ability to bind to the immunomodulatory drug cyclosporin A (CsA). Most cyclophilins possess enzyme activity, being peptidyl prolyl cis-trans isomerases.
The term "round spermatid protein" (RSP) is used herein to define a family of proteins sharing sequence similarity and/or exhibiting homology with the rat RSP-29 protein (described by Oneda and Djakiew, 1993, Molecular and Cellular Endocrinology 93, 53-61) which is a protein of rat round spermatids. RSP-29 has homologues in many different species, including higher eukaryotes (monkey to chicken), lower eukaryotes .Dictyostelium discoideum) and even prokaryotes (Rhodopseudomonas blastica and Buchnera aphidicola). The round spermatid proteins of the invention may therefore control cell differentiation in the testes as well as in the bone, and may constitute a hitherto unrecognized family of factors involved in cell differentiation (including the differentiation of bone cells such as osteoclast progenitor cells).
The term "family" is used herein to indicate a group of proteins or genes which share substantial sequence similarities, either at the level of the primary sequence of the proteins themselves, or at the level of the DNA encoding them. The sequence similarities may extend over the entire protein/gene, or may be limited to particular regions or domains. Similarities may be based on nucleotide/amino acid sequence identity as well as similarity (for example, those skilled in the art recognize certain amino acids as similar, and identify substitutions of similar amino acids as conservative changes). Some members of a protein family may be related in the sense that they share a common evolutionary ancestry, and such related proteins are herein referred to as "homologues". The members of a protein family may not necessarily share the same biochemical properties or biological functions, though their similarities are often reflected in common functional features (such as effector binding sites and substrates). The criteria by which such families are recognized are well-known in the art, and include computer analysis of large collections of sequences at the level of DNA and protein as well as biochemical techniques such as hybridization analysis.
Without being bound by theory, we propose that the bisphosphonate binding domain of the binding proteins of the invention may be characterized by the amino acid sequence motifs RGD and or SEQ. ID. NO. 34. While other binding domains may occur, preferably, the bisphosphonate binding protein of the invention comprises the amino acid sequence RGD and/or SEQ. ID. NO. 34. Other binding proteins of the invention may have different binding domains characterized by different amino acid sequence motifs. For example, the binding protein of the invention may bind pyridoxal phosphate, O-phosphorylethanolamine, O- phosphorylcholine, phosphatidyl ethanolamine or phospholipid bisphosphonate analogues, and or comprise the C-terminal region of human DPI (hDPl) and/or the amino acid sequence RGD and/or the amino acid sequence SEQ. ID. NO. 34 and/or a pyridoxal phosphate-binding domain/pseudodomain and/or a phosphatidyl ethanolamine-binding domain/pseudodomain, and/or be a Dictyostelium discoideum bisphosphonate binding protein (or homologue thereof).
Dictyostelium discoideum bisphosphonate binding proteins are particularly preferred because the present inventors have found that this organism expresses bisphosphonate binding proteins while being relatively easy to grow in large quantities. Some Dictyostelium discoideum bisphosphonate binding proteins have homologues in other organisms (for example mammalian, e.g. human cells), and such homologues are also covered by the invention.
Also contemplated by the invention are fragments of the protein of the invention which comprise the bisphosphonate binding domain, for example, comprising the amino acid sequence RGD and/or SEQ. ID. NO. 34 the C-terminal region of human DPI (hDPl), a pyridoxal phosphate-binding domain/pseudodomain and/or a phosphatidyl ethanolamine-binding domain/pseudodomain, or any fragment substantially retains bisphosphonate binding activity.
The term "pseudodomain" is used herein to indicate that the domain is structurally and functionally related to its correlate domain, but binds a different (although usually structurally related) ligand.
Preferably, such fragments consist essentially of any bisphosphonate binding domain in the protein, preferably the bisphosphonate binding domain described above. The fragments may be fused to other peptide sequences preferably having, for example, enzyme activity or to antibodies (such as monoclonal antibodies or antibody fragments).
The bisphosphonate binding domain may be identified by any of a variety of methods known to those skilled in the art such as sequence analysis, protection analysis, affinity labeling (e.g. photoaiTinity labeling) and/or the generation of a collection of fragments via e.g. protease treatment. Such techniques are known in the art
One particularly preferred bisphosphonate binding protein is DPI (SEQ. ID NO. 3), or homologues, fragments, muteins. equivalents or derivatives (e.g. fusion derivatives or synthetic peptides) there t v.hιch substantially retain bisphosphonate binding activity.
Another particularly preferred bisphosphonate binding protein is hDPl (SEQ. ID NO. 9), or homologues, fragments, muteins, equivalents or derivatives (e.g. fusion peptide derivatives or synthetic peptides) thereof which substantially retain bisphosphonate binding activity.
Yet another particularly preferred bisphosphonate binding protein is a second Dictyostelium discoideum cyclophilin herein designated DdCyP2 (SEQ. ID NO. 13), or homologues (for example, human cyclophilin B), fragments, muteins, equivalents or derivatives (e.g. fusion derivatives or synthetic peptides) thereof which substantially retain bisphosphonate binding activity.
As used herein, the term "homologue" defines proteins which are related in the evolutionary sense to the proteins of the invention, and may for example define the equivalent protein from a different organism.
The term "mutein" is used herein to define proteins that are mutant forms of the proteins of the invention, i.e. proteins in which amino acids have been added, deleted or substituted.
The term "equivalent" as used herein and applied to the proteins of the invention defines proteins which exhibit substantially the same functions as those of the proteins of the invention while differing in structure (i.e. amino-acid sequence). Such equivalents may be generated for example by identifying sequences of functional importance, selecting an amino acid sequence on that basis and then synthesizing a peptide based on the selected amino acid sequence. Such synthesis can be achieved by any of many different methods known in the art, including solid phase peptide synthesis and the assembly (and subsequent cloning) of oligonucleotides.
The term "derivative" as applied herein to the binding proteins of the invention is used to define proteins which are modified versions of the binding proteins of the invention. Such derivatives may include fusion proteins, in which the proteins of the invention have been fused to one or more different proteins or peptides (for example an antibody or a protein domain conferring a biochemical activity, such as enzymic or conjugative activity, to act as a label, or to facilitate purification).
The homologues, fragments, muteins, equivalents or derivatives of the proteins of the invention may cross-react with antibodies to the proteins of the invention, and in particular may cross-react with antibodies directed against any or all of the proteins DPI, hDPl or DdCyP2.
In another aspect, the invention contemplates a method for producing a bisphosphonate binding protein. Preferably such a method comprises the steps of: (a) linking bisphosphonate to a chromatography column to produce an affinin column; (b) loading the binding protein ( for example in the form of a cell extract ) onto the affinity column such that it becomes bound thereto; and (c) selectively eluting the binding protein from the affinity column. Methods of preparation of affinity chromatography materials are known in the art.
The step of selectively eluting the binding protein need not result in the elution of the binding protein alone in completely pure form, but merely results in a net purification (in most circumstances, a substantial net purification) of the binding protein. Selective elution as applied in the method of the invention is a process by which the binding protein is more or less specifically eluted from the column, accompanied by varying amounts of contaminating proteins and/or other (macro)molecules. Preferably, the selective elution results in the binding protein eluting as the (or a) major protein component. Preferably, the elution conditions are such that the binding protein is washed off the column substantially free of all other proteins.
The affinity chromatography material preferably comprises a bisphosphonate linked to the column. The bisphosphonate preferably comprises one or more bisphosphonate species covalently linked to the column, for example via a primary amine or sulphydryl group. However, any linkage may be used, so long as the linked bisphosphonate can function as a ligand in the process of affinity chromatography.
The cell extract may be a cytosolic, nuclear or membrane fraction, and can be prepared by any of a large number of techniques known to those skilled in the art. Such techniques include sonication, enzymatic lysis, centrifugation, precipitation, osmotic rupture or mechanical rupture. The cell extract is preferably a cytosolic extract, but may optionally include other cellular fractions.
Preferably, the cell from which the extract is prepared is one which is sensitive to bisphosphonate. Examples of such cells include mammalian or amoebal (e.g. Entamoeba spp. or Dictyostelium spp.) cells. Particularly preferred are cells of Dictyostelium discoideum. Cell lines (for example mammalian, e.g. human) expressing or overexpressing the bisphosphonate binding protein of the present invention are also useful as a source of binding protein for purification.
The binding protein may be selectively eluted from the column by loading an excess of unlinked bisphosphonate onto the column. The unlinked bisphosphonate may be different from the bisphosphonate linked to the column, and may for example be an osteoactive bisphosphonate.
The unlinked bisphosphonate for elution is preferably selected from AHBuBP, 3PHEBP, a monophosphonate analogue of 3PHEBP or PLP. The bisphosphonate linked to the chromatography column in step (a) is preferably an osteoactive bisphosphonate. Particularly preferred are AHBuBP (Alendronate) and AHPrBP (Pamidronate).
In another aspect, the invention contemplates a method for producing a bisphosphonate binding protein, comprising the steps of: (a) providing a bisphosphonate-resistant Dictyostelium mutant (e.g. a Dictyostelium discoideum mutant) bearing a mutated bisphosphonate binding protein gene; (b) cloning the wild-type gene corresponding to that mutated in step (a) to produce a cloned gene encoding a bisphosphonate-binding protein, and either: (i) expressing the cloned gene encoding a bisphosphonate binding protein to produce the bisphosphonate binding protein, or (ii) preparing a probe which is selectively hybridizable to the cloned gene and identifying a further gene on the basis of its selective hybridization with the probe and expressing said further gene.
In the case where a probe is used to identify a gene encoding a bisphosphonate binding protein, this gene may be a heterologous gene, i.e. a corresponding gene from a different biological source, a mutant thereof, etc..
The mutant for use in the above-described method is conveniently provided by the step of continuously culturing wild- type Dictyostelium discoideum Ax-2 amoebae in the presence of bisphosphonate (e.g. AHBuBP), the bisphosphonate being present at a concentration sufficient to substantially prevent growth but not sufficient to cause immediate lysis (for example, at a concentration of from 50-100 uM for AHBuBP).
The invention as described above does not rely only on the generation of spontaneous mutants, and any of a wide variety of known mutagenesis techniques could also be used (for example those involving the treatment of the amoebae with various mutagens).
The invention also embraces bisphosphonate binding protein obtainable by the methods of the invention, as well as bisphosphonate binding proteins which have been obtained by the various methods of the invention.
In another aspect, the invention relates to isolated DNA encoding the bisphosphonate binding protein (or derivatives, fragments, and the like) thereof.
As used herein and applied to DNA, the term "isolated" indicates that the DNA is substantially isolated with respect to the complex cellular milieu in which it naturally occurs, or simply present in a different nucleic acid sequence context from that in which it occurs in nature (for example, when cloned or in the form of a restriction fragment). Thus, the DNA of the invention may be isolated in the sense used herein, yet present in any of a wide variety of vectors and in any of a wide variety of host cells (or other milieu, such as buffers, viruses or cellular extracts).
The DNA of the present invention embraces DNA having any sequence so long as it encodes the bisphosphonate binding protein, fragment, mutein, etc. of the invention. As a result of degeneracy in the genetic code, any particular amino acid sequence may be encoded by many different DNA sequences. Determining whether a sequence encodes the above material is well within the scope of the skilled artisan as is the designing of the DNA or RNA sequence given the amino acid sequence
In a particular embodiment, the isolated DNA of the invention has the sequence shown in Fig. 3 (SEQ. ID NO. 1), Fig. 13 (SEQ. ID NO. 9) or Fig. 21 (SEQ. ID NO. 12).
The invention also contemplates a recombinant expression vector comprising the DNA of the invention. The nature of the vector is not critical to the invention, and any vector may be used, including plasmid, virus, bacteriophage, transposon, minichromosome, liposome or mechanical carrier. As used herein, recombinant expression vector refers to a DNA construct used to express DNA which encodes a desired protein and which includes a transcriptional subunit comprising an assembly of 1) genetic elements having a regulatory role in gene expression, for example, promoters and/or enhancers, 2) a structural or coding sequence which is transcribed into mRNA and translated into protein, and 3) appropriate transcription and translation, initiation and termination sequences. Using methodology well known in the art, recombinant expression vectors of the present invention can be constructed. Possible vectors for use in the present invention include, but are not limited to: for mammalian cells, pJT4 (discussed further below), pcDNA-1 (Invitrogen, San Diego, CA) and pSV-SPORT 1 (Gibco-BRL, Gaithersburg, MD); for insect cells, pBlueBac III or pBlueBacHis baculovirus vectors (Invitrogen, San Diego, CA); and for bacterial cells, pET-3 (Novagen, Madison, WI). The DNA sequence coding for the bisphosphonate binding protein of the invention can be present in the vector operably linked to regulatory elements.
The vector may preferably comprise an expression element or elements operably linked to the DNA of the invention to provide for expression thereof at suitable levels. Any of a wide variety of expression elements may be used, and the expression element or elements may for example be selected from promoters, enhancers, ribosome binding sites, operators and activating sequences. Such expression elements may be regulatable, for example inducible e.g., via the addition of an inducer. As used herein, "operably linked" refers to a condition in which portions of a linear DNA sequence are capable of influencing the activity of other portions of the same linear DNA sequence. For example, DNA for a signal peptide (secretory leader) is operably linked to DNA for a polypeptide if it is expressed as a precursor which participates in the secretion of the polypeptide; a promoter is operably linked to a coding sequence if it controls the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to permit translation. Generally, operably linked means contiguous and, in the case of secretory leaders, contiguous in reading frame.
The vector of the invention can also be a viral vector, being for example based on simian virus 40, adenoviruses (e.g. human adenoviruses), retroviruses, and papillomavirus.
The vector may further comprise a positive selectable marker and/or a negative selectable marker. The use of a positive selectable marker facilitates the selection and/or identification of cells containing the vector.
Also contemplated by the invention is a host cell comprising the vector of the invention. Any suitable host cell may be used, including prokaryotic host cells (such as Escherichia coli and Bacillus subtilis). eukaryotic host cells (including mammalian cells, amoebal cells and yeast cells). Host cells may be stably transfected or transiently transfected within a recombinant expression plasmid or infected by a recombinant virus vector. Other host cells include permanent cell lines derived from insects such as Sf-9 and Sf-21. and permanent mammalian cell lines such as Chinese hamster ovary (CHO) and SV40-transformed African green monkey kidney cells (COS).
The invention also embraces a method for producing a bisphosphonate binding protein comprising the steps of: (a) culturing the host cell of the invention such that the bisphosphonate binding protein is expressed, and (b) isolating the bisphosphonate binding protein expressed in step (a).
Such recombinantly-produced bisphosphonate binding protein (recombinant bisphosphonate binding protein) can be produced relatively inexpensively in large quantities, and can be relatively easily pun tied, using the method of the invention.
In another aspect, the invention also contemplates a method for producing a bisphosphonate binding protein comprising the steps of: (a) probing a gene library with a nucleic acid probe which is sclec ely hybridizable with the DNA of the invention (for example having a sequence vvhich is comprised in a gene corresponding to any of the sequences shown in Fig. 3 (SEQ. ID NO. 1), Fig. 13 (SEQ. ID NO. 9) or Fig. 21 (SEQ. ID \< ) 1 2 >. to produce a signal which identifies a gene that selectively hybridizes to the probe, and (b) expressing the gene identified in step (a) (for example by cloning into a host cell) to produce bisphosphonate binding protein.
As used herein, the term "selectively hybridizable" indicates that the sequence of the probe is such that binding to a unique (or small class) of target sequences can be obtained under more or less stringent hybridization conditions. This method of the invention is not dependent on any particular hybridization conditions, which can readily be determined by the skilled worker (e.g. by routine methods or on the basis of thermodynamic considerations).
The invention also embraces a bisphosphonate binding protein obtainable by the above-described methods and other methods.
The invention also relates to a nucleic acid probe which is selectively hybridizable with the DNA of the invention. Again, the term "selectively hybridizable" indicates that the sequence of the probe is such that binding to a unique (or small class) of target sequences can be obtained under more or less stringent hybridization conditions.
Preferably, the nucleic acid probe is selectively hybridizable with (for example having a sequence which is comprised in) a gene corresponding to any of the sequences shown in Fig. 3 (SEQ. ID NO. 1), Fig. 13 (SEQ: ID NO. 9) or Fig. 21 (SEQ. ID NO. 12).
The protein of the invention may be labeled, for example with a fluorescent label, an antibody, a radioisotope or an enzyme. Such labeled proteins may be particularly suitable for use in the test kits of the invention.
The invention also covers various uses of the bisphosphonate binding proteins of the invention. For example, the invention contemplates a method for screening for therapeutically active bisphosphonates, new and potentially active bisphosphonates and their analogues, as well as entirely new classes, not broadly defined as bisphosphonates, comprising the steps of: (a) contacting a bisphosphonate with the bisphosphonate binding protein of the invention, and (b) determining whether binding between the bisphosphonate and binding protein occurs, wherein binding is indicative of a therapeutically active bisphosphonate.
The method described above is useful for screening large numbers of bisphosphonates and bisphosphonate like compounds for therapeutic activity. It may also be used to classify or differentiate potential osteoactive or anti-arthritic bisphosphonates according to their mode of action (e.g., by their cellular targets). In fact, it may be used in ranking efficacy of the compounds to be screened. Preferably, the method is employed in high throughput screening. Compounds identified by the method of the invention can be further modified or used directly as therapeutic compounds to activate or inhibit the natural functions of the binding protein encoded by the isolated DNA molecule, for example in the treatment of osteoporosis.
The invention also provides a method for evaluating the therapeutic activity of a bisphosphonate comprising the steps of: (a) contacting the bisphosphonate with a bisphosphonate binding protein of the invention, and (b) measuring the binding affinity of the bisphosphonate binding protein for the bisphosphonate.
For example, the method described above is useful for ranking the therapeutic activity of potential bisphosphonate drugs, bisphosphonate analogs, and "bisphosphonate mimetics" (acting in a similar manner to a bisphosphonate) drugs.
In another aspect, the invention provides a method for synthesizing a therapeutically active bisphosphonate comprising the steps of: (a) generating a three-dimensional model of the bisphosphonate binding site of the bisphosphonate binding protein of the invention, and (b) modeling the therapeutically active bisphosphonate with reference to the three-dimensional model generated in step (a).
Many different techniques exist for generating a three-dimensional model of the binding protein (or fragments/derivatives thereof) for use in the above-described methods, and all are suitable for use in the method of the invention. Conveniently, the three-dimensional model is generated by computer analysis of the amino-acid sequence of all or a portion of the bisphosphonate binding protein (for example the bisphosphonate binding domain). Alternatively, the three-dimensional model could be generated by X ray crystallography or by NMR of the binding protein (or fragments/derivatives thereof). These techniques could also be applied to the bisphosphonate binding protein-bisphosphonate complex, the results of which could also be used as the basis for the rational design of therapeutic agents.
The invention also embraces a therapeutically active bisphosphonate which has been screened, evaluated or synthesized by the methods described above.
Also contemplated by the present invention are test kits comprising the bisphosphonate binding protein of the invention. Such kits are useful, for example, in the screening and evaluating methods of the invention. The bisphosphonate binding proteins comprised in the kits of the invention are preferably bound to a solid support and may conveniently include a labeled (e.g. radioactively-labeled, fluorescently labeled, enzymatically labeled) bisphosphonate (for example for use in competitive binding assays or in displacement assays).
Also contemplated by the invention are antibodies which bind to the binding protein of the invention. Such antibodies can be prepared by employing standard techniques well known to those skilled in the art, using any of the bisphosphonate binding proteins of the invention as antigens for antibody production. These antibodies can be employed in assays, diagnostic applications, therapeutic applications, and the like. Preferably, and particularly for therapeutic applications, the antibodies are monoclonal antibodies.
The antibodies of the invention may advantageously bind specifically to the bisphosphonate binding proteins of the invention. Antibodies specific for the bisphosphonate binding site may act as bisphosphonate mimetics. Specific binding may be exploited in imaging techniques, for example to assess the extent to which bisphosphonate targets are available for bisphosphonate action, or to determine the degree of occupancy of bisphosphonate targets in patients undergoing bisphosphonate therapy. They may also be used to identify and isolate new bisphosphonate binding proteins.
The invention also contemplates antibody derivatives, including antibody fragments (e.g. Fab fragments), chimaeric antibodies (including humanized antibodies) and antibody derivatives (such as fusion derivatives comprising an antibody-derived variable region and a non-immunoglobulin peptide having for example enzyme or conjugative activity).
The invention also contemplates mimetics (for example, the antibodies of the invention described above) or antagonists of the bisphosphonate binding protein of the invention.
The bisphosphonate binding proteins of the invention find utility in a wide range of therapeutic applications, and in another aspect the invention contemplates the bisphosphonate binding protein, antibody thereto, mimetic or antagonist thereof for use in therapy.
The bisphosphonate binding proteins, antibodies thereto, mimetics or antagonists thereof can be administered in a clinical setting by intraperitoneal, intramuscular, intravenous, or subcutaneous injection, implant or transdermal modes of administration, and the like.
Such administration can be expected to provide therapeutic alteration of the activity of bone active agents, and in preferred embodiments, the therapy involves: (i) the regulation of bone metabolism, for example in the treatment of Paget's disease, hypercalcaemia (both tumour-induced and non-tumour induced), bone metastases and osteoporosis, or (ii) the regulation of sperm maturation, for example for contraception or fertility treatment, or ( iii) the regulation of bone metabolism via interaction with cyclosporin A, and related compounds that bind to immunophilin proteins. Knowledge of the hDP 1 gene sequence disclosed herein and its expression may be useful for understanding the spermatogenic process. In testis, germs cells are surrounded by Sertoli cells and both cell types interact physically and chemically via a unique and impressive array of structural features. Since hDPl appears to be one of the factors from round spermatids that modulate the Sertoli cell function, it provides a basis for the development of novel contraceptives, or agents that affect reproduction in other ways, such as infertility, by affecting sperm maturation.
The recognition of the role of cyclophilins in bisphosphonate binding and as targets for osteoactive bisphosphonate provides a new therapeutic use for cyclosporins (e.g. cyclosporin A), and related compounds as modulators of bone metabolism, in the treatment of bone metabolism disorders and as adjuncts to bisphosphonate treatments.
EXAMPLES
The invention is described in further detail by way of specific non-limiting examples. These examples are purely exemplary and are not intended to be limiting in any way. The skilled artisan will appreciate that these examples may be varied and used as guidance, in light of the art, to make and use the invention as claimed. The examples may refer to drawings and descriptions offered previously.
Example 1 : Isolation of DPI . a bisphosphonate-binding discoidin from
Dictyostelium Materials
The di- or tri-sodium salts of AHPrBP (Pamidronate) and AHBuBP (Alendronate) were from GENTILI S.P.A., Pisa, Italy; 3PHEBP (Risedronate) is obtained from Procter and Gamble Pharmaceuticals, Cincinnati, OH, USA, P- dCTP is purchased from Amersham. Unless stated otherwise, all other chemicals were from Sigma Chemical Co., Poole, UK. Construction of an immobilized bisphosphonate affinity column
Two bisphosphonate-affinity columns were prepared. One prepared using AHBuBP and the other using AHPrBP, were made using the same column matrix and the same coupling procedure. AHBuBP and AHPrBP each have a primary amino group available for the coupling to the matrix. Spectra/Gel MAS Beads (Spectrum, Los Angeles, CA, USA) were used as the matrix to which an aldehyde group is attached through a five atom hydrophilic spacer arm. The bisphosphonates were coupled to the aldehyde via their amino groups in the presence of sodium cyanoborohydride (NaCNBH_ι) as described in detail below.
A 1 M stock solution of sodium cyanoborohydride (NaCNBH4) is prepared and left at room temperature for 1 to 3 hours until the bubbling subsided before use. The gel is washed on a Buchner funnel with three volumes of coupling buffer (0.1 M phosphate buffer, pH 7.8). AHBuBP or AHPrBP is dissolved to a high concentration (400 mM for AHBuBP and 150 mM for AHPrBP) in the coupling buffer and added to the same volume of the gel. Sodium cyanoborohydride from the stock solution is added to a final concentration of 0.1 M and the suspension is agitated at room temperature overnight. The coupled gel is agitated with a small amount of sodium borohydride (NaBH4) for 2 hours at room temperature in order to reduce any unreacted aldehyde groups. The gel is then washed with 10 volumes of 0.5 M NaCl, followed by column equilibration buffer (30 mM MOPS, 0.1 mM EDTA and 0.5 mM DTT, pH 7.4). 15 ml of the gel were packed into a glass column and stored in the equilibration buffer containing 0.02% (w/v) sodium azide at 4°C.
After each use, the column is regenerated by washing with a minimum of 10 volumes of 0.5 M NaCl, 0.1 Tris-HCl, pH 8.5, followed by a minimum of 10 volumes of 0.5 M NaCl, 0.1 M Na acetate. pH 4.5. The column is cleared of any irreversibly bound protein by washing with 5 M urea. The column is stored at 4°C in the equilibration buffer containing 0.02% (w/v) sodium azide. Growth and harvest of Dictyostelium discoideum cells
Dictyostelium discoideum amoebae strain Ax-2, were grown as shaken cultures in HL5-glucose medium at 22°C (Watts and Ashworth, Biochem. J. 119: 171-174, 1970). Cells from two 700 ml cultures were harvested at a density of 5-10 x 10 cells/ml and disrupted by sonication in 20 ml 30 mM MOPS, 0.1 mM EDTA, 0.5 mM DTT, pH7.4 at 4°C. Cell debris is removed by ultracentrifugation at 184,000g at 4°C for 2 hours. Purification of bisphQsphQna.e-binding protein DPI
The supernatant, containing cytosolic proteins, is then loaded onto a pre- equilibrated affinity column at a rate of 6- 12 ml/hour. The column is washed first with equilibration buffer at a rate of 20 ml/hour until protein, monitored by absorbance at 280 nm, no longer appeared in the eluate. The ionic strength of the eluant is then increased by including 0.1 VI KC 1 in the equilibration buffer in order to wash away further non-specifically bound proteins. Finally, specific elution of proteins bound to immobilized bisphosphonate is achieved by eluting with buffer containing 5 mM bisphosphonate. In order to examine whether other compounds were also able to elute proteins which wer still bound to the column following a 0.1M KC1 wash, 5 mM inorganic phosphate. 5 mM pyrophosphate, 5 mM PLP or 300 mM D-galactose were added to the elution buffer. All procedures were carried out at 4°C.
Fractions eluted from the columns were lyophilised, resuspended in 2-3 ml distilled water, and dialysed against three changes of 50 mM ammonium bicarbonate. The dialysed samples were further concentrated approximately 50-fold by ultrafiltation in Microsep microconcentrators (Filtron, Northborough MA, USA) with a lOkDa cut-off filter. The concentrated samples were examined by SDS- PAGE analysis on 12% polyacrylamide gels with standard molecular weight markers (200kD, 97kD, 68kD, 43kD, 28kD and 18kD) (Gibco, Paisley, Scotland) and stained with Coomassie brilliant blue R-250 (Laemmli, Nature 227, 680-685, 1970).
A typical elution profile, following loading of cell-free extract onto the immobilized bisphosphonate affinity columns is shown in Fig. 1. A cell extract prepared by sonication of Dictyostelium cells in equilibration buffer at 4°C is loaded onto an AHBuBP-affinity column at a rate of 10 ml/hour. Fractions (each represented by a chromatography monitor peak) of non-specifically-bound proteins were eluted with equilibration buffer (peak 1), equilibration buffer after overnight incubation (peak 2) and equilibration buffer containing 0.1 M KC1 (peak 3), respectively (as described above). DPI is eluted when 5 mM AHBuBP dissolved in equilibration buffer is used directly (peak 4), after overnight incubation (peak 5) and after further incubation for a few hours (peak 6).
Each peak from the column is lyophilised, desalted by dialysis and concentrated in a Microsep microconcentrator. An aliquot from each fraction is electrophoresed on a 12% polyacrylamide gel in denaturing conditions (Laemmli, Nature 227, 680-685, 1970) and the gel is stained with Coomassie Blue. Elution with equilibration buffer, equilibration buffer after overnight incubation and with buffer containing 0.1 M KC1 eluted proteins which had no affinity for the immobilized ligand (Fig. 2, Lanes 2, 3 and 4 respectively). Subsequent washing with 5 mM bisphosphonate (either AHBuBP or AHPrBP) resulted in the elution of DPI alone, of about 28kDa (Fig. 2, lane 5). Lane 1 contains molecular weight standards.
DPI binds to both the AHBuBP and the AHPrBP affinity columns very strongly, since repeated overnight incubation of the columns with 5 mM bisphosphonate followed by elution the following day continued to elute DPI over a period of 10 days.
DPI is the only protein eluted from the affinity-column by bisphosphonates since it is the only protein that could be detected in a 3, 000-fold concentrate of the bisphosphonate eluate by SDS-PAGE analysis and staining with Coomassie blue. The high purity is later confirmed by matching all of the fragments obtained by peptide sequencing of protein in the bisphosphonate eluate with the predicted amino acid sequence of the DPI cDNA. Not all bisphosphonates elute DPI from the columns. Whereas AHBuBP, AHPrBP and 3PHEBP at a concentration of 5 mM appear to be equally effective, 5 mM HEBP, a bisphosphonate which is 12-fold less potent at inhibiting Dictyostelium growth than AHBuBP, is not able to elute DPI at all. This demonstrates that DPI appears to bind specifically to second and third generation, potent anti-resoφtive bisphosphonates.
Neither pyrophosphate, inorganic phosphate nor galactose is effective at eluting DPI. However, pyridoxal 5'-phosphate (PLP), at a concentration of 5 mM, eluted DPI very effectively, although some DPI is still retained on the column which could be eluted with 5 mM AHBuBP. Additionally, a number of other proteins were also eluted by PLP. Enzymatic cleavage and peptide sequencing
The purified DPI protein is digested with endoprotease Lys-C or endoprotease Asp-N in the presence of urea as denaturing agent to produce a collection of peptides for sequencing. These peptides were separated by HPLC and sequenced by Edman degradation (Edman, Ada Chem. Scand. 4: 283-293, 1950).
Nine peptide sequences were determined by Edman degradation (Table I). A search in data banks for similar sequences showed that peptides 2, 3, 4, 5, 6, 8 and 9 had considerable identity with peptides from discoidin I, a lectin isolated from Dictyostelium discoideum. By contrast, peptides 1 , and 7 had little homology with any peptides in discoidin I. This implies that DPI is related to discoidin I but is not discoidin I itself.
Table 1. Peptide Sequences from DPI
Figure imgf000030_0001
*FIG. 3 (peptide 9 is a deduced sequence since peptides isolated after Asp-N digestion of DPI showed that peptides 2 and 5 are contiguous
Endoproteinase Lys-C cleaves at the carboxyl side of a lysine peptide bond and Asp-N cleaves at the amino side of an Asparagine peptide bond. Lower case letters indicate a tentative sequence assignment. "X" indicates that no definite assignment could be made at that position. Bold- typed regions are those used for general" > oligonucleondes XJ-1 (peptide 5) and XJ-2 (peptide 6) in the chart. Example 2: Cloning and sequencing of DPI
Isolation of poly(A+) mRNA and synthesis of first strand cDNA
7 Messenger RNA is isolated from approximately 10 Dictyostelium amoebae using a Quick-Micro mRNA purification kit (Pharmacia, Piscataway NJ, USA). For the synthesis of first strand cDNA, 2.5 ug mRNA were precipitated with potassium acetate and ethanol, collected by centrifugation at 14,000 g for 5 minutes at 4°C, and resuspended in 50 ul DEPC-treated water. 12.5 ng oligo(dT) \ 7-adapter primer,
GACTCGAGTCGACATCGA(dT)i7 (SEQ. ID NO. 23), were added and the solution heated to 70°C for 10 minutes followed by quick chilling on ice for 3 minutes. One-fifth volume of 5 x reaction buffer (250 mM Tris-HCl, pH8.3, 0.375 mM KC1, 15 mM MgC ), 80 units RNasin (Promega, Madison WI, USA), 1.25 mM dNTP and 0.2 mM DTT were added and the mixture is prewarmed at 37°C for
2 minutes, followed by addition of 600 units Superscript RNase H" reverse transcriptase (Gibco, Paisley, UK) and incubation at 37°C for 1 hour. The cDNA is diluted and used for PCR.
Amplification of cDNA bv 3 '-RACE
Sequences from peptides 5 and 6 described above were used to design the degenerate oligonucleotide primers XJ-1 and XJ-2. Alignment of the peptides from DPI with the sequence of discoidin I suggested that peptide 5 would be closer than peptide 6 to the N-terminal end of DPI . Hence XJ-1 is expected to be 5' of XJ-2 in the mRNA sequence.
The cDNA coding for DPI is amplified by 3'-RACE (Frohman et al. Proc. Natl. Acad. Sci. USA 85, 8998-9002, 1988) using degenerate oligonucleotide primers:
(a) XJ-l (SEQ. ID NO. 5):
[TA(T/C)ACI(T/C)TIGA(T/C)AA(T/C)GTIAA(T/C)TGGGT]
(b) XJ-2 (SEQ. ID NO. 6):
[(C/A)G(T/A/G)(T/A)(C/G)TAT(T/C/A)GCIATICA(T/C)CC] in sequential PCRs. In the first round the reaction included 50 mM KC1, 10 mM Tris-Cl, pH9.0, 0.1% Triton X-100, 1.5 mM MgC_2, 0.2 mM dNTP, diluted cDNA, 0.2 M XJ-1 and XJ-ADA [GACTCGAGTCGACATCGA] (SEQ. ID NO. 24) primers, and 2.5 units Taq DNA polymerase (Promega, Madison, WI, USA). Each cycle is initiated by denaturation at 94°C for 1.5 minutes, and continued with annealing at 46°C for 1.5 minutes and extension at 72°C for 2 minutes. After 40 cycles, samples were incubated for a further 15 minutes at 72°C and then stored at 4°C. The resulting PCR products were diluted 1 : 1 ,000 with distilled water and used as the template for the second-round PCR reaction with the XJ-2 and XJ-ADA primers. Reaction conditions were identical to those used in the first round. PCR products were size fractionated on a 1% agarose gel, visualized and isolated (Sambrook J., Fritsch E. F., and Maniatis T. (eds): Molecular Cloning: A laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989). The PCR product is cloned into the pCRII vector (Invitrogen, San Diego CA, USA) according to the manufacturer's instructions. Those recombinant plasmids that contained inserts were sequenced (Sanger et al., Proc. Natl. Acad. Sci. USA 74: 5463-5467, 1977).
The primer that is used for reverse transcription of mRNA consisted of an oligo(dT)i7 region to anneal to the Poly(A) tail of the mRNA and an adapter region of 18 nucleotides (Froh an et al., Proc. Natl. Acad. Sci. USA 85, 8998-9002, 1988). Incoφoration of the adapter sequence into the first-strand cDNA allowed use of the primer XJ-ADA for the amplification of the 3' region of the DPI cDNA using PCR.
Initially, PCR is performed using XJ-1 and XJ-ADA as 5' and 3' primers respectively to amplify the first-strand cDNA. Two DNA fragments, with approximate sizes of 600bp and 500bp were detected on subsequent agarose gel electrophoresis. Sequencing of these two fragments showed that neither of them encoded any of the peptide sequences obtained from DPI. When primers XJ-2 and XJ-ADA were used to amplify the first-strand cDNA, one band of about 400bp is observed on an agarose gel. Sequencing of this band indicated that this cDNA is a fragment from the 3' end of discoidin IA cDNA. This is not suφrising since primer XJ-2 is derived from a peptide sequence for which there is a homologous sequence in discoidin I. However, when, in a second round of the PCR, the products of the reaction using primers XJ-1 and XJ-ADA were further amplified using primers XJ-2 and XJ-ADA, a DNA fragment of 450bp is obtained. Cloning and sequencing of this fragment showed that its corresponding amino acid sequence contained the sequences of peptides 3, 6, 7 and 8 of DPI (Fig. 3 (SEQ. ID NO. 2) and Fig. 4).
This DNA fragment, designated XJ-450, is used as a probe for the subsequent cDNA library screening and northern and Southern blot analyses. Preparation and screening of cDNA libraries
Poly(A+) RNA isolated from the wild-type Ax-2 strain of Dictyostelium is used as the template for cDNA synthesis for construction of libraries in lambda ZAP bacteriophage (Stratagene, San Diego, CA, USA). To isolate full-length clones, the libraries were screened with the PCR amplified cDNA fragment (XJ-450) of DPI isolated from the pCRII vector. Hybridisations were performed at 42°C for 24 to 48 h in 5 x SSPE - 1 x Denhardt's solution - 100 ug salmon testis DNA ml- 50% formamide. Membranes were washed in 0.25 x SSPE, 0.5% SDS at 55°C. Rescreening of positive plaques yielded full-length cDNAs for DPI from the wild- type Ax-2 library.
When 1.2 x 10 plaques from the cDNA library of the wild-type Ax2 strain of P. discoideum were screened with XJ-450, nine positive clones were obtained. Analysis of the sequence of the cDNAs (Fig. 3 ) (SEQ. ID NO. 1) revealed that the inserts comprise 968 bp containing an open reading frame of 771 bp which codes for 257 amino acids. The untranslated regions on each side of the coding region are rich in A+T and the 3' untranslated region also contains multiple consensus polyadenylation sequences (AATAAA) (Fig. 3) (SEQ. ID NO. 1). Northern and Southern analysis of DPI
For northern analysis, 0.5 ug of poly(A ) RNA isolated from Dictyostelium amoebae is size fractionated on a denaturing formaldehyde agarose gel, transferred onto Hybond-N membrane (Amersham, Bucks, UK), hybridized to the appropriate cDNA fragment and washed as described above.. To control for variations in loading and transfer of RNA, the northern blots were probed with a fragment of Dd-tcpl, a Dictyostelium homologue of the tcpl family of molecular chaperones.
Genomic DNA is isolated from the wild-type Ax-2 strain by using a modification of the methods described by Nellen et al. (Molecular Biology in Dictyostelium: Tools and Applications. In: Spudich JA ed: Methods in Cell Biology Vol. 28: 67-100. Academic Press, 1987) and Amau (Laboratory Products Technology I September 1993:20.). 2 x 10 Dictyostelium amoebae were harvested, washed with distilled water at 4°C and lysed in ice-cold HMN (0.01 M Mg acetate, 0.01 M NaCl, 0.03 M HEPES and 10% sucrose, pH 7.5) containing 0.5% NP- 40.Nuclei were separated from the cell ly sate by centrifugation at 10,000g for 10 min. at 4°C, resuspended in 1 ml extraction buffer (0.4 M KC1, 0.05 M EDTA, 1% Triton-XlOO) and treated with 3 μl RNase I (7.5 u/μl) (Promega, Madison, WI, USA) at 70°C for 15 min. Insoluble material is removed by centrifugation at lOOOOg for 2 min. at room temperature and the DNA in the supernatant is purified (Magic Miniprep DNA Purification System, Promega, Madison, WI, USA). Purified DNA is quantified by absorbance at OD βO and 10 μg samples were digested overnight with the required restriction enzymes. The DNA is then size fractionated on a 1% agarose gel. transferred to Hybond-N membrane and hybridized and washed as described aho . e Λ Southern blot of genomic DNA prepared by Clontech (Palo Alto. C.V I S \ ι is hybridized with the XJ-450 DPI DNA probe using low stringency for bπdι/jtιon (37°C) and washing (3 x SSPE, 0.5% SDS). Northern analysis of Poly(A+) RNA isolated from amoebae that had been grown either axenically to a density of about 5 x 10 cells/ml or to a lower density of 1 x 10 cells/ml, or with a bacterial substrate is shown in Fig. 5a. The probe hybridized to a mRNA transcript of about 1.2 kb from axenic Ax-2 amoebae harvested at a density of about 5 x 10 cells/ml. The expression of DPI mRNA in wild-type cells grown with a bacterial food source appeared to be very low, although the presence of the mRNA is confirmed by repeating the northern blot with a larger amount of poly(A ) RNA. DPI mRNA is also expressed in axenic wild-type amoebae harvested at a lower density of 1 x 10 cells/ml (Fig. 5b). For comparison, a PCR fragment of discoidin IA is labeled with 3? P-dCTP and hybridized to the same northern blots. Like DPI, discoidin I is also expressed poorly in bacterially- grown wild-type Ax-2 amoebae. (Fig. 5a).
Southern analysis of genomic DNA revealed that only a single band from each restriction enzyme digest hybridized to the XJ-450 probe for DPI (Fig. 6b). On the same blot, a discoidin IA probe hybridized to at least three bands in each DNA digest (Fig. 6c).
This is consistent with the claim that discoidin I is encoded by a family of genes (Rowekamp et al., Cell 20: 495-505, 1980; Poole et al., J. Mol. Biol. 153:273- 289, 1981) but suggests that there is only a single gene for DPI (discoidin II). Structural features of DPI
DNA and protein sequences were compared with updated releases from GenBank, Swissprot and Entrez, and analyzed by using the computer software MacVector (Intemational Biotechnologies, Inc.), GCG (The Wisconsin Genetic Computer Group (Devereux et al., Nucelic Acids Res. 12: 387-395, 1984) CLUSTALV (Higgins et al., Comp. Appl. Biosci. 8: 189-191, 1992), and MacProt (Markiewicz, BioTechniques 10: 756-757+760+762-763, 1991) 1991 ; Luttke, Comp. Method. Prog. Biomed. 31 :105-112, 1990 ).
The predicted amino acid sequence suggests that DPI is a neutral protein with an estimated pi of 6.77. Its calculated molecular mass is 28,573Da which is slightly larger than that of discoidin IA (28,258Da) or discoidin IC (28,391 Da) (Poole et al., J. Mol. Biol. 153: 273-289. 1981 ). The amino acid sequence predicted from the full-length cDNA includes all nine peptide sequences obtained by enzymatic digestion of DPI. The amino acid and cDNA sequences of DPI were later found to be identical to those of discoidin II. The homology between discoidin II and I is significant, with 49% identify in their predicted amino acid sequences (Fig. 4) and 61% identify in the nucleotide sequences of their open reading frames. In addition to regions of identity with discoidin I, DPI (discoidin II) shares partial homologies with several other proteins, including human coagulation factors V (Jenny et al.. Proc. Natl. Acad. Sci. USA 84:4846-4850, 1987) and VIII (Wood et al., Nature 312:330-337, 1984.), milk fat globule protein (Larocca et al, Cancer Res. 51 : 4994-4998, 1991) and open reading frame 7 linked to the Rhodopseudomonas blastica atp operon (Tybulewicz et al., J. Mol. Biol. 179: 185-214, 1984) (Fig. 7). Hydropathicity analysis by the Kyte-Doolittle (J. Mol. Biol. 157, 105-132, 1982) or the Hopp-Woods (Proc. Natl. Acad. Sci. USA 78: 3824-3828. 1981) algorithms showed that the first 20 amino acids at the amino terminus of DP 1 are hydrophobic (Fig. 8) and a signal peptide cleavage site, predicted by using the method of von Heijne (Nucleic Acids Res. 14: 4683-4690, 1986), appears to be present between two serines at residues 20 and 21 (Fig. 9). Secondary structure predictions by using the methods of Chou-Fasman (Adv. Enzymol. Relat. Areas Mol. Biol. 47, 45-148, 1978) and Garnier-Robson (J. Mol. Biol. 120, 97-120, 1978) suggest that DPI is mainly composed of b-pleated sheets (Fig. 8). An Arg-Gly-Asp (RGD) motif occurs at residues 81-83 and is also present in discoidin I. The amino acids flanking the RGD motif are also highly conserved between discoidin I. and DPI . Owing to its hydrophilicity, the RGD motif probably lies on the protein surface (Fig. 10). The motif may have the same role in DPI and in discoidin I.
Example 3: Recombinant Expression of DPI
In order to conveniently produce large quantities of DPI, the DNA encoding it (see Fig. 3 (SEQ. ID NO. 1)) is transferred into an appropriate expression vector and introduced into mammalian cells by standard genetic engineering techniques well-known to those skilled in the art.
Suitable vectors include pCD (Okayama et al.. Mol. Cell. Biol. 2: 161-170 (1982) and pJL3/4 (Gough elaL EMBO J.. 4: 645-653 (1985).
The resultant expression vectors are then transformed into appropriate host cells, which thereby express the DPI protein.
Those skilled in the art could also manipulate the sequence shown in Fig. 3 (SEQ. ID NO. 2) to introduce mammalian regulatory sequences and/or expression signals. Alternatively, it could be manipulated to introduce bacterial expression signals, and the resultant recombinant DNA could then be introduced into bacterial host cells (for example Escherichia coli) for expression therein.
Similar manipulations could be performed for the construction of insect vectors and yeast vectors. Expression in mammalian cells The cDNA inserts of the pBluescript SK(-) subclones described in Example 2 (above) are subcloned into the mammalian cell expression vector pMT2 using appropriate restriction enzymes followed by ligation. The structure of the resultant expression vector is confirmed by restriction mapping on agarose gels.
Plasmid DNA from the pMT2 subclone is then transfected into monkey COS-1 cells by the DEAE-dextran procedure (Sompayrac and Danna (1981) Proc. Natl. Acad. Sci. U.S.A., 78: 7575-7578; Luthman and Magnusson (1983), Nucl. Acids Res. 1 1 : 1295-1308). Serum-free 24 hour conditioned medium is then collected from the cells starting 40 to 70 hours post-transfection.
Example 4: Isolation of hDPl, a human bisphosphonate binding protein
The tri-sodium salts of AHBuBP (Alendronate) is from GENTILI Materials S.P.A., Pisa, Italy. Human multiple tissue blots and a zoo-blot were purchased from Clontech. Human poly(A ) RNA from testis, HeLa cells and HL60 cells is purchased from Clontech. A cDNA library of human testis is custom-made in lambda ZAP bacteriophage by Stratagene (La Jolla, CA, USA) using poly(A ) RNA purchased from Clontech (Palo Alto, CA, USA). The human leukocyte concentrate is obtained from Sheffield Blood Transfusion Services (Longley Lane, Sheffield, UK). All other chemicals were purchased from Sigma Chemical Co., Poole, UK. Construction of a bisphosphonate affinity column
A bisphosphonate-affinity column is constructed as described previously in Example 1. Preparation of human leukocytes
3-4 x 10 human white blood cells were washed twice with phosphate buffered saline (PBS; pH 7.4) and disrupted by sonication in 40 ml of column equilibration buffer. Insoluble material is removed by ultracentrifugation at 184,000 x g at 4°C for 2.0 hours to yield a supernatant. Purification of hDPl
The supernatant containing the cytosolic proteins is applied to the pre- equilibrated AHBuBP-affinity column at a rate of 6-12 ml/hour. The column is first eluted with equilibration buffer at a rate of 20 ml/hour until protein no longer appeared in the eluate. The ionic strength of the eluant is then increased by including 0.1 M KC1 to elute further non-specifically bound proteins until protein no longer appeared in the eluate. Finally, buffer containing 5 mM AHBuBP specifically eluted proteins bound to the immobilized AHBuBP on the column. Fractions eluted from the column were lyophilised and resuspended in 2-3 ml distilled water, followed by dialysis against 50 mM ammonium bicarbonate. The dialysed samples were further concentrated to 50-100 μl by ultrafiltration using Microsep microconcentrators (Filtron, Northborough MA, USA) having a 1 OkDa cut-off filter. Proteins in each concentrate were separated by SDS- PAGE (Laemmli, Nature 227, 680-685, 1970) and stained with Coomassie brilliant blue R-250. The 28kDa protein isolated from the AHBuBP-affinity chromatography is hDPl (SEQ. ID NO. 13). Protein sequencing
N-terminal amino acid sequencing of hDPl (SEQ. ID NO. 13) is performed on a concentrated solution of the AHBuBP eluate and also on the band of hDP 1 (SEQ. ID NO. 13) transferred onto PVDF membrane by protein blotting following SDS-PAGE. Sequence is determined by Edman degradation (Edman, Ada Chem. Scand. 4: 283-293, 1950) both directly from sample solution and from the (PVDF) membrane containing blotted protein. Internal peptides were generated by endoprotease Lys-C digestion, purified by HPLC, and sequenced.
Both approaches yielded the same N-terminal sequence: MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQ (SEQ. ID NO. 25) [peptide 10]
Cleavage of hDPl by endoprotease Lys-C produced an internal peptide having the sequence:
YXIGEPTVPsTLAEeFtYNpF (SEQ. ID NO. 26) [peptide 11] - where X indicates that no definite assignment is made for that residue, and a lower case letter indicates a tentative assignment for that residue.
The N-terminal amino acid sequence (peptide 10) (SEQ. ID NO. 25) is highly homologous to the N-terminal sequence of a rat spermatid (SEQ. ID NO. 8) 29,000 MW protein (Figure 11). No protein appeared to contain sequences having significant homology with peptide 1 1 (SEQ. ID NO. 26).
Example 5: Cloning and sequencing of hDPl Synthesis of degenerate oligonucleotides and RT-PCR
Since the N-terminal sequence of hDPl showed homology to a rat spermatid protein, poly(A ) RNA of human testis is used to prepare total cDNA that contained the hDPl cDNA.
Degenerate oligonucleotides containing 17-22 nucleotides were synthesized according to the peptide sequences of hDPl for the subsequent PCR to amplify the corresponding hDPl cDNA (Table 2). Synthesis of the first strand cDNA from human testis poly(A+) mRNA is carried out using a 3' RACE kit (Gibco, Paisley, UK) following the manufacturer's instructions. AP-oligo(dT) is used as the primer since this introduced the sequence of the universal amplification primer (UAP) into the cDNA sequence to facilitate the subsequent 3' RACE procedure (Frohman et al., Proc. Natl. Acad. Sci. USA 85, 8998-9002, 1988).
Table 2. Nucleotide sequences of primers used for the PCR reaction
Figure imgf000038_0001
"I" indicates inosine "N" indicateseither C, A, G or T "Y" indicates either C or T "R" indicates either G or A
The strategy for the amplification of hDPl (SEQ. ID NO. 15) cDNA is shown in Fig. 12. Initially, the PCR is performed in a volume of 50 μl in the presence of AmpliTaq reaction buffer. 1.5 mM MgCl2, 0.2 mM dNTP, diluted cDNA template, 2.5 units of AmpliTaq DNA polymerase (Perkin Elmer) and 0.2 μM of each primer DJ-1 (SEQ. ID NO. 27) and UAP (SEQ. ID NO. 32) (Table 2). The mixture is subjected to 40 cycles of amplification in a Perkin Elmer 9600 PCR machine. Each cycle involved four successive steps including denaturation at 94°C for 15 sec, annealing at 48°C and 60°C for 15 sec, and extension at 72°C for 15 sec. At the end of cycling the sample is further incubated for 10 min. at 72°C and then kept at 4°C. Second-round PCR is carried out by using the diluted reactions of the first-round as template and different combinations of internal primers, i.e. DJ-1 (SEQ. ID NO. 27) plus DJ-10 (SEQ. ID NO. 30), DJ-5 (SEQ. ID NO. 28) plus DJ- 10, DJ-9 (SEQ. ID NO. 29) plus UAP (SEQ. ID NO. 32) (Table 2). 10 μl of each reaction is run into a 1% low melting point agarose gel (Gibco, Paisley, UK). The desired fragment is purified using the Wizard PCR DNA purification system (Promega, Madison, WI, USA) and the y teld is quantified.
Use of primers DJ-1, which encoded part of peptide 1, and UAP (Table 2) in the polymerase chain reaction gave DN \ products that formed a faint smear on an agarose gel. This DNA is reused as the template for further amplifications with internal primers. In the second round ot the PCR, DNA fragments were amplified by two combinations of primers: (1 ) h.\ a fragment of 600bp from DJ-1 and DJ-10; (2) hB : a fragment of 400bp from I)J-^ and LAP. Fragments hA and hB, which were expected to be contiguous with ea h other (Fig. 12), added up to a size of l,000bp. It is found that when hA is used a.s the template for PCR amplification with primers DJ-5 and DJ-10, a fragment < hC ) of 570 bp is obtained. All these sequences were found within the full length sequence (SEQ. ID NO. 9). All these three fragments were from specific amplifications since leaving out either primer of a pair, or the template, eliminated the corresponding product. hC is used as the probe for screening the cDNA library and for the subsequent northern and Southern blotting. cDNA library screening and sequence of the full-length of hDPl cDNA
25 ng of a hDPl PCR fragment is labeled with 32P-dCTP using random primers (Feinberg and Vogelstein, Anal. Biochem. 132: 6-13, 1983) (Prime-It II kit, Stratagene) and is used as a probe to screen a human testis cDNA library in lambda ZAP bacteriophage (Short et al., Nucleic Acid Res. 16: 7853-7860, 1988). After three rounds of screening, bacteriophage plaques specifically hybridized to the hDPl probe were selected and the inserts were subcloned into pBluescript SK(-) plasmids by in vivo excision (Short et al., Nucleic Acid Res. 16: 7853-7860, 1988). The recombinant plasmids were subjected to restriction digestion and amplification using T3 and T7 primers which anneal to sequences on each side of the inserts. Recombinant plasmids, which contained a 1.2 kb hDPl cDNA insert, were identified by Southern blotting and sequenced using a Taq DyeDeoxy Terminator Cycle Sequencing kit (ABI) by the method of Sanger et al. (Proc. Natl. Acad. Sci. USA 74: 5463-5467, 1977).
When hC is used to screen a testis cDNA library constructed in lambda ZAP bacteriophage, three clones were isolated. All appeared to have the same nucleotide sequence. The largest open reading frame consisted of 936 bp and contained six ATG codons. The N-terminal sequence of hDPl corresponded with the deduced amino acid sequence from the cDNA only if the second ATG codon of the cDNA encoded the N-terminal methionine of hDPl . The cDNA nucleotide sequence then indicated that hDPl consisted of 260 amino acids and had a calculated molecular mass of 28,872 and an estimated isoelectric point of 7.0 (Fig. 13). The predicted molecular weight is in good agreement with that of the protein observed on a polyacrylamide gel.
If translation started from the first ATG codon, an additional polypeptide of 48 amino acids would be produced at the N-terminus of hDPl (Fig. 13) (SEQ. ID NO. 9) to give a protein of 34 kDa. This putative peptide is designated hDPl-rP. The full-length cDNA contains a 5' untranslated region of 157 nucleotides and a 3'- untranslated region of 265 nucleotides. However, neither a poly(A) tail nor a classical AATAAA polyadenylation signal is apparent in the cDNA towards the ' end, probably owing to restriction digestion beyond the polyadenylation site in construction of the cDNA library. Human tissue-type distribution and inter-species conservation of the hDPl gene A 3? P-dCTP labeled hDPl cDNA fragment is hybridized to human multiple tissue northern blots (Clontech) and to a northern blot containing mRNA isolated from cultured human cells THP1 (a monocytic cell line), MG63 (an osteoblast-like cell line) and primary bone cells. In addition, the hDPl probe is also hybridized to a northern blot which contained mRNA from Dictyostelium discoideum and to a zoo- blot which contained genomic DNA from different species (Clontech, Palo Alto, CA. USA). ,
Hybridization is performed at 42°C in the presence of 50% formamide (Sambrook et al. , Molecular Cloning: A laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989). Washing at high stringency is carried out at 42°C using 3 x 100 ml 0.25 x SSPE plus 0.5 % SDS for 3 x 15 min. A
2kb DNA fragment of human b-actin (Clontech, Palo Alto, CA, USA) is also labeled with 32 P-dCTP and is used as a positive control for hybridization. The hybridization signal is quantified with a BioRad Phosphoimager.
DNA is also prepared from a lambda gtl 1 cDNA library of normal human osteoclast-like multinucleated cells using the Lambda DNA Purification Kit (Stratagene, Cambridge, UK). After being digested by restriction enzymes, the bacteriophage DNA is separated on a 1% agarose gel, transferred to a Hybond-N membrane and hybridized to the radioiabeled hDPl probe.
Expression of hDPl in several human tissues is studied by northern analysis with the PCR fragment hC of hDPl cDNA as a probe. The hybridized bands, corresponding to 1.2 kb mRNA transcripts, suggest that the cDNA clone obtained for hDPl is almost full-length, even though it did not contain the region having the polyadenylation signal. Moreover, the hDPl cDNA fragment hybridized to mRNA from all the human tissues and cell lines studied (Fig. 14). These comprised heart, brain, placenta, liver, lung, skeletal muscle, kidney, pancreas, spleen, thymus, prostate, testis, ovary, small intestine, colon, peripheral blood leukocytes and human cell lines HeLa, HL60, THP-1, MG63 and primary bone cells. Among all these tissues and cells, testis, skeletal muscle and heart had the highest levels of hDPl mRNA expression. On the other hand, placenta, pancreas, spleen and peripheral blood leukocytes appear to be the tissues that contain least hDPl mRNA. In addition, a hybridized band is also observed on the Southern blot of DNA prepared from a cDNA library of human osteoclast-like multinucleated cells, suggesting the presence of the hDPl mRNA transcripts in osteoclasts.
Southern blots of EcoRI digests of genomic DNA from various organisms showed that hybridization at high stringency with the hC fragment produced one or more hybridized bands in all spe^es studied except yeast (Fig. 15). In addition, a hybridized band corresponding to a mRNA transcript with a size of 1.8 kb is also detected in a northern blot containing mRNA isolated from Dictyostelium amoebae growing under various conditions (Fig. 16). Computer assisted analyses on the DNA and protein sequences of hDPl
DNA and protein sequences were compared with the updated releases from GenBank, Swissprot and Entrez, and analyzed by using the computer software MacVector (Intemational Biotechnologies, Inc.), GCG (The Wisconsin Genetic Computer Group) (Devereux et al., Nucleic Acids Res. 12: 387-395, 1984), CLUSTALV (Higgins et al., Comp. Appl. Biosci. 8: 189-191, 1992) and FRODO (Jones, Method Enzymol. 115: 157-171, 1985).
The N-terminal amino acid sequence of hDPl shared striking homology with a rat 29 kDa round spermatid protein in which the N-terminal 32 amino acids had been determined by peptide sequencing (Fig. 11) (SEQ. ID NO. 8) (Onoda and Djakiew, Mol. Cell. Endocrinol. 93: 53-61 , 1993). A search through data banks with the complete deduced amino acid sequence of hDPl showed that there is no exact match to any known sequence. However, there is significant homology with the amino acid sequences predicted from the DNA sequences of two genes from prokaryotic organisms: open reading frame 3 linked to the Rhodopseudomonas blastica atp operon (ATPase-ORF3) (33% identity over the entire sequence) (Tybulewicz et al, J. Mol. Biol. 179, 185-214, 1984) (Fig. 17(a)) and open reading frame 1 linked to the genes of arginyl tRNA synthetase and ribonuclease H of Buchnera aphidicola (ATS-ORF1) (30% identity over the entire sequence) (Munson et al, Gene 137: 171-178, 1993) (Fig. 17(b)).
Among the proteins with known functions, the aspartate transaminase (AAT) of chicken mitochondria (14% identity and 44% similarity) showed some homology to hDPl (Jaussi et al, J. Biol. Chem. 260:16060-16063, 1985) (Fig. 18). However, AAT is larger than hDPl and the alignment of hDPl with the three-dimensional structure of AAT suggests that most of the conserved amino acid residues are probably in the internal region of the folded protein. It is interesting that residues around the PLP binding site are fairly well conserved although lysine does not exist in the same position in hDPl as in AAT.
A hydropathicity plot using the Kyte-Doolittle (Kyte and Doolittle, J. Mol. Biol. 157, 105-132) or the Hopp-Woods (Hopp and Woods, Proc. Natl. Acad. Sci. USA 78: 3824-3828. 1981) algorithms shows that the C-terminal part of the hDPl protein is quite hydrophilic with a high surface probability (Fig. 19). Use of the methods of Chou-Fasman (Adv. Enzymol Relat. Areas Mol. Biol. 47, 45-148, 1978) ) and Garnier-Robson (Gamier et al.. 1978. J. Mol. Biol. 120, 97-120) to predict 40 secondary structure suggests that this region mainly comprises a-helix. No obvious signal peptide cleavage site is predicted by the method of von Heijne (Nucleic Acids Res. 14: 4683-4690, 1986).
A search in the data bank with the sequence of the putative N-terminal peptide, hDPl-rP, showed no significant homology to any existing sequences. Prediction by the method of von Heijne (Nucleic Acids Res. 14: 4683-4690, 1986) suggested that there is a possible signal peptide cleavage site between alanine and arginine residues at positions -21 and -22 (Fig. 13). The scoring of this prediction, however, is not sufficiently high to allow a firm conclusion.
Several potential protein motifs have been suggested for hDPl by searching the database of protein motifs. These include a site of Ca 2+ -calmodulin dependent kinase II at residues 228-232 (REKTV) near the C-terminus and a part of a zinc- finger at residues 53-61 (LTTHH). Sequence comparison of hDPl and DPI
The protein sequence of hDPl is also compared with that of DPI . No striking homology is observed between the two proteins (with 13% identity and 38% similarity). One region of identity is seen between residues 44-47 (His-Gly-Val- Lys) (SEQ. ID NO. 10) of hDPl and residues 28-31 of DPI (SEQ ID NO. 3). This sequence is hydrophilic and is located at the surface of hDPl protein (Fig. 20). Example 6: Recombinant Expression of hDPl
Large quantities of hDPl can be obtained using procedures similar to those described above in Example 3. Expression in mammalian cells
The cDNA inserts of the pBluescript SK(-) subclones described in Example 5 (above) were subcloned into the mammalian cell expression vector pMT2 using appropriate restriction enzymes followed by ligation. The structure of the resultant expression vector is confirmed by restriction mapping on agarose gels.
Plasmid DNA from the pMT2 subclone is then transfected into monkey COS-1 cells by the DEAE-dextran procedure (Sompayrac and Danna (1981) PNAS, 78: 7575-7578; Luthman and Magnusson (1983), Nucl. Acids Res. 11 : 1295-1308). Serum-free 24 hr conditioned medium is then collected from the cells starting 40 to 70 hours post-transfection.
Example 7: Isolation of DdCyP2. a bisphosphonate binding cyclophilin from
Dictyostelium AHPrBP, AHBuBP and CI2MBP were from GENTILI S.P.A., Pisa, Italy. All other bisphosphonates and the monophosphonate (NE 10788) were from Procter and Gamble Pharmaceuticals, Cincinnati, OHIO, USA. PLP is from Sigma (Poole,
Dorset, UK).
Construction of a bisphosphonate affinity column
Two bisphosphonate-affinity columns were prepared as described previously in example 1. Growth and harvest of Dictyostelium discoideum cells
Dictyostelium discoideum strain Ax-2 is grown in HL-5 glucose medium (2 x 700 ml) on a shaker at 22°C to a density of 2-8 x 10 cells/ml. Cells were harvested and washed once with distilled water and once with equilibration buffer at 4°C. The cell pellet is resuspended in 3-4 volumes of equilibration buffer and disrupted by sonication for 4 x 20 seconds in a MSE Soniprep with an interval of 1 minute on ice between each sonication. The cell lysate is centrifuged at 45,000 φm for 2.0 hours at 4°C in a Beckman 50.2 Ti rotor. Isolation of the bisphosphonate-binding proteins from Dictyostelium
The supernatant, containing the cytosolic proteins is loaded onto the AHBuBP-affinity column, which had been pre-equilibrated with the equilibration buffer, at about 6-12 ml/hour. After loading, the column is washed with the equilibration buffer at a rate of 20 ml/hour to wash off unbound proteins and then washed with 0.1 M KC1 in the equilibration buffer to wash off further non- specifically bound proteins. After such washing, bisphosphonates or PLP at a concentration of 5 mM in equilibration buffer were used to elute any bisphosphonate-binding proteins. The eluates (about 25 ml) containing the proteins were lyophilised overnight and redissolved in 2-3 ml of 10 mM ammonium formate. They were dialysed against four changes of 2500 ml ammonium formate over 48 hours. The dialysed proteins were further concentrated by centrifugation in Microsep microconcentrators with a molecular weight cut-off of 10 kDa (Filtron, Northborough, MA, USA) at 5,000 g for 1-3 hours. All purification and concentration processes were carried out at 4°C.
Proteins in each concentrated fraction were separated by electrophoresis in denaturing conditions on a 12% polyacr lamide gel. Molecular weight markers (Gibco, Paisley, Scotland) were also run on the gel. The gels were stained either with silver or Coomassie blue .
PLP eluted two major proteins * uh molecular weights of 18 kDa and 28 kDa respectively. The 18 kDa protein i which is present in approximately the same amount as the 28 kDa protein) is later named DdCyP2 (SEQ. ID NO. 13). The 28 kDa protein is probably DPI . Peptide sequencing and peptide sequence anal is Following electrophoresis, the gel is equilibrated for 5 minutes in the transfer buffer [10 mM CAPS, 10% (v/v) methanol, pH 1 1.0] prior to blotting to remove electrophoresis buffer salts and detergents. The membrane is cut to the dimensions of the gel, wetted with 100% methanol, and then equilibrated with the transfer buffer for 15-30 minutes. Two filter papers (Whatman 3 MM) and fibre pads were completely saturated in the transfer buffer. The gel, membrane and pads were assembled into the holder cassette. The transfer is performed at constant voltage (50 V) for 1 hour at room temperature with the Bio-Ice cooling unit in the buffer chamber.
Proteins blotted onto ProBlott membrane were detected by Coomassie brilliant blue staining following the instructions supplied with the membrane. The blotted membrane is rinsed with deionised water and saturated with 100% methanol. The membrane is then stained with 0.1% (w/v) Coomassie blue R-250 in 40% (v/v) methanol, 1% (v/v) acetic acid. Protein bands appeared within one minute. The membrane is destained with 50% (v/v) methanol, and rinsed extensively with deionised water and air dried. The bands of interest were excised for protein sequencing.
The N-terminal sequence of DdCyP2 is determined three times directly after blotting onto ProBlott membrane using repeated cycles of Edman degradation. The consensus sequence obtained is: NH2 - GKDPKITNKV FFDE (SEQ ID NO. 33). AHPrBP-affinitv column
An AHPrBP-affinity column is also used for isolation of bisphosphonate- binding proteins by the same purification procedure as that used with the AHBuBP- affinity column. AHBuBP and PLP both eluted only the 28 kDa protein off this column No DdCyP2 could be detected in either the PLP fraction or the AHBuBP fraction on a SDS-PAGE gel. Experiments confirmed that DdCyP2 is never eluted from the AHPrBP column, even with 5 M urea. It is probable that DdCyP2 does not bind to the AHPrBP-affinity column. Homology analysis
DNA and protein sequences were extracted from the updated releases from the GenBank or Swissprot. The sequences were analyzed using the program GCG (The Wisconsin Genetic Computer Group) (Devereux et al., Nucleic Acids Res. 12: 387-395, 1984).
Homology analyses of the amino acid sequence with the protein database suggested DdCyP2 had strong homology to cyclophilins, especially cyclophilin Bs. The molecular weight of cyclophilins is usually 18-22 kDa, which is very similar to that ofDdCyP2. Example 8: Cloning and sequencing of DdCyP2
DdCyP2 (SEQ. ID NO. 12) is cloned and sequenced from a Dictyostelium discoideum strain Ax-2 cDNA library, using similar methods to those described in Example 2 (above). The nucleotide and predicted amino acid sequence are shown in Fig. 21 (SEQ. ID NO.12 and SEQ. ID NO.13).
The amino acid sequence of DdCyP2 is compared with that of the other bisphosphonate-binding protein DPI of Dictyostelium. There is little similarity in their amino acid sequences (17% amino acid identity with four gaps), except that they both contain the RGD motif.
Amino acids 85-87 in DdCyP2 form an Arg-Gly-Asp (RGD) tripeptide motif. This RGD motif is also present in the sequences of all known animal cyclophilin Bs (CyPBs), including human, mouse, rat and chick but not in other members of the cyclophilin family (e.g. CyPA). CyPBs are a type of CyP occurring in the endoplasmic reticulum (ER). The CyPB family has a N-terminal signal sequence, which directs it to the ER, and a conserved C-terminal undecapeptide extension which is not found in other CyPs. A comparison of the amino acid sequences of DdCyP2 and the CyPBs from various species showed about 60% identity with one gap introduced into the CyPBs. On the basis of the "RGD" motif, DdCyP2 is like a CyPB, but it does not have the extra N- and C-terminal sequences typical of CyPBs. DdCyP2 is therefore like a hybrid between CyPA and CyPB; no similar CyPs have been reported in other cells.
The amino acid sequence of DdCyP2 is superimposed on that of hCypA by using the program FRODO (Jones, Method Enzymol. 115: 157-171, 1985). DdCyP2 is probably very similar to human cyclophilin A (hCypA) in its three-dimensional structure. The RGD tripeptide and the seven amino acid insertion in DdCyP2 both appear on the surface of the structure.
Example 9: Recombinant expression of DdCyP2
Large quantities of DdCyP2 can be obtained using procedures similar to those described above in Example 3. Expression in mammalian cells
The cDNA for DdCyP2 described in Example 8 is subcloned into the mammalian cell expression vector pMT2 using appropriate restriction enzymes followed by ligation. The structure of the resultant expression vector is confirmed by restriction mapping on agarose gels.
Plasmid DNA from the pMT2 subclone is then transfected into monkey COS-1 cells by the DEAE-dextra" procedure (Sompayrac and Danna (1981) PNAS. 78: pages 7575-7578; Luthman and Magnusson (1983), Nucl. Acids Res. 11: pages 1295-1308). Serum-free 24 hr conditioned medium is then collected from the cells starting 40 to 70 hours post-transfection.
Example 10: Production of antibodies to bisphosphonate binding proteins Following the procedures set out in the "Guide to Protein Purification", Deutscher (Ed), Methods in Enzvmology. Academic Press, pages 663-679 and "Antibodies, Volume I, a practical approach", Catty (Ed), IRL Press (the contents of which are hereby incoφorated herein by reference), antibodies are produced as follows:.
A. Polyclonal antibodies
20-200 ug of a highly pure preparation of hDPl is emulsified with Freund's complete adjuvant (FCA) using two 3 ml Luer-Lock syringes with 18-gauge needles. The emulsified immunogen mixture is then injected intradermally into a shaved rabbit using a 22-gauge needle. A total volume of 0.5-1.0 ml is injected into 10-12 sites along the upper sides of the rabbit. Boosters are given after 21 days and 40 days.
After 40 days, the rabbit is bled and the blood allowed to clot for 2-4 hours at room temperature. The serum is then decanted and centrifuged at 1 OOOg for 10 min. to remove blood cells.
Immunoglobulin is then partially purified from the serum by DEAE- Sephacel ion-exchange chromatography. The serum is first dialyzed against phosphate buffered saline, pH 7.4, and then applied to the column. Anti-hDPl IgG antibody is then collected as it flows off the column.
B. Monoclonal antibodies
The procedure followed is based on that described by Kohler and Milstein (1975) Natϋie 256: 495.
50-100 ug purified hDPl in 0.2 ml complete Freund's adjuvant is injected into the hind foot pad of a mouse. After 10-12 days, the swollen popliteal lymph node in the hind leg is dissected from the surrounding fat pad.
The lymph node is placed into Dulbecco's minimal essential media which has been supplemented with 2 mM glutamine, 100 IU/ml penicillin and 100 ug/ml streptomycin. The tissue is minced and the cells dispersed. The cell suspension is allowed to settle for 10 min. on ice and the supernatant then centrifuged (1600 φm for 6-7 min.). The supernatant is discarded. An equal volume of medium is then added and the cells washed twice. Viable cells are counted using dye exclusion.
The cells are then fused with the p3U l myeloma cell line (Yelton et al. (1978) Curr. Top. Microbiol. Immunol. 81. 1 ). using standard techniques. Cells (2ml/well) are then pipetted into cell culture trays and placed in an incubator. Multiple subclonings are then carried out and the hybridomas screened by ELISA.
Example 1 1 : Bisphosphonate binding immunoassay
The assay is based on a modification of the Antigen Capture Assay (originally developed by Berson and colleagues - see Berson, S.A., Yalow, R.S., Bauman, A., Rothschild, M.A. and Newerly, K. (1956) Journal of Clinical Investigation 35, pages 170 to 190, and Yalow, R.S. and Berson, S.A. (1959) Nature 184, pages 1648-1649).
Anti-bisphosphonate binding protein monoclonal antibody is bound to the bottom of each well of a microtitre plate using standard procedures (approximately 2 ug/ml of antibody in PBS). Any remaining binding sites in the wells are then blocked with 3% BSA in PBS and a partially pure preparation of bisphosphonate binding protein is added to each well to specifically bind the bisphosphonate binding protein to the antibodies in the wells. Each well is then washed to remove excess unbound bisphosphonate binding protein and any other free contaminating proteins.
Radioiabeled (eg. ^C or ^H) bisphosphonate is then introduced into the wells. The labeled bisphosphonate binds to the bisphosphonate binding protein which is attached to the antibody-coated wells. The plates are incubated for a time sufficient to allow full binding and the wells are then washed to remove unbound bisphosphonate.
The amount of radiolabel in each well is then determined by washing each well with trichloroacetic acid followed by liquid scintillation counting of the resulting fluid. The binding affinity of a range of different bisphosphonates can therefore be determined by comparing the binding characteristics of radioiabeled derivatives in the assay.
Example 12: Bisphosphonate binding competitive immunoassay
Anti-bisphosphonate binding protein monoclonal antibody is bound to the bottom of each well of a microtitre plate using standard procedures (approximately 2 ug/ml of antibody in PBS), and the microtitre plate treated as described in Example 11.
The ability of an unleveled test bisphosphonate to compete with a labeled reference bisphosphonate for binding to the immobilized bisphosphonate binding protein is then determined by assaying a series of test mixtures having predetermined concentrations of labeled reference and unleveled test bisphosphonates using assay procedures similar to those described in Example 1 1. Example 13: Unleveled bisphosphonate binding assay The bisphosphonate binding protein is bound to a Pharmacia BIAcore 2000 chip using standard procedures. Small volumes of bisphosphonate solutions at various concentrations are then allowed to flow over the chip, and changes in the optical properties of the chip attendant on bisphosphonate binding are detected by the Pharmacia BIAcore apparatus. The results permit calculation of the binding constant for the binding of the bisphosphonate to the bisphosphonate binding protein.
The content of all references cited herein are hereby incoφorated by reference. Materials described as starting materials are made by known methods, or are known in the art. They are thus practicable to the skilled artisan without further guidance.
Though not required for enablement, the Dictyostelium Ax-2, refered to herein, is deposited with the ATCC, (American Type Culture Collection, Rockville, Maryland, USA) prior to submission of this application and bears deposit number 24397, see the ATCC/NIH Repository Catalogue.
Patent Figure 1
Figure imgf000049_0001
Fraction Number
Figure 1. Affinity chromatography of a cell extract of Dictyostelium discoideum on an AHBuBP-affinity column.
The cell extract prepared by sonication of Dictyostelium cells in equilibration buffer at 4°C is loaded onto an AHBuBP-affinity column at a rate of 10 ml/hour. Peaks of non-specifically-bound proteins were eluted with equilibration buffer (1 ). equilibration buffer after overnight incubation (2) and equilibration buffer containing 0.1 M KCl (3). DPI is eluted when 5mM AHBuBp dissolved in equilibration buffer is used directly (4), after overnight incubation (5) and after a further incubation for a few hours (6)
Figure imgf000050_0001
Figure 2. Profile of eluted proteins from the AHBuBP-affinity column.
Each peak from the column is lyophilised, desalted by dialysis and concentrated in a microsep microconcentrator. An aliquot of each fraction is electrophoresed on a 12% polyacrylamide gel in denaturing conditions (Laemmli, 1970) and the gel is stained with Commassie Blue. Lane 1 , molecular weight standards; Lane 2, elution with equilibration buffer; Lane 3, elution with equilibration buffer after overnight incubation; Lane 4, elution with equilibration buffer containing 0.1 M KCl; lane 5, elution with 5 mM AHBuBP in equilibration buffer.
TTTTATTAGTATTTAATTTATTTATATAATTCTTTTTAAAAAAAAACAAAACAAAACAAAATG M
TCC GTT CCA GCT GGT TCT GTT TCA TGT CTT GCT AAT GCA TTA TTA AAT S V P A G S V S C A N A L L N
TTA AGA TCA TCA ACT GAT TAT AAT GCT GAT CAT GGT GTA AAG AAT TCT L -J 5 S T D Y N A D H G V K N S
ATT TTA AAT TTT TCA AAT TCA AAG GAT GCT AGT AGA TTC GAC GGT AGT I L N F S N S D A S R F D G S
GAA TCA TGG TCA TCA TCA GTT TTG GAT AAG AAT CAA TTC ATT GTT GCC E S W S S S V L D K N Q F I V A
GGT AGT GAT TCT GTT AAA CAT TTC GTT GCA ATC TCA ACT CAA GGT CGT G S D S V K H F V A I S T Q G R
GGT GAT CAT GAT CAA TGG GTA ACT TCA TAC AAA TTA AGA TAC ACA CTT G D H D Q W V T S Y K R Y T L
GAT AAT GTA AAC TGG GTT GAA TAT AAC AAT GGT GAA ATA ATC AAT GCC D N V N V E Y N N G E I I N A
AAT AAA GAT AGA AAT TCA ATT GTT ACA ATC AAC TTT AAT CCA CCA ATT N K D R N S I V T I N - F N P P I
AAA GCT AGA TCT ATT GCC ATT CAT CCT CAA ACC TAT AAT AAT CAT ATT K A R S I A I H P Q T Y N N H I
TCA CTT CGT TGG GAA TTA TAT GCA TTA CCA GTT AAA AGT TAT TCA AAT S L R W E L Y A L P V K S Y S N
CCA TCA GTC CAA GTT GGT GAA GTT TCA ATT GGT GAT AGA TCT CTT AAC P S V Q V G E V S I G D R S L N
AGT GGT ACT GGT TCA CGT ACG ATT GTT CGT CAC GTT AAA TTC CCA GTG S G T G S R T I V R H V K F P V
GAA TTC CTT TCT GTT CCA ATC GTA TCA ATT GGT TGT AAA AAA GTT GAT E F L S V P I V S I G C K K V D
GCA CAT ACT GAT AAT GGT GAA ATG AGA TGG GAA GGT AAA TCT GAA. AAT A, H T D N G Q M R W E G K S E N
ATT ACT ACA AAA GGT TTT 3AT TTA ACT TTT ATT ACA TGG GGT AAT AAT I T T K G .- D L T F I T W G N N
GCA GTT TAT GAT TTA ACT TTT GAT TAT GTT GCT GTT GAA TTT AAT AAT A V Y D L T F D Y V A V E F N N
TAA ATAATTAAATAAΓAAAAΓAAAΓAAΛTAAATTTATTTGTTTTTATTTTATATTTTAAAAT AATTAAATAATTAAΓAAATTAAAAAAAAAAAAAAAAAAAAAATTTTAAAATTTTCCAGAAAAA
896 AAAAAAAAAAAA
908
Figure 3. Nucleotide and deduced amino acid sequence of the cDNA for wild- type DPI.
The Nucleotide sequence shown is of the 986-bp cDNA of DPI (Discoidin II) from the wild-type Dictyostelium discoidium Ax-2. The adenine base of the ATG initiation sequence is assigned as 1 in the numbering. Nucleotides are numbered in the right margin and amino acids on the left. An open reading frame of 771-bp encoding 257 amino acids is shown with a single letter code for the translated amino acids. A termination codon (TAA) at the end of the coding sequence is marked with an asterisk A putative signal peptides at the beginning of the amino acid sequence (1-20) is indicated in italics. The nucleotides sequence of the PCR fragment XJ-450 used to screen the cDNA library is underlined. Multiple polyadenylation signal sequences (AAT AAA) are shown in italics.
DISC IA QYIVAGCEVPRTFMCVALQGRGDADQWVTSYKIRYSLDNVSWFEYRNGAAVTGVTDRNTV DPI OFIVAGSDSVKHFVAISTOGRGDHDO VTSYKLRYΓLD_V^NIVVEY NGEIINANKDRNSI
* **** _ _ * _ __ ***** ******** ** **** * ** * * _ ***__
(Peptide 2) (Peptide 5)
DISC IA MSTQ-GLVQLLANAQCHLRTSTNYNGVHTQFNSALNYΪN-NGTNTIDGSEA CSSIVDTN DPI MSVPAGSVSCLANALLNLRSSTDYNADHGVKNSILNFSNSKDASRFDGSES SSSVLDKN
** * * ..** _**_**_** * * . **_ * ****.* **_ * *
(Peptide 1) (Peptide 4)
DISC IA QYIVAGCEVPRTFMCv*ALQGRGDADQ VTSYKIRYSLDNVS FEYRNGAAVTGVTDRNTV DPI OFIVAGSDSVl_HFVAISTOGRGDHDOWTSYKLRYrLD_JWJPv ΕYNNGEIINANKDRNSτ
* **** _ * _ __ ***** ******** ** **** * ** ** _ _ ***
(Peptide 2) (Peptide 5)
DISC IA VNHFFDTPIRARSIAIHPLTWNGHISLRCEFYTQPVQS SVTQVGADIYTGDNCALNT
DPI VTINFNPPIKAi-SJAJHPOTYN HISLRWELYALPVKSYSNPSVOVG-EVSIGDR-S NS
* *_ ** ******** * * ***** *.*. **.* *** __ ** .**.
(Peptide 6)
DISC IA GSGKREWVPVKFQFEFATLPKVALNFDQΓDCTDATNQTRIGVQPRNITTKGFDCVFYTW
DPI GTGSRTIVRHVKFPVEFLSVPIVSIGCKKVD--HTDNGOMR EGKSENITTKGFDLTFITW
★ * * _* *** ** * * * * * ******** * **
( Peptide 3 ) ( Peptide 7) ( Peptide 8)
DISC IA NEN VYSLRADYIATALE- DP1 GNNAVYDLTFDYVAVEFNN
* ** * ** _ *
Sequence ID # 3 deduced amino acid sequence of DPI Sequence ID # 4 deduced amino acid sequence of Discoidin IA
Figure 4. Pairwise comparison of the deduced amino acid sequences for DP and discoidin IA (DISC IA).
Asterisks (*) indicate positions of identity while dots (.) indicate positions of conservation. Amino acid sequences obtained for peptides of DPI are underlined and the corresponding numbers are shown in (parentheses) underneath the sequences. Regions used for generating primers XJ-1 and XJ-2 are in italics. The alignment is performed by use of the multiple alignment program of CLUSTALV (Higgins et al, 1992).
Figure imgf000054_0001
Figure 5. Northern Blot analyses of DPI (discoidin II) mRNA expression in axenic strains.
0.5mg samples of mRNA were fractionated on formaldehyde gels. After transfer to Hybond-N membrane, the samples were hybridized to successive hybridizations with 3 P-labeled PCR fragments of DPI (XJ450), discoidin I A and Dd-tcpl (control). Lane 1, axenically grown Ax-2; Lane 2, bacterially grown Ax-2. The columns on the right-hand side represent the relative abundance of the hybridized mRNA transcripts in the corresponding lanes of the blots, (b) Hybridization to 32p. labeled DPI cDNA XJ-450 of mRNA isolated from amoebae of strain AX-2 harvested from axenic cultures at low density (lxlO^cells/ml) (A) and at high density (4xl06cells/ml) (B).
<b). DPI
Figure imgf000055_0001
Figure 6. Southern Blot Analysis of DPI (discoidin II). Genomic DNA from the Ax-2 strain is subjected to digestion and the products were separated on a 1% agarose gel (a). After transfer to Hybond-N membrane, the samples were hybridized to the 32P-labelled XJ-450 probe (b) or the discoidin IA probe (c). Note that in each lane there is only a single band of hybridization to the DPI (discoidin II) cDNA XJ- 450 whereas there were multiple bands for hybridization to the discoidin IA cDNA probe.
Lane identiifcation in (a) is as follows A Ax-2deπved DNA, UD. undigested materials, and mw: Molecular weight marker Restπction enzymes used in the numbered lanes are: 1. EcoRI, 2. Bam 1. 3 Vσ/I. 4 Accl, and 5 Sau3Al.
Figure imgf000056_0001
0 (a) DPI <b> DP1
Figure imgf000056_0002
(C) DPI ( DP1
Figure 7. Sequence Comparison Matrices for DPI (discoidin II) (horizontal) with (a) human coagulation factor V, (b) human coagulation factor VIII, (c) ORF7 (Yat7-Rhobl) linked to the Rhodopseudomonas blastica atp operon and (d) milk fat globule (MFG) protein. The Matrices are plotted using MDMγg mutation dat matrix Pam250 as the scoring system (Schwartz and Dayhoff 1978), with a window size of 8, and a minimum score of 50%.
Figure imgf000057_0001
Figure 8. Hydrophobic plot and the secondary structure prediction for DPI (discoidin II).
The profile is constructed using the Kyte-Doolittle algorithm with a sliding window of seven amino acids. Values above the zero axis correspond to hydrophilic segments. Secondary structure predictions based on algorithms of both Chou- Fasman (1978) (CH, shadowed boxes) and Robson-Gamier (Gamier et al. 1978) (RG, filled boxes) are shown in the lower half of the figure.
Store + 9.4 Ffesidues S I S
Figure imgf000058_0001
Figure 9. Prediction of a probable cleavage site for a putative signal peptide of DPI (discoidin II) by the method of von Heijne (1986) using the program MacProt (Markiewicz, 1991; Luttke, 1990). A window size of 15 residues of weight matrix for eukaryotic protein is used. Generally a protein having a signal petide has one segment scoring greater than +3.5 while cytosolic proteins have scores less than +3.5.
Figure imgf000059_0001
72 74 76 78 8β 82 84 86 88 9β 92
H F V A I S T Q G R G D H D Q W V T S Y K
Figure 10. Surface probability plot of the RGD-containing region of DPI (discoidin II).
A surface probability profile of residues 76-88 is constructed using the Janin et al. (1978) and Emini et al. (1985) algorithms. A probability value above 0.5 is assigned to a water-accessible, "exposed" sequence and values below 0.5 to buried segments.
hDPl NH2 -M VEVLPALTDNYMYLVIDDETKEAAIVDPVQ
• • • # • •
RSP29 -kd NH2 -M iElLPALTDNYMYLiIDedTqxAAvVDPVQ
Sequence ID# 7 Amino terminal amino acid sequence of hDPl
Sequence ID# 8 Amino terminal amino acid sequence of r^t round spermatid 29,000 Mr protein (RSP-29)
Figure 11. N-terminal amino acid sequence alignment of hDPl and the rat round spermatid 29,000 Mr protein (RSP-29).
A position where both sequences contained the same amino acid is indicated by an asterisk. A colon indicates a position where the two sequences contain amino acids having similar properties whilst a dot indicates a position where the two sequences contain different amino acids.
Pepticfe sequences Pg pn e 1 Vepo& l mRNA _AAA lst stiani cDNA ^ ^_ "TTT-AP
Pnmeis DJ1 PJ5 DJ.
DJ10 UAP
600 bp _{__
570 tp _£.
100 bp ]£_
Figure 12. Schematic representation for amplification of hDPl cDNA by the polymerase cahin reaction.
Degenerate primers DJ1, DJ5, DJ9 (sense), and DJ10 (antisense were designed from a knowledge of peptide sequences of hDP 1. The nucleotide sequence of the primer UAP (antisense) had been incoφorated into the first strand cDNA during the reverse transcription using primer oligo(dT) 17-AP.
ACCGGCCCGGGTCATGGTGGTGGGCCGAGGGCTGCTCGGCCGCCGCAGCCTCGC - 93
CGCGCTGGGAGCCGCCTGCGCCCGCCGAGGCCTCGGTCCAGCCCTGCTGGGAGTTTTCTGCCA - 36
CACAGATTTGCGGAAGAACCTGACCGTGGACGAGGGCACCATG AAG GTA GAG GTG CTG 16
1 .' K V E V L
CCT GCC CTG ACC GAC AAC TAC ATG TAC CTG GTC ATT GAT GAT GAG ACC 64 7 P A L T D N Y M Y V I D D F T
AAG GAG GCT GCC ATT GTG GAT CCG GTG CAG CCC CAG AAG GTC GTG GAC 112 3 Y E A A I V n P V 0 P Q K V V D
GCG GCG AGA AAG CAC GGG GTG AAA CTG ACC ACA GTG CTC ACC ACC CAC 160 9 A A R K H G V K L T T V T T H
CAC CAC TGG GAC CAT GCT GGC GGG AAT GAG AAA CTG GTC AAG CTG GAC- 208 5 H H W D H A G G N E K V K L E
TCG GGA CTG AAG GTG TAC GGG GGT GAC GAC CGT ATC GGG GCC CTG ACT 256 1 S G L K V Y G G D D R I G A L T
CAC AAG ATC ACT CAC CTG TCC ACA CTG CAG GTG GGG TCT CTG AAC GTC 304 7 H K I T H L S T L Q V G S L N V
AAG TGC CTG GCG ACC CCG TGC CAC ACT TCA GGA CAC ATT TGT TAC TTC 352 3 K C L A T P C H T S G H I C Y F
GTG AGC AAG CCC GGA GGC TCG GAG CCC CCT GCC GTG TTC ACA GGT GAC 400 9 V S K P G G Ξ E P P A V F T G D
ACC TTG TTT GTG GCT GGC TGC GGG AAG TTC TAT GAA GGG ACT GCG GAT 448 35 T L F V A G C G K F Y E G T A D
GAG ATG TGT AAA GCT CTG CTG GAG GTC TTG GGC CGG CTC CCC CCG GAC 496 51 E M C K A L L E V G R L P P D
ACA AGA GTC TAC TGT GGC CAC GAG TAC ACC ATC AAC AAC CTC AAG TTT 544 67 T R V Y C G H E Y T I N N L K F
GCA CGC CAC GTG GAG CCC GCC AAT GCC GCC ATC CGG GAG AAG CTG GCC 592 83 A R H V E P A N A A I R E K A
TGG GCC AAG GAG AAG TAC AGC ATC GGG GAG CCC ACA GTG CCA TCC ACC 640 99 W A K E K Y S T G F P T V P S T
CTG GCA GAG GAG TTT ACC TAC AAC CCC TTC ATG AGA GTG AGG GAG AAG 688 15 __ E E E 1 1 E E M R V R E K
ACG GTG CAG CAG CAC GCA GGT GAG ACG GAC CCG GTG ACC ACC ATG CGG 736 31 T V Q Q H A G E T D P V T T M R
GCC GTG CGC AGG GAG AAG GAC CAG TTC AAG ATG CCC CGG GAC TGA GGC 784 47 A V R R E K D Q F K M P R D *
CGCCCTGCACCTTCAGCGGATTTGGGGATTAGGCTCTTTTAGGTAACTGGCTTTCCTGCTGGT 847
CCGTGCGGGAAATTCAGTCTTGATTTAACCTTAATTTTACAGCCCTTGGCTTGTGTTATCGGA 910
CGTTTTAATGCATATTTATAAGAGAAGTTTAACAAGTATTTATTCCCATAAAAAGGGGGGGGC 973
CGGTACCCAATTCGCCCTATAGTGAGTCG 1002
Sequence ID #9 cDNA sequence of HDPl
Sequence ID# 10 deduced amino acid sequence of HDPl Figure 13. Nucleotide sequence and deduced amino acid sequence of hDPl cDNA.
The nucleotide sequence of hDPl cDNA isolated from a human testis cDNA library consists of 1160 base pairs. The adenine base of the ATG initiation codon is assigned a 1 in the numbering. Nucleotides are numbered in the right margin and amino acids on the left. An open reading frame of 780-bp encoding 260 amino acids is shown with a single letter code for the translated amino acids. A stop codon (TGA) at the terminus of the translation sequence is marked with an asterisk, the amino acid sequences of peptides 1 and 2 are underlined.
RdBOTC _.
■ bα_da_ce * of hDPl
1I2m3l4H5 6l7i 8 ■ 9■ 10I 11I 12B 13II MI ISM 16 J 1U7 ItLM 19 H 29 21i 22
Figure imgf000064_0001
l.HMrl (HI tl|kιι (IH) I7.T_*κ (IV) 20. THF -1
2.Bral> lO.TkgKτt It.H-L- 21.MS63 a.p aai ll.PrMate i9._α_o 22 Pr_>_-rtα_-«tls . Lot 12Te-_ι
S.Mr 13. Ow.
6. »»____.«•_ l..fc _H__tα_M
7.K___rj IS. Oka
8.Pmcr__ tt. JV_πtkar_l»koi k*αcjtts
Figure 14. Northern blot analyses of hDPl in human tissues.
Human multiple tissue blots (I and II) counting 2 μg poly(A+)RNA from various tissues were purchased from Clontech. Northern blots III and IV were prepared by running 1 μg poly(A+)RNA in a formaldehyde gel and blotting onto Hybond-N membrane. Hybridization is carried out using a [32p]-labeled PCR fragment of hDPl cDNA. A DNA fragment of β-actin is also labeled with [32p] and hybridized to the same blots. The height of the bar on top of each lane represents the relative abundance of hDPl mRNA in that tissue.
1 2 3 4 5 6 7 8 9 M kb
Figure imgf000065_0001
— 4.4
— 2.3
— 2.0
1. Yeast 2. Chic_aαι 3. Rabbit 4. Cow 5. Dog
6. Mouse 7. Rat 8. Mαntey 9. Human M. ENAMaikei
Fς.7.13.
Figure imgf000065_0002
species was puichased fiαm Ckmtech. The DNAhad teen digested with EcoSi., ran ona 0.7% agaiose gel and t___t_feιιed to a ny n membιa_te. Hyli jdmbon was caiiied out usmg a pPJ-labefledPCR fragment of hDPl cDNA
Figure 15: Southern analysis of hDPl on a zoo-blot. The southern blot containing 8μg og genomic DNA per lane from nine eucaryotic species is obtained from Clontech. The DNA had been digested with EcoRI, run on a 0.7% agarose gel and transferred to a nylon membrane. Hybridization is carried out using a [32p]- labelled PCR fragment of hDPl cDNA.
Figure imgf000066_0001
Figure 16. Northern Blot analyses of hDPl homologues in Dictyostelium discoideum. A [32p]-labelled PCR fragment of hDPl cDNA is hybridised to a northern blot containing 5μg mRNA from Dictyostelium amoebae grown on acteria (lane 1), or grown axenically (lane 2).
Figure imgf000067_0001
hDPl hDP1
(a) (b)
Figure 17. Sequence comparison for hDPl.
Atrix plots of hDPl (horizontal against (a). ORF3 linked tho the Rhodopseudomonas blastica atp operon (ATPase ORF3, vertical) and (b). ORF1 linked to genes for arginyl tRNA synthetase and ribonuclease H of Buchnera adiphicola (ATSORFl, vertical). The matrices were plotted using MDMγg mutation data matrix Pam250 as the scoring system (Schwartz and Dayoff, 1978), with a window size of 8, and a minimum score of 55%
Sequence ID #9 cDNA sequence of HDPl
Sequence ID# 10 deduced amino acid sequence of HDPl
2 KVEVLPALTDNY MYLVID ... DETKEAAIVDPVQPQKWDAA 40
: : : : | : : | | : : | : : : : | : | |
33 PPDPI GVTEAFKRDTNSKKMNLGVGAYRDDNGKSYVLNCVRKAEAMIAA 82
41 RKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDDRIGALTHKIT 90
83 KKMD.KEYLPIAGLADFTRASAELALGENSEAFK.... SGRYVTV QGIS 126
91 HLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAG 140
127 GTGSLRVGANFLQRFFKFSRD VYLPKPSWGNHTPIFRDAG QLQA 171
141 CGKFYEGTADE CKALLEVLGRLPPDTRVY ... CGHEYTINNLKFARHVE 187
: : I : :| ::::| : |:|: | : : |
172 YRYYDPKTCSLDFTGAMEDISKIPEKSIILLHACAHNPTGVDPRQEQ KE 221
188 PANAAIREKL A AKEKYSIGEPTVPSTLAEEFTY ...NPFM 225
222 LASWKKR LLAYFDMAYQGFASGDINRDA ALRHFIEQGIDWLSQSYA 271
226 RVREKTVQQHAGETDPVTTMRAVRREKDQFKM 257
: : : :: I :| |:|:
272 KNMGLYGERAGAFTVICRDAEEAKRVESQLKI 303 sequence ID#9 amino acid sequence of HDPl sequence ID#11 amino acid sequence of aspartate amino transferase
Figure 18. Sequence comparison of hDPl with aspartate aminotransferase (AAT) from chicken mitochondria. hDPl is shown in the upper line and DPI in the lower one. "|" indicates identity between aligned residues; ":" indicates similarity. The comparison is performed using the program "bestfit" of the GCG package (Devereux et al, 1984). Identity between the two proteins is 14.3% and the similarity is 44.1%.
5 ββ
_ 4 ββ 3 βθ
3 2 ββ
: 1 θβ β ββ ., ___.v.>___jL_,i__
» -l ββ . "2 ββ
: -3 ββ -4 θθ -5 ββ
Figure imgf000069_0001
Figure 19. Hydrophilic plot and secondary structure prediction for hDPl.
The hydrophilic profile is constructed using the Kyte-Doolittle algorithm (Kyte and Doolittle, 1982) with a sliding window of seven amino acids. Values above the zero axis correspond to hydrophilic segments Secondary structure prediction based on the algorithms of Chou-Fasman (1978) (CH, shadowed boxes) and Robinson- Gamier (Gamier et al 1978) (RG, filled boxes) are shown in the lower half of the figure.
16 LVIDDETKEAAIVDPVQPQKWDAARKHGVKLTTVLTTHHH DHAGGNEK 65
1 SVPAGSVSCLANALLNLRSSTDYNADHGVKNSIL NFSNSKDASR 45
66 LVKLES.GLKVYGGDDRIGALTHKITHLSTLQ VGSLNVKC 104
: || : I : :: I I : |: : 1 1 =::
46 FDGSESWSSSVLDKNQFI AGSDSVKHFVAISTQGRGDHDQWVTSYKLRY 95
105 .LAT.PCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTADEMC 153
96 TLD_TVN VEYNNGEIINANKDRNSIVTINFNPPIKARSIAIHPQTYNNHI 145
154 KALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPANAAIRE ... KLA A 200
146 SLRWELYA.LP.. KSYSNPSVQVGEVSIGDRSLNSGTGSRTIVRHVKFP 192
201 KEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETDPVTTMRAVRR 250
| |: : : : I I || :| : :| : :
193 VEFLSVPIVSIGCKKVDAHTDNGQMR.WEGKSENITTKGFDLTFI..T G 239
251 EKDQFKMPRD 260
240 N AVYDLTFD 249
Sequence ID# 10 amino acid sequence of HDPl
Sequence ID # 2 amino acid sequence of discoidin II
Figure 20. Amino acid sequence comparison of hDPl with DPI (discoidin II). hDPl is shown in the upper line and DPI in the lower one. "|" indicates identity between aligned residues; ":" indicates similarity. The comparison is performed using the program "bestfit" of the GCG package (Devereux et al, 1984). The identity between the two proteins is 12.8% and the similarity is 38.5%.
CATAATGAAAGTTATTTTCGTAGTTTTAGCCATTGTATTAGTTACATTATGGGCT
56 AISCCATCAGAAGCTGGTAAAGACCCAAAGATTACCAATAAAGTATTCTTTGATATAGAA
W P S E A G K D P K I T N K V F F D I E 20
116 ATTGATAATAAACCAGCAGGTAGAATTGTATTTGGTTTATATGGAAAGACAGTACCAAAA
I D N K P A G R I V F G L Y G K T V P K 40
176 ACAGTTGAAAACTTTAGAGCATTATGTACTGGTGAAAAAGGTTTAGGTACCAGTGGTAAA
T V E N F R A L C T G E K G L G T S G K 60
236 CCATTACATTATAAAGATAGTAAATTCCATCGTATCATTCCAAACTTTATGATTCAAGGT
P L H Y K D S K F H R I I P N F M I Q G 80
296 GGTGATTTCACAAGAGGTGATGGTACTGGTGGTGAATCAATTTATGGTAAAAAATTCAAT
G D F T R G D G T G G E S I Y G K K F N 100
356 GATGAAAACTTCAAAATTAAACACTCCAAACCAGGTCTTTTATCAATGGCTAACGCTGGT
D E N F K I K H S K P G L L S M A N A G 120
416 CCAAACACTAATGGTTCACAATTCTTTATTACTACCGTTGTTACTTCATGGTTAGATGGT
P N T N G S Q F F I T T V V T S W L D G 140
476 CGTCATACTGTTTTTGGTGAAGTTATTGAAGGTATGGATATTGTTAAACTCCTTGAATCC
R H T V F G E V I E G M D I V K L E S 160
536 ATTGGTTCCCAATCTGGAACACCAAGTAAAATTGCTAAAATCTCAAACTCTGGTGAATTA
I G S Q S G T P S K I A K I S N S G E L 180
596 TAAATAAAATAAAACCAAACCAAATAAAATAAAT
Sequence ID#12 nucleotide cDNA sequence of DdCyP2 Sequence ID#13 predicted amino acid sequence of DdCyP2
Figure 21. Nucleotide sequence of the Wt.3 cDNA insert and the deduced amino acid sequence of Dd CyP2.
The nucleotide sequence shown is for the 629 bp cDNA of Dd CyP2 in clone Wt.3. Nucleotides are numbered on the left and amino acids on the right. An open reading frame of 540 bp encoding 180 amino acids is shown using the single letter code for the translated amino acids. The initiation codon (ATG) and the termination codon (TAA) are underlined. An RGD motif is shown in bold-type. The five N-terminal amino acid residues absent from the isolated Dd CyP2 protein are in italics. 1 50
1 MPSEAGKDPK ITNKVFFDIE IDNKPAGRIV FGLYGKTVPK TVENFRALCT
2 .MTTVKPTSP ENPRVFFDIT IGGVEAGKW MELYA TVPK TAENFRALCT
3 M VNPKVFFDMT VGDKAAGRIV MELYADTVPE TAENFRALCT
4 SQVYFDVE ADGQPIGRW FKLYNDIVPK TAENFRALCT
5 VNPTVFFDIA VDGEPLGRVS FELFADKVPK TAENFRALST
51 100
1 GEKGLGTSGK PLHYKDSKFH RIITNFMIQG GDFTRGDGTG GESIYGKKFN
2 GEKGIGKSGK PLSYKGSSFH RVITNFMCQG GDFTMGNGTG GESIYGNKFA
3 GERGIGKSGK PLHYKGSAFH RVIPKFMCQG GDFTAGNGTG GESIYGMKFK
4 GEKGFG YAGSPFH RVIPDFMLQG GDFTAGNGTG GKSIYGGKFP
5 GEKGFG YKGSCFH RIIPGFMCQG GDFTRHNGTG GKSIYGEKFE
* * * *
+ + ++ + ++
101 150
1 DENFKIKHSK PGLLSMRNAG PNTNGSQFFI TTWTSWLDG RHTVFGEVIE
2 DENFKLKHFG QGTLSMANAG ANTNGSQFFI CVAPTDWLDG KHWFGFVTE
3 DENFVKKHTG PGILSMRNAG SNTNGSQFFI CTEKTS LDG KHWFGQWE
4 DENFKKHHDR PGLLSMANAG PNTNGSQFFI TTVPCP LDG KHWFGEWD
5 DENFILKHTG PGILSMANAG PNTNGSQFFI CTAKTEWLDG KHWFGKVKE
** * * * *
+++ + + ++ +
151 180 Identity with Dd CyP2
1 GMDIVKLLES IGSQSGTPSK IAKISNSGEL
2 GMDWKKMEA AGSQSGKTTK PWIANCGQL 65.0% in 175 aa overlap
3 GMDWRDIEK VGSDSGRTSK KWTCDCGQL 65.9% in 170 aa overlap
4 GYDIVKKVES LGSPSGATKA RIWAKSGEL 64.9% in 168 aa overlap
5 GMNIVEAMER FGSRNGKTSK KITIADCGQLE 61.8% in 170 aa overlap
Figure 22. Alignment of the amino acid sequence of Dd CyP2 and the sequences of selected members of the cyclophilin A family of proteins.
The sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group). The sequences of the CyPs were derived from the following sources: (1) Dd CyP2; (2) Dd CyPl (Barisic et al, 1991); (3) Brassica napus (p24525, Gasser et al, 1990); (4) Saccharomyces cerevisiae (pl4832, Haendler et al, 1989); (5) human (p05092, Haendler et al, 1987). Residues of human CyPA that are located in close contact with a tetrapeptide Ala- Ala-Pro- Ala substrate are marked with an asterisk (Kallen & Walkinshaw, 1992); residues of human CyPA that are in close contact with bound cyclosporin A are marked with a cross (Theriault et al, 1 93; Pflugl et al, 1993). 1 50
1 VNPTVFF
2 MLALRCGSRW LGLLSVPRSV PLRLPAARAC SKGSGDPSSS SSSGNPLVYL
3 PSEAGK DPKITNKVFF
4 MKVLLAAALI AGSVFFLLLP GPSAADEKKK GPKVTVKVYF
5 MG PGPRLLLPLV LCVGLGALVF SSGAEGFRKR GPSVTAKVFF
51 100
1 DIAVDGEPLG RVSFELFADK VPKTAENFRA LSTGEKGF GYKGS
2 DVDANGKPLG RWLELKADV VPKTAENFRA LCTGEKGF GYKGS
3 DIEIDNKPAG RIVFGLYGKT VPKTVENFRA LCTGEKGLGT SGKPLHYKDS
4 DLRIGDEDVG RVIFGLFGKT VPKTVDNFVA LATGEKGF GYKNS
5 DVRIGDKDVG RIVIGLFGKV VPKTVENFVA LATGEKGY GYKGS
101 150
1 CFHRIIPGFM CQGGDFTRHN GTGGKSIYGE KFEDENFILK HTGPGILSMA
2 TFHRVIPSFM CQAGDFTNHN GTGGKSIYGS RFPDENFTLK HVGPGVLSMA
3 KFHRIITNFM IQGGDFTRGD GTGGESIYGK KFNDENFKIK HSKPGLLSMR
4 KFHRVIKDFM IQGGDFTRGD GTGGKSIYGE RFPDENFKLK HYGPGWVSMA
5 KFHRVIKDFM IQGGDITTGD GTGGVSIYGE TFPDENFKLK HYGIG VSMA
151 200
1 NAGPNTNGSQ FFICTAKTE LDGKHWFGK VKEGMNIVEA MERFGSRNGK
2 NAGPNTNGSQ FFICTIKTD LDGKHWFGH VKEGMDWKK IESFGSKSGR
3 NAGPNTNGSQ FFITTWTS LDGRHTVFGE VIEGMDIVKL LESIGSQSGT
4 NAGKDTNGSQ FFITTVKTAW LDGKHWFGK VLEGMEWRK VESTKTDSRD
5 NAGPDTNGSQ FFITLTKPTW LDGKHWFGK VIDGMTWHS IELQATDGHD
201 Identity with Dd CyP2 (3)
1 TSKKITIADC GQLE 61.8% in 170 aa
2 TSKKIVITDC GQLS 61.4% in 166 aa
3 PSKIAKISNS GEL
4 KPLKDVIIAD CGKIEVEKPF AIAKE .. 60.6% in 180. aa
5 RPLTNCSIIN SGKIDVKTPF EIADW 60.7% in 166 aa
Figure 23. Alignment of the amino acid sequences of Dd CyP2 and four human cyclophilins.
The sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group) Fhe sequences of the CyPs were derived from the following sources: (1) human ( > PA (p05092, Haendler et al, 1987), (2) human CyPD (p30405, Bergsma d al . I *>l ); (3) Dd CyP2; (4) human CyPB (p23284, Price et al, 1991); (5) human ( \ PC ( Schneider et al, 1994). 72
1 50 M VT.LAAALT Aπ.qyFFT. T.P GPSAADEKKK GPKVTVKVYF DLRIGDEDVG MKVI.FAAALI Vr_.qwpT.T.T.P GPSVANDKKK GPKVTVKVYF DLQIGDESVG M AT.VAATA . GPftT .T .P AASRADERKK GPKVTAKVFF DLRVGEEDAG MKVLFAAALI VG.qWFLLLP GPSVANDKKK GPKVTVKVYF DFQIGGRTCR MPSEAGK DPKITNKVFF DIEIDNKPAG
51 100 RVIFGLFGKT VPKTVDNFVA LATGEKGFG YKNS KFHRVIKDFM RWFGLFGKT VPKTVDNFVA LATGEKGFG YKNS KFHRVIKDFM RWIGLFGKT VPKTVENFVA LATGEKGFG FKGS KFHRVIKDFM TSDL TL KD CSKTVDNFVA LATGEKGFG YKNS KFHHMIKDFM RIVFGLYGKT VPKTVENFRA LCTGEKGLGT SGKPLHYKDS KFHRIITNFM
101 150 IQGGDFTRGD GTGGKSIYGE RFPDENFKLK HYGPGWVSMA NAGKDTNGSQ IQGGDFTRGD GTGGKSIYGE RFPDENFKLK HYGPGWVSMA NAGKDTNGSQ IQGGDFTRGD GTGGKSIYGD RFPDENFKLK HYGPGWVSMA NAGKDTNGSQ IQGGDFTRGD GTGGKSIYGE RFPDENFKLK HYGPGWVSMA NAGKDTNGSQ IQGGDFTRGD GTGGESIYGK KFNDENFKIK HSKPGLLSMR NAGPNTNGSQ
151 200 FFITTVKTAW LDGKHWFGK VLEGME RK VESTKTDSRD KPLKDVIIAD FFITTVKTSW LDGKHWFGK VLEGMDWRK VESTKTDSRD KPLKDVIIVD FFITTVKTAW LDGKHWFGK VLEGMDWRK VENTKTDSRD KPLKDVTIAD FFITTVKTSW LDGKHWFGK VLEGMDWRK VENTKTDSRD KPLKDVIIVD
5 FFITTWTSW LDGRHTVFGE VIEGMDIVKL LESIGSQS.G TPSKIAKISN
201 ' Identity with Dd CyP2
1 CGKIEVEKPF AIAKE 60.6% in 180 aa
2 SGKIEVEKPF AIAKE 61.9% in 181 aa
3 CGTIEVEKPF AIAKE 61.1% in 180 aa
4 CGKIEVEKPF AIAKE 54.9% in 182 aa
5 SGEL
Figure 24. Alignment of the amino acid sequences of Dd CyP2 and CyPBs.
The sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is performed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group). The sequences of the CyPBs were derived from the following sources: (1) human (p23284, Price et al, 1991); (2) mouse (p24369, Hasel et al, 1991); (3) chick (p24367, Caroni et al, 1991); (4) rat (p24368, Iwai and Inagami et al, 1990). (5) Dd CyP2 (XPl). Potential hydrophobic signal sequences of CyPBs are underlined. The RGD motifs are in bold-type. MAFPKVYFD MTIDGQPAGR IVMELYTDKT PRTAENFRAL MANPKVFFD LTIGGAPAGR WMELFADTT PKTAENFRAL PKVYFD MTVGDKAAGR IVMELYADTV PETAENFRAL MELFQDW PQTAENFRAL MANPRVFFD MTVGGAPAGR IVMELYANEV PKTAENFRAL ..MPSEAGKD PKITNKVFFD IEIDNKPAGR IVFGLYGKTV PKTVENFRAL MAHCFFD MTIGGQPAGR IIMELFPD.V PKTAENFRAL AEEEEVIEPQ AKVTNKVYFD VEIGGEVAGR IVMGLFGE PKTVENFRAL ...MTTVKPT SPENPRVFFD ITIGGVEAGK WMELYANTV PKTAENFRAL
51
1 CTGEKGVGGT GKPLHFKGSK FHRVIPNFMC QGGDFTAGNG TGGESIYGSK CTGEKGVGKM GKPLHYKGST FHRVIPGFMC QGGDFTAGNG TGGESIYGAK CTGERGIGKS GKPLHYKGSA FHRVIPKFMC QGGDFTAGNG TGGESIYGMK CTGEKGMGDR .KPLHYKGSS FHRVIPGFMC QGGDFTAGNG TGGESIYGAK
5 CTGEKGVGKS GKPLHYKGST FHRVIPEFMC QGGDFTRGNG TGGESIYGEK
6 CTGEKGLGTS GKPLHYKDSK FHRIIPNFMI QGGDFTRGDG TGGESIYGKK
7 CTGEKGIGPS GKKMTYEGSV FHRVIPKFML QGGDFTLGNG RGGESIYGAK
8 CTGEKKYG YKGSS FHRIIKDFMI QGGDFTEGNG TGGISIYGAK
9 CTGEKGIGKS GKPLSYKGSS FHRVITNFMC QGGDFTMGNG TGGESIYGNK
101
1 FEDENFERKH TGPGILSMAN AGANTNGSQF FICTVKTDWL DGKHWFGQV
2 FNDENFVKKH TGPGILSMAN AGPGTNGSQF FICTAKTEWL NGKHWFGQV
3 FKDENFVKKH TGPGILSMRN AGSNTNGSQF FICTEKTSWL DGKHWFGQV
4 FKDENFIKKH TGPGVLSMAN AGPGTNGSQF FICTEKTAWL DGKHWFGQV
5 FPDEKFVRKQ PAPGVLSMAN AGPNTNGSQF FICTVATPWL DGKHWFGQV
6 FNDENFKIKH SKPGLLSMAN AGPNTNGSQF FITTWTSWL DGRHTVFGEV
7 FADENFIHKH TTPGLLSMAN AGPGTNGSQF FITTVATPHL DGKHWFGKV
8 FEDENFTLKH TGPGILSMAN AGPNTNGSQF FICTVKTSWL DNKHWFGQV
9 FADENFKLKH FGQGTLSMAN AGANTNGSQF FICVAPTDWL DGKHWFGFV
151 Identity with Dd CyP2
1 VEGLDWKAI EKVGSSSG.K PTKPVWADC GQLS 67. .7% in 167 aa
2 VEGMDVIKKA EAVGSSSG.R CSKPWIADC GQL 67. .1% in 167 aa
3 VEGMDWRDI EKVGSDSG.R TSKKWTCDC GQL 65. .9% in 170 aa
4 VEGMDWRAI EKVGSQSG.Q TKKPVKIADC GQLS 67. .6% in 148 aa
5 VEGMDWKAI EKVGTRNG.S TSKWKVADC GQLS 67. .7% in 167 aa
6 IEGMDIVKLL ESIGSQSG.T PSKIAKISNS GEL
7 VEGMDWRKI EATQTDRGDK PLSEVKIAKC GQL 63. .3% in 166 aa
8 IEGMKLVRTL ESQETRAFDV PKKGCRIYAC GELPLDA 64 .9% in 174 aa
9 TEGMDWKKM EAAGSQSGKT TKPWIANCG QL 65. .0% in 175 aa
Figure 25. Alignment of the amino acid sequences of two Dd CyPs and some plant CyPs.
The sequences were extracted from the updated releases from GenBank and Swissprot. The alignment is pc _ ^rmed using "pileup" of the GCG program (the Wisconsin Genetic Computer Group). The sequences for the CyPs were derived from the following sources: (I) Arabidopsis thaliana (114844, Lippuner et al, 1994) (2) tomato (m55019, Gasser et al, 1990); (3) Brassica napus (m55018. Gasser et al, 1990); (4) onion (113365, Clark et al, 1993); (5) maize (m.55021, Gasser et al, 1990); (6) Dd CyP2 (XPl); (1) Arabidopsis thaliana (x63616, Bartling et al, 1991); (8) Arabidopsis thaliana (114845, nuclear-encoded chloroplast stromal, Lippuner et al, 1994); (9) Dd CyPl (Barisic et al, 1991). The seven amino acid insertion in Dd CyP2 is in bold-type. The potential ATP/GTP binding sites are underlined.
SEQUENCE LISTING
CD GENERAL INFORMATION:
(i) APPLICANT: COOK, JONATHAN S EBETINO, FRANK H IBBOTSON, KENNETH J JI, XIAOHUI ROGERS, MICHAEL J RUSSELL, ROBERT G XIONG, IAO UAN
(ii) TITLE OF INVENTION: BISPHOSPHONATE BINDING PROTEINS
(iii) NUMBER OF SEQUENCES: 34
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: THE PROCTER & GAMBLE COMPANY
(B) STREET: 11801 EAST MIAMI RIVER ROAD
(C) CITY: ROSS
(D) STATE: OHIO
(E) COUNTRY: USA
(F) ZIP: 45061
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentin Release #1.0, Version #1.30
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(C) CLASSIFICATION:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: HAKE, RICHARD A
(B) REGISTRATION NUMBER: 37,343
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (513) 627-0087
(B) TELEFAX: (513) 627-0260
(2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 968 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 61..831
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:
TTTTATTAGT ATTTAATTTA TTTATATAAT TCTTTTTAAA AAAAAACAAA ACAAAACAAA 60
ATG TCC GTT CCA GCT GGT TCT GTT TCA TGT CTT GCT AAT GCA TTA TTA 108 Met Ser Val Pro Ala Gly Ser Val Ser Cys Leu Ala Asn Ala Leu Leu 1 5 10 15
AAT TTA AGA TCA TCA ACT GAT TAT AAT GCT GAT CAT GGT GTA AAG AAT 156 Asn Leu Arg Ser Ser Thr Asp Tyr Asn Ala Asp His Gly Val Lys Asn 20 25 30
TCT ATT TTA AAT TTT TCA AAT TCA AAG GAT GCT AGT AGA TTC GAC GGT 204 Ser He Leu Asn Phe Ser Asn Ser Lys Asp Ala Ser Arg Phe Asp Gly 35 40 45
AGT GAA TCA TGG TCA TCA TCA GTT TTG GAT AAG AAT CAA TTC ATT GTT 252 Ser Glu Ser Trp Ser Ser Ser Val Leu Asp Lys Asn Gin Phe He Val 50 55 60
GCC GGT AGT GAT TCT GTT AAA CAT TTC GTT GCA ATC TCA ACT CAA GGT 300 Ala Gly Ser Asp Ser Val Lys His Phe Val Ala He Ser Thr Gin Gly 65 70 75 80
CGT GGT GAT CAT GAT CAA TGG GTA ACT TCA TAC AAA TTA AGA TAC ACA 348 Arg Gly Asp His Asp Gin Trp Val Thr Ser Tyr Lys Leu Arg Tyr Thr 85 90 95
CTT GAT AAT GTA AAC TGG GTT GAA TAT AAC AAT GGT GAA ATA ATC AAT 396 Leu Asp Asn Val Asn Trp Val Glu Tyr Asn Asn Gly Glu He He Asn 100 105 110
GCC AAT AAA GAT AGA AAT TCA ATT GTT ACA ATC AAC TTT AAT CCA CCA 444 Ala Asn Lys Asp Arg Asn Ser He Val Thr He Asn Phe Asn Pro Pro 115 120 125
ATT AAA GCT AGA TCT ATT GCC ATT CAT CCT CAA ACC TAT AAT AAT CAT 492 He Lys Ala Arg Ser He Ala He His Pro Gin Thr Tyr Asn Asn His 130 135 140
ATT TCA CTT CGT TGG GAA TTA TAT GCA TTA CCA GTT AAA AGT TAT TCA 540 He Ser Leu Arg Trp Glu Leu Tyr Ala Leu Pro Val Lys Ser Tyr Ser 145 150 155 160
AAT CCA TCA GTC CAA GTT GGT GAA GTT TCA ATT GGT GAT AGA TCT CTT 588 Asn Pro Ser Vat Gin Vat Gly Glu Val Ser He Gly Asp Arg Ser Leu 165 170 175
AAC AGT GGT ACT GGT TCA CGT ACG ATT GTT CGT CAC GTT AAA TTC CCA 636 Asn Ser Gly Thr Gly Ser Arg Thr He Val Arg His Val Lys Phe Pro 180 185 190
GTG GAA TTC CTT TCT GTT CCA ATC GTA TCA ATT GGT TGT AAA AAA GTT 684 Val Glu Phe Leu Ser Val Pro He Val Ser He Gly Cys Lys Lys Val 195 200 205
GAT GCA CAT ACT GAT AAT GGT CAA ATG AGA TGG GAA GGT AAA TCT GAA 732 Asp Ala His Thr Asp Asn Gly Gin Met Arg Trp Glu Gly Lys Ser Glu 210 215 220
AAT ATT ACT ACA AAA GGT TTT GAT TTA ACT TTT ATT ACA TGG GGT AAT 780 Asn He Thr Thr Lys Gly Phe Asp Leu Thr Phe He Thr Trp Gly Asn 225 230 235 240
AAT GCA GTT TAT GAT TTA ACT TTT GAT TAT GTT GCT GTT GAA TTT AAT 828 Asn Ala Val Tyr Asp Leu Thr Phe Asp Tyr Val Ala Val Glu Phe Asn 245 250 255
AAT TAAATAATTA AATAATAAAA TAAATAAATA AATTTATTTG TTTTTATTTT 881
Asn
ATATTTTAAA ATAATTAAAT AATTAATAAA TTAAAAAAAA AAAAAAAAAA AAAATTTTAA 941 AATTTTCCAG AAAAAAAAAA AAAAAAA 968
(2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 257 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:.: Met Ser Val Pro Ala Gly Ser Val Ser Cys Leu Ala Asn Ala Leu Leu 1 5 10 15
Asn Leu Arg Ser Ser Thr Asp Tyr Asn Ala Asp His Gly Val Lys Asn 20 25 30
Ser He Leu Asn Phe Ser Asn Ser Lys Asp Ala Ser Arg Phe Asp Gly 35 40 45
Ser Glu Ser Trp Ser Ser Ser Val Leu Asp Lys Asn Gin Phe He Val 50 55 60
Ala Gly Ser Asp Ser Val Lys His Phe Val Ala He Ser Thr Gin Gly 65 70 75 80
Arg Gly Asp His Asp Gin Trp Val Thr Ser Tyr Lys Leu Arg Tyr Thr 85 90 95
Leu Asp Asn Val Asn Trp Val Glu Tyr Asn Asn Gly Glu He He Asn 100 105 110
Ala Asn Lys Asp Arg Asn Ser He Val Thr He Asn Phe Asn Pro Pro 115 120 125
He Lys Ala Arg Ser He Ala He His Pro Gin Thr Tyr Asn Asn His 130 135 140
He Ser Leu Arg Trp Glu Leu Tyr Ala Leu Pro Val Lys Ser Tyr Ser 145 150 155 160
Asn Pro Ser Val Gin Val Gly Glu Val Ser He Gly Asp Arg Ser Leu 165 170 175
Asn Ser Gly Thr Gly Ser Arg Thr He Val Arg His Val Lys Phe Pro 180 185 190
Val Glu Phe Leu Ser Vat Pro He Val Ser He Gly Cys Lys Lys Val 195 200 205
Asp Ala His Thr Asp Asn Gly Gin Met Arg Trp Glu Gly Lys Ser Glu 210 215 220
Asn He Thr Thr Lys Gly Phe Asp Leu Thr Phe He Thr Trp Gly Asn 225 230 235 240
Asn Ala Val Tyr Asp Leu Thr Phe Asp Tyr Val Ala Val Glu Phe Asn 245 250 255
Asn
(2) INFORMATION FOR SEQ ID N0:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 317 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: :
Gin Phe He Val Ala Gly Ser Asp Ser Val Lys His Phe Val Ala He 1 5 10 15
Ser Thr Gin Gly Arg Gly Asp His Asp Gin Trp Val Thr Ser Tyr Lys 20 25 30
Leu Arg Tyr Thr Leu Asp Asn Val Asn Trp val Glu Tyr Asn Asn Gly 35 40 45 Glu He He Asn Ala Asn Lys Asp Arg Asn Ser He Met Ser Val Pro 50 55 60
Ala Gly Ser Val Ser Cys Leu Ala Asn Ala Leu Leu Asn Leu Arg Ser 65 70 75 80
Ser Thr Asp Tyr Asn Ala Asp His Gly Val Lys Asn Ser He Leu Asn 85 90 95
Phe Ser Asn Ser Lys Asp Ala Ser Arg Phe Asp Gly Ser Glu Ser Trp 100 105 110
Ser Ser Ser Val Leu Asp Lys Asn Gin Phe He Val Ala Gly Ser Asp 115 120 125
Ser Val Lys His Phe Val Ala He Ser Thr Gin Gly Arg Gly Asp His 130 135 140
Asp Gin Trp Val Thr Ser Tyr Lys Leu Arg Tyr Thr Leu Asp Asn Val 145 150 155 160
Asn Trp Val Glu Tyr Asn Asn Gly Glu He He Asn Ala Asn Lys Asp 165 170 175
Arg Asn Ser He Val Thr He Asn Phe Asn Pro Pro He Lys Ala Arg 180 185 190
Ser He Ala He His Pro Gin Thr Tyr Asn Asn His He Ser Leu Arg 195 200 205
Trp Glu Leu Ty. Ala Leu Pro Val Lys Ser Tyr Ser Asn Pro Ser Val 210 215 220
Gin Val Gly Glu Val Ser He Gly Asp Arg Ser Leu Asn Ser Gly Thr 225 230 235 240
Gly Ser Arg Thr He Val Arg His Val Lys Phe Pro Val Glu Phe Leu 245 250 255
Ser Vat Pro He Val Ser He Gly Cys Lys Lys Val Asp Ala His Thr 260 265 270
Asp Asn Gly Gin Met Arg Trp Glu Gly Lys Ser Glu Asn He Thr Thr 275 280 285
Lys Gly Phe Asp Leu Thr Phe He Thr Trp Gly Asn Asn Ala Val Tyr 290 295 300
Asp Leu Thr Phe Asp Tyr Val Ala Val Glu Phe Asn Asn 305 310 315
(2) INFORMATION FOR SEQ ID N0:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 313 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:
Gin Tyr He Val Ala Gly Cys Glu Val Pro Arg Thr Phe Met Cys Val 1 5 10 15
Ala Leu Gin Gly Arg Gly Asp Ala Asp Gin Trp Val Thr Ser Tyr Lys 20 ? 30
He Arg Tyr Ser Leu Asp Asn Val Ser Trp Phe Glu Tyr Arg Asn Gly 35 40 45 79
Ala Ala Val Thr Gly Val Thr Asp Arg Asn Thr Val Met Ser Thr Gin 50 55 60
Gly Leu Val Gin Leu Leu Ala Asn Ala Gin Cys His Leu Arg Thr Ser 65 70 75 80
Thr Asn Tyr Asn Gly Val His Thr Gin Phe Asn Ser Ala Leu Asn Tyr 85 90 95
Lys Asn Asn Gly Thr Asn Thr He Asp Gly Ser Glu Ala Trp Cys Ser 100 105 110
Ser He Val Asp Thr Asn Gin Tyr He Val Ala Gly Cys Glu Val Pro 115 120 125
Arg Thr Phe Met Cys Val Ala Leu Gin Gly Arg Gly Asp Ala Asp Gin 130 135 140
Trp Val Thr Ser Tyr Lys He Arg Tyr Ser Leu Asp Asn Val Ser Trp 145 150 155 160
Phe Glu Tyr Arg Asn Gly Ala Ala Val Thr Gly Val Thr Asp Arg Asn 165 170 175
Thr Val Vat Asn His Phe Phe Asp Thr Pro He Arg Ala Arg Ser He 180 185 190
Ala He His Pro Leu Thr Trp Asn Gly His He Ser Leu Arg Cys Glu 195 200 205
Phe Tyr Thr Gin Pro Val Gin Ser Ser Val Thr Gin Val Gly Ala Asp 210 215 220
He Tyr Thr Gly Asp Asn Cys Ala Leu Asn Thr Gly Ser Gly Lys Arg 225 230 235 240
Glu Val Val Vat Pro Val Lys Phe Gin Phe Glu Phe Ala Thr Leu Pro 245 250 255
Lys Val Ala Leu Asn Phe Asp Gin He Asp Cys Thr Asp Ala Thr Asn 260 265 270
Gin Thr Arg lie Gly Val Gin Pro Arg Asn He Thr Thr Lys Gly Phe 275 280 285
Asp Cys Val Phe Tyr Thr Trp Asn Glu Asn Lys Val Tyr Ser Leu Arg 290 295 300
Ala Asp Tyr He Ala Thr Ala Leu Glu 305 310
(2) INFORMATION FOR SEQ ID N0:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: TAYACYTGAY AAYGTAAYTG GGT 23
(2) INFORMATION FOR SEQ ID N0:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: MGDWSTATHG CATCAYCC 18
(2) INFORMATION FOR SEQ ID NO:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 32 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7:
Met Lys Val Glu Val Leu Pro Ala Leu Thr Asp Asn Tyr Met Tyr Leu 1 5 10 15
Val He Asp Asp Glu Thr Lys Glu Ala Ala He Val Asp Pro Val Gin 20 25 30
(2) INFORMATION FOR SEQ ID NO:8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 32 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:
Met Lys He Glu Leu Leu Pro Ala Leu Thr Asp Asn Tyr Met Tyr Leu 1 5 10 15
He He Asp Glu Asp Thr Gin Xaa Ala Ala Val Val Asp Pro Val Gin 20 25 30
(2) INFORMATION FOR SEQ ID N0:9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1161 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 158..937
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9:
ACCGGCCCGG GTCATGGTGG TGGGCCGAGG GCTGCTCGGC CGCCGCAGCC TCGCCGCGCT 60
GGGAGCCGCC TGCGCCCGCC GAGGCCTCGG TCCAGCCCTG CTGGGAGTTT TCTGCCACAC 120
AGATTTGCGG AAGAACCTGA CCGTGGACGA GGGCACC ATG AAG GTA GAG GTG CTG 175 Met Lys Val Glu Val Leu 260
CCT GCC CTG ACC GAC AAC TAC ATG TAC CTG GTC ATT GAT GAT GAG ACC 223 Pro Ala Leu Thr Asp Asn Tyr Met Tyr Leu Val He Asp Asp Glu Thr 265 270 275
AAG GAG GCT GCC ATT GTG GAT CCG GTG CAG CCC CAG AAG GTC GTG GAC 271 Lys Glu Ala Ala He Val Asp Pro Val Gin Pro Gin Lys Val Val Asp 280 285 290 295
GCG GCG AGA AAG CAC GGG GTG AAA CTG ACC ACA GTG CTC ACC ACC CAC 319 Ala Ala Arg Lys His Gly Val Lys Leu Thr Thr Val Leu Thr Thr His 300 305 310
CAC CAC TGG GAC CAT GCT GGC GGG AAT GAG AAA CTG GTC AAG CTG GAG 367 His His Trp Asp His Ala Gly Gly Asn Glu Lys Leu Val Lys Leu Glu 315 320 325
TCG GGA CTG AAG GTG TAC GGG GGT GAC GAC CGT ATC GGG GCC CTG ACT 415 Ser Gly Leu Lys Val Tyr Gly Gly Asp Asp Arg He Gly Ala Leu Thr 330 335 340
CAC AAG ATC ACT CAC CTG TCC ACA CTG CAG GTG GGG TCT CTG AAC GTC 463 His Lys He Thr His Leu Ser Thr Leu Gin Val Gly Ser Leu Asn Val 345 350 355
AAG TGC CTG GCG ACC CCG TGC CAC ACT TCA GGA CAC ATT TGT TAC TTC 511 Lys Cys Leu Ala Thr Pro Cys His Thr Ser Gly His He Cys Tyr Phe 360 365 370 375
GTG AGC AAG CCC GGA GGC TCG GAG CCC CCT GCC GTG TTC ACA GGT GAC 559 Val Ser Lys Pro Gly Gly Ser Glu Pro Pro Ala Val Phe Thr Gly Asp 380 385 390
ACC TTG TTT GTG GCT GGC TGC GGG AAG TTC TAT GAA GGG ACT GCG GAT 607 Thr Leu Phe Val Ala Gly Cys Gly Lys Phe Tyr Glu Gly Thr Ala Asp 395 400 405
GAG ATG TGT AAA GCT CTG CTG GAG GTC TTG GGC CGG CTC CCC CCG GAC 655 Glu Met Cys Lys Ala Leu Leu Glu Val Leu Gly Arg Leu Pro Pro Asp 410 415 420
ACA AGA GTC TAC TGT GGC CAC GAG TAC ACC ATC AAC AAC CTC AAG TTT 703 Thr Arg Val Tyr Cys Gly His Glu Tyr Thr He Asn Asn Leu Lys Phe 425 430 435
GCA CGC CAC GTG GAG CCC GGC AAT GCC GCC ATC CGG GAG AAG CTG GCC 751 Ala Arg His Val Glu Pro Gly Asn Ala Ala He Arg Glu Lys Leu Ala 440 445 450 455
TGG GCC AAG GAG AAG TAC AGC ATC GGG GAG CCC ACA GTG CCA TCC ACC 799 Trp Ala Lys Glu Lys Tyr Ser He Gly Glu Pro Thr val Pro Ser Thr 460 465 470
CTG GCA GAG GAG TTT ACC TAC AAC CCC TTC ATG AGA GTG AGG GAG AAG 847 Leu Ala Glu Glu Phe Thr Tyr Asn Pro Phe Met Arg val Arg Glu Lys 475 480 485
ACG GTG CAG CAG CAC GCA GGT GAG ACG GAC CCG iτG ACC ACC ATG' CGG 895 Thr Val Gin Gin His Ala Gly Glu Thr Asp Pro .al '"r Thr Met Arg 490 495 OO
GCC GTG CGC AGG GAG AAG GAC CAG TTC AAG »T& :CC CSC GAC 937
Ala Val Arg Arg Glu Lys Asp Gin Phe Lys »»*t (>Ό AΓ Asp 505 510 V.
TGAGGCCGCC CTGCACCTTC AGCGGATTTG GGGATTAGGC 'C'T'AGGT AACTGGCTTT 997
CCTGCTGGTC CGTGCGGGAA ATTCAGTCTT GATTTAACCT t/ TTT CA GCCCTTGGCT 1057
TGTGTTATCG GACGTTTTAA TGCATATTTA TAAGAGAAGT TT^CAΛGTA TTTATTCCCA 1117
TAAAAAGGGG GGGGCCGGTA CCCAATTCGC CCTATAGTGA GTCG 1161 (2) INFORMATION FOR SEQ ID N0:10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 260 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:
Met Lys Val Glu Val Leu Pro Ala Leu Thr Asp Asn Tyr Met Tyr Leu 1 5 10 15
Val He Asp Asp Glu Thr Lys Glu Ala Ala He Val Asp Pro Val Gin 20 25 30
Pro Gin Lys Val Val Asp Ala Ala Arg Lys His Gly Val Lys Leu Thr 35 40 45
Thr Val Leu Thr Thr His His His Trp Asp His Ala Gly Gly Asn Glu 50 55 60
Lys Leu Val Lys Leu Glu Ser Gly Leu Lys Val Tyr Gly Gly Asp Asp 65 70 75 80
Arg He Gly Ala Leu Thr His Lys He Thr His Leu Ser Thr Leu Gin 85 90 95
Val Gly Ser Leu Asn Val Lys Cys Leu Ala Thr Pro Cys His Thr Ser 100 105 110
Gly His He Cys Tyr Phe Val Ser Lys Pro Gly Gly Ser Glu Pro Pro 115 120 125
Ala Val Phe Thr Gly Asp Thr Leu Phe Val Ala Gly Cys Gly Lys Phe 130 135 140
Tyr Glu Gly Thr Ala Asp Glu Met Cys Lys Ala Leu Leu Glu Val Leu 145 150 155 160
Gly Arg Leu Pro Pro Asp Thr Arg Val Tyr Cys Gly His Glu Tyr Thr 165 170 175
He Asn Asn Leu Lys Phe Ala Arg His Val Glu Pro Gly Asn Ala Ala 180 185 190
He Arg Glu Lys Leu Ala Trp Ala Lys Glu Lys Tyr Ser He Gly Glu 195 200 205
Pro Thr Val Pro Ser Thr Leu Ala Glu Glu Phe Thr Tyr Asn Pro Phe 210 215 220
Met Arg Val Arg Glu Lys Thr Val Gin Gin His Ala Gly Glu Thr Asp 225 230 235 240
Pro Val Thr Thr Met Arg Ala Val Arg Arg Glu Lys Asp Gin Phe Lys 245 250 255
Met Pro Arg Asp 260
(2) INFORMATION FOR SEQ ID N0:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 271 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (0) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:
Pro Pro Asp Pro He Leu Gly Val Thr Glu Ala Phe Lys Arg Asp Thr 1 5 10 15
Asn Ser Lys Lys Met Asn Leu Gly Val Gly Ala Tyr Arg Asp Asp Asn 20 25 30
Gly Lys Ser Tyr Val Leu Asn Cys Val Arg Lys Ala Glu Ala Met He 35 40 45
Ala Ala Lys Lys Met Asp Lys Glu Tyr Leu Pro He Ala Gly Leu Ala 50 55 60
Asp Phe Thr Arg Ala Ser Ala Glu Leu Ala Leu Gly Glu Asn Ser Glu 65 70 75 80
Ala Phe Lys Ser Gly Arg Tyr Val Thr Val Gin Gly He Ser Gly Thr 85 90 95
Gly Ser Leu Arg Val Gly Ala Asn Phe Leu Gin Arg Phe Phe Lys Phe 100 105 110
Ser Arg Asp Val Tyr Leu Pro Lys Pro Ser Trp Gly Asn His Thr Pro 115 120 125
He Phe Arg Asp Ala Gly Leu Gin Leu Gin Ala Tyr Arg Tyr Tyr Asp 130 135 140
Pro Lys Thr Cyg Ser Leu Asp Phe Thr Gly Ala Met Glu Asp He Ser 145 150 155 160
Lys He Pro Glu Lys Ser He He Leu Leu His Ala Cys Ala His Asn 165 170 175
Pro Thr Gly Val Asp Pro Arg Gin Glu Gin Trp Lys Glu Leu Ala Ser 180 185 190
Val Val Lys Lys Arg Asn Leu Leu Ala Tyr Phe Asp Met Ala Tyr Gin 195 200 205
Gly Phe Ala Ser Gly Asp He Asn Arg Asp Ala Trp Ala Leu Arg His 210 215 220
Phe He Glu Gin Gly He Asp Val Val Leu Ser Gin Ser Tyr Ala Lys 225 230 235 240
Asn Met Gly Leu Tyr Gly Glu Arg Ala Gly Ala Phe Thr Val He Cys 245 250 255
Arg Asp Ala Glu Glu Ala Lys Arg Val Glu Ser Gin Leu Lys He 260 265 270
(2) INFORMATION FOR SEQ ID NO: 12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 629 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 56.-595
(xi) SEQUENCE DESCRIPTION: SEQ ID « :12: CATAATGAAA GTTATTTTCG TAGTTTTAGC CATTGTATTA GTTACATTAT GGGCT ATG 58 Met
CCA TCA GAA GCT GGT AAA GAC CCA AAG ATT ACC AAT AAA GTA TTC TTT 106 Pro Ser Glu Ala Gly Lys Asp Pro Lys He Thr Asn Lys Val Phe Phe 265 270 275
GAT ATA GAA ATT GAT AAT AAA CCA GCA GGT AGA ATT GTA TTT GGT TTA 154 Asp He Glu He Asp Asn Lys Pro Ala Gly Arg He Val Phe Gly Leu 280 285 290
TAT GGA AAG ACA GTA CCA AAA ACA GTT GAA AAC TTT AGA GCA TTA TGT 202 Tyr Gly Lys Thr Val Pro Lys Thr Val Glu Asn Phe Arg Ala Leu Cys 295 300 305
ACT GGT GAA AAA GGT TTA GGT ACC AGT GGT AAA CCA TTA CAT TAT AAA 250 Thr Gly Glu Lys Gly Leu Gly Thr Ser Gly Lys Pro Leu His Tyr Lys 310 315 320 325
GAT AGT AAA TTC CAT CGT ATC ATT CCA AAC TTT ATG ATT CAA GGT GGT 298 Asp Ser Lys Phe His Arg He He Pro Asn Phe Met He Gin Gly Gly 330 335 340
GAT TTC ACA AGA GGT GAT GGT ACT GGT GGT GAA TCA ATT TAT GGT AAA 346 Asp Phe Thr Arg Gly Asp Gly Thr Gly Gly Glu Ser He Tyr Gly Lys 345 350 355
AAA TTC AAT GAT GAA AAC TTC AAA ATT AAA CAC TCC AAA CCA GGT CTT 394 Lys Phe Asn Asp Glu Asn Phe Lys He Lys His Ser Lys Pro Gly Leu 360 365 370
TTA TCA ATG GCT AAC GCT GGT CCA AAC ACT AAT GGT TCA CAA TTC TTT 442 Leu Ser Met Ala Asn Ala Gly Pro Asn Thr Asn Gly Ser Gin Phe Phe 375 380 385
ATT ACT ACC GTT GTT ACT TCA TGG TTA GAT GGT CGT CAT ACT GTT TTT 490 He Thr Thr Val Val Thr Ser Trp Leu Asp Gly Arg His Thr Val Phe 390 395 400 405
GGT GAA GTT ATT GAA GGT ATG GAT ATT GTT AAA CTC CTT GAA TCC ATT 538 Gly Glu Val He Glu Gly Met Asp He Val Lys Leu Leu Glu Ser He 410 415 420
GGT TCC CAA TCT GGA ACA CCA AGT AAA ATT GCT AAA ATC TCA AAC TCT 586 Gly Ser Gin Ser Gly Thr Pro Ser Lys He Ala Lys He Ser Asn Ser 425 430 435
GGT GAA TTA TAAATAAAAT AAAACCAAAC CAAATAAAAT AAAT 629
Gly Glu Leu 440
(2) INFORMATION FOR SEQ ID NO: 13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 180 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:
Met Pro Ser Glu Ala Gly Lys Asp Pro Lys He Thr Asn Lys Val Phe 1 5 10 15
Phe Asp He Glu He Asp Asn Lys Pro Ala Gly Arg He Val Phe Gly 20 25 30
Leu Tyr Gly Lys Thr Val Pro Lys Thr Val Glu Asn Phe Arg Ala Leu 35 40 45
Cys Thr Gly Glu Lys Gly Leu Gly Thr Ser Gly Lys Pro Leu His Tyr 50 55 60 Lys Asp Ser Lys Phe His Arg He He Pro Asn Phe Met He Gin Gly 65 70 75 80
Gly Asp Phe Thr Arg Gly Asp Gly Thr Gly Gly Glu Ser He Tyr Gly 85 90 95
Lys Lys Phe Asn Asp Glu Asn Phe Lys He Lys His Ser Lys Pro Gly 100 105 110
Leu Leu Ser Met Ala Asn Ala Gly Pro Asn Thr Asn Gly Ser Gin Phe 115 120 125
Phe He Thr Thr Val Val Thr Ser Trp Leu Asp Gly Arg His Thr Val 130 135 140
Phe Gly Glu Val He Glu Gly Met Asp He Val Lys Leu Leu Glu Ser 145 150 155 160
He Gly Ser Gin Ser Gly Thr Pro Ser Lys He Ala Lys He Ser Asn 165 170 175
Ser Gly Glu Leu 180
(2) INFORMATION FOR SEQ ID N0:14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:
Asn Ser He Leu Asn Phe Ser Asn Ser Lys 1 5 10
(2) INFORMATION FOR SEQ ID N0:15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15:
His Phe Vat Xaa He Ser Thr Gin Gly Arg Gly Asp His Asp Gin Xaa 1 5 10 15
Val Thr Xaa Tyr 20
(2) INFORMATION FOR SEQ ID NO: 16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 8 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID N0:16:
Gly Thr Gly Ser Arg Thr He Val 1 5
(2) INFORMATION FOR SEQ ID N0:17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 16 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17:
Asp Ala Ser Arg Phe Asp Gly Ser Trp Ser Ser Xaa Val Leu Asp Lys 1 5 10 15
(2) INFORMATION FOR SEQ ID NO: 18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 22 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 18:
Leu Arg Tyr Thr Leu Asp Asn Val Asn Trp Val Glu Tyr Asn Asn Gly 1 5 10 15
Glu He Asn Ala Asn Lys 20
(2) INFORMATION FOR SEQ ID N0: 19:
( i ) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 17 amino acids
(B) TYPE : amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: l inear
(xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 19:
Xaa Arg Ser He Ala He His Pro Thr Tyr Asn Asn Hi s He Ser He 1 5 10 15
Arg
(2) INFORMATION FOR SEQ ID NO:20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:
Asp Asn Gly Gin Met Arg Trp Glu Gly Lys Ser Glu Asn He 1 5 10
(2) INFORMATION FOR SEQ ID NO:21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 13 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:
Asp Leu Thr Phe He Thr Trp Gly Asn Asn Ala Val Tyr 1 5 10
(2) INFORMATION FOR SEQ ID NO:22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 48 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:
Asp Ser Val Lys His Phe Val Ala He Ser Thr Gin Gly Arg Gly Asp
1 5 10 15
His Asp Gin Trp Vat Thr Ser Tyr Lys Leu Arg Tyr Thr Leu Asp Asn
20 25 30
Val Asn Trp Val Glu Tyr Asn Asn Gly Glu He He Asn Ala Asn Lys
35 40 45
(2) INFORMATION FOR SEQ ID NO:23:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 35 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D> TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: GACTCGAGTC GACATCGATT TTJTTTTTTT TTTTT 35
(2) INFORMATION FOR SEQ ID NO:24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: GACTCGAGTC GACATCGA 18 (2) INFORMATION FOR SEQ ID NO:25:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 32 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D). TOPOLOGY: linear
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:
Met Lys Val Glu Val Leu Pro Ala Leu Thr Asp Asn Tyr Met Tyr Leu 1 5 10 15
Val He Asp Asp Glu Thr Lys Glu Ala Ala He Val Asp Pro Val Gin 20 25 30
(2) INFORMATION FOR SEQ ID N0:26:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:
Tyr Xaa He Gly Glu Pro Thr Val Pro Ser Thr Leu Ala Glu Glu Phe 1 5 10 15
Thr Tyr Asn Pro Phe 20
(2) INFORMATION FOR SEQ ID N0:27:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: GCYTACNGAY AAYTAYATGT A 21 (2) INFORMATION FOR SEQ ID N0:28:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID N0:28: GAYGAYGARA CNAARGARGC 20 (2) INFORMATION FOR SEQ ID N0:29:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 16 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: ATHGGGARCC ACGTGG 16 (2) INFORMATION FOR SEQ ID NO:30:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 16 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: TADCCCTYGG TGCAGG 16 (2) INFORMATION FOR SEQ ID NO:31 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 37 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: GGCCACGCGT CGACTAGTAC TTTTTTTTTT TTTTTTT 37 (2) INFORMATION FOR SEQ ID NO:32:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 32 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:
CUACUACUAC UAGGCCACGC GTCGACTAGT AC 32
(2) INFORMATION FOR SEQ ID NO:33:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 14 amino acids (B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:
Gly Lys Asp Pro Lys He Thr Asn Lys Val Phe Phe Asp Glu 1 5 10
(2) INFORMATION FOR SEQ ID NO:34:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 4 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:
His Gly Val Lys 1

Claims

What is claimed is:
1. A substantially pure bisphosphonate binding protein or a fragment or homologue or mutant thereof.
2. The protein of Claim 1 which mediates the physiological effects ofbisphosphonates in vivo.
3. The protein of Claim 1 which binds any or all of pyridoxal phosphate, O-phosphorylethanolamine, O-phosphorylcholine, phosphatidyl ethanolamine or phospholipid bisphosphonate analogues.
4. The protein of Claim 1 prepared by recombinant methods, which is conjugated to an antibody, to a fusion protein, to a mimetic or mimetic analogue or to a solid support..
5. The protein Claim 1 chosen from DPI (SEQ.ID. NO.2); hDPl (SEQ.ID. NO.10); the Dictyostelium homologue thereof (Dd-hDP 1 ); DdCyP2 (SEQ.ID. NO.13); or homologues thereof; fragments thereof; and proteins containing any of the preceding which substantially retain bisphosphonate binding activity.
6. A method for purifying a bisphosphonate binding protein of claim 1, comprising the steps of:
(a) linking bisphosphonate to a chromatography column to produce an affinity column;
(b) loading material containing impure bisphosphonate binding material onto the affinity column such that it becomes bound thereto; and
(c) selectively eluting the binding protein from the affinity column in a more purified form.
7. A method for producing a bisphosphonate binding protein of Claim 1 , comprising the steps of:
(a) providing a Dictyostelium mutant which expresses a mutated bisphosphonate-binding-protein gene;
(b) cloning the wild-type gene corresponding to that mutated in step (a) to produce a cloned bisphosphonate-binding-protein gene; and
(c) expressing the cloned bisphosphonate-binding-protein gene to produce the bisphosphonate binding protein.
8. The protein of Claim 1 comprised in a pharmaceutical excipient.
9. Isolated DNA, vector or host cell which encodes the proteins of Claim 1 comprising the groups consisting of SEQ.ID NO. 1; SEQ.ID NO. 9 and SEQ.ID NO. 12.
10. A method for evaluating the therapeutic activity of abisphosphonate comprising the steps of:
(a) contacting the bisphosphonate with the bisphosphonate binding protein of Claim 1 ; and
(b) measuring the binding affinity of the bisphosphonate binding protein for the bisphosphonate.
1 1. An antibody which binds to the binding protein of Claim 1, or a
Figure imgf000094_0001
thereof.
12. A test kit comprising (i) the bisphosphonate binding protein of Claim 1 bound to a solid support.
13. A method of diagnosing calcium metabolism disorders in a mammal using a bisphosphonate binding protein of claim 1, an antibody of the bisphosphonate binding protein, or an antagonist of a bisphosphonate binding protein.
14. A method of treating a calcium metabolism disorder using a bisphosphonate binding protein of Claim 1, antibody thereto, or antagonist wherein the treatment for the regulation of bone metabolism, hypercalcaemia, bone metastases and osteoporosis.
15. A method of treating a calcium metabolism disorder using a bisphosphonate binding protein of
Claim 1, antibody thereto, or antagonist wherein the therapy involves the regulation of bone metabolism via interaction with cyclosporin.
PCT/US1998/002709 1997-02-14 1998-02-13 Phosphonate binding proteins WO1998036064A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU66540/98A AU6654098A (en) 1997-02-14 1998-02-13 Phosphonate binding proteins

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US3973897P 1997-02-14 1997-02-14
US60/039,738 1997-02-14

Publications (2)

Publication Number Publication Date
WO1998036064A1 true WO1998036064A1 (en) 1998-08-20
WO1998036064A9 WO1998036064A9 (en) 1999-01-14

Family

ID=21907103

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1998/002709 WO1998036064A1 (en) 1997-02-14 1998-02-13 Phosphonate binding proteins

Country Status (2)

Country Link
AU (1) AU6654098A (en)
WO (1) WO1998036064A1 (en)

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
DALEY-YATES P.T. ET AL.: "Plasma protein binding of APD: role of calcium and transferrin", CHEM. BIOL. INTERACT., vol. 81, no. 1-2, January 1992 (1992-01-01), pages 79 - 89, XP002068193 *
FUKUZAWA M. & OCHIAI H.: "Molecular cloning and characterization of the cDNA for Discoidin II of D. discoideum", PLANT CELL PHYSIOL., vol. 37, no. 4, June 1996 (1996-06-01), pages 505 - 514, XP002068547 *
FUKUZAWA M. & OCHIAI H.: "Monoclonal antibodies against Discoidin I and Discoidin II of the cellular slime mold D. discoideum.", J. BIOCHEM., vol. 103, 1988, pages 884 - 888, XP002068194 *
HALL T.J. ET AL.: "The role of reactive oxygen intermediates in osteoclastic bone resorption.", BIOCHEM. BIOPHYS. RES. COMM., vol. 207, no. 1, 6 February 1995 (1995-02-06), pages 280 - 287, XP002068360 *
RIDDERSTRÖM M. ET AL.: "Molecular cloning, heterologous expression and chracterization of human glyoxalase II", J. BIOL. CHEM., vol. 271, no. 1, 5 January 1996 (1996-01-05), pages 319 - 323, XP002068197 *
ROGERS M.J. ET AL.: "Structure-activity relationship of new heterocycle-containing bisphosphonates as inhibitors of bone resorption and as inhibitors of D. discoideum amoebae", MOL. PHARM., vol. 47, no. 2, February 1995 (1995-02-01), pages 398 - 402, XP002068195 *
VITTÉ C. ET AL.: "Bisphosphonates induce osteoblasts to secrete an inhibitor of osteoclast-mediated resorption", ENDOCRINOLOGY, vol. 137, no. 6, June 1996 (1996-06-01), pages 2324 - 2333, XP002068196 *

Also Published As

Publication number Publication date
AU6654098A (en) 1998-09-08

Similar Documents

Publication Publication Date Title
CZ48296A3 (en) Protocadherine proteins and their application
US20020119517A1 (en) Leptin induced genes
EP1007537A1 (en) Receptor protein designated 2f1
US7176180B2 (en) Type 2 cytokine receptor and nucleic acids encoding same
IL182749A (en) Vwfa and/or ant_ig domain containing proteins
US6703487B2 (en) Human pellino polypeptides
JPH11225774A (en) Member of immunoglobulin gene superfamily, pigr-1
WO2000017345A1 (en) Ly6h gene
JPH10504714A (en) Human potassium channel 1 and 2 proteins
WO1998036064A1 (en) Phosphonate binding proteins
US6835381B1 (en) Methods for modulating angiogenesis by using the anti-angiogenic angiotensin-7 and polynucleotides encoding therefor
WO1998036064A9 (en) Phosphonate binding proteins
CA2332379A1 (en) Molecules associated with apoptosis
WO2001036631A1 (en) Human nogo-c polynucleotide and polypeptide and their uses
US20040109854A1 (en) Novel traf6 inhibiting protein
WO1999007845A1 (en) Adenosine nucleotide translocator
WO1999056778A1 (en) The cytokine family member, 2-19
WO2000017349A1 (en) A HUMAN Hsg III GENE
WO1998017688A1 (en) Osteometrin related peptides and nucleic acids and diagnosis and therapies using them
JPH11215989A (en) Member of immunoglobulin gene super family, pigr-2
EP1881065A1 (en) DINO polypetide
WO2001019864A1 (en) Polynucleotides encoding novel human angiotensin ii-1 receptor proteins and the method of preparation and its use
EP1373485A2 (en) Ikk2 variant, dino gene, lectin-like receptor gene, and proteins encoded thereby
WO2000061598A2 (en) Rat kidney specific gene profilin-3
JP2000083672A (en) Apoptosis-associated protein

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM GW HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
COP Corrected version of pamphlet

Free format text: PAGES 1-3, 8, 11, 16, 21, 23, 26, 28, 31, 36, 38, 41, 43, 53, 71, 73, 78, 81 AND 83, DESCRIPTION, REPLACED BY NEW PAGES 1-3, 8, 11, 16, 21, 23, 26, 28, 31, 36, 38, 41, 43, 53, 57, 71, 73, 78, 81 AND 83; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

NENP Non-entry into the national phase

Ref country code: JP

Ref document number: 1998535892

Format of ref document f/p: F

122 Ep: pct application non-entry in european phase