GB2372994A - Cysteine protease polypeptide - Google Patents

Cysteine protease polypeptide Download PDF

Info

Publication number
GB2372994A
GB2372994A GB0128025A GB0128025A GB2372994A GB 2372994 A GB2372994 A GB 2372994A GB 0128025 A GB0128025 A GB 0128025A GB 0128025 A GB0128025 A GB 0128025A GB 2372994 A GB2372994 A GB 2372994A
Authority
GB
United Kingdom
Prior art keywords
polypeptide
sumo
sequence
leu
pro
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
GB0128025A
Other versions
GB0128025D0 (en
Inventor
Jingwen Chen
Barbara Susan Ink
Alan Peter Lewis
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Glaxo Group Ltd
Original Assignee
Glaxo Group Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Glaxo Group Ltd filed Critical Glaxo Group Ltd
Publication of GB0128025D0 publication Critical patent/GB0128025D0/en
Publication of GB2372994A publication Critical patent/GB2372994A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/48Hydrolases (3) acting on peptide bonds (3.4)
    • C12N9/50Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
    • C12N9/64Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue
    • C12N9/6421Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue from mammals
    • C12N9/6472Cysteine endopeptidases (3.4.22)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P13/00Drugs for disorders of the urinary system
    • A61P13/12Drugs for disorders of the urinary system of the kidneys
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P15/00Drugs for genital or sexual disorders; Contraceptives
    • A61P15/08Drugs for genital or sexual disorders; Contraceptives for gonadal disorders or for enhancing fertility, e.g. inducers of ovulation or of spermatogenesis
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • A61P25/28Drugs for disorders of the nervous system for treating neurodegenerative disorders of the central nervous system, e.g. nootropic agents, cognition enhancers, drugs for treating Alzheimer's disease or other forms of dementia
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • A61P25/30Drugs for disorders of the nervous system for treating abuse or dependence
    • A61P25/32Alcohol-abuse
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/12Antivirals
    • A61P31/14Antivirals for RNA viruses
    • A61P31/18Antivirals for RNA viruses for HIV
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/12Antivirals
    • A61P31/20Antivirals for DNA viruses
    • A61P31/22Antivirals for DNA viruses for herpes viruses
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P35/00Antineoplastic agents
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P5/00Drugs for disorders of the endocrine system
    • A61P5/06Drugs for disorders of the endocrine system of the anterior pituitary hormones, e.g. TSH, ACTH, FSH, LH, PRL, GH
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P9/00Drugs for disorders of the cardiovascular system
    • A61P9/12Antihypertensives

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Medicinal Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Animal Behavior & Ethology (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • Virology (AREA)
  • Biomedical Technology (AREA)
  • Neurosurgery (AREA)
  • Molecular Biology (AREA)
  • Neurology (AREA)
  • Biotechnology (AREA)
  • Communicable Diseases (AREA)
  • Oncology (AREA)
  • Reproductive Health (AREA)
  • Endocrinology (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Addiction (AREA)
  • Psychiatry (AREA)
  • Genetics & Genomics (AREA)
  • Cardiology (AREA)
  • Hospice & Palliative Care (AREA)
  • Heart & Thoracic Surgery (AREA)
  • Microbiology (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • AIDS & HIV (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Urology & Nephrology (AREA)
  • Gynecology & Obstetrics (AREA)

Abstract

An isolated cysteine proteinase polypeptide comprising <SL> <LI>(i) the amino acid sequence of SEQ ID NO:2 or <LI>(ii) a variant thereof which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO or <LI>(iii) a fragment of (i) or (ii) which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO. </SL> The polypeptide may be used in a method to identify a substance useful in the treatment of diseases such as Cushing's disease, Addison's disease, hypertension, infertility, HBV, HIV, HSV, cancer, Huntington's, obesity and alcoholism.

Description

NEW PROTEIN
Field of the Invention The present invention relates to cysteine proteinase polypeptides.
Background of the Invention There are 4 main catalytic types of peptidases ; serine (which includes threonine peptidases) cysteine, aspartic and metallo. The serine, threonine and cysteine peptidases are catalytically very different from the aspartic and metallopeptidases in that the nucleophile of the catalytic site is part of an amino acid, whereas it is an activated water molecule in the other groups.
Peptidases in which the nucleophile is the sulphydryl group of a cysteine residue are known as cysteine-type peptidases. The catalytic mechanism is similar to that of the serine-type peptidases in that a nucleophile and a proton donor or general base is required, and the proton donor in all cysteine peptidases (in which it has been identified) is a histidine residue. Typical examples of cysteine peptidases include the cathepsins (excluding cathepsin G) and caspases.
Human ubiquitin-like protein specific protease (Ulp) is a deconjugating enzyme of SUMO/Sentrin, a ubiquitin-like protein, and its target protein. It is a cysteine protease that is unrelated to other known deubiquitinating enzymes, but shows similarity to certain viral proteases. The Ulps cleave at a glycine-glycine-X cleavage site that is similar to the consensus sites of adenovirus, African swine fever virus, and certain poxviruses. Ulps also cleave the SUMO precursor to generate the mature form of SUMO.
The human ULPs show homology to the yeast ULP1 and ULP2 genes. Yeast ULP1 cleaves the yeast homolog of SUMO, SMT3, from target proteins. ULP1 is important throughout the cell cycle and is essential for the G2/M phase. ULP2 also cleaves SMT3 from target proteins, although the target proteins of ULP2 appear to be different from ULP1. ULP2 is important for the recovery of cells from checkpoint arrest induced by DNA damage, inhibition of DNA replication, or defects in spindle assembly. Deletions of ULP2 in yeast show a phenotype of temperaturesensitive growth, abnormal cell morphology, decreased plasmid and chromosome stability, and a severe sporulation defect.
Summary of the Invention A novel cysteine proteinase, referred to herein as HIPHUM 134, is now provided. HIPHUM 134 is shown to be expressed in all tissues at various levels. It is highly expressed in adrenal, cerebellum, jejunum, lung, rectum, testis, tonsil and urinary bladder. It is also expressed at significant levels in fetal brain, ovary, thymus, thyroid and utreus. In addition, HIPHUM 134 is expressed in T cells, peripheral blood mononuclear cells (PBMCs), monocytes and dendritic cells. The novel cysteine proteinase is a screening target for the identification and development of novel pharmaceutical agents, including modulators of cysteine proteinase activity.
These agents may be used in the treatment and/or prophylaxis of disorders such as Cushing's disease, Addison's disease, hypertension, hyperkalemia, hypokalemia, polycystic ovary, infertility, hirustism and other metabolic diseases, HBV infection, HIV infection, HSV infection, breast cancer, Huntington's disease, central areolar choroidal dystrophy, lung oedemas, obesity, alcoholism and juvenille cystic kidneys.
Accordingly, the present invention provides an isolated cysteine proteinase polypeptide comprising (i) the amino acid sequence of SEQ ID NO: 2; (ii) a variant thereof which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO; or (iii) a fragment of (i) or (ii) which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO.
According to another aspect of the invention there is provided a polynucleotide encoding a polypeptide of the invention which polynucleotide includes a sequence comprising: (a) the nucleic acid sequence of SEQ ID NO: 1 and/or a sequence complementary thereto; (b) a sequence which hybridises under stringent conditions to a sequence
as defined in (a) ; (c) a sequence that is degenerate as a result of the genetic code to a sequence as defined in (a) or (b); or
(d) a sequence having at least 60% identity to a sequence as defined in (a), (b) or (c).
The invention also provides: an expression vector which comprises a polynucleotide of the invention and which is capable of expressing a polypeptide of the invention; a host cell comprising an expression vector of the invention; a method of producing a polypeptide of the invention which method comprises maintaining a host cell of the invention under conditions suitable for obtaining expression of the polypeptide and isolating the said polypeptide; an antibody specific for a polypeptide of the invention; a method for identification of a substance that modulates cysteine proteinase activity and/or expression, which method comprises contacting a polypeptide, polynucleotide, expression vector or host cell of the invention with a test substance and determining the effect of the test substance on the activity and/or expression of the said polypeptide or the polypeptide encoded by the said polynucleotide, thereby to determine whether the test substance modulates cysteine proteinase activity and/or expression; a compound which or modulates cysteine proteinase activity and which is identifiable by the method referred to above; a method of treating a subject having a disorder that is responsive to cysteine proteinase stimulation or modulation, which method comprises administering to said subject an effective amount of substance of the invention; and use of a substance that stimulates or modulates cysteine proteinase activity in the manufacture of a medicament for the treatment or prophylaxis of a disorder that is responsive to stimulation or modulation of cysteine proteinase activity.
Preferably the disorder is selected from Cushing's disease, Addison's disease, hypertension, hyperkalemia, hypokalemia, polycystic ovary, infertility, hirustism, HBV infection, HIV infection, HSV infection, breast cancer, Huntington's disease, central areolar choroidal dystrophy, lung oedemas, obesity, alcoholism and juvenille cystic kidneys.
Brief Description of the Drawings
Figure 1 shows the expression of HIPHUM 134 in various normal tissues.
Figure 2 shows the expression of HIPHUM 134 in colon, lung, breast, frontal cortex, cerebellum and in various brain diseases.
Figure 3 shows the expression of HIPHUM 134 in normal and diseased lung, in various cells, in normal and osteothritic cartlige and in rheumatoid arthritis.
Figure 3 shows the expression of HIPHUM 134 in various viral infections.
Brief Description of the Sequences SEQ ID NO: 1 shows the nucleotide and amino acid sequences of human protein HIPHUM 134.
SEQ ID NO: 2 is the amino acid sequence alone of HIPHUM 134.
Detailed Description of the Invention Throughout the present specification and the accompanying claims the words
"comprise"and"include"and variations such as"comprises","comprising", "includes"and"including"are to be interpreted inclusively. That is, these words are intended to convey the possible inclusion of other elements or integers not specifically recited, where the context allows.
The present invention relates to a human cysteine proteinase, referred to herein as HIPHUM 134, and variants thereof. Sequence information for HIPHUM 134 is provided in SEQ ID NO: 1 (nucleotide and amino acid) and in SEQ ID NO: 2.
A polypeptide of the invention thus consists essentially of the amino acid sequence of SEQ ID NO: 2 or of a variant of that sequence, or of a fragment of either thereof.
Polypeptides of the invention may be in a substantially isolated form. It will be understood that the polypeptide may be mixed with carriers or diluents which will not interfere with the intended purpose of the polypeptide and still be regarded as substantially isolated. A polypeptide of the invention may also be in a substantially purified form, in which case it will generally comprise the polypeptide in a preparation in which more than 50%, e. g. more than 80%, 90%, 95% or 99%, by weight of the polypeptide in the preparation is a polypeptide of the invention.
Routine methods, can be employed to purify and/or synthesise the proteins according to the invention. Such methods are well understood by persons skilled in the art, and
include techniques such as those disclosed in Sambrook et al, Molecular Cloning : a Laboratory Manual, 2nd Edition, CSH Laboratory Press, 1989, the disclosure of which is included herein in its entirety by way of reference.
The term"variant"refers to a polypeptide which has a same essential character or basic biological functionality as HIPHUM 134. The essential character of HIPHUM 134 can be defined as follows: HIPHUM 134 is a cysteine proteinase.
Preferably a variant polypeptide is one which binds to the same a target protein as HIPHUM 134. Preferably the polypeptide is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO. Preferably the polypeptide is capable of cleaving the SUMO precursor to its mature and active form.
Preferably the polypeptide comprises a catalytic domain which comprises a catalytic triad. The catalytic triad is typically composed of a cysteine residue, histidine residue, aspartate residue and a glutamin residue which together form the oxyanion hole at the active site. Preferably the polypeptide comprises one or more nuclear localisation signals. The polypeptide may comprise a peroxisomal targeting signal, preferably RLPPRWSQL. The polypeptide may comprise a vacuolar targeting motif, preferably TLPN. A polypeptide having a same essential character as HIPHUM 134 may be identified by monitoring for a function the cysteine proteinase selected from cleavage of SUMO from a target protein, activation of SUMO and alteration in activity or subcellular localisation of a target protein. Typical target proteins include transcription factors, such as IKBANF-KB, p53, C-jun and HIPK2, proteins involved in intracellular transport, such as RanGAPl, GLUT I and GLUT4, proteins involved in the intracellular response to DNA damage and repair, such as Topoisomerase 1, Topoisomerase 2, MDM2 and WRN, proteins involved in nuclear localisation, such as PML and Sp 100, and viral proteins such as IE2. Other target proteins may be identified, for example using the yeast-2-hybrid assay. Such target proteins that have been identified using the yeast-2-hybrid screen include TNF and FAS receptors, CEN, and rad 52.
SUMO binds to various transcription factors and is involved in their regulation. Removal of SUMO from IKB (X would allow for activation of NF-KB and produce an inflammatory response. SUMO conjugation increases the transcriptional activity of p53, therefore, deconjugation of SUMO may lower the transcriptional
activity ofp53. C-jun modification by SUMO decreases its transcriptional activity. Therefore, Ulp proteases may regulate tumorsuppressor activity with a role in cell cycle arrest, apoptosis, and cellular responses to mitogens, stress, or inflammatory stimuli.
SUMO conjugates to a nuclear kinase, HIPK2, and is needed for the formation of nuclear bodies containing HIPK2. HIPK2 can act as a transcriptional corepressor for homeoproteins. Therefore, deconjugation of SUMO by the protease may regulate the activity of homeodomain transcription factors and play a role in development.
SUMO modification of RanGAP 1 targets the complex to the nuclear pore complex from the cytosol. Ulp protease may be involved in the regulation of the levels of the Ran GAP 1 within the cytosol and pore complex.
SUMO also modifies GLUT1 and GLUT4 that are involved in insulin stimulated glucose uptake. Deconjugation of SUMO would regulate the amount of glucose uptake in the cell.
The Ulp protease may regulate the cellular response to DNA damage and DNA repair. SUMO conjugates to TOPI and TOP2 when treated with DNA damaging agents. MDM2 is also conjugated to SUMO to prevent self-ubiquitination
ofmdm2, thus enhancing E3 ligase activity ofp53. A Ulp protease may regulate the conjugation of SUMO to MDM2, thereby affecting stability ofp53. SUMO also conjugates to WRN that functions as a DNA helicase, exonuclease, and ATPase.
Deletions of WRN which are found in Werner's syndrome impair its nuclear localization. It is possible that cleavage of SUMO from WRN would alter the localization of WRN and contribute to its dysregulation during disease.
SUMO modification of PML is needed for its localization into nuclear bodies containing DAXX. PML is involved in many cellular functions such as the interferonresponse, immune surveillance, apoptosis, and as a tumor suppressor. It appears that the function of PML is to meditate the regulation of transcription. Mice lacking PML have shown altered transcription in cellular differentiation by way of nuclear receptors, apoptosis mediated by DAXX and p53, inhibition of growth mediated by pRb and p53, and the immune response through interferons. Therefore, regulating the conjugation-deconjugation of SUMO to PML through Ulp protease may affect these pathways.
Although the function of the interferon-induced protein, SplOO, is unknown, conjugation to SUMO takes place in the nucleus. The Ulp protease may affect the activity of Sp 100 in response to interferon.
SUMO conjugates to the human cytomegalovirus protein, IE2. IE2 functions as a strong transactivator of both viral and cellular promoters. Overexpression of SUMO reduces the transactivation activity of IE2. Therefore, the Ulp protease may function to control the level of IE2 sumoylation, thus, affecting both the virus life cycle and host interactions to virus infection.
Preferably a polypeptide of the invention is located in the nucleus when expressed in a cell. More preferably a polypeptide of the invention is capable of binding DNA.
In another aspect of the invention, a variant is one which does not show the same activity as HIPHUM 134 but is one which inhibits a basic function of HIPHUM 134. For example, a variant polypeptide is one which inhibits protease activity of HIPHUM 134, for example by binding to a target protein to prevent a target protein binding to HIPHUM 134.
Typically, polypeptides with more than about 65% identity preferably at least 80% or at least 90% and particularly preferably at least 95% at least 97% or at least 99% identity, with the amino acid sequences of SEQ ID NO: 2, are considered as variants of the proteins. Such variants may include allelic variants and the deletion, modification or addition of single amino acids or groups of amino acids within the protein sequence, as long as the peptide maintains a basic biological functionality of the HIPHUM 134 receptor.
Amino acid substitutions may be made, for example from 1,2 or 3 to 10,20 or 30 substitutions. The modified polypeptide generally retains activity as a cysteine proteinase. Conservative substitutions may be made, for example according to the following Table. Amino acids in the same block in the second column and preferably in the same line in the third column may be substituted for each other.
ALIPHATIC Non-polar GAP ILV
Polar-uncharged C S T M N Q Polar-charged D E K R AROMATIC H F W Y Shorter polypeptide sequences are within the scope of the invention. For example, a peptide of at least 20 amino acids or up to 50,60, 70,80, 100,150, 200, 300 or 400 amino acids in length is considered to fall within the scope of the invention as long as it demonstrates a basic biological functionality of HIPHUM 134.
In particular, but not exclusively, this aspect of the invention encompasses the situation when the protein is a fragment of the complete protein sequence and may represent a catalytic domain or substrate binding domain. Preferred fragments include the C-terminal catalytic domain, the N-terminal domain which is not conserved between different members of the Ulp family of proteases, the transmembrane domain, the peroxisome localisation signal or the vacuolar targeting motif. Such fragments can be used to construct chimeric proteases preferably with another protease, more preferably with another member of the family of cysteine proteinases. Such chimeric proteases may comprise different domains from different cysteine proteinases. For example, a fragment comprising a C-terminal catalytic domain of a polypeptide of the invention may be fused to a N-terminal domain of a different cysteine proteinase.
Fragments of HIPHUM 134 or a variant thereof can also be used to raise anti HIPHUM 134 antibodies. In this embodiment the fragment may comprise an epitope of the HIPHUM 134 polypeptide and may otherwise not demonstrate the catalytic, substrate binding or other properties of HIPHUM 134.
Polypeptides of the invention may be chemically modified, e. g. posttranslationally modified. For example, they may be glycosylated or comprise modified amino acid residues. They may also be modified by the addition of histidine residues to assist their purification or by the addition of a signal sequence to
promote insertion into the cell membrane. Such modified polypeptides fall within the scope of the term"polypeptide"of the invention.
The invention also includes nucleotide sequences that encode for HIPHUM 134 or variant thereof as well as nucleotide sequences which are complementary thereto. The nucleotide sequence may be RNA or DNA including genomic DNA, synthetic DNA or cDNA. Preferably the nucleotide sequence is a DNA sequence and most preferably, a cDNA sequence. Nucleotide sequence information is provided in SEQ ID NO: 1. Such nucleotides can be isolated from human cells or synthesised according to methods well known in the art, as described by way of example in Sambrook et al, 1989.
Typically a polynucleotide of the invention comprises a contiguous sequence of nucleotides which is capable of hybridizing under selective conditions to the coding sequence or the complement of the coding sequence of SEQ ID NO: 1.
A polynucleotide of the invention can hydridize to the coding sequence or the complement of the coding sequence of SEQ ID NO: 1 at a level significantly above background. Background hybridization may occur, for example, because of other cDNAs present in a cDNA library. The signal level generated by the interaction between a polynucleotide of the invention and the coding sequence or complement of the coding sequence of SEQ ID NO: 1 is typically at least 10 fold, preferably at least 100 fold, as intense as interactions between other polynucleotides and the coding sequence of SEQ ID NO: 1. The intensity of interaction may be measured, for example, by radiolabelling the probe, e. g. with 32p. Selective hybridisation may typically be achieved using conditions of medium to high stringency. However, such hybridisation may be carried out under any suitable conditions known in the art (see Sambrook et al, 1989. For example, if high stringency is required suitable conditions include from 0.1 to 0.2 x SSC at 60 De up to 65 C. If lower stringency is required suitable conditions include 2 x SSC at 60 C.
The coding sequence of SEQ ID NO: 1 may be modified by nucleotide substitutions, for example from 1,2 or 3 to 10,25, 50 or 100 substitutions. The polynucleotide of SEQ ID NO: 1 may alternatively or additionally be modified by one or more insertions and/or deletions and/or by an extension at either or both ends.
A polynucleotide may include one or more introns, for example may comprise genomic DNA. Additional sequences such as signal sequences which may assist in
insertion of the polypeptide in a cell membrane may also be included. The modified polynucleotide generally encodes a polypeptide which has a HIPHUM 134 activity. Alternatively, a polynucleotide encodes a catalytic or substrate-binding portion of a polypeptide or a polypeptide which inhibits HIPHUM 134 activity.
Degenerate substitutions may be made and/or substitutions may be made which would result in a conservative amino acid substitution when the modified sequence is translated, for example as shown in the Table above.
A nucleotide sequence which is capable of selectively hybridizing to the complement of the DNA coding sequence of SEQ ID NO: 1 will generally have at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the coding sequence of SEQ ID NO: 1 over a region of at least 20, preferably at least 30, for instance at least 40, at least 60, more preferably at least 100, for instance at least 200 or at least 500 contiguous nucleotides or most preferably over the full length of SEQ ID NO: 1.
For example the UWGCG Package provides the BESTFIT program which can be used to calculate homology (for example used on its default settings) (Devereux et al (1984) Nucleic Acids Research 12, p387-395). The PILEUP and BLAST algorithms can be used to calculate homology or line up sequences (typically on their default settings), for example as described in Altschul (1993) J. Mol. Evol.
36: 290-300; Altschul et al (1990) J. Mol. Bid. 215: 403-10.
Software for performing BLAST analyses is publicly available through the National Centre for Biotechnology Information (http : //www. ncbi. nim. nih. gov/).
This algorithm involves first identifying high scoring sequence pair (HSPs) by identifying short words of length W in the query sequence that either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighbourhood word score threshold (Altschul et al, 1990). These initial neighbourhood word hits act as seeds for initiating searches to find HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Extensions for the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached.
The BLAST algorithm parameters W, T and X determine the sensitivity and speed of the alignment. The BLAST program uses as defaults a word length (W) of 11, the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1992) Proc. Natl. Acad.
Sci. USA 89 : 10915-10919) alignments (B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both strands.
The BLAST algorithm performs a statistical analysis of the similarity between two sequences ; see e. g. , Karlin and Altschul (1993) Proc. Natl. Acad. Sci.
USA 90 : 5873-5787. One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P (N) ), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a sequence is considered similar to another sequence if the smallest sum probability in comparison of the first sequence to the second sequence is less than about 1, preferably less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.
Any combination of the above mentioned degrees of sequence identity and minimum sizes may be used to define polynucleotides of the invention, with the more stringent combinations (i. e. higher sequence identity over longer lengths) being preferred. Thus, for example a polynucleotide which has at least 90% sequence identity over 25, preferably over 30 nucleotides forms one aspect of the invention, as does a polynucleotide which has at least 95% sequence identity over 40 nucleotides.
The nucleotides according to the invention have utility in production of the proteins according to the invention, which may take place in vitro, in vivo or ex vivo.
The nucleotides may be involved in recombinant protein synthesis or indeed as therapeutic agents in their own right, utilised in gene therapy techniques. Nucleotides complementary to those encoding HIPHUM 134, or antisense sequences, may also be used in gene therapy.
Polynucleotides of the invention may be used as a primer, e. g. a PCR primer, a primer for an alternative amplification reaction, a probe e. g. labelled with a revealing label by conventional means using radioactive or non-radioactive labels, or the polynucleotides may be cloned into vectors.
Such primers, probes and other fragments will preferably be at least 10, preferably at least 15 or at least 20, for example at least 25, at least 30 or at least 40 nucleotides in length. They will typically be up to 40,50, 60,70, 100 or 150
nucleotides in length. Probes and fragments can be longer than 150 nucleotides in length, for example up to 200, 300, 400, 500, 600, 700, 800, 900, 1000,1100, 1200,1300, 1400 or 1500 nucleotides in length, or even up to a few nucleotides, such as five or ten nucleotides, short of the coding sequence of SEQ ID NO: 1.
The present invention also includes expression vectors that comprise nucleotide sequences encoding the proteins or variants thereof of the invention. Such expression vectors are routinely constructed in the art of molecular biology and may for example involve the use of plasmid DNA and appropriate initiators, promoters, enhancers and other elements, such as for example polyadenylation signals which may be necessary, and which are positioned in the correct orientation, in order to allow for protein expression. Other suitable vectors would be apparent to persons skilled in the art. By way of further example in this regard we refer to Sambrook et al. 1989.
Polynucleotides according to the invention may also be inserted into the vectors described above in an antisense orientation in order to provide for the production of antisense RNA. Antisense RNA or other antisense polynucleotides may also be produced by synthetic means. Such antisense polynucleotides may be used as test compounds in the assays of the invention or may be useful in a method of treatment of the human or animal body by therapy.
Preferably, a polynucleotide of the invention or for use in the invention in a vector is operably linked to a control sequence which is capable of providing for the expression of the coding sequence by the host cell, i. e. the vector is an expression vector. The term"operably linked"refers to a juxtaposition wherein the components described are in a relationship permitting them to function in their intended manner.
A regulatory sequence, such as a promoter,"operably linked"to a coding sequence is positioned in such a way that expression of the coding sequence is achieved under conditions compatible with the regulatory sequence.
The vectors may be for example, plasmid, virus or phage vectors provided with a origin of replication, optionally a promoter for the expression of the said polynucleotide and optionally a regulator of the promoter. The vectors may contain one or more selectable marker genes, for example an ampicillin resistance gene in the case of a bacterial plasmid or a resistance gene for a fungal vector. Vectors may
be used in vitro, for example for the production of DNA or RNA or used to transfect or transform a host cell, for example, a mammalian host cell. The vectors may also be adapted to be used in vivo, for example in a method of gene therapy.
Promoters and other expression regulation signals may be selected to be compatible with the host cell for which expression is designed. For example, yeast promoters include S. cerevisiae GAL4 and ADH promoters, S. pombe nmtl and adh promoter. Mammalian promoters include the metallothionein promoter which can be induced in response to heavy metals such as cadmium. Viral promoters such as the SV40 large T antigen promoter or adenovirus promoters may also be used. All these promoters are readily available in the art.
Mammalian promoters, such as-actin promoters, may be used. Tissuespecific promoters are especially preferred. Viral promoters may also be used, for example the Moloney murine leukaemia virus long terminal repeat (MMLV LTR), the rous sarcoma virus (RSV) LTR promoter, the SV40 promoter, the human cytomegalovirus (CMV) IE promoter, adenovirus, HSV promoters (such as the HSV IE promoters), or HPV promoters, particularly the HPV upstream regulatory region (URR). Viral promoters are readily available in the art.
The vector may further include sequences flanking the polynucleotide giving rise to polynucleotides which comprise sequences homologous to eukaryotic genomic sequences, preferably mammalian genomic sequences, or viral genomic sequences. This will allow the introduction of the polynucleotides of the invention into the genome of eukaryotic cells or viruses by homologous recombination. In particular, a plasmid vector comprising the expression cassette flanked by viral sequences can be used to prepare a viral vector suitable for delivering the polynucleotides of the invention to a mammalian cell. Other examples of suitable viral vectors include herpes simplex viral vectors and retroviruses, including lentiviruses, adenoviruses, adeno-associated viruses and HPV viruses. Gene transfer techniques using these viruses are known to those skilled in the art. Retrovirus vectors for example may be used to stably integrate the polynucleotide giving rise to the polynucleotide into the host genome. Replication-defective adenovirus vectors by contrast remain episomal and therefore allow transient expression.
The invention also includes cells that have been modified to express the HIPHUM 134 polypeptide or a variant thereof. Such cells include transient, or
preferably stable higher eukaryotic cell lines, such as mammalian cells or insect cells, using for example a baculovirus expression system, lower eukaryotic cells, such as yeast or prokaryotic cells such as bacterial cells.-Particular examples of cells which may be modified by insertion of vectors encoding for a polypeptide according to the invention include mammalian HEK293T, CHO, HeLa and COS cells.
Preferably the cell line selected will be one which is not only stable, but also allows for mature glycosylation of a polypeptide. Expression may be achieved in transformed oocytes. A polypeptide of the invention may be expressed in cells of a transgenic non-human animal, preferably a mouse. A transgenic non-human animal expressing a polypeptide of the invention is included within the scope of the invention. A polypeptide of the invention may also be expressed in Xenopus laevis oocytes.
A polypeptide of the invention may be over-expressed in bacterial cells, such as E. Coli, and isolated from the bacterial culture.
According to another aspect, the present invention also relates to antibodies, specific for a polypeptide of the invention. Such antibodies are for example useful in purification, isolation or screening methods involving immunoprecipitation techniques or, indeed, as therapeutic agents in their own right.
Antibodies may be raised against specific epitopes of the polypeptides according to the invention. Such antibodies may be used to block substrate binding to the receptor. An antibody, or other compound,"specifically binds"to a protein when it binds with preferential or high affinity to the protein for which it is specific but does substantially bind not bind or binds with only low affinity to other proteins.
A variety of protocols for competitive binding or immunoradiometric assays to determine the specific binding capability of an antibody are well known in the art (see for example Maddox et al, J. Exp. Med. 158,1211-1226, 1993). Such immunoassays typically involve the formation of complexes between the specific protein and its antibody and the measurement of complex formation.
Antibodies of the invention may be antibodies to human polypeptides or fragments thereof. For the purposes of this invention, the term"antibody", unless specified to the contrary, includes fragments which bind a polypeptide of the invention. Such fragments include Fv, F (ab') and F (ab') 2 fragments, as well as
single chain antibodies. Furthermore, the antibodies and fragment thereof may be chimeric antibodies, CDR-grafted antibodies or humanised antibodies.
Antibodies may be used in a method for detecting polypeptides of the invention in a biological sample, which method comprises: I providing an antibody of the invention ; II incubating a biological sample with said antibody under conditions which allow for the formation of an antibody-antigen complex; and III determining whether antibody-antigen complex comprising said antibody is formed.
A sample may be for example a tissue extract, blood, serum and saliva.
Antibodies of the invention may be bound to a solid support and/or packaged into kits in a suitable container along with suitable reagents, controls, instructions, etc.
Antibodies may be linked to a revealing label and thus may be suitable for use in methods of in vivo HIPHUM 134 imaging.
Antibodies of the invention can be produced by any suitable method. Means for preparing and characterising antibodies are well known in the art, see for example Harlow and Lane (1988)"Antibodies : A Laboratory Manual", Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. For example, an antibody may be produced by raising antibody in a host animal against the whole polypeptide or a fragment thereof, for example an antigenic epitope thereof, herein after the "immunogen".
A method for producing a polyclonal antibody comprises immunising a suitable host animal, for example an experimental animal, with the immunogen and isolating immunoglobulins from the animal's serum. The animal may therefore be inoculated with the immunogen, blood subsequently removed from the animal and the IgG fraction purified.
A method for producing a monoclonal antibody comprises immortalising cells which produce the desired antibody. Hybridoma cells may be produced by fusing spleen cells from an inoculated experimental animal with tumour cells (Kohler and Milstein (1975) Nature 256,495-497).
An immortalized cell producing the desired antibody may be selected by a conventional procedure. The hybridomas may be grown in culture or injected intraperitoneally for formation of ascites fluid or into the blood stream of an
allogenic host or immunocompromised host. Human antibody may be prepared by in vitro immunisation of human lymphocytes, followed by transformation of the lymphocytes with Epstein-Barr virus.
For the production of both monoclonal and polyclonal antibodies, the experimental animal is suitably a goat, rabbit, rat or mouse. If desired, the immunogen may be administered as a conjugate in which the immunogen is coupled, for example via a side chain of one of the amino acid residues, to a suitable carrier.
The carrier molecule is typically a physiologically acceptable carrier. The antibody obtained may be isolated and, if desired, purified.
An important aspect of the present invention is the use of polypeptides according to the invention in screening methods. The screening methods may be used to identify substances that bind to cysteine proteinase and in particular which bind to HIPHUM 134 such as a substrate for the enzyme. Screening methods may also be used to identify agonists or antagonists which may modulate cysteine proteinase activity, inhibitors or activators of HIPHUM 134 activity, and/or agents which up-regulate or down-regulate HIPHUM 134 expression.
Any suitable format may be used for the assay. In general terms such screening methods may involve contacting a polypeptide of the invention with a test substance and monitoring for binding of the test substance to the polypeptide or measuring protease activity. A polypeptide of the invention may be incubated with a test substance. Modulation of cysteine proteinase activity may be determined. In a preferred aspect, the assay is a cell-based assay. Preferably the assay may be carried out in a single well of a microtitre plate. Assay formats which allow high throughput screening are preferred.
A typical assay for determining whether a test substance acts as an inhibitor or activator of HIPHUM 134 activity comprises contacting a fluorescent or colourimetric substrate with a polypeptide of the invention and a test substance and monitoring protease activity by monitoring any change in the fluorescence or light emission. Any changes in the fluorescence of a substrate as a result of its proteolytic degradation by a polypeptide of the invention may be detected using a fluorescence plate reader. Colourimetic changes may be measured using a spectrophotometer.
The inhibitory or stimulatory activity of a test substance may be determined by comparing any fluorescent or colourimetric changes observed in the presence of a
test substance to any changes observed in the absence of a test substance and/or in the presence of a known inhibitor of HIPHUM 134 activity.
Modulator activity can be determined by contacting cells expressing a polypeptide of the invention with a substance under investigation and by monitoring an effect mediated by the polypeptide. The cells expressing the polypeptide may be in vitro or in vivo. The polypeptide of the invention may be naturally or recombinantly expressed. Preferably, the assay is carried out in vitro using cells expressing recombinant polypeptide. Preferably, control experiments are carried out on cells which do not express the polypeptide of the invention to establish whether the observed responses are the result of activation of the polypeptide.
The binding of a test substance to a polypeptide of the invention can be determined directly. For example, a radiolabelled test substance can be incubated with the polypeptide of the invention and binding of the test substance to the polypeptide can be monitored. Typically, the radiolabelled test substance can be incubated with cell membranes containing the polypeptide until equilibrium is reached. The membranes can then be separated from a non-bound test substance and dissolved in scintillation fluid to allow the radioactive content to be determined by scintillation counting. Non-specific binding of the test substance may also be determined by carrying out a competitive binding assay.
Substances that inhibit the interaction of a polypeptide of the invention with a HIPHUM 134 substrate or with another protease may also be identified through a yeast 2-hybrid assay or other protein interaction assay such as a coimmunoprecipitation or an ELISA based technique.
Assays may be carried out using cells expressing HIPHUM 134, and incubating such cells with the test substance optionally in the presence of a HIPHUM 134 substrate. The results of the assay are compared to the results obtained using the same assay in the absence of the test substance. Cells expressing HIPHUM 134 constitutively may be provided for use in assays for HIPHUM 134 function.
Additional test substances may be introduced in any assay to look for inhibitors or activators of substrate binding or inhibitors or activators of protease activity.
Assays may also be carried out to identify substances which modify HIPHUM 134 expression, for example substances which up-or down-regulate expression. Such assays may be carried out for example by using antibodies for
HIPHUM 134 to monitor levels of HIPHUM 134 expression. Other assays which can be used to monitor the effect of a test substance on HIPHUM 134 expression include using a reporter gene construct driven by the HIPHUM 134 regulatory sequences as the promoter sequence and monitoring for expression of the reporter polypeptide.
Additional control experiments may be carried out.
Suitable test substances which can be tested in the above assays include combinatorial libraries, defined chemical entities and compounds, peptide and peptide mimetics, oligonucleotides and natural product libraries, such as display (e. g. phage display libraries) and antibody products.
Typically, organic molecules will be screened, preferably small organic molecules which have a molecular weight of from 50 to 2500 daltons. Candidate products can be biomolecules including, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural analogs or combinations thereof. Candidate agents are obtained from a wide variety of sources including libraries of synthetic or natural compounds. Known pharmacological agents may be subjected to directed or random chemical modifications, such as acylation, alkylation, esterification, amidification, etc. to produce structural analogs.
Test substances may be used in an initial screen of, for example, 10 substances per reaction, and the substances of these batches which show inhibition or
activation tested individually. Test substances may be used at a concentration of from InM to 10mM, preferably from, 100nM to lOOOM or from le. M to lOOM, more preferably from luM to 1OM. Preferably, the activity of a test substance is compared to the activity shown by a known activator or inhibitor. A test substance which acts as an inhibitor may produce a 50% inhibition of activity of the receptor.
Alternatively a test substance which acts as an activator may produce 50% of the maximal activity produced using a known activator.
Another aspect of the present invention is the use of polynucleotides encoding the HIPHUM 134 polypeptides of the invention to identify mutations in HIPHUM 134 genes which may be implicated in human disorders. Identification of such mutations may be used to assist in diagnosis or susceptibility to such disorders and in assessing the physiology of such disorders. Polynucleotides may also be used in hybridisation studies to monitor for up-or down-regulation of HIPHUM 134
expression. Polynucleotides such as SEQ ID NO : 1 or fragments thereof may be used to identify allelic variants, genomic DNA and species variants.
The present invention provides a method for detecting variation in the expressed products encoded by HIPHUM 134 genes. This may comprise determining the level of an HIPHUM 134 expressed in cells or determining specific alterations in the expressed product. Sequences of interest for diagnostic purposes include, but are not limited to, the conserved portions as identified by sequence similarity and conservation of intron/exon structure. The diagnosis may be performed in conjunction with kindred studies to determine whether a mutation of interest cosegregates with disease phenotype in a family.
Diagnostic procedures may be performed on polynucleotides isolated from an individual or alternatively, may be performed in situ directly upon tissue sections (fixed and/or frozen) of patient tissue obtained from biopsies or resections, such that no nucleic acid purification is necessary. Appropriate procedures are described in, for example, Nuovo, G. J., 1992,"PCR In Situ Hybridization: Protocols And Applications", Raven Press, NY). Such analysis techniques include, DNA or RNA blotting analyses, single stranded conformational polymorphism analyses, in situ hybridization assays, and polymerase chain reaction analyses. Such analyses may reveal both quantitative aspects of the expression pattern of a HIPHUM 134, and qualitative aspects of HIPHUM 134 expression and/or composition.
Alternative diagnostic methods for the detection of HIPHUM 134 nucleic acid molecules may involve their amplification, e. g. by PCR (the experimental embodiment set forth in U. S. Patent No. 4,683, 202), ligase chain reaction (Barany, 1991, Proc. Natl. Acad. Sci. USA 88: 189-193), self sustained sequence replication (Guatelli et al., 1990, Proc. Natl. Acad. Sci. USA 87: 1874-1878), transcriptional amplification system (Kwoh et al., 1989, Proc. Natl. Acad. Sci. 15 USA 86: 11731177), Q-Beta Replicase (Lizardi et al., 1988, Bio/Technology 6: 1197) or any other nucleic acid amplification method, followed by the detection of the amplified molecules using techniques well known to those of skill in the art. These detection schemes are especially useful for the detection of nucleic acid molecules if such molecules are present in very low numbers.
Particularly suitable diagnostic methods are chip-based DNA technologies such as those described by Hacia et al., 1996, Nature Genetics 14: 441-447 and
Shoemaker et al., 1996, Nature Genetics 14 : 450-456. Briefly, these techniques involve quantitative methods for analyzing large numbers of nucleic acid sequence targets rapidly and accurately. By tagging with oligonucleotides or using fixed probe arrays, one can employ chip technology to segregate target molecules as high density arrays and screen these molecules on the basis of hybridization.
Following detection, the results seen in a given patient may be compared with a statistically significant reference group of normal patients and patients that have HIPHUM 134 related pathologies. In this way, it is possible to correlate the amount or kind of HIPHUM 134 encoded product detected with various clinical states or predisposition to clinical states.
Another aspect of the present invention is the use of the substances that have been identified by screening techniques referred to above in the treatment of disease states, which are responsive to regulation of cysteine proteinase activity. The treatment may be therapeutic or prophylactic. The condition of a patient suffering from such a disease state can thus be improved.
In particular, such substances may be used in the treatment of Cushing's disease, Addison's disease, hypertension, hyperkalemia, hypokalemia, polycystic ovary, infertility, hirustism, HBV infection, HIV infection, HSV infection, breast cancer, Huntington's disease, central areolar choroidal dystrophy, lung oedemas, obesity, alcoholism and juvenille cystic kidneys.
Additional disease states that may be treated include Alzheimer's disease, parasupranuclear palsey, ALS, colon cancer, autoimmune diseases and inflammatory diseases.
Substances that act as inhibitors of HIPHUM 134 activity may be used in the treatment of disease states in which HIPHUM 134 expression is up-regulated such as HIV and HBV infection. Substances that act as activators of HIPHUM 134 activity may be used in the treatment of disease states in which expression of HIPHUM 134 is down-regulated such as Huntington's disease, breast cancer and HSV infection.
Substances identified according to the screening methods outlined above may be formulated with standard pharmaceutically acceptable carriers and/or excipients as is routine in the pharmaceutical art. For example, a suitable substance may be dissolved in physiological saline or water for injections. The exact nature of a formulation will depend upon several factors including the particular substance to be
administered and the desired route of administration. Suitable types of formulation are fully described in Remington's Pharmaceutical Sciences, Mack Publishing Company, Eastern Pennsylvania, 17th Ed. 1985, the disclosure of which is included herein of its entirety by way of reference.
The substances may be administered by enteral or parenteral routes such as via oral, buccal, anal, pulmonary, intravenous, intra-arterial, intramuscular, intraperitoneal, topical or other appropriate administration routes.
A therapeutically effective amount of a modulator is administered to a patient. The dose of a modulator may be determined according to various parameters, especially according to the substance used; the age, weight and condition of the patient to be treated; the route of administration; and the required regimen. A physician will be able to determine the required route of administration and dosage for any particular patient. A typical daily dose is from about 0. 1 to 50 mg per kg of body weight, according to the activity of the specific modulator, the age, weight and conditions of the subject to be treated, the type and severity of the degeneration and the frequency and route of administration. Preferably, daily dosage levels are from 5 mg to 2 g.
Nucleic acid encoding HIPHUM 134 or a variant thereof which inhibits HIPHUM 134 activity may be administered to the mammal. In particular, a nucleic acid encoding a polypeptide with HIPHUM 134 activity may be administered to a subject suffering from a condition in which HIPHUM 134 expression is downregulated, such as Huntington's disease, breast cancer and HSV infection. A nucleic acid encoding a variant of HIPHUM 134 that inhibits HIPHUM 134 activity may be administered to a patient suffering from a condition in which HIPHUM 134 expression is up-regulated such as HIV and HBV infection. Nucleic acid, such as RNA or DNA, and preferably, DNA, is provided in the form of a vector, such as the polynucleotides described above, which may be expressed in the cells of the mammal.
Nucleic acid encoding the polypeptide may be administered by any available technique. For example, the nucleic acid may be introduced by needle injection, preferably intradermally, subcutaneously or intramuscularly. Alternatively, the nucleic acid may be delivered directly across the skin using a nucleic acid delivery device such as particle-mediated gene delivery. The nucleic acid may be
administered topically to the skin, or to mucosal surfaces for example by intranasal, oral, intravaginal or intrarectal administration.
Uptake of nucleic acid constructs may be enhanced by several known transfection techniques, for example those including the use of transfection agents.
Examples of these agents includes cationic agents, for example, calcium phosphate and DEAE-Dextran and lipofectants, for example, lipofectam and transfectam. The dosage of the nucleic acid to be administered can be altered. Typically the nucleic acid is administered in the range of Ipg to Img, preferably to Ipg to lOg nucleic acid for particle mediated gene delivery and 1011g to Img for other routes.
The following Examples illustrate the invention.
Example 1: Characterisation of the sequence A cysteine proteinase, designated as HIPHUM 134 has been identified. The nucleotide and amino acid sequences of the receptor have been determined. These are set out below in SEQ ID NOs: I and 2. Suitable primers and probes were designed and used to analyse tissue expression. HIPHUM 134 was found to be primarily expressed in all tissues at various levels (Fig. 1). It is highly expressed in adrenal, cerebellum, jejunum, lung, rectum, testis, tonsil and urinary bladder. It is also expressed at significant levels in fetal brain, ovary, thymus, thyroid and utreus.
HIPHUM 134 is expressed in T cells, peripheral blood mononuclear cells (PBMCs), monocytes and dendritic cells.
The chromosomal localization was also mapped. Human HIPHUM 134 has been mapped to 17pl3. 3-pl3. 1.
Example 2: Screening for substances which exhibit protein modulating activity Preparations of a purified polypeptide of the invention are generated for screening purposes. 96 and 384 well plate, high throughput screens (HTS) are employed using fluorescence or colourimetric indicator molecules. Secondary screening involves the same technology. Tertiary screens involve the study of modulators in rat, mouse and guinea-pig models of disease relevant to the target.
A brief screening assay protocol is as follows : A polypeptide of the invention is expressed in E. Coli, purified and refolded by direct dilution in assay buffer (200mM NaCl, 50mM Tris, 5mM Cal2, 10uM ZnSO4, 0. 01% Brij 35, pH 7.5). Test substances are provided in pools of 10 at 5mM for each test substance for high throughput screening or are serially diluted in dose
response assays. The screening assay is run on an automated system incorporating an OCRA rail to move the plates, a Tecan liquid handler, a Multimek liquid handler and a Titertek liquid handler. (The assay may also be run on manually or in combination with any suitable liquid handling equipment.) 60 III of serially diluted test substance is added to 96 well plates (4 control wells with no inhibitor), 60 III of a solution of a polypeptide of the invention (0.5 uM to 150 uM) is then added to each well and 50 mM EDTA (20 gl of 0. 5M EDTA) is added to 4 inhibited control wells. In a fluorescence assay, the plates are read in a Fluostar (SLT) fluorescence plate reader or equivalent at an excitation of 343 nm and an emission of 450 nm. In a colourimetric assay the plates are read continuously in a SLT spectrophotometer at 405 nm for 3 minutes.
Percentage inhibition is calculated for each concentration (unknown values = U) in the dose response based on the range determined using the value of the
difference of the control (no test substance) (mean of 4 = Cl) and the EDTA treated wells (mean of 4 = C2) using the equation 100* (1- (U-C2)/ (C1-C2).
SEQUENCE LISTING < 110 > GLAXO GROUP LIMITED < 120 > NOVEL PROTEIN < 130 > QG1036 (P80211) < 160 > 2 < 170 > PatentIn version 3. 0 < 210 > 1 < 211 > 2231 < 212 > DNA < 213 > Homo sapiens < 220 > < 221 > CDS < 222 > (73).. (1797)
< 400 > 1 gggaagcttg aggccggaga cgcccgcctt cgggcccgtc cgcccggctt ccccgctccc 60 gggtactgga ag atg aaa gag act ata caa ggg ace ggg tee tgg ggg cct III Met Lys Glu Thr Ile Gln Gly Thr Gly Ser Trp Gly Pro 1 5 10 gag cct cct gga ccc ggc ata ccc cca get tac tea agt ccc agg egg 159 Glu Pro Pro Gly Pro Gly Ile Pro Pro Ala Tyr Ser Ser Pro Arg Arg 15 20 25 gag cgt ctt cgt tgg ccc cca cct ccc aaa ccc cga etc aag tea ggt 207 Glu Arg Leu Arg Trp Pro Pro Pro Pro Lys Pro Arg Leu Lys Ser Gly 30 35 40 45 gga ggg ttt ggg cca gat cct ggg tea ggg ace aca gtg cca gcc aga 255 Gly Gly Phe Gly Pro Asp Pro Gly Ser Gly Thr Thr Val Pro Ala Arg 50 55 60 cgc etc cct gtc ccc cga ccc tct ttt gat gcc tea gca agt gaa gag 303 Arg Leu Pro Val Pro Arg Pro Ser Phe Asp Ala Ser Ala Ser Glu Glu 65 70 75 gag gaa gaa gag gag gag gag gag gat gaa gat gaa gag gag gaa gtg 351 Glu Glu Glu Glu Glu Glu Glu Glu Asp Glu Asp Glu Glu Glu Glu Val 80 85 90 gca get tgg agg ctg ccc cca aga tgg agt cag ctg gga ace tee cag 399 Ala Ala Trp Arg Leu Pro Pro Arg Trp Ser Gln Leu Gly Thr Ser Gln 95 100 105 egg ccc cgc cct tee cgc ccc act cat cga aaa ace tgc tea cag cgc 447 Arg Pro Arg Pro Ser Arg Pro Thr His Arg Lys Thr Cys Ser Gln Arg 110 115 120 125 cgc cgc cga gcc atg aga gcc ttc egg atg ctg etc tac tea aaa age 495
Arg Arg Arg Ala Met Arg Ala Phe Arg Met Leu Leu Tyr Ser Lys Ser 130 135 140 ace teg ctg aca ttc cac tgg aag ctt tgg ggg cgc cac egg ggc. egg 543 Thr Ser Leu Thr Phe His Trp Lys Leu Trp Gly Arg His Arg Gly Arg 145 150 155 egg egg ggc etc gca cac ccc aag aac cat ctt tea ccc cag caa ggg 591 Arg Arg Gly Leu Ala His Pro Lys Asn His Leu Ser Pro Gln Gln Gly 160 165 170 ggt gcg acg cca cag gtg cca tee ccc tgt tgt cgt ttt gac tee ccc 639 Gly Ala Thr Pro Gln Val Pro Ser Pro Cys Cys Arg Phe Asp Ser Pro 175 180 185 egg ggg cca cct cca ccc egg ctg ggt ctg cta ggt get etc atg get 687 Arg Gly Pro Pro Pro Pro Arg Leu Gly Leu Leu Gly Ala Leu Met Ala 190 195 200 205 gag gat ggg gtg aga ggg tct cca cca gtg ccc tct ggg ccc ccc atg 735 Glu Asp Gly Val Arg Gly Ser Pro Pro Val Pro Ser Gly Pro Pro Met 210 215 220 gag gaa gat gga etc agg tgg act cca aag tct cct ctg gac cct gac 783 Glu Glu Asp Gly Leu Arg Trp Thr Pro Lys Ser Pro Leu Asp Pro Asp 225 230 235 teg ggc etc ctt tea tgt act ctg ccc aac ggt ttt ggg gga caa tct 831 Ser Gly Leu Leu Ser Cys Thr Leu Pro Asn Gly Phe Gly Gly Gln Ser 240 245 250 ggg cca gaa ggg gag cgc age ttg gca ccc cct gat gcc age ate etc 879 Gly Pro Glu Gly Glu Arg Ser Leu Ala Pro Pro Asp Ala Ser Ile Leu 255 260 265 ate age aat gtg tgc age ate ggg gac cat gtg gcc cag gag ctt ttt 927 Ile Ser Asn Val Cys Ser Ile Gly Asp His Val Ala Gln Glu Leu Phe 270 275 280 285 cag ggc tea gat ttg ggc atg gca gaa gag gca gag agg cct ggg gag 975 Gln Gly Ser Asp Leu Gly Met Ala Glu Glu Ala Glu Arg Pro Gly Glu 290 295 300 aaa gcc ggc cag cac age ccc ctg cga gag gag cat gtg ace tgc gta 1023 Lys Ala Gly Gln His Ser Pro Leu Arg Glu Glu His Val Thr Cys Val 305 310 315 cag age ate ttg gac gaa ttc ctt caa acg tat ggc age etc ata ccc 1071 Gln Ser Ile Leu Asp Glu Phe Leu Gln Thr Tyr Gly Ser Leu Ile Pro 320 325 330 etc age act gat gag gta gta gag aag ctg gag gac att ttc cag cag 1119 Leu Ser Thr Asp Glu Val Val Glu Lys Leu Glu Asp Ile Phe Gln Gln 335 340 345 gag ttt tee ace cct tee agg aag ggc ctg gtg ttg cag ctg ate cag 1167 Glu Phe Ser Thr Pro Ser Arg Lys Gly Leu Val Leu Gln Leu Ile Gln
350 355 360 365 tct tac cag egg atg cca ggc aat gcc atg gtg agg ggc ttc cga gtg 1215 Ser Tyr Gln Arg Met Pro Gly Asn Ala Met Val Arg Gly Phe Arg Val 370 375 380 get tat aag egg cac gtg ctg ace atg gat gac ttg ggg ace ttg tat 1263 Ala Tyr Lys Arg His Val Leu Thr Met Asp Asp Leu Gly Thr Leu Tyr 385 390 395 gga cag aac tgg etc aat gac cag gtg atg aac atg tat gga gac ctg 1311 Gly Gln Asn Trp Leu Asn Asp Gln Val Met Asn Met Tyr Gly Asp Leu 400 405 410 gtc atg gac aca gtc cct gaa aag gtg cat ttc ttc aat agt ttc ttc 1359 Val Met Asp Thr Val Pro Glu Lys Val His Phe Phe Asn Ser Phe Phe 415 420 425 tat gat aaa etc cgt ace aag ggt tat gat ggg gtg aaa agg tgg ace 1407 Tyr Asp Lys Leu Arg Thr Lys Gly Tyr Asp Gly Val Lys Arg Trp Thr 430 435 440 445 aaa aac gtg gac ate ttc aat aag gag cta ctg cta ate ccc ate cac 1455 Lys Asn Val Asp Ile Phe Asn Lys Glu Leu Leu Leu Ile Pro Ile His 450 455 460 ctg gag gtg cat tgg tee etc ate tct gtt gat gtg agg cga cgc ace 1503 Leu Glu Val His Trp Ser Leu Ile Ser Val Asp Val Arg Arg Arg Thr 465 470 475 ate ace tat ttt gac teg cag cgt ace cta aac cgc cgc tgc cct aag 1551 Ile Thr Tyr Phe Asp Ser Gln Arg Thr Leu Asn Arg Arg Cys Pro Lys 480 485 490 cat att gcc aag tat cta cag gca gag gcg gta aag aaa gac cga ctg 1599 His Ile Ala Lys Tyr Leu Gln Ala Glu Ala Val Lys Lys Asp Arg Leu 495 500 505 gat ttc cac cag ggc tgg aaa ggt tac ttc aaa atg aat gtg gcc agg 1647 Asp Phe His Gln Gly Trp Lys Gly Tyr Phe Lys Met Asn Val Ala Arg 510 515 520 525 cag aat aat gac agt gac tgt ggt get ttt gtg ttg cag tac tgc aag 1695 Gln Asn Asn Asp Ser Asp Cys Gly Ala Phe Val Leu Gln Tyr Cys Lys 530 535 540 cat ctg gcc ctg tct cag cca ttc age ttc ace cag cag gac atg ccc 1743 His Leu Ala Leu Ser Gln Pro Phe Ser Phe Thr Gln Gln Asp Met Pro 545 550 555 aaa ctt cgt egg cag ate tac aag gag ctg tgt cac tgc aaa etc act 1791 Lys Leu Arg Arg Gln Ile Tyr Lys Glu Leu Cys His Cys Lys Leu Thr 560 565 570 gtg tga gcctcgtacc ccagacccca agcccataaa tgggaaggga gacatgggag 1847 Val
tcccttccca agaaactcca gttcctttcc tctcttgcct cttcccactc acttcccttt 1907 ggtttttcat atttaaatgt ttcaatttct gtattttttt ttctttgaga gaatacttgt 1967 tgatttctga tgtgcagggg gtggctacag aaaagcccct ttcttcctct gtttgcaggg 2027 gagtgtggcc ctgtggcctg ggtggagcag tcatcctccc ccttccccgt gcagggagca 2087 ggaaatcagt gctgggggtg gtgggcggac aataggatca ctgcctgcca gatcttcaaa 2147 cttttatata tatatatata tatataaaaa tgccacggtc ctgctctggt caataaagga 2207 tcctttgttg atacgtaaaa aaaa 2231 < 210 > 2 < 211 > 574 < 212 > PRT < 213 > Homo sapiens < 400 > 2
Met Lys Glu Thr Ile Gln Gly Thr Gly Ser Trp Gly Pro Glu Pro Pro 1 5 10 15 Gly Pro Gly Ile Pro Pro Ala Tyr Ser Ser Pro Arg Arg Glu Arg Leu 20 25 30 Arg Trp Pro Pro Pro Pro Lys Pro Arg Leu Lys Ser Gly Gly Gly Phe 35 40 45 Gly Pro Asp Pro Gly Ser Gly Thr Thr Val Pro Ala Arg Arg Leu Pro 50 55 60
Val Pro Arg Pro Ser Phe Asp Ala Ser Ala Ser Glu Glu Glu Glu Glu 65 70 75 80 Glu Glu Glu Glu Glu Asp Glu Asp Glu Glu Glu Glu Val Ala Ala Trp 85 90 95 Arg Leu Pro Pro Arg Trp Ser Gln Leu Gly Thr Ser Gln Arg Pro Arg 100 105 110 Pro Ser Arg Pro Thr His Arg Lys Thr Cys Ser Gln Arg Arg Arg Arg 115 120 125 Ala Met Arg Ala Phe Arg Met Leu Leu Tyr Ser Lys Ser Thr Ser Leu 130 135 140
Thr Phe His Trp Lys Leu Trp Gly Arg His Arg Gly Arg Arg Arg Gly 145 150 155 160 Leu Ala His Pro Lys Asn His Leu Ser Pro Gln Gln Gly Gly Ala Thr 165 170 175 Pro Gln Val Pro Ser Pro Cys Cys Arg Phe Asp Ser Pro Arg Gly Pro 180 185 190 Pro Pro Pro Arg Leu Gly Leu Leu Gly Ala Leu Met Ala Glu Asp Gly 195 200 205 Val Arg Gly Ser Pro Pro Val Pro Ser Gly Pro Pro Met Glu Glu Asp 210 215 220 Gly Leu Arg Trp Thr Pro Lys Ser Pro Leu Asp Pro Asp Ser Gly Leu 225 230 235 240 Leu Ser Cys Thr Leu Pro Asn Gly Phe Gly Gly Gln Ser Gly Pro Glu 245 250 255 Gly Glu Arg Ser Leu Ala Pro Pro Asp Ala Ser Ile Leu Ile Ser Asn 260 265 270 Val Cys Ser Ile Gly Asp His Val Ala Gln Glu Leu Phe Gln Gly Ser 275 280 285 Asp Leu Gly Met Ala Glu Glu Ala Glu Arg Pro Gly Glu Lys Ala Gly 290 295 300 Gln His Ser Pro Leu Arg Glu Glu His Val Thr Cys Val Gln Ser Ile 305 310 315 320 Leu Asp Glu Phe Leu Gln Thr Tyr Gly Ser Leu Ile Pro Leu Ser Thr 325 330 335 Asp Glu Val Val Glu Lys Leu Glu Asp Ile Phe Gln Gln Glu Phe Ser 340 345 350 Thr Pro Ser Arg Lys Gly Leu Val Leu Gln Leu Ile Gln Ser Tyr Gln 355 360 365
Arg Met Pro Gly Asn Ala Met Val Arg Gly Phe Arg Val Ala Tyr Lys 370 375 380 Arg His Val Leu Thr Met Asp Asp Leu Gly Thr Leu Tyr Gly Gln Asn 385 390 395 400 Trp Leu Asn Asp Gln Val Met Asn Met Tyr Gly Asp Leu Val Met Asp 405 410 415 Thr Val Pro Glu Lys Val His Phe Phe Asn Ser Phe Phe Tyr Asp Lys 420 425 430 Leu Arg Thr Lys Gly Tyr Asp Gly Val Lys Arg Trp Thr Lys Asn Val 435 440 445
Asp Ile Phe Asn Lys Glu Leu Leu Leu Ile Pro Ile His Leu Glu Val 450 455 460 His Trp Ser Leu Ile Ser Val Asp Val Arg Arg Arg Thr Ile Thr Tyr 465 470 475 480 Phe Asp Ser Gln Arg Thr Leu Asn Arg Arg Cys Pro Lys His Ile Ala 485 490 495 Lys Tyr Leu Gln Ala Glu Ala Val Lys Lys Asp Arg Leu Asp Phe His 500 505 510 Gln Gly Trp Lys Gly Tyr Phe Lys Met Asn Val Ala Arg Gln Asn Asn 515 520 525
Asp Ser Asp Cys Gly Ala Phe Val Leu Gln Tyr Cys Lys His Leu Ala 530 535 540 Leu Ser Gln Pro Phe Ser Phe Thr Gln Gln Asp Met Pro Lys Leu Arg 545 550 555 560 Arg Gln Ile Tyr Lys Glu Leu Cys His Cys Lys Leu Thr Val 565 570

Claims (16)

1. An isolated cysteine proteinase polypeptide comprising (i) the amino acid sequence of SEQ ID NO: 2 or (ii) a variant thereof which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO or (iii) a fragment of (i) or (ii) which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO.
2. A polypeptide according to claim 1 wherein the variant (ii) has at least 80% identity to the amino acid sequence of SEQ ID NO: 2.
3. A polynucleotide encoding a polypeptide according to claim 1 or 2.
4. A polynucleotide according to claim 3 which is a cDNA sequence.
5. A polynucleotide encoding a cysteine proteinase polypeptide which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO which polynucleotide comprises: (a) the nucleic acid sequence of SEQ ID NO: 1 and/or a sequence complementary thereto; (b) a sequence which hybridises under stringent conditions to a sequence as defined in (a); (c) a sequence that is degenerate as a result of the genetic code to a sequence as defined in (a) or (b); or (d) a sequence having at least 60% identity to a sequence as defined in (a), (b) or (c).
6. An expression vector comprising a polynucleotide according to any one of claims 3 to 5.
7. A host cell comprising an expression vector according to claim 6.
8. An antibody specific for a polypeptide according to claim 1 or 2.
9. A method for the identification of a substance that modulates cysteine proteinase activity and/or expression, which method comprises: (i) contacting a test substance and a polypeptide according to claim 1 or 2, a polynucleotide according to any one of claims 3 to 5, an
expression vector according to claim 6 or a host cell according to claim 7, and (ii) determining the effect of the test substance on the activity and/or expression of the said polypeptide or the polypeptide encoded by said polynucleotide, thereby to determine whether the test substance modulates cysteine proteinase activity and/or expression.
10. A method according to claim 9 wherein the polypeptide is in a substantially isolated form.
11. A substance which modulates cysteine proteinase activity and which is identifiable by a method according to claim 9 or 10.
12. A method of treating a subject having a disorder that is responsive to cysteine proteinase modulation, which method comprises administering to said subject an effective amount of a substance according to claim 11.
13. A method according to claim 12 wherein the disorder is selected from Cushing's disease, Addison's disease, hypertension, hyperkalemia, hypokalemia, polycystic ovary, infertility, hirustism, HBV infection, HIV infection, HSV infection, breast cancer, Huntington's disease, central areolar choroidal dystrophy, lung oedemas, obesity, alcoholism and juvenille cystic kidneys.
14. Use of a substance as defined in claim 11 in the manufacture of a medicament for treatment or prophylaxis of a disorder that is responsive to stimulation or modulation of cysteine proteinase activity.
15. A use according to claim 14 wherein the disorder is selected from Cushing's disease, Addison's disease, hypertension, hyperkalemia, hypokalemia, polycystic ovary, infertility, hirustism, HBV infection, HIV infection, HSV infection, breast cancer, Huntington's disease, central areolar choroidal dystrophy, lung oedemas, obesity, alcoholism and juvenille cystic kidneys.
16. A method of producing a polypeptide according to claim 1 or 2, which method comprises maintaining a host cell as defined in claim 7 under conditions suitable for obtaining expression of the polypeptide and isolating the said polypeptide.
GB0128025A 2000-11-24 2001-11-22 Cysteine protease polypeptide Withdrawn GB2372994A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GBGB0028761.5A GB0028761D0 (en) 2000-11-24 2000-11-24 New protein

Publications (2)

Publication Number Publication Date
GB0128025D0 GB0128025D0 (en) 2002-01-16
GB2372994A true GB2372994A (en) 2002-09-11

Family

ID=9903863

Family Applications (2)

Application Number Title Priority Date Filing Date
GBGB0028761.5A Ceased GB0028761D0 (en) 2000-11-24 2000-11-24 New protein
GB0128025A Withdrawn GB2372994A (en) 2000-11-24 2001-11-22 Cysteine protease polypeptide

Family Applications Before (1)

Application Number Title Priority Date Filing Date
GBGB0028761.5A Ceased GB0028761D0 (en) 2000-11-24 2000-11-24 New protein

Country Status (1)

Country Link
GB (2) GB0028761D0 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001009292A2 (en) * 1999-07-31 2001-02-08 Board Of Regents, The University Of Texas System Sentrin-specific human proteases senp1-3

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001009292A2 (en) * 1999-07-31 2001-02-08 Board Of Regents, The University Of Texas System Sentrin-specific human proteases senp1-3

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Gene, Vol.248, May 2000, Yeh, E. T. H. et al., "Ubiquitin-like proteins...", pp.1-14 & related Genbank accession number AY008763. *

Also Published As

Publication number Publication date
GB0028761D0 (en) 2001-01-10
GB0128025D0 (en) 2002-01-16

Similar Documents

Publication Publication Date Title
JP2006325596A (en) Sialoadhesin family member-2 (saf-2)
JPH1175868A (en) New compound
KR20010081109A (en) Novel g protein-coupled receptor protein, its dna and ligand thereof
JPH11151093A (en) Human disintegrin metalloprotease related to drosophila kuz gene
US20020127692A1 (en) Protein
JPH11217399A (en) Kringle-relating clone hthbz47
JP2003526322A (en) Secreted cysteine-rich protein-6 (scrp-6)
US20030165818A1 (en) Novel protein
US6410689B1 (en) Caspase recruitment domain polypeptides
GB2371303A (en) Receptor polypeptides with immunomodulatory activity
GB2372994A (en) Cysteine protease polypeptide
US20040072268A1 (en) Methods for regulating BRCA1-BRCA2-containing complex activity
JPH1142093A (en) Sialoadhesin family member-3
JP2003024086A (en) Integlin ligand, human mindin
JPH11164692A (en) Human rce1
GB2371801A (en) Cysteine protease polypeptides
US20020076800A1 (en) Protein
GB2382078A (en) Protease polypeptide
US20020115205A1 (en) Polypeptide
WO2001016596A2 (en) Screening for modulators of gaba b receptor activity
US20020076758A1 (en) Polypeptide
JP2004041214A (en) SIALOADHESIN FAMILY 4 (SAF-4) cDNA
GB2376234A (en) Trypsin-like serine protease
GB2371800A (en) Amino acid transporter polypeptides
JPH11103867A (en) Epo primary response gene1, eprg1

Legal Events

Date Code Title Description
WAP Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1)