GB2372504A - Cysteine protease polypeptide - Google Patents

Cysteine protease polypeptide Download PDF

Info

Publication number
GB2372504A
GB2372504A GB0127017A GB0127017A GB2372504A GB 2372504 A GB2372504 A GB 2372504A GB 0127017 A GB0127017 A GB 0127017A GB 0127017 A GB0127017 A GB 0127017A GB 2372504 A GB2372504 A GB 2372504A
Authority
GB
United Kingdom
Prior art keywords
polypeptide
sumo
sequence
expression
hiphum
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
GB0127017A
Other versions
GB0127017D0 (en
Inventor
Barbara Susan Ink
Alan Peter Lewis
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Glaxo Group Ltd
Original Assignee
Glaxo Group Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Glaxo Group Ltd filed Critical Glaxo Group Ltd
Publication of GB0127017D0 publication Critical patent/GB0127017D0/en
Publication of GB2372504A publication Critical patent/GB2372504A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/48Hydrolases (3) acting on peptide bonds (3.4)
    • C12N9/50Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
    • C12N9/64Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue
    • C12N9/6421Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from animal tissue from mammals
    • C12N9/6472Cysteine endopeptidases (3.4.22)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • A61P25/28Drugs for disorders of the nervous system for treating neurodegenerative disorders of the central nervous system, e.g. nootropic agents, cognition enhancers, drugs for treating Alzheimer's disease or other forms of dementia
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/12Antivirals
    • A61P31/14Antivirals for RNA viruses
    • A61P31/18Antivirals for RNA viruses for HIV
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P35/00Antineoplastic agents

Abstract

An isolated cysteine proteinase polypeptide comprising <SL> <LI>(i) the amino acid sequence of SEQ ID NO: 2; or <LI>(ii) a variant thereof which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO; or <LI>(iii) a fragment of (i) or (ii) which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO. </SL> The polypeptide may be used in a method to identify substances which may be of use in treating HIV infection, lung cancer, inflammatory disease, hepatitis B and brain disease.

Description

1 2372504
NEW PROTEIN
Field of the Invention
The present invention relates to cysteine proteinase polypeptides.
Background of the Invention
There are 4 main catalytic types of peptidases; serine (which includes threonine peptidases) cysteine, aspartic and metallo. The serine, threonine and cysteine peptidases are catalytically very different from the aspartic and 10 metallopeptidases in that the nucleophile of the catalytic site is part of an amino acid, whereas it is an activated water molecule in the other groups.
Peptidases in which the nucleophile is the sulphydryl group of a cysteine residue are known as cysteine-type peptidases. The catalytic mechanism is similar to that of the serine-type peptidases in that a nucleophile and a proton donor or general 5 base is required, and the proton donor in all cysteine peptidases (in which it has been identified) is a histidine residue. Typical examples of cysteine peptidases include the cathepsins (excluding cathepsin G) and caspases.
Human ubiquitin-like protein specific protease (ULP). Ulp is a deconjugating enzyme of SUMO/Sentrin, a ubiquitin-like protein, and its target protein. It is a 20 cysteine protease that is unrelated to other known deubiquitinating enzymes, but shows similarity to certain viral proteases. The Ulps cleave at a glycine-glycine-X cleavage site that is similar to the consensus sites of adenovirus, African swine fever virus, and certain poxviruses. Ulps also cleave the SUMO precursor to generate the mature form of SUMO.
2s The human ULPs show homology to the yeast ULP1 and ULP2 genes. Yeast ULP 1 cleaves the yeast homolog of SUMO, SMT3, from target proteins. ULP 1 is important throughout the cell cycle and is essential for the G2/M phase. ULP2 also cleaves SMT3 from target proteins, although the target proteins of ULP2 appear to be different from ULP1. ULP2 is important for the recovery of cells from checkpoint 30 arrest induced by DNA damage, inhibition of DNA replication, or defects in spindle assembly. Deletions of ULP2 in yeast show a phenotype of temperature-sensitive growth, abnormal cell morphology, decreased plasmid and chromosome stability,
and a severe sporulation defect.
Summary of the Invention
A novel cysteine proteinase, referred to herein as HIPHUM 119, is now s provided. HIPHUM 1 19 is shown to be expressed in all tissues at various levels. It is highly expressed in adrenal, cerebellum, rectum, testis, thyroid and urinary bladder.
It is also expressed at significant levels in lung, fetal brain, skeletal muscle, tonsil and uterus. In addition, it is expressed in T cells, peripheral blood mononucleocytes (PBMNCs), monocytes and dendritic cells. The novel cysteine proteinase is a lo screening target for the identification and development of novel pharmaceutical agents, including modulators of cysteine proteinase activity. These agents may be used in the treatment and/or prophylaxis of disorders such as HIV infection, lung cancer, inflammatory disease such as asthma, Hepatitis B and brain diseases such as Alzheimer's disease, parasupranuclear palsey and Huntington's disease.
Accordingly, the present invention provides an isolated cysteine proteinase polypeptide comprising: (i) the amino acid sequence of SEQ ID NO: 2; (ii) a variant thereof which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the 20 active form of SUMO; or (iii) a fragment of (i) or (ii) which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO.
According to another aspect of the invention there is provided a 2s polynucleotide encoding a polypeptide of the invention which polynucleotide includes a sequence comprising: (a) the nucleic acid sequence of SEQ ID NO: 1 and/or a sequence complementary thereto; (b) a sequence which hybridises under stringent conditions to a sequence 30 as defined in (a); (c) a sequence that is degenerate as a result of the genetic code to a sequence as defined in (a) or (b); or
(d) a sequence having at least 60% identity to a sequence as defined in (a), (b) or (c).
The invention also provides: - an expression vector which comprises a polynucleotide of the invention and s which is capable of expressing a polypeptide of the invention; - a host cell comprising an expression vector of the invention; - a method of producing a polypeptide of the invention which method comprises maintaining a host cell of the invention under conditions suitable for obtaining expression of the polypeptide and isolating the said polypeptide; lo - an antibody specific for a polypeptide of the invention; - a method for identification of a substance that modulates cysteine proteinase activity and/or expression, which method comprises contacting a polypeptide, polynucleotide, expression vector or host cell of the invention with a test substance and determining the effect of the test substance on the activity 5 and/or expression of the said polypeptide or the polypeptide encoded by the said polynucleotide, thereby to determine whether the test substance modulates cysteine proteinase activity and/or expression; - a compound which or modulates cysteine proteinase activity and which is identifiable by the method referred to above; 20 - a method of treating a subject having a disorder that is responsive to cysteine proteinase stimulation or modulation, which method comprises administering to said subject an effective amount of substance of the invention; and - use of a substance that stimulates or modulates cysteine proteinase activity in 2s the manufacture of a medicament for the treatment or prophylaxis of a disorder that is responsive to stimulation or modulation of cysteine proteinase activity. Preferably the disorder is selected from HIV infection, lung cancer, inflammatory disease such as asthma, Hepatitis B and brain diseases such as 30 Alzheimer's disease, parasupranuclear palsey and Huntington's disease.
Brief Description of the Figures
Figure 1 shows the relative expression levels of HIPHUM I 19 in human tissues. Figure 2 shows the relative expression of HIPHUM 1 19 in normal and tumor 5 tissue from the colon, lung and breast.
Figure 3 shows the relative expression of HIPHUM 1 19 in a normal brain and brain tissue from Alzheimer's disease (Alz), Huntington's disease (Hunt), myotonic dystrophy (MD), parasupranuclear palsey (PSP) and ALS.
Figure 4 shows the relative expression of HIPHUM 1 19 in normal lung and lo tissue from asthmatic and chronic obstructive pulmonary disease (COPD) lung; in control endothelial cells, VEGF treated endothelial cells and bFGF treated endothelial cells; in stimulated and unstimulated bone marrow; in normal knee cartilage and cartilage from osteoarthritis knee; in HS-1 and HS-2 synovium in rheumatoid arthritis; and in differentiated and undifferentiated osetoclasts.
5 Figure 5 shows the relative expression levels of HIPHUM 1 19 in various blood cells and in HSV, HBV and HIV/PBL infected cells.
Brief Description of the Sequences
SEQ ID NO: 1 shows the nucleotide and amino acid sequences of human 20 protein HIPHUM 1 19.
SEQ ID NO: 2 is the amino acid sequence alone of HIPHUM 119.
Detailed Description of the Invention
Throughout the present specification and the accompanying claims the words
2s "comprise" and "include" and variations such as "comprises", "comprising", "includes" and "including" are to be interpreted inclusively. That is, these words are intended to convey the possible inclusion of other elements or integers not specifically recited, where the context allows.
The present invention relates to a human cysteine proteinase, referred to so herein as HIPHUM 119, and variants thereof. Sequence information for HIPHUM 119 is provided in SEQ ID NO: 1 (nucleotide and amino acid) and in SEQ ID NO: 2.
polypeptide of the invention thus consists essentially of the amino acid sequence
of SEQ ID NO: 2 or of a variant of that sequence, or of a fragment of either thereof.
Polypeptides of the invention may be in a substantially isolated form. It will be understood that the polypeptide may be mixed with carriers or diluents which will not interfere with the intended purpose of the polypeptide and still be regarded as s substantially isolated. A polypeptide of the invention may also be in a substantially purified form, in which case it will generally comprise the polypeptide in a preparation in which more than 50%, e.g. more than 80%, 90%, 95% or 99%, by weight of the polypeptide in the preparation is a polypeptide of the invention.
Routine methods, can be employed to purify and/or synthesise the proteins according l o to the invention. Such methods are well understood by persons skilled in the art, and include techniques such as those disclosed in Sambrook et al, Molecular Cloning: a Laboratory Manual, 2nd Edition, CSH Laboratory Press, 1989, the disclosure of
which is included herein in its entirety by way of reference.
The term "variant" refers to a polypeptide which has a same essential s character or basic biological functionality as HIPHUM 119. The essential character of HIPHUM 119 can be defined as follows: HIPHUM 119 is a cysteine proteinase.
Preferably a variant polypeptide is one which binds to the same a target protein as HIPHUM 119. Preferably the polypeptide is capable of cleaving SUMO from a target protein and/or cleaving the precursor boron of SUMO to release the active form of 20 SUMO. Preferably the polypeptide is capable of cleaving the SUMO precursor to its mature and active form.
Preferably the polypeptide comprises a catalytic domain which comprises a catalytic triad. The catalytic triad is typically composed of a cysteine residue, histidine residue, aspartate residue and a glutamine residue which together form the 2s oxyanion hole at the active site. Preferably the polypeptide comprises an endoplasmic reticulum (ER) retention signal. Preferably the ER retention signal comprises the four amino acids, TKKR. Preferably the polypeptide comprises a leucine zipper pattern. Preferably the leucine zipper pattern is LSYMDSLLRQSDVSLLDPPSWL.
30 A polypeptide having a same essential character as HIPHUM 119 may be identified by monitoring for a function the cysteine proteinase selected from cleavage of SUMO from a target protein, activation of SUMO and alteration in activity or
subcellular localisation of a target protein. Typical target proteins include transcription factors, such as IKBa/NF-,cB, pS3, cjun and HIPK2, proteins involved in intracellular transport, such as RanGAP1, GLUT1 and GLUT4, proteins involved in the intracellular response to DNA damage and repair, such as Topoisomerase 1, 5 Topoisomerase 2, MDM2 and WRN, proteins involved in nuclear localization, such as PML and Spl 00, and viral proteins such as bovine papillomavirus E1 and human cytomegalovirus E2. Other target proteins may be identified, for example using the yeast2-hybrid assay. Such target proteins that have been identified using the yeast 2-hybrid screen include TIFF and FAS receptors, CEN, and red 52.
0 SUMO binds to various transcription factors and is involved in their regulation. Removal of SUMO from IKBa would allow for activation of NF- cB and produce an inflammatory response. SUMO conjugation increases the transcriptional activity of p53, therefore, deconjugation of SUMO may lower the transcriptional activity of p53. Cjun modification by SUMO decreases its transcriptional activity.
5 Therefore, Ulp proteases may regulate tumorsuppressor activity with a role in cell cycle arrest, apoptosis, and cellular responses to mitogens, stress, or inflammatory stimuli. SUMO conjugates to a nuclear kinase, HIPK2, and is needed for the formation of nuclear bodies containing HIPK2. HIPK2 can act as a transcriptional 20 compressor for homeoproteins. Therefore, deconjugation of SUMO by the protease may regulate the activity of homeodomain transcription factors and play a role in development. SUMO modification of RanGAP1 targets the complex to the nuclear pore complex from the cytosol. Ulp protease may be involved in the regulation of the 2s levels of the Ran GAP 1 within the cytosol and pore complex.
SUMO also modifies GLUT1 and GLUT4 that are involved in insulin stimulated glucose uptake. Deconjugation of SUMO would regulate the amount of glucose uptake in the cell.
The Ulp protease may regulate the cellular response to DNA damage and 30 DNA repair. SUMO conjugates to TOP1 and TOP2 when treated with DNA damaging agents. MDM2 is also conjugated to SUMO to prevent selfubiquitination of mdm2, thus enhancing E3 ligase activity of pS3. A Ulp protease may regulate the
conjugation of SUMO to MDM2, thereby affecting stability of p53. SUMO also conjugates to WRN that functions as a DNA helicase, exonuclease, and ATPase.
Deletions of WRN which are found in Werner's syndrome impair its nuclear localization. It is possible that cleavage of SUMO from WRN would alter the s localization of WRN and contribute to its dysregulation during disease.
SUMO modification of PML is needed for its localization into nuclear bodies containing DAXX. PML is involved in many cellular functions such as the interferonresponse, immune surveillance, apoptosis, and as a tumor suppressor. It appears that the function of PML is to meditate the regulation of transcription. Mice lo lacking PML have shown altered transcription in cellular differentiation by way of nuclear receptors, apoptosis mediated by DAXX and p53, inhibition of growth mediated by pRb and pS3, and the immune response through interferons. Therefore, regulating the conjugation-deconjugation of SUMO to PML through Ulp protease may affect these pathways.
5 Although the function of the interferon-induced protein, SplOO, is unknown, conjugation to SUMO takes place in the nucleus. The Ulp protease may affect the activity of Spl 00 in response to interferon.
SUMO conjugates to the human cytomegalovirus protein, IE2. IE2 functions as a strong transactivator of both viral and cellular promoters. Overexpression of 20 SUMO reduces the transactivation activity of IE2. Therefore, the Ulp protease may function to control the level of IE2 sumoylation, thus, affecting both the virus life cycle and host interactions to virus infection.
SUMO conjugates to the bovine papalomavirus E1 protein which functions as the iniatiator of replication. Mutants which cannot conjugate SUMO are not able to :5 localise to the correct nuclear subdomain. The Ulp protease may control the level of sumylation of E1, thus affecting both the virus life cycle and host reactions to viral infection. Preferably a polypeptide of the invention is located in the endoplasmic reticulum when expressed in a cell. More preferably a polypeptide of the invention is 30 capable of cycling between the endoplasmic reticulum and other intracellular compartments. In another aspect of the invention, a variant is one which does not show the
same activity as HIPHUM I 19 but is one which inhibits a basic function of HIPHUM 1 19. For example, a variant polypeptide is one which inhibits protease activity of HIPHUM 1 19, for example by binding to a target protein to prevent a target protein binding to HIPHUM 1 19.
s Typically, polypeptides with more than about 65% identity preferably at least 80% or at least 90% and particularly preferably at least 95% at least 97% or at least 99% identity, with the amino acid sequences of SEQ ID NO: 2, are considered as variants of the proteins. Such variants may include allelic variants and the deletion, modification or addition of single amino acids or groups of amino acids within the to protein sequence, as long as the peptide maintains a basic biological functionality of the HIPHUM 1 19 receptor.
Amino acid substitutions may be made, for example from 1, 2 or 3 to 10, 20 or 30 substitutions. The modified polypeptide generally retains activity as a cysteine proteinase. Conservative substitutions may be made, for example according to the 5 following Table. Amino acids in the same block in the second column and preferably in the same line in the third column may be substituted for each other.
_. ALIPHATIC Non-polar G A P I L V Polar-uncharged C S T M N (Z Polarcharged O li AROMATIC H F W Y
Shorter polypeptide sequences are within the scope of the invention. For 20 example, a peptide of at least 20 amino acids or up to 50, 60, 70, 80, 100, 150, 200, 300 or 400 amino acids in length is considered to fall within the scope of the
invention as long as it demonstrates a basic biological functionality of HIPHUM 1 19.
In particular, but not exclusively, this aspect of the invention encompasses the situation when the protein is a fragment of the complete protein sequence and may represent a catalytic domain or substrate binding domain. Preferred fragments 5 include the C-terminal catalytic domain, the N-terminal domain which is not conserved between different members of the Ulp family of proteases, the leucine zipper domain or the ER membrane retention signal. Such fragments can be used to construct chimeric proteases preferably with another protease, more preferably with another member of the family of cysteine proteinases. Such chimeric proteases may lo comprise different domains from different cysteine proteinases. For example, a fragment comprising an N-terminal domain of a polypeptide of the invention may be fused to a C-terminal catalytic domain of a different cysteine proteinase.
Fragments of HIPHUM 119 or a variant thereof can also be used to raise anti-
HIPHUM 119 antibodies. In this embodiment the fragment may comprise an epitope of the HIPHUM 119 polypeptide and may otherwise not demonstrate the catalytic, substrate binding or other properties of HIPHUM 119.
Polypeptides of the invention may be chemically modified, e.g. post-
translationally modified. For example, they may be glycosylated or comprise modified amino acid residues. They may also be modified by the addition of histidine 20 residues to assist their purification or by the addition of a signal sequence to promote insertion into the cell membrane. Such modified polypeptides fall within the scope of the term "polypeptide" of the invention.
The invention also includes nucleotide sequences that encode for HIPHUM 1 19 or variant thereof as well as nucleotide sequences which are complementary 25 thereto. The nucleotide sequence may be RNA or DNA including genomic DNA, synthetic DNA or cDNA. Preferably the nucleotide sequence is a DNA sequence and most preferably, a cDNA sequence. Nucleotide sequence information is provided in SEQ ID NO: 1. Such nucleotides can be isolated from human cells or synthesised according to methods well known in the art, as described by way of example in 30 Sambrook et al, 1989.
Typically a polynucleotide of the invention comprises a contiguous sequence of nucleotides which is capable of hybridizing under selective conditions to the
coding sequence or the complement of the coding sequence of SEQ ID NO: 1.
A polynucleotide of the invention can hydridize to the coding sequence or the complement of the coding sequence of SEQ ID NO: 1 at a level significantly above background. Background hybridization may occur, for example, because of other
5 cDNAs present in a cDNA library. The signal level generated by the interaction between a polynucleotide of the invention and the coding sequence or complement of the coding sequence of SEQ ID NO: 1 is typically at least 10 fold, preferably at least 100 fold, as intense as interactions between other polynucleotides and the coding sequence of SEQ ID NO: 1. The intensity of interaction may be measured, for lo example, by radiolabelling the probe, e.g. with 3P Selective hybridization may typically be achieved using conditions of medium to high stringency. However, such hybridization may be carried out under any suitable conditions known in the art (see Sambrook et a/, 1989. For example, if high stringency is required suitable conditions include from 0.1 to 0.2 x SSC at 60 C up to 65 C. If lower stringency is required suitable conditions include 2 x SSC at 60 C.
The coding sequence of SEQ ID NO: 1 may be modified by nucleotide substitutions, for example from l, 2 or 3 to 10, 25, 50 or 100 substitutions. The polynucleotide of SEQ ID NO: l may alternatively or additionally be modified by one or more insertions and/or deletions and/or by an extension at either or both ends.
20 A polynucleotide may include one or more introns, for example may comprise genomic DNA. Additional sequences such as signal sequences which may assist in insertion of the polypeptide in a cell membrane may also be included. The modified polynucleotide generally encodes a polypeptide which has a HIPHUM 119 activity.
Alternatively, a polynucleotide encodes a catalytic or substrate-binding portion of a 25 polypeptide or a polypeptide which inhibits HIPHUM 119 activity. Degenerate substitutions may be made andlor substitutions may be made which would result in a conservative amino acid substitution when the modified sequence is translated, for example as shown in the Table above.
A nucleotide sequence which is capable of selectively hybridizing to the 30 complement of the DNA coding sequence of SEQ ID NO: l will generally have at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98% or at least 99% sequence identity to the coding sequence of SEQ ID NO: 1 over a region
of at least 20, preferably at least 30, for instance at least 40, at least 60, more preferably at least 100 contiguous nucleotides or most preferably over the full length of SEQ ID NO: 1.
For example the UWGCG Package provides the BESTFIT program which 5 can be used to calculate homology (for example used on its default settings) (Devereux et al (1984) Nucleic Acids Research 12, p387-395). The PILEUP and BLAST algorithms can be used to calculate homology or line up sequences (typically on their default settings), for example as described in Altschul (1993) J. Mol. Evol.
36:290-300; Altschul et al (1990) J. Mol. Biol. 215:403-10.
10 Software for performing BLAST analyses is publicly available through the National Centre for Biotechnology Information (http://www.ncbi.nlm. nih.gov/).
This algorithm involves first identifying high scoring sequence pair (HSPs) by identifying short words of length W in the query sequence that either match or satisfy some positive-valued threshold score T when aligned with a word of the same length 15 in a database sequence. T is referred to as the neighbourhood word score threshold (Altschul et al, 1990). These initial neighbourhood word hits act as seeds for initiating searches to find HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Extensions for the word hits in each direction are halted when: the 20 cumulative alignment score falls off by the quantity X from its maximum achieved value, the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached.
The BLAST algorithm parameters W. T and X determine the sensitivity and speed of the alignment. The BLAST program uses as defaults a word length (W) of 11, the 2s BLOSUM62 scoring matrix (see Henikoff and Henikoff (1992) Proc. Natl. Acad. Sci. USA 89: 10915-10919) alignments (B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both strands.
The BLAST algorithm performs a statistical analysis of the similarity between two sequences; see e.g., Karlin and Altschul (1993) Proc. Natl. Acad. Sci. 30 USA 90: 5873-5787. One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would
occur by chance. For example, a sequence is considered similar to another sequence if the smallest sum probability in comparison of the first sequence to the second sequence is less than about 1, preferably less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.
s Any combination of the above mentioned degrees of sequence identity and minimum sizes may be used to define polynucleotides of the invention, with the more stringent combinations (i.e. higher sequence identity over longer lengths) being preferred. Thus, for example a polynucleotide which has at least 90% sequence identity over 25, preferably over 30 nucleotides forms one aspect of the invention, as lo does a polynucleotide which has at least 95% sequence identity over 40 nucleotides.
The nucleotides according to the invention have utility in production of the proteins according to the invention, which may take place in vitro, in vivo or ex vivo.
The nucleotides may be involved in recombinant protein synthesis or indeed as therapeutic agents in their own right, utilised in gene therapy techniques. Nucleotides is complementary to those encoding HIPHUM 119, or antisense sequences, may also be used in gene therapy.
Polynucleotides of the invention may be used as a primer, e.g. a PCR primer, a primer for an alternative amplification reaction, a probe e.g. labelled with a revealing label by conventional means using radioactive or non-radioactive labels, or 20 the polynucleotides may be cloned into vectors.
Such primers, probes and other fragments will preferably be at least 10, preferably at least 15 or at least 20, for example at least 25, at least 30 or at least 40 nucleotides in length. They will typically be up to 40, 50, 60, 70, 100 or 150 nucleotides in length. Probes and fragments can be longer than 150 nucleotides in 25 length, for example up to 200, 300, 400, 500 or 600 nucleotides in length, or even up to a few nucleotides, such as five or ten nucleotides, short of the coding sequence of SEQ ID NO: 1.
The present invention also includes expression vectors that comprise nucleotide sequences encoding the proteins or variants thereof of the invention. Such 30 expression vectors are routinely constructed in the art of molecular biology and may for example involve the use of plasmid DNA and appropriate initiators, promoters, enhancers and other elements, such as for example polyadenylation signals which
may be necessary, and which are positioned in the correct orientation, in order to allow for protein expression. Other suitable vectors would be apparent to persons skilled in the art. By way of further example in this regard we refer to Sambrook et al. 1989.
5 Polynucleotides according to the invention may also be inserted into the vectors described above in an antisense orientation in order to provide for the production of antisense RNA. Antisense RNA or other antisense polynucleotides may also be produced by synthetic means. Such antisense polynucleotides may be used as test compounds in the assays of the invention or may be useful in a method lo of treatment of the human or animal body by therapy.
Preferably, a polynucleotide of the invention or for use in the invention in a vector is operably linked to a control sequence which is capable of providing for the expression of the coding sequence by the host cell, i.e. the vector is an expression vector. The term "operably linked" refers to a juxtaposition wherein the components 5 described are in a relationship permitting them to function in their intended manner.
A regulatory sequence, such as a promoter, "operably linked" to a coding sequence is positioned in such a way that expression of the coding sequence is achieved under conditions compatible with the regulatory sequence.
The vectors may be for example, plasmid, virus or phage vectors provided So with a origin of replication, optionally a promoter for the expression of the said polynucleotide and optionally a regulator of the promoter. The vectors may contain one or more selectable marker genes, for example an ampicillin resistance gene in the case of a bacterial plasmid or aresistance gene for a fungal vector. Vectors may be used in vitro, for example for the production of DNA or RNA or used to transfect or :5 transform a host cell, for example, a mammalian host cell. The vectors may also be adapted to be used in viva, for example in a method of gene therapy.
Promoters and other expression regulation signals may be selected to be compatible with the host cell for which expression is designed. For example, yeast promoters include S. cerevisiae GAL4 and ADH promoters, S. pombe nmtl and adh 30 promoter. Mammalian promoters include the metallothionein promoter which can be induced in response to heavy metals such as cadmium. Viral promoters such as the SV40 large T antigen promoter or adenovirus promoters may also be used. All these
promoters are readily available in the art.
Mammalian promoters, such as p-actin promoters, may be used. Tissue-
specific promoters are especially preferred. Viral promoters may also be used, for example the Moloney murine leukaemia virus long terminal repeat (MMLV LTR), 5 the rous sarcoma virus (RSV) LTR promoter, the SV40 promoter, the human cytomegalovirus (CMV) IE promoter, adenovirus, HSV promoters (such as the HSV IE promoters), or HPV promoters, particularly the HPV upstream regulatory region (URR). Viral promoters are readily available in the art.
The vector may further include sequences flanking the polynucleotide giving 0 rise to polynucleotides which comprise sequences homologous to eukaryotic genomic sequences, preferably mammalian genomic sequences, or viral genomic sequences. This will allow the introduction of the polynucleotides of the invention
into the genome of eukaryotic cells or viruses by homologous recombination. In particular, a plasmid vector comprising the expression cassette flanked by viral 15 sequences can be used to prepare a viral vector suitable for delivering the polynucleotides of the invention to a mammalian cell. Other examples of suitable viral vectors include herpes simplex viral vectors and retroviruses, including lentiviruses, adenoviruses, adeno-associated viruses and HPV viruses. Gene transfer techniques using these viruses are known to those skilled in the art. Retrovirus 20 vectors for example may be used to stably integrate the polynucleotide giving rise to the polynucleotide into the host genome. Replication-defective adenovirus vectors by contrast remain episomal and therefore allow transient expression.
The invention also includes cells that have been modified to express the HIPHUM 119 polypeptide or a variant thereof. Such cells include transient, or 25 preferably stable higher eukaryotic cell lines, such as mammalian cells or insect cells, using for example a baculovirus expression system, lower eukaryotic cells, such as yeast or prokaryotic cells such as bacterial cells. Particular examples of cells which may be modified by insertion of vectors encoding for a polypeptide according to the invention include mammalian HEK293T, CHO, HeLa and COS cells. Preferably the 30 cell line selected will be one which is not only stable, but also allows for mature glycosylation of a polypeptide. Expression may be achieved in transformed oocytes.
A polypeptide of the invention may be expressed in cells of a transgenic non-human
animal, preferably a mouse. A transgenic non-human animal expressing a polypeptide of the invention is included within the scope of the invention. A polypeptide of the invention may also be expressed in Xenopus laevis oocytes.
A polypeptide of the invention may be overexpressed in bacterial cells, such 5 as E.Coli, and isolated from the bacterial culture.
According to another aspect, the present invention also relates to antibodies, specific for a polypeptide of the invention. Such antibodies are for example useful in purification, isolation or screening methods involving immunoprecipitation techniques or, indeed, as therapeutic agents in their own right.
lo Antibodies may be raised against specific epitopes of the polypeptides according to the invention. Such antibodies may be used to block substrate binding to the receptor. An antibody, or other compound, "specifically binds" to a protein when it binds with preferential or high affinity to the protein for which it is specific but does substantially bind not bind or binds with only low affinity to other proteins.
5 A variety of protocols for competitive binding or immunoradiometric assays to determine the specific binding capability of an antibody are well known in the art (see for example Maddox et al, J. Exp. Med. 158, 1211-1226, 1993). Such immunoassays typically involve the formation of complexes between the specific protein and its antibody and the measurement of complex formation.
20 Antibodies of the invention may be antibodies to human polypeptides or fragments thereof. For the purposes of this invention, the term "antibody", unless specified to the contrary, includes fragments which bind a polypeptide of the invention. Such fragments include Fv, F(ab') and F(ab')2 fragments, as well as single chain antibodies. Furthermore, the antibodies and fragment thereof may be chimeric 25 antibodies, CDRgrafted antibodies or humanised antibodies.
Antibodies may be used in a method for detecting polypeptides of the invention in a biological sample, which method comprises: I providing an antibody of the invention; II incubating a biological sample with said antibody under conditions which 30 allow for the formation of an antibodyantigen complex; and III determining whether antibody-antigen complex comprising said antibody is formed.
A sample may be for example a tissue extract, blood, serum and saliva.
Antibodies of the invention may be bound to a solid support and/or packaged into kits in a suitable container along with suitable reagents, controls, instructions, etc. Antibodies may be linked to a revealing label and thus may be suitable for use in s methods of in viva HIPHUM 1 19 imaging.
Antibodies of the invention can be produced by any suitable method. Means for preparing and characterizing antibodies are well known in the art, see for example Harlow and Lane (1988) "Antibodies: A Laboratory Manual", Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. For example, an antibody may be lo produced by raising antibody in a host animal against the whole polypeptide or a fragment thereof, for example an antigenic epitope thereof, herein after the "immunogen". A method for producing a polyclonal antibody comprises immunising a suitable host animal, for example an experimental animal, with the immunogen and 5 isolating immunoglobulins from the animal's serum. The animal may therefore be inoculated with the immunogen, blood subsequently removed from the animal and the IgG fraction purified.
A method for producing a monoclonal antibody comprises immortalizing cells which produce the desired antibody. Hybridoma cells may be produced by 20 fusing spleen cells from an inoculated experimental animal with tumour cells (Kohler and Milstein (1975) Nature 256, 495-497).
An immortalized cell producing the desired antibody may be selected by a conventional procedure. The hybridomas may be grown in culture or injected intraperitoneally for formation of ascites fluid or into the blood stream of an 25 allogenic host or immunocompromised host. Human antibody may be prepared by in vitro immunization of human lymphocytes, followed by transformation of the lymphocytes with Epstein-Barr virus.
For the production of both monoclonal and polyclonal antibodies, the experimental animal is suitably a goat, rabbit, rat or mouse. If desired, the so immunogen may be administered as a conjugate in which the immunogen is coupled, for example via a side chain of one of the amino acid residues, to a suitable carrier.
The carrier molecule is typically a physiologically acceptable carrier. The antibody
obtained may be isolated and, if desired, purified.
An important aspect of the present invention is the use of polypeptides according to the invention in screening methods. The screening methods may be used to identify substances that bind to cysteine proteinase and in particular which 5 bind to HIPHUM 119 such as a substrate for the enzyme. Screening methods may also be used to identify agonists or antagonists which may modulate cysteine proteinase activity, inhibitors or activators of HIPHUM 119 activity, andlor agents which up-regulate or down-regulate HIPHUM 1 19 expression.
Any suitable format may be used for the assay. In general terms such lo screening methods may involve contacting a polypeptide of the invention with a test substance and monitoring for binding of the test substance to the polypeptide or measuring protease activity. A polypeptide of the invention may be incubated with a test substance. Modulation of cysteine proteinase activity may be determined. In a preferred aspect, the assay is a cell-based assay. Preferably the assay may be carried out in a single well of a microtitre plate. Assay formats which allow high throughput screening are preferred.
A typical assay for determining whether a test substance acts as an inhibitor or activator of HIPHUM 1 19 activity comprises contacting a fluorescent or colourimetric substrate with a polypeptide of the invention and a test substance and 20 monitoring protease activity by monitoring any change in the fluorescence or light emission. Any changes in the fluorescence of a substrate as a result of its proteolytic degradation by a polypeptide of the invention may be detected using a fluorescence plate reader. Colourimetic changes may be measured using a spectrophotometer.
The inhibitory or stimulatory activity of a test substance may be determined by 25 comparing any fluorescent or colourimetric changes observed in the presence of a test substance to any changes observed in the absence of a test substance and/or in the presence of a known inhibitor of HIPHUM 119 activity.
Modulator activity can be determined by contacting cells expressing a polypeptide of the invention with a substance under investigation and by monitoring 30 an effect mediated by the polypeptide. The cells expressing the polypeptide may be in vitro or in viva. The polypeptide of the invention may be naturally or recombinantly expressed. Preferably, the assay is carried out in vitro using cells
expressing recombinant polypeptide. Preferably, control experiments are carried out on cells which do not express the polypeptide of the invention to establish whether the observed responses are the result of activation of the polypeptide.
The binding of a test substance to a polypeptide of the invention can be 5 determined directly. For example, a radiolabelled test substance can be incubated with the polypeptide of the invention and binding of the test substance to the polypeptide can be monitored. Typically, the radiolabelled test substance can be incubated with cell membranes containing the polypeptide until equilibrium is reached. The membranes can then be separated from a non-bound test substance and 0 dissolved in scintillation fluid to allow the radioactive content to be determined by scintillation counting. Non-specific binding of the test substance may also be determined by carrying out a competitive binding assay.
Substances that inhibit the interaction of a polypeptide of the invention with a HIPHUM 1 19 substrate or with another protease may also be identified through a 5 yeast 2-hybrid assay or other protein interaction assay such as a co-
immunoprecipitation or an ELISA based technique.
Assays may be carried out using cells expressing HIPHUM 1 19, and incubating such cells with the test substance optionally in the presence of a HIPHUM 1 19 substrate. The results of the assay are compared to the results obtained using the 20 same assay in the absence of the test substance. Cells expressing HIPHUM 119 constitutively may be provided for use in assays for HIPHUM 1 19 function.
Additional test substances may be introduced in any assay to look for inhibitors or activators of substrate binding or inhibitors or activators of protease activity.
Assays may also be carried out to identify substances which modify 25 HIPHUM 119 expression, for example substances which up- or down- regulate expression. Such assays may be carried out for example by using antibodies for HIPHUM 1 19 to monitor levels of HIPHUM 1 19 expression. Other assays which can be used to monitor the effect of a test substance on HIPHUM 1 19 expression include using a reporter gene construct driven by the HIPHUM 1 19 regulatory 30 sequences as the promoter sequence and monitoring for expression of the reporter polypeptide. Additional control experiments may be carried out
Suitable test substances which can be tested in the above assays include combinatorial libraries, defined chemical entities and compounds, peptide and peptide mimetics, oligonucleotides and natural product libraries, such as display (e.g. phage display libraries) and antibody products.
5 Typically, organic molecules will be screened, preferably small organic molecules which have a molecular weight of from 50 to 2500 daltons. Candidate products can be biomolecules including, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural analogs or combinations thereof. Candidate agents are obtained from a wide variety of sources including libraries of synthetic or lo natural compounds. Known pharmacological agents may be subjected to directed or random chemical modifications, such as acylation, alkylation, esterification, amidification, etc. to produce structural analogs.
Test substances may be used in an initial screen of, for example, 10 substances per reaction, and the substances of these batches which show inhibition or 5 activation tested individually. Test substances may be used at a concentration of from lnM to lOmM, preferably from, lOOnM to lOOOpM or from 1pM to lOO,uM, more preferably from 1 AM to 1 O,uM. Preferably, the activity of a test substance is compared to the activity shown by a known activator or inhibitor. A test substance which acts as an inhibitor may produce a 50% inhibition of activity of the receptor.
So Alternatively a test substance which acts as an activator may produce 50% of the maximal activity produced using a known activator.
Another aspect of the present invention is the use of polynucleotides encoding the HIPHUM 119 polypeptides of the invention to identify mutations in HIPHUM 119 genes which may be implicated in human disorders. Identification of :5 such mutations may be used to assist in diagnosis or susceptibility to such disorders and in assessing the physiology of such disorders. Polynucleotides may also be used in hybridization studies to monitor for up- or down-regulation of HIPHUM 1 19 expression. Polynucleotides such as SEQ ID NO: 1 or fragments thereof may be used to identify allelic variants, genomic DNA and species variants.
30 The present invention provides a method for detecting variation in the expressed products encoded by HIPHUM 1 19 genes. This may comprise
determining the level of an HIPHUM 119 expressed in cells or determining specific alterations in the expressed product. Sequences of interest for diagnostic purposes include, but are not limited to, the conserved portions as identified by sequence similarity and conservation of intron/exon structure. The diagnosis may be performed 5 in conjunction with kindred studies to determine whether a mutation of interest co-
segregates with disease phenotype in a family.
Diagnostic procedures may be performed on polynucleotides isolated from an individual or alternatively, may be performed in situ directly upon tissue sections (fixed and/or frozen) of patient tissue obtained from biopsies or resections, such that lo no nucleic acid purification is necessary. Appropriate procedures are described in, for example, Nuovo, G. J., 1992, "PCR In Situ Hybridization: Protocols And Applications", Raven Press, NY). Such analysis techniques include, DNA or RNA blotting analyses, single stranded confirmational polymorphism analyses, in situ hybridization assays, and polymerase chain reaction analyses. Such analyses may 5 reveal both quantitative aspects ofthe expression pattern of a HIPHUM 119, and qualitative aspects of HIPHUM 119 expression and/or composition.
Alternative diagnostic methods for the detection of HIPHUM 119 nucleic acid molecules may involve their amplification, e.g. by PCR (the experimental embodiment set forth in U.S. Patent No. 4,683,202), ligase chain reaction (Barany, 20 1991, Proc. Natl. Acad. Sci. USA 88:189-193), self sustained sequence replication (Guatelli et al., 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh et al., 1989, Proc. Natl. Acad. Sci. 15 USA 86: 1173 1177), Q-Beta Replicase (Lizard) et al., 1988, Bio/Technology 6:1197) or any other nucleic acid amplification method, followed by the detection of the amplified 25 molecules using techniques well known to those of skill in the art. These detection schemes are especially useful for the detection of nucleic acid molecules if such molecules are present in very low numbers.
Particularly suitable diagnostic methods are chip-based DNA technologies such as those described by Hacia et al., 1996, Nature Genetics 14:441-447 and 30 Shoemaker et al., 1996, Nature Genetics 14:450-456. Briefly, these techniques involve quantitative methods for analyzing large numbers of nucleic acid sequence targets rapidly and accurately. By tagging with oligonucleotides or using fixed probe
arrays, one can employ chip technology to segregate target molecules as high density arrays and screen these molecules on the basis of hybridization. l Following detection, the results seen in a given patient may be compared with a statistically significant reference group of normal patients and patients that have 5 HIPHUM 119 related pathologies. In this way, it is possible to correlate the amount or kind of HIPHUM 119 encoded product detected with various clinical states or predisposition to clinical states.
Another aspect of the present invention is the use of the substances that have been identified by screening techniques referred to above in the treatment of disease o states, which are responsive to regulation of cysteine proteinase activity. The treatment may be therapeutic or prophylactic. The condition of a patient suffering from such a disease state can thus be improved.
In particular, such substances may be used in the treatment of HIV infection, lung cancer, inflammatory disease such as asthma, Hepatitis B and brain diseases 5 such as Alzheimer's disease, parasupranuclear palsey and Huntington's disease.
Additional disease states that may be treated include chronic obstructive pulmonary disease (COPD), breast cancer, neuronal ceroid lipofuscinosis, Badet-
Biedl syndrome, multiple sclerosis and allergic encephalomyelitis.
Substances that act as inhibitors of HIPHUM 1 19 activity may be used in the 20 treatment of disease states in which HIPHUM 119 expression is upregulated such as stimulated bone marrow, HIV infection and HIV/PBL infection. Substances that act as activators of HIPHUM 1 19 activity may be used in the treatment of disease states in which expression of HIPHUM 119 is down-regulated such as brain diseases (Alzheimer's disease, parasupranuclear palsey, and Huntington's disease), lung 25 tumors, lung asthma and Hepatitis B infection.
Substances identified according to the screening methods outlined above may be formulated with standard pharmaceutically acceptable carriers and/or excipients as is routine in the pharmaceutical art. For example, a suitable substance may be dissolved in physiological saline or water for injections. The exact nature of a 30 formulation will depend upon several factors including the particular substance to be administered and the desired route of administration. Suitable types of formulation are fully described in Remington's Pharmaceutical Sciences, Mack Publishing
Company, Eastern Pennsylvania, 1 7th Ed. 1985, the disclosure of which is included
herein of its entirety by way of reference.
The substances may be administered by enteral or parenteral routes such as via oral, buccal, anal, pulmonary, intravenous, intra-arterial, intramuscular, s intraperitoneal, topical or other appropriate administration routes.
A therapeutically effective amount of a modulator is administered to a patient. The dose of a modulator may be determined according to various parameters, especially according to the substance used; the age, weight and condition of the patient to be treated; the route of administration; and the required regimen. A o physician will be able to determine the required route of administration and dosage for any particular patient. A typical daily dose is from about 0.1 to 50 mg per kg of body weight, according to the activity of the specific modulator, the age, weight and conditions of the subject to be treated, the type and severity of the degeneration and the frequency and route of administration. Preferably, daily dosage levels are from 5 5 mgto2g.
Nucleic acid encoding HIPHUM 1 19 or a variant thereof which inhibits HIPHUM 1 19 activity may be administered to the mammal. In particular, a nucleic acid encoding a polypeptide with HIPHUM 1 19 activity may be administered to a subject suffering from a condition in which HIPHUM 119 expression is down-
20 regulated, such as Alzheimer's disease, parasupranuclear palsey, Huntington's disease, lung tumor, lung asthma and Hepatitis B infection. A nucleic acid encoding a variant of HIPHUM 1 19 that inhibits HIPHUM 1 19 activity may be administered to a patient suffering from a condition in which HIPHUM 1 19 expression is up-
regulated such as HIV infection and HIV/PBL infection. Nucleic acid, such as RNA 2s or DNA, and preferably, DNA, is provided in the form of a vector, such as the polynucleotides described above, which may be expressed in the cells of the mammal. Nucleic acid encoding the polypeptide may be administered by any available technique. For example, the nucleic acid may be introduced by needle injection, so preferably intradermally, subcutaneously or intramuscularly. Alternatively, the nucleic acid may be delivered directly across the skin using a nucleic acid delivery device such as particle-mediated gene delivery. The nucleic acid may be
administered topically to the skin, or to mucosal surfaces for example by intranasal, oral, intravaginal or intrarectal administration.
Uptake of nucleic acid constructs may be enhanced by several known transfection techniques, for example those including the use of transfection agents.
5 Examples of these agents includes cationic agents, for example, calcium phosphate and DEAE-Dextran and lipofectants, for example, lipofectam and transfectam. The dosage of the nucleic acid to be administered can be altered. Typically the nucleic acid is administered in the range of lpg to lmg, preferably to lpg to long nucleic acid for particle mediated gene delivery and long to lmg for other routes.
lo The following Examples illustrate the invention.
Example 1: Characterisation of the sequence A cysteine proteinase, designated as HIPHUM 119 has been identified. The nucleotide and amino acid sequences of the receptor have been determined. These are set out below in SEQ ID NOs: 1 and 2. Suitable primers and probes were 15 designed and used to analyse tissue expression. HIPHUM 119 was found to be expressed in all tissues at various levels (Figure 1). It is highly expressed in adrenal, cerebellum, rectum, testis, thyroid and urinary bladder. It is also expressed at significant levels in lung, fetal brain, skeletal muscle, tonsil and uterus. In addition, it is expressed in T cells, peripheral blood mononucleocytes (PBMNCs), monocytes 20 and dendritic cells.
HIPPIHUM 1 19 expression levels are down regulated in lung tumors, lung asthma, and Hepatitis B infection (Figure 2).
HIPPIHUM 119 expression levels are down regulated in Alzheimer's disease, parasupranuclear palsey, Huntington's disease, lung tumors, lung asthma, and 25 Hepatitis B infection (Figure 3).
HIPPIHUM 119 expression levels are elevated in stimulated bone marrow (Figure 4), HIV infection and HIV/PBL infection (Figure 5).
The chromosomal localization was also mapped. Human HIPHUM 119 has been mapped to 15q22.
30 Example 2: Screening for substances which exhibit protein modulating activity Preparations of a purif ed polypeptide of the invention are generated for
screening purposes. 96 and 384 well plate, high throughput screens (HTS) are employed using fluorescence or colourimetric indicator molecules. Secondary screening involves the same technology. Tertiary screens involve the study of modulators in rat, mouse and guinea-pig models of disease relevant to the target.
5 A brief screening assay protocol is as follows: A polypeptide of the invention is expressed in E. Coli, purified and refolded by direct dilution in assay buffer (200mM NaCl, 50mM Tris, 5mM CaCl2, lO,uM ZnSO4, 0.01% Brij 35, pH 7.5). Test substances are provided in pools of 10 at 5mM for each test substance for high throughput screening or are serially diluted in dose lo response assays. The screening assay is run on an automated system incorporating an OCRA rail to move the plates, a Tecan liquid handler, a Multimek liquid handler and a Titertek liquid handler. (The assay may also be run on manually or in combination with any suitable liquid handling equipment.) 60 Ill of serially diluted test substance is added to 96 well plates (4 control 5 wells with no inhibitor) , 60 al of a solution of a polypeptide of the invention (0.5 EM to 150 EM) is then added to each well and 50 mM EDTA (20 ill of 0.5M EDTA) is added to 4 inhibited control wells. In a fluorescence assay, the plates are read in a Fluostar (SLT) fluorescence plate reader or equivalent at an excitation of 343 nm and an emission of 450 nm. In a colourimetric assay the plates are read continuously in a So SLT spectrophotometer at 405 nm for 3 minutes.
Percentage inhibition is calculated for each concentration (unknown values = U) in the dose response based on the range determined using the value of the difference of the conko1 (no test substance) (mean of 4 = C1) and the EDTA treated wells (mean of 4 = C2) using the equation 100*(1 - (U-C2)/(C 1 -C2).
SEQUENCE LISTING
5 <110> GLAXO GRPUP LIMITED
<120> New Protein <130> QG1034(p80209) <160> 2
<170> PatentIn version 3.0 <210> 1
<211> 639
<212> DNA
<213> Homo sapiens <220> <221> CDS
<222> (1)..(636)
25 <400> 1
atg gac ccc gta gtc ttg agt tac atg gac agt cta ctg cgg caa lea 48 Met Asp Pro Val Val Leu Ser Tyr Met Asp Ser Leu Leu Arg Gln Ser 1 5 10 15
30 get gtc lea cta ttg get ccg cca agc tgg ctc eat gac cat att att 96 Asp Val Ser Leu Leu Asp Pro Pro Ser Trp Leu Asn Asp His Ile Ile 20 25 30
999 ttt gcg ttt gag tac ttt gcc aac agt cag ttt cat gac tgc let 144 35 Gly Phe Ala Phe Glu Tyr Phe Ala Asn Ser Gln Phe His Asp Cys Ser 35 40 45
get cac gtc agt ttc ale agc cct gaa gtc acc cag ttc ale aag tgc 192 Asp His Val Ser Phe Ile Ser Pro Glu Val Thr Gln Phe Ile Lys Cys 40 50 55 60
act agc aac cca gca gag att gcc atg ttc ctt gaa cca ctg gac ctc 240 Thr Ser Asn Pro Ala Glu Ile Ala Met Phe Leu Glu Pro Leu Asp Leu 65 70 75 80
ccc aac aag age gtt gta ttt tta gcc ale eat get aac tcc aac cag 288 Pro Asn Lys Arg Val Val Phe Leu Ala Ile Asn Asp Asn Ser Asn Gln 85 90 95
50 gca gct gga gga acc cac tgg agt tta ttg gtc tac ctc caa get aaa 336 Ala Ala Gly Gly Thr His Trp Ser Leu Leu Val Tyr Leu Gln Asp Lys 100 105 110
eat agc tft ttt cat tat get tcc cat agc agg agc aac lea gtt cac 384 55 Asn Ser Phe Phe His Tyr Asp Ser His Ser Arg Ser Asn Ser Val His 115 120 125
gca aeg cag gta gca gag aaa ctg gag gct ttc tta ggc age aaa gga 432 Ala Lys Gln Val Ala Glu Lys Leu Glu Ala Phe Leu Gly Arg Lys Gly 130 135 140
5 gac aaa ctg gcc ttt gtg gaa gag aaa gcc cct gcc caa caa aac agc 480 AspLys Leu Ala Phe Val Glu Glu Lys Ala Pro Ala Gln Gln Asn Ser 145 150 155 160
tat gac tgt ggg atg tac gtg ata tgt aac act gag gcc ttg tgt cag 528 IO Tyr Asp Cys Gly Met Tyr Val Ile Cys Asn Thr Glu Ala Leu Cys Gln 165 170 175
aac ttc ttt agg caa cag ace gaa lea ctg ctg cag cta ctc acc cct 576 Asn Phe Phe Arg Gln Gln Thr Glu Ser Leu Leu Gln Leu Leu Thr Pro 15 180 185 190
gca tac ale ace aag aag agg gga gaa tgg aaa get ctc att gcc ace 624 Ala Tyr Ile Thr Lys Lys Arg Gly Glu Trp Lys Asp Leu Ile Ala Thr 195 200 205
ctt gct aaa aag tag 639 Leu Ala Lys Lys <210> 2
<211> 212
<212> PRT
<213> Homo sapiens <400> 2
Met Asp Pro Val Val Leu Ser Tyr Met Asp Ser Leu Leu Arg Gln Ser 1 5 10 15
Asp Val Ser Leu Leu Asp Pro Pro Ser Trp Leu Asn Asp His Ile Ile 20 25 30
Gly Phe Ala Phe Glu Tyr Phe Ala Asn Ser Gln Phe His Asp Cys Ser 40 35 40 45
Asp His Val Ser Phe Ile Ser Pro Glu Val Thr Gln Phe Ile Lys Cys 50 55 60
45 Thr Ser Asn Pro Ala Glu Ile Ala Met Phe Leu Glu Pro Leu Asp Leu 65 70 75 80
Pro Asn Lys Arg Val Val Phe Leu Ala Ile Asn Asp Asn Ser Asn Gln 85 90 95
Ala Ala Gly Gly Thr His Trp Ser Leu Leu Val Tyr Leu Gln Asp Lys 100 105 110
Asn Ser Phe Phe His Tyr Asp Ser His Ser Arg Ser Asn Ser Val His 55 115 120 125
Ala Lys Gln Val Ala Glu Lys Leu Glu Ala Phe Leu Gly Arg Lys Gly
130 135 140
Asp Lys Leu Ala Phe Val Glu Glu Lys Ala Pro Ala Gln Gln Asn Ser 5 145 150 155 160
Tyr Asp Cys Gly Met Tyr Val Ile Cys Asn Thr Glu Ala Leu Cys Gln 165 170 175
IO Asn Phe Phe Arg Gln Gln Thr Glu Ser Leu Leu Gln Leu Leu Thr Pro 180 185 190
Ala Tyr Ile Thr Lys Lys Arg Gly Glu Trp Lys Asp Leu Ile Ala Thr 195 200 205
Leu Ala Lys Lys

Claims (16)

1. An isolated cysteine proteinase polypeptide comprising (i) the amino acid sequence of SEQ ID NO: 2; or 5 (ii) a variant thereof which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO; or (iii) a fragment of (i) or (ii) which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release lo the active form of SUMO.
2. A polypeptide according to claim 1 wherein the variant (ii) has at least 80% identity to the amino acid sequence of SEQ ID NO: 2.
3. A polynucleotide encoding a polypeptide according to claim 1 or 2.
4. A polynucleotide according to claim 3 which is a cDNA sequence.
5. A polynucleotide encoding a cysteine proteinase polypeptide which is capable of cleaving SUMO from a target protein and/or cleaving the precursor form of SUMO to release the active form of SUMO which polynucleotide comprises: (a) the nucleic acid sequence of SEQ ID NO: 1 and/or a sequence complementary thereto; 20 (b) a sequence which hybridises under stringent conditions to a sequence as defined in (a); (c) a sequence that is degenerate as a result of the genetic code to a sequence as defined in (a) or (b); or (d) a sequence having at least 60% identity to a sequence as defined in 25 (a), (b) or (c).
6. An expression vector comprising a polynucleotide according to any one of claims 3 to 5.
7. A host cell comprising an expression vector according to claim 6.
8. An antibody specific for a polypeptide according to claim 1 or 2.
30
9. A method for the identification of a substance that modulates cysteine proteinase activity and/or expression, which method comprises: (i) contacting a test substance and a polypeptide according to claim I or
2, a polynucleotide according to any one of claims 3 to 5, an expression vector according to claim 6 or a host cell according to claim 7, and (ii) determining the effect of the test substance on the activity and/or 5 expression of the said polypeptide or the polypeptide encoded by said polynucleotide, thereby to determine whether the test substance modulates cysteine proteinase activity and/or expression.
10. A method according to claim 9 wherein the polypeptide is in a substantially isolated form.
l o
1 1. A substance which modulates cysteine proteinase activity and which is identifiable by a method according to claim 9 or 10.
12. A method of treating a subject having a disorder that is responsive to cysteine proteinase modulation, which method comprises administering to said subject an effective amount of a substance according to claim 11.
is
13. A method according to claim 12 wherein the disorder is selected from HIV infection, lung cancer, inflammatory disease, Hepatitis B and brain disease.
14. Use of a substance as defined in claim 11 in the manufacture of a medicament for treatment or prophylaxis of a disorder that is responsive to stimulation or modulation of cysteine proteinase activity.
20
15. A use according to claim 14 wherein the disorder is selected from HIV infection, lung cancer, inflammatory disease, Hepatitis B and brain disease.
16. A method of producing a polypeptide according to claim 1 or 2, which method comprises maintaining a host cell as defined in claim 7 under conditions suitable for obtaining expression of the polypeptide and isolating the said 25 polypeptide.
GB0127017A 2000-11-15 2001-11-09 Cysteine protease polypeptide Withdrawn GB2372504A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GBGB0027905.9A GB0027905D0 (en) 2000-11-15 2000-11-15 New protein

Publications (2)

Publication Number Publication Date
GB0127017D0 GB0127017D0 (en) 2002-01-02
GB2372504A true GB2372504A (en) 2002-08-28

Family

ID=9903240

Family Applications (2)

Application Number Title Priority Date Filing Date
GBGB0027905.9A Ceased GB0027905D0 (en) 2000-11-15 2000-11-15 New protein
GB0127017A Withdrawn GB2372504A (en) 2000-11-15 2001-11-09 Cysteine protease polypeptide

Family Applications Before (1)

Application Number Title Priority Date Filing Date
GBGB0027905.9A Ceased GB0027905D0 (en) 2000-11-15 2000-11-15 New protein

Country Status (2)

Country Link
US (1) US20020127692A1 (en)
GB (2) GB0027905D0 (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE60015559T2 (en) * 1999-07-31 2005-10-27 Board of Regents, The University of Texas System, Austin SENTRIN-SPECIFIC HUMAN PROTEASE SENP1
US8133733B2 (en) 2003-10-24 2012-03-13 Gencia Corporation Nonviral vectors for delivering polynucleotides to target tissues
US20090123468A1 (en) 2003-10-24 2009-05-14 Gencia Corporation Transducible polypeptides for modifying metabolism
US8062891B2 (en) 2003-10-24 2011-11-22 Gencia Corporation Nonviral vectors for delivering polynucleotides to plants
JP4838722B2 (en) 2003-10-24 2011-12-14 ゲンシア コーポレーション Methods for delivering polynucleotides and compositions for delivery
US8507277B2 (en) 2003-10-24 2013-08-13 Gencia Corporation Nonviral vectors for delivering polynucleotides
WO2012064897A2 (en) * 2010-11-09 2012-05-18 Yuan Chen Bicyclic and tricyclic inhibitors of sumoylation enzymes and methods for their use

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001023558A2 (en) * 1999-09-28 2001-04-05 Incyte Genomics, Inc. Human secretory molecules
WO2002020756A2 (en) * 2000-09-05 2002-03-14 Incyte Genomics, Inc. Secretory molecules

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001023558A2 (en) * 1999-09-28 2001-04-05 Incyte Genomics, Inc. Human secretory molecules
WO2002020756A2 (en) * 2000-09-05 2002-03-14 Incyte Genomics, Inc. Secretory molecules

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Genbank Accession Number AF308450, submitted 22/09/2000, Wang, Y. -G., "Homo sapiens cysteine protease..." *
Genbank Accession Number AY008293, submitted 26/09/2000, Gong, L. et al., "Homo sapiens sentrin/SUMO-specific protease..." *

Also Published As

Publication number Publication date
GB0027905D0 (en) 2000-12-27
US20020127692A1 (en) 2002-09-12
GB0127017D0 (en) 2002-01-02

Similar Documents

Publication Publication Date Title
JP2003527067A (en) ACRP30R1L, a homologue of ACRP30 (30 KD adipocyte complement-related protein)
JP2006325596A (en) Sialoadhesin family member-2 (saf-2)
JP2004508825A (en) New compound
EP0887414A2 (en) Human serine proteases HGBAB90
US6515108B2 (en) Human Wnt7a polypeptides
GB2372504A (en) Cysteine protease polypeptide
JPH11217399A (en) Kringle-relating clone hthbz47
JP2003526322A (en) Secreted cysteine-rich protein-6 (scrp-6)
JPH11183A (en) Polynucleotide of rat cathepsin k and polypeptide sequence
US20030165818A1 (en) Novel protein
US7057021B2 (en) Genes encoding caspase recruitment domain polypeptides
US20040072268A1 (en) Methods for regulating BRCA1-BRCA2-containing complex activity
JPH1142093A (en) Sialoadhesin family member-3
GB2382078A (en) Protease polypeptide
JPH11164692A (en) Human rce1
GB2372994A (en) Cysteine protease polypeptide
GB2371801A (en) Cysteine protease polypeptides
US20020076800A1 (en) Protein
JP2002521057A (en) Human protein kinase H2LAU20
JP2004041214A (en) SIALOADHESIN FAMILY 4 (SAF-4) cDNA
US20040072227A1 (en) Integrin ligand, human mindin
GB2376234A (en) Trypsin-like serine protease
US20020076758A1 (en) Polypeptide
JPH11103867A (en) Epo primary response gene1, eprg1
US20010009905A1 (en) ACRP30R1: A homolog of ACRP30 (30 kd adipocyte complement-related protein)

Legal Events

Date Code Title Description
WAP Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1)