AU777343B2 - Polynucleotides and polypeptides encoded thereby distantly homologous to heparanase - Google Patents

Polynucleotides and polypeptides encoded thereby distantly homologous to heparanase Download PDF

Info

Publication number
AU777343B2
AU777343B2 AU52448/00A AU5244800A AU777343B2 AU 777343 B2 AU777343 B2 AU 777343B2 AU 52448/00 A AU52448/00 A AU 52448/00A AU 5244800 A AU5244800 A AU 5244800A AU 777343 B2 AU777343 B2 AU 777343B2
Authority
AU
Australia
Prior art keywords
leu
seq
nucleic acid
polynucleotide
arg
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU52448/00A
Other versions
AU5244800A (en
Inventor
Hanan Itzhaki
Israel Michal
Iris Pecker
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Insight Biopharmaceuticals Ltd
Original Assignee
Insight Biopharmaceuticals Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Insight Biopharmaceuticals Ltd filed Critical Insight Biopharmaceuticals Ltd
Publication of AU5244800A publication Critical patent/AU5244800A/en
Assigned to INSIGHT BIOPHARMACEUTICALS LTD. reassignment INSIGHT BIOPHARMACEUTICALS LTD. Amend patent request/document other than specification (104) Assignors: INSIGHT STRATEGY & MARKETING LTD.
Application granted granted Critical
Publication of AU777343B2 publication Critical patent/AU777343B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y302/00Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
    • C12Y302/01Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
    • C12Y302/01166Heparanase (3.2.1.166)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/24Hydrolases (3) acting on glycosyl compounds (3.2)
    • C12N9/2402Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)

Description

WO 01/00643 PCT/IL00/00358 POLYNUCLEOTIDES AND POLYPEPTIDES ENCODED THEREBY DISTANTLY HOMOLOGOUS TO HEPARANASE FIELD AND BACKGROUND OF THE INVENTION The present invention relates to novel polynucleotides encoding polypeptides distantly homologous to heparanase, nucleic acid constructs including the polynucleotides, genetically modified cells expressing same, recombinant proteins encoded thereby and which may have heparanase or other glycosyl hydrolase activity, antibodies recognizing the recombinant proteins, oligonucleotides and oligonucleotide analogs derived from the polynucleotides and ribozymes including same.
Citation or identification of any reference in this application shall not be construed as an admission that such reference is available as prior art to the present invention.
Glycosaminoglycans (GAGs) GAGs are polymers of repeated disaccharide units consisting of uronic acid and a hexosamine. Biosynthesis of GAGs except hyaluronic acid is initiated from a core protein. Proteoglycans may contain several GAG side chains from similar or different families. GAGs are synthesized as homopolymers which may subsequently be modified by N-deacetylation and N-sulfation, followed by C5-epimerization of glucuronic acid to iduronic acid and O-sulfation. The chemical composition of GAGs from various tissues varies highly.
The natural metabolism of GAGs in animals is carried out by hydrolysis. Generally, the GAGs are degraded in a two step procedure.
First the proteoglycans are internalized in endosomes, where initial depolymerization of the GAG chain takes place. This step is mainly hydrolytic and yields oligosaccharides. Further degradation is carried out after fusion with lysosome, where desulfation and exolytic depolymerization to monosaccharides take place (42).
The only mammalian GAG degrading endolytic enzymes characterized so far are the hyaluronidases. The hyaluronidases are a family of 1-4 endoglucosaminidases that depolymerize hyaluronic acid and chondroitin sulfate. The cDNAs encoding sperm associated (Hyal3), and the lysosomal hyaluronidases Hyal 1 and Hyal2 were cloned and published These enzymes share an overall homology of 40 and have different tissue specificities, cellular localizations and PH optima.
WO 01/00643 PCT/IL0/00358 2 Exolytic hydrolases are better characterized, among which are Pglucoronidase, a-L-iduronidase, and P-N-acetylglucosaminidase. In addition to hydrolysis of the glycosidic bond of the polysaccharide chain, GAG degradation involves desulfation, which is catalyzed by several lysosomal sulfatases such as N-acetylgalactosamine-4-sulfatase, iduronate- 2-sulfatase and heparin sulfamidase. Deficiency in any of lysosomal GAG degrading enzymes results in a lysosomal storage disease, mucopolysaccharidosis.
Glycosyl hydrolases: Glycosyl hydrolases are a widespread group of enzymes that hydrolyze the o-glycosidic bond between two or more carbohydrates or between a carbohydrate and a noncarbohydrate moiety. The enzymatic hydrolysis of glycosidic bond occurs by using major one or two mechanisms leading to overall retention or inversion of the anomeric configuration. In both mechanisms catalysis involves two residues: a proton donor and a nucleophile. Glycosyl hydrolyses have been classified into 58 families based on amino acid similarities. The glycosyl hydrolyses from families 1, 2, 5, 10, 17, 30, 35, 39 and 42 act on a large variety of substrates, however, they all hydrolyze the glycosidic bond in a general acid catalysis mechanism, with retention of the anomeric configuration.
The mechanism involves two glutamic acid residues, which are the proton donors and the nucleophile, with an aspargine always preceding the proton donor. Analyses of a set of known 3D structures from this group revealed that their catalytic domains, despite the low level of sequence identity, adopt a similar 8 fold with the proton donor and the nucleophile located at the C-terminal ends of strands p4 and p7, respectively.
Mutations in the functional conserved amino acids of lysosomal glycosyl hydrolases were identified in lysosomal storage diseases.
Lysosomal glycosyl hydrolases including P-glucuronidase, Pmanosidase, p-glucocerebrosidase, p-galactosidase and a-L-iduronidase, are all exo-glycosyl hydrolases, belong to the GH-A clan and share a similar catalytic site. However, many endo-glucanases from various organisms, such as bacterial and fungal xylenases and cellulases share this catalytic domain Heparan sulfate proteoglycans (HSPGs) HSPGs are ubiquitous macromolecules associated with the cell surface and extracellular matrix (ECM) of a wide range of cells of vertebrate and invertebrate tissues The basic HSPG structure WO 01/00643 PCT/IL00/00358 3 consists of a protein core to which several linear heparan sulfate chains are covalently attached. The polysaccharide chains are typically composed of repeating hexuronic and D-glucosamine disaccharide units that are substituted to a varying extent with N- and O-linked sulfate moieties and N-linked acetyl groups Studies on the involvement of ECM molecules in cell attachment, growth and differentiation revealed a central role of HSPGs in embryonic morphogenesis, angiogenesis, metastasis, neurite outgrowth and tissue repair The heparan sulfate (HS) chains, which are unique in their ability to bind a multitude of proteins, ensure that a wide variety of effector molecules cling to the cell surface (6- HSPGs are also prominent components of blood vessels In large vessels they are concentrated mostly in the intima and inner media, whereas in capillaries they are found mainly in the subendothelial basement membrane where they support proliferating and migrating endothelial cells and stabilize the structure of the capillary wall. The ability of HSPGs to interact with ECM macromolecules such as collagen, laminin and fibronectin, and with different attachment sites on plasma membranes suggests a key role for this proteoglycan in the self-assembly and insolubility of ECM components, as well as in cell adhesion and locomotion. Cleavage of HS may therefore result in disassembly of the subendothelial ECM and hence may play a decisive role in extravasation of normal and malignant blood-borne cells HS catabolism is observed in inflammation, wound repair, diabetes, and cancer metastasis, suggesting that enzymes which degrade HS play important roles in pathologic processes.
Heparanase Heparanase is a glycosylated enzyme that is involved in the catabolism of certain glycosaminoglycans. It is an endoglucouronidase that cleaves heparan sulfate at specific intrachain sites (12-15). Interaction of T and B lymphocytes, platelets, granulocytes, macrophages and mast cells with the subendothelial extracellular matrix (ECM) is associated with degradation of heparan sulfate by heparanase activity Connective tissue activating peptide III (CTAP), a c-chemokine, was found to have heparanase-like activity. Placenta heparanase acts as an adhesion molecule or as a degradative enzyme depending on the pH of the microenvironvent (17).
Heparanase is released from intracellular compartments lysosomes, specific granules) in response to various activation signals WO 01/00643 PCT/IL0/00358 4 thrombin, calcium ionophores, immune complexes, antigens and mitogens), suggesting its regulated involvement in inflammation and cellular immunity responses (16).
It was also demonstrated that heparanase can be readily released from human neutrophils by 60 minutes incubation at 4 C in the absence of added stimuli (18).
Gelatinase, another ECM degrading enzyme which is found in tertiary granules of human neutrophils with heparanase, is secreted from the neutrophils in response to phorbol 12-myristate 13-acetate (PMA) io treatment (19-20).
In contrast, various tumor cells appear to express and secrete heparanase in a constitutive manner in correlation with their metastatic potential (21).
Degradation of heparan sulfate by heparanase results in the release of heparin-binding growth factors, enzymes and plasma proteins that are sequestered by heparan sulfate in basement membranes, extracellular matrices and cell surfaces (22-23).
Heparanase activity has been described in a number of cell types including cultured skin fibroblasts, human neutrophils, activated rat Tlymphocytes, normal and neoplastic murine B-lymphocytes, human monocytes and human umbilical vein endothelial cells, SK hepatoma cells, human placenta and human platelets.
A procedure for purification of natural heparanase was reported for SK hepatoma cells and human placenta Pat. No. 5,362,641) and for human platelets derived enzymes (62).
Cloning and expression of the heparanase gene A purified fraction of heparanase isolated from human hepatoma cells was subjected to tryptic digestion. Peptides were separated by high pressure liquid chromatography (HPLC) and micro sequenced. The sequence of one of the peptides was used to screen data bases for homology to the corresponding back translated DNA sequence. This procedure led to the identification of a clone containing an insert of 1020 base pairs (bp) which included an open reading frame of 963 bp followed by 27 bp of 3' untranslated region and a poly A tail. The new gene was designated hpa. Cloning of the missing 5' end of hpa was performed by Marathon RACE from placenta cDNA composite. The joined hpa cDNA (also referred to as phpa) fragment contained an open reading frame, which encodes a polypeptide of 543 amino acids with a calculated WO 01/00643 PCT/IL00/00358 molecular weight of 61,192 daltons The cloning procedures are described in length in U.S. Pat. application Nos. 08/922,170, 09/109,386, and 09/258,892, the latter is a continuation-in-part of PCT/US98/17954, filed August 31, 1998, all of which are incorporated herein by reference.
The genomic locus which encodes heparanase spans about 40 kb. It is composed of 12 exons separated by 11 introns and is localized on human chromosome 4.
The ability of the hpa gene product to catalyze degradation of heparan sulfate (HS) in vitro was examined by expressing the entire open o1 reading frame of hpa in High five and Sf21 insect cells, and the mammalian human 293 embryonic kidney cell line expression systems.
Extracts of infected or transfected cells were assayed for heparanase catalytic activity. For this purpose, cell lysates were incubated with sulfate labeled, ECM-derived HSPG (peak followed by gel filtration analysis (Sepharose 6B) of the reaction mixture. While the substrate alone consisted of high molecular weight material, incubation of the HSPG substrate with lysates of cells infected or transfected with hpa containing vectors resulted in a complete conversion of the high molecular weight substrate into low molecular weight labeled heparan sulfate degradation fragments (see, for example, U.S. Pat. application No. 09/071,618, which is incorporated herein by reference.
In other experiments, it was demonstrated that the heparanase enzyme expressed by cells infected with a pFhpa virus is capable of degrading HS complexed to other macromolecular constituents fibronectin, laminin, collagen) present in a naturally produced intact ECM (see U.S. Pat. application No. 09/109,386, which is incorporated herein by reference), in a manner similar to that reported for highly metastatic tumor cells or activated cells of the immune system 8).
Preferential expression of the hpa gene in human breast and hepatocellular carcinomas Semi-quantitative RT-PCR was applied to evaluate the expression of the hpa gene by human breast carcinoma cell lines exhibiting different degrees of metastasis. A marked increase in hpa gene expression is observed which correlates to metastatic capacity of non-metastatic MCF-7 breast carcinoma, moderately metastatic MDA 231 and highly metastatic MDA 435 breast carcinoma cell lines. Significantly, the differential pattern of the hpa gene expression correlated with the pattern of heparanase activity.
WO 01/00643 PCT/IL00/00358 6 Expression of the hpa gene in human breast carcinoma was demonstrated by in situ hybridization to archival paraffin embedded human breast tissue. Hybridization of the heparanase antisense riboprobe to invasive duct carcinoma tissue sections resulted in a massive positive staining localized specifically to the carcinoma cells. The hpa gene was also expressed in areas adjacent to the carcinoma showing fibrocystic changes. Normal breast tissue derived from reduction mammoplasty failed to express the hpa transcript. High expression of the hpa gene was also observed in tissue sections derived from human hepatocellular carcinoma o0 specimens but not in normal adult liver tissue. Furthermore, tissue specimens derived from adenocarcinoma of the ovary, squamous cell carcinoma of the cervix and colon adenocarcinoma exhibited strong staining with the hpa RNA probe, as compared to a very low staining of the hpa mRNA in the respective non-malignant control tissues A preferential expression of heparanase in human tumors versus the corresponding normal tissues was also noted by immunohistochemical staining of paraffin embedded sections with monoclonal anti-heparanase antibodies. Positive cytoplasmic staining was found in neoplastic cells of the colon carcinoma and in dysplastic epithelial cells of a tubuloyillous adenoma found in the same specimen while there was little or no staining of the normal looking colon epithelium located away from the carcinoma.
Of particular significance was an intense immunostaining of colon adenocarcinoma cells that had metastasized into the liver, as compared to the surrounding normal liver tissue.
Latent and active forms of the heparanase protein The apparent molecular size of the recombinant enzyme produced in the baculovirus expression system was about 65 kDa. This heparanase polypeptide contains 6 potential N-glycosylation sites. Following deglycosylation by treatment with peptide N-glycosidase, the protein appeared as a 57 kDa band. This molecular weight corresponds to the deduced molecular mass (61,192 daltons) of the 543 amino acid polypeptide encoded by the full length hpa cDNA after cleavage of the predicted 3 kDa signal peptide. No further reduction in the apparent size of the N-deglycosylated protein was observed following concurrent Oglycosidase and neuraminidase treatment. Deglycosylation had no detectable effect on enzymatic activity.
Unlike the baculovirus enzyme, expression of the full length heparanase polypeptide in mammalian cells 293 kidney cells, CHO) WO 01/00643 PCT/IL00/00358 7 yielded a major protein of about 50 kDa and a minor about 65 kDa protein in cell lysates. Preferential release of the about 65 kDa form into the culture medium was noted in some of the transfected CHO clones.
Comparison of the enzymatic activity of the two forms, using a semiquantitative gel filtration assay, revealed that the 50 kDa enzyme is about 100-fold more active than the 65 kDa form. A similar difference was observed when the specific activity of the recombinant 65 kDa baculovirus enzyme was compared to that of the 50 kDa heparanase preparations purified from human platelets, SK-hep-1 cells, or placenta. These results 1o suggest that the 50 kDa protein is a mature processed form of a latent heparanase precursor. Amino terminal sequencing of the platelet heparanase indicated that cleavage occurs between amino acids glu 15 7 lys 1 5 8 As indicated by the hydropathic plot of heparanase, this site is located within a hydrophillic peak which is likely to be exposed and hence accessible to proteases.
Involvement of Heparanase in Tumor Cell Invasion and Metastasis Circulating tumor cells arrested in the capillary beds often attach at or near the intercellular junctions between adjacent endothelial cells. Such attachment of the metastatic cells is followed by rupture of the junctions, retraction of the endothelial cell borders and migration through the breach in the endothelium toward the exposed underlying base membrane (BM) Once located between endothelial cells and the BM, the invading cells must degrade the subendothelial glycoproteins and proteoglycans of the BM in order to migrate out of the vascular compartment. Several cellular enzymes collagenase IV, plasminogen activator, cathepsin B, elastase, etc.) are thought to be involved in degradation of BM Among these enzymes is heparanase that cleaves HS at specific intrachain sites (16, 11). Expression of a HS degrading heparanase was found to correlate with the metastatic potential of mouse lymphoma (26), fibrosarcoma and melanoma (21) cells. Moreover, elevated levels of heparanase were detected in sera from metastatic tumor bearing animals and melanoma patients (21) and in tumor biopsies of cancer patients (12).
The inhibitory effect of various non-anticoagulant species of heparin on heparanase was examined in view of their potential use in preventing extravasation of blood-borne cells. Treatment of experimental animals with heparanase inhibitors markedly reduced 90 the incidence of lung metastases induced by B16 melanoma, Lewis lung WO 01/00643 PCT/IL00/00358 8 carcinoma and mammary adenocarcinoma cells (12, 13, 28). Heparin fractions with high and low affinity to anti-thrombin III exhibited a comparable high anti-metastatic activity, indicating that the heparanase inhibiting activity of heparin, rather than its anticoagulant activity, plays a role in the anti-metastatic properties of the polysaccharide (12).
The direct role of heparanase in cancer metastasis was demonstrated by two experimental systems. The murine T-lymphoma cell line Eb has no detectable heparanase activity. Whether the introduction of the hpa gene into Eb cells would confer a metastatic behavior on these o0 cells was investigated. To this purpose, Eb cells were transfected with a full length human hpa cDNA. Stable transfected cells showed high expression of the heparanase mRNA and enzyme activity. These hpa and mock transfected Eb cells were injected subcutaneously into DBA/2 mice and mice were tested for survival time and liver metastases. All mice (n=20) injected with mock transfected cells survived during the first 4 weeks of the experiment, while 50% mortality was observed in mice inoculated with Eb cells transfected with the hpa cDNA. The liver of mice inoculated with hpa transfected cells was infiltrated with numerous Eb lymphoma cells, as was evident both by macroscopic evaluation of the liver surface and microscopic examination of tissue sections. In contrast, metastatic lesions could not be detected by gross examination of the liver of mice inoculated with mock transfected control Eb cells. Few or no lymphoma cells were found to infiltrate the liver tissue. In a different model of tumor metastasis, transient transfection of the heparanase gene into low metastatic B16-F1 mouse melanoma cells followed by i.v.
inoculation, resulted in a 4- to 5-fold increase in lung metastases.
Finally, heparanase externally adhered to B16-F1 melanoma cells increased the level of lung metastases in C57BL mice as compared to control mice (see U.S. Pat. application No. 09/260,037, entitled INTRODUCING A BIOLOGICAL MATERIAL INTO A PATIENT, which is a continuation in part of U.S. Pat. application No. 09/140,888, and is incorporated herein by reference.
Possible involvement of heparanase in tumor angiogenesis Fibroblast growth factors are a family of structurally related polypeptides characterized by high affinity to heparin They are highly mitogenic for vascular endothelial cells and are among the most potent inducers of neovascularization (29-30). Basic fibroblast growth factor (bFGF) has been extracted from a subendothelial ECM produced in WO 01/00643 PCT/ILO0/00358 9 vitro (31) and from basement membranes of the cornea suggesting that ECM may serve as a reservoir for bFGF. Immunohistochemical staining revealed the localization of bFGF in basement membranes of diverse tissues and blood vessels Despite the ubiquitous presence of bFGF in normal tissues, endothelial cell proliferation in these tissues is usually very low, suggesting that bFGF is somehow sequestered from its site of action. Studies on the interaction of bFGF with ECM revealed that bFGF binds to HSPG in the ECM and can be released in an active form by HS degrading enzymes (33, 32, 34). It was demonstrated that heparanase jo activity expressed by platelets, mast cells, neutrophils, and lymphoma cells is involved in release of active bFGF from ECM and basement membranes suggesting that heparanase activity may not only function in cell migration and invasion, but may also elicit an indirect neovascular response. These results suggest that the ECM HSPG provides a natural storage depot for bFGF and possibly other heparin-binding growth promoting factors (36,37). Displacement of bFGF from its storage within basement membranes and ECM may therefore provide a novel mechanism for induction of neovascularization in normal and pathological situations.
Recent studies indicate that heparin and HS are involved in binding of bFGF to high affinity cell surface receptors and in bFGF cell signaling (38, 39). Moreover, the size of HS required for optimal effect was similar to that of HS fragments released by heparanase Similar results were obtained with vascular endothelial cells growth factor (VEGF) (41), suggesting the operation of a dual receptor mechanism involving HS in cell interaction with heparin-binding growth factors. It is therefore proposed that restriction of endothelial cell growth factors in ECM prevents their systemic action on the vascular endothelium, thus maintaining a very low rate of endothelial cells turnover and vessel growth. On the other hand, release of bFGF from storage in ECM as a complex with HS fragment, may elicit localized endothelial cell proliferation and neovascularization in processes such as wound healing, inflammation and tumor development (36,37).
The involvement of heparanase in other physiological processes and its potential therapeutic applications Apart from its involvement in tumor cell metastasis, inflammation and autoimmunity, mammalian heparanase may be applied to modulate bioavailability of heparin-binding growth factors; cellular responses to heparin-binding growth factors bFGF, VEGF) and cytokines (IL-8) WO 01/00643 PCT/IL00/00358 (44, 41); cell interaction with plasma lipoproteins cellular susceptibility to certain viral and some bacterial and protozoa infections (45-47); and disintegration of amyloid plaques (48).
Viral Infection: The presence of heparan sulfate on cell surfaces have been shown to be the principal requirement for the binding of Herpes Simplex (45) and Dengue (46) viruses to cells and for subsequent infection of the cells. Removal of the cell surface heparan sulfate by heparanase may therefore abolish virus infection. In fact, treatment of cells with bacterial heparitinase (degrading heparan sulfate) or heparinase (degrading o0 heparan) reduced the binding of two related animal herpes viruses to cells and rendered the cells at least partially resistant to virus infection There are some indications that the cell surface heparan sulfate is also involved in HIV infection (47).
Neurodegenerative diseases: Heparan sulfate proteoglycans were identified in the prion protein amyloid plaques of Genstmann-Straussler Syndrome, Creutzfeldt-Jakob disease and Scrape Heparanase may disintegrate these amyloid plaques which are also thought to play a role in the pathogenesis of Alzheimer's disease.
Restenosis and Atherosclerosis: Proliferation of arterial smooth muscle cells (SMCs) in response to endothelial injury and accumulation of cholesterol rich lipoproteins are basic events in the pathogenesis of atherosclerosis and restenosis Apart from its involvement in SMC proliferation as a low affinity receptor for heparin-binding growth factors, HS is also involved in lipoprotein binding, retention and uptake It was demonstrated that HSPG and lipoprotein lipase participate in a novel catabolic pathway that may allow substantial cellular and interstitial accumulation of cholesterol rich lipoproteins The latter pathway is expected to be highly atherogenic by promoting accumulation of apoB and apoE rich lipoproteins LDL, VLDL, chylomicrons), independent of feed back inhibition by the cellular cholesterol content. Removal of SMC HS by heparanase is therefore expected to inhibit both SMC proliferation and lipid accumulation and thus may halt the progression of restenosis and atherosclerosis.
Pulmonary diseases: The data obtained from the literature suggests a possible role for GAGs degrading enzymes, such as, but not limited to, heparanases, connective tissue activating peptide, heparinases, hyluronidases, sulfatases and chondroitinases, in reducing the viscosity of sinuses and airway WO 01/00643 PCT/IL00/00358
II
secretions with associated implications on curtailing the rate of infection and inflammation. The sputum from CF patients contains at least 3 GAGs, thus contributing to its volume and viscous properties.
Recombinant heparanase has been shown to reduce viscosity of sputum of CF patients (see, U.S. Pat. application No. 09/046,475).
In summary, heparanase may thus prove useful for conditions such as wound healing, angiogenesis, restenosis, atherosclerosis, inflammation, neurodegenerative diseases and viral infections. Mammalian heparanase can be used to neutralize plasma heparin, as a potential replacement of protamine. Anti-heparanase antibodies may be applied for immunodetection and diagnosis of micrometastases, autoimmune lesions and renal failure in biopsy specimens, plasma samples, and body fluids.
There is thus a widely recognized need for, and it would be highly advantageous to have, additional molecules with glycosyl hydrolase activity, because such molecules may exhibit greater specific activity toward certain substrates or different substrate specificity than the known heparanase.
SUMMARY OF THE INVENTION According to one aspect of the present invention there is provided an isolated nucleic acid comprising a polynucleotide hybridizable with SEQ ID NOs:1, 4, 6 or portions thereof at 68 °C in 6 x SSC, 1 SDS, 5 x Denharts, 10 dextran sulfate, 100 pg/ml salmon sperm DNA, and 3 2 p labeled probe and wash at 68 °C with 3 x SSC and 0.1 SDS.
According to another aspect of the present invention there is provided an isolated nucleic acid comprising a polynucleotide hybridizable with SEQ ID NOs:l, 4, 6 or portions thereof at 68 °C in 6 x SSC, 1 SDS, 5 x Denharts, 10 dextran sulfate, 100 pg/ml salmon sperm DNA, and 3 2 p labeled probe and wash at 68 °C with 1 x SSC and 0.1 SDS.
According to still another aspect of the present invention there is provided an isolated nucleic acid comprising a polynucleotide hybridizable with SEQ ID NOs:1, 4, 6 or portions thereof at 68 °C in 6 x SSC, 1 SDS, 5 x Denharts, 10 dextran sulfate, 100 pg/ml salmon sperm DNA, and 3 2 p labeled probe and wash at 68 °C with 0.1 x SSC and 0.1 SDS.
According to yet another aspect of the present invention there is provided an isolated nucleic acid comprising a polynucleotide at least identical with SEQ ID NOs: 4, 6 or portions thereof as determined using the Bestfit procedure of the DNA sequence analysis software WO 01/00643 PCT/IL00/00358 12 package developed by the Genetic Computer Group (GCG) at the university of Wisconsin (gap creation penalty 50, gap extension penalty 3).
According to still another aspect of the present invention there is provided an isolated nucleic acid comprising a polynucleotide encoding a polypeptide being at least 60 homologous with SEQ ID NOs:3, 5, 7 or portions thereof as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the university of Wisconsin (gap creation penalty o1 gap extension penalty 3).
According to further features in preferred embodiments of the invention described below, the polynucleotide is as set forth in SEQ ID NOs: 1, 4, 6 or portions thereof.
According to an additional aspect of the present invention there is provided a recombinant protein comprising a polypeptide encoded by the polynucleotides herein described.
According to yet an additional aspect of the present invention there is provided a recombinant protein comprising a polypeptide at least 60 homologous with SEQ ID NOs:3, 5, 7 or portions thereof as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the university of Wisconsin (gap creation penalty 50, gap extension penalty 3).
According to further features in preferred embodiments of the invention described below, the polypeptide is as set fourth in SEQ ID NOs:3, 5, 7 or portions thereof.
According to still an additional aspect of the present invention there is provided a nucleic acid construct comprising the isolated nucleic acid herein described.
According to a further aspect of the present invention there is provided a nucleic acid construct comprising a polynucleotide encoding the recombinant protein herein described.
According to still a further aspect of the present invention there is provided a host cell comprising a polynucleotide or construct and/or expressing a recombinant protein as herein described.
According to yet a further aspect of the present invention there is provided an antisense oligonucleotide or nucleic acid construct comprising a polynucleotide or a polynucleotide analog of at least 10 bases being WO 01/00643 PCT/ILOO/00358 13 hybridizable in vivo, under physiological conditions, with a portion of a polynucleotide strand encoding a polypeptide at least 60 homologous with SEQ ID NOs:3, 5, 7 or portions thereof as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the university of Wisconsin (gap creation penalty 50, gap extension penalty or (ii) a portion of a polynucleotide strand at least 60 identical with SEQ ID NOs:1, 4, 6 or portions thereof as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the university of Wisconsin (gap creation penalty 50, gap extension penalty 3).
According to another aspect of the present invention there is provided a ribozyme comprising the antisense oligonucleotide herein described and a ribozyme sequence.
The present invention provides polynucleotides and polypeptides belonging to a class of asp-glu glycosyl hydrolases of the GH-A clan, probably, based on homology to heparanase, GAG degrading enzymes.
BRIEF DESCRIPTION OF THE DRAWINGS The invention is herein described, by way of example only, with reference to the accompanying drawings, wherein: FIG. I shows the nucleotide sequence (SEQ ID NOs:1-2) and the deduced amino acid sequence (SEQ ID NOs:2-3) of hnhpl; FIG. 2 is a comparison of the deduced amino acid sequences of hnhpl (SEQ ID NOs:2-3) and of heparanase (SEQ ID NO:9). Comparison was performed using the Gap program of the GCG package (gap creation penalty 50, gap extension penalty 3); FIG. 3 illustrates variability of hnhpl transcripts. Hnhpl was amplified from placenta and from testis marathon ready cDNA libraries, using the gene specific primers pn9-312u (SEQ ID NO:14) and hnll-230 (SEQ ID NO: 11); FIG. 4 shows a zoo blot. Ten micrograms of genomic DNA from various species were digested with EcoRI and separated on 0.7 agarose TBE gel. Following electrophoresis, the gel was treated with HCI and then with NaOH and the DNA fragments were downward transferred to a nylon membrane (Hybond Amersham) with 0.4 N NaOH. The membrane was hybridized with a 1.7 Kb DNA probe that contained the hnhpl cDNA (clone pn9). Lane order: H Human; M Mouse; Rt Rat; P WO 01/00643 PCT/IL00/00358 14 Pig; Cw Cow; Hr Horse; S Sheep; Rb Rabbit; D Dog; Ch Chicken; F Fish. Size markers (Lambda Bstel) are shown on the left; FIG. 5 illustrates cross hybridization between hpa and hnhpl. Hpa was amplified by PCR from marathon ready placenta cDNA library.
Hnhpl was amplified from testis marathon ready cDNA library. PCR products were run on agarose gel in duplicates and transferred to a nylon membrane. One membrane was probed with 3 2 p labeled hpa cDNA and the other with hnhpl, clone pn9.
FIG. 6 is a comparison of the hydropathic profiles of heparanase o0 and hnhpl. The curves were calculated according to the Kyte and Dulittle method over a window of 17 amino acids.
FIG. 7 shows a Western blot analysis of recombinant hnhpl expressed in human embryonal kidney 293 cells. A control heparanase- FLAG precursor, B-D 293 cells trasfected with a control pSI vector pSI-pn6 and pSI-pn9 Cell extracts were separated by SDS- PAGE, transferred onto Immobilon-P nylon membrane (Millipore).
Membrane was incubated with anti-FLAG Flag antibody 1:1000 (Kodak anti Flag M2 cat: IB13025).
DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention is of novel polynucleotides encoding polypeptides distantly homologous to heparanase, nucleic acid constructs including the polynucleotides, genetically modified cells expressing same, recombinant proteins encoded thereby and which may have heparanase or other glycosyl hydrolase activity, antibodies recognizing the recombinant proteins, oligonucleotides and oligonucleotide analogs derived from the polynucleotides and ribozymes including same.
The principles and operation of the present invention may be better understood with reference to the drawings and accompanying descriptions.
Before explaining at least one embodiment of the invention in detail, it is to be understood that the invention is not limited in its application to the details of construction and the arrangement of the components set forth in the following description or illustrated in the drawings. The invention is capable of other embodiments or of being practiced or carried out in various ways. Also, it is to be understood that the phraseology and terminology employed herein is for the purpose of description and should not be regarded as limiting.
WO 01/00643 PCT/IL00/00358 While reducing the present invention to practice the human EST database was screened for homologous sequences using the entire amino acid sequence of human heparanase (SEQ ID NO:9). A distantly homologous fragment was pooled out, accession number AI222323, IMAGE clone number 1843155 from Soares_NFL_T_GBC_SI Homo Sapiens cDNA library prepared from testis B-cells and fetal lungs. The clone contained an insert of 560 bp (SEQ ID NO:23) of which the 3' region was homologous to the human hpa gene encoding human heparanase. Primers derived from the newly identified clone were used to o0 isolate several cDNAs including several open reading frames which reflect in frame alternative splicing, the longest of which, pn6, appears in Figure 1 (SEQ ID NOs:1, 2 and 3) is 2060 nucleotide long and it contains an open reading frame of 1776 nucleotides, which encodes a polypeptide of 592 amino acids, with a calculated molecular weight of 66.5 kDa. The newly cloned gene was designated hnhpl. Two shorter forms, pn9 and pn5 and their deduced amino acid sequences are set forth in SEQ ID NOs:4 and 6 and SEQ ID NO:5 and 7, respectively, and are further described in the Examples section that follows. Comparison between the amino acid sequence of hnhpl and heparanase is shown in Figure 3. The homology between the two proteins is 52.8 or 55.3 depending on the software employed. No cross hybridzation was detected between hpa and hnhpl, even under very moderate wash conditions (Figure Zoo blot analysis demonstrated that the hnhpl gene and other related genes, perhaps forming a new gene familly, are present in genomes of other organisms including mammals and avians. The chromosome localization of hnhpl was determined using G3 radiation hybrid panel to be on human chromosome next to the marker SHGC-57721. The results also indicated a possibility of a second copy of the gene or of a related gene. The hnhpl gene is expressed in low levels in lymph nodes, spleen, colon and ovary; in slightly higher levels in prostate and small intestine; and in yet more pronouced level in testis. No expression was detected under the assay employed in bone marrow, liver, thymus, tonsil or leukocytes. Screening of the mouse EST database with the amino acid sequence of heparanase as well as of hnhpl pooled out a mouse EST clone (clone 1378452 accession number AI019269 from mouse thymus, SEQ ID NO:8). However, this clone includes two frame shift mutations which hamper its open reading frame.
WO 01/00643 PCT/IL00/00358 16 The overall homology between the amino acid sequence of hnhpl and heparanase suggest that these two proteins share similar function. The homology between the two proteins is concentrated at several regions.
These may represent functional domains of the protein. The variability may suggest potential difference in substrate recognition, cellular localization and parameters of activity.
Despite the lack of an overall homology between the heparanase and other glycosyl hydrolases, the amino acid couple asp-glu (NE, SEQ ID NO: 13), which is characteristic of the proton donor of glycosyl hydrolyses o0 of the GH-A clan, was found at positions 224, 225 of heparanase. As in other clan members, this NE couple is located at the end of a 1 strand. As shown in Figure 2, the region surrounding the NE couple is conserved in the predicted amino acid sequence of hnhpl. This suggests that hnhpl product is a glycosyl hydrolase. This definition may include any polysaccharide degrading enzyme, either exo or endo glycosidase and based on the similarity to heparanase it is likely that it encodes a GAG degrading enzyme.
In addition, superimposition of the hydropathic profiles of heparanase and hnhpl (Figure 6) indicates an overlapping pattern along the proteins. The amino acid sequence characteristic of glycosyl hydrolases is located within a hydrophilic peak and at the same position in the aligned proteins. A remarkable difference in the hydropathic pattern is noticed around amino acids 157, 158 of heparanase, which constitute the processing site of the enzyme. While in heparanase, this site is located at the tip of a hydrophilic peak, the equivalent region of hnhpl is rather not hydrophilic. The peak around amino acid 110 of heparanase appears also, around amino acid 130 of hnhpl. Cleavage of heparanase at this region was shown to result in enzyme activation. The equivalent region of hnhpl might be a potential processing site.
Heparanase has a potential signal peptide at the N-terminus of the 67 kDa form. The homology between the two proteins is low at the Ntermini and no signal peptide was identified in hnhpl polypeptide.
According to one aspect of the present invention there is provided an isolated nucleic acid comprising a polynucleotide hybridizable with SEQ ID NOs:1, 4, 6 or portions thereof at 68 °C in 6 x SSC, 1 SDS, 5 x Denharts, 10 dextran sulfate, 100 pg/ml salmon sperm DNA, and 3 2 p labeled probe and wash at 68 °C with 3 x SSC, 1 x SSC or 0.1 x SSC and 0.1 SDS.
WO 01/00643 PCT/IL00/00358 17 As used herein in the specification and in the claims section that follows, the term "portion" or "portions" refer to a consequtive stretch of nucleic or amino acids. Such a portion may include, for example, at least nucleotides (equivalent to at least 30 amino acids), at least 120 nucleotides (equivalent to at least 40 amino acids), at least 150 nucleotides (equivalent to at least 50 amino acids), at least 180 nucleotides (equivalent to at least 60 amino acids), at least 210 nucleotides (equivalent to at least amino acids), at least 300 nucleotides (equivalent to at least 100 amino acids), at least 600 nucleotides (equivalent to at least 200 amino acids), at least 900 nucleotides (equivalent to at least 300 amino acids), at least 1,200 nucleotides (equivalent to at least 400 amino acids), at least 1,500 nucleotides (equivalent to at least 500 amino acids), or more.
According to another aspect of the present invention there is provided an isolated nucleic acid comprising a polynucleotide at least preferably at least 65 more preferably at least 70 still preferably at least 75 yet preferably at least 80 more preferably at least 85 more preferably at least 90 most preferably at least 95 100 identical with SEQ ID NOs: 1, 4, 6 or portions thereof as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the university of Wisconsin (gap creation penalty 50, gap extension penalty 3).
According to still another aspect of the present invention there is provided an isolated nucleic acid comprising a polynucleotide encoding a polypeptide being at least 60 preferably at least 65 more preferably at least 70 still preferably at least 75 yet preferably at least 80 more preferably at least 85 more preferably at least 90 most preferably at least 95 100 homologous with SEQ ID NOs:3, 5, 7 or portions thereof as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the university of Wisconsin (gap creation penalty gap extension penalty 3).
As used herein in the specification and in the claims section that follows, the term "homologous" refers to identical similar.
According to an additional aspect of the present invention there is provided a recombinant protein comprising a polypeptide encoded by the polynucleotides herein described.
WO 01/00643 PCT/ILOO/00358 18 The necleic acid according to the present invention can be a complementary polynucleotide sequence, genomic polynucleotide sequence or a composite polynucleotide sequence.
As used herein the phrase "complementary polynucleotide sequence" includes sequences which originally result from reverse transcription of messenger RNA using a reverse transcriptase or any other RNA dependent DNA polymerase. Such sequences can be subsequently amplified in vivo or in vitro using a DNA dependent DNA polymerase.
As used herein the phrase "genomic polynucleotide sequence" includes sequences which originally derive from a chromosome and reflect a contiguous portion of a chromosome.
As used herein the phrase "composite polynucleotide sequence" includes sequences which are at least partially complementary and at least partially genomic. A composite sequence can include some exonal sequences required to encode a polypeptide, as well as some intronic sequences interposing therebetween. The intronic sequences can be of any source, including of other genes, and typically will include conserved splicing signal sequences. Such intronic sequences may further include cis acting expression regulatory elements.
Thus, this aspect of the present invention encompasses (i) polynucleotides as set forth in SEQ ID NOs:1, 4 and 6; (ii) fragments or portions thereof; (iii) sequences hybridizable therewith; (iv) sequences homologous thereto; genomic and composite sequences coresponding thereto; (vi) sequences encoding similar polypeptides with different codon usage; and (vii) altered sequences characterized by mutations, such as deletion, insertion or substitution of one or more nucleotides, either naturally occurring or man induced, either randomly or in a targeted fashion.
According to yet an additional aspect of the present invention there is provided a recombinant protein comprising a polypeptide at least 60 preferably at least 65 more preferably at least 70 still preferably at least 75 yet preferably at least 80 more preferably at least 85 more preferably at least 90 most preferably at least 95 100 homologous with SEQ ID NOs:3, 5, 7 or portions thereof, as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the university of Wisconsin (gap creation penalty 50, gap extension penalty 3).
WO 01/00643 PCT/IL00/00358 19 According to still an additional aspect of the present invention there is provided a nucleic acid construct comprising the isolated nucleic acid herein described.
According to a preferred embodiment of the present invention the nucleic acid construct further comprising a promoter for regulating the expression of the isolated nucleic acid in a sense or antisense orientation.
Such promoters are known to be cis-acting sequence elements required for transcription as they serve to bind DNA dependent RNA polymerase which transcribes sequences present downstream thereof. Such down stream sequences can be in either one of two possible orientations to result in the transcription of sense RNA which is translatable by the ribozyme machinery or antisense RNA which typically does not contain translatable sequences, yet can duplex or triplex with endogenous sequences, either mRNA or chromosomal DNA and hamper gene expression, all as further detailed hereinunder.
While the isolated nucleic acid described herein is an essential element of the invention, it is modular and can be used in different contexts. The promoter of choice that is used in conjunction with this invention is of secondary importance, and will comprise any suitable promoter. It will be appreciated by one skilled in the art, however, that it is necessary to make sure that the transcription start site(s) will be located upstream of an open reading frame. In a preferred embodiment of the present invention, the promoter that is selected comprises an element that is active in the particular host cells of interest. These elements may be selected from transcriptional regulators that activate the transcription of genes essential for the survival of these cells in conditions of stress or starvation, including, but not limited to, the heat shock proteins.
A construct according to the present invention preferably further includes an appropriate selectable marker. In a more preferred 3o embodiment according to the present invention the construct further includes an origin of replication. In another most preferred embodiment according to the present invention the construct is a shuttle vector, which can propagate both in E. coli (wherein the construct comprises an appropriate selectable marker and origin of replication) and be compatible for propagation in cells, or integration in the genome, of an organism of choice. The construct according to this aspect of the present invention can be, for example, a plasmid, a bacmid, a phagemid, a cosmid, a phage, a virus or an artificial chromosome.
WO 01/00643 PCT/IL00/00358 Alternatively, the nucleic acid construct according to this aspect of the present invention further includes a positive and a negative selection markers and may therefore be employed for selecting for homologous recombination events, including, but not limited to, homologous recombination employed in knock-in and knock-out procedures. One ordinarily skilled in the art can readily design a knock-out or knock-in constructs including both positive and negative selection genes for efficiently selecting transfected embryonic stem cells that underwent a homologous recombination event with the construct. Such cells can be 0o introduced into developing embryos to generate chimeras, the offspring thereof can be tested for carrying the knock-out or knock-in constructs.
Knock-out and/or knock-in constructs according to the present invention can be used to further investigate the functionality of the new gene. Such constructs can also be used in somatic and/or germ cells gene therapy to destroy activity of a defective, gain of function allele or to replace the lack of activity of a silent allele in an organism, thereby to down or upregulate activity, as required. Further detail relating to the construction and use of knock-out and knock-in constructs can be found in Fukushige, S. and Ikeda, Trapping of mammalian promoters by Cre-lox site-specific recombination. DNA Res 3 (1996) 73-80; Bedell, Jenkins, N.A. and Copeland, Mouse models of human disease. Part I: Techniques and resources for genetic analysis in mice. Genes and Development 11 (1997) 1-11; Bermingham, Scherer, O'Connell, Arroyo, Kalla, Powell, F.L. and Rosenfeld, Tst-l/Oct-6/SCIP regulates a unique step in peripheral myelination and is required for normal respiration. Genes Dev 10 (1996) 1751-62, which are incorporated herein by reference.
According to yet another aspect of the present invention there is provided a host cell or animal comprising a nucleic acid construct or a portion thereof as described herein. Methods of transforming host cells, both prokaryotes and eukaryotes, and organisms with nucleic acid constructs and selection of transformants transformed cells or transgenic animals) are well known to those of skills in the art. In addition, once transfected, such cells and organisms can be designed to direct the production of ample amounts of a recombinant protein which can then be purfied by known methods, including, but not limited to, various chromatography and gel electrophoresis methods. Such a purified recombinant protein can serve for elicitation of antibodies as further WO 01/00643 PCT/ILOO/00358 21 detailed hereinunder. Methods of transformation of cells and organism are described in detail in reference 43, whereas methods of recombinant protein purification are described in detail in reference 52, both are incorporated herein by reference.
According to still another aspect of the present invention there is provided an oligonucleotide of at least 17, at least 18, at least 19, at least at least 22, at least 25, at least 30 or at least 40, bases specifically hybridizable with the isolated nucleic acid described herein.
Hybridization of shorter nucleic acids (below 200 bp in length, e.g.
17-40 bp in length) is effected by stringent, moderate or mild hybridization, wherein stringent hybridization is effected by a hybridization solution of 6 x SSC and 1 SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 1 mM EDTA (pH 0.5 SDS, 100 pg/ml denatured salmon sperm DNA and 0.1 nonfat dried milk, hybridization temperature of 1 1.5 °C below the Tm, final wash solution of 3 M TMACI, 0.01 M sodium phosphate (pH 1 mM EDTA (pH 0.5 SDS at 1 1.5 OC below the Tm; moderate hybridization is effected by a hybridization solution of 6 x SSC and 0.1 SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 1 mM EDTA (pH 0.5 SDS, 100 pg/ml denatured salmon sperm DNA and 0.1 nonfat dried milk, hybridization temperature of 2 2.5 °C below the Tm, final wash solution of 3 M TMACI, 0.01 M sodium phosphate (pH 1 mM EDTA (pH 0.5 SDS at 1 1.5 °C below the Tm, final wash solution of 6 x SSC, and final wash at 22 whereas mild hybridization is effected by a hybridization solution of 6 x SSC and I SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 1 mM EDTA (pH 0.5 SDS, 100 pg/ml denatured salmon sperm DNA and 0.1 nonfat dried milk, hybridization temperature of 37 final wash solution of6 x SSC and final wash at 22 oC.
According to an additional aspect of the present invention there is provided a pair of oligonucleotides each independently of at least 17, at least 18, at least 19, at least 20, at least 22, at least 25, at least 30 or at least 40 bases specifically hybridizable with the isolated nucleic acid described herein in an opposite orientation so as to direct exponential amplification of a portion thereof in a nucleic acid amplification reaction, such as a polymerase chain reaction. The polymerase chain reaction and other nucleic acid amplification reactions are well known in the art and require no further description herein. The pair of oligonucleotides WO 01/00643 PCT/IL00/00358 22 according to this aspect of the present invention are preferably selected to have compatible melting temperatures melting temperatures which differ by less than that 7 preferably less than 5 more preferably less than 4 most preferably less than 3 ideally between 3 C and zero Consequently, according to yet an additional aspect of the present invention there is provided a nucleic acid amplification product obtained using the pair of primers described herein. Such a nucleic acid amplification product can be isolated by gel electrophoresis or any other size based separation technique. Alternatively, such a nucleic acid l0 amplification product can be isolated by affinity separation, either strandness affinity or sequence affinity. In addition, once isolated, such a product can be further genetically manipulated by restriction, ligation and the like, to serve any one of a plurality of applications associated with up and/or down regulation of activity.
is According to still an additional aspect of the present invention there is provided an antisense oligonucleotide comprising a polynucleotide or a polynucleotide analog of at least 10 bases, preferably between 10 and more preferably between 50 and 20 bases, most preferably, at least 17, at least 18, at least 19, at least 20, at least 22, at least 25, at least 30 or at least 40 bases being hybridizable in vivo, under physiological conditions, with a portion of a polynucleotide strand encoding a polypeptide at least preferably at least 65 more preferably at least 70 still preferably at least 75 yet preferably at least 80 more preferably at least 85 more preferably at least 90 most preferably at least 95 100 homologous to SEQ ID NOs:3, 5, 7 or portions thereof as determined using the as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the university of Wisconsin (gap creation penalty 50, gap extension penalty or (ii) a portion of a polynucleotide strand at least 60 preferably at least 65 more preferably at least 70 still preferably at least 75 yet preferably at least 80 more preferably at least 85 more preferably at least 90 most preferably at least 95 100 identical with SEQ ID NOs:l, 4, 6 or portions thereof as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the university of Wisconsin (gap creation penalty 12, gap extension penalty 4).
WO 01/00643 PCT/ILOO/00358 23 Such antisense oligonucleotides can be used to downregulate gene expression as further detailed hereinunder. Such an antisense oligonucleotide is readily synthesizable using solid phase oligonucleotide synthesis.
The ability of chemically synthesizing oligonucleotides and analogs thereof having a selected predetermined sequence offers means for down modulating gene expression. Three types of gene expression modulation strategies may be considered.
At the transcription level, antisense or sense oligonucleotides or to analogs that bind to the genomic DNA by strand displacement or the formation of a triple helix, may prevent transcription. At the transcript level, antisense oligonucleotides or analogs that bind target mRNA molecules lead to the enzymatic cleavage of the hybrid by intracellular RNase H. In this case, by hybridizing to the targeted mRNA, the oligonucleotides or oligonucleotide analogs provide a duplex hybrid recognized and destroyed by the RNase H enzyme. Alternatively, such hybrid formation may lead to interference with correct splicing. As a result, in both cases, the number of the target mRNA intact transcripts ready for translation is reduced or eliminated. At the translation level, antisense oligonucleotides or analogs that bind target mRNA molecules prevent, by steric hindrance, binding of essential translation factors (ribosomes), to the target mRNA, a phenomenon known in the art as hybridization arrest, disabling the translation of such mRNAs.
Thus, antisense sequences, which as described hereinabove may arrest the expression of any endogenous and/or exogenous gene depending on their specific sequence, attracted much attention by scientists and pharmacologists who were devoted at developing the antisense approach into a new pharmacological tool.
For example, several antisense oligonucleotides have been shown to arrest hematopoietic cell proliferation, growth, entry into the S phase of the cell cycle, reduced survival and prevent receptor mediated responses.
For efficient in vivo inhibition of gene expression using antisense oligonucleotides or analogs, the oligonucleotides or analogs must fulfill the following requirements sufficient specificity in binding to the target sequence; (ii) solubility in water; (iii) stability against intra- and extracellular nucleases; (iv) capability of penetration through the cell membrane; and when used to treat an organism, low toxicity.
WO 01/00643 PCT/ILOO/00358 24 Unmodified oligonucleotides are typically impractical for use as antisense sequences since they have short in vivo half-lives, during which they are degraded rapidly by nucleases. Furthermore, they are difficult to prepare in more than milligram quantities. In addition, such oligonucleotides are poor cell membrane penetraters.
Thus it is apparent that in order to meet all the above listed requirements, oligonucleotide analogs need to be devised in a suitable manner. Therefore, an extensive search for modified oligonucleotides has been initiated.
For example, problems arising in connection with double-stranded DNA (dsDNA) recognition through triple helix formation have been diminished by a clever "switch back" chemical linking, whereby a sequence of polypurine on one strand is recognized, and by "switching back", a homopurine sequence on the other strand can be recognized.
Also, good helix formation has been obtained by using artificial bases, thereby improving binding conditions with regard to ionic strength and pH.
In addition, in order to improve half-life as well as membrane penetration, a large number of variations in polynucleotide backbones have been done, nevertheless with little success.
Oligonucleotides can be modified either in the base, the sugar or the phosphate moiety. These modifications include, for example, the use of methylphosphonates, monothiophosphates, dithiophosphates, phosphoramidates, phosphate esters, bridged phosphorothioates, bridged phosphoramidates, bridged methylenephosphonates, dephospho internucleotide analogs with siloxane bridges, carbonate bridges, carboxymethyl ester bridges, carbonate bridges, carboxymethyl ester bridges, acetamide bridges, carbamate bridges, thioether bridges, sulfoxy bridges, sulfono bridges, various "plastic" DNAs, a-anomeric bridges and borane derivatives.
International patent application WO 89/12060 discloses various building blocks for synthesizing oligonucleotide analogs, as well as oligonucleotide analogs formed by joining such building blocks in a defined sequence. The building blocks may be either "rigid" containing a ring structure) or "flexible" lacking a ring structure). In both cases, the building blocks contain a hydroxy group and a mercapto group, through which the building blocks are said to join to form oligonucleotide analogs. The linking moiety in the oligonucleotide WO 01/00643 PCT/ILOO/00358 analogs is selected from the group consisting of sulfide sulfoxide and sulfone (-SO 2 International patent application WO 92/20702 describe an acyclic oligonucleotide which includes a peptide backbone on which any selected chemical nucleobases or analogs are stringed and serve as coding characters as they do in natural DNA or RNA. These new compounds, known as peptide nucleic acids (PNAs), are not only more stable in cells than their natural counterparts, but also bind natural DNA and RNA 50 to 100 times more tightly than the natural nucleic acids cling to each other.
o0 PNA oligomers can be synthesized from the four protected monomers containing thymine, cytosine, adenine and guanine by Merrifield solidphase peptide synthesis. In order to increase solubility in water and to prevent aggregation, a lysine amide group is placed at the C-terminal region and may be pegylated.
Thus, antisense technology requires pairing of messenger RNA with an oligonucleotide to form a double helix that inhibits translation.
The concept of antisense-mediated gene therapy was already introduced in 1978 for cancer therapy. This approach was based on certain genes that are crucial in cell division and growth of cancer cells. Synthetic fragments of genetic substance DNA can achieve this goal. Such molecules bind to the targeted gene molecules in RNA of tumor cells, thereby inhibiting the translation of the genes and resulting in dysfunctional growth of these cells. Other mechanisms has also been proposed. These strategies have been used, with some success in treatment of cancers, as well as other illnesses, including viral and other infectious diseases. Antisense oligonucleotides are typically synthesized in lengths of 13-30 nucleotides.
The life span of oligonucleotide molecules in blood is rather short. Thus, they have to be chemically modified to prevent destruction by ubiquitous nucleases present in the body. Phosphorothioates are very widely used modification in antisense oligonucleotide ongoing clinical trials. A new generation of antisense molecules consist of hybrid antisense oligonucleotide with a central portion of synthetic DNA while four bases on each end have been modified with 2'O-methyl ribose to resemble RNA.
In preclinical studies in laboratory animals, such compounds have demonstrated greater stability to metabolism in body tissues and an improved safety profile when compared with the first-generation unmodified phosphorothioate. Dosens of other nucleotide analogs have also been tested in antisense technology.
WO 01/00643 PCT/IL00/00358 26 RNA oligonucleotides may also be used for antisense inhibition as they form a stable RNA-RNA duplex with the target, suggesting efficient inhibition. However, due to their low stability RNA oligonucleotides are typically expressed inside the cells using vectors designed for this purpose.
This approach is favored when attempting to target a mRNA that encodes an abundant and long-lived protein.
Recent scientific publications have validated the efficacy of antisense compounds in animal models of hepatitis, cancers, coronary artery restenosis and other diseases. The first antisense drug was recently approved by the FDA. This drug Fomivirsen, developed by Isis, is indicated for local treatment of cytomegalovirus in patients with AIDS who are intolerant of or have a contraindication to other treatments for CMV retinitis or who were insufficiently responsive to previous treatments for CMV retinitis (Pharmacotherapy News Network).
Several antisense compounds are now in clinical trials in the United States. These include locally administered antivirals, systemic cancer therapeutics. Antisense therapeutics has the potential to treat many lifethreatening diseases with a number of advantages over traditional drugs.
Traditional drugs intervene after a disease-causing protein is formed.
Antisense therapeutics, however, block mRNA transcription/translation and intervene before a protein is formed, and since antisense therapeutics target only one specific mRNA, they should be more effective with fewer side effects than current protein-inhibiting therapy.
A second option for disrupting gene expression at the level of transcription uses synthetic oligonucleotides capable of hybridizing with double stranded DNA. A triple helix is formed. Such oligonucleotides may prevent binding of transcription factors to the gene's promoter and therefore inhibit transcription. Alternatively, they may prevent duplex unwinding and, therefore, transcription of genes within the triple helical structure.
Thus, according to a further aspect of the present invention there is provided a pharmaceutical composition comprising the antisense oligonucleotide described herein and a pharmaceutically acceptable carrier. The pharmaceutically acceptable carrier can be, for example, a liposome loaded with the antisense oligonucleotide. Formulations for topical administration may include, but are not limited to, lotions, ointments, gels, creams, suppositories, drops, liquids, sprays and powders.
Conventional pharmaceutical carriers, aqueous, powder or oily bases, WO 01/00643 PCT/IL00/00358 27 thickeners and the like may be necessary or desirable. Compositions for oral administration include powders or granules, suspensions or solutions in water or non-aqueous media, sachets, capsules or tablets. Thickeners, diluents, flavorings, dispersing aids, emulsifiers or binders may be desirable. Formulations for parenteral administration may include, but are not limited to, sterile aqueous solutions which may also contain buffers, diluents and other suitable additives.
According to still a further aspect of the present invention there is provided a ribozyme comprising the antisense oligonucleotide described o0 herein and a ribozyme sequence fused thereto. Such a ribozyme is readily synthesizable using solid phase oligonucleotide synthesis.
Ribozymes are being increasingly used for the sequence-specific inhibition of gene expression by the cleavage of mRNAs encoding proteins of interest. The possibility of designing ribozymes to cleave any specific target RNA has rendered them valuable tools in both basic research and therapeutic applications. In the therapeutics area, ribozymes have been exploited to target viral RNAs in infectious diseases, dominant oncogenes in cancers and specific somatic mutations in genetic disorders.
Most notably, several ribozyme gene therapy protocols for HIV patients are already in Phase 1 trials. More recently, ribozymes have been used for transgenic animal research, gene target validation and pathway elucidation.
Several ribozymes are in various stages of clinical trials. ANGIOZYME was the first chemically synthesized ribozyme to be studied in human clinical trials. ANGIOZYME specifically inhibits formation of the VEGFr (Vascular Endothelial Growth Factor receptor), a key component in the angiogenesis pathway. Ribozyme Pharmaceuticals, Inc., as well as other firms have demonstrated the importance of anti-angiogenesis therapeutics in animal models. HEPTAZYME, a ribozyme designed to selectively destroy Hepatitis C Virus (HCV) RNA, was found effective in decreasing Hepatitis C viral RNA in cell culture assays (Ribozyme Pharmaceuticals, Incorporated WEB home page).
According to still another aspect of the present invention there is provided an antibody comprising an immunoglobulin specifically recognizing and binding a polypeptide at least 60 preferably at least more preferably at least 70 still preferably at least 75 yet preferably at least 80 more preferably at least 85 more preferably at least 90 most preferably at least 95 100 homologous (identical similar) to SEQ ID NOs:3, 5, 7 or portions thereof using as determined WO 01/00643 PCT/IL00/00358 28 using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the university of Wisconsin (gap creation penalty 50, gap extension penalty According to a preferred embodiment of this aspect of the present invention the antibody specifically recognizing and binding the polypeptides set forth in SEQ ID NOs:3, 5, 7 or portions thereof.
The present invention can utilize serum immunoglobulins, polyclonal antibodies or fragments thereof, immunoreactive derivative of an antibody), or monoclonal antibodies or fragments thereof.
Monoclonal antibodies or purified fragments of the monoclonal antibodies having at least a portion of an antigen binding region, including such as Fv, F(abl)2, Fab fragments (Harlow and Lane, 1988 Antibody, Cold Spring Harbor), single chain antibodies Patent 4,946,778), chimeric or humanized antibodies and complementarily determining regions (CDR) may be prepared by conventional procedures. Purification of these serum immunoglobulins antibodies or fragments can be accomplished by a variety of methods known to those of skill including, precipitation by ammonium sulfate or sodium sulfate followed by dialysis against saline, ion exchange chromatography, affinity or immunoaffinity chromatography as well as gel filtration, zone electrophoresis, etc. (see Goding in, Monoclonal Antibodies: Principles and Practice, 2nd ed., pp. 104-126, 1986, Orlando, Fla., Academic Press). Under normal physiological conditions antibodies are found in plasma and other body fluids and in the membrane of certain cells and are produced by lymphocytes of the type denoted B cells or their functional equivalent. Antibodies of the IgG class are made up of four polypeptide chains linked together by disulfide bonds.
The four chains of intact IgG molecules are two identical heavy chains referred to as H-chains and two identical light chains referred to as Lchains. Additional classes includes IgD, IgE, IgA, IgM and related proteins.
Methods for the generation and selection of monoclonal antibodies are well known in the art, as summarized for example in reviews such as Tramontano and Schloeder, Methods in Enzymology 178, 551-568, 1989.
A recombinant protein of the present invention may be used to generate antibodies in vitro. More preferably, the recombinant protein of the present invention is used to elicit antibodies in vivo. In general, a suitable host animal is immunized with the recombinant protein of the present invention. Advantageously, the animal host used is a mouse of an inbred WO 01/00643 PCT/IL00/00358 29 strain. Animals are typically immunized with a mixture comprising a solution of the recombinant protein of the present invention in a physiologically acceptable vehicle, and any suitable adjuvant, which achieves an enhanced immune response to the immunogen. By way of example, the primary immunization conveniently may be accomplished with a mixture of a solution of the recombinant protein of the present invention and Freund's complete adjuvant, said mixture being prepared in the form of a water in oil emulsion. Typically the immunization may be administered to the animals intramuscularly, intradermally, l0 subcutaneously, intraperitoneally, into the footpads, or by any appropriate route of administration. The immunization schedule of the immunogen may be adapted as required, but customarily involves several subsequent or secondary immunizations using a milder adjuvant such as Freund's incomplete adjuvant. Antibody titers and specificity of binding to the recombinant protein can be determined during the immunization schedule by any convenient method including by way of example radioimmunoassay, or enzyme linked immunosorbant assay, which is known as the ELISA assay. When suitable antibody titers are achieved, antibody producing lymphocytes from the immunized animals are obtained, and these are cultured, selected and cloned, as is known in the art. Typically, lymphocytes may be obtained in large numbers from the spleens of immunized animals, but they may also be retrieved from the circulation, the lymph nodes or other lymphoid organs. Lymphocytes are then fused with any suitable myeloma cell line, to yield hybridomas, as is well known in the art. Alternatively, lymphocytes may also be stimulated to grow in culture, and may be immortalized by methods known in the art including the exposure of these lymphocytes to a virus, a chemical or a nucleic acid such as an oncogene, according to established protocols.
After fusion, the hybridomas are cultured under suitable culture conditions, for example in multiwell plates, and the culture supernatants are screened to identify cultures containing antibodies that recognize the hapten of choice. Hybridomas that secrete antibodies that recognize the recombinant protein of the present invention are cloned by limiting dilution and expanded, under appropriate culture conditions. Monoclonal antibodies are purified and characterized in terms of immunoglobulin type and binding affinity.
WO 01/00643 PCT/IL00/00358 Additional objects, advantages, and novel features of the present invention will become apparent to one ordinarily skilled in the art upon examination of the following examples, which are not intended to be limiting. Additionally, each of the various embodiments and aspects of the present invention as delineated hereinabove and as claimed in the claims section below finds experimental support in the following examples.
EXAMPLES
Reference is now made to the following examples, which together with the above descriptions, illustrate the invention in a non limiting fashion.
Generally, the nomenclature used herein and the laboratory procedures in recombinant DNA technology described below are those well known and commonly employed in the art. Standard techniques are used for cloning, DNA and RNA isolation, amplification and purification.
Generally enzymatic reactions involving DNA ligase, DNA polymerase, restriction endonucleases and the like are performed according to the manufacturers' specifications. These techniques and various other techniques are generally performed according to Sambrook et al., molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989), which is incorporated herein by reference. Other general references are provided throughout this document. The procedures therein are believed to be well known in the art and are provided for the convenience of the reader. All the information contained therein is incorporated herein by reference.
Materials and Experimental Methods The following protocols and experimental details are referenced in the Examples that follow: Primers list: hnlll 16 5'-GGAGAGCAAGTCTGTGTTGATTC-3' hn 11230 5'-CACTGGTAGCCATGAGTGTGAG-3' hnlu350 5'-TTGGTCATCCCTCCAGTCACCA-3' pn9-312u 5'-CTTGCCTGTAGACAGAGCTGCAG-3' hpu-685 5'-GAGCAGCCAGGTGAGCCCAAGA-3' hpl967 5'-TCAGATGCAAGCAGCAACTTTGGC-3' mnlul 18 5'-CACCCTGATGTCATGCTGGAG-3' mnll563 5'-CATCTAGGAGAGCAATGACGTTC-3' (SEQ (SEQ ID NO:11) (SEQ ID NO:12) (SEQ ID NO:14) (SEQ ID NO:16) (SEQ ID NO:17) (SEQ ID NO:18) (SEQ ID NO:19) WO 01/00643 PCT/ILO/00358 31 Apl 5'-CCATCCTAATACGACTCACTATAGGGC-3' (SEQ ID Ap2 5'-ACTCACTATAGGGCTCGAGCGGC-3' (SEQ ID NO:21) Southern analysis: Genomic DNA was extracted from animal or from human blood using Blood and cell culture DNA maxi kit (Qiagene). DNA was digested with EcoRI, separated by gel electrophoresis and transferred to a nylon membrane Hybond N+ (Amersham). PCR products underwent a similar procedure. Hybridization was performed at 680 C in 6 x SSC, 1 SDS, x Denharts, 10 dextran sulfate, 100 gg/ml salmon sperm DNA, and 3 2 p o0 labeled probe. Pn9, a 1.7 kb fragment, which contain the entire open reading frame except for a deletion of 162 nucleotides (del:473-634, SEQ ID NO:1) was used as a probe. Following hybridization, the membrane was washed with 3 x SSC, 0.1 SDS, at 68 °C and exposed to X-ray film for 3 days. Membranes were then washed with 0.1 x SSC, 0.1 SDS, at 68 oC and were re-exposed for 4 days.
R T-PCR: RNA was prepared using TRI-Reagent (Molecular research center Inc.) according to the manufacturer instructions. 1.25 gg were taken for reverse transcription reaction using SuperScriptII Reverse transcriptase (Gibco BRL) and Oligo (dT) 15 primer (SEQ ID NO:22), (Promega).
Amplification of the resultant first strand cDNA was performed with Taq polymerase (Promega) or with Expand high fidelity (Boehringer Mannheim).
cDNA Sequence analysis: Sequence determinations were performed with vector specific and gene specific primers, using an automated DNA sequencer (Applied Biosystems, model 373A). Each nucleotide was read from at least two independent primers. Computation and sequence analysis and alignments were done using the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the university of Wisconsin.
Alignments of two sequences were performed using Bestfit (gap creation penalty 12, gap extension penalty 4) or with Gap program (gap creation penalty 50, gap extension penalty 3).
Tissue distribution: Tissue distribution of the hnhpl transcript was determined by semiquantitative PCR. cDNA panels were obtained from Clontech. PCR was performed with the gene specific primers hnlu350 (SEQ ID NO:12) and hnllll6 (SEQ ID NO:10). PCR program was as follows: 94 3 WO 01/00643 PCT/IL00/00358 32 minutes, followed by 40 cycles of 94 45 seconds, 64 1 minute, 72 1 minute. Samples were taken for further analysis following 25, and 40 cycles.
Chromosome localization: Chromosome localization of hnhpl was performed using the radiation hybrid panel Stanford G3. This panel was provided by the human genome center at the Weizmann Institute. A 225 bp genomic fragment of hnhpl gene was amplified using the gene specific primers hnlu350 (SEQ ID NO:12) and hnllll6 (SEQ ID NO:10). PCR program to was as follows: 94 3 minutes, followed by 39 cycles of 94 °C seconds, 64 1 minute, 72 1 min. Analysis of results was done through the RH server at the Stanford human genome center.
EXAMPLE 1 Cloning an ESTfor a novel heparanase gene The entire amino acid sequence of human heparanase (SEQ ID NO:9) was used to screen human EST database for homologous sequences. Screening was performed using the BLAST 2.0 server at the NCBI, basic BLAST search, tblastn program.
A distantly homologous fragment was pooled out, accession number AI222323, IMAGE clone number 1843155 from Soares NFL_T_GBCSI Homo Sapiens cDNA library prepared from testis B-cells and fetal lungs. The search values for this sequence were as follows: Score 38.3 bits Expect 0.15 Identities 16/36 (44 Positives 22/36 (60 The sequence of accession number AI222323 contains 378 nucleotides of the 3' of clone 1843155 (complementary to nucleotides 165-543 of SEQ ID NO:23).
This clone was purchased from the IMAGE consortium. It contained an insert of 560 bp (SEQ ID NO:23). The entire nucleotide sequence was determined and compared to the hpa cDNA encoding human heparanase. The homology between clone 1843155 and hpa cDNA was restricted to the 3' region of the cDNA clone. There was 59 homology between nucleotides 99-275 of clone 1843155 (SEQ ID NO:23), and 1532-1708 of hpa (SEQ ID NO:24). The deduced amino acid sequence of this region had 60 homology (identical similar) to amino acids 488-542 (SEQ ID NO:9) of human heparanase. The downstream sequence (nucleotides 276-560, SEQ ID NO:23) represents a 3' untranslated region and a poly A tail. The upstream sequence, nucleotides WO 01/00643 PCT/IL00/00358 33 1-98 (SEQ ID NO:23) was unrelated to heparanase. This unrelated sequence was found to be identical to a different cDNA clone from the same library. Therefore, the human EST clone 1843155, obtained from the IMAGE consortium is assumed to be a chimera, which contains two unrelated partial cDNAs ligated to a single vector.
EXAMPLE 2 Cloning a cDNA for a novel heparanase gene In order to isolate the entire cDNA, three primers were designed according to the sequence of clone 1843155. The cDNA was amplified from placenta cDNA by Marathon RACE (rapid amplification of cDNA ends) (Clontech, Palo Alto, California) according to the manufacturer instructions. The first cycle was performed with the gene specific primer hn1116 (SEQ ID NO:10) and the universal primer Apl (SEQ ID The second cycle was performed with the gene specific primer hn11230 (SEQ ID NO:11) and the universal primer Ap2 (SEQ ID NO:21).
Following amplification, a difused band of approximately 1.7 kb was obtained. This cDNA amplification product was subcloned into pGEM Teasy (Promega, Madison, WI) and the nucleotide sequences of three independent clones pn5, pn6 and pn9 were determined. The consensus sequence of the longest cDNA, pn6, appears in Figure 1 (SEQ ID NOs: 1, 2 and It is 2060 nucleotide long and it contains an open reading frame of 1776 nucleotides, which encodes a polypeptide of 592 amino acids, with a calculated molecular weight of 66.5 kDa. The newly cloned gene was designated hnhpl. The two shorter forms, pn9 and pn5 and their deduced amino acid sequences are set forth in SEQ ID NOs:4 and 6 and SEQ ID and 7, respectively. Pn9 and pn5 were identical to pn6, however each one of then contained an in frame deletion as a result of alternative splicing. Pn9 contains a deletion of 162 nucleotides, 473-634 of SEQ ID NO:1, which correspond to amino acids 150-203 of SEQ ID NO:3. As a result pn9 encodes a putative polypeptide of 538 amino acids (SEQ ID having a calculated molecular weight of 60.4 kDa. Pn5 contains a deletion of 336 nucleotides, 473-808 of SEQ ID NO:1, which correspond to amino acids 150-261 of SEQ ID NO:3, thus, it encodes a putative polypeptides of 480 amino acids (SEQ ID NO:7) having a calculated molecular weight of 53.9 kDa. The 11 t h amino acid residue of SEQ ID NO:3 is methionine. It is generally accepted that the first methionine serves as a translation start site in mammals, however, the nucleotides WO 01/00643 PCT/IL00/00358 34 surrounding the second ATG fit better with the Kozak consensus sequence for translation start site. Translation may thus start at the second methionine and produce a protein of 581 amino acids with calculated molecular weight of 65.4 kDa. The presence of transcripts of variable length was confirmed by PCR amplification of the hnlhp cDNA using two gene specific primers: pn9-312u (SEQ ID NO:14) which is located close to the 5' end and hn11230 (SEQ ID NO:11) which overlaps the stop codon at the 3' end of the open reading frame. Amplification was performed from Marathon ready cDNA prepared from placenta and from testis. The o0 PCR products are shown in figure 3. Four bands were obtained from placenta: two major bands of 1.45 and 1.6 kb, similar to pn9 and pn6 and two minor bands, one of 1.35 kb, similar to pn5 and a second one of 1.8 kb. The sequence of the latter has not yet been determined. Amplification of testis cDNA resulted in a different pattern. Four bands of 1.35, 1.65, 1.85 and 2.05 kb were observed and a minor one of 1.5 kb. The various forms appear to represent products of alternative splicing. Since the deletions characterized so far retain an open reading frame, the translation products of the various cDNAs may constitute a protein family. The comparison between the amino acid sequence of hnhpl and heparanase is shown in Figure 3. Using the gap program of the GCG package which aligns the entire amino acid sequences, the homology between the two proteins is 45.5 identity and 7.3 similarity, total homology of 52.8 (gap creation penalty 50, gap extension penalty The BestFit program defines the region of the best homology between the two sequences. Using this program, the homology between the two amino acid sequences starts at position 63 of hnlhpl (SEQ ID NO:3) and position 41 of heparanase (SEQ ID NO:9) and is 47.5 identity and 7.8 similarity, i.e. homology of 55.3 The homology between the nucleotide sequences of hnhpl and hpa is 57 as calculated by the BestFit program. The homologous region is located between nucleotides 638-1812 of hnhpl (SEQ ID NO:1) and nucleotides 564-1708 of hpa (SEQ ID NO:24). Using the Gap program the homology is 51 over the entire sequence gap creation penalty 50, gap extension penalty 3.
EXAMPLE 3 Zoo blot Hnhpl cDNA was used as a probe to detect homologous sequences in human DNA and in DNA of various animals. The autoradiogram of the WO 01/00643 PCT/IL00/00358 Southern analysis is presented in Figure 4. Several bands were detected in human DNA. Several intense bands were detected in all mammals, while faint bands were detected in chicken. This correlates with the phylogenetic relation between human and the tested animals. The intense bands indicate that hnhpl is conserved among mammals as well as in more genetically distant organisms. The multiple bands patterns suggest that in all animals, hnhpl locus occupies a large genomic region. Several specific bands disappeared after stringent wash. These may represent homologous sequences and suggest the existence of a gene family, which can be to isolated based on their homology to the human hnhpl reported here.
EXAMPLE 4 comparison to heparanase via cross hybridization In order to check the capability of hpa and hnhpl to cross hybridize under low stringency conditions, the entire coding region of the human hpa and hnhpl were amplified by PCR. Human hpa was amplified from platelets mRNA by RT-PCR using the primers hpu-685 (SEQ ID NO:16) and hp1967 (SEQ ID NO:17), and hnhpl was amplified from testis using the primers hn11230 (SEQ ID NO:11) and pn9-312u (SEQ ID NO:14). The products were quantified and samples of 100 pg and 1 ng were run on agarose gel and subjected to Southern hybridization. The membranes were probed with 32p labeled hpa cDNA and with hnhpl cDNA. No cross hybridization was observed (Figure 5) even after over exposure for 5 days. Since hpa is the most similar sequence known today to that of hnhpl, this experiment indicates that the bands detected in the autoradiograph of Figure 4 are of the hnhpl gene or of yet unknown sequences homologous thereto, which might constitute a gene family.
This further indicated that such sequences are isolatable using the hnhpl as a probe to screen the relevant libraries, or using hnhpl derived PCR primers to amplify the relevant cDNA or DNA sequences.
EXAMPLE Chromosome localization The chromosome localization of hnhpl was determined using G3 radiation hybrid panel. Hnhpl was amplified from 83 human/mouse radiation hybrids. The results were analyzed by the RH server and the hnhpl gene was mapped to chromosome 10, next to the marker SHGC- WO 01/00643 PCT/IL00/00358 36 57721. The results also indicated a possibility of a second copy of the gene.
EXAMPLE 6 Expression Pattern ofhnhpl The tissue distribution of hnhpl transcripts was determined using calibrated human cDNA panels (Clontech, Palo Alto, Ca). The results are shown in Table 1 below. Expression level is generally low. PCR products were clearly observed only after 40 cycles of amplification.
TABLE 1 Tissue hnl (40 cycles) Bone marrow Liver Lymph node Leukocytes Spleen Thymus Tonsil Colon Ovary Prostate Small intestine Testis EXAMPLE 7 cloning of a Mouse homologue Screening of the mouse EST database with the amino acid sequence of heparanase as well as of hnhpl pooled out a mouse EST clone, which shares distant homology with heparanase and a remarkably high homology with hnhpl. The EST clone 1378452 accession number AI019269 from mouse thymus was 351 nucleotide long and it is set forth in SEQ ID NO:8.
It has 61-63 identity over 161 nucleotides (191-351, SEQ ID NO:8) to the human (SEQ ID NO:24) and mouse (SEQ ID NO:15) hpa nucleotide sequences, and 93 to hnhpl nucleotide sequence (SEQ ID NO: 1) using the BestFit program of the GCG package. The nucleotide sequence of this clone did not contain an open reading frame. Two frame shifts were identified in the sequence found in the EST database, as compared to the WO 01/00643 PCT/IL0O/00358 37 hnhpl sequence. This frame shifts were later confirmed by nucleotide sequence analysis of this clone as well as by isolation of this fragment from BL6 mouse melanoma cells and determination of its nucleotide sequence. This mouse gene is transcribed at very low levels. Low levels of expression were indicated as no amplification products were obtained following 40 cycles of PCR from mouse cDNA panel (Clontech, Palo Alto, Ca) which included cDNA from mouse heart, brain, spleen, lung, liver, skeletal muscle, kidney, testis and embryos of 7, 11,15, and 17 days.
The amplification was performed using the gene specific primers mnlu118 0o (SEQ ID NO:18) and mn11563 (SEQ ID NO:19).
EXAMPLE 8 Expression of hnhpl in mammalian cells A mammalian expression vector was constructed in order to overexpress hnhpl in human cells. To enable detection of the Hnhpl translation product, the hnhpl expression vector was designed to encode a C-terminal tagged hnl protein. A DNA sequence, which encodes eight amino acids FLAG (Kodak), was fused to the 3' end of the hnhpl open reading frame.
Fusion of the FLAG sequence to the hnhpl coding sequence was generated by PCR amplification using the primer: hnl-c-flag: A-3' (SEQ ID NO:25) and the primer: pn9-312u (SEQ ID NO:14). The PCR program was as follows: 94 3 min followed by 5 cycles of: 94 oC, 45 seconds, 50 45 seconds and 72 2 minutes, and then 32 cycles of 94 45 seconds, 64 oC, 45 seconds and 72 2 min.
The amplification product was subcloned into pGEM-T-easy, and the sequence was verified. The resulting plasmids were designated pGEMpn6F and pGEM-pn9F.
Two constructs were generated in pSI mammalian expression vector (Promega): the first contained the complete hnhpl sequence (pn6) and the second contained the alternative splice form (pn9). The pSI-pn6 expression vector was constructed by triple ligation of the following fragments: an EcoRI BamHI fragment, which contains the 5' end of hnlpn6, excised from pGem-T-easy-pn9, a BamHI NotI fragment which contains the 3' FLAG tagged hnhpl, excised from pGEM-pn6F and pSI digested with EcoRI NotI.
WO 01/00643 PCT/IL0/00358 38 The pSI-pn9 expression vector was constructed similarly, by triple ligation of the following fragments: an EcoRI SspI fragment, which contains the 5' end of hnhpl-pn6, excised from pGem-T-easy-pn9, an SspI -NotI fragment, which contains the 3' FLAG tagged hnhpl, excised from pGem-pn6F and pSI digested with EcoR I Not I.
The resulting plasmids were transfected into human embryonal kidney 293 cells, using the Fugene transfection reagent (Boehringer Mannheim). Forty-eight hours following transfection cells were harvested and proteins were analysed by western blot. Cell lysates of 2.5xl0 s were 0o separated by SDS-PAGE, transferred onto a nylon membrane and incubated with anti FLAG antibody 1:1000 dilution (Kodak anti FLAG M2 cat: IB 13025, final concentration 10 gg/ml). Proteins of approximately 65 kDa and 60 kDa were detected in cells transfected with pSI-pn6F and pSI-pn9F respectively. These proteins are similar in size to those predicted by the calculated molecular weight for the translation products of corresponding open reading frames. It is demonstrated that both the entire hnhpl cDNA and the pn9 splice form are successfully transcribed and translated in human 293 cells. However, unlike heparanase the Hnhpl protein products do not undergo major processing in these cells.
Although the invention has been described in conjunction with specific embodiments thereof, it is evident that many alternatives, modifications and variations will be apparent to those skilled in the art.
Accordingly, it is intended to embrace all such alternatives, modifications and variations that fall within the spirit and broad scope of the appended claims. All publications cited herein are incorporated by reference in their entirety.
WO 01/00643 PCT/IL00/00358 39 List of References 1. Durand, Lehn, Callebaut, Fabrega, Henrissat, B. and Mornon, J.P. (1997). Active-site motifs of lysosomal acid hydrolases: invariant features of clan GH-A glycosyl hydrolases deduced from hydrophobic cluster analysis. Glycobiology, 277- 284.
2. Vlodavsky, Friedmann, Elkin, Aingorn, Atzmon, R., Ishai-Michaeli, Bitan, Pappo, Peretz, Michal, 1., Spector, L and Pecker I. (1999). Mammalian heparanase: gene cloning, expression and function in tumor progression and metastasis.
Nature Medicine (In press) 3. Wight, Kinsella, and Qwamstromn, E.E. (1992). The role of proteoglycans in cell adhesion, migration and proliferation.
Curr. Opin. Cell Biol., 4, 793-801.
4. Jackson, Busch, and Cardin, A.L. (1991).
Glycosaminoglycans: Molecular properties, protein interactions and role in physiological processes. Physiol. Rev., 71, 481-539.
Wight, T.N. (1989). Cell biology of arterial proteoglycans.
Arteriosclerosis, 9, 1-20.
6. Kjellen, and Lindahl, U. (1991). Proteoglycans: structures and interactions. Annu. Rev. Biochem., 60, 443-475.
7. Ruoslahti, and Yamaguchi, Y. (1991). Proteoglycans as modulators of growth factor activities. Cell, 64, 867-869.
8. Vlodavsky, Bar-Shavit, Korer, and Fuks, Z. (1993).
Extracellular matrix-bound growth factors, enzymes and plasma proteins. In Basement membranes: Cellular and molecular aspects (eds. D.H. Rohrbach and R. Timpl), pp 327-343. Academic press Inc., Orlando, Fl.
WO 01/00643 PCT/IL00/00358 9. Vlodavsky, Bar-Shavit, Komer, and Fuks, Z. (1993).
Extracellular matrix-bound growth factors, enzymes and plasma proteins. In Basement membranes: Cellular and molecular aspects (eds. D.H. Rohrbach and R. Timpl), pp 327-343. Academic press Inc., Orlando, Fl.
Vlodavsky, Eldor, Haimovitz-Friedman, Matzner, Ishai- Michaeli, Levi, Bashkin, Lider, Naparstek, Cohen, and Fuks, Z. (1992). Expression of heparanase by platelets and circulating cells of the immune system: Possible involvement in diapedesis and extravasation. Invasion Metastasis, 12, 112-127.
11. Nakajima, Irimura, and Nicolson, G.L. (1988). Heparanase and tumor metastasis. J. Cell. Biochem., 36, 157-167.
12. Vlodavsky, I. et al. Invasion Metastasis 1995, 14:290-302.
13. Nakagima, M. et al J. Cell. Biochem. 1988, 36:157-167.
14. Vlodavsky, I. et al. Cancer res. 1983, 43:2704-2711.
Vlodavsky, I. et al. J. Med 1988, 24:464-470.
16. Vlodavsky, I. et al. Invasion and Metastasis 12:112-127.
17. Gilat,D. etal. J. Exp. Med. 1995, 181:1929-1934.
18. Matzneretal. 1985, J. Clin. Invest. 10:1306-1313.
19. Mollinedo, F. et al. Biochem. J. 1997, 327:917-923.
Murphy, G. etal. Biochem. J. 1990, 192:517-525.
21. Nakajima, M. et al. J. Cell. Biochem. 1988, 36(2):157-167.
22. Ishai-Michaeli R. wt al. Cell Reg. 1990, 1:833-842.
23. Cardon-Cardo C. et al. Lab. Inrest. 1990, 63:832-840.
WO 01/00643 PCT/ILOO/00358 41 24. Nicolson, G.L. (1988). Organ specificity of tumor metastasis: Role of preferential adhesion, invasion and growth of malignant cells at specific secondary sites. Cancer Met. Rev., 7, 143-188.
Liotta, Rao, and Barsky, S.H. (1983). Tumor invasion and the extracellular matrix. Lab. Invest., 49, 639-649.
26. Vlodavsky, Fuks, Bar-Ner, Ariav, and Schirrmacher, V.
(1983). Lymphoma cell mediated degradation of sulfated proteoglycans in the subendothelial extracellular matrix: Relationship to tumor cell metastasis. Cancer Res., 43, 2704-2711.
27. Sun L, Feusi E, Sibalic A, Beck-Schimmer B, Wuthrich RP (1998).
Expression profile ofhyaluronidase mRNA transcripts in the kidney and in renal cells. Kidney Blood Press Res;21(6):413-8 28. Parish, Coombe, Jakobsen, and Underwood, P.A.
(1987). Evidence that sulphated polysaccharides inhibit tumor metastasis by blocking tumor cell-derived heparanase. Int. J.
Cancer, 40, 511-517.
29. Burgess, and Maciag, T. (1989). The heparin-binding (fibroblast) growth factor family of proteins. Annu. Rev. Biochem., 58, 575-606.
Folkman, and Klagsbrun, M. (1987). Angiogenic factors.
Science, 235, 442-447.
31. Vlodavsky, Folkman, Sullivan, Fridman, Ishai-Michaelli, Sasse, and Klagsbrun, M. (1987). Endothelial cell-derived basic fibroblast growth factor: Synthesis and deposition into subendothelial extracellular matrix. .Proc. Natl. Acad. Sci. USA, 84, 2292-2296.
32. Folkman, Klagsbrun, Sasse, Wadzinski, Ingber, and Vlodavsky, 1. (1980). A heparin-binding angiogenic protein basic WO 01/00643 PCT/IL00/00358 42 fibroblast growth factor is stored within basement membrane. Am.
J. Pathol., 130, 393-400.
33. Bashkin, Doctrow, Klagsbrun, Svahn, Folkman, J., and Vlodavsky, I. (1989). Basic fibroblast growth factor binds to subendothelial extracellular matrix and is released by heparitinase and heparin-like molecules. Biochemistry, 28, 1737-1743.
34. Ishai-Michaeli, Svahn, Chajek-Shaul, Korner, Ekre, and Vlodavsky, I. (1992). Importance of size and sulfation of heparin in release of basic fibroblast factor from the vascular endothelium and extracellular matrix. Biochemistry, 31, 2080-2088.
Ishai-Michaeli, Eldor, and Vlodavsky, I. (1990). Heparanase activity expressed by platelets, neutrophils and lymphoma cells releases active fibroblast growth factor from extracellular matrix.
Cell Reg., 1, 833-842.
36. Vlodavsky, Bar-Shavit, Ishai-Michaeli, Bashkin, and Fuks, Z. (1991). Extracellular sequestration and release of fibroblast growth factor: a regulatory mechanism? Trends Biochem. Sci., 16, 268-271.
37. Vlodavsky, Bar-Shavit, Korner, and Fuks, Z. (1993).
Extracellular matrix-bound growth factors, enzymes and plasma proteins. In Basement membranes: Cellular and molecular aspects (eds. D.H. Rohrbach and R. Timpl), pp327-343. Academic press Inc., Orlando, Fl.
38. Yayon, Klagsbrun, Esko, Leder, and Omitz, D.M.
(1991). Cell surface, heparin-like molecules are required for binding of basic fibroblast growth factor to its high affinity receptor. Cell, 64, 841-848.
39. Spivak-Kroizman, Lemmon, Dikic, Ladbury, J.E., Pinchasi, Huang, Jaye, Crumley, Schlessinger, and Lax, I. (1994). Heparin-induced oligomerization of FGF molecules WO 01/00643 PCTIL00/00358 43 is responsible for FGF receptor dimerization, activation, and cell proliferation. Cell, 79, 1015-1024.
Omitz, Herr, Nilsson, West, Svahn, and Waksman, G. (1995). FGF binding and FGF receptor activation by synthetic heparan-derived di- and trisaccharides. Science, 268, 432- 436.
41. Gitay-Goren, Soker, Vlodavsky, and Neufeld, G. (1992).
Cell surface associated heparin-like molecules are required for the binding of vascular endothelial growth factor (VEGF) to its cell surface receptors. J. Biol. Chem., 267, 6093-6098.
42. Ernst S, Langer R, Cooney CL, Sasisekharan R. (1995). Enzymatic degradation of glycosaminoglycans. Crit Rev Biochem Mol Biol, 30(5):387-444.
43. Current protocols in molecular biology (1994-1999). Ausubel, F.M., Brent, Kingston, Moore Seidman, Smith, J.A., Struhl, K. Eds. JohnWiley Sons, Inc.
44. Rapraeger, Krufka, and Olwin, B.R. (1991). Requirement of heparan sulfate for bFGF-mediated fibroblast growth and myoblast differentiation. Science, 252, 1705-1708.
Shieh, Wundunn, Montgomery, Esko, and Spear, P.G. J. (1992). Cell surface receptors for herpes simplex virus are heparan sulfate proteoglycans. J Cell Biol., 116, 1273-1281.
46. Chen, Maguire, Hileman, Fromm, Esko, J.D., Linhardt, and Marks, R.M. (1997). Dengue virus infectivity depends on envelope protein binding to target cell heparan sulfate.
Nature Medicine 3, 866-871.
47. Putnak, Kanesa-Thasan, and Innis, B.L. (1997). A putative cellular receptor for dengue viruses. Nature Medicine 3, 828-829.
WO 01/00643 PCT/ILOO/00358 44 48. Narindrasorasak, Lowery, Gonzalez-DeWhitt, Poorman, Greenberg, Kisilevsky, R. (1991). High affinity interactions between the Alzheimer's beta-amyloid precursor protein and the basement membrane form oftheparan sulfate proteoglycan. J.
Biol. Chem., 266, 12878-83.
49. Eisenberg, Sehayek, Olivecrona, and Vlodavsky, I. (1992).
Lipoprotein lipase enhances binding of lipoproteins to heparan sulfate on cell surfaces and extracellular matrix. J. Clin. Invest., 90, 2013- 2021.
Ross, R. (1993). The pathogenesis of atherosclerosis: a perspective for the 1990s. Nature 362:801-809.
51. Zhong-Sheng, Walter, Brecht, Miranda, Mahmood Hussain, Innerarity, T.L. and Mahley, W.R. (1993). Role of heparan sulfate proteoglycans in the binding and uptake of apolipoprotein E-enriched remnant lipoproteins by cultured cells. J.
Biol. Chem., 268, 10160-10167.
52. Current protocols in protein science (1996-1999). Coligan, J.E., Dunn, Ploegh, Speicher, D.W. and Wingfield, P.T. Eds.
John Wiley Sons Inc.
EDITORIAL NOTE APPLICATION NUMBER 52448/00 The following Sequence Listing pages 1-11 is part of the description.
The claims pages follow on pages 45-47 WO 01/00643 PCT/1L0000358 GENERAL INFORMATION: APPLICANT: Ir (ii) TITLE OF INVENTION: PO (iii) NUMBER OF SEQUENCES: 24 (iv) CORRESPONDENCE ADDRESS: ADDRESSEE: Sol Sh STREET: 2001 J CITY: Arling STATE: Virgin COUNTRY: United ZIP: 22202 COMPUTER READABLE FORM: MEDIUM TYPE: 1.
COMPUTER: Tw OPERATING SYSTEM: MS Wi SOrWARE: Wo file (vi) CURRENT APPLICATION DATA: APPLICATION NUMBER: FILING DATE:
CLASSIFICATION:
(vii) PRIOR APPLICATION DATA: APPLICATION NUMBER: 60/1.
FILING DATE: June (viii) ATTORNEY/AGENT INFORMATION:
NAME:
REGISTRATION NUMBER: REFERENCE/DOCKET NUMBER: (ix) TELECOMMUNICATION INFORMATION:
TELEPHONE:
TELEFAX:
TELEX:
NGG
is Pecker et al.
LYNUCLEOTIDES AND POLYPEPTIDES ENCODED THEREBY einbein c/o Anthony Castorina efferson Davis Highway, Suite 207 ton ia States of America 44 megabyte, 3.5' microdisk inhead* Slimnote-B9OTX DOS version 6.2, ndows version 3.11 rd for Windows version converted to an ASCI 40,801 25, 1999 Sheinbein, Sol 25,457 20105 972-3-6127676 972-3-6127575 INFORMATION FOR SEQ ID NO:1: SEQUENCE CHARACTERISTICS: LENGTH: 2060 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: CGCTTAATTC TAGAAGAGGG ATTGAATGAG GGTGCTTTGT GCCTTC AAGCCATGCC CTCCAGCAAC TCCCGCCCCC CCGCGTGCCT AGCCCC GCTCTCTACT TGGCTCTGTT GCTCCATCTC TCCCTTTCCT CCCAGG AGACAGGAGA CCCTTGCCTG TAGACAGAGC TGCAGGTTTG AAGGAA CCCTGATTCT ACTTGATGTG AGCACCAAGA ACCCAGTCAG GACAGT GAGAACTTCC TCTCTCTGCA GCTGGATCCG TCCATCATTC ATGATG GCTCGATTTC CTAAGCTCCA AGCGCTTGGT GACCCTGGCC CGGGGA CGCCCGCCTT TCTGCGCTTC GGGGGCAAAA GGACCGACTT CCTGCA CAGAACCTGA GGAACCCGGC GAAAAGCCGC GGGGGCCCGG GCCCGG CTATCTCAAA AACTATGAGG ATGACATTGT TCGAAGTGAT GTTGCC ATAAACAGAA AGGCTGCAAG ATTGCCCAGC ACCCTGATGT TATGCT CTCCAAAGGG AGAAGGCAGC TCAGATGCAT CTGGTTCT'TC TAAAGG ATTCTCCAAT ACTTACAGTA ATCTCATATT AACAGCCAGG TCTCTA AACTTTATAA CTTTGCTGAT TGCTCTGGAC TCCACCTGAT ATTTGC AATGCACTGC GTCGTAATCC CAATAACTCC TGGAACAGTT CTAGTG GAGTCTGTTG AAGTACAGCG CCAGCAAAAA GTACAACATT TCTTGG TGGGTAATGA GCCAAATAAC TATCGGACCA TGCATGGCCG GGCAGT GGCAGCCAGT TGGGAAAGGA TTACATCCAG CTGAAGAGCC TGTTGC CATCCGGATT TATTCCAGAG CCAGCTTATA TGGCCCTAAT ATTGGG CGAGGAAGAA TGTCATCGCC CTCCTAGATG GATTCATGAA GGTGGC
CCTG
GGGG
CTGG
AAGA
CAAT
GCTG
CTTT
GTTC
ATTA
TTAG
GGAG
AGCA
GACA
TCTA
CCCT
GAAC
AAAT
AGCC
CGGC
AGGA
AGTACAGTAG
GGTCAAGGTG
ACCAGATTAG
AAGATTTGGC
TCTATCCGAT
TGCTGGCCAA
CATGGATACA
CTGGCTCTCT
TGCATGTGGC
GACAAACTAA
CGTTCGTGGG
AGAAAATCAA
CTGCTGCAGC
GAATGGCCAG
AGCCCCGCCC
ATGGGCTTTT
ATGCAGTTAC CTGGCAACAT TGCTACATTG ATGGCCGGGT ATGGACTTCC TGAAAACTCG CCTGTTAGAC ACACTCTCTG GAAAATTCAG AAAGTGGTTA ATACATACAC TCCAGGAAAG TTGAAGGTGT GGTGACCACC TCAGCTGGAG GCACAAACAA TCCTATGCTG CAGGATTCTT ATGGTTGAAC ACTTTAGGAA TCAGGGCATT GATGTCGTGA TACGGCACTC ATTTTTTGAC ATCACCTCGT GGACCAGAAT TTTAACCCAT TACCAGACTA CTCCTCTACA AGCGCCTGAT CGGCCCCAAA GTCTTGGCTG TGGGCTCCAG CGGAAGCCAC GGCCTGGCCG AGTGATCCGG GGATTTATGC TCACTGCACA AACCACCACA ACCACAACTA TCCATTACAC TTTTTATCAT CAACTTGCAT CGATCAAGAA GCTGGCTGGG ACTCTCAGAG ACAAGCTGGT TCACCAGTAC CCTATGGGCA GGAGGGCCTA AAGTCCAAGT CAGTGCAACT CCCTTAGTGA TGGTGGACGA CGGGACCCTC CCAGAATTGA CCTTCGGGCC GGCCGGACAT TGGTCATCCC TCCAGTCACC TTGTGGTCAA GAATGTCAAT GCTTTGGCCT GCCGCTACCG WO 01/00643 PCT1L410100358 2 ATAAGCTATC CTCACACTCA TGGCTACCAG TGGGCCTGCT GGGCTGCTTC 1850 CACTCCTCCA CTCCAGTAGT ATCCTCTGTT TTCAGACATC CTAGCAACCA 1900 GCCCCTGCTG CCCCATCCTG CTGGAATCAA CACAGACTTG CTCTCCAAAG 1950 AGACTAAATG TCATAGCGTG ATCTTAGCCT AGGTAGGCCA CATCCATCCC 2000 AAAGGAAAAT GTAGACATCA CCTGTACCTA TATAAGGATA AAGGCATGTG 2050 TATAGAGCAA 2060 INFORMATION FOR SEQ ID NO:2: SEQUENCE CHARACTERISTICS: LENGTH: 2060 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: C GCT TAA TTC TAG AAG AGG SAT TGA ATG AGS GTG CTT TGT GCC TTC CCT GAA GCC ATG CCC TCC AGC AAC Met Arg Val Leu Cys Ala Phe Pro Glu Ala Met Pro Ser Ser Asn 5 10 TCC CGC CCC CCC GCG TGC CTA GCC CCG GGG GCT CTC TAC TTG GCT 115 Ser Arg Pro Pro Ala Cys Leu. Ala Pro Gly Ala Leu Tyr Leu Ala 25 CTG TTG CTC CAT CTC TCC CTT TCC TCC CAG GCT GGA GAC AGG AGA 160 Leu Leu Leu His Leu Ser Leu Ser Ser Gln Ala Gly Asp Arg Arg .35 40 CCC TTG CCT GTA GAC AGA GCT GCA GGT TTG AAG GAA AAG ACC CTG 205 Pro Leu Pro Val Asp Arg Ala Ala Giy Leu Lys Siu Lys Thr Leu 55 ATT CTA CTT GAT GTG AGC ACC AAG AAC CCA STC AGG ACA GTC AAT 250 Ile Leu Leu Asp Val Ser Thr Lys Asn Pro Val Arg Thr Val Asn 65 70 GAG AAC TTC CTC TCT CTG CAG CTG GAT CCG TCC ATC ArT CAT GAT 295 Glu Asn Phe Leu Ser Lou Gln Leu Asp Pro Ser Ile Ile His Asp 85 GGC TGG CTC GAT TTC CTA ASC TCC AAS CGC TTG GTG ACC CTG GCC 340 Gly Trp Leu Asp Phe Leu Ser Ser Lys Arg Leu Val Thr Leu Ala 95 100 105 CGG GSA CTT TCG CCC GCC TTT CTG CGC TTC GGG GGC AAA AGS ACC 385 Arg Gly Leu Ser Pro Ala Phe Leu Arg Phe Gly Gly Lys Arg Thr 110 115 120 GAC TTC CTG CAG TTC CAG AAC CTG AGG AAC CCG GCG AAA AGC CGC 430 Asp Phe Leu Gln Phe Gin Asn Leu Arg Asn Pro Ala Lys Ser Arg 125 130 135 GGS SGC CCG GGC CCG GAT TAC TAT CTC AAA AAC TAT GAG GAT GAC 475 Giy Gly Pro Gly Pro Asp Tyr Tyr Leu Lys Asn Tyr Giu Asp Asp 140 145 150 ATT GTT CGA AGT GAT GTT GCC TTA GAT AAA CAG AAA GGC TGC AAG 520 Ile Val Arg Ser Asp Val Ala Leu Asp Lys Gin Lys Giy Cys Lys 155 160 165 ATT GCC CAG CAC CCT GAT GTT ATG CTG GAG CTC CAA AGG GAG AAG 565 Ile Ala Gin His Pro Asp Val Met Leu Giu Leu Gin Arg Glu Lys 170 175 180 GCA GCT CAG ATG CAT CTG GTT CTT CTA AAG GAG CAA TTC TCC AAT 610 Ala Ala Gin Net His Leu Val Leu Leu Lys Giu Gin Phe Ser Asn 185 190 195 ACT TAC AGT AAT CTC ATA TTA ACA GCC AGG TCT CTA GAC AAA CTT 655 Thr Tyr Ser Asn Leu Ile Leu Thr Ala Arg Ser Leu Asp Lys Leu 200 205 210 TAT AAC TTT SCT GAT TGC TCT GGA CTC CAC CTG ATA TTT GCT CTA 700 Tyr Asn Phe Ala Asp Cys Ser Gly Leu His Leu Ile Phe Ala Leu 215 220 225 AAT GCA CTG CGT CGT AAT CCC AAT AAC TCC TGG AAC AST TCT AGT 745 Asn Ala Leu Arg Arg Asn Pro Asn Asn Ser Trp Asn Ser Ser Ser 230 235 240 GCC CTG AGT CTG TTG AAS TAC AGC GCC AGC AAA AAG TAC AAC ATT 790 Ala Leu Ser Leu Leu Lys Tyr Ser Ala Ser Lys Lys Tyr Asn Ile 245 250 255 TCT TGG GAA CTG GGT AAT GAG CCA AAT AAC TAT CGG ACC ATG CAT 835 Ser Trp Siu Leu Sly Asn Glu Pro Asn Asn Tyr Arg Thr Met His 260 265 270 GSC CSG SCA GTA AAT SGC AGC CAS TTG GSA AAG SAT TAC ATC CAG 880 Sly Arg Ala Val Asn Sly Ser Gin Leu Gly Lys Asp Tyr Ile Gin 275 280 285 CTS AAS ASC CTS TTS CAG CCC ATC CSS AT? TAT TCC AGA GCC ASC 925 Leu Lys Ser Leu Leu Sin Pro Ile Arg Ile Tyr Ser Arg Ala Ser 290 295 300 TTA TAT SSC CCT AAT ATT GGG CSG CCS ASS AAG AAT GTC ATC SCC 970 Leu Tyr Sly Pro Asn Ile Sly Arg Pro Arg Lys Asn Val Ile Ala 305 310 315 CTC CTA SAT GSA TTC ATS AAS STS SCA GSA AST ACA STA SAT SCA 1015 Leu Leu Asp Sly Phe Met Lys Val Ala Sly Ser Thr Val Asp Ala WO 01M~643 PCT1L00/00358 ACC TGG C1 Thr Trp GI GAC TTC C1 Asp Phe L( AGG AAA Al Arg Lys I] ATT TGG C1 Ile Trp Lt AAT CTA T( Asn Leu S( TTA GGA Al Leu Gly M( TCA TTT T7.
Ser Phe P) AAC CCA T7 Asn Pro Li ATC GGC C( Ile Gly Pi AAG CCA C( Lys Pro A: GCT CAC T( Ala His C, ATT ACA C', Ile Thr Li AAG CTG G( Lys Leu A] CTG CAG C( Leu Gin Pi CTG AAT G( Leu Asn G: GAA TTG A) Giu Leu L, CCT CCA G' Pro Pro V.
TTG GCC T( Leu Aia C, AGT GGG C( CTC TGT T' TGC TGG Al TAG CGT GJ ATG TAG A( AGA GCA A TGC TAC Cys Tyr ACT CGC Thr Arg AAA GTG Lys Vai GGT GTG Gly Val TCC TAT Ser Tyr GCC AAT Ala Asn CAT GGA Hi-Js Giy GAC TAC Asp Tyr GTC TTG Val Leu GGC CGA Gly Arg AAC CAC Asn His ATC ATC Ile Ile ACT CTC Thr Leu GGG CAG Gly Gin CCC TTA Pro Leu CGC CCC Arg Pro ATG GGC Met Giy TAC CGA Tyr Arg GGG CTG ACA TCC CAC AGA AGC CTA CCT GTA 325 ATT GAT' GGC Ile Asp Giy 340 CTG TTA GAC Leu Leu Asp 355 GTT AAT ACA Val Asn Thr 370 GTG ACC ACC Vai Thr Thr 385 GCT GCA GGA Ala Ala Gly 400 CAG GGC ATT Gin Gly Ile 415 TAC PAT CAC Tyr ASn His 430 TGG CTC TCT Trp Leu Ser 445 GCT GTG CAT Ala Val His 460 GTG ATC CGG Val Ile Arg 475 CAC AAC CAC His Asn His 490 AAC TTG CAT Asn Leu His 505 AGA GAC AAG Arg Asp Lys 520 GAG GGC CTA Glu Gly Leu 535 GTG ATG GTG Val Met Val 550 CTT CGG GCC Leu Arg Ala 565 TTT TTT GTG Phe Phe Val 580 TAA GCT ATC CGG GTG Arg Val ACA CTC Thr Leu TAC ACT Tyr Thr TCA GCT Ser Ala TTC TTA Phe Leu GAT GTC ASP Val CTC GTG Leu Val CTC CTC Leu Leu GTG GCT Val Ala GAC AAA Asp Lys AAC TAC Asn Tyr CGA TCA Arg Ser CTG GTT Leu Val AAG TCC Lys Ser GAC GAC Asp ASP GGC CGG Giy Arg GTC AAG Val Lys CTC ACA GAC CAG Asp Gin TAC AAG Tyr Lys GGG CTC Gly Leu CTA AGG Leu Arg GTT CGT Val Arg AGA AAG Arg Lys CAC CAG His Gin AAG TCA Lys Ser GGG ACC Gly Thr ACA TTG Thr Leu PAT GTC Asn Val CTC ATG 1060 1105 1150 1195 1240 1285 1330 1375 1420 1465 1510 1555 1600 1645 1690 1735 1780 1825 1870 1915 1960 2005 2050 2060 CTT CCA CTC CTC TAG CPA CCA GCC CTT GCT CTC CPA GGT AGG CCA CAT CCT ATA TAA GGA CAC TCC CCT GCT AGA GAC CCA TCC TPA AGG INFORMATION FOR SEQ ID NO:3: SEQUENCE CHARACTERISTICS: LENGTH: 592 TYPE: amino acid STRANOEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: Met Arg Val Leu Cys Ala Phe Pro Glu Ala Met Pro Ser 10 Ser Arg Pro Pro Ala Cys Leu Ala Pro Gly Ala Leu Tyr 25 Leu Leu Leu His Leu Ser Leu Ser Ser Gin Ala Gly Asp 40 Pro Leu Pro Val Asp Arg Ala Ala Gly Leu Lys Giu Lys 55 Ile Leu Leu Asp Val Ser Thr Lys Asn Pro Val Arg Thr 70 Glu Asn Phe Leu Ser Leu Gin Leu Asp Pro Ser Ile Ile Ser Asn Leu Ala Arg Arg Thr Leu Val Asn His Asp WO 01/00643 PCT/IL00/00358 Trp Leu Asp Phe Leu Ser Gly Leu Ser Pro Ala Phe 110 Phe Leu Gin Phe Gin Asn 125 Gly Pro Gly Pro Asp Tyr 140 Val Arg Ser Asp Val Ala 155 Ala Gin His Pro Asp Val 170 Ala Gin Met His Leu Val 185 Tyr Ser Asn Leu Ile Leu 200 Asn Phe Ala Asp Cys Ser 215 Ala Leu Arg Arg Asn Pro 230 Leu Ser Leu Leu Lys Tyr 245 Trp Glu Leu Gly Asn Glu 260 Arg Ala Val Asn Gly Ser 275 Lys Ser Leu Leu Gln Pro 290 Tyr Gly Pro Asn Ile Gly 305 Leu Asp Gly Phe Met Lys 320 Thr Trp Gin His Cys Tyr 335 Asp Phe Leu Lys Thr Arg 350 Arg Lys Ile Gln Lys Val 365 Ile Trp Leu Glu Gly Val 380 Asn Leu Ser Asp Ser Tyr 395 Leu Gly Met Leu Ala Asn 410 Ser Phe Phe Asp His Gly 425 Asn Pro Leu Pro Asp Tyr 440 Ile Gly Pro Lys Val Leu 455 Lys Pro Arg Pro Gly Arg 470 Ala His Cys Thr Asn His 485 Ile Thr Leu Phe Ile Ile 500 Lys Leu Ala Gly Thr Leu 515 Leu Gin Pro Tyr Gly Gin 530 Leu Asn Gly Gin Pro Leu 545 Glu Leu Lys Pro Arg Pro 560 Pro Pro Val Thr Met Gly 575 Leu Ala Cys Arg Tyr Arg 590 Lys Arg 100 Arg Phe 115 Arg Asn 130 Leu Lys 145 Asp Lys 160 Leu Glu 175 Leu Lys 190 Ala Arg 205 Leu His 220 Asn Ser 235 Ala Ser 250 Asn Asn 265 Leu Gly 280 Arg Ile 295 Pro Arg 310 Ala Gly 325 Asp Gly 340 Leu Asp 355 Asn Thr 370 Thr Thr 385 Ala Gly 400 Gly Ile 415 Asn His 430 Leu Ser 445 Val His 460 Ile Arg 475 Asn His 490 Leu His 505 Asp Lys 520 Gly Leu 535 Met Val 550 Arg Ala 565 Phe Val 580 Val Thr Leu Ala 105 Gly Lys Arg Thr 120 Ala Lys Ser Arg 135 Tyr Glu Asp Asp 150 Lys Gly Cys Lys 165 Gln Arg Glu Lys 180 Gin Phe Ser Asn 195 Leu Asp Lys Leu 210 Ile Phe Ala Leu 225 Asn Ser Ser Ser 240 Lys Tyr Asn Ile 255 Arg Thr Met His 270 Asp Tyr Ile Gin 285 Ser Arg Ala Ser 300 Asn Val Ile Ala 315 Thr Val Asp Ala 330 Val Val Lys val 345 Leu Ser Asp Gln 360 Thr Pro Gly Lys 375 Ala Gly Gly Thr 390 Leu Trp Leu Asn 405 Val Val Ile Arg 420 Val Asp Gin Asn 435 Leu Tyr Lys Arg 450 Ala Gly Leu Gin 465 Lys Leu Arg Ile 480 Tyr Val Arg Gly 495 Ser Arg Lys Lys 510 Val His Gin Tyr 525 Ser Lys Ser Val 540 Asp Gly Thr Leu 555 Arg Thr Leu Val 570 Lys Asn Val Asn 585 INFORMATION FOR SEQ ID NO:4: SEQUENCE CHARACTERISTICS: LENGTH: 1898 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: CGCTTAATTC TAGAAGAGGG ATTGAATGAG GGTGCTTTGT GCCTTCCCTG AAGCCATGCC CTCCAGCAAC TCCCGCCCCC CCGCGTGCCT AGCCCCGGGG GCTCTCTACT TGGCTCTGTT GCTCCATCTC TCCCTTTCCT CCCAGGCTGG WO 01/00643 PCTILOOIOO358 AGACAGGAGA CCCTTGCCTG CCCTGATTCT ACTTGATGTG GAGAACTTCC TCTCTCTGCA GCTCGATTTC CTAAGCTCCA CGCCCGCCTT TCTGCGCTTC CAGAACCTGA GGAACCCGGC CTATCTCAAA AACTATGAGG TTGCTGATTG CTCTGGACTC CGTAATCCCA ATAACTCCTG GTACAGCGCC AGCAAAAAGT CAAATAACTA TCGGACCATG GGAAAGGATT ACATCCAGCT TTCCAGAGCC AGCTTATATG TCATCGCCCT CCTAGATGGA GCAGTTACCT GGCAACATTG GGACTTCCTG AAAACTCGCC AAATTCAGAA AGTGGTTAAT GAAGGTGTGG TGACCACCTC CTATGCTGCA GGATTCTTAT AGGGCATTGA TGTCGTGATA CACCTCGTGG ACCAGAATTT CCTCTACAAG CGCCTGATCG GGCTCCAGCG GAAGCCACGG ATTTATGCTC ACTGCACAAA CATTACACTT TTTATCATCA TGGCTGGGAC TCTCAGAGAC TATGGGCAGG AGGGCCTAAA CTTAGTGATG GTGGACGACG TTCGGGCCGG CCGGACATTG GTGGTCAAGA ATGTCAATGC CACACTCATG GCTACCAGTG CCAGTAGTAT CCTCTGTTTT CCATICCTGCT GGAATCAACA TAGACAGAGC TGCAGGTTTG AAGGAAAAGA AGCACCAAGA ACCCAGTCAG GACAGTCAAT GCTGGATCCG TCCATCATTC ATGATGGCTG AGCGCTTGGT GACCCTGGCC CGGGGACTTT GGGGGCAAAA GGACCGACTT CCTGCAGTTrC GAAAAGCCGC GGGGGCCCGG GCCCGGATTA ATGCCAGGTC TCTAGACAAA CTTTATAACT CACCTGATAT TTGCTCTAAA TGCACTGCGT GAACAGTTCT AGTGCCCTGA GTCTGTTGAA ACAACATTTC TTGGGAACTG GGTAATGAG:C CATGGCCGGG CAGTAAATGG CAGCCAGTTG GAAGAGCCTG TTGCAGCCCA TCCGGATTTA GCCCTAATAT TGGGCGGCCG AGGAAGAATG TTCATGAAGG TGGCAGGAAG TACAGTAGAT CTACATTGAT GGCCGGGTGG TCAAGGTGAT TGTTAGACAC ACTCTCTGAC CAGATTAGGA ACATACACTC CAGGAAAGAA GATTTGGCTT AGCTGGAGGC ACAAACAATC TATCCGATTC GGTTGAACAC TTTAGGAATG CTGGCCAATC CGGCACTCAT TTT'rTGACCA TGGATACAAT TAACCCATTA CCAGACTACT GGCTCTCTCT GCCCCAAAGT CTTGGCTGTG CATGTGGCTG CCTGGCCGAG TGATCCGGGA CAAACTAAGG CCACCACAAC CACAACTACG TTCGTGGGTC ACTTGCATCG ATCAAGAAAG AAAATCAAGC AAGCTGGTTC ACCAGTACCT GCTGCAGCCC GTCCAAGTCA GTGCAACTGA ATGGCCAGCC GGACCCTCCC AGAATTGAAG CCCCGCCCCC GTCATCCCTC CAGTCACCAT GGGCTTTTTT TTTGGCCTGC CGCTACCGAT AAGCTATCCT GGCCTGCTGG GCTGCTTCCA CTCCTCCACT CAGACATCCT AGCAACCAGC CCCTGCTGCC CAGACTTGCT CTCCAAAGAG ACTAAATGTC 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 950 1000 1050 1100 1150 1200 1250 1300 1350 1400 1450 1500 1550 1600 1650 1700 1750 1800 1850 1898 ATAGCGTGAT CTTAGCCTAG GTAGGCCACA TCCATCCCAA AGGAAAATGT AGACATCACC TGTACCTATA TAAGGATAAA GGCATGTGTA TAGAGCAA 2) INFORMATION FOR SEQ ID Wi SEQUENCE CHARACTERISTICS: LENGTH: 538 TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID Met Arg Val Leu Ser Arg Pro Pro Leu Leu Leu His Pro Leu Pro Val Ile Leu Leu Asp Glu Asn Phe Leu Gly Trp Leu Asp Arg Gly Leu Ser Asp Phe Leu Gln Gly Gly Pro Gly Arg Ser Leu Asp His Leu Ile Phe Ser Trp Asn Ser Ser Lys Lys Tyr Asn Tyr Arg Thr Gly Lys Asp Tyr Ile Tyr Ser Arg Arg Lys Asn Val Cys Ala Ala Cys Leu Ser Asp Arg Val Ser Ser Leu Phe Leu Pro Ala 110 Phe Gln 125 Pro Asp 140 Lys Leu 155 Ala Leu 170 Ser Ser 185 Asn Ile 200 Met His 215 Ile Gln 230 Ala Ser 245 Ile Ala Phe Pro Glu Ala Met Pro Ser Ser Asn 10 Leu Ala Pro Gly Ala Leu Tyr Leu Ala 25 Leu Ser Ser Gin Ala Gly Asp Arg Arg 40 Ala Ala Gly Leu Lys Glu Lys Thr Leu 55 Thr Lys Asn Pro Val Arg Thr Val Asn 70 Gin Leu Asp Pro Ser Ile Ile His Asp 85 Ser Ser Lys Arg Leu Vai Thr Leu Ala 100 105 Phe Leu Arg Phe Gly Gly Lys Arg Thr 115 120 Asn Leu Arg Asn Pro Ala Lys Ser Arg 130 135 Tyr Tyr Leu Lys Asn Tyr Glu Asp Ala 145 150 Tyr Asn Phe Ala Asp Cys Ser Gly Leu 160 165 Asn Ala Leu Arg Arg Asn Pro Asn Asn 175 180 Ala Leu Ser Leu Leu Lys Tyr Ser Ala 190 195 Scr Trp Glu Leu Gly Asn Glu Pro Asn 205 210 Gly Arg Ala Val Asn Gly 5cr Gin Leu 220 225 Leu Lys Ser Leu Leu Gln Pro Ile Arg 235 240 Leu Tyr Gly Pro Asn Ile Gly Arg Pro 250 255 Leu Leu Asp Gly Phe Met Lys Val Ala WO 01/00643 PCT/iLOOIOO358 Gly Ser Thr Gly Arg Val Asp Thr Leu Thr Tyr Thr Thr Ser Ala Gly Phe Leu Ile Asp Val His Leu Val Ser Leu Leu His Vai Ala Arg Asp Lys His Asn Tyr His Arg Ser Lys Leu Val Leu Lys Ser Val Asp Asp Ala Giy Arg Val Val Lys 260 Asp Ala Val 275 Lys Val Met 290 Ala Gin Ile 305 Gly Lys Lys 320 Gly Thr Asn 335 Leu Asn Thr 350 Ile Arg His 365 Gin Asn Phe 380 Lys Arg Leu 395 Leu Gin Arg 410 Arg Ile Tyr 425 Arg Giy Ser 440 Lys Lys Ile 455 Gin Tyr Leu 410 Ser Val Gin 485 Thr Leu Pro 500 Leu Val Ile 515 Val Asn Ala 530 6 265 Thr Trp Gin His Cys Tyr 280 Asp Phe Leu Lys Thr Arg 295 Arg Lys Ile Gin Lys Val 310 Ile Trp Leu Glu Gly Val 325 Asn Leu Ser Asp Ser Tyr 340 Leu Gly Met Leu Ala Asn 355 Ser Phe Phe Asp His Gly 370 Asn Pro Leu Pro Asp Tyr 385 Ile Gly Pro Lys Val Leu 400 Lys Pro Arg Pro Gly Arg 415 Ala His Cys Thr Asn His 430 Ile Ihr Leu Phe Ile Ile 445 Lys Leu Aia Gly Thr Leu 460 Leu Gin Pro Tyr Gly Gin 475 Leu Asn Gly Gin Pro Leu 490 Glu Leu Lys Pro Arg Pro 505 Pro Pro Val Thr Met Gly 520 Leu Ala Cys Arg Tyr Arg 535 270 Ile Asp 285 Leu Leu 300 Vai Asn 315 Val Thr 330 Ala Aia 345 Gin Gly 360 Tyr Asn 375 Trp Leu 390 Ala Val 405 Val Ile 420 His Asn 435 Asn Leu 450 Arg Asp 465 Glu Gly 480 Val Met 495 Leu Arg Phe Phe 525 2) INFORMATION FOR SEQ ID NO:6: SEQUENCE CHARACTERISTICS: LENGTH: 1724 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: CGCTTAATTC TAGAAGAGGG ATTGAATGAG GGTGCTTTGT GCCTTCCCTG AAGCCATGCC CTCCAGCAAC TCCCGCCCCC CCGCGTGCCT AGCCCCGGGG GCTCTCTACT TGGCTCTGTT GCTCCATCTC TCCCTTTCCT CCCAGGCTGG AGACAGGAGA CCCTTGCCTG TAGACAGAGC TGCAGGTTTG AAGGAAAAGA CCCTGATTCT ACTTGATGTG AGCACCAAGA ACCCAGTCAG GACAGTCAAT GAGAACTTCC TCTCTCTGCA GCTGGATCCG TCCATCATTC ATGATGGC!TG GCTCGATTTC CTAAGCTCCA AGCGCTTGGT GACCCTGGCC CGGGGACTTT CGCCCGCCTT TCTGCGCTTC GGGGGCAAAA GGACCGACTT CCTGCAGTTC CAGAACCTGA GGAACCCGGC GAAAAGCCGC GGGGGCCCGG GCCCGGATTA CTATCTCAAA AACTATGAGG ATGAGCCAAA TAACTATCGG ACCATGCATG GCCGGGCAGT AAATGGCAGC CAGTTGGGAA AGGATTACAT CCAGCTGAAG AGCCTGTTGC AGCCCATCCG GATTTATTCC AGAGCCAGCT TATATGGCCC TAATATTGGG CGGCCGAGGA AGAATGTCAT CGCCCTCCTA GATGGATTCA TGAAGGTGGC AGGAAGTACA GTAGATGCAG TTACCTGGCA ACATTGCTAC ATTGATGGCC GGGTGGTCAA GGTGATGGAC TTCCTGAAAA CTCGCCTGTT AGACACACTC TCTGACCAGA TTAGGAAAAT TCAGAAAGTG GTTAATACAT ACACTCCAGG AAAGAAGATT TGGCTTGAAG GTGTGGTGAC CACCTCAGCT GGAGGCACAA ACAATCTATC CGATTCCTAT
GAACACTTTA
ACTCATTTTT
CCATTACCAG
CAAAGTCTTG
GCCGAGTGAT
CACAACCACA
GCATCGATCA
TGGTTCACCA
AAGTCAGTGC
CCTCCCAGAA
TCCCTCCAGT
GCCTGCCGCT
TGCTGGGCTG
CATCCTAGCA
CTTGCTCTCC
GCCACATCCA
GATAAAGGCA
GGAATGCTGG CCAATCAGGG TGACCATGGA TACAATCACC ACTACTGGCT CTCTCTCCTC GCTGTGCATG TGGCTGGGCT CCGGGACAAA CTAAGGATTT ACTACGTTCG TGGGTCCATT AGAAAGAAAA TCAAGCTGGC GTACCTGCTG CAGCCCTATG AACTGAATGG CCAGCCCTTA TTGAAGCCCC GCCCCCTTCG CACCATGGGC TTTTTTGTGG ACCGATAAGC TATCCTCACA CTTCCACTCC TCCACTCCAG ACCAGCCCCT GCTGCCCCAT AAAGAGACTA AATGTCATAG TCCCAAAGGA AAATGTAGAC TGTGTATAGA GCAA GCTGCAGGAT TCTTATGGTT CATTGATGTC GTGATACGGC TCGTGGACCA GAATTTTAAC TACAAGCGCC TGATCGGCCC CCAGCGGAAG CCACGGCCTG ATGCTCACTG CACAAACCAC ACACTTTTTA TCATCAACTT TGGGACTCTC AGAGACAAGC GGCAGGAGGG CCTAA.AGTCC GTGATGGTGG ACGACGGGAC GGCCGGCCGG ACATTGGTCA TCAAGAATGT CAATGCTTTG CTCATGGCTA CCAGTGGGCC TAGTATCCTC TGTTTTCAGA CCTGCTGGAA TCAACACAGA CGTGATCTTA GCCTAGGTAG ATCACCTGTA CCTATATAAG 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 650 900 950 1000 1050 1100 1150 1200 1250 1300 1350 1400 1450 1500 1550 1600 1650 1700 1724 WO 01/00643 PCTIL00/00358 7 INFORMATION FOR SEQ ID NO:7: SEQUENCE CHARACTERISTICS: LENGTH: 480 TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: Met Arg Val Leu Cys Ala Phe Pro Glu Ala Met Pro Ser Ser Asn 10 Ser Arg Pro Pro Ala Cys Leu Ala Pro Gly Ala Leu Tyr Leu Ala 25 Leu Leu Leu His Leu Ser Leu Ser Ser Gin Ala Gly Asp Arg Arg 40 Pro Leu Pro Val Asp Arg Ala Ala Gly Leu Lys Glu Lys Thr Leu 55 Ile Leu Leu Asp Val Ser Thr Lys Asn Pro Val Arg Thr Val Asn 70 Glu Asn Phe Leu Ser Leu Gln Leu Asp Pro Ser Ile Ile His Asp 85 Gly Trp Leu Asp Phe Leu Ser Ser Lys Arg Leu Val Thr Leu Ala 100 105 Arg Gly Leu Ser Pro Ala Phe Leu Arg Phe Gliy Gly Lys Arg Thr 110 115 120 Asp Phe Leu Gin Phe Gin Asn Leu Arg Asn Pro Ala Lys Ser Arg 125 130 135 Gly Gly Pro Gly Pro Asp Tyr Tyr Leu Lys Asn Tyr Glu Asp Glu 140 145 150 Pro Asn Asn Tyr Arg Thr Met His Gly Arg Ala Val Asn Gly Ser 155 160 165 Gin Leu Gly Lys Asp Tyr Ile Gin Leu Lys Ser Leu Leu Gln Pro 170 175 180 Ile Arg Ile Tyr Ser Arg Ala Ser Leu Tyr Gly Pro Asn Ile Gly 185 190 195 Arg Pro Arg Lys Asn Val Ile Ala Leu Leu Asp Gly Phe Met Lys 200 205 210 Val Ala Gly Ser Thr val Asp Ala Val Thr Trp Gin His Cys Tyr 215 220 225 Ile Asp Gly Arg Val Val Lys Val Met Asp Phe Leu Lys Thr Arg 230 235 240 Leu Leu Asp Thr Leu Ser Asp Gin Ile Arg Lys Ile Gin Lys Val 245 250 255 Val Asn Thr Tyr Thr Pro Gly Lys Lys Ile Trp Leu Glu Gly Val 260 265 270 Val Thr Thr Ser Ala Gly Gly Thr Asn Asn Leu Ser Asp Ser Tyr 275 280 285 Ala Ala Gly Phe Leu Trp Leu Asn Thr Leu Gly Met Leu Ala Asn 290 295 300 Gin Gly Ile Asp Val Val Ile Arg His Ser Phe Phe Asp His Gly 305 310 315 Tyr Asn His Leu Val Asp Gin Asn Phe Asn Pro Leu Pro Asp Tyr 320 325 330 Trp Leu Ser Leu Leu Tyr Lys Arg Leu Ile Gly Pro Lys Val Leu 335 340 345 Ala Val His Val Ala Gly Leu Gin Arg Lys Pro Arg Pro Gly Arg 350 355 360 Val Ile Arg Asp Lys Leu Arg Ile Tyr Ala His Cys Thr Asn His 365 370 375 His Asn His Asn Tyr Val Arg Gly Ser Ile Thr Leu Phe Ile Ile 380 385 390 Asn Leu His Arg Ser Arg Lys Lys Ile Lys Leu Ala Gly Thr Leu 395 400 405 Arg Asp Lys Leu Val His Gln Tyr Leu Leu Gin Pro Tyr Gly Gin 410 415 420 Glu Gly Leu Lys Ser Lys Ser Val Gin Leu Asn Gly Gin Pro Leu 425 430 435 Val Met Val Asp Asp Gly Thr Leu Pro Glu Leu Lys Pro Arg Pro 440 445 450 Leu Arg Ala Gly Arg Thr Leu Val Ile Pro Pro Val Thr Met Gly 455 460 465 Phe Phe Val Val Lys Asn Val Asn Ala Leu Ala Cys Arg Tyr Arg 470 475 480 INFORMATION FOR SEQ ID NO:8: SEQUENCE CHARACTERISTICS: LENGTH: 351 TYPE: amino acid STRANDEDNESS: double (0D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: WO 01/00643 PCTILOO/00358 GTTCGGCAGA GGATCATGTC TGATGTACAG AGACATTGTC CGGAGTGATG TTGCCTTGGA CAAGCAGAAA GGCTGTAAGA TTGGCCAGCA CCCTGATGTC 100 ATGCTGGAGC TCCAGAGAGA GAAGGCATCC AGACTGTCTG GTTCTTCTGA 150 AGGAGCAATA CTCCAATACT TACAGTAACC TCATATTAAC AGGTCTCTAG 200 ACAAACTTTA TAACTTTGCT GATTGCTCTG GACTCCACCT GATATTTGCT 250 CTAAATGCAC TGCGTCGTAA 'rCCCAATAAC TCCTGGAACA GTTCTAGTGC 300 CCTGAGCCTG TTGAAGTACA GTGCCAGCAA AAAGTACAAC ATTTCTTGGG 350 A 351 INFORMATION FOR SEQ ID NO:9: SEQUENCE CHARACTERISTICS: LENGTH: 543 TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: Met Leu Leu Arg Ser Lys Pro Ala Leu Pro Pro Pro Le 10 Leu Leu Giy Pro Leu Gly Pro Leu Ser Pro Gly Ala Le 25 Ala Gin Ala Gin Asp Val Val Asp Leu Asp Phe Phe Thi 40 4 Leu His Leu Val Ser Pro Ser Phe Leu Ser Val Thr 11 55 Leu Ala Thr Asp Pro Arg Phe Leu Ile Leu Leu Gly Se '70 75 Arg Thr Leu Ala Arg Gly Leu Ser Pro Ala Tyr Leu Ar 90 Thr Lys Thr Asp Phe Leu Ile Phe Asp Pro Lys Lys GI 100 105 Glu Giu Arg Ser Tyr Trp Gin Ser Gin Val Asn Gin As 115 120 12 ryr Gly Ser Ile Pro Pro Asp Val Giu Glu Lys Leu Ar 130 135 140 Pro Tyr Gin Giu Gin Leu Leu Leu Arg Giu His Tyr Gl 145 150 155 Lys Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Val Le 165 170 Pkia Asn Cys Ser Gly Leu Asp Leu Ile Phe Gly Leu As 180 185 krg Thr Ala Asp Leu Gin Trp Asn Ser Ser Asn Ala Gi 195 200 20 k.sp Tyr Cys Ser Ser Lys Gly Tyr Asn Ile Ser Trp Gl 210 215 220 3lu Pro Asn Ser Phe Leu Lys Lys Ala Asp Ile Phe Ii 230 235 ;in Leu Gly Glu Asp Tyr Ile Gin Leu His Lys Leu Lei 245 250 rhr Phe Lys Asn Ala Lys Leu Tyr Gly Pro Asp Val Gi' 260 265 krg Lys Thr Ala Lys Met Leu Lys Ser Phe Leu Lys Al 275 280 28~ lal Ile Asp Ser Val Thr Trp His His Tyr Tyr Leu As, 290 295 300 kla Thr Arg Glu Asp Phe Leu Asn Pro Asp Val Leu Asj 305 310 315 Ser Val Gin Lys Val Phe Gin Val Val Giu Ser Th~ 325 330 ,ys Lys Val Trp Leu Gly Glu Thr Ser Ser Ala Tyr Gi! 340 345 Iro Leu Leu Ser Asp Thr Phe Ala Ala Gly Phe Met Tr) 355 360 36! ,eu Gly Leu Ser Ala Arg Met Gly Ile Giu Val Val Met 370 375 380 'he Phe Gly Ala Gly Asn Tyr His Leu Val Asp Giu Asr 185 390 395 .eu Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Let 405 410 .ys Val Leu Met Ala Ser Val Gin Gly Ser Lys Arg Arg 420 425 7ai Tyr Leu His Cys Thr Asn Thr Asp Asn Pro Arg Tyr 435 440 445 Lsp Leu Thr Leu Tyr Ala Ile Asn Leu His Asn Val Thr 450 455 460 Lrg Leu Pro Tyr Pro Phe Ser Asn Lys Gin Val Asp Lys 470 475 Lrg Pro Leu Gly Pro His Gly Leu Leu Ser Lys Ser Val 485 490 ,ly Leu Thr Leu Lys Met Val Asp Asp Gin Thr Leu Pro 500 505 u Met Leu Leu Pro Arg Pro *r Gin Glu Pro e Asp Ala Asn Pro Lys Leu g Phe Gly Gly u Ser Thr Phe 110 p Ile Cys Lys g Leu Glu Trp n Lys Lys Phe 160 Tyr Thr Phe 175 n Ala Leu Leu i190 n Leu Leu Leu Leu Gly Asn Asn Gly Ser 240 Arg Lys Ser 255 y' Gin Pro Arg 270 a Giy Gly Giu ni Gly Arg Thr ple Phe Ile 320 r Arg Pro Giy 335 y' Giy Gly Ala 350 3Leu Asp Lys Arg Gin Val Phe Asp Pro 400 'Val Gly Thr 415 1Lys Leu Arg 430 Lys Giu Gly Lys Tyr Leu Tyr Leu Leu 480 Gin Leu Asn 495 Pro Leu Met 510 WO 01/00643 PCT/IL0/00358 9 Glu Lys Pro Leu Arg Pro Gly Ser Ser Leu Gly Leu Pro Ala Phe Ser 515 520 525 Tyr Ser Phe Phe Val Ile Arg Asn Ala Lys Val Ala Ala Cys Ile 530 535 540 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 23 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID GGAGAGCAAG TCTGTGTTGA TTC 23 INFORMATION FOR SEQ ID NO:11: SEQUENCE CHARACTERISTICS: LENGTH: 22 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: CACTGGTAGC CATGAGTGTG AG 22 INFORMATION FOR SEQ ID NO:12: SEQUENCE CHARACTERISTICS: LENGTH: 22 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: TTGGTCATCC CTCCAGTCAC CA 22 INFORMATION FOR SEQ ID NO:13: SEQUENCE CHARACTERISTICS: LENGTH: 2 TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13 Asp Glu INFORMATION FOR SEQ ID NO:14: SEQUENCE CHARACTERISTICS: LENGTH: 23 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: CTTGCCTGTA GACAGAGCTG CAG 23 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 2396 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID TTTCTAGTTG CTTTTAGCCA ATGTCGGATC AGGTTTTTCA AGCGACAAAG AGATACTGAG ATCCTGGGCA GAGGACATCC TAGCTCGGTC AGATTTGGGC 100 AGGCTCAAGT GACCAGTGTC TTAAGGCAGA AGGGAGTCGG GGTAGGGTCT 150 GGCTGAACCC TCAACCGGGG CTTTTAACTC AGGGTCTAGT CCTGGCGCCA 200 AATGGATGGG ACCTAGAAAA GGTGACAGAG TGCGCAGGAC ACCAGGAAGC 250 TGGTCCCACC CCTGCGCGGC TCCCGGGCGC TCCCTCCCCA GGCCTCCGAG 300 GATCTTGGAT TCTGGCCACC TCCGCACCCT TTGGATGGGT GTGGATGATT 350 TCAAAAGTGG ACGTGACCGC GGCGGAGGGG AAAGCCAGCA CGGAAATGAA 400 AGAGAGCGAG GAGGGGAGGG CGGGGAGGGG AGGGCGCTAG GGAGGGACTC 450 CCGGGAGGGG TGGGAGGGAT GGAGCGCTGT GGGAGGGTAC TGAGTCCTGG 500 CGCCAGAGGC GAAGCAGGAC CGGTTGCAGG GGGCTTGAGC CAGCGCGCCG 550 GCTGCCCCAG CTCTCCCGGC AGCGGGCGGT CCAGCCAGGT GGGATGCTGA 600 GGCTGCTGCT GCTGTGGCTC TGGGGGCCGC TCGGTGCCCT GGCCCAGGGC 650 GCCCCCGCGG GGACCGCGCC GACCGACGAC GTGGTAGACT TGGAGTTTTA 700 CACCAAGCGG CCGCTCCGAA GCGTGAGTCC CTCGTTCCTG TCCATCACCA 750 TCGACGCCAG CCTGGCCACC GACCCGCGCT TCCTCACCTT CCTGGGCTCT 800 CCAAGGCTCC GTGCTCTGGC TAGAGGCTTA TCTCCTGCAT ACTTGAGATT 850 TGGCGGCACA AAGACTGACT TCCTTATTTT TGATCCGGAC AAGGAACCGA 900 CTTCCGAAGA AAGAAGTTAC TGGAAATCTC AAGTCAACCA TGATATTTGC 950 AGGTCTGAGC CGGTCTCTGC TGCGGTGTTG AGGAAACTCC AGGTGGAATG 1000 GCCCTTCCAG GAGCTGTTGC TGCTCCGAGA GCAGTACCAA AAGGAGTTCA 1050 WO 01/00643 PCTILOOIOO358
AGAACAGCAC
AAGTGCTCGG
CCCAGACTTA
GCTCTTCCA.A
AGTTrcTGGA
AGACTTTGTG
CAAAACTCTA
CTGCTGAGGA
ATGGCATCAC
TGAGCTCTGA
AAGGTCACTA
GAGCTCAGCT
CTGGCTTTAT
GAAGTCGTGA
GGATGAAAAC
AGAAACTGGT
AGGAGCAAAC
ATATCAGGAA
CCAAGCACTT
TACCTTCTGA
ACTGAACGGT
TGACAGAAAA
TCCTATGGTT
AAAATAAAAG
TTCATAAAAC
GAGCTTCGGG
CTCTCTAAGA
CTACTCAAGA AGCTCAGTGG ACATGCTCTA GGTTAGACCT GATCTTTGGT CTAAATGCGT CGGTGGAACA GcTCCAACGC CCAGCTTCTC GGGTTATAAC ATcTCCTGGG AACTGGGCAA AGAAAGCTCA CATTCTCATC GATGGGTTGC GAGTTGCATA AACTTcTACA AAGGTCAGCT TGGTCCTGAC ATCGGTCAGC CTCGAGGGAA GTTTCCTGAA GGCTGGCGGA GAAGTGATCG TATTACTTGA ATGGACGCAT CGCTACCAAA TGCGCTGGAC ACTTTTATTC TCTCTGTGCA AAGAGATCAC ACCTGGCAAG AAGGTCTGGT TACGGTGGCG GTGCACCCTT GCTGTCCAAC GTGGCTGGAT AAATTGGGCC TGTCAGCCCA TGAGGCAGGT GTTCTTCGGA GCAGGCAACT TTTGAGCCTT TACCTGATTA CTGGCTCTCT AGGTCCCAGG GTGTTACTGT CAAGAGTGAA TCCGAGTGTA TCTCCACTGC ACTAACGTCT GGAGATCTAA CTCTGTATGT CCTGAACCTC GAAGGTACCG CCTCCGTTGT TCAGGAAACC AGCCTTCGGG GCCGGATGGA TTACTTTCCA CAAATTCTGA AGATGGTGGA TGAGCAGACC ACCTCTCCCC GCAGGAAGTG CACTAAGCCT TTTTTGTCAT AAGAAATGCC AAAATCGCTG GCATACGGTA CCCCTGAGAC AAAAGCCGAG AAP.ACCCTAG TTTAGGAGGC CACCTCCTTG AGGGTGGGGT ACACTTCAGT ATTACATTCA AGAATACTGC AGGTGGTGAC AGTTAATAGC
CAGTTTTGCC
TACTACGAAC
CTTGACTACT
TGAGCCCAAC
AGTTAGGAGA
TTCCA.AAATG
GACAGTTAAA
ACTCTCTTAC
GAAGATTTTC
AAAAATTCTG
TGGGAGAGAC
ACCTTTGCAG
GATGGGCATA
ACCACTTAGT
CTTCTGTTCA
AGGCCCAGAC
ATCACCCACG
CATAATGTCA
AGTGGATACG
AATCTGTCCA
CTGCCAGCTT
GCCTGCCTTT
CTTGTATATG
GGGGGTGTTA
CCGAGTTCCA
GTGTGGTGTT
ACTGTG
1100 1150 1200 1250 1300 1350 1400 1450 1500 1550 1600 1650 1700 1750 1800 1850 1900 1950 2000 2050 2100 2150 2200 2250 2300 2350 2396 INFORMATION FOR SEQ ID NO:16: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 22 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: GAGCAGCCAG GTGAGCCCAA GA 22 INFORMATION FOR SEQ ID NO-17: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 24 TYPE: nucleic acid STRANOEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: TCAGATGCAA GCAGCAACTT TGGC 24 INFORMATION FOR SEQ ID NO:18: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 21 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: CACCCTGATG TCATGCTGGA G 21 INFORMATION FOR SEQ ID NO:19: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 23 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: CATCTAGGAG AGCAATGACG TTC 23 INFORMATION FOR SEQ ID Wi SEQUENCE CHARACTERISTICS: LENGTH: 27 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID CCATCCTAAT ACGACTCACT ATAGGGC 27 INFORMATION FOR SEQ ID NO:21: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 23 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear WO 01/00643 PCT/ILOO/00358 I I (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: ACTCACTATA GGGCTCGAGC GGC 23 INFORMATION FOR SEQ ID NO:22: Ci) SEQUENCE CHARACTERISTICS: LENGTH: TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: TTTTTTTTTT TTTTT INFORMATION FOR SEQ ID NO:23 i) SEQUENCE CHARACTERISTICS: LENGTH: 560 TYPE: nucleic acid STRANDEDNESS: double CD) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23 GGCACGAGGC TAGTGGAGAG ACTGACAAGC AGTCAGCTCA GCGGTCACAA TACTGTGTGA CAGGAGCTGA GATCCAAGAA GTACTGGGTC CTGTGGGAGC 100 ACCCCTGACT TGAAGGACAA GTCAGTGCAA CTGAATGGCC AGCCCTTAGT 150 GATGGTGGAC GACGGGACCC TCCCAGAATT GAAGCCCCGC CCCCTTCGGG 200 CCGGCCGGAC ATTGGTCATC CCTCCAGTCA CCATGGGCTT TTTTGTGGTC 250 AAGAATGTCA ATGCTTTGGC CTGCCGCTAC CGATAAGCTA TCCTCACACT 300 CATGGCTACC AGTGGGCCTG CTGGGCTGCT TCCACTCCTC CACTCCAGTA 350 GTATCCTCTG TTTTCAGACA TCCTAGCAAC CAGCCCCTGC TGCCCCATCC 400 TGCTGGAATC AACACAGACT TGCTCTCCAA AGAGACTAAA TGTCATAGCG 450 TGATCTTAGC CTAGGTAGGC CACATCCATC CCAAAGGAAA ATGTAGACAT 500 CACCTGTACC TATATAAGGA TAAAGGCATG TGTATAGAGC AAAAAAAAAA 550 AAAAAAAAAA 560 INFORMATION FOR SEQ ID NO:24: SEQUENCE CHARACTERISTICS: CA) LENGTH: 1721 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: CTAGAGCTTT CGACTCTCCG CTGCGCGGCA GCTGGCGGGG GGAGCAGCCA GGTGAGCCCA AGATGCTGCT GCGCTCGAAG CCTGCGCTGC CGCCGCCGCT GATGCTGCTG CTCCTGGGGC CGCTGGGTCC CCTCTCCCCT GGCGCCCTGC CCCGACCTGC GCAAGCACAG GACGTCGTGG ACCTGGACTT cTTCACCCAG GAGCCGCTGC ACCTGGTGAG CCCCTCGTTC CTGTCCGTCA CCATTGACGC CAACCTGGCC ACGGACCCGC GGTTCCTCAT CCTCCTGGGT TCTCCAAAGC TTCGTACCTT GGCCAGAGGC TTGTCTCCTG CGTACCTGAG GTTTGGTGGC ACCAAGACAG ACTTCCTAAT TTTCGATCCC AAGAAGGAAT CAACCTTTGA AGAGAGAAGT TACTGGCAAT CTCAAGTCAA CCAGGATATT TGCAAATATG GATCCATCCC TCCTGATGTG GAGGAGAAGT TACGGTTGGA ATGGCCCTAC CAGGAGCAAT TGCTACTCCG AGAACACTAC CAGAAAAAGT TCAAGAACAG CACCTACTCA AGAAGCTCTG TAGATGTGCT ATACACTTTT GCAAACTGCT CAGGACTGGA CTTGATCTTT GGCCTAAATG CGTTATTAAG AACAGCAGAT TTGCAGTGGA ACAGTTCTAA TGCTCAGTTG CTCCTGGACT ACTGCTCTTC CAAGGGGTAT AACATTTCTT GGGAACTAGG CAATGAACCT AACAGTTTCC TTAAGAAGGC TGATATTTTC ATCAATGGGT CGCAGTTAGG AGAAGATTAT ATTCAATTGC ATAAACTTCT AAGAAAGTCC ACCTTCAAAA ATGCAAAACT CTATGGTCCT GATGTTGGTC AGCCTCGAAG AAAGACGGCT AAGATGCTGA AGAGCTTCCT GAAGGCTGGT GGAGAAGTGA TTGATTCAGT TACATGGCAT CACTACTATT TGAATGGACG GACTGCTACC AGGGAAGATT TTCTAAACCC TGATGTATTG GACATTTTTA TTTCATCTGT GCAAAAAGTT TTCCAGGTGG TTGAGAGCAC CAGGCCTGGC AAGAAGGTCT GGTTAGGAGA AACAAGCTCT GCATATGGAG GCGGAGCGCC CTTGCTATCC GACACCTTTG CAGCTGGCTT TATGTGGCTG GATAAATTGG GCCTGTCAGC CCGAATGGGA ATAGAAGTGG 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1721 TGATGAGGCA AGTATTCTTT CTTTACCTGA TTATTGGCTA TGGCAAGCGT GCAAGGTTCA CTGACAATCC AAGGTATAAA TCACCAAGTA CTTGCGGTTA TAAGACCTTT GGGACCTCAT TAAAGATGGT GGATGATCAA GTTCACTGGG CTTGCCAGCT CTGCTTGCAT CTGAAAATAA
GGAGCAGGAA
TCTCTTCTGT
AAGAGAAGGA
GAAGGAGATT
CCCTATCCTT
GGATTACTTT
ACCTTGCCAC
TTCTCATATA
AATATACTAG
ACTACCATTT AGTGGATGAA AACTTCGATC TCAAGAAATT GGTGGGCACC AAGGTGTTAA AGCTTCGAGT ATACCTTCAT TGCACAAACA TAACTCTGTA TGCCATAAAC CTCCATAACG TTTCTAACAA GCAAGTGGAT AAATACCTTC CCAAATCTGT CCAACTCAAT GGTCTAACTC CTTTAATGGA AAAACCTCTC CGGCCAGGAA GTTTTTTTGT GATAAGAAAT GCCAAAGTTG TCCTGACACT G INFORMATION FOR SEQ ID Ci) SEQUENCE CHARACTERISTICS: LENGTH: TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: CTTACTTGTC ATCGTCGTCC TTGTAGTCTC GGTAGCGGCA GGCCA

Claims (7)

1. An isolated nucleic acid comprising a polynucleotide hybridizable with any one of SEQ ID NOs:1, 4 and 6 at 68 'C in 6 x SSC, 1 SDS, 5 x Denharts, 10 dextran sulfate, 100 jig/ml salmon sperm DNA, and 3 2 p labeled probe and wash at 68 "C with 3 x SSC and 0.1 SDS, said polynucleotide comprises a sequence encoding a polypeptide which comprises a conserved glycosyl hydrolase domain.
2. An isolated nucleic acid comprising a polynucleotide at least 80 identical with any one of SEQ ID NOs:1, 4 and 6 as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the University of Wisconsin (gap creation penalty 50, gap extension penalty said polynucleotide comprises a sequence encoding a polypeptide which comprises a conserved glycosyl hydrolase domain.
3. The isolated nucleic acid of claim 2, wherein said polynucleotide is as set forth in any one of SEQ ID NOs: 1, 4 and 6.
4. An isolated nucleic acid comprising a polynucleotide encoding a polypeptide being at least 80 homologous with any one of SEQ ID NOs:3, 5 and 7 as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the University of Wisconsin (gap creation penalty 50, gap extension penalty wherein said polypeptide comprises a conserved glycosyl hydrolase domain. *.25 5. A recombinant protein comprising a polypeptide encoded by the polynucleotide of claim 1. *00
6. A recombinant protein comprising a polypeptide encoded by the 0* polynucleotide of claim 2.
7. A recombinant protein comprising a polypeptide encoded by the polynucleotide of claim 3. S
44.4 4*4* 44 4 4* 4 4 4* 4*44 *4*4 44 8. A recombinant protein comprising a polypeptide encoded by the polynucleotide of claim 4. 9. A recombinant protein comprising a polypeptide at least 80 homologous with any one of SEQ ID NOs:3, 5 and 7 as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the University of Wisconsin (gap creation penalty 50, gap extension penalty wherein said polypeptide comprises a conserved glycosyl hydrolase domain. 10. The recombinant protein of claim 9, wherein said polypeptide is as set fourth in any one of SEQ ID NOs:3, 5 and 7. 11. A nucleic acid construct comprising the isolated nucleic acid of claim 1. 12. A nucleic acid construct comprising the isolated nucleic acid of claim 2. 13. A nucleic acid construct comprising the isolated nucleic acid of claim 3. 14. A nucleic acid construct comprising the isolated nucleic acid of claim 4. A host cell comprising the nucleic acid construct of claim 11. 16. A host cell comprising the nucleic acid construct of claim 12. 17. A host cell comprising the nucleic acid construct of claim 13. 18. A host cell comprising the nucleic acid construct of claim 14. 19. An antisense oligonucleotide comprising a polynucleotide or a polynucleotide analog of at least 20 bases being hybridizable in vivo, under physiological conditions, with: a portion of a polynucleotide strand encoding a polypeptide at least 80 homologous with any one of SEQ ID NOs:3, 5 and 7 as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the University of Wisconsin (gap creation penalty 50, gap extension penalty wherein said polypeptide comprises a conserved glycosyl hydrolase domain; or (ii) a portion of a polynucleotide strand at least 80 identical with any one of SEQ ID NOs:1, 4 and 6 as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the University of Wisconsin (gap creation penalty 50, gap extension penalty said polynucleotide comprises a sequence encoding a polypeptide which comprises a conserved glycosyl hydrolase domain. A ribozyme comprising the antisense oligonucleotide of claim 19 and a ribozyme sequence. 21. An antisense nucleic acid construct comprising a promoter sequence and a polynucleotide sequence directing the synthesis of an antisense RNA sequence of at least bases being hybridizable in vivo, under physiological conditions, with: a portion of a polynucleotide strand encoding a polypeptide at least 80 homologous with any one of SEQ ID NOs:3, 5 and 7 as determined using the Bestfit procedure of the DNA sequence analysis software package developed by the Genetic Computer Group (GCG) at the University of Wisconsin (gap creation penalty 50, gap extension penalty wherein said polypeptide comprises a conserved glycosyl hydrolase domain; or (ii) a portion of a polynucleotide strand at least 80 identical with any one of SEQ ID NOs:l, 4 and 6 as determined using the Bestfit procedure of the DNA 25 sequence analysis software package developed by the Genetic Computer Group (GCG) at the University of Wisconsin (gap creation penalty 50, gap extension penalty 3),_said polynucleotide comprises a sequence encoding a polypeptide which comprises a conserved glycosyl hydrolase domain. 22. The isolated nucleic acid of claim 4, wherein said polypeptide is as set forth in any one of SEQ ID NOs: 3, 5 and 7. DATED this 24 h day of August 2004 InSight Biopharmaceuticals Ltd. By their Patent Attorneys CULLEN CO.
AU52448/00A 1999-06-25 2000-06-19 Polynucleotides and polypeptides encoded thereby distantly homologous to heparanase Ceased AU777343B2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US14080199P 1999-06-25 1999-06-25
US60/140801 1999-06-25
PCT/IL2000/000358 WO2001000643A2 (en) 1999-06-25 2000-06-19 Polynucleotides and polypeptides encoded thereby distantly homologous to heparanase

Publications (2)

Publication Number Publication Date
AU5244800A AU5244800A (en) 2001-01-31
AU777343B2 true AU777343B2 (en) 2004-10-14

Family

ID=22492846

Family Applications (1)

Application Number Title Priority Date Filing Date
AU52448/00A Ceased AU777343B2 (en) 1999-06-25 2000-06-19 Polynucleotides and polypeptides encoded thereby distantly homologous to heparanase

Country Status (12)

Country Link
EP (1) EP1212341A4 (en)
JP (1) JP2003503070A (en)
KR (1) KR20020028906A (en)
CN (1) CN1370178A (en)
AU (1) AU777343B2 (en)
CA (1) CA2377498A1 (en)
HU (1) HUP0300901A2 (en)
IL (1) IL146527A0 (en)
MX (1) MXPA01011708A (en)
NO (1) NO20015526L (en)
PL (1) PL362599A1 (en)
WO (1) WO2001000643A2 (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1157118A4 (en) * 1999-03-01 2002-07-17 Insight Strategy & Marketing Polynucleotide encoding a polypeptide having heparanase activity and expression of same in genetically modified cells
EP1214423A1 (en) * 1999-09-23 2002-06-19 MERCK PATENT GmbH Heparanase-2, a member of the heparanase protein family
CA2393855A1 (en) * 1999-12-22 2001-06-28 Oxford Glycosciences (Uk) Ltd. Homologues of human heparanase and splice variants thereof
KR20020062375A (en) * 1999-12-23 2002-07-25 쉐링 악티엔게젤샤프트 Human Heparanase-Related Polypeptide and Nucleic Acid
GB0014447D0 (en) * 2000-06-13 2000-08-09 Smithkline Beecham Plc Novel compounds
AU2001287610A1 (en) * 2000-07-12 2002-01-21 Vlaams Interuniversitair Instituut Voor Biotechnologie Vzw A second human heparanase, and splice variants thereof, with a predominant expression in skeletal muscle, heart and pancreas
IL162276A0 (en) * 2004-06-01 2005-11-20 Hadasit Med Res Service Nucleic acid molecules as heparanase potent inhibitors, compositions and methods of use thereof
FI20085308A0 (en) * 2008-04-11 2008-04-11 Polysackaridforskning I Uppsal Heparanase-free non-human mammals
JP5622451B2 (en) * 2010-06-21 2014-11-12 小川香料株式会社 Tea extract
JP6321506B2 (en) * 2014-09-22 2018-05-09 株式会社 資生堂 Whitening method with heparanase inhibitor and method for evaluating substances having whitening effect
CN116064544A (en) * 2016-01-26 2023-05-05 日产化学株式会社 Single-stranded oligonucleotides

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999021975A1 (en) * 1997-10-28 1999-05-06 The Australian National University Isolated nucleic acid molecule encoding mammalian endoglucuronidase and uses therefor

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6177545B1 (en) * 1997-09-02 2001-01-23 Insight Strategy & Marketing Ltd. Heparanase specific molecular probes and their use in research and medical applications

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999021975A1 (en) * 1997-10-28 1999-05-06 The Australian National University Isolated nucleic acid molecule encoding mammalian endoglucuronidase and uses therefor

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
EMBL ACCESSION NO A1 222323 *

Also Published As

Publication number Publication date
HUP0300901A2 (en) 2003-07-28
KR20020028906A (en) 2002-04-17
CA2377498A1 (en) 2001-01-04
IL146527A0 (en) 2002-07-25
MXPA01011708A (en) 2003-09-10
NO20015526L (en) 2001-12-18
NO20015526D0 (en) 2001-11-12
EP1212341A4 (en) 2002-11-27
WO2001000643A3 (en) 2005-02-24
WO2001000643A2 (en) 2001-01-04
CN1370178A (en) 2002-09-18
EP1212341A1 (en) 2002-06-12
AU5244800A (en) 2001-01-31
JP2003503070A (en) 2003-01-28
PL362599A1 (en) 2004-11-02

Similar Documents

Publication Publication Date Title
US20090275106A1 (en) Polynucleotide encoding A polypeptide having heparanase activity and expression of same in genetically modified cells
Vlodavsky et al. Mammalian heparanase: gene cloning, expression and function in tumor progression and metastasis
JP4187408B2 (en) Polynucleotide encoding a polypeptide having heparanase activity and expression of the polypeptide in transfected cells
US6348344B1 (en) Genetically modified cells and methods for expressing recombinant heparanase and methods of purifying same
AU777343B2 (en) Polynucleotides and polypeptides encoded thereby distantly homologous to heparanase
US20020194625A1 (en) Transgenic animals expressing heparanase and its uses
US6699672B1 (en) Heparanase specific molecular probes and their use research and medical applications
US7101706B1 (en) Polynucleotides and polypeptides encoded thereby distantly homologous to heparanase
IL146527A (en) Polynucleotides and polypeptides encoded thereby distantly homologous to heparanase
US6677137B2 (en) Avian and reptile derived polynucleotide encoding a polypeptide having heparanase activity
KR100549408B1 (en) Polynucleotide encoding a polypeptide having heparanase activity and expression of same in transduced cells
AU772311B2 (en) Polynucleotide encoding a polypeptide having heparanase activity and expression of same in transduced cells
CZ2000754A3 (en) Polynucleotide encoding polypeptide exhibiting heparanase activity and its expression in transformed cells
MXPA00002106A (en) Polynucleotide encoding a polypeptide having heparanase activity and expression of same in transduced cells

Legal Events

Date Code Title Description
TC Change of applicant's name (sec. 104)

Owner name: INSIGHT BIOPHARMACEUTICALS LTD.

Free format text: FORMER NAME: INSIGHT STRATEGY AND MARKETING LTD.