AU2005205754B2 - Secreted and transmembrane polypeptides and nucleic acids encoding the same - Google Patents
Secreted and transmembrane polypeptides and nucleic acids encoding the same Download PDFInfo
- Publication number
- AU2005205754B2 AU2005205754B2 AU2005205754A AU2005205754A AU2005205754B2 AU 2005205754 B2 AU2005205754 B2 AU 2005205754B2 AU 2005205754 A AU2005205754 A AU 2005205754A AU 2005205754 A AU2005205754 A AU 2005205754A AU 2005205754 B2 AU2005205754 B2 AU 2005205754B2
- Authority
- AU
- Australia
- Prior art keywords
- sequence identity
- polypeptide
- sequence
- nucleic acid
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Landscapes
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Description
o SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC ACIDS ENCODING THE
SAME
FIELD OF THE INVENTION SThe present invention relates generally to the identification and isolation of novel DNA and to the recombinant production of novel polypeptides.
BACKGROUND OF THE INVENTION tt All references, including any patents or patent applications, cited in this specification are hereby Sincorporated by reference. No admission is made that any reference constitutes prior art. The discussion of the references states what their authors assert, and the applicants reserve the right to challenge the accuracy and *n pertinency of the cited documents. It will be clearly understood that, although a number of prior art publications are S referred to herein, this reference does not constitute an admission that any of these documents forms part of the common general knowledge in the art, in Australia or in any other country.
Extracellular proteins play important roles in, among other things, the formation, differentiation and maintenance of multicellular organisms. The fate of many individual cells, proliferation, migration, differentiation, or interaction with other cells, is typically governed by information received from other cells and/or the immediate environment. This information is often transmitted by secreted polypeptides (for instance, mitogenic factors, survival factors, cytotoxic factors, differentiation factors, neuropeptides, and hormones) which are, in turn, received and interpreted by diverse cell receptors or membrane-bound proteins. These secreted polypeptides or signaling molecules normally pass through the cellular secretory pathway to reach their site of action in the extracellular environment.
Secreted proteins have various industrial applications, including as pharmaceuticals, diagnostics, biosensors and bioreactors. Most protein drugs available at present, such as thrombolytic agents, interferons, interleukins, erythropoietins, colony stimulating factors, and various other cytokines, are secretory proteins. Their receptors, which are membrane proteins, also have potential as therapeutic or diagnostic agents. Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. Examples of screening methods and techniques are described in the literature [see, for example, Klein et al., Proc. Natl. Acad. Sci. 93:7108-7113 (1996); U.S. Patent No. 5,536,637)].
Membrane-bound proteins and receptors can play important roles in, among other things, the formation, differentiation and maintenance of multicellular organisms. The fate of many individual cells, proliferation, migration, differentiation, or interaction with other cells, is typically governed by information received from other cells and/or the immediate environment. This information is often transmitted by secreted polypeptides (for instance, mitogenic factors, survival factors, cytotoxic factors, differentiation factors, neuropeptides, and hormones) which are, in turn, received and interpreted by diverse cell receptors or membrane-bound proteins. Such membrane-bound proteins and cell receptors include, but are not limited to, cytokine receptors, receptor kinases, receptor phosphatases, receptors involved in cell-cell interactions, and cellular adhesin molecules like selectins and integrins. For instance, transduction of signals that regulate cell growth and differentiation is regulated in part by phosphorylation of various cellular proteins. Protein tyrosine kinases, enzymes that catalyze that process, can also act as growth factor receptors. Examples include fibroblast growth factor receptor and nerve growth factor receptor.
-la- Membrane-bound proteins and receptor molecules have various industrial applications, including as O pharmaceutical and diagnostic agents. Receptor immunoadhesins, for instance, can be employed as therapeutic agents to block receptor-ligand interactions. The membrane-bound proteins can also be employed for screening Sof potential peptide or small molecule inhibitors of the relevant receptor/ligand interaction.
Efforts are being undertaken by both industry and academia to identify new, native receptor or Ce 5 membrane-bound proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel receptor or membrane-bound proteins.
S1. PR01484 Adipose differentiation is accompanied by changes in cellular morphology, a dramatic accumulation of S 10 intracellular lipid and activation of a specific program of gene expression (Liang et al., J. Biol. Chem.
O 271:10697-10703 (1996)). Adipose complement-related protein is a protein whose expression is highly induced Ci during adipocyte differentiation and which shares significant homology with subunits of complement factor C lq, collagen alpha l(x) and the brain-specific facto cerebellin (Scherer et al., J. Biol. Chem. 270:26746-26749 (1995)). While the function of adipocyte complement-related protein is presently unknown, the tissue-specific expression thereof suggests that this protein functions as a novel signaling molecule for adipose tissue. As such, there is significant interest in identifying and characterizing novel polypeptides having homology to adipocyte complement-related protein. We herein describe the identification and characterization of novel polypeptides having homology to adipocyte complement-related protein, designated herein as PRO 1484 polypeptides.
2. PR04334 Plasma cell membrane glycoprotein PC-I is of interest. The cloning of PC-1 is described in the art, see Buckley, et al., J. Biol. Chem., 265(29):17506-11 (1990) and W09519570-A. W09519570-A describes the human insulin receptor tyrosine kinase inhibitor PC-1. It is reported that PC-I is useful in the diagnosis and treatment of diseases involving inappropriate insulin receptor tyrosine kinase inhibitor expression such as non-insulin dependent mellitus. Thus, proteins having homology to PC-1 are of interest.
3. PRO1122 It has been reported that the cytokine interleukin 17 (IL-17) stimulates epithelial, endothelial, and fibroblastic cells to secrete cytokines such as IL-6, IL-8, and granulocyte-colony-stimulating factor, as well as prostaglandin E2. Moreover, it has been shown that when cultured in the presence of IL-17, fibroblasts could sustain proliferation of CD34+ preferential maturation into neutrophils. Thus it has been suggested that IL-17 constitutes an early initiator of the T cell-dependent inflammatory reaction and/or an element of the cytokine network that bridges the immune system to hematopoiesis. See, Yao, et al., J. Immunol., 155(12):5483-5486 (1995); Fossiez, et al., J. Exp. Med., 183(6):2593-2603 (1996); Kennedy, et al., J. Interferon Cvtokine Res., 16(8):611-617 (1996). Thus, proteins related to IL-17, including CTLA-8, which has been mistaken for IL-17 (see Kennedy, supra) are of interest.
n 4. PR01889 O E48 antigen protein is a cysteine-rich GPI-anchored membrane protein that belongs to the LY-6 family of human proteins (see, WO 96/35808). The E48 antigen serves as a marker for squamous cells, exhibits biological activity of cell-cell/cell-matrix adhesion and is a target for antibody-based immunotherapy. The amino acid sequence of the E48 antigen protein has previously been deduced from a cDNA clone which was obtained from a squamous cell carcinoma of the head and neck. As such, the E48 antigen serves as a potential target for the treatment of squamous cell cancer.
We herein describe the identification and characterization of novel polypeptides having homology to E48 antigen protein, designated herein as PRO1889 polypeptides.
l 10 5. PR01890 0 The recognition of carbohydrates by lectins has been found to play an important role in various aspects 0 of eukaryotic physiology. A number of different animal and plant lectin families exist, but it is the calcium dependent, or type C, lectins that have recently garnered the most attention. For example, the recognition of carbohydrate residues on either endothelial cells or leukocytes by the selectin family of calcium dependent lectins has been found to be of profound importance to the trafficking of leukocytes to inflammatory sites. Lasky, L., Ann, Rev. Biochem., 4 113-139 (1995). The biophysical analysis of these adhesive interactions has suggested that lectin-carbohydrate binding evolved in this case to allow for the adhesion between leukocytes and the endothelium under the high shear conditions of the vasculature. Thus, the rapid on rates of carbohydrate recognition by such lectins allows for a hasty acquisition of ligand, a necessity under the high shear of the vascular flow. The physiological use of type C lectins in this case is also supported by the relatively low affinities of these interactions, a requirement for the leukocyte rolling phenomenon that has been observed to occur at sites of acute inflammation. The crystal structures of the mannose binding protein (Weis et al., Science 254, 1608- 1615 [1991]; Weis et al., Nature 360 127-134 [1992]) and E-selectin (Graves et al., Nature 367(6463), 532-538 [1994]), together with various mutagenesis analyses (Erbe et al., J. Cell. Biol. 119(1), 215-227 [1992]; Drickamer, Nature 360, 183-186 [1992]; lobst et al., J. Biol. Chem. 169(22), 15505-15511 [1994]; Kogan et at., J. Biol. Chem, 270(23), 14047-14055 [1995]), is consistent with the supposition that the type C lectins are, in general, involved with the rapid recognition of clustered carbohydrates. Together, these data suggest that type C lectins perform a number of critical physiological phenomena through the rapid, relatively low affinity recognition of carbohydrates.
Given the obvious importance of the lectin proteins in numerous biological processes, efforts are currently being made to identify novel lectin proteins or proteins having sequence homology to lectin proteins.
We herein describe the identification and characterization of novel polypeptides having homology to a lectin protein, designated herein as PRO 1890 polypeptides.
6. PR01887 Enzymatic proteins play important roles in the chemical reactions involved in the digestion of foods, the biosynthesis of macromolecules, the controlled release and utilization of chemical energy, and other processes 1 necessary to sustain life. Enzymes have also been shown to play important roles in combating various diseases O and disorders. For example, liver carboxylesterases have been reported to assist in sensitizing human tumor cells to the cancer prodrugs. Danks et al., report that stable expression of the cDNA encoding a carboxylesterase in human rhabdomyosarcoma cells increased the sensitivity of the cells to the CPT-11 cancer prodrug 8.1-fold. Cancer Res. (1998) 58(1):20-22. The authors propose that this prodiug/enzyme combination could Cn 5 be exploited therapeutically in a manner analogous to approaches currently under investigation with the combinations of ganciclovir/herpes simplex virus thymidine kinase and 5-fluorocytosine/cytosine deaminase.
7" van Pelt et al. demonstrated that a 55 kD human liver carboxylesterase inhibits the invasion of Plasmodium 1.7 falciparum malaria sporozoites into primary human hepatocytes in culture. J Hepatol (1997) 27(4):688-698.
O Carboxylesterases have also been found to be of importance in the detoxification of drugs, pesticides Ci 10 and other xenobiotics. Purified human liver carboxylesterases have been shown to be involved in the metabolism O of various drugs including cocaine and heroin. Prindel et al. describe the purification and cloning of a broad
O
cI substrate specificity human liver carboxylesterase which catalyzes the hydrolysis of cocaine and heroin and which may play an important role in the degradation of these drugs in human tissues. J. Biol. Chem. (1997) 6:272(23):14769-14775. Brzenzinski et al. describe a spectrophotometric competitive inhibition assay used to identify drug or environmental esters that are metabolized by carboxylesterases. Drug Metab Dispos (1997) 25(9):1089-1096.
As additional background information on carboxylesterases, Kroetz et al. (Biochemistry, (1993) 32(43): 11606-17) reported the cDNA cloning and characterization of human liver carboxylesterases. Aida et al.
(Biochim Biophys Acta (1993) 1174(1):72-4) reported the cDNA cloning and characterization of a male-predominant carboxylesterase in mouse liver carboxylesterases.
In light of the important physiological roles played by carboxylesterases, efforts are being undertaken by both industry and academia to identify new, native carboxylesterase homologs. We herein describe the identification and characterization of a novel polypeptide having homology to carboxylesterase.
7. PR01785 Antioxidant enzymes are thought to play a crucial role in the survival of the parasite, Schistosoma mansoni, during its migration through the tissues of a definitive host. Recently, one such enzyme, glutathione peroxidase was cloned. Roche, et al., Gene, 138:149-152 (1994), accession number GSHC SCHMA.
Glutathione perxodiases are further described in FR2689906-A. Thus, glutathione peroxidases, and the nucleic acids which encode them are useful as diagnostic reagents, vaccines and in assays to find modulators of antioxidant enzymes.
8. PR04353 Semaphorins comprise a large family of proteins implicated in axonal guidance during development.
Semaphorin Y may be used to inhibit peripheral nerve growth. Semaphorin Z is useful as a central nerve extension inhibitor. Semaphorin Z inhibitors can be used as promoters of central nerve regeneration. Thus semaphorins and regulators of semaphorins are of great interest. Kikuchi, et al., Brain Res Mol Brain Res., T 51(1-2):229-37 (1997); Shoji, et al., Development, 125(7):1275-83 (1998).
O
9. PR04357 SEfforts are being undertaken by both industry and academia to identify new, native secreted proteins, Many of these efforts are focused on the screening of mammalian recombinant DNA libraries to identify the C 5 coding sequences for novel secreted proteins. We herein describe the identification and characterization of novel secreted polypeptides, designated herein as PR04357 polypeptides.
PR04405 SEfforts are being undertaken by both industry and academia to identify new, native transmembrane Ci 10 receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to O identify the coding sequences for novel transmembrane receptor proteins. We herein describe the identification 0 C, .and characterization of novel transmembrane polypeptides, designated herein as PR04405 polypeptides.
11. PR04356 Glycosylphosphatidylinositol (GPI) anchored proteoglycans are generally localized to the cell surface and are thus known to be involved in the regulation of responses of cells to numerous growth factors, cell adhesion molecules and extracellular matrix components. The metastasis-associated GPI-ahchored protein (MAGPIAP) is one of these cell surface proteins which appears to be involved in metastasis. Metastasis is the form of cancer wherein the transformed or malignant cells are traveling and spreading the cancer from one site to another. Therefore, identifying the polypeptides related to metastasis and MAGPIAP is of interest.
12. PR04352 Cadherins are a large family of transmembrane proteins. Cadherins comprise a family of calciumdependent glycoproteins that function in mediating cell-cell adhesion in virtually all solid tissues of multicellular organisms. At least cadherins 1-13 as well as types B, E, EP, M, N, P and R have been characterized. Among the functions cadherins are known for, with some exceptions, cadherins participate in cell aggregation and are associated with cell-cell adhesion sites. Recently, it has been reported that while all cadherins share multiple repeats of a cadherin specific motif believed to correspond to folding of extracellular domains, members of the cadherin superfamily have divergent structures and, possibly, functions. In particular it has been reported that members of the cadherin superfamily are involved in signal transduction. See, Suzuki, J. Cell Biochem., 61(4):531-542 (1996). Cadherins are further described in Tanihara, et al., J. Cell Sci., 107(6):1697-1704 (1994), Aberle, et al., J. Cell Biochem., 61(4):514-523 (1996), Obata, etal., Cell Adhes. Commun., 6(4):323- 33 (1998) and Tanihara, et al., Cell Adhes. Commun., 2(1):15-26 (1994).
Protocadherins are members of the cadherin superfamily which are highly expressed in the brain. In some studies, protocadherins have shown cell adhesion activity. See, Sano, et al., EMBO 12(6):2249-2256 (1993). However, studies have also shown that some protocadherins, such as protocadherin 3 (also referred to as Pcdh3 or pc3), do not show strong calcium dependent cell aggregation activity. See, Sago, et al., Genomics, S29(3):631-640 (1995) for this study and further characteristics of Pcdh3. Molecules related to pc3 are thus of Sgreat interest. Also of great interest is the subtype of desmosomal cadherin described in Koch, et al., Differentiation, 47(1):29-36 (1991).
SOf particular interest are proteins having a sequence with homology to that described in Amagai, et al., Cell, 67(5):869-77 (1991). This study describes antibodies against a novel epithelial cadherin in pemphigus C 5 vulgaris, a disease of cell adhesion. Also of interest are full-length cadherins. Additionally, proteins having homology to the fat tumor suppressor gene which are novel cadherins are of interest.
13. PRO4380 SEfforts are being undertaken by both industry and academia to identify new, native secreted proteins.
C 10 Many of these efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of novel Ci secreted polypeptides, designated herein as PR04380 polypeptides.
14. PR04354 Efforts are being undertaken by both industry and academia to identify new, native secreted proteins.
Many of these efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of novel secreted polypeptides, designated herein as PR04354 polypeptides.
15. PRO4408 Efforts are being undertaken by both industry and academia to identify new, native secreted proteins.
Many of these efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterizationof novel secreted polypeptides, designated herein as PR04408 polypeptides.
16. PR05737 Interleukin-1 refers to two proteins (IL-la and IL-lp) which play a key role early in the inflammatory response (for a review, see Dinarello, Blood, 87: 2095-2147 (1996) and references therein). Both proteins are made as intracellular precursor proteins which are cleaved upon secretion to yield mature carboxy-terminal 17 kDa fragments which are biologically active. In the case of IL-Ip, this cleavage involves an intracellular cysteine protease, known as ICE, which is required to release the active fragment from the inactive precursor.
The precursor of IL-la is active.
These two proteins act by binding to cell surface receptors found on almost all cell types and triggering a range of responses either alone or in concert with other secreted factors. These range from effects on proliferation fibroblasts, T cells) apoptosis A375 melanoma cells), cytokine induction of TNF, IL-I, IL-8), receptor activation E-selectin), eicosanoid production PGE2) and the secretion of degradative enzymes collagenase). To achieve these effects, IL-I activates transcription factors such as o NF-KB and AP-1. Several of the activities of IL-I action on target cells are believed to be mediated through Oactivation of kinase cascades that have also been associated with cellular stresses, such as the stress activated n MAP kinase JNK/SAPK and p38.
SA third member of the IL-1 family was subsequently discovered which acts as a natural antagonist of IL-la and IL-lp by binding to the IL-I receptor but not transducing an intracellular signal or a biological response. The protein is called IL-IRa (for IL-1 receptor antagonist) or IRAP (for IL-I receptor antagonist protein). At least three alternatively spliced forms of IL-IRa exist: one encodes secreted protein, and the other two encode intracellular proteins. IL-la, IL-lp and IL-IRa exhibit approximately 25-30% sequence identity with each other and share a similar three dimensional structure consisting of twelve P-strands folded into a O P-barrel, with an internal thrice repeated structural motif.
There are three known IL-1 receptor subunits. The active receptor complex consists of the type I Sreceptor and IL-1 accessory protein (IL-IRAcP). The type I receptor is responsible for binding of the IL-la, Cl IL-lp and IL-IRa ligands, and is able to do so in the absence of the IL-iRAcP. However, signal transduction requires the interaction of IL-la or IL-1p with the IL-IRAcP. IL-IRa does not interact with the IL-lRAcP and hence cannot induce signal transduction. A third receptor subunit, the type II receptor, binds IL-la and IL-lp but cannot transduce signal due its lack of an intracellular domain. Instead, the type [I receptor either acts as a decoy in its membrane bound form or as an IL-I antagonist in a processed, secreted form, and hence inhibits IL-I activity. The type II receptor weakly binds to IL-IRa.
Many studies using IL-IRa, soluble IL-1R derived from the extracellular domain of the type I IL-I receptor, antibodies to IL-1 a or IL-10, and transgenic knockout mice for these genes have shown that IL-1 plays a role in a number of pathophysiologies (for a review, see Dinarello, Blood, 87: 2095-2147 (1996)). For example, IL-1Ra has been shown to be effective in animal models of septic shock, rheumatoid arthritis, graft-versus-host disease (GVHD), stroke, cardiac ischemia, psoriasis, inflammatory bowel disease, and asthma.
In addition, IL-lRa has demonstrated efficacy in clinical trials for rheumatoid arthritis and GVHD, and is also in clinical trials for inflammatory bowel disease, asthma and psoriasis.
More recently, interleukin-18 (IL-18) was placed in the IL-1 family (for a review, see Dinarello et al, J. Leukocyte Biol., 63: 658-664 (1998)). IL-18 shares the p-pleated, barrel-like form of IL-la and IL-1p. In addition, IL-18 is the natural ligand for the IL-1 receptor family member formerly known as IL-lR-related protein (IL-lRrp) (now known as the IL-18 receptor (IL-18R)). IL-18 has been shown to initiate the inflammatory cytokine cascade in a mixed population of peripheral blood mononuclear cells (PBMCs) by triggering the constitutive IL-18 receptors on lymphocytes and NK cells, inducing TNF production in the activated cells. TNF, in turn, stimulates IL-I and IL-8 production in CD14+ cells. Because of its ability to induce TNF, IL-1, and both C-C and C-X-C chemokines, and because IL-18 induces Fas ligand as well as nuclear translocation of nuclear factor 6B (NF-6B), IL-18 ranks with other pro-inflammatory cytokines as a likely contributor to systemic and local inflammation.
17. PR04425 O Efforts are being undertaken by both industry and academia to identify new, native secreted proteins.
Many of these efforts are focused on the screening of mammalian recombinant DNA libraries to identify the Scoding sequences for novel secreted proteins. We herein describe the identification and characterization of novel secreted polypeptides, designated herein as PR04425 polypeptides.
tcf 18. PR05990 The secretogranin proteins secretogranin I and II) are found in secretory granules in a variety of endocrine cells and neurons. Schimmel, A. et al., FEBS Lett. 314(3):375-80 (1992); Gerdes, H. H. et al., L o Biol. Chem. 264(20): 12009-15. A possible function of the secretogranin proteins is the packaging of secretory CK 10 products, including regulatory peptides. Chanat, E. et al., FEBS Lett. 351(2):225-30 (1994); Rosa, P. et al., OJ. Cell. Biol. 101(5):1999-2011 (1985); Gorr, S. U. et al., Am. J. Physiol. 257(2):E247-54 (1989).
Ci Secretogranins have been successfully used as biological markers in a number of contexts. For example, secretogranin II has gained importance as an immunohistochemical marker for endocrine neoplasms. See Fischer-Colbrie, R. et al., J. Biol. Chem. 265(16):9208-13 (1990). Eder et al. found that the ratio of secretogranin II to chromogranins was remarkably constant within patient populations, and suggest that the ratio may be used as a parameter to standardize CSF levels of other peptides, such as neuropeptides. Eder, et al., J. Neural Transm., 105(1):39-51 (1998).
We herein describe the identification and characterization of novel polypeptides having sequence similarity to secretogranin, designated herein as PR05990 polypeptides.
19. PR06030 Efforts are being undertaken by both industry and academia to identify new, native transmembrane receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane receptor proteins. We herein describe the identification and characterization of novel transmembrane polypeptides, designated herein as PR06030 polypeptides.
PR04424 Efforts are being undertaken by both industry and academia to identify new, native secreted proteins.
Many of these efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of novel secreted polypeptides, designated herein as PR04424 polypeptides.
21. PR04422 Lysozyme is a protein which is widely distributed in several human tissues and secretions including milk, tears and saliva. It has been demonstrated to hydrolyze linkages between N-acetylglucosamines. It has been demonstrated to be an inhibitor of chemotaxis and of the production of toxic oxygen free radicals and may also have some role in the calcification process. As such, there is substantial interest in identifying novel Spolypeptides having homology to lysozyme. Nakano and Graf, Biochim. Biophys Acta, 1090(2):273-6 (1991).
O
22. PR04430 SEfforts are being undertaken by both industry and academia to identify new, native secreted proteins.
Many of these efforts are focused on the screening of mammalian recombinant DNA libraries to identify the Cn 5 coding sequences for novel secreted proteins. We herein describe the identification and characterization of novel secreted polypeptides, designated herein as PR04430 polypeptides.
23. PRO4499 Efforts are being undertaken by both industry and academia to identify new, native transmembrane Ci 10 receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to O identify the coding sequences for novel transmembrane receptor proteins. We herein describe the identification and characterization of novel transmembrane polypeptides, designated herein as PR04499 polypeptides.
SUMMARY OF THE INVENTION 1. PR01484 A cDNA clone (DNA44686-1653) has been identified, having homology to nucleic acid encoding adipocyte complement-related protein that encodes a novel polypeptide, designated in the present application as "PRO1484".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1484 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR01484 polypeptide having the sequence of amino acid residues from about 1 or about 23 to about 246, inclusive of Figure 2 (SEQ ID NO:2), or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01484 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 77 or about 143 and about 814, inclusive, of Figure 1 (SEQ ID NO:1). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203581 (DNA44686-1653) or the complement of the nucleic acid molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203581 (DNA44686-1653).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA Sencoding a polypeptide having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues I or about 23 to about 246, inclusive of Figure 2 (SEQ ID NO:2), or the complement of the DNA of Ce 5 In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 600 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PRO1484 polypeptide having the sequence of amino acid residues from I or about 23 to about 246, inclusive of Figure 2 (SEQ ID NO:2), or the complement of the DNA molecule of and, if o the DNA molecule has at least about an 80 sequence identity, prefereably at least about an 85% sequence Ci 10 identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence Sidentity to or isolating the test DNA molecule.
C. In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01484 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 22 in the sequence of Figure 2 (SEQ ID NO:2).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues I or about 23 to about 246, inclusive of Figure 2 (SEQ ID NO:2), or the complement of the DNA of Another embodiment is directed to fragments of a PRO1484 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 1 (SEQ ID NO:1).
In another embodiment, the invention provides isolated PRO1484 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01484 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 23 to about 246 of Figure 2 (SEQ ID NO:2).
In another aspect, the invention concerns an isolated PRO 1484 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to the sequence of amino acid residues I or about 23 to about 246, inclusive of Figure 2 (SEQ ID NO:2).
In a further aspect, the invention concerns an isolated PR01484 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least 1T about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence O of residues I or about 23 to about 246, inclusive of Figure 2 (SEQ ID NO:2).
In yet another aspect, the invention concerns an isolated PRO 1484 polypeptide, comprising the sequence Sof amino acid residues I or about 23 to about 246, inclusive of Figure 2 (SEQ ID NO:2), or a fragment thereof sufficient to provide a binding site for an anti-PRO 1484 antibody. Preferably, the PRO1484 fragment retains eCn 5 a qualitative biological activity of a native PRO1484 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PRO1484 polypeptide having the Ssequence of amino acid residues from about 1 or about 23 to about 246, inclusive of Figure 2 (SEQ ID NO: S3), or the complement of the DNA molecule of and if the test DNA molecule has at least about an "K 10 sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 0sequence identity, most preferably at least about a 95% sequence identity to or (ii) culturing a host cell C-i comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
2. PR04334 A cDNA clone (DNA59608-2577) has been identified that encodes a novel polypeptide having homology to PC-I and designated in the present application as "PR04334".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04334 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR04334 polypeptide having the sequence of amino acid residues from 1 or about 23 to about 440, inclusive of Figure 4 (SEQ ID NO:9), or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04334 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 150 and about 1403, inclusive, of Figure 3 (SEQ ID NO:8). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203870 (DNA59608-2577), or the complement of the DNA molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203870 (DNA59608-2577).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 4 T" identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence Sidentity to the sequence of amino acid residues from about 23 to about 440, inclusive of Figure 4 (SEQ ID NO:9), or the complement of the DNA of SIn a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule Cn 5 under stringent conditions with a DNA molecule encoding a PR04334 polypeptide having the sequence of amino acid residues from about 23 to about 440, inclusive of Figure 4 (SEQ ID NO:9), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90 sequence identity, most preferably Sat least about a 95 sequence identity to or isolating the test DNA molecule.
(c 10 In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA O encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more c-i preferably at least about 90 positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 23 to about 440, inclusive of Figure 4 (SEQ ID NO:9), or the complement of the DNA of Another embodiment is directed to fragments of a PR04334 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR04334 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04334 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 23 through 440 of Figure 4 (SEQ ID NO:9).
In another aspect, the invention concerns an isolated PR04334 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 23 to about 440, inclusive of Figure 4 (SEQ ID NO:9).
In a further aspect, the invention concerns an isolated PR04334 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90 positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 23 through 440 of Figure 4 (SEQ IDNO:9).
In yet another aspect, the invention concerns an isolated PRO4334 polypeptide, comprising the sequence of amino acid residues 23 to about 440, inclusive of Figure 4 (SEQ ID NO:9), or a fragment thereof sufficient to provide a binding site for an anti-PR04334 antibody. Preferably, the PR04334 fragment retains a qualitative biological activity of a native PRO04334 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04334 polypeptide having the sequence of amino acid residues from about 23 to about 440, inclusive of Figure 4 (SEQ ID NO:9), or the Scomplement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the Cn 5 polypeptide from the cell culture.
3. PRO1122 17- A cDNA clone (DNA62377-1381) has been identified, having sequence identity with CTLA-8 that encodes a novel polypeptide, designated in the present application as "PR01122." C 10 In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding O a PR01122 polypeptide.
CI In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding a PROI 122 polypeptide having the sequence of amino acid residues from I or about 19 to about 197, inclusive of Figure 6 (SEQ ID NO: 11), or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO1122 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 104 and about 640, inclusive, of Figure 5 (SEQ ID NO:10). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203552 (DNA62377-1381), or the complement of the DNA molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203552 (DNA62377-1381).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 19 to about 197, inclusive of Figure 6 (SEQ ID NO: or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR01122 polypeptide having the sequence of amino acid residues from about 19 to about 197, inclusive of Figure 6 (SEQ ID NO: 11), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80 sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a Ssequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
SIn another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the C 5 amino acid sequence of residues 19 to about 197, inclusive of Figure 6 (SEQ ID NO: 11), or the complement of the DNA of In another embodiment, the invention provides isolated PRO 1122 polypeptide encoded by any of the Sisolated nucleic acid sequences hereinabove defined.
SIn a specific aspect, the invention provides isolated native sequence PRO 1122 polypeptide, which in C" 10 one embodiment, includes an amino acid sequence comprising residues 19 through 197 of Figure 6 (SEQ ID SNO:ll).
C, In another aspect, the invention concerns an isolated PRO1122 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 19 to about 197, inclusive of Figure 6 (SEQ ID NO:11).
In a further aspect, the invention concerns an isolated PRO1122 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 19 through 197 of Figure 6 (SEQ ID NO:11).
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PRO1122 polypeptide having the sequence of amino acid residues from about 19 to about 197, inclusive of Figure 6 (SEQ ID NO: 11), or the complement of the DNA molecule of and if the test-DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
4. PR01889 A cDNA clone (DNA77623-2524) has been identified, having homology to nucleic acid encoding E48 antigen protein, that encodes a novel polypeptide, designated in the present application as "PRO1889".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1889 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding a PRO1889 polypeptide having the sequence of amino acid residues from about 1 or about 21 to about 97, inclusive of Figure 8 (SEQ ID t NO: 16), or the complement of the DNA molecule of O In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1889 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 39 or about 99 and about 329, inclusive, of Figure 7 (SEQ ID NO:15). Preferably, hybridization occurs under stringent hybridization and wash conditions.
CM 5 In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203546 o (DNA77623-2524) or the complement of the nucleic acid molecule of In a preferred embodiment, the (C 10 nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in SATCC Deposit No. 203546 (DNA77623-2524).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 21 to about 97, inclusive of Figure 8 (SEQ ID NO:16), or the complement of the DNA of In a further aspect,. the invention concerns an isolated nucleic acid molecule having at least 315 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR01889 polypeptide having the sequence of amino acid residues from I or about 21 to about 97, inclusive of Figure 8 (SEQ ID NO: 16), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01889 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position I to about amino acid position 20 in the sequence of Figure 8 (SEQ ID NO:16).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 1 or about 21 to about 97, inclusive of Figure 8 (SEQ ID NO: 16), or the complement of the DNA of Another embodiment is directed to fragments of a PRO 1889 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 1T from the nucleotide sequence shown in Figure 7 (SEQ ID SIn another embodiment, the invention provides isolated PRO 1889 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
SIn a specific aspect, the invention provides isolated native sequence PR01889 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues I or about 21 to about 97 of Figure S 5 8 (SEQ ID NO: 16).
In another aspect, the invention concerns an isolated PRO 1889 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the o sequence of amino acid residues I or about 21 to about 97, inclusive of Figure 8 (SEQ ID NO: 16).
CI 10 In a further aspect, the invention concerns an isolated PRO 1889 polypeptide, comprising an amino acid 0sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least C-i about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 1 or.about 21 to about 97, inclusive of Figure 8 (SEQ ID NO:16).
In yet another aspect, the invention concerns an isolated PRO 1889 polypeptide, comprising the sequence of amino acid residues 1 or about 21 to about 97, inclusive of Figure 8 (SEQ ID NO:16), or a fragment thereof sufficient to provide a binding site for an anti-PR01889 antibody. Preferably, the PR01889 fragment retains a qualitative biological activity of a native PRO 1889 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PRO1889 polypeptide having the sequence of amino acid residues from about 1 or about 21 to about 97, inclusive of Figure 8 (SEQ ID NO: 16), or the complement of the DNA molecule of and if the test DNA molecule has at least about an sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a sequence identity, most preferably at least about a 95% sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In another embodiment, the invention concerns a method for determining the presence of a PRO1889 polypeptide comprising exposing a cell suspected of containing the polypeptide to an anti-PRO 1889 antibody and determining binding of the antibody to the cell.
In yet another embodiment, the present invention relates to a method of diagnosing the presence of a cancerous cell in a mammal, comprising detecting the level of expression of a gene encoding a PR01889 polypeptide in a test sample of tissue cells obtained from the mammal, and in a control sample of known normal tissue cells of the same cell type, wherein a higher expression level in the test sample indicates the presence of a cancerous cell, particularly a cancerous squamous cell, in the mammal.
A further embodiment is a method for identifying a compound capable of inhibiting the expression and/or activity of a PR01889 polypeptide by contacting a candidate compound with a PR01889 polypeptide under conditions and for time sufficient to allow these two compounds to interact. In a specific aspect, either the candidate compound or the PRO 1889 polypeptide is immobilized on a solid support. In another aspect, the 1t non-immobilized component carries a detectable label.
O
O In a further embodiment, the present invention provides a-method of diagnosing tumor in a mammal, comprising detecting the level of expression of a gene encoding a PRO1889 polypeptide in a test sample of Stissue cells obtained from the mammal, and in a control sample of known normal tissue cells of the same cell type, wherein a higher expression level in the test sample indicates the presence of tumor in the mammal from n 5 which the test tissue cells were obtained.
In another embodiment, the present invention provides a method of diagnosing tumor in a mammal, Scomprising contacting an anti-PRO1889 antibody with a test sample of the tissue cells obtained from the 17 mammal, and detecting the formation of a complex between the anti-PRO1889 and the PRO1889 polypeptide Sin the test sample. The detection may be qualitative or quantitative, and may be performed in comparison with Ci 10 monitoring the complex formation in a control sample of known normal tissue cells of the same cell type. A 0 larger quantity of complexes formed in the test sample indicates the presence of tumor in the mammal from S-which the test tissue cells were obtained. The antibody preferably carries a detectable label. Complex formation can be monitored, for example, by light microscopy, flow cytometry, fluorimetry, or other techniques known in the art. Preferably, the test sample is obtained from an individual mammal suspected to have neoplastic cell growth or proliferation cancerous cells).
In another embodiment, the present invention provides a cancer diagnostic kit, comprising an anti-PR01889 antibody and a carrier a buffer) in suitable packaging. The kit preferably contains instructions for using the antibody to detect the PRO1889 polypeptide.
In yet another embodiment, the invention provides a method for inhibiting the growth of tumor cells comprising exposing a cell which overexpresses a PRO1889 polypeptide to an effective amount of an agent inhibiting the expression and/or activity of the PRO 1889 polypeptide. The agent preferably is an anti-PR01889 polypeptide, a small organic and inorganic peptide, phosphopeptide, antisense or ribozyme molecule, or a triple helix molecule. In a specific aspect, the agent, anti-PRO1889 antibody induces cell death. In a further aspect, the tumor cells are further exposed to radiation treatment and/or a cytotoxic or chemotherapeutic agent.
In a further embodiment, the invention concerns an article of manufacture, comprising: a container; a label on the container, and a composition comprising an active agent contained within the container; wherein the composition is effective for inhibiting the growth of tumor cells, the label on the container indicates that the composition can be used for treating conditions characterized by overexpression of a PR01889 polypeptide, and the active agent in the composition is an agent inhibiting the expression and/or activity of the PRO1889 polypeptide. In a preferred aspect, the active ageni is an anti-PRO1889 antibody.
PR01890 A cDNA clone (DNA79230-2525) has been identified, having homology to nucleic acid encoding a lectih protein that encodes a novel polypeptide, designated in the present application as "PRO1890".
^t In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding Sa PR01890 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PRO 1890 polypeptide having C 5 the sequence of amino acid residues from about 1 or about 22 to about 273, inclusive of Figure 10 (SEQ ID NO: 18), or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01890 Spolypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 378 o or about 441 and about 1196, inclusive, of Figure 9 (SEQ ID NO:17). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having C1 at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203549 (DNA79230-2525) or the complement of the nucleic acid molecule of In a pieferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203549 (DNA79230-2525).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 22 to about 273, inclusive of Figure 10 (SEQ ID NO:18), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 475 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR01890 polypeptide having the sequence of amino acid residues from 1 or about 22 to about 273, inclusive of Figure 10 (SEQ ID NO: 18), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80 sequence identity, prefereably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01890 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 21 in the sequence of Figure 10 (SEQ ID NO:18). The transmembrane domain has been tentatively identified as extending from about amino acid position 214 to about amino acid position 235 in the PRO1890 amino acid sequence (Figure 10, SEQ ID NO:18).
o) In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA Sencoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 0.0) preferably at least about 90% positives, most preferably at least about 95% positives when compared with the ;Z amino acid sequence of residues I or about 22 to about 273, inclusive of Figure 10 (SEQ ID NO:18), or the complement of the DNA of c 5 Another embodiment is directed to fragments of a PRO1890 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived O from the nucleotide sequence shown in Figure 9 (SEQ ID NO:17).
In another embodiment, the invention provides isolated PRO1890 polypeptide encoded by any of the Sisolated nucleic acid sequences hereinabove identified.
C1 In a specific aspect, the invention provides isolated native sequence PR01890 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues I or about 22 to about 273 of Figure (SEQ ID NO: I8).
In another aspect, the invention concerns an isolated PRO1890 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues I or about 22 to about 273, inclusive of Figure 10 (SEQ ID NO: 18).
In a further aspect, the invention concerns an isolated PRO1 890 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 22 to about 273, inclusive of Figure 10 (SEQ ID NO: 18).
In yet another aspect, the invention concerns an isolated PRO 1890 polypeptide, comprising the sequence of amino acid residues 1 or about 22 to about 273, inclusive of Figure 10 (SEQ ID NO:18), or a fragment thereof sufficient to provide a binding site for an anti-PRO1 890 antibody. Preferably, the PR01890 fragment retains a qualitative biological activity of a native PR01890 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR01890 polypeptide having the sequence of amino acid residues from about 1 or about 22 to about 273, inclusive of Figure 10 (SEQ ID NO:18), or the complement of the DNA molecule of and if the test DNA molecule has at least about an sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a sequence identity, most preferably at least about a 95% sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
S6. PR01887 O A cDNA clone (DNA79862-2522) has been identified that encodes a novel polypeptide having homology to carboxylesterases and designated in the present application as "PR01887".
;Z In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding _a PR01887 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding a PR01887 polypeptide having the sequence of amino acid residues from I or about 28 to about 571, inclusive of Figure 12 (SEQ ID NO:23), O or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01887 o polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 87 and C"l about 1718, inclusive, of Figure 11 (SEQ ID NO:22). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203550 (DNA79862-2522), or the complement of the DNA molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203550 (DNA79862-2522).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 28 to about 571, inclusive of Figure 12 (SEQ ID NO:23), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PRO1887 polypeptide having the sequence of amino acid residues from about 28 to about 571, inclusive of Figure 12 (SEQ ID NO:23), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01887 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble variants transmembrane domain deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 27 in the sequence of Figure 12 (SEQ ID NO:23). The transmembrane Sdomain has been tentatively identified as extending from about amino acid position 226 to about amino acid
O
O position 245 in the PR01887 amino acid sequence (Figure 12, SEQ ID NO:23).
0In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 28 to about 571, inclusive of Figure 12 (SEQ ID NO:23), or the complement of the DNA of Another embodiment is directed to fragments of a PRO 1887 polypeptide coding sequence that may find Suse as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, Spreferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO1887 polypeptide encoded by any of the Ci isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1887 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 28 to 571 of Figure 12 (SEQ ID NO:23).
In another aspect, the invention concerns an isolated PRO1887 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 28 to about 571, inclusive of Figure 12 (SEQ ID NO:23).
In a further aspect, the invention concerns an isolated PRO1887 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 28 to 571 of Figure 12 (SEQ ID NO:23).
In yet another aspect, the invention concerns an isolated PRO1887 polypeptide, comprising the sequence of amino acid residues 28 to about 571, inclusive of Figure 12 (SEQ ID NO:23), or a fragment thereof sufficient to provide a binding site for an anti-PRO1887 antibody. Preferably, the PRO1887 fragment retains a qualitative biological activity of a native PRO1887 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR01887 polypeptide having the sequence of amino acid residues from about 28 to about 571, inclusive of Figure 12 (SEQ ID NO:23), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
7. PRO17S O A cDNA clone (DNA80136-2503) has been identified that encodes a novel polypeptide having sequence ci identity with peroxidases and designated in the present application as "PR01785." SIn one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1785 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding a PRO 1785 polypeptide having Sthe sequence of amino acid residues from 1 or about 32 to about 209, inclusive of Figure 14 (SEQ ID NO:29), o or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01785 O polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 95 and Cl about 628, inclusive, of Figure 13 (SEQ ID NO:28). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203541 (DNA80136-2503), or the complement of the DNA molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203541 (DNA80136-2503).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 32 to about 209, inclusive of Figure 14 (SEQ ID NO:29), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PRO1785 polypeptide having the sequence of amino acid residues from about 32 to about 209, inclusive of Figure 14 (SEQ ID NO:29), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01785 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position I through about amino acid position 31 in the sequence of Figure 14 (SEQ ID NO:29). The transmembrane o domain has been tentatively identified as extending from about amino acid position 18 through about amino acid O position 37 in the PR01785 amino acid sequence (Figure 14, SEQ ID NO:29).
SIn another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 32 to about 209, inclusive of Figure 14 (SEQ ID NO:29), or the complement of the DNA of Another embodiment is directed to fragments of a PRO1785 polypeptide coding sequence that may find Suse as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, Spreferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 10 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
O In another embodiment, the invention provides isolated PRO 1785 polypeptide encoded by any of the Cl .isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 1785 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 32 through 209 of Figure 14 (SEQ ID NO:29).
In another aspect, the invention concerns an isolated PR01785 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to the sequence of amino acid residues 32 to about 209, inclusive of Figure 14 (SEQ ID NO:29).
In a further aspect, the invention concerns an isolated PRO1785 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 32 through 209 of Figure 14 (SEQ ID NO:29).
In yet another aspect, the invention concerns an isolated PRO 1785 polypeptide, comprising the sequence of amino acid residues 32 to about 209, inclusive of Figure 14 (SEQ ID NO:29), or a fragment thereof sufficient to provide a binding site for an anti-PRO1785 antibody. Preferably, the PR01785 fragment retains a qualitative biological activity of a native PR01785 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR01785 polypeptide having the sequence of amino acid residues from about 32 to about 209, inclusive of Figure 14 (SEQ ID NO:29), or (b) the complement of the DNA molecule of(a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence.identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
8. PR04353 SA cDNA clone (DNA80145-2594) has been identified that encodes a novel polypeptide having homology 0. to semaphorin Z and designated in the present application as "PR04353".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding Sa PR04353 polypeptide.
c 5 In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR04353 polypeptide having the sequence of amino acid residues from 1 or about 26 to about 888, inclusive of Figure 16 (SEQ ID O or the complement of the DNA molecule of in 10 In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04353 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 94 and Sabout 2682, inclusive, of Figure 15 (SEQ ID NO:34). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 204-PTA (DNA80145-2594), or the complement of the DNA molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 204-PTA (DNA80145-2594).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about.85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 26 to about 888, inclusive of Figure 16 (SEQ ID NO:35), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04353 polypeptide having the sequence of amino acid residues from about 26 to about 888, inclusive of Figure 16 (SEQ ID NO:35), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04353 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble variants transmembrane domain deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 25 in the sequence of Figure 16 (SEQ ID NO:35). The transmembrane o domains have been tentatively identified as extending from about amino acid positions 318-339 and 598-617 C- (Figure 16, SEQ ID In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA ;Z encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the
C
c 5 amino acid sequence of residues 26 to about 888, inclusive of Figure 16 (SEQ ID NO:35), or the complement of the DNA of n1" Another embodiment is directed to fragments of a PR04353 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in O length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
SIn another embodiment, the invention provides isolated PR04353 polypeptide encoded by any of the Ci isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04353 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 26 through 888 of Figure 16 (SEQ ID In another aspect, the invention concerns an isolated PR04353 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to the sequence of amino acid residues 26 to about 888, inclusive of Figure 16 (SEQ ID In a further aspect, the invention concerns an isolated PR04353 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 26 through 888 of Figure 16 (SEQ ID In yet another aspect, the invention concerns an isolated PR04353 polypeptide, comprising the sequence of amino acid residues 26 to about 888, inclusive of Figure 16 (SEQ ID NO:35), or a fragment thereof sufficient to provide a binding site for an anti-PR04353 antibody. Preferably, the PR04353 fragment retains a qualitative biological activity of a native PR04353 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04353 polypeptide having the sequence of amino acid residues from about 26 to about 888, inclusive of Figure 16 (SEQ ID NO:35), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
9. PR04357 O A cDNA clone (DNA84917-2597) has been identified that encodes a novel polypeptide having homology to "BK158_1" and designated in the present application as "PR04357".
;Z In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding _a PR04357 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR04357 polypeptide having the sequence of amino acid residues from i or about 18 to about 502, inclusive of Figure 18 (SEQ ID Sor the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04357 O polypeptide comprising DNA hybridizing to the' complement of the nucleic acid between about residues 337 and Cabout 1791, inclusive, of Figure 17 (SEQ ID NO:39). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203863 (DNA84917-2597), or the complement of the DNA molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203863 (DNA84917-2597).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 18 to about 502, inclusive of Figure 18 (SEQ ID NO:40), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04357 polypeptide having the sequence of amino acid residues from about 18 to about 502, inclusive of Figure 18 (SEQ ID NO:40), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 18 to about 502, inclusive of Figure 18 (SEQ ID NO:40), or the complement of the DNA of o Another embodiment is directed to fragments of a PR04357 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in b length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through ;Z about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
_In another embodiment, the invention provides isolated PR04357 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04357 polypeptide, which in n one embodiment, includes an amino acid sequence comprising residues 18 through 502 of Figure 18 (SEQ ID O In another aspect, the invention concerns an isolated PR04357 polypeptide, comprising an amino acid S 10 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more Spreferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the CI sequence of amino acid residues 18 to about 502, inclusive of Figure 18 (SEQ ID In a further aspect, the invention concerns an isolated PR04357 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 18 through 502 of Figure 18 (SEQ ID In yet another aspect, the invention concerns an isolated PR04357 polypeptide, comprising the sequence of amino acid residues 18 to about 502, inclusive of Figure 18 (SEQ ID NO:40), or a fragment thereof sufficient to provide a binding site for an anti-PR04357 antibody. Preferably, the PR04357 fragment retains a qualitative biological activity of a native PR04357 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04357 polypeptide having the sequence of amino acid residues from about 18 to about 502, inclusive of Figure 18 (SEQ ID NO:40), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
10. PR04405 A cDNA clone (DNA84920-2614) has been identified that encodes a novel polypeptide designated in the present application as "PR04405".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04405 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding a PR04405 polypeptide having o the sequence of amino acid residues from 1 or about 35 to about 310, inclusive of Figure 20 (SEQ ID Sor the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04405 ;Z polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 181 and _about 1008, inclusive, of Figure 19 (SEQ ID NO:44). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule Sencoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203966 10 (DNA84920-2614), or the complement of the DNA molecule of(a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC C. Deposit No. 203966 (DNA84920-2614).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 35 to about 310, inclusive of Figure 20 (SEQ ID or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04405 polypeptide having the sequence of amino acid residues from about 35 to about 310, inclusive of Figure 20 (SEQ ID NO:45), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04405 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble variants transmembrane domain deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 34 in the sequence of Figure 20 (SEQ ID NO:45). The transmembrane domain has been tentatively identified as extending from about amino acid position 58 through about amino acid position 76 in the PR04405 amino acid sequence (Figure 20, SEQ ID NO:45) and may be a type II transmembrane domain.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 35 to about 3 10, inclusive of Figure 20 (SEQ ID NO:45), or the complement of the DNA of t Another embodiment is directed to fragments of a PR04405 polypeptide coding sequence that may find C use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in Slength, preferably from about 20 through about 60 nucleoddes in length, more preferably from about 20 through ;Z about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
_In another embodiment, the invention provides isolated PR04405 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04405 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 35 through 310 of Figure 20 (SEQ ID SIn another aspect, the invention concerns an isolated PR04405 polypeptide, comprising an amino acid tt 10 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more o preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the ,1 sequence of amino acid residues 35 to about 310, inclusive of Figure 20 (SEQ ID In a further aspect, the invention concerns an isolated PR04405 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 35 through 310 of Figure 20 (SEQ ID In yet another aspect, the invention concerns an isolated PRO4405 polypeptide, comprising the sequence of amino acid residues 35 to about 310, inclusive of Figure 20 (SEQ ID NO:45), or a fragment thereof sufficient to provide a binding site for an anti-PR04405 antibody. Preferably, the PR04405 fragment retains a qualitative biological activity of a native PRO4405 polypeptide.
SIn a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04405 polypeptide having the sequence of amino acid residues from about 35 to about 310, inclusive of Figure 20 (SEQ ID NO:45), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
11. PR04356 A cDNA clone (DNA86576-2595) has been identified that encodes a novel polypeptide having homology to MAGPIAP and designated in the present application as "PR04356".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO4356 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR04356 polypeptide having o the sequence of amino acid residues from 1 or about 20 to about 251, inclusive of Figure 22 (SEQ ID or the complement of the DNA molecule of OIn another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04356 ;Z polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 112 and Sabout 807, inclusive, of Figure 21 (SEQ ID NO:49). Preferably, hybridization occurs under stringent C 5 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule Sencoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203868 tn 10 (DNA86576-2595), or the complement of the DNA molecule of In a preferred embodiment, the nucleic o acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC C(l Deposit No. 203868 (DNA86576-2595).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 20 to about 251, inclusive of Figure 22 (SEQ ID or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04356 polypeptide having the sequence of amino acid residues from about 20 to about 251, inclusive of Figure 22 (SEQ ID NO:50), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90 sequence identity, most preferably at least about a 95 sequence identity to or isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04356 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble variants transmembrane domain deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 19 in the sequence of Figure 22 (SEQ ID NO:50). The transmembrane domain has been tentatively identified as extending from about amino acid position 233 through about amino acid position 251 in the PR04356 amino acid sequence (Figure 22, SEQ ID In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 20 to about 251, inclusive of Figure 22 (SEQ ID NO:50), or the complement of the DNA of SAnother embodiment is directed to fragments of a PR04356 polypeptide coding sequence that may find O use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in Slength, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through Sabout 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR04356 polypeptide encoded by any of the S 5 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04356 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 20 through 251 of Figure 22 (SEQ ID SIn another aspect, the invention concerns an isolated PR04356 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more O preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the C-l sequence of amino acid residues 20 to about 251, inclusive of Figure 22 (SEQ ID In a further aspect, the invention concerns an isolated PR04356 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 20 through 251 of Figure 22 (SEQ ID In yet another aspect, the invention concerns an isolated PR04356 polypeptide, comprising the sequence of amino acid residues 20 to about 251, inclusive of Figure 22 (SEQ ID NO:50), or a fragment thereof sufficient to provide a binding site for an anti-PR04356 antibody. Preferably, the PR04356 fragment retains a qualitative biological activity of a native PR04356 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04356 polypeptide having the sequence of amino acid residues from about 20 to about 251, inclusive of Figure 22 (SEQ ID NO:50), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
12. PR04352 A cDNA clone (DNA87976-2593) has been identified that encodes a novel polypeptide having homology to protocadherin pc3 designated in the present application as "PR04352".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04352 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR04352 polypeptide having d the sequence of amino acid residues from 1 or about 27 to about 800, inclusive of Figure 24 (SEQ ID NO:52), or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04352 ;Z polypeptide comprising' DNA hybridizing to the complement of the nucleic acid between about residues 257 and _about 2578, inclusive, of Figure 23 (SEQ ID NO:51). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having n at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule Sencoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203888 Ittr 10 (DNA87976-2593), or the complement of the DNA molecule of In a preferred embodiment, the nucleic o acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Ci .Deposit No. 203888 (DNA87976-2593).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 27 to about 800, inclusive of Figure 24 (SEQ ID NO:52), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04352 polypeptide having the sequence of amino acid residues from about 27 to about 800, inclusive of Figure 24 (SEQ ID NO:52), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04352 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble variants transmembrane domain deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position I through about amino acid position 26 in the sequence of Figure 24 (SEQ ID NO:52). The transmembrane domain has been tentatively identified as extending from about amino acid position 687 through about amino acid position 711 in the PR04352 amino acid sequence (Figure 24, SEQ ID NO:52).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 27 to about 800, inclusive of Figure 24 (SEQ ID NO:52), or the complement of the DNA of o Another embodiment is directed to fragments of a PR04352 polypeptide coding sequence that may find O use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in Slength, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through Sabout 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR04352 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04352 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 27 through 800 of Figure 24 (SEQ ID NO:52).
o In another aspect, the invention concerns an isolated PR04352 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more O preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the C"l sequence of amino acid residues 27 to about 800, inclusive of Figure 24 (SEQ ID NO:52).
In a further aspect, the invention concerns an isolated PR04352 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 27 through 800 of Figure 24 (SEQ ID NO:52).
In yet another aspect, the invention concerns an isolated PRO4352 polypeptide, comprising the sequence of amino acid residues 27 to about 800, inclusive of Figure 24 (SEQ ID NO:52), or a fragment thereof sufficient to provide a binding site for an anti-PR04352 antibody. Preferably, the PR04352 fragment retains a qualitative biological activity of a native PR04352 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04352 polypeptide having the sequence of amino acid residues from about 27 to about 800, inclusive of Figure 24 (SEQ ID NO:52), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
13. PR04380 A cDNA clone (DNA92234-2602) has been identified that encodes a novel polypeptide designated in the present application as "PR04380".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04380 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR04380 polypeptide having Sthe sequence of amino acid residues from 1 or about 27 to about 507, inclusive of Figure 26 (SEQ ID NO:57), O or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04380 Spolypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 279 and about 1721, inclusive, of Figure 25 (SEQ ID NO:56). Preferably, hybridization occurs under stringent en 5 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least 1 about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule Sencoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203948 (DNA92234-2602), or the complement of the DNA molecule of In a preferred embodiment, the nucleic Sacid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC C Deposit No. 203948 (DNA92234-2602).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 27 to about 507, inclusive of Figure 26 (SEQ ID NO:57), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04380 polypeptide having the sequence of amino acid residues from about 27 to about 507, inclusive of Figure 26 (SEQ ID NO:57), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80 sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04380 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble variants transmembrane domain deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position I through about amino acid position 26 in the sequence of Figure 26 (SEQ ID NO:57). The transmembrane domain has been tentatively identified as extending from about amino acid position 273 through about amino acid position 292 in the PR04380 amino acid sequence (Figure 26, SEQ ID NO:57).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at.least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 27 to about 507, inclusive of Figure 26 (SEQ ID NO:57), or the complement of the DNA of o Another embodiment is directed to fragments of a PRO4380 polypeptide coding sequence that may find o use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in (3length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through ;Z about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR04380 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04380 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 27 through 507 of Figure 26 (SEQ ID SNO:57).
O In another aspect, the invention concerns an isolated PR04380 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more O preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to the Cl sequence of amino acid residues 27 to about 507, inclusive of Figure 26 (SEQ ID NO:57).
In a further aspect, the invention concerns an isolated PR04380 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 27 through 507 of Figure 26 (SEQ ID NO:57).
In yet another aspect, the invention concerns an isolated PR04380 polypeptide, comprising the sequence of amino acid residues 27 to about 507, inclusive of Figure 26 (SEQ ID NO:57), or a fragment thereof sufficient to provide a binding site for an anti-PR04380 antibody. Preferably, the PR04380 fragment retains a qualitative biological activity of a native PR04380 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04380 polypeptide having the sequence of amino acid residues from about 27 to about 507, inclusive of Figure 26 (SEQ ID NO:57), or (b) the complement of the DNA molecule of(a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
14. PR04354 A cDNA clone (DNA92256-2596) has been identified that encodes a novel polypeptide designated in the present application as "PR04354".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04354 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR04354 polypeptide having o the sequence of amino acid residues from about 22 to about 248, inclusive of Figure 28 (SEQ ID NO:59), or -i the complement of the DNA molecule of 00 In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04354 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 171 and about 851, inclusive, of Figure 27 (SEQ ID NO:58). Preferably, hybridization occurs under stringent cf 5 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule Sencoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203891 tf 10 (DNA92256-2596), or the complement of the DNA molecule of(a). In a preferred embodiment, the nucleic o acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC SDeposit No. 203891 (DNA92256-2596).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 22 to about 248, inclusive of Figure 28 (SEQ ID NO:59), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04354 polypeptide having the sequence of amino acid residues from about 22 to about 248, inclusive of Figure 28 (SEQ ID NO:59), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80 sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 22 to about 248, inclusive of Figure 28 (SEQ ID NO:59), or the complement of the DNA of Another embodiment is directed to fragments of a PR04354 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length, In another embodiment, the invention provides isolated PR04354 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04354 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 22 through 248 of Figure 28 (SEQ ID SNO:59).
In another aspect, the invention concerns an isolated PR04354 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more ;Z preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 22 to about 248, inclusive of Figure 28 (SEQ ID NO:59).
In a further aspect, the invention concerns an isolated PR04354 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 22 through 248 of Figure 28 (SEQ ID NO:59).
SIn yet another aspect, the invention concerns an isolated PR04354 polypeptide, comprising the sequence I/ 10 of amino acid residues 22 to about 248, inclusive of Figure 28 (SEQ ID NO:59), or a fragment thereof sufficient o to provide a binding site for an anti-PR04354 antibody. Preferably, the PR04354 fragment retains a qualitative i biological activity of a native PR04354 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04354 polypeptide having the sequence of amino acid residues from about 22 to about 248, inclusive of Figure 28 (SEQ ID NO:59), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
PR04408 A cDNA clone (DNA92274-2617) has been identified that encodes a novel polypeptide designated in the present application as "PR04408".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04408 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding a PR04408 polypeptide having the sequence of amino acid residues from about 23 to about 223, inclusive of Figure 30 (SEQ ID NO:61), or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04408 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 155 and about 757, inclusive, of Figure 29 (SEQ ID NO:60), Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least O about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203971 (DNA92274-2617), or the complement of the DNA molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203971 (DNA92274-2617).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence
I.
1 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 23 to about 223, inclusive of Figure 30 (SEQ ID SNO:61), or the complement of the DNA of ltr 10 In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about Snucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule Ci under stringent conditions with a DNA molecule encoding a PR04408 polypeptide having the sequence of amino acid residues from about 23 to about 223, inclusive of Figure 30 (SEQ ID NO:61), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 23 to about 223, inclusive of Figure 30 (SEQ ID NO:61), or the complement of the DNA of Another embodiment is directed to fragments of a PR04408 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR04408 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04408 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 23 through 223 of Figure 30 (SEQ ID NO:61).
In another aspect, the invention concerns an isolated PR04408 polypeptide; comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 23 to about 223, inclusive of Figure 30 (SEQ ID NO:61).
In a further aspect, the invention concerns an isolated PR04408 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence o of residues 23 through 223 of Figure 30 (SEQ ID NO:61).
C In yet another aspect, the invention concerns an isolated PR04408 polypeptide, comprising the sequence of amino acid residues 23 to about 223, inclusive of Figure 30 (SEQ ID NO:61), or a fragment thereof sufficient ;Z to provide a binding site for an anti-PR04408 antibody. Preferably, the PR04408 fragment retains a qualitative biological activity of a native PR04408 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04408 polypeptide having the Ssequence of amino acid residues from about 23 to about 223, inclusive of Figure 30 (SEQ ID NO:61), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence Sidentity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 10 identity, most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising Sthe test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the C polypeptide from the cell culture.
16. PRO5737 A cDNA clone (DNA92929-2534) has been identified that encodes a novel polypeptide having homology to IL-1 and is designated in the present application as "PR05737".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO5737 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR05737 polypeptide having the sequence of amino acid residues from 1 or about 18 to about 134, inclusive of Figure 32 (SEQ ID NO:63), or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR05737 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 96 or about 147 and about 497, inclusive, of Figure 31 (SEQ ID NO:62). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203586 (DNA92929-2534), or the complement of the DNA molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203586 (DNA92929-2534).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence o identity to the sequence of amino acid residues from about 18 to about 134, inclusive of Figure 32 (SEQ ID C- NO:63), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about ;Z nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule _under stringent conditions with a DNA molecule encoding a PR05737 polypeptide having the sequence of c 5 amino acid residues from about 18 to.about 134, inclusive of Figure 32 (SEQ ID NO:63), or the complement of the DNA molecule of(a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or isolating the test DNA molecule.
SIn another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA Vtt 10 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more o preferably at least about 90% positives, most preferably at least about 95 positives when compared with the Samino acid sequence of residues 18 to about 134, inclusive of Figure 32 (SEQ ID NO:63), or the complement of the DNA of Another embodiment is directed to fragments of a PR05737 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR05737 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR05737 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 18 through 134 of Figure 32 (SEQ ID NO:63).
In another aspect, the invention concerns an isolated PR05737 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to the sequence of amino acid residues 18 to about 134, inclusive of Figure 32 (SEQ ID NO:63).
In a further aspect, the invention concerns an isolated PR05737 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 18 through 134 of Figure 32 (SEQ ID NO:63).
In yet another aspect, the invention concerns an isolated PR05737 polypeptide, comprising the sequence of amino acid residues 18 to about 134, inclusive of Figure 32 (SEQ ID NO:63), or a fragment thereof sufficient to provide a binding site for an anti-PR05737 antibody. Preferably, the PRO5737 fragment retains a qualitative biological activity of a native PR05737 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR05737 polypeptide having the sequence of amino acid residues from about 18 to about 134, inclusive of Figure 32 (SEQ ID NO:63), or (b) O the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence C- identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
m 17. PR04425 A cDNA clone (DNA93011-2637) has been identified that encodes a novel polypeptide having homology to a protein in GenBank, accession number HGS RE295, and designated in the present application as o "PR04425".
tr 10 In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding oa PR04425 polypeptide.
Ci In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR04425 polypeptide having the sequence of amino acid residues from I OR about 20 to about 136, inclusive of Figure 34 (SEQ ID or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04425 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 84 and about 434, inclusive, of Figure 33 (SEQ ID NO:64). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. (DNA93011-2637), or the complement of the DNA molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 20-PTA (DNA93011-2637).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80 sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to the sequence of amino acid residues from about 20 to about 136, inclusive of Figure 34 (SEQ ID or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04425 polypeptide having the sequence of amino acid residues from about 20 to about 136, inclusive of Figure 34 (SEQ ID NO:65), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably O at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably C1 at least about a 95 sequence identity to or isolating the test DNA molecule.
00) In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA ;Z encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 positives, most preferably at least about 95 positives when compared with the c 5 amino acid sequence of residues 20 to about 136, inclusive of Figure 34 (SEQ ID NO:65), or the complement of the DNA of SAnother embodiment is directed to fragments of a PR04425 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in O length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through n) 10 about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
SIn another embodiment, the invention provides isolated PR04425 polypeptide encoded by any of the C" isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04425 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 20 through 136 of Figure 34 (SEQ ID In another aspect, the invention concerns an isolated PR04425 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to the sequence of amino acid residues 20to about 136, inclusive of Figure 34 (SEQ ID In a further aspect, the invention concerns an isolated PR04425 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 20 through 136 of Figure 34 (SEQ ID In yet another aspect, the invention concerns an isolated PR04425 polypeptide, comprising the sequence of amino acid residues 20 to about 136, inclusive of Figure 34 (SEQ ID NO:65), or a fragment thereof sufficient to provide a binding site for an anti-PR04425 antibody. Preferably, the PR04425 fragment retains a qualitative biological activity of a native PR04425 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04425 polypeptide having the sequence of amino acid residues from about 20 to about 136, inclusive of Figure 34 (SEQ ID NO:65), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity; most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
o 18. PR05990 O A cDNA clone (designated herein as DNA96042-2682) has been identified that has homology to nucleic acid encoding secretogranin and that encodes a novel polypeptide, designated in the present application as S"PR05990".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising a nucleotide sequence that encodes a PR05990 polypeptide.
In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about sequence identity, preferably at least about 81 sequence identity, more preferably at least about 82% Ssequence identity, yet more preferably at least about 83 sequence identity, yet more preferably at least about O 84 sequence identity, yet more preferably at least about 85% sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87 sequence identity, yet more preferably at O least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably C at least about 90% sequence identity, yet more preferably at least about 91% sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93 sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at least about 95 sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% sequence identity to a DNA molecule encoding a PR05990 polypeptide having the sequence of amino acid residues from about I or about 22 to about 468, inclusive, of Figure 36 (SEQ ID NO:67), or the complement of the DNA molecule of In another aspect, the isolated nucleic acid molecule comprises a nucleotide sequence encoding a PR05990 polypeptide having the sequence of amino acid residues from about 1 or about 22 to about 468, inclusive, of Figure 36 (SEQ ID NO:67), or the complement of the nucleotide sequence of In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about sequence identity, preferably at least about 81% sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83 sequence identity, yet more preferably at least about 84 sequence identity, yet more preferably at least about 85 sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87 sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91% sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93% sequence identity, yet more preferably at least about 94 sequence identity, yet more preferably at least about 95 sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99 sequence identity to a DNA molecule having the sequence of nucleotides from about 265 or about 328 to about 1668, inclusive, of Figure 35 (SEQ ID NO:66), or the complement of the DNA molecule of In another aspect,.the isolated nucleic acid molecule comprises the nucleotide sequence of from about 265 or about 328 to about 1668, inclusive, of Figure 35 (SEQ ID NO:66), or the complement of the o nucleotide sequence of O In a further aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide Csequence having at least about 80 sequence identity, preferably at least about 81% sequence identity, more ;Z spreferably at least about 82 0% sequence identity, yet more preferably at least about 83 sequence identity, yet more preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 84 sequence identity, yet more preferably at least about 85% sequence identity, identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence sequence identity, yet more preferably at least about 088% sequence identity, yet more preferably at least about 89% 91% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 17 91% sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least o about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at least about 95% sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably Sat least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more Cpreferably at least about 99% sequence identity to a DNA molecule that encodes the same mature polypeptide encoded by the human protein cDNA deposited with the ATCC on July 20, 1999 under ATCC Deposit No. 382- PTA (DNA96042-2682) or the complement of the DNA molecule of In a preferred embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding the same mature polypeptide encoded by the human protein cDNA deposited with the ATCC on July 20, 1999 under ATCC Deposit No. 382- PTA (DNA96042-2682) or the complement of the nucleotide sequence of In another aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide sequence having at least about 80% sequence identity, preferably at least about 81% sequence identity, more preferably at least about 82 sequence identity, yet more preferably at least about 83 sequence identity, yet more preferably at least about 84 sequence identity, yet more preferably at least about 85 sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about.87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91 sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at least about 95 sequence identity, yet more preferably at least about 96 sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% sequence identity to the full-length polypeptide coding sequence of the human protein cDNA deposited with the ATCC on July 20, 1999 under ATCC Deposit No. 382-PTA (DNA96042- 2682) or the complement of the nucleotide sequence of In a preferred embodiment, the isolated nucleic acid molecule comprises the full-length polypeptide coding sequence of the DNA deposited with the ATCC on July 20, 1999 under ATCC Deposit No. 382-PTA (DNA96042-2682) or the complement of the nucleotide sequence of In another aspect, the invention concerns an isolated nucleic acid molecule which encodes an active PR05990 polypeptide as defined below comprising a nucleotide sequence that hybridizes to the complement of a nucleic acid sequence that encodes amino acids 1 or about 22 to about 468, inclusive, of Figure 36 (SEQ ID f NO:67). Preferably, hybridization occurs under stringent hybridization and wash conditions.
SIn yet another aspect, the invention concerns an isolated nucleic acid molecule which encodes an active b4p PR05990 polypeptide as defined below comprising a nucleotide sequence that hybridizes to the complement of ;Z the nucleic acid sequence between about nucleotides 265 or about 328 and about 1668, inclusive, of Figure (SEQ ID NO:66). Preferably, hybridization occurs under stringent hybridization and wash conditions.
C 5 In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 1301 nucleotides and which is produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR05990 polypeptide having the sequence of amino acid residues from about 1 or Sabout 22 to about 468, inclusive, of Figure 36 (SEQ ID NO:67), or the complement of the DNA molecule o of and, if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 81 sequence identity, more preferably at least about an 82% sequence identity, yet more preferably at least 0about an 83% sequence identity, yet more preferably at least about an 84% sequence identity, yet more preferably at least about an 85 sequence identity, yet more preferably at least about an 86 sequence identity, yet more preferably at least about an 87% sequence identity, yet more preferably at least about an 88% sequence identity, yet more preferably at least about an 89% sequence identity, yet more preferably at least about a 90 sequence identity, yet more preferably at least about a 91 sequence identity, yet more preferably at least about a 92% sequence identity, yet more preferably at least about a 93 sequence identity, yet more preferably at least about a 94 sequence identity, yet more preferably at least about a 95 sequence identity, yet more preferably at least about a 96% sequence identity, yet more preferably at least about a 97% sequence identity, yel more preferably at least about a 98% sequence identity and yet more preferably at least about a 99% sequence identity to or and isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide sequence encoding a polypeptide scoring at least about 80% positives, preferably at least about 81% positives, more preferably at least about 82% positives, yet more preferably at least about 83% positives, yet more preferably at least about 84 positives, yet more preferably at least about 85 positives, yet more preferably at least about 86% positives, yet more preferably at least about 87 positives, yet more preferably at least about 88% positives, yet more preferably at least about 89% positives, yet more preferably at least about positives, yet more preferably at least about 91 positives, yet more preferably at least about 92% positives, yet more preferably at least about 93 positives, yet more preferably at least about 94 positives, yet more preferably at least about 95% positives, yet more preferably at least about 96% positives, yet more preferably at least about 97 positives, yet more preferably at least about 98 positives and yet more preferably at least about 99% positives when compared with the amino acid sequence of residues about 1 or about 22 to 468, inclusive, of Figure 36 (SEQ ID NO:67), or the complement of the nucleotide sequence of In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR05990 polypeptide without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 21 in the sequence of Figure 36 (SEQ ID NO:67). It is noted, however, that the C-terminal boundary of the signal peptide may vary, but most likely o by no more than about 5 amino acids on either side of the signal peptide C-terminal boundary as initially C identified herein, wherein the C-terminal boundary of the signal peptide may be identified pursuant to criteria J) routinely employed in the art for identifying that type of amino acid sequence element Nielsen et al., Prot.
;Z En. 10:1-6 (1997) and von Heinje el al., Nucl. Acids. Res. 14:4683-4690 (1986)). Moreover, it is also recognized that, in some cases, cleavage of a signal sequence from a secreted polypeptide is not entirely uniform, c 5 resulting in more than one secreted species. These polypeptides, and the polynucleotides encoding them, are contemplated by the present invention. As such, for purposes of the present application, the signal peptide of the PR05990 polypeptide shown in Figure 36 (SEQ ID NO:67) extends from amino acids 1 to X of Figure 36 (SEQ ID NO:67), wherein X is any amino acid from 16 to 26 of Figure 36 (SEQ ID NO:67). Therefore, mature O forms of the PR05990 polypeptide which are encompassed by the present invention include those comprising 10 amino acids X to 468 of Figure 36 (SEQ ID NO:67), wherein X is any amino acid from 16 to 26 of Figure 36 o (SEQ ID NO:67) and variants thereof as described below. Isolated nucleic acid molecules encoding these Spolypeptides are also contemplated.
Another embodiment is directed to fragments of a PR05990 polypeptide coding sequence that may find use as, for example, hybridization probes or for encoding fragments of a PR05990 polypeptide that may optionally encode a polypeptide comprising a binding site for an anti-PRO5990 antibody. Such nucleic acid fragments are usually at least about 20 nucleotides in length, preferably at least about 30 nucleotides in length, more preferably at least about 40 nucleotides in length, yet more preferably at least about 50 nucleotides in length, yet more preferably at least about 60 nucleotides in length, yet more preferably at least about nucleotides in length, yet more preferably at least about 80 nucleotides in length, yet more preferably at least about 90 nucleotides in length, yet more preferably at least about 100 nucleotides in length, yet more preferably at least about 110 nucleotides in length, yet more preferably at least about 120 nucleotides in length, yet more preferably at least about 130 nucleotides in length, yet more preferably at least about 140 nucleotides in length, yet more preferably at least about 150 nucleotides in length, yet more preferably at least about 160 nucleotides in length, yet more preferably at least about 170 nucleotides in length, yet more preferably at least about 180 nucleotides in length, yet more preferably at least about 190 nucleotides in length, yet more preferably at least about 200 nucleotides in length, yet more preferably at least about 250 nucleotides in length, yet more preferably at least about 300 nucleotides in length, yet more preferably at least about 350 nucleotides in length, yet more preferably at least about 400 nucleotides in length, yet more preferably at least about 450 nucleotides in length, yet more preferably at least about 500 nucleotides in length, yet more preferably at least about 600 nucleotides in length, yet more preferably at least about 700 nucleotides in length, yet more preferably at least about 800 nucleotides in length, yet more preferably at least about 900 nucleotides in length and yet more preferably at least about 1000 nucleotides in length, wherein in this context the term "about" means the referenced nucleotide sequence length plus or minus 10% of that referenced length. In a preferred embodiment, the nucleotide sequence fragment is derived from any coding region of the nucleotide sequence shown in Figure 35 (SEQ ID NO;66). It is noted that novel fragments of a PR05990 polypeptide-encoding nucleotide sequence may be determined in a routine manner by aligning the PR05990 polypeptide-encoding nucleotide sequence with other known nucleotide sequences using any of a number of well known sequence alignment programs and determining o which PR05990 polypeptide-encoding nucleotide sequence fragment(s) are novel. All of such PR05990 Spolypeptide-encoding nucleotide sequences are contemplated herein and can be determined without undue experimentation. Also contemplated are the PR05990 polypeptide fragments encoded by these nucleotide ;Z molecule fragments, preferably those PR05990 polypeptide fragments that comprise a binding site for an anti- _PR05990 antibody.
cf 5 In another embodiment, the invention provides isolated PR05990 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR05990 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues from about I or about 22 to about S468 of Figure 36 (SEQ ID NO:67).
In another aspect, the invention concerns an isolated PR05990 polypeptide, comprising an amino acid o sequence having at least about 80% sequence identity, preferably at least about 81 sequence identity, more C, preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 84% sequence identity, yet more preferably at least about 85 sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91 sequence identity, yet more preferably at least about 92 sequence identity, yet more preferably at least about 93% sequence identity, yet more preferably at least about 94 sequence identity, yet more preferably at least about 95 sequence identity, yet more preferably at least about 96 sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% sequence identity to the sequence of amino acid residues from about 1 or about 22 to about 468, inclusive, of Figure 36 (SEQ ID NO:67).
In a further aspect, the invention concerns an isolated PR05990 polypeptide comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 81 sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 84% sequence identity, yet more preferably at least about 85 sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91% sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93% sequence identity, yet more preferably at least about 94 sequence identity, yet more preferably at least about 95% sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% sequence identity to an amino acid sequence encoded by the human protein cDNA deposited with the ATCC on July 20, 1999 under ATCC Deposit No. 382-PTA (DNA96042-2682). In a preferred embodiment, the isolated PR05990 polypeptide comprises an amino acid sequence encoded by the human protein cDNA deposited with the ATCC on July 20, 1999 under ATCC Deposit No. 382-PTA S(DNA96042-2682).
O In a further aspect, the invention concerns an isolated PR05990 polypeptide comprising an amino acid 0JT sequence scoring at least about 80% positives, preferably at least about 81% positives, more preferably at least ;Z about 82% positives, yet more preferably at least about 83% positives, yet more preferably at least about 84 positives, yet more preferably at least about 85% positives, yet more preferably at least about 86% positives, yet more preferably at least about 87% positives, yet more preferably at least about 88% positives, yet more preferably at least about 89% positives, yet more preferably at least about 90% positives, yet more preferably at least about 91 positives, yet more preferably at least about 92% positives, yet more preferably at least about Slsa93% positives, yet more preferably at least about 94% positives, yet more preferably at least about O positives, yet more preferably at least about 96% positives, yet more preferably at least about 97% positives, yet more preferably at least about 98% positives and yet more preferably at least about 99% positives when compared with the amino acid sequence of residues from about 1 or about 22 to about 468, inclusive, of Figure C<i 36 (SEQ ID NO:67).
In a specific aspect, the invention provides an isolated PR05990 polypeptide without the N-terminal signal sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino acid sequence as hereinbefore described. Processes for producing the same are also herein described, wherein those processes comprise culturing a host cell comprising a vector which comprises the appropriate encoding nucleic acid molecule under conditions suitable for expression of the PR05990 polypeptide and recovering the PR05990 polypeptide from the cell culture.
In yet another aspect, the invention concerns an isolated PR05990 polypeptide, comprising the sequence of amino acid residues from about I or about 22 to about 468, inclusive, of Figure 36 (SEQ ID NO:67), or a fragment thereof which' is biologically active or sufficient to provide a binding site for an anti-PR05990 antibody, wherein the identification of PR05990 polypeptide fragments that possess biological activity or provide a binding site for an anti-PR05990 antibody may be accomplished in a routine manner using techniques which are well known in the art. Preferably, the PR05990 fragment retains a qualitative biological activity of a native PR05990 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR05990 polypeptide having the sequence of amino acid residues from about I or about 22 to about 468, inclusive, of Figure 36 (SEQ ID NO:67), or the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 81 sequence identity, more preferably at least about an 82% sequence identity, yet more preferably at least about an 83 sequence identity, yet more preferably at least about an 84% sequence identity, yet more preferably at least about an 85% sequence identity, yet more preferably at least about an 86% sequence identity, yet more preferably at least about an 87 sequence identity, yet more preferably at least about an 88% sequence identity, yet more preferably at least about an 89% sequence identity, yet more preferably at least about a 90% sequence identity, yet more preferably at least about a 91 sequence identity, yet more preferably at least about a 92% sequence identity, yet more preferably at least about a 93% sequence identity, yet more preferably at least about a 94 sequence identity, yet more preferably at least WO 00/56889 PCT/USOO/05601 O about a 95% sequence identity, yet more preferably at least about a 96% sequence identity, yet more preferably Ci at least about a 97% sequence identity, yet more preferably at least about a 98% sequence identity and yet more preferably at least about a 99% sequence identity to or (ii) culturing a host cell comprising the test DNA Smolecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
Another embodiment of the present invention is directed to the use of a PR05990 polypeptide, or an agonist or antagonist thereof as herein described, or an anti-PR05990 antibody, for the preparation of a l^ medicament useful in the treatment of a condition which is responsive to the PR05990 polypeptide, an agonist or antagonist thereof or an anti-PR05990 antibody.
tn 10 19. PRO6030 SA cDNA clone (designated herein as DNA96850-2705) has been identified that encodes a novel Ci polypeptide, designated in the present application as "PR06030".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising a nucleotide sequence that encodes a PR06030 polypeptide.
In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about sequence identity, preferably at least about 81 sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83 sequence identity, yet more preferably at least about 84% sequence identity, yet more preferably at least about 85% sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least aboul 87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91% sequence identity, yet more preferably at least about 92 sequence identity, yet more preferably at least about 93 sequence identity, yet more preferably at least about 94 sequence identity, yet more preferably at least about 95 sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% sequence identity to a DNA molecule encoding a PR06030 polypeptide having the sequence of amino acid residues from about 1 or about 27 to about 322, inclusive, of Figure 38 (SEQ ID NO:72), or the complement of the DNA molecule of In another aspect, the isolated nucleic acid molecule comprises a nucleotide sequence encoding a PR06030 polypeptide having the sequence of amino acid residues from about 1 or about 27 to about 322, inclusive, of Figure 38 (SEQ ID NO:72), or the complement of the nucleotide sequence of In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about sequence identity, preferably at least about 81% sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83 sequence identity, yet more preferably at least about 84 sequence identity, yet more preferably at least about 85 sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87 sequence identity, yet more preferably at least about 88 sequence identity, yet more preferably at least about 89 sequence identity, yet more preferably o at least about 90% sequence identity, yet more preferably at least about 91% sequence identity, yet more O preferably at least about 92% sequence identity, yet more preferably at least about 93 sequence identity, yet (3j) more preferably at least about 94% sequence identity, yet more preferably at least about 95 sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99%
C
rc 5 sequence identity to a DNA molecule having the sequence of nucleotides from about 60 or about 138 to about 1025, inclusive, of Figure 37 (SEQ ID NO:71), or the complement of the DNA molecule of In another aspect, the isolated nucleic acid molecule comprises the nucleotide sequence of from about 60 or about 138 to about 1025, inclusive, of Figure 37 (SEQ ID NO:71 or the complement of the nucleotide Ssequence of In a further aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide o sequence having at least about 80% sequence identity, preferably at least about 81 sequence identity, more Ci preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 84 sequence identity, yet more preferably at least about 85 sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91 sequence identity, yet more preferably at least about 92 sequence identity, yet more preferably at least about 93 sequence identity, yet more preferably at least about 94 sequence identity, yet more preferably at least about 95 sequence identity, yet more preferably at least about 96 sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% sequence identity to a DNA molecule that encodes the same mature polypeptide encoded by the human protein cDNA deposited with the ATCC on August 3, 1999 under ATCC Deposit No.
479-PTA (DNA96850-2705) or the complement of the DNA molecule of In a preferred embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding the same mature polypeptide encoded by the human protein cDNA deposited with the ATCC on August 3, 1999 under ATCC Deposit No.
479-PTA (DNA96850-2705) or the complement of the nucleotide sequence of In another aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide sequence having at least about 80% sequence identity, preferably at least about 81 sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 84 sequence identity, yet more preferably at least about 85 Sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91 sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93 sequence identity, yet more preferably at least about 94 sequence identity, yet more preferably at least about 95 sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more O preferably at least about 99 sequence identity to the full-length polypeptide coding sequence of the human Sprotein cDNA deposited with the ATCC on August 3, 1999 under ATCC Deposit No. 479-PTA (DNA96850- 2705) or the complement of the nucleotide sequence of In a preferred embodiment, the isolated nucleic ;Z acid molecule comprises ihe full-length polypeptide coding sequence of the DNA deposited with the ATCC on August 3, 1999 under ATCC Deposit No. 479-PTA (DNA96850-2705) or the cbmplement of the nucleotide sequence of In another aspect, the invention concerns an isolated nucleic acid molecule which encodes an active V) PR06030 polypeptide as defined below comprising a nucleotide sequence that hybridizes to the complement of a nucleic acid sequence that encodes amino acids 1 or about 27 to about 322, inclusive, of Figure 38 (SEQ ID O NO:72). Preferably, hybridization occurs under stringent hybridization and wash conditions.
C) 10 In yet another aspect, the invention concerns an isolated nucleic acid molecule which encodes an active SPR06030 polypeptide as defined below comprising a nucleotide sequence that hybridizes to the complement of the nucleic acid sequence between about nucleotides 60 or about 138 and about 1025, inclusive, of Figure 37 (SEQ ID NO:71). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 528 nucleotides and which is produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR06030 polypeptide having the sequence of amino acid residues from about 1 or about 27 to about 322, inclusive, of Figure 38 (SEQ ID NO:72), or the complement of the DNA molecule of and, if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 81 sequence identity, more preferably at least about an 82% sequence identity, yet more preferably at least about an 83% sequence identity, yet more preferably at least about an 84% sequence identity, yet more preferably at least about an 85 sequence identity, yet more preferably at least about an 86% sequence identity, yet more preferably at least about an 87 sequence identity, yet more preferably at least about an 88% sequence identity, yet more preferably at least about an 89% sequence identity, yet more preferably at least about a sequence identity, yet more preferably at least about a 91 sequence identity, yet more preferably at least about a 92 sequence identity, yet more preferably at least about a 93 sequence identity, yet more preferably at least about a 94 sequence identity, yet more preferably at least about a 95 sequence identity, yet more preferably at least about a 96% sequence identity, yet more preferably at least about a 97% sequence identity, yet more preferably at least about a 98% sequence identity and yet more preferably at least about a 99% sequence identity to or and isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide sequence encoding a polypeptide scoring at least about 80% positives, preferably at least about 81% positives, more preferably at least about 82% positives, yet more preferably at least about 83% positives, yet more preferably at least about 84% positives, yet more preferably at least about 85% positives, yet more preferably at least about 86 positives, yet more preferably at least about 87 positives, yet more preferably at least about 88% positives, yet more preferably at least about 89% positives, yet more preferably at least about positives, yet more preferably at least about 91% positives, yet more preferably at least about 92% positives, yet more preferably at least about 93% positives, yet more preferably at least about 94% positives, yet more o preferably at least about 95% positives, yet more preferably at least about 96% positives, yet more preferably at least about 97% positives, yet more preferably at least about 98% positives and yet more preferably at least (about 99% positives when compared with the amino acid sequence of residues about 1 or about 27 to 322, inclusive, of Figure 38 (SEQ ID NO:72), or the complement of the nucleotide sequence of In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding cf 5 a PR06030 polypeptide without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 26 in the sequence of Figure 38 (SEQ ID NO:72). It is noted, however, that the C-terminal boundary of the signal peptide may vary, but most likely O by no more than about 5 amino acids on either side of the signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal boundary of the signal peptide may be identified pursuant to criteria routinely employed in the art for identifying that type of amino acid sequence element Nielsen et al., Prot.
Cl Eng. 10:1-6 (1997) and von Heinje et al., Nucl. Acids. Res. 14:4683-4690 (1986)). Moreover, it is also recognized that, insomecases, cleavageof a signal sequence from a secreted polypeptide is not entirely uniform, resulting in more than one secreted species. These polypeptides, and the polynucleotides encoding them, are contemplated by the present invention. As such, for purposes of the present application, the signal peptide of the PR06030 polypeptide shown in Figure 38 (SEQ ID NO:72) extends from amino acids 1 to X of Figure 38 (SEQ ID NO:72), wherein X is any amino acid from 21 to 31 of Figure 38 (SEQ ID NO:72). Therefore, mature forms of the PR06030 polypeptide which are encompassed by the present invention include those comprising amino acids X to 322 of Figure 38 (SEQ ID NO:72), wherein X is any amino acid from 21 to 31 of Figure 38 (SEQ ID NO:72) and variants thereof as described below. Isolated nucleic acid molecules encoding these polypeptides are also contemplated.
Another aspect the invention provides an isolated nucleic acid molecule comprising a nucleotide sequence encoding a PR06030 polypeptide which is either transmembrane domain-deleted or transmembrane domain-inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane domain has been tentatively identified as extending from about amino acid position 142 to about amino acid position 158 in the sequence of Figure 38 (SEQ ID NO:72). Therefore, soluble extracellular domains of the herein described PR06030 polypeptides are contemplated.
In this regard, another aspect of the present invention is directed to an isolated nucleic acid molecule which comprises a nucleotide sequence having at least about 80% sequence identity, preferably at least about 81 sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 84% sequence identity, yet more preferably at least about 85% sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91% sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at least about 95% sequence identity, yet more preferably at least about 96% sequence t identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% sequence identity to a DNA molecule encoding &JQ amino acids 1 to X of Figure 38 (SEQ ID NO:72), where X is any amino acid from 137 to 147 of Figure 38 (SEQ ID NO:72), or the complement of the DNA molecule of In a specific aspect, the isolated nucleic acid molecule comprises a nucleotide sequence which encodes amino acids I to X of Figure 38 (SEQ ID cf 5 NO:72), where X is any amino acid from 137 to 147 of Figure 38 (SEQ ID NO:72), or is the complement of the DNA molecule of In yet another aspect of the present invention, the isolated nucleic acid molecule encodes a polypeptide scoring at least about 80% positives, preferably at least about 81 positives, more preferably at least O about 82% positives, yet more preferably at least about 83 positives, yet more preferably at least about 84% tn 10 positives, yet more preferably at least about 85% positives, yet more preferably at least about 86% positives, 'n yet more preferably at least about 87% positives, yet more preferably at least about 88% positives, yet more Spreferably at least about 89% positives, yet more preferably at least about 90 positives, yet more preferably at least about 91 positives, yet more preferably at least about 92% positives, yet more preferably at least about 93% positives, yet more preferably at least about 94% positives, yet more preferably at least about positives, yet more preferably at least about 96 positives, yet more preferably at least about 97% positives, yet more preferably at least about 98% positives and yet more preferably at least about 99% positives when compared with the amino acid sequence of residues about 1 to X of Figure 38 (SEQ ID NO:72), where X is any amino acid from 137 to 147 of Figure 38 (SEQ ID NO:72), or is the complement of the DNA molecule of Another embodiment is directed to fragments of a PR06030 polypeptide coding sequence that may find use as, for example, hybridization probes or for encoding fragments of a PR06030 polypeptide that may optionally encode a polypeptide comprising a binding site for an anti-PR06030 antibody. Such nucleic acid fragments are usually at least about 20 nucleotides in length, preferably at least about 30 nucleotides in length, more preferably at least about 40 nucleotides in length, yet more preferably at least about 50 nucleotides in length, yet more preferably at least about 60 nucleotides in length, yet more preferably at least about nucleotides in length, yet more preferably at least about 80 nucleotides in length, yet more preferably at least about 90 nucleotides in length, yet more preferably at least about 100 nucleotides in length, yet more preferably at least about 110 nucleotides in length, yet more preferably at least about 120 nucleotides in length, yet more preferably at least about 130 nucleotides in length, yet more preferably at least about 140 nucleotides in length, yet more preferably at least about 150 nucleotides in length, yet more preferably at least about 160 nucleotides in length, yet more preferably at least about 170 nucleotides in length, yet more preferably at least about 180 nucleotides in length, yet more preferably at least about 190 nucleotides in length, yet more preferably at least about 200 nucleotides in length, yet more preferably at least about 250 nucleotides in length, yet more preferably at least about 300 nucleotides in length, yet more preferably at least about 350 nucleotides in length, yet more preferably at least about 400 nucleotides in length, yet more preferably at least about 450 nucleotides in length, yet more preferably at least about 500 nucleotides in length, yet more preferably at least about 600 nucleotides in length, yet more preferably at least about 700 nucleotides in length, yet more preferably at least about 800 O nucleotides in length, yet more preferably at least about 900 nucleotides in length and yet more preferably at least CI about 1000 nucleotides in length, wherein in this context the term "about" means the referenced nucleotide sequence length plus or minus 10% of that referenced length. In a preferred embodiment, the nucleotide ;Z sequence fragment is derived from any coding region of the nucleotide sequence shown in Figure 37 (SEQ ID NO:71). It is noted that novel fragments of a PR06030 polypeptide-encoding nucleotide sequence may be determined in a routine manner by aligning the PR06030 polypeptide-encoding nucleotide sequence with other known nucleotide sequences using any of a number of well known sequence alignment programs and determining which PR06030 polypeptide-encoding nucleotide sequence fragment(s) are novel. All of such PR06030 polypeptide-encoding nucleotide sequences are contemplated herein and can be determined without undue Sexperimentation. Also contemplated are the PR06030 polypeptide fragments encoded by these nucleotide 10 molecule fragments, preferably those PRO6030 polypeptide fragments that comprise a binding site for an anti- SPR06030 antibody.
In another embodiment, the invention provides isolated PR06030 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR06030 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues from about 1 or about 27 to about 322 of Figure 38 (SEQ ID NO:72).
In another aspect, the invention concerns an isolated PR06030 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 81 sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83 sequence identity, yet more preferably at least about 84 sequence identity, yet more preferably at least about 85 sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91 sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93 sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at least about 95 sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% sequence identity to the sequence of amino acid residues from about 1 or about 27 to about 322, inclusive, of Figure 38 (SEQ ID NO:t2).
In a further aspect, the invention concerns an isolated PR06030 polypeptide comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 81% sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83 sequence identity, yet more preferably at least about 84 sequence identity, yet more preferably at least about 85 sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91 sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least o about 93% sequence identity, yet more preferably at least about 94 sequence identity, yet more preferably at O least about 95 sequence identity, yet more preferably at least about 96 sequence identity, yet more preferably b) at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more ;Z preferably at least about 99% sequence identity to an amino acid sequence encoded by the human protein cDNA Sdeposited with the ATCC on August 3, 1999 under ATCC Deposit No. 479-PTA (DNA96850-2705). In a preferred embodiment, the isolated PR06030 polypeptide comprises an amino acid sequence encoded by the human protein cDNA deposited with the ATCC on August 3, 1999 under ATCC Deposit No. 479-PTA T" (DNA96850-2705).
I> In a further aspect, the invention concerns an isolated PR06030 polypeptide comprising an amino acid o sequence scoring at least about 80% positives, preferably at least about 81 positives, more preferably at least about 82% positives, yet more preferably at least about 83% positives, yet more preferably at least about 84 0positives, yet more preferably at least about 85% positives, yet more preferably at least about 86% positives, Cp yet more preferably at least about 87% positives, yet more preferably at least about 88% positives, yet more preferably at least about 89% positives, yet more preferably at least about 90% positives, yet more preferably at least about 91 positives, yet more preferably at least about 92 positives, yet more preferably at least about 93% positives, yet more preferably at least about 94% positives, yet more preferably at least about positives, yet more preferably at least about 96% positives, yet more preferably at least about 97% positives, yet more preferably at least about 98% positives and yet more preferably at least about 99% positives when compared with the amino acid sequence of residues from about 1 or about 27 to about 322, inclusive, of Figure 38 (SEQ ID NO:72).
In a specific aspect, the invention provides an isolated PR06030 polypeptide without the N-terminal signal sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino acid sequence as hereinbefore described. Processes for producing the same are also herein described, wherein those processes comprise culturing a host cell comprising a vector which comprises the appropriate encoding nucleic acid molecule under conditions suitable for expression of the PR06030 polypeptide and recovering the PR06030 polypeptide from the cell culture.
Another aspect the invention provides an isolated PR06030 polypeptide which is either transmembrane domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein described, wherein those processes comprise culturing a host cell comprising a vector which comprises the appropriate encoding nucleic acid molecule under conditions suitable for expression of the PR06030 polypeptide and recovering the PR06030 polypeptide from the cell culture.
As such, one aspect of the present invention is directed to an isolated soluble PR06030 polypeptide which comprises an amino acid sequence having at least about 80% sequence identity, preferably at least about 81% sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 84% sequence identity, yet more preferably at least about 85 sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87 sequence identity, yet more preferably at least about 88 sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more o preferably at least about 91 sequence identity, yet more preferably at least about 92 sequence identity, yet more preferably at least about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at least about 95% sequence identity, yet more preferably at least about 96% sequence ;Z identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% sequence identity to amino acids I to X of Figure 38 (SEQ ID NO:72), where X is any amino acid from 137 to 147 of Figure 38 (SEQ ID NO:72). [na preferred aspect, the isolated soluble PR06030 polypeptide. comprises amino acids I to X of Figure 38 (SEQ ID NO:72), where X is any amino acid from 137 to 147 of Figure 38 (SEQ ID NO:72).
I> In yet another aspect of the present invention, the isolated soluble PR06030 polypeptide comprises an O amino acid sequence which scores at least about 80% positives, preferably at least about 81 positives, more V' 0 peeal tlataot8%psiieytmr rfrbya es bot8%pstvs e oepeeal 1prfbyat least about 82 positives, yet more preferably at least about 83 positives yet more preferably es bu 860oiieytmr rfrbya es bu 7 oiieytmr rfrbya es bu 8 o t estau 8%positives, yet more preferably at least about 8 5% positives, yet more preferably at least about %pstvs C86poie 1 yet more preferably at least about 87 positives, yet more preferably at least about 88%vs ytmr preferably at least about 939% positives, yet more preferably at least about 90 positives,yemoepfral ytmrpfrbyat least about 91% positives, yet more preferably at least about 92% positives, yet moreprfabytlesaou 97% positives, yet more preferably at least about 98% positives and yet more preferably at least about 99% positives when compared with the amino acid sequence of residues about I to X of Figure 38 (SEQ ID NO:72), where X is any amino acid fromn 137 to 147 of Figure 38 (SEQ ID NO:72).
In yet another aspect, the invention concerns an isolated PR06030 polypeptide, comprising the sequence of amino acid residues from about I or about 27 to about 322, inclusive, of Figure 38 (SEQ ID NO:72), or a fragment thereof which is biologically active or sufficient to provide a binding site for an anti-PR06030 antibody, wherein the identification of PR06030 polypeptide fragments that possess biological activity or provide a binding site for an anti-PR06030 antibody may be accomplished in a routine manner using techniques which are well known in the art. Preferably, the PR06030 fragment retains a qualitative biological activity of a native PR06030 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR06030 polypeptide having the sequence of amino acid residues from about I or about 27 to about 322, inclusive, of Figure 38 (SEQ ID NO:72), or the complement of the DNA molecule of and if the test DNA molecule has at least about an sequence identity, preferably at least about an 81% sequence identity, more preferably at least about on 82% sequence identity, yet more preferably at least about an 83 sequence identity, yet more preferably at least about an 84 sequence identity, yet more preferably at least about an 85 sequence identity, yet more preferably at least about an 86 sequence identity, yet more preferably at least about an 87 sequence identity, yet more preferably at least about an 88 sequence identity, yet more preferably at least about an 89 sequence identity, yet more preferably at least about a 90% sequence identity, yet more preferably at least about a 91 sequence identity, yet more preferably at least about a 92% sequence identity, yet more preferably at least about Sa 93% sequence identity, yet more preferably at least about a 94% sequence identity, yet more preferably at least 0 about a 95% sequence identity, yet more preferably at least about a 96% sequence identity, yet more preferably bJ) at least about a 97% sequence identity, yet more preferably at least about a 98% sequence identity and yet more preferably at least about a 99% sequence identity to or (ii) culturing a host cell comprising the test DNA _molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from c 5 the cell culture.
In a still further embodiment, the invention concerns a composition of matter comprising a PR06030 polypeptide, or an agonist or antagonist of a PR06030 polypeptide as herein described, or an anti-PR06030 antibody, in combination with a carrier. Optionally, the carrier is a pharmaceutically acceptable carrier.
O Another embodiment of the present invention is directed to the use of a PR06030 polypeptide, or an tn 10 agonist or antagonist thereof as herein described, or an anti-PR06030 antibody, for the preparation of a o medicament useful in the treatment of a condition which is responsive to the PR06030 polypeptide, an agonist C or antagonist thereof or an anti-PR06030 antibody.
PR04424 A cDNA clone (DNA96857-2636) has been identified that encodes a novel polypeptide having homology to a protein in GenBank, accession number HGS_A 135 and designated in the present application as "PR04424".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04424 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR04424 polypeptide having the sequence of amino acid residues from I or about 29 to about 221, inclusive of Figure 40 (SEQ ID NO:74), or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04424 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 136 and about 714, inclusive, of Figure 39 (SEQ ID NO:73). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 17-PTA (DNA96857-2636), or the complement of the DNA molecule of(a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 17-PTA (DNA96857-2636).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence o identity to the sequence of amino acid residues from about 29 to about 221, inclusive of Figure 40 (SEQ ID 0 NO:74), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04424 polypeptide having the sequence of c n 5 amino acid residues from about 29 to about 221, inclusive of Figure 40 (SEQ ID NO:74), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90 sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule, SIn another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more Spreferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 29 to about 221, inclusive of Figure 40 (SEQ ID NO:74), or the complement of the DNA of Another embodiment is directed to fragments of a PR04424 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR04424 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04424 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 29 through 221 of Figure 40 (SEQ ID NO:74).
In another aspect, the invention concerns an isolated PR04424 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 29 to about 221, inclusive of Figure 40 (SEQ ID NO:74).
In a further aspect, the invention concerns an isolated PR04424 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 29 through 221 of Figure 40 (SEQ ID NO:74).
In yet another aspect, the invention concerns an isolated PR04424 polypeptide, comprising the sequence of amino acid residues 29 to about 221, inclusive of Figure 40 (SEQ ID NO:74), or a fragment thereof sufficient to provide a binding site for an anti-PR04424 antibody. Preferably, the PR04424 fragment retains a qualitative biological activity of a native PR04424 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04424 polypeptide having the sequence of amino acid residues from about 29 to about 221, inclusive of Figure 40 (SEQ ID NO:74), or (b) Sthe complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence O identity, preferably at least about an 85% sequence identity, more-preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
21. PR04422 A cDNA clone (DNA96867-2620) has been identified that encodes a novel polypeptide having homology 17- to lysozyme g and designated in the present application as "PR04422".
o In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding C 10 a PR04422 polypeptide.
SIn one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 sequence identity to a DNA molecule encoding a PR04422 polypeptide having the sequence of amino acid residues from 1 or about 20 to about 194, inclusive of Figure 42 (SEQ ID NO:76), or the complement of the DNA molecule of In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04422 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 375 and about 899, inclusive, of Figure 41 (SEQ ID NO:75). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203972 (DNA96867-2620), or the complement of the DNA molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203972 (DNA96867-2620).
in a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 20 to about 194, inclusive of Figure 42 (SEQ ID NO:76), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04422 polypeptide having the sequence of amino acid residues from about 20 to about 194, inclusive of Figure 42 (SEQ ID NO:76), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preterably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably o at least about a 95 sequence identity to or isolating the test DNA molecule.
CI In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more ;Z preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 20 to about 194, inclusive of Figure 42 (SEQ ID NO:76), or the complement c 5 of the DNA of Another embodiment is directed to fragments of a PR04422 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through 0 about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
S 10 In another embodiment, the invention provides isolated PR04422 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
Cl In a specific aspect, the invention provides isolated native sequence PR04422 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 20 through 194 of Figure 42 (SEQ ID NO:76).
In another aspect, the invention concerns an isolated PR04422 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 20 to about 194, inclusive of Figure 42 (SEQ ID NO:76).
In a further aspect, the invention concerns an isolated PR04422 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 20 through 194 of Figure 42 (SEQ ID NO:76).
In yet another aspect, the invention concerns an isolated PR04422 polypeptide, comprising the sequence of amino acid residues 20 to about 194, inclusive of Figure 42 (SEQ ID NO:76), or a fragment thereof sufficient to provide a binding site for an anti-PR04422 antibody. Preferably, the PR04422 fragment retains a qualitative biological activity of a native PR04422 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04422 polypeptide having the sequence of amino acid residues from about 20 to about 194, inclusive of Figure 42 (SEQ ID NO:76), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
22. PRO4430 O A cDNA clone (DNA96878-2626) has been identified that encodes a novel polypeptide having homology to a protein in GenBank, accession number MMHC213L3_9, and designated in the present application as "PRO4430".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding c 5 a PR04430 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 1 preferably at least about 95 sequence identity to a DNA molecule encoding a PR04430 polypeptide having 0 the sequence of amino acid residues from 1 or about 19 to about 125, inclusive of Figure 44 (SEQ ID NO:78), S 10 or the complement of the DNA molecule of o In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04430 Cl polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 110 and about 430, inclusive, of Figure 43 (SEQ ID NO:77). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80%.sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 23-PTA (DNA96878-2626), or the complement of the DNA molecule of In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 23-PTA (DNA96878-2626).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 19 to about 125, inclusive of Figure 44 (SEQ ID NO:78), or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04430 polypeptide having the sequence of amino acid residues from about 19 to about 125, inclusive of Figure 44 (SEQ ID NO:78), or the complement of the DNA molecule of and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 19 to about 125, inclusive of Figure 44 (SEQ ID NO:78), or the complement 0 of the DNA of 0 Another embodiment is directed to fragments of a PRO4430polypeptide coding sequence that may find Suse as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR04430 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
SIn a specific aspect, the invention provides isolated native sequence PR04430 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 19 through 125 of Figure 44 (SEQ ID o NO:78).
in 10 In another aspect, the invention concerns an isolated PR04430 polypeptide, comprising an amino acid o sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more C1 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 19 to about 125, inclusive of Figure 44 (SEQ ID NO:78).
In a further aspect, the invention concerns an isolated PR04430 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 19 through 125 of Figure 44 (SEQ ID NO:78).
In yet another aspect, the invention concerns an isolated PR04430 polypeptide, comprising the sequence of amino acid residues 19 to about 125, inclusive of Figure 44 (SEQ ID NO:78), or a fragment thereof sufficient to provide a binding site for an anti-PR04430 antibody. Preferably, the PR04430 fragment retains a qualitative biological activity of a native PR04430 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04430 polypeptide having the sequence of amino acid residues from about 19 to about 125, inclusive of Figure 44 (SEQ ID NO:78), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
23. PRO4499 A cDNA clone (DNA96889-2641) has been identified that encodes a novel polypeptide and designated in the present application as "PR04499".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR04499 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most O preferably at least about 95% sequence identity to a DNA molecule encoding a PR04499 polypeptide having Sthe sequence of amino acid residues from 1 or about 31 to about 339, inclusive of Figure 46 (SEQ ID Sor the complement of the DNA molecule of ;Z In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR04499 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 275 and about 1201, inclusive, of Figure 45 (SEQ ID NO:79). Preferably, hybridization occurs under stringent hybridization and wash conditions.
lt In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least Sabout 90% sequence identity, most preferably at least about 95% sequence identity to a DNA molecule t 10 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 119-PTA o (DNA96889-2641), or the complement of the DNA molecule of In a preferred embodiment, the nucleic Sacid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 119-PTA (DNA96889-2641).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85 sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 31 to about 339, inclusive of Figure 46 (SEQ ID or the complement of the DNA of In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PR04499 polypeptide having the sequence of amino acid residues from about 31 to about 339, inclusive of Figure 46 (SEQ ID NO:80), or the complement of the DNA molecule of(a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 sequence identity, more preferably at least about a 90 sequence identity, most preferably at least about a 95% sequence identity to or isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid moleculecomprising DNA encoding a PR04499 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble variants transmembrane domain deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 30 in the sequence of Figure 46 (SEQ ID NO:80). The transmembrane domain has been tentatively identified as extending from about amino acid position 171 through about amino acid position 190 in the PR04499 amino acid sequence (Figure 46, SEQ ID In another aspect, the invention concerns an isolated nucleic acid molecule comprising DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 31 to about 339, inclusive of Figure.46 (SEQ ID NO:80), or the complement of the DNA of O Another embodiment is directed to fragments of a PR04499 polypeptide coding sequence that may find c.i use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in 0length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR04499 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR04499 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 31 through 339 of Figure 46 (SEQ ID OIn another aspect, the invention concerns an isolated PR04499 polypeptide, comprising an amino acid tf 10 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more o preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 'I sequence of amino acid residues 31 to about 339, inclusive of Figure 46 (SEQ ID In a further aspect, the invention concerns an isolated PR04499 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 positives, most preferably at least about 95 positives when compared with the amino acid sequence of residues 31 through 339 of Figure 46 (SEQ ID In yet another aspect, the invention concerns an isolated PR04499 polypeptide, comprising the sequence of amino acid residues 31 to about 339, inclusive of Figure 46 (SEQ ID NO:80), or a fragment thereof sufficient to provide a binding site for an anti-PR04499 antibody. Preferably, the PR04499 fragment retains a qualitative biological activity of a native PR04499 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by hybridizing a test DNA molecule under stringent conditions with a DNA molecule encoding a PRO4499 polypeptide having the sequence of amino acid residues from about 31 to about 339, inclusive of Figure 46 (SEQ ID NO:80), or (b) the complement of the DNA molecule of and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 sequence identity to or (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
24. Additional Embodiments In other embodiments of the present invention, the invention provides vectors comprising DNA encoding any of the herein described polypeptides. Host cell comprising any such vector are also provided. By way of example, the host cells may be CHO cells, E. coil, or yeast. A process for producing any of the herein described polypeptides is further provided and comprises culturing host cells under conditions suitable for expression of the desired polypeptide and recovering the desired polypeptide from the cell culture.
In other embodiments, the invention provides chimeric molecules comprising any of the herein described polypeptides fused to a heterologous polypeptide or amino acid sequence. Example of such chimeric molecules o3 comprise any of the herein described polypeptides fused to an epitope tag sequence or a Fc region of an C immunoglobulin.
In another embodiment, the invention provides an antibody which specifically binds to any of the above ;Z or below described polypeptides. Optionally, the antibody is a monoclonal antibody, humanized antibody, antibody fragment or single-chain antibody.
c 5 In yet other embodiments, the invention provides oligonucleotide probes useful for isolating genomic and cDNA nucleotide sequences or as antisense probes, wherein those probes may be derived from any of the above or below described nucleotide sequences.
In other embodiments, the invention provides an isolated nucleic acid molecule comprising a nucleotide O sequence that encodes a PRO polypeptide.
tn 10 In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about o 80 nucleic acid sequence identity, alternatively at least about 81% nucleic acid sequence identity, alternatively C. at least about 82% nucleic acid sequence identity, alternatively at least about 83 nucleic acid sequence identity, alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 87 nucleic acid sequence identity, alternatively at least about 88 nucleic acid sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid sequence identity, alternatively at least about 91% nucleic acid sequence identity, alternatively at least about 92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid sequence identity, alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid sequence identity, alternatively at least about 97 nucleic acid sequence identity, alternatively at least about 98% nucleic acid sequence identity and alternatively at least about 99% nucleic acid sequence identity to a DNA molecule encoding a PRO polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, with or without the signal peptide, as disclosed herein or any other specifically defined fragment of the full-length amino acid sequence as disclosed herein, or the complement of the DNA molecule of In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about nucleic acid sequence identity, alternatively at least about 81 nucleic acid sequence identity, alternatively at least about 82 nucleic acid sequence identity, alternatively at least about 83 nucleic acid sequence identity, alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid sequence identity, alternatively at least about 91% nucleic acid sequence identity, alternatively at least about 92% nucleic acid sequence identity, alternatively at least about 93 nucleic acid sequence identity, alternatively at least about 94 nucleic acid sequence identity, alternatively at least about 95 nucleic acid sequence identity, alternatively at least about 96 nucleic acid sequence identity, alternatively at least about 97 nucleic acid sequence identity, O alternatively at least about 98% nucleic acid sequence identity and alternatively at least about 99% nucleic acid (Ci sequence identity to a DNA molecule comprising the coding sequence of a full-length PRO polypeptide cDNA as disclosed herein, the coding sequence of a PRO polypeptide lacking the signal peptide as disclosed herein, the coding sequence of an extracellular domain of a transmembrane PRO polypeptide, with or without the signal peptide, as disclosed herein or the coding sequence of any other specifically defined fragment of the full-length ¢Cf 5 amino acid sequence as disclosed herein, or the complement of the DNA molecule of In a further aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide sequence having at least about 80% nucleic acid sequence identity, alternatively at least about 81 nucleic acid sequence identity, alternatively at least about 82% nucleic acid sequence identity, alternatively at least about 83 O nucleic acid sequence identity, alternatively at least about 84% nucleic acid sequence identity, alternatively at tV 10 least about 85 nucleic acid sequence identity, alternatively at least about 86% nucleic acid sequence identity, o alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid C.i sequence identity, alternatively at least about 89 nucleic acid sequence identity, alternatively at least about 90 nucleic acid sequence identity, alternatively at least about 91% nucleic acid sequence identity, alternatively at least about 92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid sequence identity, alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 95% nucleic acid sequence identity, alternatively at least about 96 nucleic acid sequence identity, alternatively at least about 97 nucleic acid sequence identity, alternatively at least about 98 nucleic acid sequence identity and alternatively at least about 99% nucleic acid sequence identity to a DNA molecule that encodes the same mature polypeptide encoded by any of the human protein cDNAs deposited with the ATCC as disclosed herein, or (b) the complement of the DNA molecule of Another aspect the invention provides an isolated nucleic acid molecule comprising a nucleotide sequence encoding a PRO polypeptide which is either transmembrane domain-deleted or transmembrane domaininactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane domain(s) of such polypeptide are disclosed herein. Therefore, soluble extracellular domains of the herein described PRO polypeptides are contemplated.
Another embodiment is directed to fragments of a PRO polypeptide coding sequence, or the complement thereof, that may find use as, for example, hybridization probes, for encoding fragments of a PRO polypeptide that may optionally encode a polypeptide comprising a binding site for an anti-PRO antibody or as antisense oligonucleotide probes. Such nucleic acid fragments are usually at least about 20 nucleotides in length, alternatively at least about 30 nucleotides in length, alternatively at least about 40 nucleotides in length, alternatively at least about 50 nucleotides in length, alternatively at least about 60 nucleotides in length, alternatively at least about 70 nucleotides in length, alternatively at least about 80 nucleotides in length, alternatively at least about 90 nucleotides in length, alternatively at least about 100 nucleotides in length, alternatively at least about 110 nucleotides in length, alternatively at least about 120 nucleotides in length, alternatively at least about 130 nucleotides in length, alternatively at least about 140 nucleotides in length, alternatively at least about 150 nucleotides in length, alternatively at least about 160 nucleotides in length, alternatively at least about 170 nucleotides in length, alternatively at least about 180 nucleotides in length, o alternatively at least about 190 nucleotides in length, alternatively at least about 200 nucleotides in length, Salternatively at least about 250 nucleotides in length, alternatively at least about 300 nucleotides in length, c-I) alternatively at least about 350 nucleotides in length, alternatively at least about 400 nucleotides in length, alternatively at least about 350 nucleotides in length, alternatively at least about 400 nucleotides in length, alternatively at least about 450 nucleotides in length, alternatively at least about 500 nucleotides in length, 5 alternatively at least about 600 nucleotides in length, alternatively at least about 700 nucleotides in length, Cf 5 alternatively at least about 800 nucleotides in length, alternatively at least about 900 nucleotides in length and alternatively at least about 1000 nucleotides in length, wherein in this context the term "about" means the referenced nucleotide sequence length plus or minus 10% of that referenced length. It is noted that novel 1> fragments of a PRO polypeptide-encoding nucleotide sequence may be determined in a routine manner by Saligning the PRO polypeptide-encoding nucleotide sequence with other known nucleotide sequences using any 10 of a number of well known sequence alignment programs and determining which PRO polypeptide-encoding Snucleotide sequence fragment(s) are novel. All of such PRO polypeptide-encoding nucleotide sequences are C. contemplated herein. Also contemplated are the PRO polypeptide fragments encoded by these nucleotide molecule fragments, preferably those PRO polypeptide fragments that comprise a binding site for an anti-PRO antibody.
In another embodiment, the invention provides isolated PRO polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a certain aspect, the invention concerns an isolated PRO polypeptide, comprising an amino acid sequence having at least about 80% amino acid sequence identity, alternatively at least about 81% amino acid sequence identity, alternatively at least about 82% amino acid sequence identity, alternatively at least about 83% amino acid sequence identity, alternatively at least about 84% amino acid sequence identity, alternatively at least about 85% amino acid sequence identity, alternatively at least about 86% amino acid sequence identity, alternatively at least about 87% amino acid sequence identity, alternatively at least about 88% amino acid sequence identity, alternatively at least about 89% amino acid sequence identity, alternatively at least about amino acid sequence identity, alternatively at least about 91 amino acid sequence identity, alternatively at least about 92% amino acid sequence identity, alternatively at least about 93% amino acid sequence identity, alternatively at least about 94% amino acid sequence identity, alternatively at least about 95% amino acid sequence identity, alternatively at least about 96% amino acid sequence identity, alternatively at least about 97 amino acid sequence identity, alternatively at least about 98 amino acid sequence identity and alternatively at least about 99% amino acid sequence identity to a PRO polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, with or without the signal peptide, as disclosed herein or any other specifically defined fragment of the full-length amino acid sequence as disclosed herein.
In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid sequence having at least about 80% amino acid sequence identity, alternatively at least about 81 amino acid sequence identity, alternatively at least about 82 amino acid sequence identity, alternatively at least about 83 amino acid sequence identity, alternatively at least about 84% amino acid sequence identity, alternatively at least about 85% amino acid sequence identity, alternatively at least about 86% amino acid sequence identity, o alternatively at least about 87% amino acid sequence identity, alternatively at least about 88% amino acid 0, sequence identity, alternatively at least about 89% amino acid sequence identity, alternatively at least about amino acid sequence identity, alternatively at least about 9189 amino acid sequence identity, alternatively at least about 92% amino acid sequence identity, alternatively at least about 93% amino acid sequence identity, alternatively at least about 94% amino acid sequence identity, alternatively at least about 95% amino acid c n 5 sequence identity, alternatively at least about 96% amino acid sequence identity, alternatively at least about 97 amino acid sequence identity, alternatively at least about 98% amino acid sequence identity and alternatively at least about 99% amino acid sequence identity to an amino acid sequence encoded by any of the human protein cDNAs deposited with the ATCC as disclosed herein.
O In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid l 10 sequence scoring at least about 80% positives, alternatively at least about 81% positives, alternatively at least S about 82% positives, alternatively at least about 83% positives, alternatively at least about 84% positives, alternatively at least about 85 positives, alternatively at least about 86% positives, alternatively at least about 87% positives, alternatively at least about 88% positives, alternatively at least about 89% positives, alternatively at least about 90% positives, alternatively at least about 91% positives, alternatively at least about 92% positives, alternatively at least about 93 positives, alternatively at least about 94 positives, alternatively at least about positives, alternatively at least about 96% positives, alternatively at least about 97% positives, alternatively at least about 98% positives and alternatively at least about 99% positives when compared with the amino acid sequence of a PRO polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, with or without the signal peptide, as disclosed herein or any other specifically defined fragment of the full-length amino acid sequence as disclosed herein.
In a specific aspect, the invention provides an isolated PRO polypeptide without the N-terminal signal sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino acid sequence as hereinbefore described. Processes for producing the same are also herein described, wherein those processes comprise culturing a host cell comprising a vector which comprises the appropriate encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and recovering the PRO polypeptide from the cell culture.
Another aspect the invention provides an isolated PRO polypeptide which is either transmembrane domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein described, wherein those processes comprise culturing a host cell comprising a vector which comprises the appropriate encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and recovering the PRO polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO polypeptide as defined herein. In a particular embodiment, the agonist or antagonist is an anti-PRO antibody or a small molecule.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists to a PRO polypeptide which comprise contacting the PRO polypeptide with a candidate molecule and monitoring a o biological activity mediated by said PRO polypeptide. Preferably, the PRO polypeptide is a native PRO C polypeptide.
In a still further embodiment, the invention concerns a composition of matter comprising a PRO polypeptide, or an agonist or antagonist of a PRO polypeptide as herein described, or an anti-PRO antibody, in _combination with a carrier. Optionally, the carrier is a pharmaceutically acceptable carrier.
c 5 Another embodiment of the present invention is directed to the use of a PRO polypeptide, or an agonist or antagonist thereof as hereinbefore described, or an anti-PRO antibody, for the preparation of a medicament useful in the treatment of a condition which is responsive to the PRO polypeptide, an agonist or antagonist I thereof or an anti-PRO antibody.
I 10 BRIEF DESCRIPTION OF THE DRAWINGS o Figure 1 shows a nucleotide sequence (SEQ ID NO: 1) of a native sequence PRO1484 cDNA, wherein Ci SEQ ID NO:1 is a clone designated herein as "DNA44686-1653".
Figure 2 shows the amino acid sequence (SEQ ID NO:2) derived from the coding sequence of SEQ ID NO:1 shown in Figure 1.
Figure 3 shows a nucleotide sequence (SEQ ID NO:8) of a native sequence PR04334 cDNA, wherein SEQ ID NO:8 is a clone designated herein as "DNA59608-2577".
Figure 4 shows the amino acid sequence (SEQ ID NO:9) derived from the coding sequence of SEQ ID NO:8 shown in Figure 3.
Figure 5 shows a nucleotide sequence (SEQ ID NO:10) of a native sequence PRO1122 cDNA, wherein SEQ ID NO:10 is a clone designated herein as "DNA62377-1381".
Figure 6 shows the amino acid sequence (SEQ ID NO: 11) derived from the coding sequence of SEQ ID NO:10 shown in Figure Figure 7 shows a nucleotide sequence (SEQ ID NO: 15) of a native sequence PRO1889 cDNA, wherein SEQ ID NO: 15 is a clone designated herein as "DNA77623-2524".
Figure 8 shows the amino acid sequence (SEQ ID NO: 16) derived from the coding sequence of SEQ ID NO: 15 shown in Figure 7.
Figure 9 shows a nucleotide sequence (SEQ ID NO: 17) of a native sequence PRO 1890 cDNA, wherein SEQ ID NO: 17 is a clone designated herein as "DNA79230-2525".
Figure 10 shows the amino acid sequence (SEQ ID NO: 18) derived from the coding sequence of SEQ ID NO;17 shown in Figure 9.
Figure 11 shows a nucleotide sequence (SEQ ID NO:22) of a native sequence PRO1887 cDNA, wherein SEQ ID NO:22 is a clone designated herein as "DNA79862-2522".
Figure 12 shows the amino acid sequence (SEQ ID NO:23) derived from the coding sequence of SEQ ID NO:22 shown in Figure 11.
Figure 13 shows anucleotide sequence (SEQ ID NO:28) of a native sequence PR01785 cDNA, wherein SEQ ID NO:28 is a clone designated herein as "DNA80136-2503".
o Figure 14 shows the amino acid sequence (SEQ ID NO:29) derived from the coding sequence of SEQ ID NO:28 shown in Figure 13.
S) Figure 15 shows a nucleotide sequence (SEQ ID NO:34) of a native sequence PR04353 cDNA, wherein SEQ ID NO:34 is a clone designated herein as "DNA80145-2594".
Figure 16 shows the amino acid sequence (SEQ ID NO;35) derived from the coding sequence of SEQ ID NO:34 shown in Figure Figure 17 shows a nucleotide sequence (SEQ ID NO:39) of a native sequence PRO4357 cDNA, wherein SEQ ID NO:39 is a clone designated herein as "DNA84917-2597".
Figure 18 shows the amino acid sequence (SEQ ID NO:40) derived from the coding sequence of SEQ SID NO:39 shown in Figure 17.
tn 10 Figure 19shows a nucleotide sequence (SEQ ID NO:44) of a native sequence PR04405 cDNA, wherein SSEQ ID NO:44 is a clone designated herein as "DNA84920-2614".
i Figure 20 shows the amino acid sequence (SEQ ID NO:45) derived from the coding sequence of SEQ ID NO:44 shown in Figure 19.
Figure 21 shows a nucleotide sequence (SEQ ID NO:49) of a native sequence PR04356 cDNA, wherein SEQ ID NO:49 is a clone designated herein as "DNA86576-2595".
Figure 22 shows the amino acid sequence (SEQ ID NO:50) derived from the coding sequence of SEQ ID NO:49 shown in Figure 21.
Figure 23 shows a nucleotide sequence (SEQ ID NO:51) ofa native sequence PR04352 cDNA, wherein SEQ ID NO:51 is a clone designated herein as "DNA87976-2593".
Figure 24 shows the amino acid sequence (SEQ ID NO:52) derived from the coding sequence of SEQ ID NO:51 shown in Figure 23.
Figure 25 shows a nucleotide sequence (SEQ ID NO:56) of a native sequence PR04380 cDNA, wherein SEQ ID NO:56 is a clone designated herein as "DNA92234-2602".
Figure 26 shows the amino acid sequence (SEQ ID NO:57) derived from the coding sequence of SEQ ID NO:56 shown in Figure Figure 27 shows a nucleotide sequence (SEQ ID NO:58) of a native sequence PR04354 cDNA, wherein SEQ ID NO:58 is a clone designated herein as "DNA92256-2596".
Figure 28 shows the amino acid sequence (SEQ ID NO:59) derived from the coding sequence of SEQ ID NO:58 shown in Figure 27.
Figure 29 shows a nucleotide sequence (SEQ ID NO:60) of a native sequence PR04408 cDNA, wherein SEQ ID NO:60 is a clone designated herein as "DNA92274-2617".
Figure 30 shows the amino acid sequence (SEQ ID NO:61) derived from the coding sequence of SEQ ID NO:60 shown in Figure 29.
Figure 31 shows a nucleotide sequence (SEQ ID NO:62) of a native sequence PR05737 cDNA, wherein SEQ ID NO:62 is a clone designated herein as "DNA92929-2534".
Figure 32 shows the amino acid sequence (SEQ ID NO:63) derived from the coding sequence of SEQ ID NO:62 shown in Figure 31.
Vt Figure 33 shows a nucleotide sequence (SEQ ID NO:64) of a native sequence PR04425 cDNA, wherein 0 0 SEQ ID NO:64 is a clone designated herein as "DNA93011-2637".
Figure 34 shows the amino acid sequence (SEQ ID NO:65) derived from the coding sequence of SEQ ID NO:64 shown in Figure 33.
Figure 35 shows a nucleotide sequence (SEQ ID NO:66) of a native sequence PR05990 cDNA, wherein SEQ ID NO:66 is a clone designated herein as "DNA96042-2682".
Figure 36 shows the amino acid sequence (SEQ ID NO:67) derived from the coding sequence of SEQ ID NO:66 shown in Figure I. Figure 37 shows a nucleotide sequence (SEQ ID NO:71) of a native sequence PR06030 cDNA, wherein S SEQ ID NO:71 is a clone designated herein as "DNA96850-2705".
0 Figure 38 shows the amino acid sequence (SEQ ID NO:72) derived from the coding sequence of SEQ ID S NO:71 shown in Figure 37.
O Figure 39 shows a nucleotide sequence (SEQ ID NO:73) of a native sequence PR04424 cDNA, wherein C] SEQ ID NO:73 is a clone designated herein as "DNA96857-2636".
Figure 40 shows the amino acid sequence (SEQ ID NO:74) derived from the coding sequence of SEQ ID NO:73 shown in Figure 39.
Figure 41 shows a nucleotide sequence (SEQ ID NO:75) of a native sequence PR04422 cDNA, wherein SEQ ID NO:75 is a clone designated herein as "DNA96867-2620".
Figure 42 shows the amino acid sequence (SEQ ID NO:76) derived from the coding sequence of SEQ ID shown in Figure 41.
Figure 43 shows a nucleotide sequence (SEQ ID NO:77) of a native sequence PRO4430 cDNA, wherein SEQ ID NO:77 is a clone designated herein as "DNA96878-2626".
Figure 44 shows the amino acid sequence (SEQ ID NO:78) derived from the coding sequence of SEQ ID NO:77 shown in Figure 43.
Figure 45 shows a nucleotide sequence (SEQ ID NO:79) of a native sequence PR04499 cDNA, wherein SEQ ID NO:79 is a clone designated herein as "DNA96889-264I".
Figure 46 shows the amino acid sequence (SEQ ID NO:80) derived from the coding sequence of SEQ ID NO:79 shown in Figure DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS I. Definitions "In the claims of this application and in the description of the invention, except where the context requires otherwise due to express language or necessary implication, the words "comprise" or variations such as "comprises" or "comprising" are used in an inclusive sense, i.e. to specify the presence of tie stated features but not to preclude the presence or addition of further features in various embodiments of the invention".
The terms "PRO polypeptide" and "PRO" as used herein and when immediately followed by a numerical designation refer to various polypeptides, wherein the complete designation PRO/number) refers to specific polypeptide sequences as described herein. The terms "PRO/number polypeptide" and "PRO/number" wherein the term "number" is provided as an actual numerical designation as used herein encompass native sequence polypeptides and polypeptide variants (which are further defined herein). The PRO polypeptides described herein may be isolated from a variety of sources, such as from human tissue types or from another source, or prepared by recombinant or synthetic methods.
tV O A "native sequence PRO polypeptide' comprises a polypeptide having the same amino acid sequence Cl as the corresponding PRO polypeptide derived from nature. Such native sequence PRO polypeptides can be isolated from nature or can be produced by recombinant or synthetic means. The term "native sequence PRO polypeptide" specifically encompasses naturally-occurring truncated or secreted forms of the specific PRO polypeptide an extracellular domain sequence), naturally-occurring variant forms alternatively spliced forms) and naturally-occurring allelic variants of the polypeptide. In various embodiments of the invention, the native sequence PRO polypeptides disclosed herein are mature or full-length native sequence tl polypeptides comprising the full-length amino acids sequences shown in the accompanying figures. Start and stop codons are shown in bold font and underlined in the figures. However, while the PRO polypeptide o disclosed in the accompanying figures are shown to begin with methionine residues designated herein as amino I 10 acid position 1 in the figures, it is conceivable and possible that other methionine residues located either upstream O or downstream from the amino acid position I in the figures may be employed as the starting amino acid residue for the PRO polypeptides.
The PRO polypeptide "extracellular domain" or "ECD" refers to a form of the PRO polypeptide which is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a PRO polypeptide ECD will have less than 1% of such transmembrane and/or cytoplasmic domains and preferably, will have less than 0.5% of such domains. It will be understood that any transmembrane domains identified for the PRO polypeptides of the present invention are identified pursuant to criteria routinely employed in the art for identifying that type of hydrophobic domain. The exact boundaries of a transmembrane domain may vary but most likely by no more than about 5 amino acids at either end of the domain as initially identified herein. Optionally, therefore, an extracellular domain of a PRO polypeptide may contain from about 5 or fewer amino acids on either side of the transmembrane domain/extracellular domain boundary as identified in the Examples or specification and such polypeptides, with or without the associated signal peptide, and nucleic acid encoding them, are comtemplated by the present invention.
The approximate location of the "signal peptides" of the various PRO polypeptides disclosed herein are shown in the present specification and/or the accompanying figures. It is noted, however, that the C-terminal boundary of a signal peptide may vary, but most likely by no more than about 5 amino acids on either side of the signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal boundary of the signal peptide may be identified pursuant to criteria routinely employed in the art for identifying that type of amino acid sequence element Nielsen et al., Prot. Eng. 10:1-6 (1997) and von Heinje et al., Nucl. Acids, Res. 14:4683-4690 (1986)). Moreover, it is also recognized that, in some cases, cleavage of a signal sequence from a secreted polypeptide is not entirely uniform, resulting in more than one secreted species. These mature polypeptides, where the signal peptide is cleaved within no more than about 5 amino acids on either side of the C-terminal boundary of the signal peptide as identified herein, and the polynucleotides encoding them, are contemplated by the present invention.
"PRO polypeptide variant" means an active PRO polypeptide as defined above or below having at least about 80% amino acid sequence identity with a full-length native sequence PRO polypeptide sequence as disclosed herein, a PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular o domain of a PRO polypeptide, with or without the signal peptide, as disclosed herein or any other fragment of a full-length PRO polypeptide sequence as disclosed herein. Such PRO polypeptide variants include, for instance, PRO polypeptides wherein one or more amino acid residues are added, or deleted, at the N- or C- ;Z terminus of the full-length native amino acid sequence. Ordinarily, a PRO polypeptide variant will have at least about 80% amino acid sequence identity, alter-natively at least about 81 amino acid sequence identity, alternatively at least about 82% amino acid sequence identity, alternatively at least about 83% amino acid sequence identity, alternatively at least about 84% amino acid sequence identity, alternatively at least about amino acid sequence identity, alternatively at least about 86 amino acid sequence identity, alternatively at least about 87% amino acid sequence identity, alternatively at least about 88% amino acid sequence identity, O alternatively at least about 89% amino acid sequence identity, alternatively at least about 90% amino acid tn 1 euneiettatraieyatlataot9 mn cdsqec dettatraieya es bu 2 tnamn 10i sequence identity, alternatively at least about 9 amino acid sequence identity, alternatively at least92 Ci about 94 amino acid sequence identity, alternatively at least about 95 amino acid sequence identity, alternatively at least about 96% amino acid sequence identity, alternatively at least about 97% amino acid sequence identity, alternatively at leas about 98% amino acid sequence identity and alternatively at least about 99% amino acid sequence identity to a full-length native sequence PRO polypeptide sequence as disclosed herein, a PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the signal peptide, as disclosed herein or any other specifically defined fragment of a full -length PRO polypepride sequence as disclosed herein. Ordinarily, PRO variant polypeptides are at least about 10 amino acids in length, alternatively at least about 20 amino acids in length, alternatively at least about 30 amino acids in length, alternatively at least about 40 amino acids in length, alternatively at least about amino acids in tength, alternatively at least about 60 amino acids in length, alternatively at least about 70 amino acids in length, alternatively at least about 80 amino acids in length, alternatively at least about 90 amino acids in length, alternatively at least about 100 amino acids in length, alternatively at least about 150 amino acids in length, alternatively at least about 200 amino acids in length, alternatively at least about 300 amino acids in length, or more.
"Percent amino acid sequence identity" with respect to the PRO polypeptide sequences identified herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the amino acid residues in the specific PRO polypeptide sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity. Alignment for purposes of determining percent amino acid sequence identity can be achieved in various ways that are within the skil in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software, 'Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. For purposes herein, however, amino acid sequence identity values are generated using the sequence comparison computer program ALIGN-2, wherein the complete source code for the AL[GN-2 program is provided in Table I below. The ALIGN-2 sequence comparison computer program was authored by Genentech, Inc. and the source code shown o in Table 1 below has been filed with user documentation in the U.S. Copyright Office, Washington 20559, where it s registered under U.S. Copyright Registration No. TXU5 10087. The ALIGN-2 program is publicly available through Genentech, Inc., South San Francisco, California or may be compiled from the source code ;Z provided in Table 1 below. The ALIGN-2 program should be compiled for use on a UNIX operating system, preferably digital UNIX V4.01D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary.
In situations where ALIGN-2 is employed for amino acid sequence comparisons, the amino acid sequence identity of a given amnino acid sequence A to, with, or against a given amino acid sequence B (which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain amino acid O sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: V'n o 100 times the fraction X/Y where X is the number of amino acid residues scored as identical matches by the sequence alignment program ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid sequence B, the amino acid sequence identity of A to B will niot equal the amino acid sequence identity of B to A. As examples of amino acid sequence identity calculations using this method, Tables 2 and 3 demonstrate how to calculate the amino acid sequence identity of the amino acid sequence designated "Comparison Protein" to the amino acid sequence designated "PRO", wherein "PRO' represents the amino acid sequence of a hypothetical PRO polypeptide of interest, "Comparison Protein" represents the amino acid sequence of a polypeptide against which the "PRO" polypeptide of interest is being compared, and and each represent different hypothetical amino acid residues.
Unless specifically stated otherwise, all amino acid sequence identity values used herein are obtained as described in the immediately preceding paragraph using the ALIGN-2 computer program. However, amino acid sequence identity values may also be obtained as described below by using the WU-BLAST-2 computer program (Altschul et al., Methods in Enzvoloav 266:460-480 (1996)). Most of the WU-BLAST-2 search parameters are set to the default values. Those not set to default values, the adjustable parameters, are set with the following values: overlap span 1, overlap fraction =0.t125, word threshold (TI) 11, and scoring matrix BLOSUM62. When WU-BLAST-2 is employed, a amino acid sequence identity value is determined by dividing t number of matching identical amino acid residues between the amino acid sequence of the PRO polypeptide of interest having a sequence derived from the native PRO polypeptide and the comparison amino acid sequence of interest the sequence against which the PRO polypeptide of interest is being compared which may be a PRO variant polypeptide) as determined by WU-BLAST-2 by the total number of amino acid residues of the PRO polypeptideof interest. For example, in the statement "a polypeptide comprising an the amino acid sequence A which has or having at least 80% amino acid sequence identity to the amino acid sequence the amino acid sequence A is the comparison amino acid sequence of interest and the amino acid sequence B is the amino acid sequence of the PRO polypeptide of interest.
o Percent amino acid sequence identity may also be determined using the sequence comparison program C- NCBI-BLAST2 (Altschul et al., Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence comparison program may be downloaded from http://www.ncbi.nlm,nih.gov or otherwise obtained from the ;Z National Institute of Health, Bethesda, MD. NCBI-BLAST2 uses several search parameters, wherein all of those search parameters are set to default values including, for example, unmask yes, strand all, expected c 5 occurrences 10, minimum low complexity length 15/5, multi-pass e-value 0.01, constant for multi-pass 25, dropoff for final gapped alignment 25 and scoring matrix BLOSUM62.
In situations where NCBI-BLAST2 is employed for amino acid sequence comparisons, the amino acid sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B O (which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain amino 'n 10 acid sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: -100 times the fraction X/Y where X is the number of amino acid residues scored as identical matches by the sequence alignment program NCBI-BLAST2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid sequence B, the amino acid sequence identity of A to B will not equal the amino acid sequence identity of B to A.
"PRO variant polynucleotide" or "PRO variant nucleic acid sequence" means a nucleic acid molecule which encodes an active PRO polypeptide as defined below and which has at least about 80% nucleic acid sequence identity with a nucleotide acid sequence encoding a full-length native sequence PRO polypeptide sequence as disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the signal peptide, as disclosed herein or any other fragment of a full-length PRO polypeptide sequence as disclosed herein.
Ordinarily, a PRO variant polynucleotide will have at least about 80% nucleic acid sequence identity, alternatively at least about 81% nucleic acid sequence identity, alternatively at least about 82% nucleic acid sequence identity, alternatively at least about 83 nucleic acid sequence identity, alternatively at least about 84 nucleic acid sequence identity, alternatively at least about 85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid sequence identity, alternatively at least about 91 nucleic acid sequence identity, alternatively at least about 92% nucleic acid sequence identity, alternatively at least about 93 nucleic acid sequence identity, alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid sequence identity, alternatively at least about 97 nucleic acid sequence identity, alternatively at least about 98 nucleic acid sequence identity and alternatively at least about 99% nucleic acid sequence identity with a nucleic acid sequence encoding a full-length native sequence PRO polypeptide sequence as disclosed herein, a full-length o native sequence PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain o of a PRO polypeptide, with or without the signal sequence, as disclosed herein or any other fragment of a full- OJ4 length PRO polypeptide sequence as disclosed herein. Variants do not encompass the native nucleotide ;Z sequence.
Ordinarily, PRO variant polynucleotides are at least about 30 nucleotides in length, alternatively at least
C
c 5 about 60 nucleotides in length, alternatively at least about 90 nucleotides in length, alternatively at least about 120 nucleotides in length, alternatively at least about 150 nucleotides in length, alternatively at least about 180 nucleotides in length, alternatively at least about 210 nucleotides in length, alternatively at least about 240 nucleotides in length, alternatively at least about 270 nucleotides in length, alternatively at least about 300 Snucleotides in length, alternatively at least about 450 nucleotides in length, alternatively at least about 600 n- 10 nucleotides in length, alternatively at least about 900 nucleotides in length, or more.
O "Percent nucleic acid sequence identity" with respect to PRO-encoding nucleic acid sequences C"l identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with the nucleotides in the PRO nucleic acid sequence of interest, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. For purposes herein, however, nucleic acid sequence identity values are generated using the sequence comparison computer program ALIGN-2, wherein the complete source code for the ALIGN-2 program is provided in Table 1 below. The ALIGN-2 sequence comparison computer program was authored by Genentech, Inc. and the source code shown in Table 1 below has been filed with user documentation in the U.S.
Copyright Office, Washington 20559, where it is registered under U.S. Copyright Registration No.
TXU510087. The ALIGN-2 program is publicly available through Genentech, Inc., South San Francisco, California or may be compiled from the source code provided in Table 1 below. The ALIGN-2 program should be compiled for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary.
In situations where ALIGN-2 is employed for nucleic acid sequence comparisons, the nucleic acid sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 100 times the fraction W/Z where W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN-2 in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will be appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence D, the nucleic acid sequence identity of C to D will not equal the nucleic acid sequence identity of D to C. As examples of nucleic acid sequence identity calculations, Tables 4 and 5, demonstrate how to calculate o the nucleic acid sequence identity of the nucleic acid sequence designated "Comparison DNA" to the nucleic o acid sequence designated "PRO-DNA", wherein "PRO-DNA" represents a hypothetical PRO-encoding nucleic acid sequence of interest, "Comparison DNA" represents the nucleotide sequence of a nucleic acid molecule against which the "PRO-DNA" nucleic acid molecule of interest is being compared, and and each represent different hypothetical nucleotides.
C^ 5 Unless specifically stated otherwise, all nucleic acid sequence identity values used herein are obtained as described in the immediately preceding paragraph using the ALIGN-2 computer program. However, nucleic acid sequence identity values may also be obtained as described below by using the WU-BLAST-2 computer program (Altschul et al., Methods in Enzvmology 266:460-480 (1996)). Most of the WU-BLAST-2 O search parameters are set to the default values. Those not set to default values, the adjustable parameters, are set with the following values: overlap span i, overlap fraction 0.125, word threshold 11, and Sscoring matrix BLOSUM62. When WU-BLAST-2 is employed, a nucleic acid sequence identity value (C"l is determined by dividing the number of matching identical nucleotides between the nucleic acid sequence of the PRO polypeptide-encoding nucleic acid molecule of interest having a sequence derived from the native sequence PRO polypeptide-encoding nucleic acid and the comparison nucleic acid molecule of interest the sequence against which the PRO polypeptide-encoding nucleic acid molecule of interest is being compared which may be a variant PRO polynucleotide) as determined by WU-BLAST-2 by the total number of nucleotides of the PRO polypeptide-encoding nucleic acid molecule of interest. For example, in the statement "an isolated nucleic acid molecule comprising a nucleic acid sequence A which has or having at least 80% nucleic acid sequence identity to the nucleic acid sequence the nucleic acid sequence A is the comparison nucleic acid molecule of interest and the nucleic acid sequence B is the nucleic acid sequence of the PRO polypeptideencoding nucleic acid molecule of interest.
Percent nucleic acid sequence identity may also be determined using the sequence comparison program NCBI-BLAST2 (Altschul et al., Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence comparison program may be downloaded from http://www.ncbi.nlm.nih.gov or otherwise obtained from the National Institute of Health, Bethesda, MD. NCBI-BLAST2 uses several search parameters, wherein all of those search parameters are set to default values including, for example, unmask yes, strand all, expected occurrences 10, minimum low complexity length 15/5, multi-pass e-value 0.01, constant for multi-pass 25, dropoff for final gapped alignment 25 and scoring matrix BLOSUM62.
In situations where NCBI-BLAST2 is employed for sequence comparisons, the nucleic acid sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 100 times the fraction W/Z where W is the number of nucleotides scored as identical matches by the sequence alignment program NCBI- BLAST2 in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will o be appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence C D, the nucleic acid sequence identity of C to D will not equal the nucleic acid sequence identity of D to 02)1 C.
;Z In other embodiments, PRO variant polynucleotides are nucleic acid molecules that encode an active PRO polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash cf 5 conditions, to nucleotide sequences encoding a full-length PRO polypeptide as disclosed herein. PRO variant polypeptides may be those that are encoded by a PRO variant polynucleotide.
The term "positives*, in the context of sequence comparison performed as described above, includes residues in the sequences compared that are not identical but have similar properties as a result of Sconservative substitutions, see Table 6 below). For purposes herein, the value of positives is determined by dividing the number of amino acid residues scoring a positive value between the PRO polypeptide amino acid o sequence of interest having a sequence derived from the native PRO polypeptide sequence and the comparison Cl amino acid sequence of interest the amino acid sequence against which the PRO polypeptide sequence is being compared) as determined in the BLOSUM62 matrix of WU-BLAST-2 by the total number of amino acid residues of the PRO polypeptide of interest.
Unless specifically stated otherwise, the value of positives is calculated as described in the immediately preceding paragraph. However, in the context of the amino acid sequence identity comparisons performed as described for ALIGN-2 and NCBI-BLAST-2 above, includes amino acid residues in the sequences compared that are not only identical, but also those that have similar properties. Amino acid residues that score a positive value to an amino acid residue of interest are those that are either identical to the amino acid residue of interest or are a preferred substitution (as defined in Table 6 below) of the amino acid residue of interest.
For amino acid sequence comparisons using ALIGN-2 or NCBI-BLAST2, the value of positives of a given amino acid sequence A to, with, or against a given amino acid sequence B (which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain positives to, with, or against a given amino acid sequence B) is calculated as follows: 100 times the fraction X/Y where X is the number of amino acid residues scoring a positive value as defined above by the sequence alignment program ALIGN-2 or NCBI-BLAST2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid sequence B, the positives of A to B will not equal the positives of B to A.
"Isolated,' when used to describe the various polypeptides disclosed herein, means polypeptide that has been identified and separated and/or recovered from a component of its natural environment. Contaminant components of its natural environment are materials that would typically interfere with diagnostic or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous solutes. In preferred embodiments, the polypeptide will be purified to a degree sufficient to obtain at least o 15 residues of N-terminal or internal amino acid sequence by use of a spinning cup sequenaror, or to Shomogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or, preferably, silver stain. Isolated polypeptide includes polypeptide in situ within recombinant cells, since at least one ;Z component of the PRO polypeptide natural environment will not be present. Ordinarily, however, isolated polypeptide will be prepared by at least one purification step.
C
c r 5 An "isolated" PRO polypeptide-encoding nucleic acid or other polypeptide-encoding nucleic acid is a nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule with which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. An isolated polypeptide-encoding nucleic acid molecule is other than in the form or setting in which it is found in nature.
OIsolated polypeptide-encoding nucleic acid molecules therefore are distinguished from the specific polypeptidet- 10 encoding nucleic acid molecule as it exists in natural cells. However, an isolated polypeptide-encoding nucleic o acid molecule includes polypeptide-encoding nucleic acid molecules contained in cells that ordinarily express the Spolypeptide where, for example, the nucleic acid molecule is in a chromosomal location different from that of natural cells.
The term "control sequences" refers to DNA sequences necessary for the expression of an operably linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers.
Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, "operably linked" means that the DNA sequences being linked are contiguous, and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors or linkers are used in accordance with conventional practice.
The term "antibody" is used in the broadest sense and specifically covers, for example, single anti-PRO monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-PRO antibody compositions with polyepitopic specificity, single chain anti-PRO antibodies, and fragments of anti-PRO antibodies (see below). The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of substantially homogeneous antibodies, the individual antibodies comprising the population are identical except for possible naturally-occurring mutations that may be present in minor amounts.
"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration.
In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when complementary strands are present in an environment below their melting temperature. The higher the degree 79 o) of desired homology between the probe and hybridizable sequence, the higher the relative temperature which -i can be used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions bJ) more stringent, while lower temperatures less so. For additional details and explanation of stringency of hybridization reactions, see Ausubel et al., Current Protocols in Molecular Biology, Wiley Interscience Publishers, (1995).
C 5 "Stringent conditions" or "high stringency conditions", as defined herein, may be identified by those that: employ low ionic strength and high temperature for washing, for example 0.015 M sodium chloride/0.0015 M sodium citrate/0. sodium dodecyl sulfate at 50C; employ during hybridization a 1" denaturing agent, such as formamide, for example, 50% formamide with 0,1% bovine serum O albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 with 750 mM sodium tr 10 chloride, 75 mM sodium citrate at 42°C; or employ 50% formamide, 5 x SSC (0.75 M NaCI, 0.075 M o sodium citrate), 50 mM sodium phosphate (pH 0.1% sodium pyrophosphate, 5 x Denhardt's solution, CI sonicated salmon sperm DNA (50 4g/ml), 0.1% SDS, and 10% dextran sulfate at 42"C, with washes at 42°C in 0.2 x SSC (sodium chloride/sodium citrate) and 50% formamide at 55 C, followed by a high-stringency wash consisting of 0.1 x SSC containing EDTA at "Moderately stringent conditions" may be identified as described by Sambrook et al., Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989, and include the use of washing solution and hybridization conditions temperature, ionic strength and %SDS) less stringent that those described above. An example of moderately stringent conditions is overnight incubation at 37"C in a solution comprising: 20% formamide, 5 x SSC (150 mM NaCI, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, followed by washing the filters in 1 x SSC at about 37-50"C. The skilled artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like.
The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a PRO polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope against which an antibody can be made, yet is short enough such that it does not interfere with activity of the polypeptide to which it is fused. The tag polypeptide preferably also is fairly unique so that the antibody does not substantially cross-react with other epitopes. Suitable tag polypeptides generally have at least six amino acid residues and usually between about 8 and 50 amino acid residues (preferably, between about 10 and 20 amino acid residues).
As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the desired binding specificity which is other than the antigen recognition and binding site of an antibody is "heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a ligand. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained from any immunoglobulin, such as IgG-I, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, IgD Sor IgM.
O "Active" or "activity" for the purposes herein refers to form(s) of a PRO polypeptide which retain a biological and/or an immunological activity of native or naturally-occurring PRO, wherein "biological" activity Srefers to a biological function (either inhibitory or stimulatory) caused by a native or naturally-occurring PRO other than the ability to induce the production of an antibody against an antigenic epitope possessed by a native Ce 5 or naturally-occurring PRO and an "immunological" activity refers to the ability to induce the production of an antibody against an antigenic epitope possessed by a native or naturally-occurring PRO.
The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully Sblocks, inhibits, or neutralizes a biological activity of a native PRO polypeptide disclosed herein. In a similar Smanner, the term "agonist" is used in the broadest sense and includes any molecule that mimics a biological C 10 activity of a native PRO polypeptide disclosed herein. Suitable agonist or antagonist molecules specifically Sinclude agonist or antagonist antibodies or antibody fragments, fragments or amino acid sequence variants of c.1 native PRO polypeptides, peptides, antisense oligonucleotides, small organic molecules, etc. Methods for identifying agonists or antagonists of a PRO polypeptide may comprise contacting a PRO polypeptide with a candidate agonist or antagonist molecule and measuring a detectable change in one or more biological activities normally associated with the PRO polypeptide.
"Treatment" refers to both therapeutic treatment and prophylactic or preventative measures, wherein the object is to prevent or slow down (lessen) the targeted pathologic condition or disorder. Those in need of treatment include those already with the disorder as well as those prone to have the disorder or those in whom the disorder is to be prevented.
"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time.
"Intermittent" administration is treatment that is not consecutively done without interruption, but rather is cyclic in nature.
"Mammal" for purposes of treatment refers to any animal classified as a mammal, including humans, domestic and farm animals, and zoo, sports, orpet animals, such as dogs, cats, cattle, horses, sheep, pigs, goats, rabbits, etc. Preferably, the mammal is human.
Administration "in combination with" one or more further therapeutic agents includes simultaneous (concurrent) and consecutive administration in any order.
"Carriers' as used herein include pharmaceutically acceptable carriers, excipients, or stabilizers which are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically acceptable carriers include buffers such as phosphate, citrate, and other organic acids; antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; saltforming counterions such as sodium; and/or nonionic surfactants such as TWEEN
T
polyethylene glycol (PEG), o and PLURONICS".
o "Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or variable region of the intact antibody. Examples of antibody fragments include Fab, Fab', and Fv fragments; diabodies; linear antibodies (Zapata et al., Protein Eng. 8(10): 1057-1062 [1995]); single-chain antibody molecules; and multispecific antibodies formed from antibody fragments.
Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" fragments, each with a single antigen-binding site, and a residual "Fc" fragment, a designation reflecting the ability to crystallize readily. Pepsin treatment yields an fragment that has two antigen-combining sites and is still capable of cross-linking antigen.
S"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and -binding site. This region consists of a dimer of one heavy- and one light-chain variable domain in tight, non-covalent O association. It is in this configuration that the three CDRs of each variable domain interact to define an antigenl binding site on the surface of the V,-VL dimer. Collectively, the six CDRs confer antigen-binding specificity to the antibody. However, even a single variable domain (or half of an Fv comprising only three CDRs specific for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the entire binding site.
The Fab fragment also contains the constant domain of the light chain and the first constant domain (CH1) of the heavy chain. Fab fragments differ from Fab' fragments by the addition of a few residues at the carboxy terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge region. Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant domains bear a free thiol group. antibody fragments originally were produced as pairs of Fab' fragments which have hinge cysteines between them. Other chemical couplings of antibody fragments are also known.
The "light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to one of two clearly distinct types, called kappa and lambda, based on the amino acid sequences of their constant domains.
Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG, and IgM, and several of these may be further divided into subclasses (isotypes), IgGI, lgG2, lgG3, IgG4, IgA, and IgA2.
"Single-chain Pv" or 'sFv" antibody fragments comprise the V, and VL domains of antibody, wherein these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a polypeptide linker between the and VL domains which enables the sFv to form the desired structure for antigen binding. For a review of sFv, see Pluckihun in The Pharmacology of Monoclonal Antibodies, vol. 113, Rosenburg and Moore eds., Springer-Verlag, New York, pp. 269-315 (1994).
The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which fragments comprise a heavy-chain variable domain connected to a light-chain variable domain in the same polypeptide chain By using a linker that is too short to allow pairing between the two domains on the same chain, the domains are forced to pair with the complementary domains of another chain and create o two antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 93/11161; and SHollinger et al., Proc. Natl, Acad. Sci. USA, 90:6444-6448 (1993).
An "isolated" antibody is one which has been identified and separated and/or recovered from a ;Z tcomponent of its natural environment. Contaminant components of its natural environment are materials which would interfere with diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and ¢Cf 5 other proteinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) to greater than 95% by weight of antibody as determined by the Lowry method, and most preferably more than 99% by weight, to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid sequence by use of a spinning cup sequenator, or to homogeneity by SDS-PAGE under reducing or O nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated antibody includes the antibody in situ within recombinant cells since at least one component of the antibody's natural environment will not be o present. Ordinarily, however, isolated antibody will be prepared by at least one purification step.
Cl The word "label" when used herein refers to a detectable compound or composition which is conjugated directly or indirectly to the antibody so as to generate a "labeled" antibody. The label may be detectable by itself radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, may catalyze chemical alteration of a substrate compound or composition which is detectable.
By "solid phase" is meant a non-aqueous matrix to which the antibody of the present invention can adhere. Examples of solid phases encompassed herein include those formed partially or entirely of glass controlled pore glass), polysaccharides agarose), polyacrylamides, polystyrene, polyvinyl alcohol and silicones. In certain embodiments, depending on the context, the solid phase can comprise the well of an assay plate; in others it is a purification column an affinity chromatography column). This term also includes a discontinuous solid phase of discrete particles, such as those described in U.S. Patent No. 4,275,149.
A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant which is useful for delivery of a drug (such as a PRO polypeptide or antibody thereto) to a mammal. The components of the liposome are commonly arranged in a bilayer formation, similar to the lipid arrangement of biological membranes.
A "small molecule" is defined herein to have a molecular weight below about 500 Daltons.
Table 1 C~ 1* 0 C-C increased from 32 to Z is average of EQ B is average of ND match with stop is stop-stop 0; 1 (joker) match 0 (In *1 #derine _M -8 /4 value of a match with a stop fI int -day[26][26] In ISA B C D E F G H I J K L M N 0 P Q R S T U V W X Y Z*I A *1 2 0 2 0, 1, 1, I, 0, 0).
B 1 0. 3, 0 0, 1,0,0, 0, 1), O* C 0, C- 15 P D 4, 1, 1-2, 0, 2, 0,0, 0,4, 2), I* PEV 3, 0, 1-2, 0, 0, 0. 0.4, 3), OP F 1, 2, 0, 0, OPG l 1. 0, 0), /l IH/ 1, 0, 0, 2), i*1*1 5, 2, 0, 0, 0, 0.0 00,0. 0, 0, 0, 0 0,M, 0, 0. 0,0 0, 0, 0, 0, 0), 4
K
4 1 0, 0, 0, 1. 3 0. 0, 0).
LI 2, 6, 0.
*M
t 2, 0. ,04, 0, 4
N
4 2, 0, 1,0, 1, 0, 1), _M,_M.MM._MM MM,_M,
P
4 I 6, 0. 0, 1, 0, 0),
/'Q
t '10, 2, 0, I.M, 0, 4, 3), P R *1 0, 0, 0, 1, 6, 2, s 4 0, 0, 2, 1, 0), T 0. 0,M. 1, 3, 0, 0), U 0,0 0. 00,0,0. 0, 0,0.0, 0, M,0,0. 0, 0.0.0, 0 0.0, 00), V 4, 2, 0, 0, P W 0,-6,17, 0, *1f 00, 0, 0, 0. 00,0. 0. 0. 0,0,00,-k 0M. 0, 0,0, 0. 0 0, 0, 0, 0, 0).
I/ 0. 0,I0.-4).
P Z 4 2, 02,-2. 0, I 0. 3, 0. 0, 0, 4)
;Z
Table 1 (cont') #indlude <stdio.h> include <crypeli> #definc #defme #defmne #deine #define #define #dcflne #detbie #derine #deflne
MAXJMP
MAXOAP
JMPS
NIX
DMAT
DMIS
DJNSO
DINS I
PINSO
PINS I 16 /P max jumps in a diag 24 P' don't continue to penalize gaps larger than ibis/ 1024 1* max jmps in an path 4/ 4 P save if there's at least MX-1I bases since last jmp 4 3 P value of matching bases 0 P8 penalty for mismatched bases 8 P penalty for agap 1 /8 penalty per base 8 8 P* penalty for a gap 4 P* penalty per residue n[MAXJMPJ; size of jmp (neg for dely) *I x[MAXJMP:. base no. of jmp in seq x/ /8 limits seq to 2^ 16 -1 struct jmnp short unsigned short struct diag ilt long short Struct jmp score; offset;, ijmp; jp: /8 score at last jmp /8 offset of prey block 8 /8 current imp. index 8 I* list of jmps *f struct path int $PC, I* number or leading space~s short nIJ MPS]; P size of jmp (gap) fit x[JMPSJ;/* loc of jmp (last elem before gap) char char char char it ilt mlt tnt tnt int in( int int long struct struct *ofile' 8 rmmex[2],, t prog, 8 seqx[2J: dinax; dmax0; dua; endgaps; gapx, gapy; lenO, len]; ngapx, ogapy; smax; *xhm; offset; *dx: pp[ 2 1:, output file name /8 seq names: getseqs() /8 prog name for err msgs /8 seqs: gecseqso /P best diag: nwQ /8 final diag *J P* set if dna: main() *1 f* set if penalizing end gaps /8 total gaps in seqs 8 /8 seq lens total size of gaps P* max score: nw() *i P* bitmp for matching *1 P* current offset injmp file 8 /4 holds diagonals 8/ I* holds Path for seqs *1 char char t callocO. *malloco, "'Indexo, *strcpy0; *getseqo, 8 gcalloco, STable 1 (cont') 0 Needleman-Wunsch alignment program usage: progs filel file2 where filel and file2 are two dna or two protein sequences.
The sequences can be in upper- or lower-case an may contain ambiguity Any lines beginning with or are ignored C Max file length is 65535 (limited by unsigned short x in the jmp struct) A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA Output is in the file "align.out" n The program may create a Imp file in I/mp to hold info about traceback.
Original version developed under BSD 4.3 on a vax 8650 1 #include "nw.h" N 15 #include "day.h" Sstatic dbval(26] O.14,2,13,0.0,4.11.0,0, 12,0,3.15,0,0,0,5.6,8,8,7,9,0,1O,0 static _pbval[26] 1, 21(1< 4, 8, 16, 32, 64, 128, 256, OxPFFFFFF, 1<<10, I<<l11, l<<13, 1<14, I 15, 16, I<<17, 1< <18, 1 19, l< <20, 1<<21, <22, 1 <23, l<<24, 1 <25j(1< main(ac, av) main int ac; char *av[lj; prog av[0]; itf (ac 1= 3)( fprintf(stderr,"usage: %s filel file2\n", prog); fprintf(stderr,"where filel and file2 are two dna or two protein sequences.\n"); fprintf(stderr,"The sequences can be in upper- or lower-case\n"); fprintf(stderr,'Any lines beginning with or are ignored\n"): fprintf(stderr,"Output is in the file \'align.out\"kn"); exit(1); namex(0] av[l]; namex[1] av[2J; seqx[0] getseq(namex[0], &lenO); seqx[l] getseq(namex[lJ, &lenl); xbm (dna)? dbval: _pbval; endgaps 0; I to penalize endgaps ofile "align.out"; output file nw,: fill in the matrix, get the possible jmps readjmps); get the actual jmps *1 print()o; print stats, alignment cleanup(0); /P unlink any tnp files
;Z
CA
Table 1 (coat') do the alignment, return best score: main() dna:- values in Fitch and Smith, PNAS, 80, 1382-1386, 1983 pro:- PAM 250 values When scores are equal, we prefer mismatches to any gap, prefer a new gap to extending an ongoing gap, and prefer a gap in seqx *to a gap in seq y.
nwO in( int register register register register *lx, t py.
*ndcly. *dely; ndelx, dclx; *Imp; mis; insO, insi; id; *Colo, $coI 1; xx, yy; seqs and ptrs keep track of deiy P* keep track of deix *I for swapping rowO, rowl score for each type insertion penalties diagonal index 4
I
t jmp index 4/ score for curr, last row /4 index into seqs dx (struct diag caltocQ'to get diags". IenrO+lent 1, slzeof(struct diag)): ndety =(tnt t )gcalloc("to get ndely". tenlI 1, sizeof(int)); dety (int t )g_caltoc("co get dely", lentI 1. slneof(iiit)); Coto (int calloec(to get colD", lent 1, sizeof(Int)): coil (int 4 )g_calloc("to get coil", lentl 1, sizeof(int));, insO (dna)? DINSO :PINSO: insl (dna)? DINSI :PlNSI; sinax =-10000; If (endgaps) for (colO[0J= dety[Ol -insO, yy 1; yy t eni; yy+ colO[yy] dely[yyl colO[yy-1]j ins 1; ndety[yy] yy:
I
colO[0] 0: I* Waterman Bull Math Riot 84 *1 for (yy 1; yy lentI; yy++) dely[yy] -imaO; tilt in match matrix for (px seqx[0], xx 1; xx leno; px+ xx initialize first entry in cot if' (endgaps)( if (xx cot 1(0] delx -(inO+insl); cot 1 [0J dclx =cotO[0] i nsl; ndelX Xx; else{ coll[0] 0; delx -insO; ndelx 0; iabltI(cnt' Clfor se qx yy 1; yy tenl; py+ yy+ 0.0 mis colO[yy-lJ; it (dna) mis (xbmfpx-'Au]&xbm[*py-'AI)? DMAT: else mis -day[t*x-'A]*py-Au); I* update penalty for del in x seq; favor new del over ongong del ignore MAXOAP if weighting codgaps if (endgaps I ndely[yyl MAXGAP){ o if (coIO[yy] inso dely[yyj) Cl 15. dely(yy) colO[yy] (insO +ins 1); ndely[yy] 1; o }else o dely[yy] insi; ndely[yy] )else if (colO[yyj (insO+insl) dely[yyj){ dely~yyJ colO[yy] (insO +ins 1); ndely[yy] 1 }else ndely[yy] update penialty for del in y seq; *favor new del over ongong del if' (endgaps fl deix MAXOAP)( if (col[I[yy-lI] insO deix){ deix coil Ijyy-lI] (insO inst); ndelx =I )else delx instI; ndelx4 4; )else{ if (cotlI yy-lJ (insO +insl1) deix){ deix collI[yy-l] (insO +insl1); ndeix 1,, ele ndelx P pick the maximum score;, we're favoring *mis over any del and deix over dely o Table 1 (cont') OJ~id xx yy lenI 1: .n ;Z ~if (mis =deix mis dely[yy) ekse if (delx =delyfyy]) COMMl =deyj ii dx[id].ijmp; if (dx[id).jp~n[O] (!daI (ndelx =MAXIMP &&xx dx~id].jp.x~ijl4MX) mis dx[idJ score+ DINSO)){ kn 10 dx[idtijmp+ N-, r- ~if =MAXJMP)( wriiejmps(id); o ij dx[idJ.ijmp =0; dx[idJ.offsec offset; q 15 offset slzeof(ntruct imp) sizeo~offset): C] dx[idj~jp.nlij] ndeix; dxlid].jp.x[ijl xx; dx[idJ.score deix; else{ col I yyj -delylyy]; ij dx[id].ijmp; if (dx[idjjp.n[O] (Idna I I (ndciy[yy) MAXIMP xx dxfid].jp.xjijJ+ MX) I I mis dx[idj.score+DINSO)){ dx[idiijmp+ 4-; if +ij MAXIM?)( wriiejmps(id); Uj dx[idflijmp =0, dx[id].offsec offset; offset sizeof(strucl jmp) sizeo ((offset); dx[id].jp.n~ij] -ndely[yyj; dx[idj.jp.x[ij) xx; dx[idlbscore delylyyj; if (xx ==lenO yy leni)( last Col if (endgaps) coll[yy] -=insO+insl t (lenl-yy); if (coili [yy] smax) smax col [yyl; if (endgaps xx lenO) coll[yy-I) imsO+insi'(enO-xx); if (Col I yy-l] swnax){( smax coll[yy-l]; dmax id; 551 imp Colo; Colo Coil; Coil mp; (void) free((char *)zydely); (void) free((char *)dely); (void) free((char ')colO); (void) free((cbar *)ColI)-, o Table licont') print() only routine visible outside this module *getmato trace back best path, count matches: princo *pr align() print al ignment of described in array print() *dumpblock0 dump a block of lines with numbers, stars: pr aligno *nuMSQ put out a number line: dumpblock() 10 puclineQ) put out a line (namne, (nurn], scq, [anmfl: dumtpblock() scars() -put a line of stars: dumpblockQ stripnane() strip any path and prefix from a seqname 15 #include "nw.hw o#deine SPC 3 ci#dermeP PLINE 256 maximum output line *1 #define PSPC 3 I* space between name or now and seq extent day[2 61126]; int Olen; seA output line length FILE *fx. output file *1 printO print tnt lx, ly, firstgap. lasigap; overlap If ((fx fopen(ofile, fprinrf(stderr,*%9s: can't write %skn', prog, ofile); cleanup( I);
I
fprintf(fx, <first sequence: %s (length namcx[03, lenO); Cprintf(fx. "<second sequence: %s (length namex[l], lenl); olen Ix IenO; ly ten]; firstgap lastgap =0; if (dmax lent 1) /4 leading gap in x pp[O],spc =firsrgap lecM dmax 1: ly PPf0]spc; else if (dinax lent I) leading gap in y pp[I].spc firstgap dniax (lent 1); lx -ppl.spc; If (dmaxo lenO 1) trailing gap in x Iastgap lenO dmax0 -I1: Ix lastgap: else If (dmax0 lenO 1) trailing gap in y 4 lastgap dmax0 (lenO 1); ly lastgap; getmat(lx, ly. firstgap, lastgap); praligno;, o Table 1 (cont') Of) 'trace back the best path, counc matches Static getrnut(ix, ly. firstgap. lastgap) getmiat ian Ix, ly; I* "core' (minus eudgaps) *f tnt firstgap, lastgap; leading trailing overlap nt n 10. ii, sizo, sizi; ~fldouble pct; register no, ni1; oregister cha *pO, *pl; C] 15 l* get total matches, score o 10=I1= sizO skzi 0; p0 seqx[WJ pp~lbspc:.
pt seqx[lJ pp[O].spc; no pp[ll.spc +l; ni pp[0J.spc 1; run =0; while('*p0&& *p if (sizO) else if sizl){ P0 no si-: else( if (xbrn[*p0-'AhJ&xbm[*pI 'A1) rnm I; If' pp(0).x[iOJ) sL-0 pp[0].n[iO+ +1: if (n -=pp[l)x[IlJ) sit ppIIJ.niI p0+ 4; 451 I* Pet homology: if penalizing endgaps. base is the shorter seq else, knock off overhangs andi rake shorter core If (endgaps) Ix (lenO lent)? lenO len I; else ix =(lx ly)l Ix ly:.
pct j00.*(double)n/(double)x frrintf(fx. frrintflfx, %d match%s in an overlap of %.2f percent sirnhlarity\n*, nun, (rn Q=1? lx, pet); c Table I (coil') ffrintf(fx. <gaps in first sequence: gapx): if (gapx) (void) sprintftoutx. ngapx, (dna)? *base": "residue", (ogapx 1 fprintffx,"9bs', OUIX); getmat fprintf(fx, gaps in second sequence: gapy); if (gapy) if (dna) else if (endga
I
(void) sprintf(ouix, ngapy, (dna)? 'base":"residue", (ngapy fprinf(fx,"97s", outx); fprintf(fx, \n <score: %d (match mismatch gap penalty %d %d per base)\n".
smax, DMAT, DMIS, DINSO, DINSI); fprinrf(fx, "\nscore: %d (Dayhoff PAM 250 matrix, gap penalty %d %d per residue)n", smax, PINSO, PINS1); ps) fprintf(fx.
endgaps penalized. left endgap: %d right endgap: %d %s%sXn', firsigap (dna)? "bas": "residue', (firstgap s Iastgap. (dna)? "base": "residue". (lasigap [printf(fx, "<endgaps not penalized\n); static static static static static static static char static char Static char static char ri; Imax: ij[21; nc[2J; ni[2]: siz[21; *ps[21.
*po 2 cutjZJPLINE]; star[PLINE]; matches in core for checking lengths of stripped fle names *l jmp index for a path 1* number a[ start of current line 'I current elem number for gapping fI ptr to current element I/ I ptr to next output char slot *1 output line 4/ set by star50 *1 print alignment of described in struct path ppi] +1 stalk pralign( pralign rn; char count *I tnt more; register i; for (i 0. Imax 0: i 2: nn stripname(nanex[iJ); If (nn Imax) imax nn; nc[i] i; nitil= I: siz) ij[il 0; pslij seqx[iJ: poill GUI~il: 0 Table I (cont') ;Z ~~~for (an nrn 0,nmore ;rmore; .){7alg for more i 2; i do we have more of this sequence? if (l'ps(ij) continue; In 10 more+ iI' (pp~jibspc) I leading space o *po[iI++ C] pp[i].spc--;, In o else iI' (siz[ij1) in a gap '1 o *po[i C] siziil--; else /we're putting a seq element *f *POPi] 'Ps0il: if (islower('psliJ)) *psli coupper(*ps[i]); po~i] ps[il++; *are we at next gap for this seq?
I
if (null pp[iJ.x[ijrijl){ *we need io merge all gaps at this location siziJ pp~ij.n[Jjij+I-4; while (nlffl w= pptilIx1ijiiI) siz[iJ ppli].n(ij[i]4- I-; ni[iJ if +nn==olenh !more dumnphlockQ; for (i -0 i 2,i po(i] outli]; nn. 0; *dump a block of lines, including numbers, stars: pr aign() static dwnpblock() dumpblock register i: for (i =0:iC<2; 1+ Table 1 (corn') dumpbiock (void) pucc('\n'. fx), for (i 2; i+ if (*OO~j I I *(poii) D 21 if (i nums(i); If (i 0 &&*ou[1ID starsQ; put] ine(i);, if l =0 *out[lI]) fpriniffX. star); nums(i): *put out a number line: durnpblock() static nms(i x) it ix; index in out[] holding seq line *f char nline[P LINE]: register 2, j; register char *Pn, t px,, t py: for (pn nline. i i lmax PSPC:. i pn+ *pn for (i nc[ix], py =out[ixj; *py: pn+ t pnelse({ if (i%I10 0 110 I= ncfix] 0 OP A i:.
ror (px p; jj/= 10, px--) if(i 0) %0+V nu111ms else i+ *pn *pn nclix] i: for (pn Wilne; *pn; pn (void) putc(*pn. Cx);, (void) putc(f\n'. fx); t* *put out a line (name. Inumi. seq. fournd): dumpbiocko static puaineoix) bIt ix: putline 0 Table 1 (cant') putline ;Z it cr1register char *px; enfor (px namex[ix), i =O0; px &A t *px= (void) putc(*px. fx):, for i lmaxt+P SPC; i+ (void) putc(' P~ these count from 1: niuj is current element (from 1) ln for (px outlixi: *px: px4- -4) o (void) putc(*px&0x7P. fx):, o (void) putc('\n', fx); 2put a line of scars (seqs always in oul(O]. our[tJ):- durnpblnck() static starsQ stars f in( t register char *PO cx, *px; if(!*out[O) I I 4 out[O] 2 (po[OJ) I l~outl] I I e*outtll1 return: px Star; for (i Imax +P_SPC; i; -p for(po= out[O], pi outil; 'PpO&& *p1; pO+ if (isalpha(pO) isalpha( 2 p1)) itf(xbm[*-A'I&xbml p cx nm else if (dna -day( t pO-'A i 0) cx else cx else c cx:, 1 0 Table 1 (cant') 011) *strip path or prefix from pn. return len: pi-aign()
;ZV
static stripame(pni) stnipname char *pn; /2 file name (may be path) 'I register char *pjx, 4 py; py p;*x x+ for (px x++ if (*px 7) 0py -pX4+-I: i p)(void) surcpy(pn, py): o return(sirlen(pn)); 00 0 Table 1 (cout') 011) cleanup() cleanup any tmp file gecseq() read in seq. set dna, len, maxler, cr4 &.cal lnc() calloc() with error checkin 5 '~rradjinps() get the good jmps. from tmnp file if necessary wrilejmps() write a filled array of jmps to a tnp file: nwO #lnclude 'nw.hh lndcude sys/file.h I)char *jnamne "/cmnp/horngXXXXXX"; I* snip file for jnips *1 I>FILE *Q; CO tnt cleantupO; cleanup imp ile *1 long lseekQ; 0 remove any snip file if we blow -cleanup(i) cleanup int i if (0j) (void) uniiiikoflafle); exic(i); read, return pzr to seq. set dna, len, maxlen skiplinesstring withj or'>' seqt in upper or lower case char getseq(file, len) getseq char *file; file name imt 'Hen: seq len char line[10241, *pseq; register char *px, *py; Jut 'naige. (ten; FILE 4 p: if ((Ij fopen(fle,"r')) 0){ fbrintf(siderr, can't read prog, file); ei() flen ntalge 0 while (fgetsline, 10O4, 1) if (*line I I *line line continue; for (px line; pxI px if (isuppcr(*px) I I islower('px)) tlen+ if ((pseq malloc((unsigned)(tlen+6))) fprintCqstderr."%s: mallDo( failed ro get %d bytes for %sAn". prog. rlen+6, file); exit(l);, pseqi0l pseql pstq[21 =pseq(31 1' 0 Table 1 (cont') getscq ;Z py =pseq 4; *len den; en while (I'geas(line. 1024, fp))( if(*line I I *line~ line continue:.
far cpx line; *px px tfto1 if (isupprQ*px)) *px; else ii' (islower( t px)) o~P py Iouppwr*px): cif i(Wnex("ATGCU natgc+ py V (void) McoseQfp); dna natgc (lea); retur-n(pseq+4); char gcalloc(msg, ax, sz) g..calloc char *msg;. I* program, caling routine1 int in, sz; number and size of elements char t px. kcallocfr, if ((px calloc((unsigned)nx, (unaigned)sr)) 0)( If (*msg) fjninrf(stdcrr. g_calloc() failed %s sz= prog. msg. rin, sz); return(px); *get final jmps from dx[J or imp tile, sel pp[l. reset drnax: niain() readjmpsO readjmps int fd= -i; tnt siz. iD, i I; register i, j, xx; (void) ficlose(l); if openoname. 0_R DONLY, 0){ fixintfstdenr, can't openO prog, jname); cleanup( I): fo (i O ii I= 0. dm'.A0 dna x, xx =lernO; while for dxldmaxj.ijmp; j 0 dxfdmaxbjp.xbjJ xx; c-i Table 1 (cont') redjmps ;Z if (j 0 dx[dnax].offsec f3) I (void) Iseekffd, dx[dmax] offset, 0); (void) read(fd, (char *)&dx[dinaxl.jp. slzeof(struci jmp)): (void) read(fd, (char *)&dx[dniaxl.offSec, sizeof(dxiumax].offsei)): dxldrnax].jmp MlAXJMP-l; else break;.
'n ifQ(i>=JMP'S)( fprinif(stderr. too many gaps in alignmenun", prog); o cleanup( I); rgo>=O)( o siz dx[drmx].jp.ntj]; o xx dx[dniaxjjp.xj; Cl dmax sir; if (sir 0) I' gap in second seq *1 pprlb.nlilI -siz; xx siz; id xx yy lent -I *1 pp[I.x~ilI xx -dinax lent 1: gapy+ ngapy sz ft ignore MAXGAP when doing endgaps 1/ sir (-sir MAXCAP Iendgaps)? -sir MAXGAI'; 301 else if (siz 0)1( gap in first seq pplOjn[iOI sir; pp[O].x(iO] xx; gapx ngapx sir; ignore MAXCAP when doing endgaps sir (siz MAXOAP Iendgaps)? siz: MAXOAP: jO 401 else revere the order of jmps for Ci 0. j Oi= ppTO.nUj]: pp(O]-nU) pp[0J.n[iO]; pp[0n[iO] i prp[O].x$] pp(0l-xljJ pp[0J.xliJ(); ppLOJ.xli01 i for 0 0, il-;j il; i=ppIll.nijl; pp[I.nUI pp( I].n[ilJ; pp lJ-n(i IJ i; 1= ppIUALxjl pp~l].xUl pp1l].xlill; Pppl].xilJ I; If fd =0) (void) close(fd); If (void) uniinkojname): I) 0: offset 0; o Table 1 (cont') ;Z wite a filled jmp sn-ict offset of the prey one (if any): nw0( 5 writejmps(ix) writejmps in( ix; char *mkcemp0; if( fj) (mk-tempojnarm) 0){f fprinrftslderr, can't nemp( prog. jnamc); o cleanup(1): 0 ffin((tdrr 1 :an't wite Tos\n", prog. ine); (void) fwiie((char 4 )&dxfi xj.jp. sizeof(struct jmp), 1, 0j; vold) fwrite((char offset. sizeof(dx[ix]. offset), 1. fi); Table 2 to
PRO
Comparison Protein
XXXXXXXXXXXXXXX
XXXXXYYYYYYY
(Length 15 amino acids) (Length 12 amino acids) amino acid sequence identity (the number of identically matching amino acid residues between the two polypeptide sequences as determined by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) 5 divided by 15 33.3% Table 3 to
PRO
Comparison Protein
XXXXXXXXXX
XXXXXYYYYYYZZYZ
(Length 10 amino acids) (Length 15 amino acids) amino acid sequence identity (the number of identically matching amino acid residues between the two polypeptide sequences as determined by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) 5 divided by 10
O
O
Ci en Table 4
PRO-DNA
Comparison DNA
NNNNNNNNNNNNNN
NNNNNNLLLLLLLLLL
(Length 14 nucleotides) (Length 16 nucleotides) nucleic acid sequence identity (the number of identically matching nucleotides between the two nucleic acid sequences as determined by ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) 6 divided by 14 42.9% Table to ;Z3
PRO-DNA
Comparison DNA
NNNNNNNNNNNN
NNNNLLLVV
(Length 12 nucleotides) (Lentgth 9 nucleotides) nucleic acid sequence identity (the number of identically matching nucleotides between the two nucleic acid sequences as determined by ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) 4 divided by 12 33.3%
O
SII. Compositions and Methods of the Invention A. Full-Length PRO Polyveptides The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides referred to in the present application as PRO polypeptides. In particular, cDNAs encoding various PRO eC polypeptides have been identified and isolated, as disclosed in further detail in the Examples below. It is noted that proteins produced in separate expression rounds may be given different PRO numbers but the UNQ number is unique for any given DNA and the encoded protein, and will not be changed. However, for sake of 7 simplicity, in the present specification the protein encoded by the full length native nucleic acid molecules o disclosed herein as well as all further native homologues and variants included in the foregoing definition of CPRO, will be referred to as "PRO/number", regardless of their origin or mode of preparation.
As disclosed in the Examples below, various cDNA clones have been deposited with the ATCC. The 0 actual nucleotide sequences of those clones can readily be determined by the skilled artisan by sequencing of the deposited clone using routine methods in the art. The predicted amino acid sequence can be determined from the nucleotide sequence using routine skill. For the PRO polypeptides and encoding nucleic acids described herein, Applicants have identified what is believed to be the reading frame best identifiable with the sequence information available at the time.
1. Full-length PRO1484 Polypeptides Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PRO1484 (shown in Figure 2 and SEQ ID NO:2) has certain amino acid sequence identity with a portion of the mouse adipocyte complement related protein (ACR3 MOUSE). Accordingly, it is presently believed that PRO1484 disclosed in the present application is a newly identified adipocyte complement-related protein homolog and may possess activity typical of that protein.
2. Full-length PR04334 Polopeptides Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR04334 (shown in Figure 4 and SEQ ID NO:9) has certain amino acid sequence identity with PC-I.
Accordingly, it is presently believed that PRO4334 disclosed in the present application is a newly identified member of the PC-I family and shares similar mechanisms.
3. Full-length PRO1122 Polvneptides The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides referred to in the present application as PRO 1122. In particular, Applicants have identified and isolated cDNA encoding a PR01122 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA sequence alignment computer programs, Applicants found that the PROI 122 polypeptide has sequence identity with CTLA-8. The amino acid sequence shows a region having sequence identity with IL-17.
Acco. Jingly, it is presently believed that PRO1122 polypeptide disclosed in the present application is a novel cytokine and thus may be involved in inflammation responses.
O
4. Full-length PR01889 Polveptides SUsing the WU-BLAST2 sequence alignment computer program, it has been found that a portion of the full-length native sequence PRO1889 (shown in Figure 8 and SEQ ID NO:16) has certain amino acid sequence identity with a portion of the human E48 antigen protein (HSE48ATGN_I). Accordingly, it is presently believed Cthat PRO1889 disclosed in the present application is a newly identified E48 homolog and may possess activity or properties typical of the E48 protein.
Full-length PRO1890 Polvpeptides SUsing the WU-BLAST2 sequence alignment computer program, it has been found that a portion of the c full-length native sequence PRO1890 (shown in Figure 10 and SEQ ID NO: 18 has certain amino acid sequence 0 10 identity with a portion of the layilin protein (AF093673_1).
0 6. Full-length PRO1887 Polvpeptides Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PRO1887 (Figure 12; SEQ ID NO:23) has certain amino acid sequence identity with a mouse liver carboxylesterase precursor identified on the Dayhoffdatabase as "ESTM_MOUSE". Accordingly, it is presently believed that PRO 1887 disclosed in the present application is a newly identified member of the carboxylesterase family and may possess enzymatic activity typical of carboxylesterases.
7. Full-length PR01785 Polvyeptides Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01785 (shown in Figure 14 and SEQ ID NO:29) has certain amino acid sequence identity with glutathione peroxidase. Accordingly, it is presently believed that PRO1785 disclosed in the present application is a newly identified member of the peroxidase family and may possess antioxidant enzyme activity. Regulation of antioxidant activity is of interest in the treatment of cancer and aging.
8. Full-length PR04353 Polypeptides Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR04353 (shown in Figure 16 and SEQ ID NO:35) has certain amino acid sequence identity with semaphorin Z. Accordingly, it is presently believed that PR04353 disclosed in the present application is a newly identified member of the semaphorin Z family and is involved in inhibition of nerve growth. PR04353 can be used in assays to identify modulators of semaphorin Z, particularly inhibitors to promote central nerve regeneration.
9. Full-leneth PR04357 Polvpeptides Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR04357 (shown in Figure 18 and SEQ ID NO:40) has certain amino acid sequence identity with 289 amino acids in accession number P W48804. However, PR04357 has an additional 213 amino acids at the Nc-i terminus.
10. Full-lentth PR04405 Polypeptides As far as is known, the DNA84920-2614 sequence encodes a novel fa"tor designated herein as r PR04405; using WU-BLAST2 sequence alignment computer programs, limited sequence identities to known proteins were revealed.
t1% 11. Full-lenrth PR04356 Polvpeptldes o Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR04356 (shown in Figure 22 and SEQ ID NO:50) has certain amino acid sequence identity with S 10 metastasis associated GPI-anchored protein. Accordingly, it is presently believed that PR04356 disclosed in 0 the present application is a newly identified member of this family and shares similar mechanisms.
12. Full-length PR04352 Polvpeptides Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR04352 (shown in Figure 24 and SEQ ID NO:52) has certain amino acid sequence identity with protocadherin pc3 and protocadherin pc4. Accordingly, it is believed that PR04352 is involved in cell adhesin and can be used in treatments regarding differentiation disorders, cell adhesin, neural receptor or skin disorders.
Moreover, it can be used in screens to identify agonists and antagonists to treat such disorders.
13. Full-length PR04380 &LPolvpeptides As far as is known, the DNA92234-2602 sequence encodes a novel factor designated herein as PR04380: using WU-BLAST2 sequence alignment compuier programs, limited sequence identities to proteins with known functions were revealed.
14. Full-length PR04354 Polvpeptides As far as is known, the DNA92256-2596 sequence encodes a novel factor designated herein as PR04354; using WU-BLAST2 sequence alignment computer programs, limited sequence identities to proteins with known functions were revealed.
15. Full-length PR04408 Polvypeptides As far as is known, the DNA92274-2617 sequence encodes a novel factor designated herein as PR04408; using WU-BLAST2 sequence alignment computer programs, limited sequence identities to known proteins were revealed.
16. Full-length PROS737 Polvpentides Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR05737 (shown in Figure 32 and SEQ ID NO:63) has certain amino acid sequence identity with IL- 1
O
O
ci and/or IL-IRa. Accordingly, it is presently believed that PR05737 disclosed in the present application is a 0. newly identified member of this family and shares similar mechanisms.
17. Full-length PR04425 Polypeptides
C
As far as is known, the DNA93011-2637 sequence encodes a novel factor designated herein as PR04425; using WU-BLAST2 sequence alignment computer programs, PR04425 showed homology to a protein in GenBank, accession number HGS RE295, but is not identical.
tn o 18. Full-length PR05990 Polypeptides Using the ALIGN-2 sequence alignment computer program referenced above, it has been found that 0 10 the full-length native sequence PRO5990 (shown in Figure 36 and SEQ ID NO:67) has certain amino acid c"i sequence identity with Secretogranin II (Dayhoff No. GEN14673). Accordingly, it is presently believed that the PR05990 polypeptide disclosed in the present application is a newly identified member of the secretogranin protein family and may possess one or more biological and/or immunological activities or properties typical of that protein family.
19. Full-length PR06030 Polvyeptides The DNA96850-2705 clone was isolated from a human library as described in the Examples below.
As far as is known, the DNA96850-2705 nucleotide sequence encodes a novel factor designated herein as PR06030; using the ALIGN-2 sequence alignment computer program, no significant sequence identities to any known proteins were revealed.
Full-length PR04424 Polypeptides As far as is known, the DNA96857-2636 sequence encodes a novel factor designated herein as PR04424: using WU-BLAST2 sequence alignment computer programs, PR04424 showed homology to aprotein in GenBank, accession number HGS A135, but is not identical thereto.
21. Full-length PR04422 Polvyeptides Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR04422 (shown in Figure 42 and SEQ ID NO:76) has certain amino acid sequence identity with lysozyme g. Accordingly, it is presently believed that PR04422 disclosed in the present application is a newly identified member of the lysozyme family and may have lysozyme activy.
22. Full-length PR04430 Polypeptides Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR04430 (shown in Figure 44 and SEQ ID NO:78) has certain amino acid sequence identity with the protein in GenBank accession number MMHC213L3_9. Accordingly, it is presently believed that PR04430 disclosed in the present application is related to the GenBank protein and may share at least one similar 0, mechanism.
23. Full-length PRO4499 Polvpeptides As far as is known, the DNA96889-2641 sequence encodes a novel factor designated herein as en PR04499; using WU-BLAST2 sequence alignment computer programs, limited sequence identities to known proteins were revealed.
B. PRO Polvneptide Variants o In addition to the full-length native sequence PRO polypeptides described herein, it is contemplated that PRO variants can be prepared. PRO variants can be prepared by introducing appropriate nucleotide changes into S 10 the PRO DNA, and/or by synthesis of the desired PRO polypeptide. Those skilled in the art will appreciate that 0i amino acid changes may alter post-translational processes of the PRO, such as changing the number or position of glycosylation sites or altering the membrane anchoring characteristics.
Variations in the native full-length sequence PRO or in various domains of the PRO described herein, can be made, for example, using any of the techniques and guidelines for conservative and non-conservative mutations set forth, for instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, deletion or insertion of one or more codons encoding the PRO that results in a change in the amino acid sequence of the PRO as compared with the native sequence PRO. Optionally the variation is by substitution of at least one amino acid with any other amino acid in one or more of the domains of the PRO. Guidance in determining which amino acid residue may be inserted, substituted or deleted without adversely affecting the desired activity may be found by comparing the sequence of the PRO with that of homologous known protein molecules and minimizing the number of amino acid sequence changes made in regions of high homology. Amino acid substitutions can be the result of replacing one amino acid with another amino acid having similar structural.
and/or chemical properties, such as the replacement of a leucine with a serine, conservative amino acid replacements. Insertions or deletions may optionally be in the range of about I to 5 amino acids. The variation allowed may be deterniined by systematically making insertions, deletions or substitutions of amino acids in the sequence and testing the resulting variants for activity exhibited by the full-length or mature native sequence.
PRO polypeptide fragments are provided herein. Such fragments may be truncated at the N-terminus or C-terminus, or may lack internal residues, for example, when compared with a full length native protein.
Certain fragments lack amino acid residues that are not essential for a desired biological activity of the PRO polypeptide.
PRO fragments may be prepared by any of a number of conventional techniques. Desired peptide fragments may be chemically synthesized. An alternative approach involves generating PRO fragments by enzymatic digestion, by treating the protein with an enzyme known to cleave proteins at sites defined by particular amino acid residues, or by digesting the DNA with suitable restriction enzymes and isolating the desired fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment encoding a desired polypeptide fragment, by polymerase chain reaction (PCR). Oligonucleotides that define the desired termini of the DNA fragment are employed at the 5' and 3' primers in the PCR. Preferably, PRO polypeptide
O
0 ci e-n fragments share at least one biological and/or immunological activity with the native PRO polypeptide disclosed herein.
In particular embodiments, conservative substitutions of interest are shown in Table 6 under the heading of preferred substitutions. If such substitutions result in a change in biological activity, then more substantial changes, denominated exemplary substitutions in Table 6, or as further described below in reference to amino acid classes, are introduced and the products screened.
Table 6 Original Residue Ala (A) Arg (R) Asn (N) Asp (D) Cys (C) Gin (Q) Glu (E) Gly (G) His (H) lie (I) Leu (L) Lys (K) Met (M) Phe (F) Pro (P) Ser (S) Thr (T) Trp (W) Tyr (Y) Val (V) Exemolary Substitutions val; leu; ile lys; gin; asn gin; his; lys; arg glu ser asn asp pro: ala asn; gin; lys; arg leu; val; met; ala; phe; norleucine norleucine; ile; val; met; ala; phe arg; gin; asn leu; phe; ile leu; val; ile; ala; tyr ala thr ser tyr; phe trp; phe; thr; ser ile; leu; met; phe; ala; norleucine Preferred Substitutions val lys gin glu ser asn asp ala arg leu ile arg leu leu ala thr ser tyr phe leu Substantial modifications in function or immunological identity of the PRO polypeptide are accomplished by selecting substitutions that differ significantly in their effect on maintaining the structure of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical conformation, the charge or hydrophobicity of the molecule at the target site, or the bulk of the side chain. Naturally occurring residues are divided into groups based on common side-chain properties: hydrophobic: norleucine, met, ala, val, leu, ile; neutral hydrophilic: cys, ser, thr; acidic: asp, glu; basic: asn, gin, his, lys, arg; residues that influence chain orientation: gly, pro; and aromatic: trp, tyr, phe.
O
O Non-conservative substitutions will entail exchanging a member of one of these classes for another class.
Such substituted residues also may be introduced into the conservative substitution sites or, more preferably, into Z the remaining (non-conserved) sites.
The variations can be made using methods known in the art such as oligonucleotide-mediated (site- Cf directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et al., Nucl.
Acids Res.. 13:4331 (1986); Zoller et al., Nucl. Acids Res., 10:6487 (1987)], cassette mutagenesis (Wells et al., Gene, 34:315 (1985)], restriction selection mulagenesis [Wells et al., Philos, Trans. R. Soc. London SerA, F[ 317:415 (1986)) or other known techniques can be performed on the cloned DNA to produce the PRO variant ifl DNA.
O
Ci Scanning amino acid analysis can also be employed to identify one or more amino acids along a S 10 contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. Such O amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the mainchain conformation of the variant [Cunningham and Wells, Science, 244: 1081-1085 (1989)]. Alanine is also typically preferred because it is the most common amino acid. Further, it is frequently found in both buried and exposed positions (Creighton, The Proteins, Freeman Co., Chothia, J. _ol. Biol.. 150:1 (1976)]. If alanine substitution does not yield adequate'amounts of variant, an isoteric amino acid can be used.
C. Modifications of PRO Covalent modifications of PRO are included within the scope of this invention. One type of covalent modification includes reacting targeted amino acid residues of a PRO polypeptide with an organic derivatizing agent that is capable of reacting with selected side chains or the N- or C- terminal residues of the PRO.
Derivatization with bifunctional agents is useful, for instance, for crosslinking PRO to a water-insoluble support matrix or surface for use in the method for purifying anti-PRO antibodies, and vice-versa. Commonly used crosslinking agents include, I, l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'-dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N-maleimido-1,8octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate.
Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the a-amino groups of lysine, arginine, and histidine side chains Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman Co., San Francisco, pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation of any C-terminal carboxyl group.
Another type of covalent modification of the PRO polypeptide included within the scope of this invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties found in native sequence PRO (either by removing the underlying glycosylation site or by deleting the glycosylation by chemical and/or enzymatic means), and/or adding one or more glycosylation sites that are not present in the
O
0 native sequence PRO. In addition, the phrase includes qualitative changes in the glycosylation of the native 00) proteins, involving a change in the nature and proportions of the various carbohydrate moieties present.
Addition of glycosylation sites to the PRO polypeptide may be accomplished by altering the amino acid sequence. The alteration may be made, for example, by the addition of, or substitution by, one or more serine C or threonine residues to the native sequence PRO (for O-linked glycosylation sites). The PRO amino acid sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA encoding the PRO polypeptide at preselected bases such that codons are generated that will translate into the desired amino acids.
O Another means of increasing the number of carbohydrate moieties on the PRO polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art. e.g., S 10 in WO 87/05330 published I I September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem., pp. 259- 306 (1981).
Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished chemically or enzymatically or by mutational substitution of codons encoding for amino acid residues that serve as targets for glycosylation. Chemical deglycosylation techniques are known in the ar and described, for instance, by Hakimuddin, et al., Arch. Biochem. Biohys., 259:52 (1987) and by Edge et al., Anal. Biochem., 118:131 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety of endo- and exo-glycosidases as described by Thotakura et al., Meth. Enzvmol., 138:350 (1987).
Another type of covalent modification of PRO comprises linking the PRO polypeptide to one of a variety of nonproteinaceous polymers, polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in themannerset forth in U.S. Patent Nos. 4,640,835; 4.496,689; 4,301,144; 4,670,417; 4 7 9 1,192 or 4 ,1 7 9 33 7 The PRO of the present invention may also be modified in a way to form a chimeric molecule comprising PRO fused to another, heterologous polypeptide or amino acid sequence.
In one embodiment, such a chimeric molecule comprises a fusion of the PRO with a tag polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally placed at the amino- or carboxyl- terminus of the PRO. The presence of such epitope-tagged forms of the PRO can be detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the PRO to be readily purified by affinity purification using an anti-tag antibody or another type of affinity matrix that binds to the epitope tag. Various tag polypeptides and their respective antibodies are well known in the an. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its antibody 12CA5 [Field et al., Mol. Cell. Biol., 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E1 0 antibodies thereto [Evan et Molecular and Cellular Biology, 1:3610-3616 (1985)]; and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et al., Protein Engineering, 2(6):547- 553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp et al., BioTechnologv, 6:1204-1210 (1988)1; the KT3 epitope peptide [Martin et al., Science, 255:192-194 (1992)]; an a-tubulin epitope peptide [Skinner et al., J. Biol. Chem., 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide tag (Lutz- Freyermuth et al., Proc. Natl, Acad, Sci. USA, 87:6393-6397 (1990)].
O
0c In an alternative embodiment, the chimeric molecule may comprise a fusion of the PRO with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric molecule (also referred to as an "immunoadhesin"), such a fusion could be to the Fc region of an IgG molecule. The Ig fusions preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) form of a PRO n polypeptide in place of at least one variable region within an Ig molecule. In a particularly preferred embodiment, the immunoglobulin fusion includes the hinge, CH2 and CR3, or the hinge, CHI, CH2 and CH3 regions of an IgG1 molecule. For the production of immunoglobulin fusions see also US Patent No. 5,428,130 issued June 27, 1995.
i D. Preparation of PRO The description below relates primarily to production of PRO by culturing cells transformed or transfected with a vector containing PRO nucleic acid. It is, of course, contemplated that alternative methods, .which are well known in the art, may ne employed to prepare PRO. For instance, the PRO sequence, or portions thereof, may be produced by direct peptide synthesis using solid-phase techniques [see, Stewart et al., Solid-Phase Pentide Synthesis, W.H. Freeman Co., San Francisco, CA (1969); Merrifield, J. Am, Chem.
Soc., 85:2149-2154 (1963)]. In vitro protein synthesis may be performed using manual techniques or by automation. Automated synthesis may be accomplished, for instance, using an Applied Biosystems Peptide Synthesizer (Foster City, CA) using manufacturer's instructions. Various portions of the PRO may be chemically synthesized separately and combined using chemical or enzymatic methods to produce the full-length
PRO.
1. Isolation of DNA Encoding PRO DNA encoding PRO may be obtained from a cDNA library prepared from tissue believed to possess the PRO mRNA and to express it at a detectable level. Accordingly, human PRO DNA can be conveniently obtained from a cDNA library prepared from human tissue, such as described in the Examples. The PROencoding gene may also be obtained from a genomic library or by known synthetic procedures automated nucleic acid synthesis).
Libraries can be screened with probes (such as antibodies to the PRO or oligonucleotides of at least about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening the cDNA or genomic library with the selected probe may be conducted using standard procedures, such as described in Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989). An alternative means to isolate the gene encoding PRO is to use PCR methodology [Sambrook et al., supra; Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1995)].
The Examples below describe techniques for screening acDNA library. The oligonucleotide sequences selected as probes should be ofsufficient length and sufficiently unambiguous that false positives are minimized.
The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library being screened. Methods of labeling are well known in the an, and include the use of radiolabels like "P-labeled ATP, biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and high
O
0, stringency, are provided in Sambrook et al., sunra.
00) Sequences identified in such library screening methods can be compared and aligned to other known sequences deposited and available in public databases such as GenBank or other private sequence databases.
Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across Sthe full-length sequence can be determined using methods known in the art and as described herein.
Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or genomic libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 17 conventional primer extension procedures as described in Sambrook et al., supra, to detect precursors and o processing intermediates of mRNA that may not have been reverse-transcribed into cDNA.
O 10 2. Selection and Transformation of Host Cells Host cells are transfected or transformed with expression or cloning vectors described herein for PRO production and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting, transformants, or amplifying the genes encoding the desired sequences. The culture conditions, such as media, temperature, pH and the like, can be selected by the skilled artisan without undue experimentation. In general, principles, protocols, and practical techniques for maximizing the productivity of cell cultures can be found in Mammalian Cell Biotechnology: a Practical Approach, M. Butler, ed. (IRL Press, 1991) and Sambrook et al., sunra.
Methods of eukaryotic cell transfection and prokaryotic cell transformation are known to the ordinarily skilled artisan, for example, CaCI,, CaPO,, liposome-mediated and electroporation. Depending on the host cell used, transformation is performed using standard techniques appropriate to such cells. The calcium treatment employing calcium chloride, as described in Sambrook et al., supra, or electroporation is generally used for prokaryotes. Infection with Agrobacterium tumefaciens is used for transformation of certain plant cells, as described by Shaw et al., Gene, 23:315 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb, Virolog.
52:456-457 (1978) can be employed. General aspects of mammalian cell host system transfections have been described in U.S. Patent No. 4,399,216. Transformations into yeast are typically carried out according to the method of Van Solingen et al., Bact., 130:946 (1977) and Hsiao et al., Proc. Natl Acad Sc. (USA, 76:3829 (1979). However, other methods for introducing DNA into cells, such as by nuclear microinjection, electroporation, bacterial protoplast fusion with intact cells, or polycations, polybrene, polyomithine, may also be used. For various techniques for transforming mammalian cells, see Keown et al., Methods in Enzvmologv, 185:527-537 (1990) and Mansour et al., Nature, 336:348-352 (1988).
Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, or higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram-negative or Gram-positive organisms, for example, Enterobacteriaceae such as E, coli. Various E. coli strains are publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); E. coli X1776 (ATCC 31,537); E. colt strain W3110 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host cells include Enterobacteriaceae such as Escherichia, E. coll, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella,
O
o Salmonella typhimurium, Serratia, Serratia marcescans, and Shigella, as well as Bacilli such as B.
0. subtilis and B. licheniformis B. licheniformis 41P disclosed in DD 266,710 published 12 April 1989), Pseudomonas such as P. aeruginosa, and Streptomyces. These examples are illustrative rather than limiting.
Strain W3110 is one particularly preferred host or parent host because it is a common host strain for recombinant CDNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For example, strain W3110 may be modified to effect a genetic mutation in the genes encoding proteins endogenous to the host, with examples of such hosts including E. coil W3110 strain 1A2. which has the complete genotype lonA E. coil W3110 strain 9E4, which has the complete genotype tonA Ipr3; E. coli W3110 strain 27C7 t (ATCC 55,244), which has the complete genotype tonA ptr3phoA E15 (argF-lac)169 degP ompTkan'; E. coli SW3110 strain 37D6, which has the complete genotype tonA ptr3 phoA E15 (argF-lac)169 degP ompT rbs7 ilvG kan'; E. coli W3110 strain 40B4, which is strain 37D6 with a non-kanamycin resistant degP deletion o mutation; and anE. coli strain having mutant periplasmicprotease disclosed in U.S. Patent No. 4,946,783 issued 7 August 1990. Alternatively, in vitro methods of cloning, PCR or other nucleic acid polymerase reactions, are suitable.
In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning or expression hosts for PRO-encoding vectors. Saccharomyces cerevisiae is a commonly used lower eukaryotic host microorganism. Others include Schizosaccharomycespombe (Beach and Nurse, Nature, 290: 140 (1981]; EP 139,383 published 2 May 1985); Kluyveromyces hosts Patent No. 4,943,529; Fleer et al., Bio/Technologv, 9:968-975 (1991)) such as, K. lactis (MW98-8C, CBS683, CBS4574; Louvencourt et al., J. Bacteriol., 154(2):737-742 [1983]), K. fragilis (ATCC 12,424), K. bulgaricus (ATCC 16,045). K.
wickeramii (ATCC 24,178), K. waltii (ATCC 56,500), K. drosophilarum (ATCC 36,906; Van den Berg et al., Bio/Technology, 8:135 (1990)), K. thermorolerans, and K. marxianus; yarrowia (EP 402,226); Pichia pastoris (EP 183,070; Sreekrishna et al., J. Basic Microbiol., 28:265-278 [1988]); Candida; Trichoderma reesia (EP 244,234); Neurospora crassa (Case et al., Proc. Natl. Acad Sci USA, 76:5259-5263 [19791); Schwanniomyces such as Schwanniomyces occidentalis (EP 394,538 published 31 October 1990); and filamentous fungi such as, Neurospora, Penicillium, Tolypocladium (WO 91/00357 published 10 January 1991), and Aspergillus hosts such as A. nidulans (Ballance et al., Biochem. Bioophvs. Res. Commun., 112:284-289 [1983]; Tilbur et al., Gene, 26:205-221 [1983]; Yeltonet al., Proc. Natl. Acad. Sci. USA. 81: 1470-1474 [1984]) and A. niger(Kelly and Hynes, EMBO 4:475-479 [1985]). Methylotropic yeasts are suitable herein and include, but are not limited to, yeast capable of growth on methanol selected from the genera consisting of Hansenula, Candida, Kloeckera, Pichia, Saccharomyces, Torulopsis, and Rhodotorula. A list of specific species that are exemplary of this class of yeasts may be found in C. Anthony, The Biochemistry of Methylotrophs, 269 (1982).
Suitable host cells for the expression of glycosylated PRO are derived from multicellular organisms.
Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, as well as plant cells. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and COS cells.
More specific examples include monkey kidney CVI line transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, Graham et al., L Gen Virol., 36:59 (1977)); Chinese hamster ovary cells/-DHFR (CHO, Urlaub and Chasin,.Proc, Nal. Acad.
O
Sci. USA, 77:4216(1980)); mouse sertoli cells (TM4. Mather, Biol Reprod., 23:243-251 (1980)); human lung 1 cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); and mouse mammary tumor (MMT S060562, ATCC CCL51). The selection of the appropriate host cell is deemed to be within the skill in the an.
n 3. Selection and Use of a Reolicable Vector The nucleic acid cDNA or genomic DNA) encoding PRO may be inserted into a replicable vector for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. The vector may, for example, be in the form of a plasmid, cosmid, viral particle, or phage. The appropriate nucleic acid Ssequence may be inserted into the vector by a variety of procedures. In general, DNA is inserted into an C appropriate restriction endonuclease site(s) using techniques known in the art. Vector components generally O 10 include, but are not limited to, one or more of a signal sequence, an origin of replication, one or more marker genes, an enhancer element, a promoter, and a transcription termination sequence. Construction of suitable vectors containing one or more of these components employs standard ligation techniques which are known to the skilled artisan.
The PRO may be produced recombinantly not only directly, but also as a fusion polypeptide with a heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component of the vector, or it may be a par of the PRO-encoding DNA that is inserted into the vector. The signal sequence may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. For yeast secretion the signal sequence may be, e.g., the yeast invertase leader, alpha factor leader (including Saccharomyces and Kluyveromyces a-factor leaders, the latter described in U.S. Patent No. 5,010,182), or acid phosphatase leader, the C. albicans glucoamylase leader (EP 362,179 published 4 April 1990), or the signal described in WO 90/13646 published 15 November 1990. In mammalian cell expression, mammalian signal sequences may be used to direct secretion of the protein, such as signal sequences from secreted polypeptides of the same or related species, as well as viral secretory leaders.
Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses.
The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2 p plasmid origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for cloning vectors in mammalian cells.
Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker.
Typical selection genes encode proteins that confer resistance to antibiotics or other toxins, ampicillin, neomycin, methotrexate, or tetracycline, complement auxotrophic deficiencies, or supply critical nutrients not available from complex media, the gene encoding D-alanine racemase for Bacilli.
An example of suitable selectable markers for mammalian cells are those that enable the identification of cells competent to take up the PRO-encoding nucleic acid, such as DHFR or thymidine kinase. An appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity,
O
O
c prepared and propagated as described by Urlaubet al., Proc. Natl. Acad. Sci. USA, 77:4216 (1980). A suitable selection gene for use in yeast is the trpl gene present in the yeast piasmid YRp7 [Stinchcomb et.al.. Nature, 282:39 (1979); Kingsman et al., Gene, 7:141 (1979); Tschemper et al., Qgne, 10:157 (1980)]. The trpl gene provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example, C ATCC No. 44076 or PEP4-1 [Jones, Genetics, 85:12 (1977)).
Expression and cloning vectors usually contain a promoter operably linked to the PRO-encoding nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are well known. Promoters suitable for use with prokaiyotic hosts include the p-lactamase and lactose promoter systems O [Chang et al., Nature, 275:615 (1978); Goeddel et al., Nature, 281:544 (1979)], alkaline phosphatase, a tryptophan (trp) promoter system [Goeddel, Nucleic Acids Res., 8:4057 (1980); EP 36,7761, and hybrid O 10 promoters such as the tac promoter [deBoer et al., Proc. Natl, Acad. Sci. USA, 80:21-25 (1983)]. Promoters Sfor use in bacterial systems also will contain a Shine-Dalgamo sequence operably linked to the DNA encoding PRO.
Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3phosphoglycerate kinase [Hitzeman et al.. J. Biol. Chem., 255:2073 (1980)1 or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg., 7:149 (1968); Holland, Biochemistry, 17:4900 (1978)], such as enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose- 6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase.
Other yeast promoters, which are inducible promoters having the additional advantage of transcription controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and promoters for use in yeast expression are further described in EP 73,657.
PRO transcription from vectors in mammalian host cells is controlled, for example, by promoters obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,211,504 published 5 July 1989), adenovirus (such as Adenovirus bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous mammalian promoters, the actin promoter or an immunoglobulin promoter, and from heat-shock promoters, provided such promoters are compatible with the host cell systems.
Transcription of a DNA encoding the PRO by higher eukaryotes may be increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known from mammalian genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use an enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a position 5' or 3' to the PRO coding sequence, but is preferably located at a site 5' from the promoter.
O
0 Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated 00) cells from other multicellular organisms) will also contain sequences necessary for the termination of transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5' and, occasionally untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding PRO.
Still other methods, vectors, and host cells suitable for adaptation to the synthesis of PRO in recombinant vertebrate cell culture are described in Gething et al., Nature, 293:620-625 (1981); Mantei et al., SNature, 281:40-46 (1979); EP 117,060; and EP 117,058.
Ci 4. Detecting Gene Amplification/Exoression O 10 Gene amplification and/or expression may be measured in a sample directly, for example, by c conventional Southern blotting, Northern blotting to quantitare the transcription of mRNA [Thomas, Proc. Nail, Acad. Sci. USA, 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be carried out where the duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound to the duplex can be detected.
Gene expression, alternatively, may be measured by immunological methods, such as immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate directly the expression of gene product. Antibodies useful for immunohistochemical staining and/or assay of sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may be prepared against a native sequence PRO polypeptide or against a synthetic peptide based on the DNA sequences provided herein or against exogenous sequence fused to PRO DNA and encoding a specific antibody epitope.
Purification of Polvpeptide Forms of PRO may be recovered from culture medium or from host cell lysates. If membrane-bound, it can be released from the membrane using a suitable detergent solution Triton-X 100) or by enzymatic cleavage. Cells employed in expression of PRO can be disrupted by various physical or chemical means, such as freeze-thaw cycling, sonication, mechanical disruption, or cell lysing agents.
It may be desired to purify PRO from recombinant cell proteins or polypeptides. The following procedures are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, Sephadex G-75; protein A Sepharose columns to remove contaminants such as IgG; and metal chelating columns to bind epitope-tagged forms of the PRO. Various methods of protein purification may be employed and such methods are known in the an and described for example in Deutscher, Methods in Enzvmoloev, 182 (1990);
O
O
c, Scopes, Protein Purification: Principles and Practice, Springer-Verlag, New York (1982). The purification step(s) selected will depend, for example, on the nature of the production process used and the particular PRO Sproduced.
E. Uses for PRO Nucleotide sequences (or their complement) encoding PRO have various applications in the art of molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in the generation of anti-sense RNA and DNA. PRO nucleic acid will also be useful for the preparation of PRO O polypeptides by the recombinant techniques described herein.
The full-length native sequence PRO gene, or portions thereof, may be used as hybridization probes for a cDNA library to isolate the full-length PRO cDNA or to isolate still other cDNAs (for instance, those (C encoding naturally-occurring variants of PRO or PRO from other species) which have a desired sequence identity to the native PRO sequence disclosed herein. Optionally, the length of the probes will be about 20 to about bases. The hybridization probes may be derived from at least partially novel regions of the full length native nucleotide sequence wherein those regions may be determined without undue experimentation or from genomic sequences including promoters, enhancer elements and introns of native sequence PRO. By way of example, a screening method will comprise isolating the coding region of the PRO gene using the known DNA sequence to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety of labels, including radionucleotides such as "P or or enzymatic labels such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems. Labeled probes having a sequence complementary to that of the PRO gene of the present invention can be used to screen libraries of human cDNA, genomic DNA or mRNA to determine which members of such libraries the probe hybridizes to. Hybridization techniques are described in further detail in the Examples below.
Any EST sequences disclosed in the present application may similarly be employed as probes, using the methods disclosed herein.
Other useful fragments of the PRO nucleic acids include antisense or sense oligonucleotides comprising a singe-stranded nucleic acid sequence (either RNA or DNA) capable of binding to target PRO mRNA (sense) or PRO DNA (antisense) sequences. Antisense or sense oligonucleotides, according to the present invention, comprise a fragment of the coding region of PRO DNA. Such a fragment generally comprises at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, for example, Stein and Cohen (Cancer Res, 48:2659, 1988) and van der Krol et al. (BioTechniques 6:958, 1988).
Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the formation of duplexes that block transcription or translation of the target sequence by one of several means, including enhanced degradation of the duplexes, premature termination of transcription or translation, or by other means.
The antisense oligonucleotides thus may be used to block expression of PRO proteins. Antisense or sense oligonucleotides further comprise oligonucleotides having modified sugar-phosphodiester backbones (or other sugar linkages, such as those described in WO 91/06629) and wherein such sugar linkages are resistant to
O
c endogenous nucleases. Such oligonucleotides with resistant sugar linkages are stable in vivo capable of 0. resisting enzymatic degradation) but retain sequence specificity to be able to bind to target nucleotide sequences.
Other examples of sense or antisense oligonucleotides include those oligonucleotides which are covalently linked to organic moieties, such as those described in WO 90/10048, and other moieties that increases Saffinity of the oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further still, intercalating agents, such as ellipticine, and alkylating agents or metal complexes may be attached to sense or antisense oligonucleotides to modify binding specificities of the antisense or sense oligonucleotide for the target nucleotide sequence.
O Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic acid sequence by any gene transfer method, including, for example, CaPO,-mediated DNA transfection, O 10 electroporation, or by using gene transfer vectors such as Epstein-Barr virus. In a preferred procedure, an ci antisense or sense oligonucleotide is inserted into a suitable retroviral vector. A cell containing the target nucleic acid sequence is contacted with the recombinant retroviral vector, either in vivo or ex vivo. Suitable retroviral vectors include, but are not limited to, those derived from the murine retrovirus M-MuLV, N2 (a retrovirus derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and DCTSC (sce WO 90/13641).
Sense or antisense oligonucleotides also may be introduced into a cell containing the target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, cell surface receptors, growth factors, other cytokines.
or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does not substantially interfere with the ability of the ligand binding molecule to bind to its corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide or its conjugated version into the cell.
Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell containing the target nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 90/10448. The sense or antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous lipase.
Antisense or sense RNA or DNA molecules are generally at least about 5 bases in length, about bases in length, about 15 bases in length, about 20 bases in length, about 25 bases in length, about 30 bases in length, about 35 bases in length, about 40 bases in length, about 45 bases in length, about 50 bases in length, about 55 bases in length, about 60 bases in length, about 65 bases in length, about 70 bases in length, about bases in length, about 80 bases in length, about 85 bases in length, about 90 bases in length, about 95 bases in length, about 100 bases in length, or more.
The probes may also be employed in PCR techniques to generate a pool of sequences for identification of closely related PRO coding sequences.
Nucleotide sequences encoding a PRO can also be used to construct hybridization probes for mapping the gene which encodes that PRO and for the genetic analysis of individuals with genetic disorders. The nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome using known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers.
and hybridization screening with libraries.
0 c-I When the coding sequences for PRO encode a protein which binds to another protein (example, where (3J1 the PRO is a receptor), the PRO can be used in assays to identify the other proteins or molecules involved in the binding interaction. By such methods, inhibitors of the receptor/ligand binding interaction can be identified.
Proteins involved in such binding interactions can also be used to screen for peptide or small molecule inhibitors or agonists of the binding interaction. Also, the receptor PRO can be used to isolate correlative ligand(s).
Screening assays can be designed to find lead compounds that mimic the biological activity of a native PRO or a receptor for PRO. Such screening assays will include assays amenable to high-throughput screening of chemical libraries, making them particularly suitable for identifying small molecule drug candidates. Small O molecules contemplated include synthetic organic or inorganic compounds. The'assays can be performed in a variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays and o 10 cell based assays, which are well characterized in the art.
CNucleic acids which encode PRO or its modified forms can also be used to generate either transgenic animals or "knock out" animals which, in turn, are useful in the development and screening of therapeutically useful reagents. A transgenic animal a mouse or rat) is an animal having cells that contain a transgene.
which transgene was introduced into the animal or an ancestor of the animal at a prenatal, an embryonic stage. A transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal develops. In one embodiment, cDNA encoding PRO can be.used to clone genomic DNA encoding PRO in accordance with established techniques and the genomic sequences used to generate transgenic animals that contain cells which express DNA encoding PRO. Methods for generating transgenic animals, particularly animals such as mice or rats, have become conventional in the an and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009. Typically, particular cells would be targeted for PRO transgene incorporation with tissue-specific enhancers. Transgenic animals that include a copy of a transgene encoding PRO introduced into the germ line of the animal at an embryonic stage can be used to examine the effect of increased expression of DNA encoding PRO. Such animals can be used as tester animals for reagents thought to confer protection from, for example, pathological conditions associated with its overexpression. In accordance with this facet of the invention, an animal is treated with the reagent and a reduced incidence of the pathological condition, compared to untreated animals bearing the transgene, would indicate a potential therapeutic intervention for the pathological condition.
Alternatively, non-human homologues of PRO can be used to construct a PRO "knock out' animal which has a defective or altered gene encoding PRO as a result of homologous recombination between the endogenous gene encoding PRO and altered genomic DNA encoding PRO introduced into an embryonic stem cell of the animal. For example, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in accordance with established techniques. A portion of the genomic DNA encoding PRO can be deleted or replaced with another gene, such as a gene encoding a selectable marker which can be used to monitor integration. Typically, several kilobases of unaltered flanking DNA (both at the 5' and 3' ends) are included in the vector [see Thomas and Capecchi, Cell, 51:503 (1987) for a description of homologous recombination vectors]. The vector is introduced into an embryonic stem cell line by electroporation) and cells in which the introduced DNA has homologously recombined with the endogenous DNA are selected [see
O
O
C1 Li et al., Cell, 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal a mouse or rat) to form aggregation chimeras [see Bradley, in Teratocarcinomas and Embryonic Stem Cells:A Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 113-1521. A chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to create a "knock out" animal. Progeny harboring the homologously recombined DNA in their germ cells can be identified by standard techniques and used to breed animals in which all cells of the animal contain the Shomologously recombined DNA. Knockout animals can be characterized for instance, for their ability to defend against certain pathological conditions and for their development of pathological conditions due to absence of O the PRO polypeptide.
r Nucleic acid encoding the PRO polypeptides may also be used in gene therapy. In gene therapy O 10 applications, genes are introduced into cells in order to achieve in vivo synthesis of a therapeutically effective Cl genetic product, for example for replacement of a defective gene. "Gene therapy" includes both conventional gene therapy where a lasting effect is achieved by a single treatment, and the administration of gene therapeutic agents, which involves the one time or repeated administration of a therapeutically effective DNA or mRNA.
Antisense RNAs and DNAs can be used as therapeutic agents for blocking the expression of certain genes in vivo. It has already been shown that short antisense oligonucleotides can be imported into cells where they act as inhibitors, despite their low intracellular concentrations caused by their restricted uptake by the cell membrane. (Zamecnik et al.. Proc. Natl. Acad. Sci. USA 83:4143-4146 [1986]). The oligonucleotides can be modified to enhance their uptake, e.g. by substituting their negatively charged phosphodiester groups by uncharged groups.
There are a variety of techniques available for introducing nucleic acids into viable cells. The techniques vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, or in vivo in the cells of the intended host. Techniques suitable for the transfer of nucleic acid into mammalian cells in vitro include the use of liposomes, electroporation, microinjection, cell fusion, DEAE-dextran, the calcium phosphate precipitation method, etc. The currently preferred in vivo gene transfer techniques include transfection with viral (typically retroviral) vectors and viral coat protein-liposome mediated transfection (Dzau et al., Trends in Biotechnology 11, 205-210 [1993]). In some situations it is desirable to provide the nucleic acid source with an agent that targets the target cells, such as an antibody specific for a cell surface membrane protein or the target cell, a ligand for a receptor on the target cell, etc. Where liposomes are employed, proteins which bind to a cell surface membrane protein associated with endocytosis may be used for targeting and/or to facilitate uptake, e.g. capsid proteins or fragments thereof tropic for a particular cell type, antibodies for proteins which undergo internalization in cycling, proteins that target intracellular localization and enhance intracellular half-life.
The technique of receptor-mediated endocytosis is described, for example, by Wu et al., J. Biol. Chem. 262, 4429-4432 (1987); and Wagner et al., Proc. Natl. Acad. Sci. USA 87, 3410-3414 (1990). For review of gene marking and gene therapy protocols see Anderson et al., Science 256, 808-813 (1992).
The PRO polypeptides described herein may also be employed as molecular weight markers for protein electrophoresis purposes and the isolated nucleic acid sequences may be used for recombinantly expressing those markers.
O
c" The nucleic acid molecules encoding the PRO polypeptides or fragments thereof described herein are Suseful for chromosome identification. In this regard, there exists an ongoing need to identify new chromosome markers, since relatively few chromosome marking reagents, based upon actual sequence data are presently available. Each PRO nucleic acid molecule of the present invention can be used as a chromosome marker.
n The PRO polypeptides and nucleic acid molecules of the present invention may also be used for tissue typing, wherein the PRO polypeptides of the present invention may be differentially expressed in one tissue as compared to another. PRO nucleic acid molecules will find use for generating probes for PCR, Northern analysis, Southern analysis and Western analysis.
o The PRO polypeptides described herein may also be employed as therapeutic agents. The PRO polypeptides of the present invention can be formulated according to known methods to prepare pharmaceutically O 10 useful compositions, whereby the PRO product hereof is combined in admixture with a pharmaceutically Sacceptable carrier vehicle. Therapeutic formulations are prepared for storage by mixing the active ingredient having the desired degree of purity with optional physiologically acceptable carriers, excipients or stabilizers (Remington's Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980)), in the form of lyophilized formulations or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the dosages and concentrations employed, and include buffers such as phosphate, citrate and other organic acids; antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, gelatin or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone, amino acids such as glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides and other carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as TWEENTM,
PLURONICS
T M or PEG.
The formulations to be used for in vivo administration must be sterile. This is readily accomplished by filtration through sterile filtration membranes, prior to or following lyophilization and reconstitution.
Therapeutic compositions herein generally are placed into a container having a sterile access port, for example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle.
The route of administration is in accord with known methods, e.g. injection or infusion by intravenous, intraperitoneal, intracerebral, intramusculhr, intraocular, intraarterial or intralesional routes, topical administration, or by sustained release systems.
Dosages and desired drug concentrations of pharmaceutical compositions of the present invention may vary depending on the particular use envisioned. The determination of the appropriate dosage or route of administration is well within the skill of an ordinary physician. Animal experiments provide reliable guidance for the determination of effective doses for human therapy. Interspecies scaling of effective doses can be performed following the principles laid down by Mordenti, J. and Chappell, W. "The use of interspecies scaling in toxicokinetics" In Toxicokinetics and New Drug Development, Yacobi et al., Eds., Pergamon Press, New York 1989, pp. 42-96.
When in vivo administration of a PRO polypeptide or agonist or antagonist thereof is employed, normal dosage amounts may vary from about 10 ng/kg to up to 100 mg/kg of mammal body weight or more per day, WO 00/56889 PCT/US00/05601 C" preferably about 1 pg/kg/day to 10 mg/kg/day, depending upon the route of administration. Guidance as to 0. particular dosages and methods of delivery is provided in the literature; see, for example, U.S. Pat. Nos.
4,657,760; 5,206,344; or 5,225,212. It is anticipated that different formulations will be effective for different treatment compounds and different disorders, that administration targeting one organ or tissue, for example, may necessitate delivery in a manner different from that to another organ or tissue.
Where sustained-release administration of a PRO polypeptide is desired in a formulation with release Scharacteristics suitable for the treatment of any disease or disorder requiring administration of the PRO polypept de, microencapsulation of the PRO polypeptide is contemplated. Microencapsulation of recombinant O proteins for sustained release has been successfully performed with human growth hormone (rhGli), interferonr (rhIFN-), interleukin-2, and MN rgpl20. Johnson et al., Nat. Med., 2:795-799 (1996); Yasuda, Biomed.
S 10 Ther., 27:1221-1223 (1993); Hora etal., Bio/Technologv. 8:755-758 (1990); Cleland, "Design and Production of Single Immunization Vaccines Using Polylactide Polyglycolide Microsphere Systems," in Vaccine Design: The Subunit and Adiuvant Atproach, Powell and Newman, eds, (Plenum Press: New York, 1995), pp. 439-462; WO 97/03092, WO 96/40072, WO 96/07399; and U.S. Pat. No. 5,654,010.
The sustained-release formulations of these proteins were developed using poly-lactic-coglycolic acid (PLGA) polymer due to its biocompatibility and wide range of biodegradable properties. The degradation products of PLGA, lactic and glycolic acids, can be cleared quickly within the human body. Moreover, the degradability of this polymer can be adjusted from months to years depending on its molecular weight and composition. Lewis, "Controlled release of bioactive agents from lactide/glycolide polymer," in: M. Chasin and R. Langer Biodegradable Polymers as Drug Delivery Systems (Marcel Dekker: New York, 1990), pp. 1-41.
This invention encompasses methods of screening compounds to identify those that.mimic the PRO polypeptide (agonists) or prevent the effect of the PRO polypeptide (antagonists). Screening assays for antagonist drug candidates are designed to identify compounds that bind or complex with the PRO polypeptides encoded by the genes identified herein, or otherwise interfere with the interaction of the encoded polypeptides with other cellular proteins. Such screening assays will include assays amenable to high-throughput screening of chemical libraries, making them particularly suitable for identifying small molecule drug candidates.
The assays can be performed in a variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art.
All assays for antagonists are common in that they call for contacting the drug candidate with a PRO polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these two components to interact.
In binding assays, the interaction is binding and the complex formed can be isolated or detected in the reaction mixture. In a particular embodiment, the PRO polypeptide encoded by the gene identified hereinor the drug candidate is immobilized on a solid phase, on a microtiter plate, by covalent or non-covalent attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution of the PRO polypeptide and drying. Alternatively, an immobilized antibody, a monoclonal antibody, specific for the PRO polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed
O
0 by adding the non-immobilized component, which may be labeled by a detectable label, to the immobilized component, the coated surface containing the anchored component. When the reaction is complete, the non-reacted components are removed, by washing, and complexes anchored on the solid surface are detected. When the originally non-immobilized component carries a detectable label, the detection of label immobilized on the surface indicates that complexing occurred. Where the originally non-immobilized component does not carry a label, complexing can be detected, for example, by using a labeled antibody specifically binding the immobilized complex.
SIf the candidate compound interacts with but does not bind to a particular PRO polypeptide encoded by Sa gene identified herein, its interaction with that polypeptide can be assayed by mcthods well known for detecting C protein-protein interactions. Such assays include traditional approaches, such as, cross-linking, coimmunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, proteinprotein interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers (Fields and Song, Nature (London), 340:245-246 (1989); Chien et al., Proc. Natl. Acad. Sci. USA, 88:9578- 9582 (1991)) as disclosed by Chevray and Nathans, Proc. Natl, Acad. Sci. USA, 89: 5789-5793 (1991). Many transcriptional activators, such as yeast GAL4, consist of two physically discrete modular domains, one acting as the DNA-binding domain, the other one functioning as the transcription-activation domain. The yeast expression system described in the foregoing publications (generally referred to as the "two-hybrid system') takes advantage of this property, and employs two hybrid proteins, one in which the target protein is fused to the DNA-binding domain of GAL4, and another, in which candidate activating proteins are fused to the activation domain. The expression of a GAL1-lacZ reporter gene under control of a GAIA-activated promoter depends on reconstitution of GALA activity via protein-protein interaction. Colonies containing interacting polypeptides are detected with a chromogenic substrate for p-galactosidase. A complete kit (MATCHMAKER) for identifying protein-protein interactions between two specific proteins using the twohybrid technique is commercially available from Clontech. This system can also be extended to map protein domains involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial for these interactions.
Compounds that interfere with the interaction of a gene encoding a PRO polypeptide identified herein and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared containing the product of the gene and the intra- or extracellular component under conditions and for a time allowing for the interaction and binding of the two products. To test the ability of a candidate compound to inhibit binding, the reaction is run in the absence and in the presence of the test compound. In addition, a placebo may be added to a third reaction mixture, to serve as positive control. The binding (complex formation) between the test compound and the intra- or extracellular coinponent present in the mixture is monitored as described hereinabove. The formation of a complex in the control reaction(s) but not in the reaction mixture containing the test compound indicates that the test compound interferes with the interaction of the test compound and its reaction partner.
To assay for antagonists, the PRO polypeptide may be added to a cell along with the compound to be screened for a particular activity and the ability of the compound to inhibit the activity of interest in the presence
O
Cl of the PRO polypeptide indicates that the compound is an antagonist to the PRO polypeptide. Alternatively, antagonists may be detected by combining the PRO polypeptide and a potential antagonist with membrane-bound r PRO polypeptide receptors or recombinant receptors under appropriate conditions for a competitive inhibition assay. The PRO polypeptide can be labeled, such as by radioactivity, such that the number of PRO polypeptide molecules bound to the receptor can be used to determine the effectiveness of the potential antagonist. The gene encoding the receptor can be identified by numerous methods known to those of skill in the art, for example, 1 ligand panning and FACS sorting. Coligan et al., Current Protocols in Immun., Chapter 5 (1991).
Preferably, expression cloning is employed wherein polyadenylated RNA is prepared from a cell responsive to O the PRO polypeptide and a cDNA library created from this RNA is divided into pools and used to transfect COS cells or other cells that are not responsive to the PRO polypeptide. Transfected cells that are grown on glass O 10 slides are exposed to labeled PRO polypeptide. The PRO polypeptide can be labeled by a variety of means Cl •including iodination or inclusion of a recognition site for a site-specific protein kinase. Following fixation and Sincubation, the slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pools are prepared and re-transfected using an interactive sub-pooling and re-screening process, eventually yielding a single clone that encodes the putative receptor.
As an alternative approach for receptor identification, labeled PRO polypeptide can be photoaffinitylinked with cell membrane or extract preparations that express the receptor molecule. Cross-linked material is resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, resolved into peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained from micro- sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA library to identify the gene encoding the putative receptor.
In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor would be incubated with labeled PRO polypeptide in the presence of the candidate compound. The ability of the compound to enhance or block this interaction could then be measured.
More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of immunoglobulin with PRO polypeptide, and, in particular antibodies including, without limitation, poly- and monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and chimeric or humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments.
Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the PRO polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the PRO polypeptide.
Another potential PRO polypeptide antagonist is an antisense RNA or DNA construct prepared using antisense technology, where, an antisense RNA or DNA molecule acis to block directly the translation of mRNA by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be used to control gene expression through triple-helix formation or antisense DNA or RNA, both of which methods are based on binding of a polynucleotide to DNA or RNA. For example, the 5'coding portion of the polynucleotide sequence, which encodes the mature PRO polypeptides herein, is used to design an antisense RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be
O
Scomplementary to a region of the gene involved in transcription (triple helix see Lee et al., Nucl. Acids Res., 6:3073 (1979); Cooney et al., Science, 241: 456 (1988); Dervan et al., Science, 251:1360 (1991)), thereby Z preventing transcription and the production of the PRO polypeptide. The antisense RNA oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into the PRO polypeptide C (antisense Okano, Neurochem., 56:560 (1991); Olieodeoxvnucleotides as Antisense Inhibitors of Gene Expression (CRC Press: Boca Raton, FL, 1988). The oligonucleotides described above can also be delivered to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of the PRO 17- polypeptide. When antisense DNA is used, oligodeoxyribonucleotides derived from the translation-initiation site, S between about -10 and +10 positions of the target gene nucleotide sequence, are preferred.
l Potential antagonists include small molecules that bind to the active site, the receptor binding site, or growth factor or other relevant binding site of the PRO polypeptide, thereby blocking the normal biological 0 activity of the PRO polypeptide. Examples of small molecules include, but are not limited to, small peptides or peptide-like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or inorganic compounds.
Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA.
Ribozymes act by sequence-specific hybridization to the complementary target RNA, followed by endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by known techniques. For further details see, Rossi, Current Biology, 4:469-471 (1994), and PCT publication No. WO 97/33551 (published September 18, 1997).
Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded and composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it promotes triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of purines or pyrimidines on one strand of a duplex. For further details see, PCT publication No. WO 97/33551, supra.
These small molecules can be identified by any one or more of the screening assays discussed hereinabove and/or by any other screening techniques well known for those skilled in the art.
Uses of the herein disclosed molecules may also be based upon the positive functional assay hits disclosed and described below.
F. Anti-PRO Antibodies The present invention further provides anti-PRO antibodies. Exemplary antibodies include polyclonal, monoclonal, humanized, bispecific, and heteroconjugate antibodies.
1. Polvclonal Antibodies The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a mammal, for example, by one or more injections of an immunizing agent and, if desired, an adjuvant. Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. The
O
l immunizing agent may include the PRO polypeptide or a fusion protein thereof. It may be useful to conjugate the immunizing agent to a protein known to be immunogenic in the mammal being immunized. Examples of such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate).
The immunization protocol may be selected by one skilled in the art without undue experimentation.
2. Monoclonal Antibodies O The anti-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may be t" prepared using hybridoma methods, such as those described by Kohler and Milstein, Name, 256:495 (1975).
o 10 In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized with an l immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically .bind to the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro.
The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof.
Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes are then fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice, Academic Press, (1986) pp. 5 9 103 Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT). the culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells.
Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the production of human monoclonal antibodies [Kozbor J. Immunol., 133:3001 (1984); Brodeur et al., Monoclonal Antibody Production Techniques and Alications, Marcel Dekker, Inc., New York, (1987) pp. 51-631.
The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of monoclonal antibodies directed against PRO. Preferably, the binding specificity of monoclonal antibodies produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the art. The binding affinity of the monoclonal antibody can, for example, be determined by the Scatchard analysis of Munson and Pollard, Anal. Biochem., 107:220 (1980).
O
0 After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 1 procedures and grown by standard methods [Goding, supral. Suitable culture media for this purpose include, Sfor example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. Alternatively, the hybridoma cells may be grown in vivo as ascites in a mammal.
Cn The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography.
The monoclonal antibodies may also be made by recombinant DNA methods, such as those described o in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily isolated C"i and sequenced using conventional procedures by using oligonucleotide probes that are capable of binding 0 10 specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the c" invention serve as a preferred source of such DNA. Once isolated, the DNA may be placed into expression vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also may be modified, for example, by substituting the coding sequence for human heavy and light chain constant domains in place of the homologous murine sequences Patent No. 4,816,567; Morrison et al., supraj or by covalently joining to the immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 'polypeptide can be substituted for the constant domains of an antibody of the invention, or can be substituted for the variable domains of one antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody.
The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are well known in the art. For example, one method involves recombinant expression of immunoglobulin light chain and modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to prevent heavy chain crosslinking. Alternatively, the relevant cysteine residues are substituted with another amino acid residue or are deleted so as to prevent crosslinking.
In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to produce fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques known in the art.
3. Human and Humanized Antibodies The anti-PRO antibodies of the invention may further comprise humanized antibodies or human antibodies. Humanized forms of non-human murine) antibodies are chimeric immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', or other antigen-binding subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin.
Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In
O
O
Cl some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies may also comprise residues which are found neither in the recipient antibody nor in the imported CDR or framework sequences. In general, the humanized antibody will comprise substantially all of at least one, and typically two, variable domains, in which all or substantially all of the CDR regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant region typically that of a human immunoglobulin [Jones et al., Nature, 321:522-525 (1986); Riechmann et al., Nature, 332:323-329 (1988); and Presta, Curr. Op, O Struct. BioL, 2:593-596 (1992)].
r f Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized o 10 antibody has one or more amino acid residues introduced into it from a source which is non-human. These non- NC human amino acid residues are often referred to as "import" residues, which are typically taken from an "import" variable domain. Humanization can be essentially performed following the method of Winter and co-workers [Jones et al., Nature, 321:522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et al., Science, 232:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies Patent No. 4,816,567), wherein substantially less than an intact human variable domain has been substituted by the corresponding sequence from a non-human species. In practice, humanized antibodies are typically human antibodies in which some CDR residues and possibly some FR residues are substituted by residues from analogous sites in rodent antibodies.
Human antibodies can also be produced using various techniques known in the an, including phage display libraries [Hoogenboom and Winter, J. Mol. Biol., 227:381 (1991); Marks et al., J, Mol. Biol., 222:581 (1991)]. The techniques of Cole et al. and Boerner et al. are also available for the preparation of human monoclonal antibodies (Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985) and Boerner et al., J. Immunol., 147(1):86-95 (1991)]. Similarly, human antibodies can be made by introducing of human immunoglobulin loci into transgenic animals, mice in which the endogenous immunoglobulin genes have been partially or completely inactivated. Upon challenge, human antibody production is observed, which closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the following scientific publications: Marks et al., Bio/Technologv 779-783(1992); Lonberg etal., Nature 368 856-859 (1994): Morrison, Nature 812-13 (1994); Fishwild et al., Nature Biotechnology 14, 845-51 (1996); Neuberger, Nature Biotechnology 4, 826 (1996); Lonberg and Huszar, Intern. Rev. Immunol. 1165-93 (1995).
4. Bispecific Antibodies Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding specificities for at least two different antigens. In the present case, one of the binding specificities is for the PRO, the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor
O
Ssubunil.
Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different specificities [Milstein and Cuello, Nature, 305:537-539(1983)].
CBecause of the random assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a potential mixture of ten different antibody molecules, of which only one has the correct bispecific stricture. The purification of the correct molecule is usually accomplished by affinity chromatography steps.
Similar procedures are disclosed in WO 93/08829, published 13 May 1993, a id in Traunecker et al., EMBO SJ., 1:3655-3659 (1991).
C Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) can S 10 be fused to immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin Sheavy-chain constant domain, comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region (CH 1) containing the site necessary for light-chain binding present in at least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected into a suitable host organism. For further details of generating bispecific antibodies see, for example, Suresh et al., Methods in Enzvmologv. 121:210 (1986).
According to another approach described in WO 96/27011, the interface between a pair of antibody molecules can be engineered to maximize the percentage of heterodimers which are recovered from recombinant cell culture. The preferred interface comprises at least a part of the CH3 region of an antibody constant domain.
In this method, one or more small amino acid side chains from the interface of the first antibody molecule are replaced with larger side chains tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side chain(s) are created on the interface of the second antibody molecule by replacing large amino acid side chains with smaller ones alanine or threonine). This provides a mechanism for increasing the yield of the heterodimer over other unwanted end-products such as homodimers.
Bispecific antibodies can be prepared as full length antibodies or antibody fragments F(ab'), bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been described in the literature. For example, bispecific antibodies can be prepared can be prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a procedure wherein intact antibodies are proteolytically cleaved to generate fragments. These fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies produced can be used as agents for the selective immobilization of enzymes.
Fab' fragments may be directly recovered from E. coli and chemically coupled to form bispecific antibodies. Shalaby et al., J. Exp, Med. 175:217-225 (1992) describe the production of a fully humanized bispecific antibody molecule. Each Fab' fragment was separately secreted from E. coli and subjected O O Cl to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic Sactivity of human cytotoxic lymphocytes against human breast tumor targets.
Various technique for making and isolating bispecific antibody fragments directly from recombinant cell culture have also been described. For example, bispecific antibodies have been produced using leucine zippers.
Kostelny et al., J. Immunol. 148(5):1547-1553 (1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers.
O This method can also be utilized for the production of antibody homodimers. The "diabody" technology Sdescribed by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative o 10 mechanism for making bispecific antibody fragments. The fragments comprise a heavy-chain variable domain connected to a light-chain variable domain (V
L
by a linker which is tob short to allow pairing between the two domains on the same chain. Accordingly, the V, and VL domains of one fragment are forced to pair with the complementary V, and domains of another fragment, thereby forming two antigen-binding sites. Another strategy for making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been reported. See, Gruber et al., J. Immunol. 152:5368 (1994).
Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be prepared.
Tutt e al., J. Immunol. 147:60 (1991).
Exemplary bispecific antibodies may bind to two different epitopes on a given PRO polypeptide herein.
Alternatively, an anti-PRO polypeptide arm may be combined with an arm which binds to a triggering molecule on a leukocyte such as a T-cell receptor molecule CD2, CD3, CD28, or B7), or Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and FeyRIll (CD16) so as to focus cellular defense mechanisms to the cell expressing the particular PRO polypeptide. Bispecific antibodies may also be used to localize cytotoxic agents to cells which express a particular PRO polypeptide. These antibodies possess a PRO-binding arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody of interest binds the PRO polypeptide and further binds tissue factor
(TF).
Heteroconiugate Antibodies Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed to target immune system cells to unwanted cells Patent No. 4,676,980], and fortreatment of HIV infection [WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be prepared in vitro using known methods in synthetic protein chemistry, including those involving crosslinking agents. For example, immunotoxins may be constructed using a disulfide exchange reaction or by forming a thioether bond.
Examples of suitable reagents for this purpose include iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980.
O
c 6. Effector Function Engineering S) It may be desirable to modify the antibody of the inventioni with respect to effector function, so as to enhance, the effectiveness of the antibody in treating cancer. For example, cystcine residue(s) may be _introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. The homodimeric antibody thus generated may have improved internalization capability and/or increased complementmediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et al., J. Ex Med., 176: 1191-1195 (1992) and Shopes, J. Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti-tumor activity may also be prepared using heterobifunctional cross-linkers as described in Wolff et al.
o Cancer Research, 3: 2560-2565 (1993). Alternatively, an antibody can be engineered that has dual Fc regions and may thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et al., Anti-Cancer S 10 Drug Design, 3: 219-230 (1989).
0 7. Immunoconiueates The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic agent such as a chemotherapeutic agent, toxin an enzymatically active toxin of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope a radioconjugate).
Chemotherapeutic agents useful in the generation of such immunoconjugates have been described above.
Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins; Phytolaca americana proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin. enomycin, and the tricothecenes. A variety of radionuclides are available for the production of radioconjugated antibodies. Examples include 2 2 Bi, I 31 and '"Re.
Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bisdiazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6diisocyanate), and bis-active fluorine compounds (such as 1,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in Vitena etal., Science, 238: 1098 (1987). Carbon-14-labeled 1isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See W094/11026.
In another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, followed by removal of unbound conjugate from the circulation using a clearing agent and then administration of a "ligand" avidin) that is conjugated to a cytotoxic agent a radionucleotide).
O
l 8. Immunoliposomes The antibodies disclosed herein may also be formulated as immunoliposomes. Liposomes containing the antibody are prepared by methods known in the art, such as described in Epstein et al., Proc. Natl. Acad.
Sci. SA, 82: 3688 (1985); Hwang er al., Proc. Natl Acad. Sci. USA, 77: 4030 (1980); and U.S. Pat. Nos.
4,485,045 and 4,544,545. Liposomes with enhanced circulation time are disclosed in U.S. Patent No.
5,013,556.
1 Particularly useful liposomes can be generated by the reverse-phase evaporation method with a lipid composition comprising phosphatidylcholine, cholesterol, and PEG-derivatized phosphatidylethanolamine (PEG- O PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired diameter.
l Fab' fragments of the antibody of the present invention can be conjugated to the liposomes as described in Martin o 10 et al., J. Biol. Chem., 257: 286-288 (1982) via a disulfide-interchange reaction. A chemotherapeutic agent Cl (such as Doxorubicin) is optionally contained within tie liposome. See Gabizon et al., J. National Cancer Inst., S81(19): 1484 (1989).
9. Pharmaceutical Compositions of Antibodies Antibodies specifically binding a PRO polypeptide identified herein, as well as other molecules identified by the screening assays disclosed hereinbefore, can be administered for the treatment of various disorders in the form of pharmaceutical compositions.
If the PRO polypeptide is intracellular and whole antibodies are used as inhibitors, internalizing antibodies are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an antibody fragment, into cells. Where antibody fragments are used, the smallest inhibitory fragment that specifically binds to the binding domain of the target protein is preferred. For example, based upon the variableregion sequences of an antibody, peptide molecules can be designed that retain the ability to bind the target protein sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA technology. See, Marasco e al.. Proc. Natl. Acad. Sci. USA, 90: 7889-7893 (1993). The formlation herein may also contain more than one active compound as recessary for the particular indication being treated, preferably those with complementary activities that do not adversely affect each other. Alternatively, or in addition, the composition may comprise an agent that enhances its function, such as. for example, a cytotoxic agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such molecules are suitably present in combination in amounts that are effective for the purpose intended.
The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in macroemulsions.
Such techniques are disclosed in Remington's Pharmaceutical Sciences, supra.
The formulations to be used for in vivo administration must be sterile. This is readily accomplished by filtration through sterile filtration membranes.
O
O
Cl Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations .0 include semipermeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in the form of shaped articles, films, or microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)),polylactides (U.S.
Pat. No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable ethylene-vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON DEPOT TM (injectable Smicrospheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release O of molecules for over 100 days, certain hydrogels release proteins for shorter time periods. When encapsulated Santibodies remain in the body for a long time, they may denature or aggregate as a result of exposure to moisture o 10 at 37"C, resulting in a loss of biological activity and possible changes in immunogenicity. Rational strategies l can be devised for stabilization depending on the mechanism involved. For example, if the aggregation mechanism is discovered to be intermolecular S-S bond formation through thio-disulfide interchange, stabilization may be achieved by modifying sulfhydryl residues, lyophilizing from acidic solutions, controlling moisture content, using appropriate additives, and developing specific polymer matrix compositions.
G. Uses for anti-PRO Antibodies The anti-PRO antibodies of the invention have various utilities. For example, anti-PRO antibodies may be used in diagnostic assays for PRO, detecting its expression in specific cells, tissues, or serum. Various diagnostic assay techniques known in the an may be used, such as competitive binding assays, direct or indirect sandwich assays and immunoprecipitation assays conducted in either heterogeneous or homogeneous phases [Zola, Monoclonal Antibodies: A Manual of Techniques, CRC Press, Inc. (1987) pp. 147-158]. The antibodies used in the diagnostic assays can be labeled with a detectable moiety. The detectable moiety should be capable of producing, either directly or indirectly, a detectable signal. For example, the detectable moiety may be a radioisotope, such as 4 C, 3P, or 1251, a fluorescent or chemiluminescent compound, such as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline phosphatase, beta-galactosidase or horseradish peroxidase. Any method known in the art for conjugating the antibody to the detectable moiety may be employed, including those methods described by Hunter et al., ature, 144:945 (1962); David et al., Biochemistry, 13:1014 (1974); Pain et al., J Immunol. Meth., 40:219 (1981); and Nygren, J.HistQchem, and Cytochem., 30:407 (1982).
Anti-PRO antibodies also are useful for the affinity purification of PRO from recombinant cell culture or natural sources. In this process, the antibodies against PRO are immobilized on a suitable support, such a Sephadex resin or filter paper, using methods well known in the art. The immobilized antibody then is contacted with a sample containing the PRO to be purified, and thereafter the support is washed with a suitable solvent that will remove substantially all the material in the sample except the PRO, which is bound to the immobilized antibody. Finally, the support is washed with another suitable solvent that will release the PRO from the antibody.
O
rcl The following examples are offered for illustrative purposes only, and are not intended to limit the scope 0. of the present invention in any way.
All patent and literature references cited in the present specification are hereby incorporated by reference in their entirety.
EXAMPLES
Commercially available reagents referred to in the examples were used according to manufacturer's instructions unless otherwise indicated. The source of those cells identified in the following examples, and O throughout the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas,
SVA.
C] EXAMPLE 1: Extraceilular Domain Homology Screenine to Identify Novel Polvnelnides and cDNA Encodin Therefor The extracellular domain (ECD) sequences (including the secretion signal sequence, if any) from about 950 known secreted proteins from the Swiss-Prot public database were used to search EST databases. The EST databases included public databases Dayhoff, GenBank), and proprietary databases LIFESEQ
TM
Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST-2 (Altschul et al., Methods in Enzvmoloav 266:460-480 (1996)) as a comparison of the ECD protein sequences to a 6 frame translation of the EST sequences. Those comparisons with a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, WA).
Using this extracellular domain homology screen, consensus DNA sequences were assembled relative to the other identified EST sequences using phrap. In addition, the consensus DNA sequences obtained were often (but not always) extended using repeated cycles of BLAST or BLAST-2 and phrap to extend the consensus sequence as far as possible using the sources of EST sequences discussed above.
Based upon the consensus sequences obtained as described above, oligonucleotides were then synthesized and used to identify by PCR a cDNA library that contained the sequence of interest and for use as probes to isolate a clone of the full-length coding sequence for a PRO polypeptide. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are often designed to give a PCR product of about 100- 1000 bp in length. The probe sequences are typically 40-55 bp in length. In some cases, additional oligonuclecotides are synthesized when the consensus sequence is greater than about I-I.Skbp. In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR amplification, as per Ausubel et al., Current Protocols in Molecular Biology, with the PCR primer pair. A positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and one of the primer pairs.
The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a Nod site, linked with blunt to Sail hemikinased adaptors, cleaved with Notl, sized appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as
O
O pRKB or pRKD; pRK5B is a precursor ofpRK5D that does not contain the Sfil site; see, Holmes et al., Science, 253:1278-1280 (1991)) in the unique XhoI and Nol sites.
EXAMPLE 2: Isolation of cDNA clones by Amylase Screening S1. Preparation of oligo dT primed cDNA library mRNA was isolated from a human tissue of interest using reagents and protocols from Invitrogen, San Diego, CA (Fast Track This RNA was used to generate an oligo dT primed cDNA library in the vector
S
pRK5D using reagents and protocols from Life Technologies, Gaithersburg, MD (Super Script Plasmid System).
V In this procedure, the double stranded :DNA was sized to greater than 1000 bp and the Sall/NotI linkered cDNA was cloned into Xhol/Notl cleaved vector. pRKSD is a cloning vector that has an sp6 transcription initiation site followed by an Sfil restriction enzyme site preceding the XhoolNtl cDNA cloning sites.
0 2. Preparationof random primed cDNA library A secondary cDNA library was generated in order to preferentially represent the 5' ends of the primary cDNA clones. Sp6 RNA was generated from the primary library (described above), and this RNA was used to generate a random primed cDNA library in the vector pSST-AMY.0 using reagents and protocols from Life Technologies (Super Script Plasmid System, referenced above). In this procedure the double stranded cDNA was sized to 500-1000 bp, linkered with blunt to NotI adaptors, cleaved with Sfil, and cloned into Sfil/Notl cleaved vector. pSST-AMY.0 is a cloning vector that has a yeast alcohol dehydrogenase promoter preceding the cDNA cloning sites and the mouse amylase sequence (the mature sequence without the secretion signal) followed by the yeast alcohol dehydrogenase terminator, after the cloning sites. Thus, cDNAs cloned into this vector that are fused in frame with amylase sequence will lead to the secretion of amylase from appropriately transfected yeast colonies.
3. Transformation and Detection DNA from the library described in paragraph 2 above was chilled on ice to which was added electrocompetent DHO1B bacteria (Life Technologies, 20 ml). The bacteria and vector mixture was then electroporated as recommended by the manufacturer. Subsequently, SOC media (Life Technologies, I ml) was added and the mixture was incubated at 37"C for 30 minutes. The transformants were then plated onto standard 150 mm LB plates containing ampicillin and incubated for 16 hours Positive colonies were scraped off the plates and the DNA was isolated from the bacterial pellet using standard protocols, e.g. CsCIgradient. The purified DNA was then carried on to the yeast protocols below.
The yeast methods were divided into three categories: Transformation of yeast with the plasmid/cDNA combined vector; Detection and isolation of yeast clones secreting amylase; and PCR amplification of the insert directly from the yeast colony and purification of the DNA for sequencing and further analysis.
The yeast strain used was HD56-5A (ATCC-90785). This strain has the following genotype: MAT alpha, ura3-52, leu2-3, leu2-112, his3-11, his3-15, MAL*, SUC 4 GAL*. Preferably, yeast mutants can be 0 employed that have deficient post-translational pathways. Such mutants may have translocation deficient alleles in sec71, sec72, sec62, with truncated sec71 being most preferred. Alternatively, antagonists (including antisense nucleotides and/or ligands) which interfere with the normal operation of these genes, other proteins implicated in this post translation pathway SEC61p, SEC72p, SEC62p, SEC63p, TDJlp or SSAlp-4p) c or the complex formation of these proteins may also be preferably employed in combination with the amylaseexpressing yeast.
Transformation was performed based on the protocol outlined by Gietz et al., Nuct. Acid. Res., 20:1425 (1992). Transformed cells were then inoculated from agar into YEPD complex media broth (100 ml) o and grown overnight at 30*C. The YEPD broth was prepared as described in Kaiser et al., Methods in Yeast Genetics, Cold Spring Harbor Press, Cold Spring Harbor, NY, p. 207 (1994). The overnight culture was then O 10 diluted to about 2 x 106 cells/ml (approx. OD,=0.1) into fresh YEPD broth (500 ml) and regrown to 1 x 107 0i cells/ml (approx. ODw 0.4-0.5).
The cells were then harvested and prepared for transformation by transfer into GS3 rotor bottles in a Sorval GS3 rotor at 5,000 rpm for 5 minutes, the supernatant discarded, and then resuspended into sterile water, and centrifuged again in 50 ml falcon tubes at 3,500 rpm in a Beckman GS-6KR centrifuge. The supernatant was discarded and the cells were subsequently washed with LiAc/TE (10 ml, 10 mM Tris-HCI, 1 mM EDTA pH 7:5, 100 mM Li,OOCCH3), and resuspended into LiAc/TE (2.5 mi).
Transformation took place by mixing the prepared cells (100 p1) with freshly denatured single stranded salmon testes DNA (Lofstrand Labs, Gaithersburg, MD) and transforming DNA (1 pg, vol. 10 pl) in microfuge tubes. The mixture was mixed briefly by vortexing, then 40% PEG/TE (600 pl, 40% polyethylene glycol-4000, 10 mM Tris-HCI, I mM EDTA, 100 mM Li,OOCCH,, pH 7.5) was added. This mixture was gently mixed and incubated at 30°C while agitating for 30 minutes. The cells were then heat shocked at 42"C for 15 minutes, and the reaction vessel centrifuged in a microfuge at 12,000 rpm for 5-10 seconds, decanted and resuspended into TE (500 10 mM Tris-HC1, I mM EDTA pH 7.5) followed by recentrifugation. The cells were then diluted into TE (1 ml) and aliquots (200 pl) were spread onto the selective media previously prepared in 150 mm growth plates (VWR).
Alternatively, instead of multiple small reactions, the transformation was performed using asingle, large scale reaction, wherein reagent amounts were scaled up accordingly.
The selective media used was a synthetic complete dextrose agar lacking uracil (SCD-Ura) prepared as described in Kaiser et al., Methods in Yeast Genetics, Cold Spring Harbor Press, Cold Spring Harbor, NY, p.
208-210 (1994). Transformants were grown at 30*C for 2-3 days.
The detection of colonies secreting amylase was performed by including red starch in the selective growth media. Starch was coupled to the red dye (Reactive Red-120, Sigma) as per the procedure described by Biely et al., Anal. Biochem., 172:176-179 (1988). The coupled starch was incorporated into the SCD-Ura agar plates at a final concentration of 0.1' and was buffered with potassium phosphate to a pH of 7.0 100 mM final concentration).
The positive colonies were picked and streaked across fresh selective media (onto 150 mm plates) in order to obtain well isolated and identifiable single colonies. Well isolated single colonies positive for amylase
O
secretion were detected by direct incorporation of red starch into buffered SCD-Ura agar. Positive colonies were n determined by their ability to break down starch resulting in a clear halo around the positive colony visualized Sdirectly.
S4. Isolation of DNA by PCR Amplification When a positive colony was isolated, a portion of it was picked by a toothpick and diluted into sterile waler (30 pl) in a 96 well plate. At this time, the positive colonies were either frozen and stored for subsequent analysis or immediately amplified. An aliquot of cells (5 pi) was used as a template for the PCR reaction in a i 25 pl volume containing: 0.5 il Klentaq (Clontech, Palo Alto, CA); 4.0 pl 10 mM dNTP's (Perkin Elmer- Ci Cetus); 2.5 pl Kentaq buffer (Clontech); 0.25 pl forward oligo I; 0.25 pi reverse oligo 2; 12.5 pl distilled water.
S 10 The sequence of the forward oligonucleotide 1 was: o 5'-TGTAAAACGACGGCCAGTTAAATAGACCTGCAATTATTAATCT-3' (SEQ ID NO:3) The sequence of reverse oligonucleotide 2 was: 5'-CAGGAAACAGCTATGACCACCTGCACACCTGCAAATCCATT-3' (SEQ ID NO:4) PCR was then performed as follows: a. Denature 92"C, 5 minutes b. 3 cycles of: Denature 92*C, 30 seconds Anneal 59 0 C, 30 seconds Extend 72"C, 60 seconds c. 3 cycles of: Denature 92"C, 30 seconds Anneal 57"C, 30 seconds Extend 72"C, 60 seconds d. 25 cycles of: Denature 92C, 30 seconds Anneal 55"C, 30 seconds Extend 72 0 C, 60 seconds e. Hold 4 C The underlined regions of the oligonucleotides annealed to the ADH promoter region and the amylase region, respectively, and amplified a 307 bp region from vector pSST-AMY.0 when no insert was present.
Typically, the first 18 nucleotides of the 5' end of these oligonucleotides contained annealing sites for the sequencing primers. Thus, the total product of the PCR reaction from an empty vector was 343 bp. However, signal sequence-fused cDNA resulted in considerably longer nucleotide sequences.
Following the PCR, an aliquot of the reaction (5 pul) was examined by agarose gel electrophoresis in a 1 agarose gel using a Tris-Borate-EDTA (TBE) buffering system as described by Sambrook et al., supra.
Clones resulting in a single strong PCR product larger than 400 bp were further analyzed by DNA sequencing after purification with a 96 Qiaquick PCR clean-up column (Qiagen Inc., Chatsworth, CA).
O
SEXAMPLE 3: Isolation of cDNA Clones Using Signal Algorithm Analysis Various polypeptide-encoding nucleic acid sequences were identified by applying a proprietary signal sequence finding algorithm developed by Genentech, Inc. (South San Francisco, CA) upon ESTs as well as _clustered and assembled EST fragments from public GenBank) and/or private (LIFESEQ*, Incyte SPharmaceuticals, Inc., Palo Alto, CA) databases. The signal sequence algorithm computes a secretion signal score based on the character of the DNA nucleotides surrounding the first and optionally the second methionine codon(s) (ATG) at the 5'-end of the sequence or sequence fragment under consideration. The nucleotides following ,he first ATG must code for at least 35 unambiguous amino acids without any stop codons. If the first o ATG has the required amino acids, the second is not examined. If neither meets the requirement, the candidate sequence is not scored. In order to determine whether the EST sequence contains an authentic signal sequence, O 10 the DNA and corresponding amino acid sequences surrounding the ATG codon are scored using a set of seven C- sensors (evaluation parameters) known to be associated with secretion signals. Use of this algorithm resulted in the identification of numerous polypeptide-encoding nucleic acid sequences.
EXAMPLE 4: Isolation of cDNA clones Encoding Human PROI484 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA39616. Based on the DNA39616 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1484.
PCR primers (forward and reverse) were synthesized: forward PCR primer (39616.fl) 5'-GCAACAATGGAGCCACTGGTCATG-3' (SEQ ID reverse PCR primer (39616.rl 5'-GCAAAGGTGGAGAAGCGTTGOTGG-3' (SEQ ID NO:6) Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA39616 sequence which had the following nucleotide sequence hybridization probe (39616.pl) 5'-CCCACTrCAGCAATCAGAACAGTGGGATTATCTTTCAGCAGTGTTTGAGACC-3' (SEQ ID NO:7) In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above, A positive library was then used to isolate clones encoding the PRO 1484 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PROI484 (designated herein as DNA44686-1653 [Figure I, SEQ ID NO: (UNQ753) and the derived protein sequence for PROI484.
The entire nucleotide sequence of DNA44686-1653 is shown in Figure I (SEQ ID NO:1). Clone DNA44686-1653 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 77-79 and ending at the stop codon at nucleotide positions 815-817 (Figure The predicted polypeptide precursor is 246 amino acids long (Figure The full-length PROI484 protein shown in Figure
O
S2 has an estimated molecular weight of about 26,994 daltons and a pi of about 6.43. Analysis of the full-length PRO1484 sequence shown in Figure 2 (SEQ ID NO:2)evidences the presence of the following: a signal peptide from about amino acid I to about amino acid 22, a Clq domain signature sequence from about anino acid 137 to about amino acid 167 and various amino acid sequence blocks having homology to CIq domain-containing eC proteins as shown in Figure 2. Clone DNA44686-1653 has been deposited with ATCC on January 12, 1999 and is assigned ATCC deposit no. 203581.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 2 (SEQ ID NO:2), evidenced significant homology between the PR01484 amino acid sequence and the following Dayhoff sequences: P_W09108, CA1A HUMAN, CIQC_HUMAN, HUMC1QB21, COLE LEPMA, MMU32107_1, CAS4_EPHMU, S 10 A57131, A41207 and CERL RAT.
EXAMPLE 5: Isolation of cDNA clones Encoding Human PR04334 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database. -This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzymolov 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56421.
In light of an observed sequence homology between the DNA5642 sequence and an EST sequence contained within the Incyte EST clone no. 3347532, ite Incyte clone was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 3 and is herein designated as DNA59608-2577.
The full length clone shown in Figure 3 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 83-85 and ending at the stop codon found at nucleotide positions 1404-1406 (Figure 3; SEQ ID NO:8). The predicted polypeptide precursor (Figure 4, SEQ ID NO:9) is 440 amino acids long. PR04334 has a calculated molecular weight of approximately 50,211 daltons and an estimated pi of approximately 8.29.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 4 (SEQ ID NO:9), revealed homology between the PR04334 amino acid sequence and the following Dayhoff sequences incorporated herein: AB020686 1, PCI HUMAN, P R79148. PCIMOUSE,RNU78788_1, RATPDIB_1, PW75859, AC005587_1, PR86595 and PPDI BOVIN.
Clone DNA59608-2577 was deposited with the ATCC on March 23, 1999 and is assigned ATCC deposit no. 203870.
O
O EXAMPE 6: Isolation of cDNA clones Encoding Human PROI 122 Anexpressed sequence tag (EST) DNA database (LIFESEQ T M Incyte Pharmaceuticals, Palo Alto, CA) Swas searched and an EST was identified. The EST was Incyte 1347523 which is also called herein DNA49665.
Based on the DNA49665 sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library Sthat contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1122.
PCR primers (forward and reverse) were synthesized: n forward PCR primer 5'-ATCCACAGAAGCTGGCCTTCGCCG-3' (SEQ ID NO:12); and n reverse PCR primer 5'-GGGACGTGGATGAACTCGGTGTGG-3' (SEQ ID NO:13).
0 Additionally, a synthetic oligonucleotide hybridization probe was constructed which had the following nucleotide sequence: O -hybridization probe S'-TATCCACAGAAGCTGGCCTTCGCCGAGTGCCTGTGCAGAG-3' (SEQ ID NO:14) In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PRO1122 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB228).
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PRO1122 [herein designated as DNA62377-1381] (SEQ ID NO:10) and the derived protein sequence for PRO 1122.
The entire nucleotide sequence of DNA62377-1381 is shown in Figure 5 (SEQ ID NO:10). Clone DNA62377-1381 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 50-52 and ending at the stop codon at nucleotide positions 641-643 of SEQ ID NO: 10 (Figure The predicted polypeptide precursor is 197 amino acids long (Figure The full-length PRO1122 protein shown in Figure 6 has an estimated molecular weight of about 21,765 daltons and a pl of about 8.53. Clone DNA62377-1381 has been deposited with ATCC on December 22, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing techniques.
Analysis of the amino acid sequence of the full-length PROl 122 polypeptide suggests that it possesses similarity with CTLA-8 and IL-17, thereby indicating that PRO1122 may be a novel cytokine. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced significant homology between the PROI 122 amino acid sequence and the following Dayhoff sequences, P-W13651, VGI3_HSVSA and CEF25DIl1.
EXAMPLE 7: Isolation of cDNA clones Encoding Human PRO 1889 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA49310. Based up an observed homology between the DNA49310 consensus sequence and an EST contained within the Incyte EST clone no.
O
0 2779436, Incyte EST clone no. 2779436 was purchased and its insert obtained and sequenced. The sequence Sof that insert is shown in Figure 7 and is herein designated DNA77623-2524.
The entire nucleotide sequence of DNA77623-2524 is shown in Figure 7 (SEQ ID NO: 15). Clone DNA77623-2524 contains a single open reading frame with an apparent translational initiation site at nucleotide Cn positions 39-41 and ending at the stop codon at nucleotide positions 330-332 (Figure The predicted polypeptide precursor is 97 amino acids long (Figure The full-length PR01889 protein shown in Figure 8 has an estimated molecular weight of about 10,160 daltons and a pl of about 6.56. Analysis of the full-length 17 PRO1889 sequence shown in Figure 8 (SEQ ID NO:16) evidences the presence of the following: a signal peptide o from about amino acid 1 to about amino acid 20, potential N-myristolation sites from about amino acid 6 to about C amino acid 11 and from about amino acid 33 to about amino acid 38 and prokaryotic membrane tipoprotein lipid S 10 attachment sites from about amino acid 24 to about amino acid 34 and from about amino acid 78 to about amino acid 88. Clone DNA77623-2524 has been deposited with ATCC on December 22, 1998 and is assigned ATCC deposit no. 203546.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 8 (SEQ ID NO:16), evidenced significant homology between the PRO1889 amino acid sequence and the following Dayhoff sequences: HSE48ATGN_ 1, P_W06292, AB012293_1, THYBMOUSE,P R70984, CHKSCA2A_, PW61628, 148639, BMBUNGKP4 and UPAR HUMAN.
EXAMPLE 8: Isolation of cDNA clones Encoding Human PRO1890 A consensus DNA sequence was assembled relative to other EST sequences using repeated cycles of BLAST and phrap as described in Example 1 above. This consensus sequence is herein designated DNA52162.
Based on the DNA52162 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1890.
PCR primers (forward and reverse) were synthesized: forward PCR primer (52162.fl) 5'-CACCAACCAACTGCCAATCCTGGC-3' (SEQ ID NO:19) reverse PCRprimer (52162.r1 5'-ACCACATTCTGATGGGTGTCTCCTGG-3' (SEQ ID Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA52162 sequence which had the following nucleotide sequence hybridization probe (52162.pll 5'-GGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCAC-3' (SEQ ID NO:21) RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PRO1890 ldesignated herein as DNA79230-2525 [Figure 9, SEQ ID NO: 17]; (UNQ872) and the derived protein sequence for PR01890.
The entire nucleotide sequence of DNA79230-2525 is shown in Figure 9 (SEQ ID NO: 17). Clone DNA79230-2525 contains a single open reading frame with an apparent translational initiation site at nucleotide
O
0 positions 378-380 and ending at the stop codon at nucleotide positions 1197-1199 (Figure The predicted 0. polypeplide precursor is 273 amino acids long (Figure 10). The full-length PRO 890 protein shown in Figure has an estimated molecular weight of about 30,431 daltons and a pi of about 6.79. Analysis of the full-length PR01890 sequence shown in Figure 10 (SEQ ID NO:18) evidences the presence of the following: a signal peptide from about amino acid I to about amino acid 21, a transmembrane domain from about amino acid 214 to about amino acid 235, potential N-glycosylation sites from about amino acid 86 to about amino acid 89 and from about amino acid 255 to about amino acid 258, a cAMP- and cGMP-dependent protein kinase phosphorylation site from about amino acid 266 to about amino acid 269 and potential N-myristolation sites from Sabout amino acid 27 to about amino acid 32, from about amino acid 66 to about amino acid 71, from about 0 amino acid 91 to about amino acid 96. from about amino acid 93 to about amino acid 98, from about amino acid 102 to about amino acid 107, from about amino acid 109 to about amino acid 114, from about amino acid 140 O to about amino acid 145 and from about amino acid 212 to about amino acid 217. Clone DNA79230-2525 has been deposited with ATCC on December 22, 1998 and is assigned ATCC deposit no. 203549.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 10 (SEQ ID NO:18), evidenced significant homology between the PROl890 amino acid sequence and the following Dayhoff sequences: AF093673_1, P W44118, AB014609_1, AC005254_1, AF026547_1, LEC2 MEGRO, PGCV_HUMAN, GEN12667, P R06331 and CELF52E1 9.
EXAMPLE 9: Isolation of cDNA clones Encoding Human PRO1887 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein "DNA43041". Based on this consensus sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1887.
PCR primers (forward and reverse) were synthesized: forward PCR primer: 5'-GCAAAGCTCTGCCTCCTTGGCC-3' (SEQ ID NO:24); and reverse PCR primers: 5'-GGGTGGACTGTGCTCTAATGGACGC-3' (SEQ ID NO:25), and 5'-CGTGGCACTGGGTTGATC-3' (SEQ ID NO:26).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA43041 sequence which had the following nucleotide sequence: hybridization probe: 5'-GATGCAGTTCTGGTCAGAGACGCTCCCCAGCAAGATACAACAGTG-3'
(SEQ
ID NO:27).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PROI88 7 gene using the probe oligonucleotide and one o 'he PCR prirr"-s. RNA for construction of the cDNA libraries was isolated from human bone marrow.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01887, designated herein as "DNA79862-2522' (Figure 11; SEQ ID NO:22), and the derived protein
O
O
cI sequence for PRO1887.
b DNA79862-2522 is shown in Figure I (SEQ ID NO:22). Clone DNA79862-2522 contains a single Sopen reading frame with an apparent translational initiation site at nucleotide positions 6-8, and an apparent stop codon at nucleotide positions 1719-1721. The predicted polypeptide precursor is 571 amino acids long. The c full-length PR01887 protein shown in Figure 12 has an estimated molecular weight of about 62,282 daltons and a pl of about 5.56. Additional features of the PRO1887 protein include a signal peptide at about amino acids 1-27; a transmembrane domain at about amino acids 226-245; a potential N-glyzosylation site at about amino acids 105-108; N-myristoylation sites at about amino acids 10-15, 49-54, 62-67, 86-91, 150-155, 155-160, o 162-167, 217-222, 227-232, 228-231, 232-237.262-267, 257-362, and 461-466; a prolaryotic membrane c lipoprotein lipid attachment site at about amino acids 12-22; and a carboxylesterases type-B serine active site at 0 10 about amino acids 216-231.
0 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 12 (SEQ ID NO:23), revealed significant homology between the PRO 1887 amino acid sequence and Dayhoff sequence ESTM_MOUSE. Homology was also found between the PR01887 amino acid sequence and the following additional Dayhoff sequences: D50579 1, 161085, EST1 HUMAN, GENI2405, P W39078, GEN13248, P_R58980, A31800 1, and P R45189.
Clone DNA79862-2522 was deposited with the ATCC on December 22, 1998, and is assigned ATCC deposit no. 203550.
EXAMPLE 10: Isolation of cDNA clones Encoding Human PRO1785 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein "DNA35718". Based on the DNA35718 consensus sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1785.
PCR primers (forward and reverse) were synthesized: forward PCR primer: 5'-ATCCTCCAACATGGAGCCTCTTGC-3' (SEQ ID NO:30); forward PCR primer: 5'-GTATCTTGTCAACCCTGAGG-3' (SEQ ID NO:31); and reverse PCR primer: 5'-TAACCAGAGCTGCTATGTCAGGCC-3' (SEQ ID NO:32); Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35718 sequence which had the following nucleotide sequence: hybridization probe: 5'-AGGCAAAGTfTCACTAGTTGTAAACGTGGCCAGTGACTGCCAACTCACAG-3' (SEQ ID NO:33).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PRO 1785 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human aortic endothelial cells.
O
o DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01785 (designated herein as DNA80136-2503 [Figure 13, SEQ ID NO:28]; and the derived proteinsequence Sfor PR01785.
The entire coding sequence of PRO 1785 is shown in Figure 13 (SEQ ID NO:28). Clone DNA80136- 2503 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 2-4 and an apparent stop codon at nucleotide positions 629-631 of SEQ ID NO:28. The predicted polypeptide precursor is 209 amino acids long. There is a signal peptide at about amino acids 1-31, a transmembrane domain at about amino acids 18-37 and a glutathione peroxidase signature at about amino acids 104-111 of SEQ ID SNO:29. Clone DNA80136-2503 has been deposited with the ATCC and is assigned ATCC deposit no. 203541.
Ci The full-length PRO 785 protein shown in Figure 14 has an estimated molecular weight of about 23,909 daltons and a pl of about 9.68.
SAn analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 14 (SEQ ID NO:29), revealed sequence identity between the PRO 1785 amino acid sequence and the following Dayhoff sequences: GSHC SCHMA, P_R44988, AB0123951, GSHH HUMAN, AC004151_3, BTUEECOLI, GSHC_HUMAN, P_R89910, PWU88907 1 and D37916 1.
EXAMPLE Is: slation of cDNA clones Encoding Human PR04353 A consensus DNA sequence was assembled relative to other EST sequences using repeated cycles of BLAST and phrap as described in Example 1 above. This consensus sequence is designated herein "DNA39482". Based on the DNA39482 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR04353.
PCR primers (forward and reverse) were synthesized: forward PCR primer: 5'-GAGGACCTACCGGCCGGACAG-3' (SEQ ID NO:36) and reverse PCR primer: 5'-ATACACCCCGAGTACTGCTGGCAG-3' (SEQ ID NO:37).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA39482 sequence which had the following nucleotide sequence: hybridization probe: 5'-AGACAGGGCAGCGGCTGCTGAGCTTGGAGCTGGACGCAGCTT-3' (SEQ ID NO:38).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR04353 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human aortic endothelial cells.
DNA sequencing of the clcnes isolated as described above gave the full-length DNA sequence for PRO4353 (designated herein as DNA80145-2594 [Figure 15, SEQ ID NO:34]: and the derived protein sequence for PR04353.
O
o The entire coding sequence of PR04353 is shown in Figure 15 (SEQ ID NO:34). Clone DNA80145- S2594 contains a single open reading frame with an apparent translational initiation site at nucleotide positions S19-21, and an apparent stop codon at nucleotide positions 2683-2685. The predicted polypeptide precursor is 888 amino acids long. Clone DNA80145-2594 has been deposited with ATCC and is assigned ATCC deposit Cno. 204-PTA. The full-length PR04353 protein shown in Figure 16 has an estimated molecular weight of about 95,285 daltons and a pi of about 8.89.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), uing a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 16 (SEQ ID NO:34), revealed homology between Sthe PRO4353 amino acid sequence and the following Dayhoff sequences (sequences and related text incorporated Ci herein): PW19857, AB000776_1, P_W57260, JH0798, P_R71382, CEY54E5B_I, 148747, MUSCI 1, S 10 P_R71383 and P_W63748.
0 EXAMPLE 12: Isolation of cDNA clones Encoding Human PR04357 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein "DNA80155". Based on the DNA80155 consensus sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR04357.
PCR primers (forward and reverse) were synthesized: forward PCR primer: 5'-GAAGGTGGAAATTAAATTCCAAGGC-3' (SEQ ID NO:41) and reverse PCR primer: 5'-CGATAAGCTGCTACAGTGCCATCG-3' (SEQ ID NO:42).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA80155 sequence which had the following nucleotide sequence: hybridization probe: 5'-GTGACTGTCCTCTGCAAGATAGTGCAGCCTGGCTACGGGA-3' (SEQ ID NO:43).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PRO4357 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human aortic endothelial cells.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR04357; and the derived protein sequence for PR04357.
The entire coding sequence of PR04357 is shown in Figure 17 (SEQ ID NO:39). Clone DNA84917- 2597 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 286-288, and an apparent stop codon at nucleotide positions 1792-1794. The predicted polypeptide precursor is 502 amino acids long. Clone DNA84917-2597 has been deposited with ATCC and is assigned ATCC deposit no. 203863. The full-length PRO4357 protein shown in Figure 18 has an estimated molecular weight of about 58,043 daltons and a pl of about 7.94.
O
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence )alignment analysis of the full-length sequence shown in Figure 18 (SEQ ID NO:40), revealed homology between the PR04357 amino acid sequence and the following Dayhoff sequences: P_W48804, AF003534 66, ATAC00466519, LPSA_BACNO, GELA DICDI, EHU70560_1, AF089841_1, ABP2_HMAN, PW19349 Sand A49551.
EXAMPLE 13: Isolation ofcDNA clones Encoding Human PRO4405 A consensus DNA sequence was assembled relative to other EST sequences using repeated cycles of O BLAST and phrap. This consensus sequence is designated herein "DNA80170". Based on the DNA80170 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained O 10 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 0 (i PR04405.
PCR primers (forward and reverse) were synthesized: forward PCR primer: 5'-CGGGACTTTCGCTACCTGTTGC-3' (SEQ ID NO:46) and reverse PCR primer: 5'-CATCATATTCCACAAAATGCTTTGGG-3' (SEQ ID NO:47).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus sequence which had the following nucleotide sequence: hybridization probe: 5'-CCTTCGGGGATTCTCCCGGCTCCCGTTCGTTCCTCTG-3' (SEQ ID NO:48).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR04405 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal kidney.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR04405 (designated hereinas DNA84920-2614 [Figure 19, SEQ ID NO:44]; and the derived protein sequence for PRO4405.
The entire coding sequence of PR04405 is shown in Figure 19 (SEQ ID NO:44). Clone DNA84920- 2614 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 79-81, and an apparent stop codon at nucleotide positions 1009-1011. The predicted polypeptide precursor is 310 amino acids long. Clone DNA84920-2614 has been deposited with ATCC and is assigned ATCC deposit no. 203966. The full-length PR04405 protein shown in Figure 20 has an estimated molecular weight of about 33.875 daltons and a pl of about 7.08.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 20 (SEQ ID NO:45), revealed homology between the PR04405 amino acid sequence and the following Dayhoff sequences (sequences and related text incorporated herein): YA93_SCHPO, S62432, YJG2 YEAST, AC0044723, AB004539_7, S64782, CELC27AI2_8, AF109219 1, AF086791 10, and PW75859.
O
0 EXAMPLE 14: Isolation of cDNA clones Encoding Human PR04356 )A consensus DNA sequence was assembled relative to other EST sequences using phrap.asdescribed Sin Example 1 above. This consensus sequence is designated herein "DNA80200". Based upon an observed homology between the DNA80200 consensus sequence and an EST sequence contained within Merck EST clone S248287, Merck EST clone 248287 was purchased and its insert obtained and sequenced, thereby providing DNA86576-2595.
The entire coding sequence of PR04356 is shown in Figure 21'(SEQ ID NO:49). Clone DNA86576- 2595 contains a single open reading frame with an apparent translational initiation site at nucleotide positions o 55-57, and an apparent stop codon at nucleotide positions 808-810. The predicted polypeptide precursor is 251 i amino acids long. Clone DNA86576-2595 has been deposited with ATCC and is assigned ATCC deposit no.
O 10 203868. The full-length PR04356 protein shown in Figure 22 has an estimated molecular weight of about 26,935 daltons and a pi of about 7.42.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 22 (SEQ ID NO:50), revealed homology between the PR04356 amino acid sequence and the following Dayhoff sequences incorporated herein: RNMAGPIAN_ 1, UPAR BOVIN, 542152, AF007789_1, UPAR_RAT, UPAR_MOUSE, P_W31165, P_W31168, P_R44423 and P W26359.
EXAMPLE 15: Isolation of cDNA clones Encoding Human PR04352 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein "DNA83397". Based on the DNA83397 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR04352.
PCR primers (forward and reverse) were synthesized: forward PCR primer: 5'-CTGGGGAGTGTCCTTGGCAGGTTC-3' (SEQ ID NO:53) and reverse PCR primer: 5'-CAGCATACAGGGCTCTTTAGGGCACAC-3' (SEQ ID NO:54).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA83397 sequence which had the following nucleotide sequence: hybridization probe: 5'-CGGTGACTGAGGAAACAGAGAAAGGATCCTTTGTGGTCAATCTGGC-3' (SEQ ID In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PRO4352 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal brain.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR04352 (designated herein as DNA87976-2593 [Figure 23, SEQ ID NO:511; and the derived protein sequence for PR04352.
O
O The entire coding sequence of PR04352 is shown in Figure 23 (SEQ ID NO:51). Clone DNA87976- 2593 contains a single open reading frame with an apparent translational initiation site at nucleotide positions S179-181, and an apparent stop codon at nucleotide positions 2579-2581 of SEQ ID NO:51. The predicted polypeptide precursor is 800 amino acids long. Clone DNA87976-2593 has been deposited with ATCC and is assigned ATCC deposit no. 203888. The full-length PR04352 protein shown in Figure 24 has an estimated molecular weight of about 87,621 daltons and a pi of about 4.77.
An analysis of thi: Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 24 (SEQ ID NO:52), revealed homology between the PR04352 amino acid sequence and the following Dayhoff sequences: P_R86865, P_R86866, RATPCDH 1, AB0111601, MMU88549 1, D86917_1, AB008179_ P_R58907, HSHFATPRO 1. and AF031572 I.
c to 0 EXAMPLE 16: Isolation of cDNA clones Encoding Human PR04380 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enymologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases orgreater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA79132. In light of DNA79132, DNA92234-2602 was identified.
The full length clone shown in Figure 25 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 201-203 and ending at the stop codon found at nucleotide positions 1722-1724 (Figure 25; SEQ ID NO:56). The predicted polypeptide precursor (Figure 26, SEQ ID NO:57) is 507 amino acids long. PRO4380 has a calculated molecular weight of approximately 56,692 daltons and an estimated pl of approximately 5.22.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequenceshown in Figure 26 (SEQ ID NO:57), revealed homology between the PR04380 amino acid sequence and the following Dayhoff sequences (sequences and related text incorporated herein): CERIIH6 1, S56299, D89150_1, G70870, S43914, LM034616_5, LLU78036_1, AF055904_2, P W79066 and ARGE ECOLI.
Clone DNA92234-2602 was deposited with the ATCC and is assigned ATCC deposit no. 203948.
EXAMPLE 17: Isolation of cDNA clones Encoding Human PR04354 Use of the signal sequence algorithm described in Example 3 above all,'ved identification of an EST cluster (92909) sequence designated herein as DNA 10195. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases GenBank) and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing
O
O homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The Sconsensus sequence obtained therefrom is herein designated as DNA56063. In light of DNA56063, DNA92256- 2596 was identified.
The full length clone shown in Figure 27 contained a single open reading frame with an apparent
S
translational initiation site at nucleotide positions 108-110 and ending at the stop codon found at nucleotide Spositions 852-854 (Figure 27; SEQ ID NO:58). The predicted polypeptide precursor (Figure 28, SEQ ID NO:59) is 248 amino acids long. PR04354 has a calculated molecular weight of approximately 28,310 daltons and an estimated pl of approximately 4.63.
O An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 28 (SEQ ID NO:59), revealed homology between the PR04354 amino acid sequence and the following Dayhoff sequences incorporated herein: HGS_RF300, CEVK04GII_2, CECIIHI_7, HSU80744_, CEF09E8 2, RNAJ2967_1, DDICOiI, AB020648_1, P W33887 and A64319.
Clone DNA92256-2596 was deposited with the ATCC on March 30, 1999 and is assigned ATCC deposit no.203891.
EXAMPLE 18: Isolation of cDNA clones Encoding Human PR04408 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmolo 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA79298. In light of DNA79298, DNA92274-2617 was identified and sequenced in full.
The full length clone shown in Figure 29 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 89-91 and ending at the stop codon found at nucleotide positions 758-760 (Figure 29; SEQ ID NO:60). The predicted polypeptide precursor (Figure 30, SEQ ID NO:61) is 223 amino acids long. PRO4408 has a calculated molecular weight of approximately 25,402 daltons and an estimated Ip of apprc.:;mately 8.14.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 30 (SEQ ID NO:61). revealed homology between the PR04408 amino acid sequence and the following Dayhoff sequences: P R27897, P_R49942, PBP RAT,
O
0 CELF40A3 3, DIONCVO, PC4214, OV16 ONCVO, P_R27718, GEN10789, and Clone DNA92274-2617 was deposited with the ATCC and is assigned ATCC deposit no. 203971.
EXAMPLE 19: Isolation of cDNA clones Encoding Human PRQ5737 c An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) was searched with a human interleukin-1 receptor antagonist (hlL-IRa) sequence, and an EST sequence, designated herein as 1433156 was identified, which showed homology with the hIL-IRa known protein. EST clone 1433156 was purchased from Incyte Pharmaceuticals (Palo Alto, CA) and the cDNA insert was obtained o and sequenced in its entirety, giving the DNA92929-2534 sequence..
C The entire nucleotide sequence of DNA92929-2534 is shown in Figure 31 (SEQ ID NO:62). Clone O 10 DNA92929-2534 contains a single open reading frame with an apparent translational initiation site at nucleotide ,i positions 96-98 and a stop codon at nucleotide positions 498-500 (Figure 31; SEQ ID NO:62). The predicted polypeptide precursor (hIL-1 Ra2) is 134 amino acids long. The putative signal sequence extends from amino acid positions 1-17. Clone DNA92929-2534 was deposited with ATCC and was assigned ATCC deposit no. 203586.
The full-length hlL-1ra2 protein shown in Figure 32 has an estimated molecular weight of about 14,927 daltons and a pi of about 4.8.
Based on a BLAST and FastA sequence alignment analysis (using the ALIGN-2 computer program) of the full-length sequence, hlL-lRa2 (Figure 32, SEQ ID NO:63) shows significant amino acid sequence identity to hIL-lRap protein. hIL-lRa2 is believed to be a splice variant of hIL- lRp.
EXAMPLE 20: Isolation of cDNA clones Encoding Human PRO4425 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases GenBank) and a proprietary EST DNA database (LIFESEQ', Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzymology 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA81099.
In light of an observed sequence homology between the DNA81099 sequence and an EST sequence contained within the EST clone no. AA448744, the EST clone AA448744 was purchased from Merck and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 33 and is herein designated as DNA93011-2637.
The full length clone shown in Figure 33 contained a single open reading frame with an -pparent translational initiation siteat nucleotide positions 27-29 and ending at the stop codon found at nucleotide positions 435-437 (Figure 33; SEQ ID NO:64). The predicted polypeptide precursor (Figure 34, SEQ ID NO:65) is 136 amino acids long. PRO4425 has a calculated molecular weight of approximately 15,577 daltons and an estimated
O
Spl of approximately 8.88.
0. An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 34 (SEQ ID NO:65), revealed homology between the PR04425 amino acid sequence and the following Dayhoff sequences: HGS_RE295, S44655, YOJ8_CAEEL, C VBRI CLVK, P R39520, P_R65332, P_R39388, TGL4_HUMAN, YKAB_CAEEL, and S71105.
Clone DNA93011-2637 was deposited with the ATCC and is assigned ATCC deposit no. EXAMPLE 21: Isolation of cDNA clones Encoding Human PRO5990 t A consensus DNA sequence vas assembled relative to other EST sequences using phrap as described Cl1 in Example I above. This consensus sequence is herein designated DNA86602. Based on the DNA86602 S 10 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained o the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR05990.
PCR primers (forward and reverse) were synthesized: forward PCR primer: 5'-CGTCACAGGAACTTCAGCACCC-3' (SEQ ID NO:68) reverse PCR primer: 5'-GTCTTGGCTTCCTCCAGGTTTGG-3' (SEQ ID NO:69) Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA86602 sequence which had the following nucleotide sequence hybridization probe: 5'-GGACAGCGCTCCCCTCTACCTGGAGACTTGACTCCCGC-3 (SEQ ID RNA for construction of the cDNA libraries was isolated from human fetal brain tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for a fulllength PR05990 polypeptide (designated herein as DNA96042-2682 [Figure 35, SEQ ID NO:66]) and the derived protein sequence for that PR05990 polypeptide.
The full length clone identified above contained a single open reading frame with an apparent translational initiation site at nucleotide positions 265-267 and a stop signal at nucleotide positions 1669-1671 (Figure 35, SEQ ID NO:66). The predicted polypeptide precursor is 468 amino acids long, has a calculated molecular weight of approximately 53,005 daltons and an estimated pl of approximately 4.98. Analysis of the full-length PR05990 sequence shown in Figure 36 (SEQ ID NO:67) evidences the presence of a variety of important polypeptide domains as shown in Figure 36, wherein the locations given for those important polypeptide domains are approximate as described above. Clone DNA96042-2682 has been deposited with ATCC on July 20, 1999 and is assigned ATCC deposit no. 382-PTA.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using the ALIGN-2 sequence alignment analysis of the full-length sequence shown in Figure 36 (SEQ ID NO:67), evidenced sequence identity between the PRO990 amino arid sequence and the following Dayhoff sequences: SG3_MOUSE; SG3_RAT; GENI4673; ENHMHCAXI; MYS2_DICDI; NFU431921; US01_YEAST; A56577; PFLSA13_1; CELF12F3 3.
O
O EXAMPLE 22: Isolation of cDNA clones Encoding Human PRO6030 SUse of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the LIFESEQ® (Incyte Pharmaceuticals, Palo Alto, CA) database, designated herein as CLU20900. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) Cdatabases which included public EST databases Genbank) and a proprietary EST DNA database (L[FESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzymology 266:460- 480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not Sencode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" C(Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is o 10 herein designated DNA81229.
O In light of an observed sequence homology between the DNA81229 sequence and an EST sequence encompassed within clone no. 4020130HI from the Incyte (Incyte Pharmaceuticals, Palo Alto, CA) database, clone no.4020130111 was purchased and the cDNA insert was obtained and sequenced, It was found herein that that cDNA insert encoded a full-length protein. The sequence of this cDNA insert is shown in Figure 37 and is herein designated as DNA96850-2705.
Clone DNA96850-2705 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 60-62 and ending at the stop codon at nucleotide positions 1026-1028 (Figure 37).
The predicted polypeptide precursor is 322 amino acids long (Figure 38). The full-length PR06030 protein shown in Figure 38 has an estimated molecular weight of about 34,793 daltons and a pl of about 6.34. Analysis of the full-length PR06030 sequence shown in Figure 38 (SEQ ID NO:72) evidences the presence of a variety of important polypeptide domains as shown in Figure 38, wherein the locations given for those important polypeptide domains are approximate as described above. Clone DNA96850-2705 has been deposited with ATCC on August 3, 1999 and is assigned ATCC Deposit No. 479-PTA.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using the ALIGN-2 sequence alignment analysis of the full-length sequence shown in Figure 38 (SEQ ID NO:72), evidenced sequence identity between the PR06030 amino acid sequence and the following Dayhoff sequences: AF059571 1; 138346; AF0358351; P_W83138; P R54714; P R65166; P_P93995; BGPI HUMAN; P_W06873; A43165_1.
EXAMPLE 23: Isolation of cDNA clones Encoding Human PR04424 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmolog 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The extended consensus
O
o sequence obtained therefrom is designated DNA80820. In light of DNA80820, DNA96857-2636 was identified and sequenced.
SThe full length clone shown in Figure 39 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 52-54 and ending at the stop codon found at nucleotide positions n 715-717 (Figure 39; SEQ ID NO:73). The predicted polypeptide precursor (Figure 40, SEQ ID NO:74) is 221 amino acids long, PRO4424 has acalculated molecular weight of approximately 23,598 daltons and an estimated pl ot approximately 6.96.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence Q alignment analysisof the full-length sequence shown in Figure 40 (SEQ ID NO:74). revealed homology between C, the PRO4424 amino acid sequence and the following Dayhoff sequences: HGS_A135, JC5105, PR88555, JC5106. P_R88556, CELRI2E2_13, DMC34F3 8, ATG13D4_7, HGS_A204, 558331.
o Clone DNA96857-2636 was deposited with the ATCC and is assigned ATCC deposit no. I7-PTA.
EXAMPLE 24: Isolation of cDNA clones Encoding Human PR04422 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzymoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA80134. In light of DNA80134, DNA96867-2620 was identified and sequenced in full.
The full length clone shown in Figure 41 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 318-320 and ending at the stop codon found at nucleotide positions 900-902 (Figure 41; SEQ ID NO:75). The predicted polypeptide precursor (Figure 42, SEQ ID NO:76) is 194 amino acids long. PR04422 has a calculated molecular weight of approximately 21,431 daltons and an estimated pi of approximately 8.57.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 42 (SEQ ID NO:76), revealed homology between the PR04422 amino acid sequence and the following Dayhoff sequences: LYGCHICK, LYG_CYGAT, LYG ANSAN, LYG_STRCA, PW69515, ATAC0036807, ACCA HAEIN, 164065, A70853 and AF074611_71.
Clone DNA96867-2620 was deposited with the ATCC and is assigned ATCC deposit no. 203972.
O
0 EXAMPLE 25: Isolation of cDNA clones Encoding Human PRO4430 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST Scluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases GenBank) and a proprietary C EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvymolog 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with Sthe program "phrap" (Phil Green, University of Washington, Seattle, Washington). The extended consensus l sequence obtained therefrom is herein designated DNA82380. In light of DNA82380, DNA96878-2626 was identified and sequenced.
SThe full length clone shown in Figure 43 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 56-58 and ending at the stop codon found at nucleotide positions 431-433 (Figure 43; SEQ ID NO:77). The predicted polypeptide precursor (Figure 44, SEQ ID NO:78) is 125 amino acids long. PR04430 has a calculated molecular weight of approximately 13,821 daltons and an estimated pi of approximately 8.6.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 44 (SEQ ID NO:78), revealed homology between the PR04430 amino acid sequence and the following Dayhoff sequences: MMHC213L3_9, A45835, D45835, UPAR_MOUSE, AF043498 I, P_W62066, LY6C MOUSE. LY6A_MOUSE, P_R58710, and P R86315.
Clone DNA96878-2626 was deposited with the ATCC and is assigned ATCC deposit no. 23-PTA.
EXAMPLE 26: Isolation ofcDNA clonesEncoding Human PRO4499 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmolog 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA81155. In light of DNA81155, DNA96889-2641 was identified and sequenced.
The full length clone shown in Figure 45 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 185-187 and ending at the stop codon found at nucleotide positions 1202-1204 (Figure 45; SEQ ID NO:79). The predicted polypeptide precursor (Figure 46, SEQ ID is 339 amino acids long. PR04499 has a calculated molecular weight of approximately 36,975 daltons and an estimated pi of approximately 7.85.
O
0 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence bJ) alignment analysis of the full-length sequence shown in Figure 46 (SEQ ID NO:80), revealed homology between Sthe PR04499 amino acid sequence and the following Dayhoff sequences: CEF38B7_4, D70575, AF073993_1, PNAPA_1, AF098967_1, AF007140_1, ROA3_HUMAN, E70969, CEY53Cl2B 5 and CEY53Cl2B 6.
c Clone DNA96889-2641 was deposited with the ATCC and is assigned ATCC deposit no. 119-PTA.
EXAMPLE 27: Use of PRO as a hybridization probe SThe following method describes use of a nucleotide sequence encoding PRO as a hybridization probe.
o DNA comprising the coding sequence of full-length or mature PRO as disclosed herein is employed as I a probe to screen for homologous DNAs (such as those encoding naturally-occurring variants of PRO) in human 0 10 tissue cDNA libraries or human tissue genomic libraries.
SHybridization and washing of filters containing either library DNAs is performed under the following high stringency conditions. Hybridization of radiolabeled PRO-derived probe to the filters is performed in a solution of 50% formamide, 5x SSC, 0.1 SDS, 0.1 sodium pyrophosphate, 50 mM sodium phosphate, pH 6.8, 2x Denhardt's solution, and 10% dextran sulfate at 42"C for 20 hours. Washing of the filters is performed in an aqueous solution of 0. x SSC and 0.1% SDS at 420C.
DNAs having a desired sequence identity with the DNA encoding full-length native sequence PRO can then be identified using standard techniques known in the art.
EXAMPLE 28: Expression of PRO in E. coli This example illustrates preparation of an unglycosylated form of PRO by recombinant expression in E. coli.
The DNA sequence encoding PRO is initially amplified using selected PCR primers. The primers should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected expression vector. A variety of expression vectors may be employed. An example of a suitable vector is pBR322 (derived from E. coli; see Bolivar et al., Gene, 2:95 (1977)) which contains genes for ampicillin and tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PCR amplified sequences are then ligated into the vector. The vector will preferably include sequences which encode for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, polyhis sequence, and enterokinase cleavage site), the PRO coding region, lambda transcriptional terminator, and an argU gene.
The ligation mixture is then used to transform a selected E. coli strain using the methods described in Sambrook et al., supra. Transformants are identified by their ability to grow on LB plates and antibiotic resistant colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA sequencing.
Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with antibiotics. The overnight culture may subsequrnly be used to inoculate a larger scale culture. The cells are then grown to a desired optical density, during which the expression promoter is turned on.
Cl After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell P pellet obtained by the centrifugation can be solubilized using various agents known in the art, and the solubilized PRO protein can then be purified using a metal chelating column under conditions that allow tight binding of the protein.
PRO may be expressed in E. coli in a poly-His tagged form, using the following procedure. The DNA encoding PRO is initially amplified using selected PCR primers. The primers will contain restriction enzyme f sites which correspond to the restriction enzyme sites on the selected expression vector, and other useful sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation O column, and proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged sequences are then yl ligated into an expression vector, which is used to transform an E. coli host based on strain 52 (W3110 o 10 fuhA(tonA) Ion galE rpoHts(htpRts) clpP(laclq). Transformants are first grown in LB containing 50 mg/mn l carbenicillin at 30 0 C with shaking until an O.D.600 of 3-5 is reached. Cultures are then diluted 50-100 fold into CRAP media (prepared by mixing 3.57 g (NH4),SO,, 0.71 g sodium citrate*2H20. 1.07 g KCI, 5.36 g Difco yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 110 mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM MgSO,) and grown for approximately 20-30 hours at 30 0 C with shaking. Samples are removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet the cells. Cell pellets are frozen until purification and refolding.
E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes in 7 M guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final concentrations of 0. M and 0.02 M, respectively, and the solution is stirred overnight at 4 0 C. This step results in a denatured protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 40,000 rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate column buffer (6 M-guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol grade), pH 7.4. The protein is eluted with buffer containinLg 250 mM imidazole. Fractions containing the desired protein are pooled and stored at 4 Protein concentration is estimated by its absorbance at 280 nm using the calculated extinction coefficient based on its amino acid sequence.
The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting of: 20 mM Tris, pH 8.6, 0.3 M NaCI, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA.
Refolding volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The refolding solution is stirred gently at 4 0 C for 12-36 hours. The refolding reaction is quenched by the addition of TFA to a final concentration of 0.4% (pH of approximately Before further purification of the protein, the solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The refolded protein is chromatographed on a Poros RI/H reversed phase column r-ing a mobile buffer of 0.1% TF-A with clution with a gradient of acetonitrile from 10 to 80%. Aliquots of fractions with A280 absorbance are analyzed on SDS polyacrylamide gels and fractions containing homogeneous refolded protein are pooled.
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile
O
0c since those species are the most compact with their hydrophobic interiors shielded from interaction with the 0) reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition Sto resolving misfolded forms of proteins from the desired form, the reversed phase step also removes endotoxin _from the samples.
SFractions containing the desired folded PRO polypeptide are pooled and the acetonitrile removed using a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 0.14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins -equilibrated in the formulation buffer and sterile filtered.
o Many of the PRO polypeptides disclosed herein were successfully expressed as described above.
O 10 EXAMPLE 29: Expression of PRO in mammalian cells 0 This example illustrates preparation of a potentially glycosylated form of PRO by recombinant expression in mammalian cells.
The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression vector.
Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the PRO DNA using ligation methods such as described in Sambrook et al., supra. The resulting vector is called
PRO.
In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and optionally, nutrient components and/or antibiotics. About 10 pg pRK5-PRO DNA is mixed with about 1 pg DNA encoding the VA RNA gene [Thimmappaya et al., Cell, 31:543 (1982)] and dissolved in 500 pi of I mM Tris-HCI, 0.1 mM EDTA, 0.227 M CaCI,. To this mixture is added, dropwise, 500 p1 of 50 mM HEPES (pH 7.35), 280 mM NaCI, 1.5 mM NaPO 4 and a precipitate is allowed to form for 10 minutes at 25°C. The precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37*C. The culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days.
Approximately 24 hours after the transfections, the culture medium is removed and replaced with culture medium (alone) or culture medium containing 200 pCilml "S-cysteine and 200 pCi/ml "S-methionine. After a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the presence of PRO polypeptide. The cultures containing transfected cells may undergo further incubation (in serum free medium) and the medium is tested in selected bioassays.
In an alternative technique, PRO may be introduced into 293 cells transiently using the dextran sulfate method described by Somparyrac et al., Proc. Natl. Acad. Sci., 12:7575 (1981). 293 cells are grown to maximal dr.city in a spinner flask and 700 pg pRKS-PRO DNA is added. The cells are first concentrated from the spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is incubated on the cell pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture medium, and re-introduced into the spinner flask containing tissue culture medium, 5 pg/ml bovine insulin and
O
S0.1 g/ml bovine transferrin. After about four days, the conditioned media is centrifuged and filtered to remove cells and debris. The sample containing expressed PRO can then be concentrated and purified by.any selected Smethod, such as dialysis and/or column chromatography.
In another embodiment, PRO can be expressed in CHO cells. The pRK5-PRO can be transfected into C CHO cells using known reagents such as CaPO, or DEAE-dextran. As described above, the cell cultures can be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as "S-methionine. After determining the presence of PRO polypeptide, the culture medium may be replaced with serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medium o is harvested. The medium containing the expressed PRO can then be concentrated and purified by any selec,ed i method.
Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of the vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a polyhis tag into a Baculovirus expression vector. The poly-his tagged PRO insert can then be subcloned into a driven vector containing a selection marker such as DHFR for selection of stable clones: Finally, the CHO cells can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described above, to verify expression. The culture medium containing the expressed poly-his tagged PRO can then be concentrated and purified by any selected method, such as by Ni"-chelate affinity chromatography.
PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO cells by another stable expression procedure.
Stable expression in CHO cells is performed using the following procedure. The proteins are expressed as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms extracellular domains) of the respective proteins are fused to an IgGl constant region sequence containing the hinge, CH2 and CH2 domains and/or is a poly-His tagged form.
Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using standard techniques as described in Ausubel et al., Current Protocols of Molecular Biology, Unit 3.16, John Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3' of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used expression in CHO cells is as described in Lucas et al., Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR expression permits selection for stable maintenance of the plasmid following transfection.
Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million CHO cells using commercially available transfection reagents Superfect' (Quiagen), Dosper' or Fugene' (Boehringer Mannheim). The cells are grown as described in Lucas et al., supra. Approximately 3 x 10" cells are frozen in an ampule for further growth and production as described below.
The ampules containing the plasmid DNA are thawed by placement into vwter bath and "ixed by vortexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifuged at 1000 rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 pm filtered PS20 with 5% 0.2 pm diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL
O
O
c spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 0. filled with 150 mL selective growth medium and incubated at 37 0 C. After another 2-3 days, 250 mL, 500 mL 4 and 2000 mL spinners are seeded with 3 x 10' cells/mL. The cell media is exchanged with fresh media by centrifugation and resuspension in production medium. Although any suitable CHO media may be employed, c a production medium described in U.S. Patent No. 5,122,469, issued June 16, 1992 may actually be used. A 3 L production spinner is seeded at 1.2 x 106 cells/mL. On day 0, the cell number pH ie determined. On day 1, the spinner is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, the temperature shifted to 33 0 C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam O polydimethylsiloxane emulsion, Dow Coming 365 Medical Grade Emulsion) taken. Throughout the production, the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below O 10 70%, the cell culture is harvested by centrifugation and filtering through a 0.22 pm filter. The filtrate was either 0 stored at 4"C or immediately loaded onto columns for purification.
For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4'C. After loading, the column is washed with additional equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0.14 M NaCI and 4% mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80 0
C.
Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting I mi fractions into tubes containing 275 pL of 1 M Tris buffer, pH 9. The highly purified protein is subsequently desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity is assessed by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation.
Many of the PRO polypeptides disclosed herein were successfully expressed as deatribed above.
EXAMPLE 30: Expression of PRO in Yeast The following method describes recombinant expression of PRO in yeast.
First, yeast expression vectors are constructed for intracellular production or secretion of PRO from the ADH2/GAPDH promoter. DNA encoding PRO and the promoter is inserted into suitable restriction enzyme sites in the selected plasmid to direct intracellular expression of PRO. For secretion, DNA encoding PRO can be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native PRO signal peptide or nther mamma!izn signal peptide, or, for example, a yeast alpha-factor or invertase secretory signal/leader sequence, and linker sequences (if needed) for expression of PRO.
Yeast cells, such as yeast strain AB110, can then be transformed with the expression plasmids described above and cultured in selected fermentation media. The transformed yeast supernatants can be analyzed by
O
0c" precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with Coomassie Blue stain.
SRecombinant PRO can subsequently be isolated and purified by removing the yeast cells from the _fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The
C
concentrate containing PRO may further be purified using selected column chromatography resins.
Many of the PRO polypeptides disclosed herein were successfully expressed as described above.
1 EXAMPLE 31: Expression of PRO in Baculovirus-lnfected Insect Cells o The following method describes recombinant expression of PRO in Baculovirus-infected insect cells.
The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus O 10 expression vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG).
c".i A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of the coding sequence of PRO such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to the 5' and 3' regions. The 5' primer may incorporate flanking (selected) restriction enzyme sites. The product is then digested with those selected restriction enzymes and subcloned into the expression vector.
Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGoldT virus DNA (Pharmingen) into Spodopterafrugiperda cells (ATCC CRL 1711) using lipofectin (commercially available from GIBCO-BRL). After 4 5 days of incubation at 28 0 C, the released "iruses are harvested and used for further amplifications. Viral infection and protein expression are performed as described by O'Reilley et al., Baculovirus expression vectors: A Laboratory Manual, Oxford: Oxford University Press (1994).
Expressed poly-his tagged PRO can then be purified, for example, by Ni 7 "-chelate affinity chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by Rupert et al., Nature, 362:175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer mL Hepes, pH 7.9; 12.5 mM MgCl,; 0.1 mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M KCI), and sonicated twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted in loading buffer (50 mM phosphate, 300 mM NaCI, 10% glycerol, pH 7.8) and filtered through a 0.45 pm filter. A Ni"-NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is loaded onto the column at 0.5 mL per minute. The column is washed to baseline A, with loading buffer, at which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM phosphate; 300 mM NaCI, 10% glycerol, pH which elutes nonspecifically bound protein. After reaching
A
2 baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash buffer. One mL fractions are collectcd and analyzed by SDS-PAGE and silver staining or Western blot with NiZ'-NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted His,-iagged PRO are pooled and dialyzed against loading buffer.
O
O
cI Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 0.0 chromatography techniques, including for instance, Protein A or protein G column chromatography.
Many of the PRO polypeptides disclosed herein were successfully expressed as described above.
EXAMPLE 32: Preparation of Antibodies that Bind PRO This example illustrates preparation of monoclonal antibodies which can specifically bind PRO.
Techniques for producing the monoclonal antibodies are known in tl:e art and are described, for Sinstance, in Coding, supra. Immunogens that may be employed include purified PRO, fusion proteins containing O PRO, and cells expressing recombinant PRO on the cell surface. Selection of the immunogen can be made by the skilled artisan without undue experimentation.
0 10 Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's Nc adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. Alternatively, the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and injected into the animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be boosted with additional immunization injections. Serum samples may be periodically obtained from the mice by retro-orbital bleeding for testing in ELISA assays to detect anti-PRO antibodies.
After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells are harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected murine myeloma cell line such as P3X63AgU.I, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and thymidine) medium to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids.
The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determination of 'positive" hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the an.
The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, affinity chromatography based upon binding of antibody to protein A or protein G can be employed.
EXAMPLE 33: Purification of PRO Polvpeptides Using Specific Antibodies Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in the art of protein purification. For example, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO polypeptide is purified by immunoaffinity chromatography using antibodies specific for the PRO polypeptide of interest. In general, an immunoaffinity column is constructed by covalently coupling the anti-PRO polypeptide antibody to an activated chromatographic resin.
O
Cl Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium jsulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, 2 Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a chromatographic resin such as CnBr-activated SEPHAROSETM (Pharmacia LKB Biotechnology). The antibody is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's Sinstructions.
Such an immunoaffinity column is utilized in the purification of PRO polypeptide by preparing a fraction O from cells containing PRO polypeptide in a soluble form. This preparation is derived by solubilization of the l) whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by S 10 other methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal sequence may be secreted in useful quantity into the medium in which the cells are grown.
A soluble PRO polypeptide-containing preparation is passed over the immunoaffinity column, and the column is washed under conditions that allow the preferential absorbance of PRO polypeptide high ionic strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt antibody/PRO polypeptide binding a low pH buffer such as approximately pH 2-3, or a high concentration of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected.
EXAMPLE 34: Drug Screening This invention is particularly useful for screening compounds by using PRO polypeptides or binding fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed in such a test may either be free in solution, affixed to a solid support, borne on a cell surface, or located intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs are screened against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be used for standard binding assays. One may measure, for example, the formation of complexes between PRO polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex formation between the PRO polypeptide and its target cell or target receptors caused by the agent being tested.
Thus, the present invention provides methods of screening for drugs or any other agents which can affect a PRO polypeptide-associated disease or disorder. These methods comprise contacting such an agent with an PRO polypeptide or fragment thereof and assaying for the presence of a complex between the agent and the PRO polypeptide or fragment, or (ii) for the presence of a complex between the PRO polypeptide or fragment and the cell, by methods well known in the art. In such competitive binding assays, the PRO polypeptide or fragment is typically labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that present in bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular agent to bind to PRO polypeptide or to interfere with the PRO polypeptide/cell complex.
Another technique for drug screening provides high throughput screening for compounds having suitable binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13, 1984.
O
c Briefly stated, large numbers of different small peptide test compounds are synthesized on a solid substrate, such 00) as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted with PRO polypeptide and washed. Bound PRO polypeptide is detected by methods well known in the art.
_Purified PRO polypeptide can also be coated directly onto plates for use in the aforementioned drug screening n techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the solid support.
This invention ilso contemplates the use of competitive drug screening assays in which necitralizing Santibodies capable of binding PRO polypeptide specifically compete with a test compound for binding to PRO o polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any peptide which shares one or more antigenic determinants with PRO polypeptide.
o 0 EXAMPLE 35: Rational Drug Design The goal of rational drug design is to produce structural analogs of biologically active polypeptide of interest a PRO polypeptide) or of small molecules with which they interact, agonists, antagonists, or inhibitors. Any of these examples can be used to fashion drugs which are more active or stable forms of the PRO polypeptide or which enhance or interfere with the function of the PRO polypeptide in vivo Hodgson, Bio/Technology, 9: 19-21 (1991)).
In one approach, the three-dimensional structure of the PRO polypeptide, or of an PRO polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically.
by a combination of the two approaches. Both the shape and charges of the PRO polypeptide must be ascertained to elucidate the structure and to determine active site(s) of the molecule. Less often, useful information regarding the structure of the PRO polypeptide may be gained by modeling based on the structure of homologous proteins.
In both cases, relevant structural information is used to design analogous PRO polypeptide-like molecules or to identify efficient inhibitors. Useful examples of rational drug design may include molecules which have improved activity or stability as shown by Braxton and Wells, Biochemistry. 31:7796-7801 (1992) or which act as inhibitors, agonists, or antagonists of native peptides as shown by Athauda et at., 3. Biochem., J11:742-746 (1993).
It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent drug design can be based. It is possible to bypass protein crystallography altogether by generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically active antibody. As a mirror image of a mirror image, the binding site of the anti-ids would be expected to be an analog of the original receptor. The anti-id could then be used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated peptides would then act as the pharmacore.
By virtue of the present invention, sufficient amounts of the PRO polypeptide may be made available to perform such analytical studies as X-ray crystallography. In addition, knowledge of the PRO polypeptide amino acid sequence provided herein will provide guidance to those employing computer modeling techniques in place of or in addition to x-ray crystallography.
O
"1 EXAMPLE 36: Chondrocyte Re-differentiation Assay (Assay 110) (3 This assay shows that certain polypeptides of the invention act to induce redifferentiation of chondrocytes, therefore, are expected to be useful for the treatment of various bone and/or cartilage disorders such as, for example, sports injuries and arthritis. The assay is performed as follows. Porcine chondrocytes C are isolated by overnight collagenase digestion of articulary cartilage of metacarpophalangeal joints of 4-6 month old female pigs. The isolated cells are then seeded at 25,000 cells/cm 2 in Ham F-12 containing 10% FBS and 4 pg/ml gentamycin. The culture media is changed every third day and the cells are then seeded in 96 well 17 plates at 5,000 cells/well in 100pl of the same media without serum and 100 pl of the test PRO polypeptide, SnM staurosporin (positive control) or medium alone (negative control) is added to give a final volume of 200 C, l/well. After 5 days of incubation at 37 0 C. a picture of each well is taken and the differentiation state of the chondrucytes is determined. A positive result in the assay occurs when the redifferentiation of the chondrocytes Sis determined to be more similar to the positive control than the negative control.
The following polypeptide tested positive in this assay: PRO1484. PRO1890, PROI887, PR04353, PR04357, PR04405, PR05737 and PR05990.
EXAMPLE 37: Detection of Polypeptides That Affect Glucose or FFA Uptake in Skeletal Muscle (Assay 106) This assay is designed to determine whether PRO polypeptides show the ability to affect glucose or FFA uptake by skeletal muscle cells. PRO polypeptides testing positive in this assay would be expected to be useful for the therapeutic treatment of disorders where either the stimulation or inhibition of glucose uptake by skeletal muscle would be beneficial including, for example, diabetes or hyper- or hypo-insulinemia.
In a 96 well format, PRO polypeptides to be assayed are added to primary rat differentiated skeletal muscle, and allowed to incubate overnight. Then fresh media with the PRO polypeptide and insulin are added to the wells. The sample media is then monitored to determine glucose and FFA uptake by the skeletal muscle cells. The insulin will stimulate glucose and FFA uptake by the skeletal muscle, and insulin in media without the PRO polypeptide is used as a positive control, aid a limit for scoring. As the PRO polypeptide being tested may either stimulate or inhibit glucose and FFA uptake, results are scored as positive in the assay if greater than 1.5 times or less than 0.5 times the insulin control.
The following PRO polypeptides tested positive as either stimulators or inhibitors of glucose and/or FFA uptake in this assay: PR01484, PRO 1122, PRO1889, PR04357 and PR04380.
EXAMPLE 38: Detection of PRO Polvpeptides That Affect Glucose or FFA Uptake by Primary Rat Adipocytes (Assay 94) This assay is designed to determine whether PRO polypeptides show the ability to affect glucose or FFA uptake by adipocyte cells. PRO polypeptides testing positive in this assay would be expected to be useful for the therapeutic treatment of disorders where either the stimulation or inhibition t "glucose uptake by adipocytes would be beneficial including, for example, obesity, diabetes or hyper- or hypo-insulinemia.
In a 96 well format, PRO polypeptides to be assayed are added to primary rat adipocytes, and allowed to incubate overnight. Samples are taken at 4 and 16 hours and assayed for glycerol, glucose and FFA uptake.
O
0. After the 16 hour incubation, insulin is added to the media and allowed to incubate for 4 hours. At this time; a 3 sample is taken and glycerol, glucose and FFA uptake is measured. Media containing insulin without the PRO Spolypeptide is used as a positive reference control. As the PRO polypeptide being tested may either stimulate or inhibit glucose and FFA uptake, results are scored as positive in the assay if greater than 1.5 times or less C than 0.5 times the insulin control.
The following PRO polypeptides tested positive as stimulators of glucose and/or FFA uptake in this assay: PROI890, PR01785 and PR04422.
The following PRO polypeptides tested positive as inhibitors of glucose and/or FFA uptake in this n assay: PR04334, PR04425, PRO4424 and PR04430.
S 10 EXAMPLE 39: Induction of Pancreatic 0-Cell Precursor Differentiation(Assav 89) SThis assay shows that certain polypeptides of the invention act to induce differentiation of pancreatic P-cell precursor cells into mature pancreatic p-cells and, therefore, are useful for treating various insulin deficient states in mammals, including diabetes mellitus. The assay is performed as follows. The assay uses a primary culture of mouse fetal pancreatic cells and the primary readout is an alteration in the expression of markers that represent either p-cell precursors or mature P-cells. Marker expression is measured by real time quantitative PCR (RTQ-PCR); wherein the marker being evaluated is insulin.
The pancreata are dissected from E14 embryos (CDI mice). The pancreata are then digested with collagenaseldispase in F12/DMEM at 37°C for 40 to 60 minutes (collagenase/dispase, 1.37 mg/ml, Boehringer Mannheim, #1097113). The digestion is then neutralized with an equal volume of 5% BSA and the cells are washed once with RPMI11640. At day 1, the cells are seeded into 12-well tissue culture plates (pre-coated with laminin, 20,g/ml in PBS, Boehringer Mannheim, #124317). Cells from pancreata from 1-2 embryos are distributed per well. The culture medium for this primary cuture is 14F/1640. At day 2. the media is removed and the attached cells washed with RPMI/1640. Two mis of minimal media are added in addition to the protein to be tested. At day 4, the media is removed and RNA prepared from the cells and marker expression analyzed by real time quantitative RT-PCR. A protein is considered to be active in the assay if it increases the expression of the relevant A-cell marker as compared to untreated controls.
14F/1640 is RPMI1640 (Gibco) plus the following: group A 1:1000 group B 1:1000 recombinant human insulin 10 pg/ml Aprotinin (50pg/ml) 1:2000 (Boehringer manheim #981532) Bovine pituitary extract (BPE) Gentamycin 100 ng/ml Group A fin 10ml PBS) Transferrin, 100mg (Sigma T2252) Epidermal Growth Factor, 100pg (BRL 100004) Triiodothyronine, iO/l of 5xl 0 M (Sigma T5516)
O
c Ethanolamine, 100pI of 10' M (Sigma E0135) Phosphoethalamine, 100pl of 10' M (Sigma P0503) Selenium, 4 pl of 10' M (Aesar #12574) Group C (in 10ml 100% ethanol) e Hydrocortisone, 2p1 of 5X10' M (Sigma #H0135) Progesterone, 100pl of IX l O' M (Sigma #P6149) Forskolin, 5001 of 20mM (Calbiochem #344270) Minimal media: SRPMI 1640 plus transferrin (10 pg/dm), insulin (I pg/ml), gentamycin(100 ng/ml), aprotinin (50 gg,,nl) C and BPE (15 pg/ml).
O 10 Defined media: SRPMI 1640 plus transferrin (10 pg/ml), insulin (1 pg/ml), gentamycin (100 ng/ml) and aprotinin pg/ml).
The following polypeptides were positive in this assay: PRO4356.
EXAMPLE 40: Fetal Hemoglobin Induction in an Ervyhroblastic Cell Line (Assay 107) This assay is useful for screening PRO polypeptides for the ability to induce the switch from adult hemoglobin to fetal hemoglobin in an erythroblastic cell line. Molecules testing positive in this assay are expected to be useful for therapeutically treating various mammalian hemoglobin-associated disorders such as the various thalassemias. The assay is performed as follows. Erythroblastic cells are plated in standard growth medium at 1000 cells/well in a 96 well format. PRO polypeptides are added to the growth medium at a concentration of 0.2% or 2% and the cells are incubated for 5 days at 37 0 C. As a positive control, cells are treated with 100pM hemin and as a negative control, the cells are untreated. After 5 days, cell lysates are prepared and analyzed for the expression of gamma globin (a fetal marker). A positive in the assay is a gamma globin level at least 2-fold above the negative control.
The following polypeptides tested positive inthisassay: PR04352, PR04354, PR04408, PR06030 and PR04499.
EXAMPLE 41: Mouse Kidney Mesangial Cell Proliferation Assay (Assav 92) This assay shows that certain polypeptides of the invention act to induce proliferation of mammalian kidney mesangial cells and, therefore, are useful for treating kidney disorders associated with decreased mesangial cell function such as Bcrger disease or other nephropathies associated with Sch6nlein-Henoch purpura, celiac disease, dermatitis herpetiformis or Crohn disease. The assay is performed as follows. On day one, mouse kidney mesangial cells are plated on a 96 well plate in growth media (3:1 mixture of Dulbecco's modified Eagle's medium and Ham's F12 medium, 95% fetal bovine serum, 5% supplemented "'lh 14 mM H-PES) and grown overnight. On day 2, PRO polypeptides are diluted at 2 concentrations( and in serum-free medium and added to the cells. Control samples are serum-free medium alone. On day 4, 20/l of the Cell Titer 96 Aqueous one solution reagent (Progema) was added to each well and the cotormetric reaction was allowed 0 to proceed for 2 hours. The absorbance (OD) is then measured at 490 am, A positive in the asay is anything that gives an absorbance reading which is at least 15% above the control reading.
;Z The following polypeptide tested positive in this assay: PR04380, PR04408 and PR04425.
Deposit of Material The following materials have been deposited with the American Type Culture Collection, 12301 Parklawri Drive, Rockville, MD, USA (ATCC): Table7 ClMaterial ATCC Dep. No. Deposit Date DNA44686-1653 203581 January 12, 1999 ODNA59608-2577 203870 March 23, 1999 ClDNA62377-1381 203552 December 22, 1998 DNA77623-2524 203546 December 22, 1998 DNA79230-2525 203549 December 22, 1998 DNA79862-2522 203550 December 22, 1998 DNA80136-2503 203541 December 15, 1998 DNA80145-2594 204-PTA June 8, 1999 D)NA849l7-2597 203863 March 23, 1999 DNA84920-2614 203966 April 27, 1999 DNA86576-2595 203868 March 23, 1999 DNA87976-2593 203888 March 30, 1999 DNA92234-2602 203948 April 20, 1999 DNA92256-2596 203891 March 30, 1999 DNA92274-2617 203971 April 27, 1999 DNA92929-2534 203586 January 12, 1999 DNA93011-2637 20-PTA May 4, 1999 DNA96042-2682 382-PTA July 20, 1999 DNA96850-2705 479-PTrA August 3, 1999 DNA96857-2630 17-PTA May 4. 1999 DNA96867-2620 203972 April 27, 1999 DNA96878-2626 23-PTA May 4, 1999 DNA96899-2641 119-PTA May 25, 1999 These deposits were made under the provisions of the Budapest Treaty on the international Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of the culture of the deposit to the public upon'issuance of the pertinent U.S. patent or upon laying open to the public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny to one determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to USC 122 and the Commissioner's rules pursuant thereto (including 37 CFR 1. 14 with particular reference to 886 00 638).
jthe assignee of the present application has agreed that if a culture of the materials on deposit should die or be lost or destroyed when cultivated under suitable conditions, the materials he promptly replaced on notification with another of the same. Availahility of the deposited material is not to he construed as a license to practice the invention in contravention of the rights granted under the authority of any government in accordance with its patent laws.
The foregoing written specification is considered to be sufficient to enable one skilled in the art to practice the invention. The present invention is not to be limited in scope by the construct deposited, since the deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs that are functionally equivalent are within the scope of this invention. The deposit of material herein does not constitute an admission that the written description herein contained is inadequate to enable the practice of any aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description and fall within the scope of the appended claims.
The entire disclosure in the complete specification of our Australian Patent Application No.
2003261484 is by this cross-reference incorporated into the present specification.
In the claims which follow and in the preceding description of the invention, except where the context requires otherwise due to express language or necessary implication, the word "comprise" or variations such as "comprises" or "comprising" is used in an inclusive sense, i.e. to specify the presence of the stated features but not to preclude the presence or addition of further features in various embodiments of the invention.
It is to be understood that, if any prior art publication is referred to herein, such reference does not constitute an admission that the publication forms a part of the common general knowledge in the art, in Australia or any other country.
Claims (40)
- 2. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a nucleotide Ssequence having the nucleotide sequence shown in Figure 21 (SEQ ID NO:49), in which the nucleic acid l encodes a polypeptide which has the ability to induce differentiation of pancreatic f-cell precursor cells O 10 into mature pancreatic B-cells.
- 3. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a nucleotide O sequence having the full-length coding sequence of the nucleotide sequence shown in Figure 21 (SEQ ID C]l NO:49), in which the nucleic acid encodes a polypeptide which has the ability to induce differentiation of pancreatic 3-cell precursor cells into mature pancreatic fl-cells.
- 4. Isolated nucleic acid having at least 80% nucleic acid sequence identity to the full- length coding sequence of the DNA deposited under ATCC accession number 203868. Isolated nucleic acid according to any one of claims I to 4 in which sequence identity is at least
- 6. Isolated nucleic acid according to any one of claims I to 4 in which sequence identity is at least
- 7. Isolated nucleic acid according to any one of claims I to 4 in which sequence identity is at least
- 8. Isolated nucleic acid according to any one of claims I to 4 in which sequence identity is at least 99%.
- 9. Isolated nucleic acid having the nucleotide sequence shown in Figure 21 (SEQ ID: NO. 49). Isolated nucleic acid encoding the amino acid sequence shown in Figure 22 (SEQ ID: NO.
- 11. Isolated nucleic acid having the nucleotide sequence of the full length coding sequence of the DNA deposited under the ATCC accession number 203868.
- 12. A vector comprising the nucleic acid of any one of Claims I to 11.
- 13. The vector of Claim 12, operably linked to control sequences recognized by a host cell transformed with the vector.
- 14. A host cell comprising the vector of Claim 12 or Claim 13.
- 15. The host cell of Claim 14, wherein said cell is a CHO cell.
- 16. The host cell of Claim 14, wherein said cell is an E. coli
- 17. The host cell of Claim 14, wherein said cell is a yeast cell.
- 18. A process for producing a PR04356 polypeptide, comprising culturing the host cell of Claim 14 under conditions suitable for expression of the PR04356 polypeptide and recovering the PR04356 polypeptide from the cell culture. 171 H.VuanitalKcep'paleni PSI 822 div pro43 56 doC 31/OR/05 O O 19. An isolated polypeptide having at least 80% amino acid sequence identity having the amino acid sequence shown in Figure 22 (SEQ ID NO:50), in which the polypeptide has the ability to induce differentiation ofpancreatic (-cell precursor cells into mature pancreatic #-cells. An isolated polypeptide scoring at least 80% positives when compared to the amino acid sequence shown in Figure 22 (SEQ ID NO:50), in which the polypeptide has the ability to induce differentiation of pancreatic (-cell precursor cells into mature pancreatic 3-cells.
- 21. An isolated polypeptide having at least 80% amino acid sequence identity to an amino n acid sequence encoded by the full-length coding sequence of the DNA deposited under ATCC accession Snumber 203858, in which the polypeptide has the ability to induce differentiation of pancreatic 0-cell O 10 precursor cells into mature pancreatic 0-cells.
- 22. An isolated polypeptide according to any one of claims 19 to 21 in which the sequence Sidentity is at least CN 23. An isolated polypeptide according to any one of claims 19 to 21 in which the sequence identity is at least
- 24. An isolated polypeptide according to any one of claims 19 to 21 in which the sequence identity is at least An isolated polypeptide according to any one of claims 19 to 21 in which the sequence identity is at least 99%.
- 26. An isolated polypeptide having the amino acid sequence shown in Figure 22 (SEQ ID
- 27. An isolated polypeptide having the amino acid sequence encoded by the nucleotide sequence shown in Figure 21 (SEQ ID NO:49).
- 28. An isolated polypeptide having the amino acid sequence encoded by the full-length coding sequence of the DNA deposited under ATCC accession number 203868.
- 29. A chimeric molecule comprising a polypeptide according to any one of Claims 19 to 28, fused to a heterologous amino acid sequence. The chimeric molecule of Claim 29, wherein said heterologous amino acid sequence is an epitope tag sequence.
- 31. The chimeric molecule of Claim 29, wherein said heterologous amino acid sequence is a Fc region of an immunoglobulin.
- 32. An antibody which specifically binds to a polypeptide according to any one of Claims 19 to 29.
- 33. The antibody of Claim 32, wherein said antibody is a monoclonal antibody, a humanized antibody or a single-chain antibody.
- 34. Isolated nucleic acid having at least 80% nucleic acid sequence identity to: a nucleotide sequence encoding the polypeptide shown in Figure 22 (SEQ ID lacking its associated signal peptide; a nucleotide sequence encoding an extracellular domain of the polypeptide shown in Figure 22 (SEQ ID NO:50), with its associated signal peptide; or a nucleotide sequence encoding an extracellular domain of the polypeptide shown in Figure 22 (SEQ ID NO:50), lacking its associated signal peptide, in which the polypeptide has the ability to induce differentiation of pancreatic 3-cell precursor cells into mature pancreatic (-cells. 172 IH:unila\Keep\patemlP 51822 div pro4356 doc 311 08OO O O 35. An isolated nucleic acid according to claim 34 in which the sequence identity is at least S36. An isolated nucleic acid according to claim 34 in which the sequence identity is at least
- 37. An isolated nucleic acid according to claim 34 in which the sequence identity is at least
- 38. An isolated nucleic acid according to claim 34 in which the sequence identity is at least ICn 99%. S39. Isolated nucleic acid having: O 10 a nucleotide sequence encoding the polypeptide shown in Figure 22 (SEQ ID Slacking its associated signal peptide; S(b) a nucleotide sequence encoding an extracellular domain of the polypeptide shown in C, Figure 22 (SEQ ID NO:50), with its associated signal peptide; or a nucleotide sequence encoding an extracellular domain of the polypeptide shown in Figure 22 (SEQ ID NO:50), lacking its associated signal peptide, in which the polypeptide has the ability to induce differentiation of pancreatic f-cell precursor cells into mature pancreatic f-cells. An isolated polypeptide having at least 80% amino acid sequence identity to: the polypeptide shown in Figure 22 (SEQ ID NO:50), lacking its associated signal peptide; an extracellular domain of the polypeptide shown in Figure 22 (SEQ ID NO:50), with its associated signal peptide; or an extracellular domain of the polypeptide shown in Figure 22 (SEQ ID lacking its associated signal peptide, in which the polypeptide has the ability to induce differentiation of pancreatic f-cell precursor cells into mature pancreatic B-cells.
- 41. An isolated polypeptide according to claim 40 in which the sequence identity is at least
- 42. An isolated polypeptide according to claim 40 in which the sequence identity is at least
- 43. An isolated polypeptide according to claim 40 in which the sequence identity is at least
- 44. An isolated polypeptide according to claim 40 in which the sequence identity is at least An isolated polypeptide having the amino acid sequence of: the polypeptide shown in Figure 22 (SEQ ID NO:50), lacking its associated signal peptide; an extracellular domain of the polypeptide shown in Figure 22 (SEQ ID NO:50), with its associated signal peptide; or an extracellular domain of the polypeptide shown in Figure 22 (SEQ ID lacking its associated signal peptide,
- 46. A composition comprising an isolated polypeptide according to any one of claims 40 to together with a pharmaceutically-acceptable carrier. 173 H:UuanitcaKeep\pa cnt\P5 122 div pro4J 6 doc 31 ^O 47. A composition comprising an antibody according to claim 32 or claim 33, together with a pharmaceutically-acceptable carrier. S48. A composition comprising a chimeric molecule according to any one of claims 29 to 31, together with a pharmaceutically-acceptable carrier.
- 49. Complement of the isolated nucleic acid according to any one of claims I to 11 or 34 to 39. DNA hybridising to the complement according to claim 49 between about residues 112 and about 807 inclusive of Figure 21 (SEQ ID NO: 49) under conditions of high stringency.
- 51. Fragments of the isolated nucleic acid according to any one of claims 1 to I I or 34 to 39, which are specific to the sequence shown in Figure 21 (SEQ ID NO: 49). S 10 52. Antagonists of PR04356, which antagonists are antibodies specific to PR04356 or Santisense molecules specific for PR04356. O 53. A method of treatment of disorder involving insulin deficiency, comprising administering to C a subject in need thereof an effective amount of an isolated nucleic acid according to any one of claims 1 to 11 or 34 to 39, an isolated polypeptide according to any one of claims 19 to 28 or 40 to 45, a chimeric molecule according to any one of claims 29 to 31, an antibody according to claim 32 or 33, a composition according to claim 46 to 48 or an antagonist according to claim 52.
- 54. Use of an isolated nucleic acid according to any one of claims 1 to 11 or 34 to 39, an isolated polypeptide according to any one of claims 19 to 28 or 40 to 45, a chimeric molecule according to any one of claims 29 to 31, an antibody according to claim 32 or 33, a composition according to claim 46 to 48 or an antagonist according to claim 52, in the manufacture of a medicament for treating a disorder involving insulin deficiency. A method of diagnosis of a disorder involving insulin deficiency, comprising assaying for an isolated nucleic acid according to any one of claims 1 to 11 or 34 to 39, an isolated polypeptide according to any one of claims 19 to 28 or 40 to 45, a chimeric molecule according to any one of claims 29 to 31 an antibody according to claim 32 or 33, a composition according to claim 46 to 48, or an antagonist according to claim 52.
- 56. Use of an isolated nucleic acid according to any one of claims I to 11 or 34 to 39, an isolated polypeptide according to any one of claims 19 to 28 or 40 to 45, a chimeric molecule according to any one of claims 29 to 31, an antibody according to claim 32 or 33, a composition according to claim 46 to 48, or an antagonist according to claim 52, in the manufacture of a medicament for diagnosing a disorder involving insulin deficiency.
- 57. A method according to claim 53 or claim 55 or use according to claim 54 or claim 56, in which the disorder is diabetes mellitus.
- 58. A method according to claim 53 or claim 55 or use according to claim 54 or claim 56, substantially as hereinbefore described, with reference to the examples, and, or drawings. 174 H.Uinila\Kecp\p tca\PeaI 22 div pro4356 doc 31(08105 O O
- 59. An isolated nucleic acid according to any one of claims 1 to 11 or 34 to 39, an isolated Spolypeptide according to any one of claims 19 to 28 or 40 to 45, a chimeric molecule according to any one of claims 29 to 31, an antibody according to claim 32 or 33, a composition according to claim 46 to 48 or an antagonist according to claim 49, substantially as hereinbefore described, with reference to the examples, and, or drawings. n Dated this 31 st day of August 2005 n GENENTECH. INC. C 10 By their Patent Attorneys 0GRIFFITH HACK i Fellows Institute of Patent and Trade Mark Attorneys of Australia H:U uanita\Keep\ptent\P51822 div pro4356 doe 31108/05
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2005205754A AU2005205754B2 (en) | 1999-03-23 | 2005-08-31 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
Applications Claiming Priority (19)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US60/125778 | 1999-03-23 | ||
US60/125774 | 1999-03-23 | ||
US60/125826 | 1999-03-24 | ||
US60/127035 | 1999-03-31 | ||
US60/127706 | 1999-04-05 | ||
US60/130359 | 1999-04-21 | ||
US60/131270 | 1999-04-27 | ||
US60/131291 | 1999-04-27 | ||
US60/131272 | 1999-04-27 | ||
US60/132383 | 1999-05-04 | ||
US60/132379 | 1999-05-04 | ||
US60/132371 | 1999-05-04 | ||
US60/135750 | 1999-05-25 | ||
US60/138166 | 1999-06-08 | ||
US60/144791 | 1999-07-20 | ||
US60/146970 | 1999-08-03 | ||
US60/170262 | 1999-12-09 | ||
AU2003261484A AU2003261484B2 (en) | 1999-03-23 | 2003-11-06 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
AU2005205754A AU2005205754B2 (en) | 1999-03-23 | 2005-08-31 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2003261484A Division AU2003261484B2 (en) | 1999-03-23 | 2003-11-06 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
Publications (2)
Publication Number | Publication Date |
---|---|
AU2005205754A1 AU2005205754A1 (en) | 2005-09-22 |
AU2005205754B2 true AU2005205754B2 (en) | 2007-09-20 |
Family
ID=35057829
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2005205754A Ceased AU2005205754B2 (en) | 1999-03-23 | 2005-08-31 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
AU2005205755A Ceased AU2005205755B2 (en) | 1999-03-23 | 2005-08-31 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
AU2005205758A Ceased AU2005205758B2 (en) | 1999-03-23 | 2005-08-31 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
AU2005205752A Ceased AU2005205752B2 (en) | 1999-03-23 | 2005-08-31 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
Family Applications After (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2005205755A Ceased AU2005205755B2 (en) | 1999-03-23 | 2005-08-31 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
AU2005205758A Ceased AU2005205758B2 (en) | 1999-03-23 | 2005-08-31 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
AU2005205752A Ceased AU2005205752B2 (en) | 1999-03-23 | 2005-08-31 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
Country Status (1)
Country | Link |
---|---|
AU (4) | AU2005205754B2 (en) |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1200590B1 (en) * | 1999-08-12 | 2009-01-07 | Agensys, Inc. | C-type lectin transmembrane antigen expressed in human prostate cancer and uses thereof |
-
2005
- 2005-08-31 AU AU2005205754A patent/AU2005205754B2/en not_active Ceased
- 2005-08-31 AU AU2005205755A patent/AU2005205755B2/en not_active Ceased
- 2005-08-31 AU AU2005205758A patent/AU2005205758B2/en not_active Ceased
- 2005-08-31 AU AU2005205752A patent/AU2005205752B2/en not_active Ceased
Also Published As
Publication number | Publication date |
---|---|
AU2005205752A1 (en) | 2005-09-22 |
AU2005205754A1 (en) | 2005-09-22 |
AU2005205755A1 (en) | 2005-09-22 |
AU2005205752B2 (en) | 2007-06-07 |
AU2005205755B2 (en) | 2007-10-04 |
AU2005205758A1 (en) | 2005-09-22 |
AU2005205758B2 (en) | 2007-09-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1210418B1 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
US20060073579A1 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
AU764092B2 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
US7105639B2 (en) | Anti-PRO 4405 antibodies | |
US7192587B2 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
US7435793B2 (en) | Peptides that induce chondrocyte redifferentiation | |
US20060073549A1 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
US6984519B2 (en) | Nucleic acids encoding peptides that induce chondrocyte redifferentiation | |
AU2003261484B2 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
AU2005205754B2 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
EP1612220A2 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
EP1300417B1 (en) | Secreted and transmembrane polypeptide and nucleic acid encoding the same | |
EP1591452A2 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
EP1621619B1 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
EP1624061A1 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
EP1956030B1 (en) | Secreted and transmembrane polypeptides and nucleic acids endoding the same | |
US7425613B2 (en) | PRO1375 polypeptides | |
US20060073547A1 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
EP1944317A2 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
CA2512479A1 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FGA | Letters patent sealed or granted (standard patent) | ||
MK14 | Patent ceased section 143(a) (annual fees not paid) or expired |